Classifying and fact-checking health-related information about COVID-19 on Twitter/X using machine learning and deep learning models

Table 1 Details of the used datasets

Datasets	Number of data	Labels provided in the source	The labels used in the model	#Trustworthy information label	#Misinformation label
DATASET-1: FaCov [38]	3088	True, False	True, False	72	3016
DATASET-2: FakeCOVID [39]	7621	Collections, Correct, Correct attribution, Explanatory, Fake, Fake news, False, False and misleading, Half true, Half truth in dispute, labeled satire, Misattributed, Miscaptioned, Misinformation / Conspiracy theory, Misleading, Misleading/false, Mixed, Mixture, Mostly false, Mostly true, News, No evidence, Not true, Pants on fire, Partially correct, Partially false, Partially true, Partly false, Partly true, Scam, Suspicions, True, True but, Two pinocchios, Unlikely, Unproven, Unverified	Correct, Mostly true, True, News, True but, Half truth, Half true, Fake, Fake news, False, False and misleading, Mostly false, Misinformation / Conspiracy theory, Misleading, Misleading/False, Not true, Scam	88	7149
DATASET-3: Check-COVID [40]	1504	Not enough info, Refute, Support	Refute, Support	506	504
DATASET-4: Esoc-covid-19-misinformation-dataset [41]	5952	Conspiracy, Fake remedy, False Reporting	Conspiracy, Fake remedy, False reporting	0	4112
DATASET-5: WHO Myth Busters [42]	30	True	True	29	0
DATASET-6: healthfeedback.org [43]	784	True	True	765	0
DATASET-7: Lopez and Gallemore [29]	13,150	-	True, False	13,080	70

ISSN: 1472-6947