Datasets | Number of data | Labels provided in the source | The labels used in the model | #Trustworthy information label | #Misinformation label |
---|---|---|---|---|---|
DATASET-1: FaCov [38] | 3088 | True, False | True, False | 72 | 3016 |
DATASET-2: FakeCOVID [39] | 7621 | Collections, Correct, Correct attribution, Explanatory, Fake, Fake news, False, False and misleading, Half true, Half truth in dispute, labeled satire, Misattributed, Miscaptioned, Misinformation / Conspiracy theory, Misleading, Misleading/false, Mixed, Mixture, Mostly false, Mostly true, News, No evidence, Not true, Pants on fire, Partially correct, Partially false, Partially true, Partly false, Partly true, Scam, Suspicions, True, True but, Two pinocchios, Unlikely, Unproven, Unverified | Correct, Mostly true, True, News, True but, Half truth, Half true, Fake, Fake news, False, False and misleading, Mostly false, Misinformation / Conspiracy theory, Misleading, Misleading/False, Not true, Scam | 88 | 7149 |
DATASET-3: Check-COVID [40] | 1504 | Not enough info, Refute, Support | Refute, Support | 506 | 504 |
DATASET-4: Esoc-covid-19-misinformation-dataset [41] | 5952 | Conspiracy, Fake remedy, False Reporting | Conspiracy, Fake remedy, False reporting | 0 | 4112 |
DATASET-5: WHO Myth Busters [42] | 30 | True | True | 29 | 0 |
DATASET-6: healthfeedback.org [43] | 784 | True | True | 765 | 0 |
DATASET-7: Lopez and Gallemore [29] | 13,150 | - | True, False | 13,080 | 70 |