Mide22: an annotated multi-event tweet dataset for misinformation detection

buir.contributor.authorÖzçelik, Oğuzhan
buir.contributor.authorCan, Fazlı
buir.contributor.orcidÖzçelik, Oğuzhan|0000-0002-9420-9854
buir.contributor.orcidCan, Fazlı|0000-0003-0016-4278
dc.citation.epage11295
dc.citation.spage11283
dc.contributor.authorToraman, Çağrı
dc.contributor.authorÖzçelik, Oğuzhan
dc.contributor.authorŞahinuç, Furkan
dc.contributor.authorCan, Fazlı
dc.date.accessioned2025-02-23T20:30:29Z
dc.date.available2025-02-23T20:30:29Z
dc.date.issued2024
dc.departmentDepartment of Computer Engineering
dc.descriptionConference Name:Joint 30th International Conference on Computational Linguistics and 14th International Conference on Language Resources and Evaluation, LREC-COLING 2024
dc.descriptionDate of Conference::20 May 2024through 25 May 2024
dc.description.abstractThe rapid dissemination of misinformation through online social networks poses a pressing issue with harmful consequences jeopardizing human health, public safety, democracy, and the economy; therefore, urgent action is required to address this problem. In this study, we construct a new human-annotated dataset, called MiDe22, having 5,284 English and 5,064 Turkish tweets with their misinformation labels for several recent events between 2020 and 2022, including the Russia-Ukraine war, COVID-19 pandemic, and Refugees. The dataset includes user engagements with the tweets in terms of likes, replies, retweets, and quotes. We also provide a detailed data analysis with descriptive statistics and the experimental results of a benchmark evaluation for misinformation detection. © 2024 ELRA Language Resource Association: CC BY-NC 4.0.
dc.identifier.isbn978-249381410-4
dc.identifier.urihttps://hdl.handle.net/11693/116710
dc.language.isoEnglish
dc.publisherEuropean Language Resources Association (ELRA)
dc.rightsCC BY-NC 4.0 (Attribution 4.0 International Deed)
dc.rights.urihttps://creativecommons.org/licenses/by-nc/4.0/
dc.subjectHuman-annotation
dc.subjectMisinformation detection
dc.subjectMulti-event dataset
dc.subjectTweet
dc.titleMide22: an annotated multi-event tweet dataset for misinformation detection
dc.typeConference Paper

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mide22_an_annotated_multi-event_tweet_dataset_for_misinformation_detection.pdf
Size:
7.47 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: