Finnish Rumor Detection Dataset and Models

dc.contributor.affiliationUniversity of Helsinki - Hämäläinen, Mika
dc.contributor.affiliationUniversity of Helsinki - Alnajjar, Khalid
dc.contributor.affiliationUniversity of Helsinki - Partanen, Niko
dc.contributor.affiliationUniversity of Helsinki - Rueter, Jack
dc.contributor.authorHämäläinen, Mika
dc.contributor.authorAlnajjar, Khalid
dc.contributor.authorPartanen, Niko
dc.contributor.authorRueter, Jack
dc.date.accessioned2025-03-24T15:11:12Z
dc.date.issued2021-04-16
dc.date.issued2021-04-16
dc.descriptionData and models for Finnish rumor detection. The entire dataset is in rumor_dataset.json. Look into dataset_splits.zip for data splits used in the paper (*_test.txt and *_train.txt). The .pt files are the OpenNMT models described in the paper. bert-models.zip has the BERT based models trained with the bert.py (FinBERT) and bert_multi.py (Multilingual BERT) scripts. Cite: Hämäläinen, M., Alnajjar, K., Partanen, N., & Rueter, J. (2021) Never guess what I heard... Rumor Detection in Finnish News: a Dataset and a Baseline. In the Proceedings of the Third Workshop on NLP for Internet Freedom (NLP4IF): Censorship, Disinformation, and Propaganda
dc.identifierhttps://doi.org/10.5281/zenodo.4697529
dc.identifier.urihttps://hydatakatalogi-test-24.it.helsinki.fi/handle/123456789/9006
dc.rightsOpen
dc.rights.licensecc-by-nc-nd-4.0
dc.titleFinnish Rumor Detection Dataset and Models
dc.typedataset
dc.typedataset

Files

Repositories