nikopartanen/old-literary-finnish-lemmatization: Old Literary Finnish Lemmatization Dataset

dc.contributor.affiliationUniversity of Helsinki - Niko Partanen
dc.contributor.affiliationRootRoo - Khalid Alnajjar
dc.contributor.affiliationRootRoo - Mika Hämäläinen
dc.contributor.affiliationUniversity of Helsinki - Jack Rueter
dc.contributor.authorNiko Partanen
dc.contributor.authorKhalid Alnajjar
dc.contributor.authorMika Hämäläinen
dc.contributor.authorJack Rueter
dc.date.accessioned2025-03-24T15:15:27Z
dc.date.issued2021-06-07
dc.date.issued2021-06-07
dc.descriptionThis is a dataset that contains randomly selected and manually lemmatized sentences from the corpus of Old Literary Finnish. Please cite and consult the original corpus as well: Institute for the Languages of Finland (2013). Corpus of Old Literary Finnish [text corpus]. The Language Bank of Finland. Retrieved from http://urn.fi/urn:nbn:fi:lb-201407165 Currently there are individual decades that have not been lemmatized, which are 1690, 1720, 1740 and 1770. Additionally there are many decades not present in the dataset at all. Adding these and to complete the material in various ways is an important goal for the further research. We also welcome corrections and additions into the dataset by other researchers.
dc.identifierhttps://doi.org/10.5281/zenodo.4906627
dc.identifier.urihttps://hydatakatalogi-test-24.it.helsinki.fi/handle/123456789/9739
dc.rightsOpen
dc.rights.licenseother-open
dc.titlenikopartanen/old-literary-finnish-lemmatization: Old Literary Finnish Lemmatization Dataset
dc.typedataset
dc.typedataset

Files

Repositories