Finnish Wikipedia 2017, Korp
dc.contributor.affiliation | University of Helsinki - Tatu Huovilainen | |
dc.contributor.author | Tatu Huovilainen | |
dc.date.accessioned | 2024-10-09T10:06:08Z | |
dc.date.available | 2024-10-09T10:06:08Z | |
dc.description | The Finnish Wikipedia 2017 Corpus will be available in the concordance tool Korp. The corpus contains all the Finnish articles from the online encyclopedia Wikipedia available in 1 January 2018. The text parts of the articles have been extracted from [Wikipedia Dumps](https://dumps.wikimedia.org/) with [WikiExtractor](https://github.com/attardi/wikiextractor). The corpus has been tokenized and annotated with morpho-syntactic analysis produced with the [Turku Dependency Parser](http://turkunlp.github.io/Finnish-dep-parser/) | |
dc.discipline | Languages | |
dc.identifier | http://urn.fi/urn:nbn:fi:lb-2018060401 | |
dc.identifier.uri | http://localhost:4000/handle/123456789/4782 | |
dc.language | Finnish | |
dc.language | Finnish | |
dc.rights | Open | |
dc.rights.license | Creative Commons Attribution 4.0 International (CC BY 4.0) | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.title | Finnish Wikipedia 2017, Korp |