Description
Finnish Web text corpus based on material from 2002
Details
| Name |
fin_web_2002 |
Sentences |
4,737,045 |
| Language |
Finnish
()
|
Types |
3,590,142 |
| Genre |
Web |
Tokens |
55,173,465 |
| Year |
2002 |
Link to the corpus
https://corpora.wortschatz-leipzig.de?corpusId=fin_web_2002
Annotations
coocSim
GDEX
POS (TreeTagger - unknown)
wordsLevenshteinSim
Cite this corpus
Leipzig Corpora Collection: Finnish Web text corpus based on material from 2002. Leipzig Corpora Collection. Dataset. https://corpora.wortschatz-leipzig.de?corpusId=fin_web_2002.
BibTeX
@misc{fin_web_2002,
author = {Leipzig Corpora Collection},
title = {Finnish Web text corpus based on material from 2002},
howpublished = {https://corpora.wortschatz-leipzig.de?corpusId=fin_web_2002},
note = {Accessed: 2025-12-18}
}