Description
Portuguese news corpus based on material from 2019
Details
| Name |
por_news_2019 |
Sentences |
4,490,898 |
| Language |
Portuguese
()
|
Types |
785,287 |
| Genre |
News |
Tokens |
89,449,824 |
| Year |
2019 |
Link to the corpus
https://corpora.wortschatz-leipzig.de?corpusId=por_news_2019
Annotations
coocSim
GDEX
POS (OpenNLP - https://opennlp.apache.org/download.html)
Cite this corpus
Leipzig Corpora Collection: Portuguese news corpus based on material from 2019. Leipzig Corpora Collection. Dataset. https://corpora.wortschatz-leipzig.de?corpusId=por_news_2019.
BibTeX
@misc{por_news_2019,
author = {Leipzig Corpora Collection},
title = {Portuguese news corpus based on material from 2019},
howpublished = {https://corpora.wortschatz-leipzig.de?corpusId=por_news_2019},
note = {Accessed: 2025-12-20}
}