Wortschatz – bul_newscrawl_2011 – Институт за международни отношения

Bulgarian news corpus based on material crawled in 2011 with 6,170,388 sentences. Change corpus

Description

Bulgarian news corpus based on material crawled in 2011

Details

Name	bul_newscrawl_2011	Sentences	6,170,388
Language	Bulgarian ()	Types	1,285,435
Genre	Newscrawl	Tokens	104,958,221
Year	2011

Link to the corpus

https://corpora.wortschatz-leipzig.de?corpusId=bul_newscrawl_2011

Annotations

coocSim
GDEX
wordsLevenshteinSim

Cite this corpus

Leipzig Corpora Collection: Bulgarian news corpus based on material crawled in 2011. Leipzig Corpora Collection. Dataset. https://corpora.wortschatz-leipzig.de?corpusId=bul_newscrawl_2011. BibTeX

@misc{bul_newscrawl_2011,
    author = {Leipzig Corpora Collection},
    title = {Bulgarian news corpus based on material crawled in 2011},
    howpublished = {https://corpora.wortschatz-leipzig.de?corpusId=bul_newscrawl_2011},
    note = {Accessed: 2025-03-15}
}

Word:

Институт за международни отношения

Number of occurrences: 2 Rank: 483,744 Frequency class: 21

Subwords: отношения, международни, Институт

Институт за международни отношения

Neighbour Cooccurrences:

Word graph