cs| en

Corpus of Czech Verse

Corpus of Czech Verse
The Corpus of Czech Verse (CCV) is a lemmatized, phonetically, morphologically, metrically, and strophically annotated corpus of Czech poetry of the 19th century and of the beginning of the 20th century. Each lexical unit in the corpus is provided with information concerning its basic entry-form (lemma), phonetic transcription, and grammar categories; each line is provided with the indication of its metre (iamb, trochee,...), length (n-meter), ending type (masculine, feminine,...) and the metrical formula. (Currently, only metrically syllabotonic lines are annotated.) As to higher levels, rhyme pairs, or n-tuples, and fixed verse forms (sonnet, rondel,...) are annotated. The CCV data are both downloadable for free, and processed via user-friendly online tools .
Basic Features of the CCV
  • 1 689 poetry collections
  • 76 699 poems
  • 2 664 989 lines
  • 14 592 037 words
CCV Content

Number of Poems

» Number of Lines «

Number of Words

» Publication Year «

Year of the Author's Birth

This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies.

This website was created with support of the Czech Science Foundation as part of the P406/11/1825 (The History and Theory of the 19th Century Czech Verse) and 17-01723S (Stylometric Analysis of Poetic Texts) projects and with support aiming at a long-term, conception-based development of a research institution (no. 68378068).
© 2018 Petr Plecháč