Translations:Manually annotated corpora/27/en

From Clarin K-Centre
Jump to navigation Jump to search

Crowll corpora

Manually annotated corpora for teaching and learning purposes of Brazilian Portuguese, Dutch, Estonian, and Slovene. Sentences are annotated with “problematic” or “non-problematic” labels, from the point of usage for pedagogical purposes. Sentences labelled as problematic also have annotations defining the category of the problem (offensive, vulgar, sensitive content, grammar and/or spelling problems, incomprehensible and/or lack of context). Each corpus consists of 10.000 sentences, which were annotated by language experts.