Translations:Manually annotated corpora/27/en

From Clarin K-Centre
Revision as of 08:02, 28 May 2024 by FuzzyBot (talk | contribs) (Importing a new version from external source)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Crowll corpora

Manually annotated corpora for teaching and learning purposes of Brazilian Portuguese, Dutch, Estonian, and Slovene. Sentences are annotated with “problematic” or “non-problematic” labels, from the point of usage for pedagogical purposes. Sentences labelled as problematic also have annotations defining the category of the problem (offensive, vulgar, sensitive content, grammar and/or spelling problems, incomprehensible and/or lack of context). Each corpus consists of 10.000 sentences, which were annotated by language experts.