Newspaper corpora: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
No edit summary |
||
Line 3: | Line 3: | ||
We have the following corpora: | We have the following corpora: | ||
* Wablieft corpus | * Wablieft corpus | ||
** 2011-2017 archive of easy language newspaper in Belgian Dutch. | |||
** tagged, lemmatized, parsed, available in several file formats | |||
** version 1.2 | |||
** [https://limo.libis.be/primo-explore/fulldisplay?docid=LIRIAS2859003&context=L&vid=Lirias&search_scope=Lirias&tab=default_tab&lang=en_US&fromSitemap=1 Vincent Vandeghinste, Bram Bulté & Liesbeth Augustinus (2019). Wablieft: An Easy-to-Read Newspaper corpus for Dutch. In CLARIN Annual Conference 2019 Proceedings. pp.188-191. Leipzig, Germany.] |
Revision as of 15:17, 1 February 2021
Newspaper corpora are corpora which exclusively consist of newspaper material.
We have the following corpora:
- Wablieft corpus
- 2011-2017 archive of easy language newspaper in Belgian Dutch.
- tagged, lemmatized, parsed, available in several file formats
- version 1.2
- Vincent Vandeghinste, Bram Bulté & Liesbeth Augustinus (2019). Wablieft: An Easy-to-Read Newspaper corpus for Dutch. In CLARIN Annual Conference 2019 Proceedings. pp.188-191. Leipzig, Germany.