Children's language: Difference between revisions
Appearance
Created page with "<translate> ==Jasmin Speech corpus== [https://kdutch.ivdnt.org/wiki/Spoken_corpora#JASMIN-spraakcorpus See spoken corpora] ==BasiLex-corpus== The Basilex corpus is an annotated collection of texts written for children in the age from four to twelve years. *version 1.0 (2015) * [https://www.clinjournal.org/index.php/clinj/article/view/50 Tellings, A., Hulsbosch, M., Vermeer, A. & van den Bosch, A. (2015). BasiLex: an 11.5-million words corpus of Dutch texts written fo..." |
|||
Line 23: | Line 23: | ||
CHILDES contains a large collection of corpora, which are datasets of transcripts of child-adult interactions, typically annotated and searchable. These include conversations, storytelling, and other linguistic exchanges, gathered from children of various languages, ages, and contexts. | CHILDES contains a large collection of corpora, which are datasets of transcripts of child-adult interactions, typically annotated and searchable. These include conversations, storytelling, and other linguistic exchanges, gathered from children of various languages, ages, and contexts. | ||
*[https://childes.talkbank.org/access/DutchAfrikaans/ index to CHILDES data] from Dutch and Afrikaans. | |||
*[https://sla.talkbank.org/TBB/childes/DutchAfrikaans browse the Dutch database online] | |||
[https://childes.talkbank.org/access/Biling/DeHouwer.html | Subcorpora: | ||
*[https://childes.talkbank.org/access/Biling/DeHouwer.html Dutch-English De Houwer Corpus] | |||
</translate> | </translate> |
Revision as of 11:20, 29 November 2024
Jasmin Speech corpus
BasiLex-corpus
The Basilex corpus is an annotated collection of texts written for children in the age from four to twelve years.
- version 1.0 (2015)
- Tellings, A., Hulsbosch, M., Vermeer, A. & van den Bosch, A. (2015). BasiLex: an 11.5-million words corpus of Dutch texts written for children. Computational Linguistics in the Netherlands Journal 4, 191-208
- Download page
BasiScript-corpus
The BasiScript Corpus is an annotated collection of texts written by children in the age from four to twelve years.
- version 1.0 (2015)
- Project page
- Download page
CHILDES
CHILDES contains a large collection of corpora, which are datasets of transcripts of child-adult interactions, typically annotated and searchable. These include conversations, storytelling, and other linguistic exchanges, gathered from children of various languages, ages, and contexts.
- index to CHILDES data from Dutch and Afrikaans.
- browse the Dutch database online
Subcorpora: