Jump to content

Translations:Spoken corpora/13/en

From Clarin K-Centre
Revision as of 17:45, 3 April 2025 by FuzzyBot (talk | contribs) (Importing a new version from external source)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

SABeD -- Spoken Academic Belgian Dutch

The Spoken Academic Belgian Dutch Corpus consists of 200 lectures given in higher education institutions in Flanders. The first 25 and the last 5 minutes of each lecture were transcribed using an ASR system tuned to Belgian Dutch and then manual utterance segmentation was applied, followed by manual correction of the automated transcription. The resulting text is processed with the FROG language analyser.