Jump to content

Speech recognition

From Clarin K-Centre
Revision as of 11:23, 13 October 2025 by Vincent (talk | contribs)

This page contains information on Dutch speech recognition systems.

Online services

BAS Web Services

The BAS Web Services are a rich set of tools for speech sciences and technology. Tools include:

  • Automated speech recognition, including several models for Dutch
  • Anonymizer
  • Audio segmentation tool on the basis of transcripts
  • Speaker diarisation
  • Voice activity detection

LaMachine webservices

There are several speech recognition web services at Radboud University

Speech Recognition for Belgian Dutch: NeLF

API and browser access to a state-of-the-art speech recognition system for Belgian Dutch, including dialect speech recognition, developed by KU Leuven and UGent.

Requires a login which can be requested, but you have to await manual approval.

NeLF Website

HENSOLDT ANALYTICS Speech-to-text for Dutch (demo)

The European Language Grid hosts this speech recognition service with demo at https://live.european-language-grid.eu/catalogue/tool-service/23090/try%20out/

Microsoft Transcriber

To install

noScribe

  • AI-based software that transcribes interviews for qualitative social research or journalistic use
  • free and open source (GPL-3.0)
  • runs completely local on your computer
  • can distinguish different speakers and understands around 60 languages
  • includes a nice editor to review, verify and correct the resulting transcript
  • standing on the shoulders of giants: Whisper from OpenAI, faster-whisper by Guillaume Klein and pyannote from Hervé Bredin
  • Github page

Whisper model from OpenAI

ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.


Punctuation Insertion

AS ASR output often consists of streams of words, you may want to automatically insert punctuation.