Speech recognition: Difference between revisions
No edit summary |
Marked this version for translation |
||
| (13 intermediate revisions by 3 users not shown) | |||
| Line 1: | Line 1: | ||
<languages/> | |||
<translate> | <translate> | ||
<!--T:17--> | |||
This page contains information on Dutch speech recognition systems. | |||
==Online services== <!--T:18--> | |||
==HENSOLDT ANALYTICS Speech-to-text for Dutch== | === BAS Web Services=== <!--T:10--> | ||
<!--T:11--> | |||
The BAS Web Services are a rich set of tools for speech sciences and technology. Tools include: | |||
* Automated speech recognition, including several models for Dutch | |||
* Anonymizer | |||
* Audio segmentation tool on the basis of transcripts | |||
* Speaker diarisation | |||
* Voice activity detection | |||
<!--T:12--> | |||
*[https://clarin.phonetik.uni-muenchen.de/BASWebServices/interface Webinterface] (requires CLARIN login) | |||
===Digital Europe Speech-to-Text=== <!--T:25--> | |||
<!--T:26--> | |||
Speech recognition built by the European Commission. Requires an EU login. | |||
<!--T:27--> | |||
*[https://language-tools.ec.europa.eu/SpeechServices/Transcription Website] | |||
===LaMachine webservices=== <!--T:1--> | |||
<!--T:28--> | |||
LaMachine is end-of-life and being deprecated. See [https://github.com/proycon/LaMachine/issues/214 this post] for reasons and alternative solutions. | |||
===Speech Recognition for Belgian Dutch: NeLF=== <!--T:2--> | |||
<!--T:13--> | |||
API and browser access to a state-of-the-art speech recognition system for Belgian Dutch, including dialect speech recognition, developed by KU Leuven and UGent. | |||
<!--T:14--> | |||
Requires a login which can be requested, but you have to await manual approval. | |||
<!--T:15--> | |||
[https://www.nelfproject.be/web_service.php NeLF Website] | |||
<!--T:16--> | |||
===HENSOLDT ANALYTICS Speech-to-text for Dutch (demo)=== | |||
The [https://european-language-grid.eu European Language Grid] hosts this speech recognition service with demo at | The [https://european-language-grid.eu European Language Grid] hosts this speech recognition service with demo at | ||
[https://live.european-language-grid.eu/catalogue/tool-service/ | [https://live.european-language-grid.eu/catalogue/tool-service/23090/try%20out/ https://live.european-language-grid.eu/catalogue/tool-service/23090/try%20out/] | ||
===Microsoft Transcriber=== <!--T:9--> | |||
<!--T:19--> | |||
* in Word 365 | |||
*[https://support.microsoft.com/nl-nl/office/uw-opnamen-transcriberen-7fc2efec-245e-45f0-b053-2a97531ecf57 Website in Dutch] | |||
==To install== <!--T:20--> | |||
== | ===noScribe=== <!--T:21--> | ||
<!--T:22--> | |||
*[https://github.com/ | *AI-based software that transcribes interviews for qualitative social research or journalistic use | ||
*free and open source (GPL-3.0) | |||
*runs completely local on your computer | |||
* can distinguish different speakers and understands around 60 languages | |||
* includes a nice editor to review, verify and correct the resulting transcript | |||
* standing on the shoulders of giants: Whisper from OpenAI, faster-whisper by Guillaume Klein and pyannote from Hervé Bredin | |||
* [https://github.com/kaixxx/noScribe Github page] | |||
==Whisper model from OpenAI== | <!--T:23--> | ||
===Whisper model from OpenAI=== | |||
ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible. | ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible. | ||
<!--T:8--> | |||
*[https://openai.com/research/whisper Webpage] | *[https://openai.com/research/whisper Webpage] | ||
*[https://github.com/openai/whisper Github page] | *[https://github.com/openai/whisper Github page] | ||
*[https://www.youtube.com/watch?v=ABFqbY_rmEk YouTube video] explaining how to install whisper on your windows machine | *[https://www.youtube.com/watch?v=ABFqbY_rmEk YouTube video] explaining how to install whisper on your windows machine | ||
== | <!--T:24--> | ||
*[https:// | ==Leaderboard== | ||
* [https://opensource-spraakherkenning-nl.github.io/ASR_NL_results/UT/N-Best/nbest_res.html Website] | |||
<!--T:5--> | |||
==Punctuation Insertion== | |||
AS ASR output often consists of streams of words, you may want to automatically insert punctuation. | |||
<!--T:6--> | |||
*[https://huggingface.co/oliverguhr/fullstop-dutch-sonar-punctuation-prediction?text=hervatting+van+de+zitting+ik+verklaar+de+zitting+van+het+europees+parlement+die+op+vrijdag+17+december+werd+onderbroken+te+zijn+hervat HuggingFace model] | |||
*[https://github.com/VincentCCL/Segment_FullStop/blob/main/Segment_FullStop.py Python script that accepts txt file as input and returns punctuated txt as output] | |||
</translate> | </translate> | ||
Latest revision as of 18:05, 13 November 2025
This page contains information on Dutch speech recognition systems.
Online services
BAS Web Services
The BAS Web Services are a rich set of tools for speech sciences and technology. Tools include:
- Automated speech recognition, including several models for Dutch
- Anonymizer
- Audio segmentation tool on the basis of transcripts
- Speaker diarisation
- Voice activity detection
- Webinterface (requires CLARIN login)
Digital Europe Speech-to-Text
Speech recognition built by the European Commission. Requires an EU login.
LaMachine webservices
LaMachine is end-of-life and being deprecated. See this post for reasons and alternative solutions.
Speech Recognition for Belgian Dutch: NeLF
API and browser access to a state-of-the-art speech recognition system for Belgian Dutch, including dialect speech recognition, developed by KU Leuven and UGent.
Requires a login which can be requested, but you have to await manual approval.
HENSOLDT ANALYTICS Speech-to-text for Dutch (demo)
The European Language Grid hosts this speech recognition service with demo at https://live.european-language-grid.eu/catalogue/tool-service/23090/try%20out/
Microsoft Transcriber
- in Word 365
- Website in Dutch
To install
noScribe
- AI-based software that transcribes interviews for qualitative social research or journalistic use
- free and open source (GPL-3.0)
- runs completely local on your computer
- can distinguish different speakers and understands around 60 languages
- includes a nice editor to review, verify and correct the resulting transcript
- standing on the shoulders of giants: Whisper from OpenAI, faster-whisper by Guillaume Klein and pyannote from Hervé Bredin
- Github page
Whisper model from OpenAI
ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.
- Webpage
- Github page
- YouTube video explaining how to install whisper on your windows machine
Leaderboard
Punctuation Insertion
AS ASR output often consists of streams of words, you may want to automatically insert punctuation.