Speech recognition: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
(3 intermediate revisions by the same user not shown)
Line 3: Line 3:


==Speech Recognition for Belgian Dutch==
==Speech Recognition for Belgian Dutch==
Since April 2022, there is a new ASR engine available, specifically suited for speech recognition for Belgian Dutch.
Since April 2022, there is a new ASR engine available, specifically suited for speech recognition for Belgian Dutch. It is running at KU Leuven.


*[https://www.spraak.org/webservice/dutch_asr/ Online webservice]
*[https://www.spraak.org/webservice/dutch_asr/ Online webservice]
Line 17: Line 17:
*[https://huggingface.co/oliverguhr/fullstop-dutch-sonar-punctuation-prediction?text=hervatting+van+de+zitting+ik+verklaar+de+zitting+van+het+europees+parlement+die+op+vrijdag+17+december+werd+onderbroken+te+zijn+hervat HuggingFace model]
*[https://huggingface.co/oliverguhr/fullstop-dutch-sonar-punctuation-prediction?text=hervatting+van+de+zitting+ik+verklaar+de+zitting+van+het+europees+parlement+die+op+vrijdag+17+december+werd+onderbroken+te+zijn+hervat HuggingFace model]
*[https://github.com/VincentCCL/Segment_FullStop/blob/main/Segment_FullStop.py Python script that accepts txt file as input and returns punctuated txt as output]
*[https://github.com/VincentCCL/Segment_FullStop/blob/main/Segment_FullStop.py Python script that accepts txt file as input and returns punctuated txt as output]
==Whisper model from OpenAI==
ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.
*[https://openai.com/research/whisper Webpage]
*[https://github.com/openai/whisper Github page]
*[https://www.youtube.com/watch?v=ABFqbY_rmEk YouTube video] explaining how to install whisper on your windows machine
==Microsoft Transcriber==
*[https://support.microsoft.com/nl-nl/office/uw-opnamen-transcriberen-7fc2efec-245e-45f0-b053-2a97531ecf57 Website in Dutch]

Revision as of 21:28, 24 October 2023

LaMachine webservices

There are several speech recognition services web services at Radboud University

Speech Recognition for Belgian Dutch

Since April 2022, there is a new ASR engine available, specifically suited for speech recognition for Belgian Dutch. It is running at KU Leuven.

HENSOLDT ANALYTICS Speech-to-text for Dutch

The European Language Grid hosts this speech recognition service with demo at https://live.european-language-grid.eu/catalogue/tool-service/20900

Punctuation Insertion

AS ASR output often consists of streams of words, you may want to automatically insert punctuation.

Whisper model from OpenAI

ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.

Microsoft Transcriber