Speech recognition: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
No edit summary
Line 17: Line 17:
*[https://huggingface.co/oliverguhr/fullstop-dutch-sonar-punctuation-prediction?text=hervatting+van+de+zitting+ik+verklaar+de+zitting+van+het+europees+parlement+die+op+vrijdag+17+december+werd+onderbroken+te+zijn+hervat HuggingFace model]
*[https://huggingface.co/oliverguhr/fullstop-dutch-sonar-punctuation-prediction?text=hervatting+van+de+zitting+ik+verklaar+de+zitting+van+het+europees+parlement+die+op+vrijdag+17+december+werd+onderbroken+te+zijn+hervat HuggingFace model]
*[https://github.com/VincentCCL/Segment_FullStop/blob/main/Segment_FullStop.py Python script that accepts txt file as input and returns punctuated txt as output]
*[https://github.com/VincentCCL/Segment_FullStop/blob/main/Segment_FullStop.py Python script that accepts txt file as input and returns punctuated txt as output]
==Whisper model from OpenAI==
ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.
*[https://openai.com/research/whisper Webpage]
*[https://github.com/openai/whisper Github page]

Revision as of 07:26, 8 September 2023

LaMachine webservices

There are several speech recognition services web services at Radboud University

Speech Recognition for Belgian Dutch

Since April 2022, there is a new ASR engine available, specifically suited for speech recognition for Belgian Dutch.

HENSOLDT ANALYTICS Speech-to-text for Dutch

The European Language Grid hosts this speech recognition service with demo at https://live.european-language-grid.eu/catalogue/tool-service/20900

Punctuation Insertion

AS ASR output often consists of streams of words, you may want to automatically insert punctuation.

Whisper model from OpenAI

ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.