Jump to content

Speech recognition: Difference between revisions

From Clarin K-Centre
Created page with "==LaMachine webservices== There are several speech recognition services [https://webservices.cls.ru.nl/ web services] at Radboud University ==HENSOLDT ANALYTICS Speech-to-te..."
 
Marked this version for translation
(17 intermediate revisions by 3 users not shown)
Line 1: Line 1:
<languages/>
<translate>
== BAS Web Services== <!--T:10-->
<!--T:11-->
The BAS Web Services are a rich set of tools for speech sciences and technology. Tools include:
* Automated speech recognition, including several models for Dutch
* Anonymizer
* Audio segmentation tool on the basis of transcripts
* Speaker diarisation
* Voice activity detection
<!--T:12-->
*[https://clarin.phonetik.uni-muenchen.de/BASWebServices/interface Webinterface] (requires CLARIN login)
<!--T:1-->
==LaMachine webservices==
==LaMachine webservices==
There are several speech recognition services [https://webservices.cls.ru.nl/ web services] at Radboud University
There are several speech recognition [https://webservices.cls.ru.nl/ web services] at Radboud University
 
==Speech Recognition for Belgian Dutch: NeLF== <!--T:2-->
 
<!--T:13-->
API and browser access to a state-of-the-art speech recognition system for Belgian Dutch, including dialect speech recognition, developed by KU Leuven and UGent.
 
<!--T:14-->
Requires a login which can be requested, but you have to await manual approval.


<!--T:15-->
[https://www.nelfproject.be/web_service.php NeLF Website]


<!--T:16-->
==HENSOLDT ANALYTICS Speech-to-text for Dutch==
==HENSOLDT ANALYTICS Speech-to-text for Dutch==
The [https://european-language-grid.eu European Language Grid] hosts this speech recognition service at
The [https://european-language-grid.eu European Language Grid] hosts this speech recognition service with demo at
[https://live.european-language-grid.eu/catalogue/tool-service/8154]
[https://live.european-language-grid.eu/catalogue/tool-service/20900 https://live.european-language-grid.eu/catalogue/tool-service/20900]
 
<!--T:5-->
==Punctuation Insertion==
AS ASR output often consists of streams of words, you may want to automatically insert punctuation.
 
<!--T:6-->
*[https://huggingface.co/oliverguhr/fullstop-dutch-sonar-punctuation-prediction?text=hervatting+van+de+zitting+ik+verklaar+de+zitting+van+het+europees+parlement+die+op+vrijdag+17+december+werd+onderbroken+te+zijn+hervat HuggingFace model]
*[https://github.com/VincentCCL/Segment_FullStop/blob/main/Segment_FullStop.py Python script that accepts txt file as input and returns punctuated txt as output]
 
<!--T:7-->
==Whisper model from OpenAI==
ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.
 
<!--T:8-->
*[https://openai.com/research/whisper Webpage]
*[https://github.com/openai/whisper Github page]
*[https://www.youtube.com/watch?v=ABFqbY_rmEk YouTube video] explaining how to install whisper on your windows machine
 
<!--T:9-->
==Microsoft Transcriber==
*[https://support.microsoft.com/nl-nl/office/uw-opnamen-transcriberen-7fc2efec-245e-45f0-b053-2a97531ecf57 Website in Dutch]
</translate>

Revision as of 09:41, 5 May 2025

BAS Web Services

The BAS Web Services are a rich set of tools for speech sciences and technology. Tools include:

  • Automated speech recognition, including several models for Dutch
  • Anonymizer
  • Audio segmentation tool on the basis of transcripts
  • Speaker diarisation
  • Voice activity detection

LaMachine webservices

There are several speech recognition web services at Radboud University

Speech Recognition for Belgian Dutch: NeLF

API and browser access to a state-of-the-art speech recognition system for Belgian Dutch, including dialect speech recognition, developed by KU Leuven and UGent.

Requires a login which can be requested, but you have to await manual approval.

NeLF Website

HENSOLDT ANALYTICS Speech-to-text for Dutch

The European Language Grid hosts this speech recognition service with demo at https://live.european-language-grid.eu/catalogue/tool-service/20900

Punctuation Insertion

AS ASR output often consists of streams of words, you may want to automatically insert punctuation.

Whisper model from OpenAI

ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.

Microsoft Transcriber