Speech recognition: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
No edit summary
(Marked this version for translation)
 
(2 intermediate revisions by 2 users not shown)
Line 1: Line 1:
<languages/>
<languages/>
<translate>
<translate>
== BAS Web Services== <!--T:10-->
<!--T:11-->
The BAS Web Services are a rich set of tools for speech sciences and technology. Tools include:
* Automated speech recognition, including several models for Dutch
* Anonymizer
* Audio segmentation tool on the basis of transcripts
* Speaker diarisation
* Voice activity detection
<!--T:12-->
*[https://clarin.phonetik.uni-muenchen.de/BASWebServices/interface Webinterface]
<!--T:1-->
<!--T:1-->
==LaMachine webservices==
==LaMachine webservices==
There are several speech recognition services [https://webservices.cls.ru.nl/ web services] at Radboud University
There are several speech recognition [https://webservices.cls.ru.nl/ web services] at Radboud University


<!--T:2-->
<!--T:2-->

Latest revision as of 14:10, 19 November 2024

Other languages:

BAS Web Services

The BAS Web Services are a rich set of tools for speech sciences and technology. Tools include:

  • Automated speech recognition, including several models for Dutch
  • Anonymizer
  • Audio segmentation tool on the basis of transcripts
  • Speaker diarisation
  • Voice activity detection


LaMachine webservices

There are several speech recognition web services at Radboud University

Speech Recognition for Belgian Dutch

Since April 2022, there is a new ASR engine available, specifically suited for speech recognition for Belgian Dutch. It is running at KU Leuven.

HENSOLDT ANALYTICS Speech-to-text for Dutch

The European Language Grid hosts this speech recognition service with demo at https://live.european-language-grid.eu/catalogue/tool-service/20900

Punctuation Insertion

AS ASR output often consists of streams of words, you may want to automatically insert punctuation.

Whisper model from OpenAI

ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.

Microsoft Transcriber