Speech recognition: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
(Marked this version for translation) |
||
(7 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
<languages/> | |||
<translate> | |||
== BAS Web Services== <!--T:10--> | |||
<!--T:11--> | |||
The BAS Web Services are a rich set of tools for speech sciences and technology. Tools include: | |||
* Automated speech recognition, including several models for Dutch | |||
* Anonymizer | |||
* Audio segmentation tool on the basis of transcripts | |||
* Speaker diarisation | |||
* Voice activity detection | |||
<!--T:12--> | |||
*[https://clarin.phonetik.uni-muenchen.de/BASWebServices/interface Webinterface] | |||
<!--T:1--> | |||
==LaMachine webservices== | ==LaMachine webservices== | ||
There are several speech recognition | There are several speech recognition [https://webservices.cls.ru.nl/ web services] at Radboud University | ||
<!--T:2--> | |||
==Speech Recognition for Belgian Dutch== | ==Speech Recognition for Belgian Dutch== | ||
Since April 2022, there is a new ASR engine available, specifically suited for speech recognition for Belgian Dutch. | Since April 2022, there is a new ASR engine available, specifically suited for speech recognition for Belgian Dutch. It is running at KU Leuven. | ||
<!--T:3--> | |||
*[https://www.spraak.org/webservice/dutch_asr/ Online webservice] | *[https://www.spraak.org/webservice/dutch_asr/ Online webservice] | ||
*[https://clinjournal.org/clinj/article/view/119 Scientific publication about speech recognition engine] | *[https://clinjournal.org/clinj/article/view/119 Scientific publication about speech recognition engine] | ||
<!--T:4--> | |||
==HENSOLDT ANALYTICS Speech-to-text for Dutch== | ==HENSOLDT ANALYTICS Speech-to-text for Dutch== | ||
The [https://european-language-grid.eu European Language Grid] hosts this speech recognition service with demo at | The [https://european-language-grid.eu European Language Grid] hosts this speech recognition service with demo at | ||
[https://live.european-language-grid.eu/catalogue/tool-service/20900 https://live.european-language-grid.eu/catalogue/tool-service/20900] | [https://live.european-language-grid.eu/catalogue/tool-service/20900 https://live.european-language-grid.eu/catalogue/tool-service/20900] | ||
<!--T:5--> | |||
==Punctuation Insertion== | ==Punctuation Insertion== | ||
AS ASR output often consists of streams of words, you may want to automatically insert punctuation. | AS ASR output often consists of streams of words, you may want to automatically insert punctuation. | ||
<!--T:6--> | |||
*[https://huggingface.co/oliverguhr/fullstop-dutch-sonar-punctuation-prediction?text=hervatting+van+de+zitting+ik+verklaar+de+zitting+van+het+europees+parlement+die+op+vrijdag+17+december+werd+onderbroken+te+zijn+hervat HuggingFace model] | *[https://huggingface.co/oliverguhr/fullstop-dutch-sonar-punctuation-prediction?text=hervatting+van+de+zitting+ik+verklaar+de+zitting+van+het+europees+parlement+die+op+vrijdag+17+december+werd+onderbroken+te+zijn+hervat HuggingFace model] | ||
*[https://github.com/VincentCCL/Segment_FullStop/blob/main/Segment_FullStop.py Python script that accepts txt file as input and returns punctuated txt as output] | *[https://github.com/VincentCCL/Segment_FullStop/blob/main/Segment_FullStop.py Python script that accepts txt file as input and returns punctuated txt as output] | ||
<!--T:7--> | |||
==Whisper model from OpenAI== | ==Whisper model from OpenAI== | ||
ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible. | ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible. | ||
<!--T:8--> | |||
*[https://openai.com/research/whisper Webpage] | *[https://openai.com/research/whisper Webpage] | ||
*[https://github.com/openai/whisper Github page] | *[https://github.com/openai/whisper Github page] | ||
*[https://www.youtube.com/watch?v=ABFqbY_rmEk YouTube video] explaining how to install whisper on your windows machine | |||
<!--T:9--> | |||
==Microsoft Transcriber== | ==Microsoft Transcriber== | ||
*[https://support.microsoft.com/nl-nl/office/uw-opnamen-transcriberen-7fc2efec-245e-45f0-b053-2a97531ecf57 Website in Dutch] | *[https://support.microsoft.com/nl-nl/office/uw-opnamen-transcriberen-7fc2efec-245e-45f0-b053-2a97531ecf57 Website in Dutch] | ||
</translate> |
Latest revision as of 14:10, 19 November 2024
BAS Web Services
The BAS Web Services are a rich set of tools for speech sciences and technology. Tools include:
- Automated speech recognition, including several models for Dutch
- Anonymizer
- Audio segmentation tool on the basis of transcripts
- Speaker diarisation
- Voice activity detection
LaMachine webservices
There are several speech recognition web services at Radboud University
Speech Recognition for Belgian Dutch
Since April 2022, there is a new ASR engine available, specifically suited for speech recognition for Belgian Dutch. It is running at KU Leuven.
HENSOLDT ANALYTICS Speech-to-text for Dutch
The European Language Grid hosts this speech recognition service with demo at https://live.european-language-grid.eu/catalogue/tool-service/20900
Punctuation Insertion
AS ASR output often consists of streams of words, you may want to automatically insert punctuation.
Whisper model from OpenAI
ASR for multiple languages, including Dutch is available from Whisper. Full model download is possible.
- Webpage
- Github page
- YouTube video explaining how to install whisper on your windows machine