K-Dutch/nl: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
No edit summary
(Updating to match new version of source page)
Line 1: Line 1:
<languages/>
<languages/>
<div lang="en" dir="ltr" class="mw-content-ltr">
<span style="color:white">Mediawiki:Mainpage</span><br>
<span style="color:white">Mediawiki:Mainpage</span><br>
</div>


<div lang="en" dir="ltr" class="mw-content-ltr">
[[File:K-centre-logo.jpg|frameless|right]]
[[File:K-centre-logo.jpg|frameless|right]]
Welkom op [[K-Dutch]], de plek voor iedereen die iets over het Nederlands wil weten: taalkundige eigenschappen, taaladvies, beschikbare applicaties en data, etymologie, dialect,...  
Welcome to [[K-Dutch]], the place for anyone who wants to know anything about the Dutch language: linguistic properties, language advice, available tools and resources, etymology, dialects...
</div>


K-Dutch is een [https://www.clarin.eu/content/knowledge-centres CLARIN-kenniscentrum]. Het wordt gehost door het [https://www.ivdnt.org Instituut voor de Nederlandse Taal], dat ook een gecertificeerd [https://www.clarin.eu/content/certified-centres CLARIN-B-centrum] is en dat verschillende  resources voor het Nederlands host. Over het algemeen zijn deze gratis en vrij beschikbaar voor onderzoeksdoeleinden. K-Dutch is een initiatief van [https://www.clarin.eu CLARIN-ERIC] en [https://clarin-be.ivdnt.org CLARIN-BE].
<div lang="en" dir="ltr" class="mw-content-ltr">
K-Dutch is a [https://www.clarin.eu/content/knowledge-centres CLARIN Knowledge Centre]. It is hosted by the [https://www.ivdnt.org Instituut voor de Nederlandse Taal] (Dutch Language Institute) , which is also a [https://www.clarin.eu/content/certified-centres CLARIN-B centre] and host of many resources for Dutch, which are, in general, freely available for research purposes. K-Dutch is an initiative of [https://www.clarin.eu CLARIN-ERIC] and [https://clarin-be.ivdnt.org CLARIN-BE].
</div>


De status van het Nederlands met betrekking tot taaltechnologie wordt beschreven in
<div lang="en" dir="ltr" class="mw-content-ltr">
The status of Dutch with respect to language technologies is described in
</div>


* Korte versie (Engels): [https://link.springer.com/chapter/10.1007/978-3-031-28819-7_12https://link.springer.com/chapter/10.1007/978-3-031-28819-7_12 Steurs, Vandeghinste and Daelemans (2023).] Language Report Dutch. In : Rehm, G., Way, A. (eds) ''European Language Equality''. Cognitive Technologies. Springer, Cham. <nowiki>https://doi.org/10.1007/978-3-031-28819-7_12</nowiki>
<div lang="en" dir="ltr" class="mw-content-ltr">
* Langere versie (Engels): [https://european-language-equality.eu/wp-content/uploads/2022/03/ELE___Deliverable_D1_10__Language_Report_Dutch_.pdf Steurs, Vandeghinste and Daelemans (2022). Report on Dutch.] Project deliverable. European Language Equality.
* Short version: [https://doi.org/10.1007/978-3-031-28819-7_12 Steurs, Vandeghinste and Daelemans (2023).] Language Report Dutch. In : Rehm, G., Way, A. (eds) ''European Language Equality''. Cognitive Technologies. Springer, Cham.  
* Longer version: [https://european-language-equality.eu/wp-content/uploads/2022/03/ELE___Deliverable_D1_10__Language_Report_Dutch_.pdf Steurs, Vandeghinste and Daelemans (2022). Report on Dutch.] Project deliverable. European Language Equality.
</div>


Uw bijdrage aan deze pagina's is welkom. Gelieve de INT-servicedesk te contacteren [mailto://servicedesk@ivdnt.org servicedesk@ivdnt.org] met als onderwerp K-Dutch
<div lang="en" dir="ltr" class="mw-content-ltr">
You are most welcome to contribute to these pages, please contact [mailto://servicedesk@ivdnt.org servicedesk@ivdnt.org] with as subject K-Dutch, and we will be in touch.
==Linguisitic topics==
</div>


==Taalkundige onderwerpen==
<div lang="en" dir="ltr" class="mw-content-ltr">
 
===[[Grammar]]===
<span id="Grammar"></span>
</div>
===[[Grammatica]]===


<div lang="en" dir="ltr" class="mw-content-ltr">
* [[Grammar#Phonology,_Morphology_and_Syntax:_Taalportaal|Phonology, Morphology and Syntax: Taalportaal]]
* [[Grammar#Phonology,_Morphology_and_Syntax:_Taalportaal|Phonology, Morphology and Syntax: Taalportaal]]
* [[Grammar#Morfosyntaxis|Morfosyntaxis]]
* [[Grammar#Morphosyntax|Morphosyntax]]
* [[Grammar#Syntactische_Atlas_van_de_Nederlandse_Dialecten_(SAND)|Syntactische Atlas van de Nederlandse Dialecten (SAND)]]
* [[Grammar#Syntactic_Atlas_of_the_Dutch_dialects_(SAND)|Syntactic Atlas of the Dutch dialects (SAND)]]
* [[Grammar#Nederlandse_beschrijvende_grammatica:_e-ANS|Nederlandse beschrijvende grammatica: e-ANS]]
* [[Grammar#Dutch_descriptive_grammar:_e-ANS_(in_Dutch)|Dutch descriptive grammar: e-ANS (in Dutch)]]
* [[Grammar#Grambank|Grambank]]
* [[Grammar#Grambank|Grambank]]
</div>


<span id="Lexicography"></span>
<div lang="en" dir="ltr" class="mw-content-ltr">
===[[Lexicografie]]===
===[[Lexicography]]===
</div>


* [[Lexicografie#Nederlandse_woordenboeken|Nederlandse woordenboeken]]
<div lang="en" dir="ltr" class="mw-content-ltr">
* [[Lexicografie#ELEXIS|Het ELEXIS-project]]
* [[Lexicography#Dutch_dictionaries|Dutch dictionaries]]
* [[Lexicography#Elexis|The Elexis Project]]
* [https://ivdnt.org/wp-content/uploads/2021/02/The-Future-of-Academic-Lexicography-A-White-Paper.pdf White paper]: The Future of Academic Lexicography
* [https://ivdnt.org/wp-content/uploads/2021/02/The-Future-of-Academic-Lexicography-A-White-Paper.pdf White paper]: The Future of Academic Lexicography
</div>


===[[Terminologie]]===
<div lang="en" dir="ltr" class="mw-content-ltr">
*[[Terminologie#Expertisecentrum_Nederlandstalige_Terminologie|Het Expertisecentrum Nederlandstalige Terminologie]]
===[[Terminology]]===
*[[Terminologie#Academische_taal|Academische taal]]
*[[Terminology#Centre_of_Expertise_for_Dutch_Terminology|The Centre of Expertise for Dutch Terminology]]
*[[Terminologie#Medische_terminologie|Medische terminologie]]
*[[Terminology#Academic_Language|Academic Language]]
*[[Terminologie#Nederlands_als_wetenschapstaal|Nederlands als wetenschapstaal]]
*[[Terminology#Medical_Terminology|Medical Terminology]]
*[[Terminologie#Juridische_terminologie|Juridische terminologie]]
*[[Terminology#Dutch_as_a_scientific_language|Dutch as a scientific language]]
*[[Terminology#Legal_Terminology|Legal Terminology]]
</div>


<div lang="en" dir="ltr" class="mw-content-ltr">
===[[Spelling]]===  
===[[Spelling]]===  
*[[Spelling#Woordenlijst.org_(Officiёle_Nederlandse_woorden_Lijst)|Woordenlijst.org (Officiёle Nederlandse woordenlijst)]]
*[[Spelling#Woordenlijst.org_(Official_Dutch_Word_List)|Woordenlijst.org (Official Dutch Word List)]]
*[[Spelling#Keurmerk_Spelling|Keurmerk Spelling]]
*[[Spelling#Spelling_Certification_Mark|Spelling Certification Mark]]
</div>


==Taalkundige bronnen: datasets==
<div lang="en" dir="ltr" class="mw-content-ltr">
==Linguistic resources: datasets==
===[[Corpora]]===
===[[Corpora]]===
===[[Lexicale bronnen]]===
* [[Newspaper corpora]]: corpora exclusively consisting of newspaper text
* [[Parliamentary corpora]]
* [[Computer-mediated communication corpora]]
* [[Corpora of academic texts]]
* [[Historical corpora]]
* [[L2 learner corpora]]
* [[Manually annotated corpora]]
* [[Multimodal corpora]]
* [[Parallel corpora]]
* [[Reference corpora]]
* [[Social media corpora]]
* [[Spoken corpora]]
* [[Sign Language corpora]]
* [[Propbanks]]: contains semantic role labels
* [[Treebanks]]
* [[Other corpora]]
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
===Lexical Resources===
* [[Lexica]]
* [[Dictionaries]]
* [[Conceptual Resources]]
* [[Wordlists]]
* [[Embeddings]]
* [[Lexica of terminology]]
* [[Ontologies]]
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
===N-grams===
===N-grams===
* [[Letterteken N-grams]]
* [[Character N-grams]]
</div>


==Applicaties voor het Nederlands==
<div lang="en" dir="ltr" class="mw-content-ltr">
===Normalisatie===
==Tools for Dutch==
* [[Formaatconversie]]
===Normalisation===
* [[Controleren van spelling]]
* [[Format conversion]]
*[https://dev.clarin.nl/node/1914 TiCCLops]: Text-Induced Corpus Clean-up online processing system: niet langer beschikbaar
* [[Spell checking]]
*[https://lt3.ugent.be/normalisation-demo/ Normalisatie-Demo]
*[https://dev.clarin.nl/node/1914 TiCCLops]: Text-Induced Corpus Clean-up online processing system: no longer available
*[https://lt3.ugent.be/normalisation-demo/ Normalisation Demo]
</div>


===De Nederlandse Taal leren===
<div lang="en" dir="ltr" class="mw-content-ltr">
===Language Learning===
*[https://schrijfassistent.be Schrijfassistent]
*[https://schrijfassistent.be Schrijfassistent]
*[http://schrijfassistent.standaard.be/ Schrijfassistent] van De Standaard
*[http://schrijfassistent.standaard.be/ Schrijfassistent] at De Standaard
*[https://www.nedbox.be NedBox]: online oefeningen om Nederlands te leren
*[https://www.nedbox.be NedBox]: Online exercises to learn Dutch
*[https://oefenen.nl/programma/soort/taal Oefenen.nl]: online oefeningen om Nederlands te leren
*[https://oefenen.nl/programma/soort/taal Oefenen.nl]: Online exercises to learn Dutch
*[http://woordcombinaties.ivdnt.org/ Woordcombinaties]: werkwoorden en hun combinatiepatronen
*[http://woordcombinaties.ivdnt.org/ Woordcombinaties]: Verbs and their combination patterns
*[https://orientplus.ucll.be/ Orient+]: een serieus spel om de academische woordenschat uit te breiden
*[https://orientplus.ucll.be/ Orient+]: A serious game to enhance academic vocabulary
*[https://www.taalwinkel.nl/ Taalwinkel]: taaladvies
*[https://www.taalwinkel.nl/ Taalwinkel]: Language Advice
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
===Automatic linguistic annotation===
* [[Basic language processing]]
* [[Deep parsing]]
<!-- ===Information extraction!-->
<!--* Processing of historical variants of Dutch!-->
<!--* Text mining!-->
</div>


===Automatische taalkundige annotatie===
<div lang="en" dir="ltr" class="mw-content-ltr">
* [[Eenvoudige Taalverwerking]]
===Speech processing===
* [[Diepe syntactische analyse (parsing)]]
* [[Spoken Language Recognition]]
<!-- ===Informatie-extractie!-->
* [[Speech recognition]]
<!--* Het verwerken van historische varianten van het Nederlands!-->
* Speech synthesis
<!--* Tekst-mining!-->
</div>


===Spraakverwerking===
<div lang="en" dir="ltr" class="mw-content-ltr">
* [[Gesproken taalherkenning]]
===Natural Language Processing===
* [[Spraakherkenning]]
* [[Language Modeling]]
* Spraaksynthese
* [[Machine translation]]
* [[Coreference resolution]]
* [[Compound splitting]]
* [[Word Sense Disambiguation]]
* [[Text classification]]
* [[Sentiment analysis]]
* [[Readability]]
* [[Clinical NLP]]
* [[Topic Modelling]]
</div>


===Natuurlijke Taalverwerking===
<div lang="en" dir="ltr" class="mw-content-ltr">
* [[Taalmodellering]]
===Resource querying===
* [[Machinevertaling]]
* [[Corpus querying]]
* [[Coreferentieresolutie]]
* [[Treebank querying]]
* [[Samenstellingen splitsen]]
</div>
* [[Woordbetekenisverheldering]]
* [[Tekstclassificatie]]
* [[Sentimentanalyse]]
* [[Leesbaarheid]]
* [[Klinische NTV]]


===Zoeken in bronnen===
<div lang="en" dir="ltr" class="mw-content-ltr">
* [[Zoeken in een Corpus]]
===Text simplification ===
* [[Zoeken in een Treebank]]
Publicly available websites:
</div>


===Machinevertaling===
<div lang="en" dir="ltr" class="mw-content-ltr">
====Vertaalmachines====
* [https://languagetool.org/nl/herschrijvingsfunctie Uw schrijfassistent]
Openbaar beschikbare vertaalmachines van of naar het Nederlands:
* [https://neuralwriter.com/nl/ Neural Writer]
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
Publicly available apps to download:
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
* [https://www.leessimpel.nl/ Lees Simpel App]
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
Not publicly available:
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
* [https://www2.deloitte.com/nl/nl/pages/customer-and-marketing/solutions/simone.html Simone]
* [https://tscan.hum.uu.nl/tscan/ T-scan]
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
* [https://tolkie.nl/schrijfhulp/ Tolkie]
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
===Machine translation===
====Translation Engines====
Publicly available machine translation engines from or to Dutch:
*[https://www.deepl.com/translator DeepL]
*[https://www.deepl.com/translator DeepL]
*[https://translate.google.com/ Google Vertalen]
*[https://translate.google.com/ Google translate]
*[https://www.bing.com/translator Bing Microsoftvertaler]
*[https://www.bing.com/translator Bing Microsoft translator]
*[https://www.reverso.net/ Reverso]
*[https://www.reverso.net/ Reverso]
*[https://webgate.ec.europa.eu/etranslation/public/welcome.html eTranslation van de Europese Unie]
*[https://webgate.ec.europa.eu/etranslation/public/welcome.html eTranslation from the European Union]
*[https://mateo.ivdnt.org/Translate MATEO No Language Left Behind]
*[https://mateo.ivdnt.org/Translate MATEO No Language Left Behind]
</div>


====Evaluatie van machinevertaling====
<div lang="en" dir="ltr" class="mw-content-ltr">
====MT Evaluation====
*[https://mateo.ivdnt.org/Evaluate MATEO Machine Translation Evaluation Online]
*[https://mateo.ivdnt.org/Evaluate MATEO Machine Translation Evaluation Online]
</div>


===Terminologie-extractie===
<div lang="en" dir="ltr" class="mw-content-ltr">
* [https://termtreffer.org/ Termtreffer]. Vraag een login aan via [mailto:terminologie@ivdnt.org terminologie@ivdnt.org].
===Terminology extraction===
* [https://lt3.ugent.be/dterminer D-Terminer demo]. Terminologie-extractie voor het Nederlands, Engels, Frans en Duits. (Rigouts Terryn, A. (2021). D-TERMINE: Data driven Term Extraction Methodologies Investigated [Doctoral thesis]. Ghent University.)
* [https://termtreffer.org/ Termtreffer]. Ask for login at [mailto:terminologie@ivdnt.org terminologie@ivdnt.org].
* [https://lt3.ugent.be/dterminer D-Terminer demo]. Terminology extraction for Dutch, English, French and German. (Rigouts Terryn, A. (2021). D-TERMINE: Data-driven Term Extraction Methodologies Investigated [Doctoral thesis]. Ghent University.)
</div>


===Terminologiebeheer===
<div lang="en" dir="ltr" class="mw-content-ltr">
* [https://iate.europa.eu/home IATE] (Interactive Terminology for Europe) is het terminologiebeheerssysteem van de Europese Unie. Het is het gedeelde beheerssysteem van de instellingen van de Europese Unie en het omvat meer dan 7 miljoen termen in 26 talen, die bij elkaar meer dan 100 domeinen van EU-wetgeving beslaan.
===Terminology management===
* [https://iate.europa.eu/home IATE] (Interactive Terminology for Europe) is the EU's terminology management system. It’s the shared terminology management system of the institutions of the European Union and it contains more than 7 million terms in 26 languages covering more than 100 domains of the EU legislation.
</div>


===Anders===
<div lang="en" dir="ltr" class="mw-content-ltr">
* Niet eerder genoemde [[CLARIN-projecten]] bij het Instituut voor de Nederlandse Taal (INT)
===Other===
* [https://webservices.cls.ru.nl/ Taal en Spraakapplicaties] van de Radboud Universiteit Nijmegen. bijv. [https://webservices.cls.ru.nl/tscan T-scan], een hulpmiddel voor de analyse van Nederlandse teksten waarmee de complexiteit van de tekst kan worden vastgesteld.
* Previously unmentioned [[CLARIN projects]] at INT
* [https://www.opener-project.eu/ OpeNER] is een taalanalyse-toolchain die (academische) onderzoekers en bedrijven helpt bij het begrijpen van natuurlijketaalanalyse. Het bestaat uit gemakkelijk te installeren, verbeteren en te configureren onderdelen om bijv. de taal van een tekst te detecteren, de tekstpolarisatie vast te stellen (sentimentanalyse), te detecteren welke onderwerpen zijn ingesloten in de tekst,... De ondersteunde taalset bestaat momenteel uit: Engels, Spaans, Italiaans, Duits en Nederlands.
* [https://webservices.cls.ru.nl/ Language and Speech Tools] at Radboud Nijmegen. e.g. [https://webservices.cls.ru.nl/tscan T-scan], an analysis tool for dutch texts to assess the complexity of the text.
* [https://gate.ac.uk/ GATE] (General Architecture for Text Engineering) is een verzameling van Java-hulpmiddelen die oorspronkelijk zijn ontwikkeld aan de Universiteit van Sheffield en die worden gebruikt voor meerdere natuurlijketaalverwerkingstaken, waaronder informatie-extractie. ([https://cloud.gate.ac.uk/shopfront#tagged=Dutch Nederlandse diensten in GATE Cloud]).
* [https://www.opener-project.eu/ OpeNER] is a language analysis toolchain helping (academic) researchers and companies make sense out of natural language analysis”. It consist of easy to install, improve and configure components to e.g. detect the language of a text, determine polarisation of texts (sentiment analysis), detect what topics are included in the text,... The supported language set currently consists of: English, Spanish, Italian, German and Dutch.
* [https://speech-repository.webcloud.ec.europa.eu/ Speech Repository] is een online e-learninghulpmiddel. Het bevat video-opnames van real-life speeches en op maat gemaakt pedagogisch materiaal, die de vertaler en vertalende studenten een kans bieden om te oefenen met hun vertaalvaardigheden en deze te verbeteren.
* [https://gate.ac.uk/ GATE] (General Architecture for Text Engineering) is a Java suite of tools originally developed at the University of Sheffield and it is used for many natural language processing tasks, including information extraction. ([https://cloud.gate.ac.uk/shopfront#tagged=Dutch Dutch services in GATE Cloud]).
* [https://subworkshop.sourceforge.net/ Subtitle Workshop] is een gratis applicatie voor het creëren, bewerken en omzetten van op tekst gebaseerde ondertitelingbestanden.  
* [https://speech-repository.webcloud.ec.europa.eu/ Speech Repository] is an online e-learning tool. It contains video recordings of real-life speeches and tailor-made pedagogical material speeches which give the interpreter and interpreting students an opportunity to practise and improve their interpretation skills.
* [https://youdescribe.org/ YouDescribe] is een gratis internetplatform voor het toevoegen van audiobeschrijving aan YouTube-content.
* [https://subworkshop.sourceforge.net/ Subtitle Workshop] is a free application for creating, editing, and converting text-based subtitle files.  
* [https://www.audacityteam.org/ Audacity] is een audio-opname- en bewerkingssoftwareapplicatie die open source is.
* [https://youdescribe.org/ YouDescribe] is a free, web-based platform for adding audio description to YouTube content.
* [https://www.audacityteam.org/ Audacity] is an audio recording and editing software application that is open source.
</div>


<div lang="en" dir="ltr" class="mw-content-ltr">
==Helpdesk==
==Helpdesk==
Voor informatie over het Nederlands: als u op deze wikipagina geen antwoord op uw vragen vindt, stuur dan een mail met uw vraag naar [mailto://servicedesk@ivdnt.org servicedesk@ivdnt.org ]. Uw vragen worden zo spoedig mogelijk doorgestuurd naar de geschikte experts en u zou binnen twee werkdagen een antwoord moeten ontvangen.
For information about Dutch: If you cannot find the answers to your questions on this wiki, you can send your question to [mailto://servicedesk@ivdnt.org servicedesk@ivdnt.org ]. Your questions will be forwarded as soon as possible to the appropriate experts and you should receive an answer within two working days.
</div>


U kunt ons ook om informatie en assistentie vragen i.v.m. het gebruik van taalmaterialen en applicaties.
<div lang="en" dir="ltr" class="mw-content-ltr">
You can also ask us for information and assistance with the use of data and tools.
</div>


==Andere Diensten==
<div lang="en" dir="ltr" class="mw-content-ltr">
* [[Beste toepassingsdocumenten en richtlijnen]]
==Other Services==
* [[Stages]]
* [[Best practice documents and guidelines]]
* [[Internships]]
* [[Consulting]]
* [[Consulting]]
* [[CLARIN]] voor het Nederlands
* [[CLARIN]] for Dutch
</div>
 
<div lang="en" dir="ltr" class="mw-content-ltr">
==Questions and Answers==
On the [[Q&A|Questions and Answers page]] we keep track of all questions we receive concerning Dutch. This will grow into a repository of K-Dutch answers to your questions.
</div>


==Vragen en Antwoorden==
<div lang="en" dir="ltr" class="mw-content-ltr">
Op de [[Q&A|Vragen- en antwoordenpagina]] houden we de vragen bij die we ontvingen. Dit groeit uit tot een opslagplaats van K-Dutch-antwoorden op uw vragen.
Note that there is also a very active Discord server concerning Dutch NLP: https://discord.gg/jn94Ux5j
</div>

Revision as of 14:18, 29 May 2024

Other languages:

Mediawiki:Mainpage

Welcome to K-Dutch, the place for anyone who wants to know anything about the Dutch language: linguistic properties, language advice, available tools and resources, etymology, dialects...

K-Dutch is a CLARIN Knowledge Centre. It is hosted by the Instituut voor de Nederlandse Taal (Dutch Language Institute) , which is also a CLARIN-B centre and host of many resources for Dutch, which are, in general, freely available for research purposes. K-Dutch is an initiative of CLARIN-ERIC and CLARIN-BE.

The status of Dutch with respect to language technologies is described in

Tools for Dutch

Normalisation

Language Learning

Automatic linguistic annotation

Speech processing

Text simplification

Publicly available websites:

Publicly available apps to download:

Not publicly available:

Machine translation

Translation Engines

Publicly available machine translation engines from or to Dutch:

Terminology extraction

  • Termtreffer. Ask for login at terminologie@ivdnt.org.
  • D-Terminer demo. Terminology extraction for Dutch, English, French and German. (Rigouts Terryn, A. (2021). D-TERMINE: Data-driven Term Extraction Methodologies Investigated [Doctoral thesis]. Ghent University.)

Terminology management

  • IATE (Interactive Terminology for Europe) is the EU's terminology management system. It’s the shared terminology management system of the institutions of the European Union and it contains more than 7 million terms in 26 languages covering more than 100 domains of the EU legislation.

Other

  • Previously unmentioned CLARIN projects at INT
  • Language and Speech Tools at Radboud Nijmegen. e.g. T-scan, an analysis tool for dutch texts to assess the complexity of the text.
  • OpeNER is a language analysis toolchain helping (academic) researchers and companies make sense out of natural language analysis”. It consist of easy to install, improve and configure components to e.g. detect the language of a text, determine polarisation of texts (sentiment analysis), detect what topics are included in the text,... The supported language set currently consists of: English, Spanish, Italian, German and Dutch.
  • GATE (General Architecture for Text Engineering) is a Java suite of tools originally developed at the University of Sheffield and it is used for many natural language processing tasks, including information extraction. (Dutch services in GATE Cloud).
  • Speech Repository is an online e-learning tool. It contains video recordings of real-life speeches and tailor-made pedagogical material speeches which give the interpreter and interpreting students an opportunity to practise and improve their interpretation skills.
  • Subtitle Workshop is a free application for creating, editing, and converting text-based subtitle files.
  • YouDescribe is a free, web-based platform for adding audio description to YouTube content.
  • Audacity is an audio recording and editing software application that is open source.

Helpdesk

For information about Dutch: If you cannot find the answers to your questions on this wiki, you can send your question to servicedesk@ivdnt.org . Your questions will be forwarded as soon as possible to the appropriate experts and you should receive an answer within two working days.

You can also ask us for information and assistance with the use of data and tools.

Questions and Answers

On the Questions and Answers page we keep track of all questions we receive concerning Dutch. This will grow into a repository of K-Dutch answers to your questions.

Note that there is also a very active Discord server concerning Dutch NLP: https://discord.gg/jn94Ux5j