K-Dutch: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
(Marked this version for translation)
No edit summary
Line 93: Line 93:
<!--T:44-->
<!--T:44-->
==Tools for Dutch==
==Tools for Dutch==
===Normalisation===
===Normalisation===
* [[Format conversion]]
* [[Format conversion]]
* [[Spell checking]]
* [[Spell checking]]
Line 101: Line 103:
<!--T:45-->
<!--T:45-->
===Language Learning===
===Language Learning===
*[https://schrijfassistent.be Schrijfassistent]
*[https://schrijfassistent.be Schrijfassistent]
*[http://schrijfassistent.standaard.be/ Schrijfassistent] at De Standaard
*[http://schrijfassistent.standaard.be/ Schrijfassistent] at De Standaard
Line 111: Line 114:
<!--T:46-->
<!--T:46-->
===Automatic linguistic annotation===
===Automatic linguistic annotation===
* [[Basic language processing]]
* [[Basic language processing]]
* [[Deep parsing]]
* [[Deep parsing]]
Line 119: Line 123:
<!--T:47-->
<!--T:47-->
===Speech processing===
===Speech processing===
* [[Spoken Language Recognition]]
* [[Spoken Language Recognition]]
* [[Speech recognition]]
* [[Speech recognition]]
Line 125: Line 130:
<!--T:48-->
<!--T:48-->
===Natural Language Processing===
===Natural Language Processing===
* [[Language Modeling]]
* [[Language Modeling]]
* [[Machine translation]]
* [[Machine translation]]
Line 137: Line 143:
<!--T:49-->
<!--T:49-->
===Resource querying===
===Resource querying===
* [[Corpus querying]]
* [[Corpus querying]]
* [[Treebank querying]]
* [[Treebank querying]]
Line 142: Line 149:
<!--T:50-->
<!--T:50-->
===Text simplification ===
===Text simplification ===
Publicly available websites:
Publicly available websites:


Line 166: Line 174:
<!--T:57-->
<!--T:57-->
===Machine translation===
===Machine translation===
====Translation Engines====
====Translation Engines====
Publicly available machine translation engines from or to Dutch:
Publicly available machine translation engines from or to Dutch:
*[https://www.deepl.com/translator DeepL]
*[https://www.deepl.com/translator DeepL]
Line 177: Line 187:
<!--T:58-->
<!--T:58-->
====MT Evaluation====
====MT Evaluation====
*[https://mateo.ivdnt.org/Evaluate MATEO Machine Translation Evaluation Online]
*[https://mateo.ivdnt.org/Evaluate MATEO Machine Translation Evaluation Online]


<!--T:59-->
<!--T:59-->
===Terminology extraction===
===Terminology extraction===
* [https://termtreffer.org/ Termtreffer]. Ask for login at [mailto:terminologie@ivdnt.org terminologie@ivdnt.org].
* [https://termtreffer.org/ Termtreffer]. Ask for login at [mailto:terminologie@ivdnt.org terminologie@ivdnt.org].
* [https://lt3.ugent.be/dterminer D-Terminer demo]. Terminology extraction for Dutch, English, French and German. (Rigouts Terryn, A. (2021). D-TERMINE: Data-driven Term Extraction Methodologies Investigated [Doctoral thesis]. Ghent University.)
* [https://lt3.ugent.be/dterminer D-Terminer demo]. Terminology extraction for Dutch, English, French and German. (Rigouts Terryn, A. (2021). D-TERMINE: Data-driven Term Extraction Methodologies Investigated [Doctoral thesis]. Ghent University.)
Line 186: Line 198:
<!--T:60-->
<!--T:60-->
===Terminology management===
===Terminology management===
* [https://iate.europa.eu/home IATE] (Interactive Terminology for Europe) is the EU's terminology management system. It’s the shared terminology management system of the institutions of the European Union and it contains more than 7 million terms in 26 languages covering more than 100 domains of the EU legislation.
* [https://iate.europa.eu/home IATE] (Interactive Terminology for Europe) is the EU's terminology management system. It’s the shared terminology management system of the institutions of the European Union and it contains more than 7 million terms in 26 languages covering more than 100 domains of the EU legislation.


<!--T:61-->
<!--T:61-->
===Other===
===Other===
* Previously unmentioned [[CLARIN projects]] at INT
* Previously unmentioned [[CLARIN projects]] at INT
* [https://webservices.cls.ru.nl/ Language and Speech Tools] at Radboud Nijmegen. e.g. [https://webservices.cls.ru.nl/tscan T-scan], an analysis tool for dutch texts to assess the complexity of the text.
* [https://webservices.cls.ru.nl/ Language and Speech Tools] at Radboud Nijmegen. e.g. [https://webservices.cls.ru.nl/tscan T-scan], an analysis tool for dutch texts to assess the complexity of the text.
Line 201: Line 215:
<!--T:62-->
<!--T:62-->
==Helpdesk==
==Helpdesk==
For information about Dutch: If you cannot find the answers to your questions on this wiki, you can send your question to [mailto://servicedesk@ivdnt.org servicedesk@ivdnt.org ]. Your questions will be forwarded as soon as possible to the appropriate experts and you should receive an answer within two working days.
For information about Dutch: If you cannot find the answers to your questions on this wiki, you can send your question to [mailto://servicedesk@ivdnt.org servicedesk@ivdnt.org ]. Your questions will be forwarded as soon as possible to the appropriate experts and you should receive an answer within two working days.


Line 208: Line 223:
<!--T:64-->
<!--T:64-->
==Other Services==
==Other Services==
* [[Best practice documents and guidelines]]
* [[Best practice documents and guidelines]]
* [[Internships]]
* [[Internships]]
Line 215: Line 231:
<!--T:65-->
<!--T:65-->
==Questions and Answers==
==Questions and Answers==
On the [[Q&A|Questions and Answers page]] we keep track of all questions we receive concerning Dutch. This will grow into a repository of K-Dutch answers to your questions.
On the [[Q&A|Questions and Answers page]] we keep track of all questions we receive concerning Dutch. This will grow into a repository of K-Dutch answers to your questions.



Revision as of 08:32, 30 May 2024

Other languages:

Mediawiki:Mainpage

Welcome to K-Dutch, the place for anyone who wants to know anything about the Dutch language: linguistic properties, language advice, available tools and resources, etymology, dialects...

K-Dutch is a CLARIN Knowledge Centre. It is hosted by the Instituut voor de Nederlandse Taal (Dutch Language Institute) , which is also a CLARIN-B centre and host of many resources for Dutch, which are, in general, freely available for research purposes. K-Dutch is an initiative of CLARIN-ERIC and CLARIN-BE.

The status of Dutch with respect to language technologies is described in

You are most welcome to contribute to these pages, please contact servicedesk@ivdnt.org with as subject K-Dutch, and we will be in touch.

Linguisitic topics

Grammar

Lexicography

Terminology

Spelling

Linguistic resources: datasets

Corpora

Lexical Resources

N-grams

Tools for Dutch

Normalisation

Language Learning

Automatic linguistic annotation

Speech processing

Natural Language Processing

Resource querying

Text simplification

Publicly available websites:

Publicly available apps to download:

Not publicly available:

Machine translation

Translation Engines

Publicly available machine translation engines from or to Dutch:

MT Evaluation

Terminology extraction

  • Termtreffer. Ask for login at terminologie@ivdnt.org.
  • D-Terminer demo. Terminology extraction for Dutch, English, French and German. (Rigouts Terryn, A. (2021). D-TERMINE: Data-driven Term Extraction Methodologies Investigated [Doctoral thesis]. Ghent University.)

Terminology management

  • IATE (Interactive Terminology for Europe) is the EU's terminology management system. It’s the shared terminology management system of the institutions of the European Union and it contains more than 7 million terms in 26 languages covering more than 100 domains of the EU legislation.

Other

  • Previously unmentioned CLARIN projects at INT
  • Language and Speech Tools at Radboud Nijmegen. e.g. T-scan, an analysis tool for dutch texts to assess the complexity of the text.
  • OpeNER is a language analysis toolchain helping (academic) researchers and companies make sense out of natural language analysis”. It consist of easy to install, improve and configure components to e.g. detect the language of a text, determine polarisation of texts (sentiment analysis), detect what topics are included in the text,... The supported language set currently consists of: English, Spanish, Italian, German and Dutch.
  • GATE (General Architecture for Text Engineering) is a Java suite of tools originally developed at the University of Sheffield and it is used for many natural language processing tasks, including information extraction. (Dutch services in GATE Cloud).
  • Speech Repository is an online e-learning tool. It contains video recordings of real-life speeches and tailor-made pedagogical material speeches which give the interpreter and interpreting students an opportunity to practise and improve their interpretation skills.
  • Subtitle Workshop is a free application for creating, editing, and converting text-based subtitle files.
  • YouDescribe is a free, web-based platform for adding audio description to YouTube content.
  • Audacity is an audio recording and editing software application that is open source.

Helpdesk

For information about Dutch: If you cannot find the answers to your questions on this wiki, you can send your question to servicedesk@ivdnt.org . Your questions will be forwarded as soon as possible to the appropriate experts and you should receive an answer within two working days.

You can also ask us for information and assistance with the use of data and tools.

Other Services

Questions and Answers

On the Questions and Answers page we keep track of all questions we receive concerning Dutch. This will grow into a repository of K-Dutch answers to your questions.

Note that there is also a very active Discord server concerning Dutch NLP: https://discord.gg/jn94Ux5j