Sign Language corpora: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
No edit summary
Line 1: Line 1:
==Dutch Sign Language (NGT) ==
==Dutch Sign Language (NGT) ==
===Corpus Nederlandse Gebarentaal===
In the Netherlands, Nederlandse Gebarentaal (NGT) or Dutch Sign Language is officially recognized.
In the Netherlands, Nederlandse Gebarentaal (NGT) or Dutch Sign Language is officially recognized.
A corpus is available for research. For download, you can contact [mailto://servicedesk@ivdnt.org servicedesk@ivdnt.org]
A corpus is available for research. For download, you can contact [mailto://servicedesk@ivdnt.org servicedesk@ivdnt.org]


* [https://www.corpusngt.nl Corpus website (in Dutch)]
* [https://www.corpusngt.nl Corpus website (in Dutch)]
===NGT-HoReCo===
A multimodal parallel corpus of Dutch and Sign Language of the Netherlands (NGT). 283 hotel reviews in written Dutch were translated into NGT videos by 6 professional, deaf translators. Each review was translated by only one translator. The word length of the Dutch reviews varies from around 15 to 400 words; the NGT videos duration ranged from around 10 seconds to around 4 minutes. The total amount of words contained in the corpus is 21,825; the NGT translations consist of over 3.5 hours of videos.
*[http://hdl.handle.net/10032/tm-a2-w2 NGT-HoReCo Download page]


==Flemish Sign Language (VGT) ==
==Flemish Sign Language (VGT) ==
===Corpus Vlaamse Gebarentaal===
In Flanders, Vlaamse Gebarentaal (VGT) or Flemish Sign Language is officially recognized. A corpus is available for research.  
In Flanders, Vlaamse Gebarentaal (VGT) or Flemish Sign Language is officially recognized. A corpus is available for research.  


Line 11: Line 17:
* [http://hdl.handle.net/10032/tm-a2-v6 Download from INT]
* [http://hdl.handle.net/10032/tm-a2-v6 Download from INT]


==Belgian Covid Sign Language Corpus (BeCoS Corpus) ==
===Belgian Covid Sign Language Corpus (BeCoS Corpus) ===


An automatically transcribed and annotated parallel corpus with speech (Flemish, French, German) and sign language (VGT, LSFB) based on press conferences from the Belgian federal government concerning COVID-19.
An automatically transcribed and annotated parallel corpus with speech (Flemish, French, German) and sign language (VGT, LSFB) based on press conferences from the Belgian federal government concerning COVID-19.


*[http://hdl.handle.net/10032/tm-a2-v5 Download page]
*[http://hdl.handle.net/10032/tm-a2-v5 Download page]
===GoSt-ParC Sign===
A parallel corpus of authentic Flemish Sign Language (VGT) and written Dutch which can be employed as gold standard in automated sign language translation.
*[https://aclanthology.org/2023.eamt-1.54 GoST-ParC Sign short publication]

Revision as of 14:47, 4 October 2023

Dutch Sign Language (NGT)

Corpus Nederlandse Gebarentaal

In the Netherlands, Nederlandse Gebarentaal (NGT) or Dutch Sign Language is officially recognized. A corpus is available for research. For download, you can contact servicedesk@ivdnt.org

NGT-HoReCo

A multimodal parallel corpus of Dutch and Sign Language of the Netherlands (NGT). 283 hotel reviews in written Dutch were translated into NGT videos by 6 professional, deaf translators. Each review was translated by only one translator. The word length of the Dutch reviews varies from around 15 to 400 words; the NGT videos duration ranged from around 10 seconds to around 4 minutes. The total amount of words contained in the corpus is 21,825; the NGT translations consist of over 3.5 hours of videos.

Flemish Sign Language (VGT)

Corpus Vlaamse Gebarentaal

In Flanders, Vlaamse Gebarentaal (VGT) or Flemish Sign Language is officially recognized. A corpus is available for research.

Belgian Covid Sign Language Corpus (BeCoS Corpus)

An automatically transcribed and annotated parallel corpus with speech (Flemish, French, German) and sign language (VGT, LSFB) based on press conferences from the Belgian federal government concerning COVID-19.

GoSt-ParC Sign

A parallel corpus of authentic Flemish Sign Language (VGT) and written Dutch which can be employed as gold standard in automated sign language translation.