Compound splitting: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
Line 3: Line 3:
This demo allows Dutch input up to 500 characters. You can either input running text or single words (one word per line). If you are interested in using the compound splitter for other purposes contact Lieve.Macken@UGent.be.
This demo allows Dutch input up to 500 characters. You can either input running text or single words (one word per line). If you are interested in using the compound splitter for other purposes contact Lieve.Macken@UGent.be.


*[https://biblio.ugent.be/publication/7126122 Paper]
*[https://biblio.ugent.be/publication/7126122 Lieve Macken and Arda Tezcan. 2018. “Dutch Compound Splitting for Bilingual Terminology Extraction.” In Multiword Units in Machine Translation and Translation Technology, ed. Ruslan Mitkov, Johanna Monti, Gloria Corpas Pastor, and Violeta Seretan. Vol. 341. John Benjamins, pp. 148–162.]
*[https://lt3.ugent.be/compound-splitter-demo/ Demo]
*[https://lt3.ugent.be/compound-splitter-demo/ Demo]

Revision as of 16:24, 8 February 2022

Compound splitter demo

A compound splitter splits compounds into their component parts, e.g. liefde+s+drank or [post+zegel]+verzamelaar. This demo allows Dutch input up to 500 characters. You can either input running text or single words (one word per line). If you are interested in using the compound splitter for other purposes contact Lieve.Macken@UGent.be.