Compound splitting

From Clarin K-Centre
Revision as of 15:29, 6 November 2024 by Vincent (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search
Other languages:

Compound splitter demo

A compound splitter splits compounds into their component parts, e.g. liefde+s+drank or [post+zegel]+verzamelaar. This demo allows Dutch input up to 500 characters. You can either input running text or single words (one word per line). If you are interested in using the compound splitter for other purposes contact Lieve.Macken@UGent.be.

CharSplit - An ngram-based compound splitter

Python module that splits a compound into its body and head. So far German and Dutch are supported.

Wordbuilder