Language modeling
n-gram modeling
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e. patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool colibri-patternmodeller which allows you to build, view, manipulate and query pattern models.
BERT-like models
- Hugging Face Dutch Models
- RobBERT: A Dutch RoBERTa-based Language Model
Multilingual Language Models including Dutch
SpaCy
spaCy is a free open-source library for Natural Language Processing in Python.