Language modeling

From Clarin K-Centre
Revision as of 15:26, 23 March 2021 by Vincent (talk | contribs)
Jump to navigation Jump to search

n-gram modeling

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool colibri-patternmodeller which allows you to build, view, manipulate and query pattern models.

BERT-like models

  • RobBERT: A Dutch RoBERTa-based Language Model

SpaCy