Language Modeling: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
No edit summary
Line 13: Line 13:


==SpaCy==
==SpaCy==
spaCy is a free open-source library for Natural Language Processing in Python.
* [https://spacy.io/models/nl Dutch models]
* [https://spacy.io/models/nl Dutch models]

Revision as of 12:09, 27 June 2023

n-gram modeling

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e. patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool colibri-patternmodeller which allows you to build, view, manipulate and query pattern models.

BERT-like models

Multilingual Language Models including Dutch

SpaCy

spaCy is a free open-source library for Natural Language Processing in Python.