Translations:Other corpora/14/en

From Clarin K-Centre
Jump to navigation Jump to search

DBRD

The DBRD (pronounced dee-bird) dataset contains over 110k book reviews of which 22k have associated binary sentiment polarity labels. It is intended as a benchmark for sentiment classification in Dutch. The dataset can be used to train a model for sequence modeling, more specifically language modeling and it can be used to train a model for text classification, more specifically sentiment classification, using the provided positive/negative sentiment polarity labels.