Conceptual resources: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
No edit summary
(Marked this version for translation)
 
(31 intermediate revisions by 4 users not shown)
Line 1: Line 1:
<languages/>
<translate>
<!--T:1-->
== Open Dutch Wordnet ==
Open Dutch WordNet is a Dutch lexical semantic database. It was created by removing the proprietary content from Cornetto, and by using open source resources to replace this proprietary content.
*[http://wordpress.let.vupr.nl/odwn/ Project website]
<!--T:2-->
==  Cornetto-LMF (Lexicon Markup Framework) ==
==  Cornetto-LMF (Lexicon Markup Framework) ==
Cornetto is a lexical resource for the Dutch language which combines two resources with different semantic structures.
Cornetto is a lexical resource for the Dutch language which combines two resources with different semantic structures.
It includes the Dutch Wordnet which organizes words in sets of synonyms (synsets) and records semantic relations between them. It also includes the Dutch Reference Lexicon which organizes words in form-meaning units (lexical entries) and describes them with short definitions, usage constraints, selection restrictions, syntactic behaviours, combinatorial information and illustrative contexts. Cornetto can be considered as the combination of a thesaurus and a dictionary. It is accessible for human use via a web browser and it is also available in XML for computational use (opensourcewordnet). Cornetto has circa 177,000 lexical entries and 70,000 synsets.
It includes the Dutch Wordnet which organizes words in sets of synonyms (synsets) and records semantic relations between them. It also includes the Dutch Reference Lexicon which organizes words in form-meaning units (lexical entries) and describes them with short definitions, usage constraints, selection restrictions, syntactic behaviours, combinatorial information and illustrative contexts. Cornetto can be considered as the combination of a thesaurus and a dictionary. It is accessible for human use via a web browser and it is also available in XML for computational use (opensourcewordnet). Cornetto has circa 177,000 lexical entries and 70,000 synsets.


<!--T:3-->
*[http://cornetto.clarin.inl.nl/index.html Online demo]
*[http://wordpress.let.vupr.nl/cornetto/ Project website]
<!--T:4-->
== Dutch SemCor ==
Dutch SemCor provides a corpus, fully sense-tagged with senses and domain tags from the Cornetto database
*[http://wordpress.let.vupr.nl/dutchsemcor/ Project website]
*[https://ssh.datastations.nl/dataset.xhtml?persistentId=doi:10.17026/dans-zcg-4ndy Download page]
<!--T:5-->
== EuroWordNet Dutch ==
EuroWordNet is a multilingual database with wordnets for several European languages (Dutch, Italian, Spanish, German, French, Czech and Estonian). The wordnets are structured in the same way as the American wordnet for English ( Princeton WordNet, Miller et al 1990) in terms of synsets (sets of synonymous words) with basic semantic relations between them. Each wordnet represents a unique language-internal system of lexicalizations.
<!--T:6-->
*[http://catalog.elra.info/en-us/repository/browse/ELRA-M0016/ Information]
*[http://catalog.elra.info/en-us/purchase-procedure-info/ Purchase procedure]
<!--T:7-->
== Open Multilingual WordNet ==
The goal of this project is to make it easy to use wordnets in multiple languages.
There are currently two versions of the OMW.
<!--T:8-->
* [https://omwn.org/omw1.html OMW Version 1] links hand created wordnets and automatically created wordnets for over a 150 languages through the Princeton Wordnet of English.
* [https://omwn.org/omw2.html OMW Version 2], the current (experimental) version, uses the Collaborative Interlingual Index to link the wordnets together
<!--T:9-->
== Word Association Data ==
Word association and participant data for 100 primary, secondary and tertiary responses to 12,571 cues as reported in De Deyne, Navarro and Storms (2013).
<!--T:10-->
*[https://smallworldofwords.org/en/project/research#download Data download]
*[https://smallworldofwords.org/articles/2012.DeDeyneNavarroStorms.Wordassociations.BRM.pdf De Deyne, S., Navarro, D., Storms, G. (2013). Better explanations of lexical and semantic cognition using networks derived from continued rather than single word associations. Behavior Research Methods, 45 (2), 480-498.]
<!--T:11-->
== Age of Acquisition Data ==
Following similar studies in English, ratings of age of acquisition for 30,000 Dutch words are presented and validated. These include nearly all lemmas language researchers are likely to be interested in.
<!--T:12-->
*[https://www.ugent.be/pp/experimentele-psychologie/en/research/documents/aoa/appnet.htm Data download]
*[https://www.ugent.be/pp/experimentele-psychologie/en/research/documents/aoa Website] 
*[https://www.researchgate.net/publication/262385555_Norms_of_age_of_acquisition_and_concreteness_for_30000_Dutch_words Brysbaert, M., Stevens, M., De Deyne, S., Voorspoels, W., & Storms, G. (2014). Norms of age of acquisition and concreteness for 30,000 Dutch words. Acta Psychologica, 150, 80-84]
== Concreteness Data == <!--T:13-->


===[http://cornetto.clarin.inl.nl/index.html Online demo]===
<!--T:17-->
Van der Goten, K., De Vooght, G., & Kemps, E. (1999). Concreteness and emotional valence ratings of 399 Dutch nouns. Psychologica Belgica, 39, 49-70.  


== DuELME-LMF (Lexicon Markup Framework) ==
<!--T:15-->
* [https://www.ugent.be/pp/experimentele-psychologie/en/research/documents/nouns/1.xls 1-syllable Norms]
* [https://www.ugent.be/pp/experimentele-psychologie/en/research/documents/nouns/3.xls 3-syllable Norms]


DuELME is a lexicon of more than 5,000 Dutch multiple-word expressions.
<!--T:14-->
Brysbaert, M., Stevens, M., De Deyne, S., Voorspoels, W., & Storms, G. (2014). Norms of age of acquisition and concreteness for 30,000 Dutch words. Acta Psychologica, 150, 80-84]


Expressions with the same syntactic pattern are divided into so-called Equivalence Classes, which makes it possible to integrate the lexicon with minimal manual effort into an NLP system. The lexicon has been developed within the framework of the IRME project.
<!--T:18-->
* no data found


===[http://duelme.clarin.inl.nl/ Duelme website]===
<!--T:19-->
== Datasets for Psychological Research ==
https://www.ugent.be/pp/experimentele-psychologie/en/research/documents
</translate>

Latest revision as of 07:42, 11 October 2024

Other languages:

Open Dutch Wordnet

Open Dutch WordNet is a Dutch lexical semantic database. It was created by removing the proprietary content from Cornetto, and by using open source resources to replace this proprietary content.

Cornetto-LMF (Lexicon Markup Framework)

Cornetto is a lexical resource for the Dutch language which combines two resources with different semantic structures. It includes the Dutch Wordnet which organizes words in sets of synonyms (synsets) and records semantic relations between them. It also includes the Dutch Reference Lexicon which organizes words in form-meaning units (lexical entries) and describes them with short definitions, usage constraints, selection restrictions, syntactic behaviours, combinatorial information and illustrative contexts. Cornetto can be considered as the combination of a thesaurus and a dictionary. It is accessible for human use via a web browser and it is also available in XML for computational use (opensourcewordnet). Cornetto has circa 177,000 lexical entries and 70,000 synsets.

Dutch SemCor

Dutch SemCor provides a corpus, fully sense-tagged with senses and domain tags from the Cornetto database

EuroWordNet Dutch

EuroWordNet is a multilingual database with wordnets for several European languages (Dutch, Italian, Spanish, German, French, Czech and Estonian). The wordnets are structured in the same way as the American wordnet for English ( Princeton WordNet, Miller et al 1990) in terms of synsets (sets of synonymous words) with basic semantic relations between them. Each wordnet represents a unique language-internal system of lexicalizations.

Open Multilingual WordNet

The goal of this project is to make it easy to use wordnets in multiple languages. There are currently two versions of the OMW.

  • OMW Version 1 links hand created wordnets and automatically created wordnets for over a 150 languages through the Princeton Wordnet of English.
  • OMW Version 2, the current (experimental) version, uses the Collaborative Interlingual Index to link the wordnets together

Word Association Data

Word association and participant data for 100 primary, secondary and tertiary responses to 12,571 cues as reported in De Deyne, Navarro and Storms (2013).

Age of Acquisition Data

Following similar studies in English, ratings of age of acquisition for 30,000 Dutch words are presented and validated. These include nearly all lemmas language researchers are likely to be interested in.

Concreteness Data

Van der Goten, K., De Vooght, G., & Kemps, E. (1999). Concreteness and emotional valence ratings of 399 Dutch nouns. Psychologica Belgica, 39, 49-70.

Brysbaert, M., Stevens, M., De Deyne, S., Voorspoels, W., & Storms, G. (2014). Norms of age of acquisition and concreteness for 30,000 Dutch words. Acta Psychologica, 150, 80-84]

  • no data found

Datasets for Psychological Research

https://www.ugent.be/pp/experimentele-psychologie/en/research/documents