Conceptual Resources: Difference between revisions

From Clarin K-Centre
Jump to navigation Jump to search
No edit summary
 
(6 intermediate revisions by 2 users not shown)
Line 1: Line 1:
<languages/>
<translate>
<!--T:1-->
== Open Dutch Wordnet ==
== Open Dutch Wordnet ==
Open Dutch WordNet is a Dutch lexical semantic database. It was created by removing the proprietary content from Cornetto, and by using open source resources to replace this proprietary content.
Open Dutch WordNet is a Dutch lexical semantic database. It was created by removing the proprietary content from Cornetto, and by using open source resources to replace this proprietary content.
*[http://wordpress.let.vupr.nl/odwn/ Project website]
*[http://wordpress.let.vupr.nl/odwn/ Project website]


<!--T:2-->
==  Cornetto-LMF (Lexicon Markup Framework) ==
==  Cornetto-LMF (Lexicon Markup Framework) ==
Cornetto is a lexical resource for the Dutch language which combines two resources with different semantic structures.
Cornetto is a lexical resource for the Dutch language which combines two resources with different semantic structures.
It includes the Dutch Wordnet which organizes words in sets of synonyms (synsets) and records semantic relations between them. It also includes the Dutch Reference Lexicon which organizes words in form-meaning units (lexical entries) and describes them with short definitions, usage constraints, selection restrictions, syntactic behaviours, combinatorial information and illustrative contexts. Cornetto can be considered as the combination of a thesaurus and a dictionary. It is accessible for human use via a web browser and it is also available in XML for computational use (opensourcewordnet). Cornetto has circa 177,000 lexical entries and 70,000 synsets.
It includes the Dutch Wordnet which organizes words in sets of synonyms (synsets) and records semantic relations between them. It also includes the Dutch Reference Lexicon which organizes words in form-meaning units (lexical entries) and describes them with short definitions, usage constraints, selection restrictions, syntactic behaviours, combinatorial information and illustrative contexts. Cornetto can be considered as the combination of a thesaurus and a dictionary. It is accessible for human use via a web browser and it is also available in XML for computational use (opensourcewordnet). Cornetto has circa 177,000 lexical entries and 70,000 synsets.


<!--T:3-->
*[http://cornetto.clarin.inl.nl/index.html Online demo]
*[http://cornetto.clarin.inl.nl/index.html Online demo]
*[http://wordpress.let.vupr.nl/cornetto/ Project website]
*[http://wordpress.let.vupr.nl/cornetto/ Project website]


<!--T:4-->
== Dutch SemCor ==
== Dutch SemCor ==
Dutch SemCor provides a corpus, fully sense-tagged with senses and domain tags from the Cornetto database
Dutch SemCor provides a corpus, fully sense-tagged with senses and domain tags from the Cornetto database
Line 16: Line 23:
*[http://wordpress.let.vupr.nl/dutchsemcor/results/annotated-data/machine_sonar/ Download machine annotations]
*[http://wordpress.let.vupr.nl/dutchsemcor/results/annotated-data/machine_sonar/ Download machine annotations]


<!--T:5-->
== EuroWordNet Dutch ==
== EuroWordNet Dutch ==
EuroWordNet is a multilingual database with wordnets for several European languages (Dutch, Italian, Spanish, German, French, Czech and Estonian). The wordnets are structured in the same way as the American wordnet for English ( Princeton WordNet, Miller et al 1990) in terms of synsets (sets of synonymous words) with basic semantic relations between them. Each wordnet represents a unique language-internal system of lexicalizations.
EuroWordNet is a multilingual database with wordnets for several European languages (Dutch, Italian, Spanish, German, French, Czech and Estonian). The wordnets are structured in the same way as the American wordnet for English ( Princeton WordNet, Miller et al 1990) in terms of synsets (sets of synonymous words) with basic semantic relations between them. Each wordnet represents a unique language-internal system of lexicalizations.


<!--T:6-->
*[http://catalog.elra.info/en-us/repository/browse/ELRA-M0016/ Information]
*[http://catalog.elra.info/en-us/repository/browse/ELRA-M0016/ Information]
*[http://catalog.elra.info/en-us/purchase-procedure-info/ Purchase procedure]
*[http://catalog.elra.info/en-us/purchase-procedure-info/ Purchase procedure]


<!--T:7-->
== Open Multilingual WordNet ==
The goal of this project is to make it easy to use wordnets in multiple languages.
There are currently two versions of the OMW.
<!--T:8-->
* [https://omwn.org/omw1.html OMW Version 1] links hand created wordnets and automatically created wordnets for over a 150 languages through the Princeton Wordnet of English.
* [https://omwn.org/omw2.html OMW Version 2], the current (experimental) version, uses the Collaborative Interlingual Index to link the wordnets together
<!--T:9-->
== Word Association Data ==
== Word Association Data ==
Word association and participant data for 100 primary, secondary and tertiary responses to 12,571 cues as reported in De Deyne, Navarro and Storms (2013).
Word association and participant data for 100 primary, secondary and tertiary responses to 12,571 cues as reported in De Deyne, Navarro and Storms (2013).


<!--T:10-->
*[https://smallworldofwords.org/en/project/research#download Data download]
*[https://smallworldofwords.org/en/project/research#download Data download]
*[https://smallworldofwords.org/articles/2012.DeDeyneNavarroStorms.Wordassociations.BRM.pdf De Deyne, S., Navarro, D., Storms, G. (2013). Better explanations of lexical and semantic cognition using networks derived from continued rather than single word associations. Behavior Research Methods, 45 (2), 480-498.]
*[https://smallworldofwords.org/articles/2012.DeDeyneNavarroStorms.Wordassociations.BRM.pdf De Deyne, S., Navarro, D., Storms, G. (2013). Better explanations of lexical and semantic cognition using networks derived from continued rather than single word associations. Behavior Research Methods, 45 (2), 480-498.]


<!--T:11-->
== Age of Acquisition Data ==
== Age of Acquisition Data ==
Following similar studies in English, ratings of age of acquisition for 30,000 Dutch words are presented and validated. These include nearly all lemmas language researchers are likely to be interested in.
Following similar studies in English, ratings of age of acquisition for 30,000 Dutch words are presented and validated. These include nearly all lemmas language researchers are likely to be interested in.


*[http://crr.ugent.be/papers/Overzicht%20AoA%20ratings%20alle%20studies.xlsx Data download]
<!--T:12-->
*[http://crr.ugent.be/archives/1602 Website]
*[https://www.ugent.be/pp/experimentele-psychologie/en/research/documents/aoa/appnet.htm Data download]
*[http://crr.ugent.be/papers/Norms%20of%20AoA%20and%20Concreteness%20for%2030000%20Dutch%20words%20final.pdf Brysbaert, M., Stevens, M., De Deyne, S., Voorspoels, W., & Storms, G. (2014). Norms of age of acquisition and concreteness for 30,000 Dutch words. Acta Psychologica, 150, 80-84]
*[https://www.ugent.be/pp/experimentele-psychologie/en/research/documents/aoa Website]
*[https://www.researchgate.net/publication/262385555_Norms_of_age_of_acquisition_and_concreteness_for_30000_Dutch_words Brysbaert, M., Stevens, M., De Deyne, S., Voorspoels, W., & Storms, G. (2014). Norms of age of acquisition and concreteness for 30,000 Dutch words. Acta Psychologica, 150, 80-84]


<!--T:13-->
== Concreteness Data ==
== Concreteness Data ==
Following similar studies in English, ratings of concreteness for 30,000 Dutch words are presented and validated. These include nearly all lemmas language researchers are likely to be interested in.
Following similar studies in English, ratings of concreteness for 30,000 Dutch words are presented and validated. These include nearly all lemmas language researchers are likely to be interested in.
*[http://crr.ugent.be/papers/Concreteness%20ratings%20Brysbaert%20et%20al.xlsx Concreteness norms]
*[http://crr.ugent.be/papers/Concreteness%20ratings%20Brysbaert%20et%20al.xlsx Concreteness norms] (links are dead)
*[http://crr.ugent.be/archives/1602 Website]
*[http://crr.ugent.be/archives/1602 Website]
*[http://crr.ugent.be/papers/Norms%20of%20AoA%20and%20Concreteness%20for%2030000%20Dutch%20words%20final.pdf Brysbaert, M., Stevens, M., De Deyne, S., Voorspoels, W., & Storms, G. (2014). Norms of age of acquisition and concreteness for 30,000 Dutch words. Acta Psychologica, 150, 80-84]
*[http://crr.ugent.be/papers/Norms%20of%20AoA%20and%20Concreteness%20for%2030000%20Dutch%20words%20final.pdf Brysbaert, M., Stevens, M., De Deyne, S., Voorspoels, W., & Storms, G. (2014). Norms of age of acquisition and concreteness for 30,000 Dutch words. Acta Psychologica, 150, 80-84]
<!--T:14-->
== Datasets for Psychological Research ==
https://www.ugent.be/pp/experimentele-psychologie/en/research/documents
</translate>

Latest revision as of 10:06, 29 March 2024

Other languages:

Open Dutch Wordnet

Open Dutch WordNet is a Dutch lexical semantic database. It was created by removing the proprietary content from Cornetto, and by using open source resources to replace this proprietary content.

Cornetto-LMF (Lexicon Markup Framework)

Cornetto is a lexical resource for the Dutch language which combines two resources with different semantic structures. It includes the Dutch Wordnet which organizes words in sets of synonyms (synsets) and records semantic relations between them. It also includes the Dutch Reference Lexicon which organizes words in form-meaning units (lexical entries) and describes them with short definitions, usage constraints, selection restrictions, syntactic behaviours, combinatorial information and illustrative contexts. Cornetto can be considered as the combination of a thesaurus and a dictionary. It is accessible for human use via a web browser and it is also available in XML for computational use (opensourcewordnet). Cornetto has circa 177,000 lexical entries and 70,000 synsets.

Dutch SemCor

Dutch SemCor provides a corpus, fully sense-tagged with senses and domain tags from the Cornetto database

EuroWordNet Dutch

EuroWordNet is a multilingual database with wordnets for several European languages (Dutch, Italian, Spanish, German, French, Czech and Estonian). The wordnets are structured in the same way as the American wordnet for English ( Princeton WordNet, Miller et al 1990) in terms of synsets (sets of synonymous words) with basic semantic relations between them. Each wordnet represents a unique language-internal system of lexicalizations.

Open Multilingual WordNet

The goal of this project is to make it easy to use wordnets in multiple languages. There are currently two versions of the OMW.

  • OMW Version 1 links hand created wordnets and automatically created wordnets for over a 150 languages through the Princeton Wordnet of English.
  • OMW Version 2, the current (experimental) version, uses the Collaborative Interlingual Index to link the wordnets together

Word Association Data

Word association and participant data for 100 primary, secondary and tertiary responses to 12,571 cues as reported in De Deyne, Navarro and Storms (2013).

Age of Acquisition Data

Following similar studies in English, ratings of age of acquisition for 30,000 Dutch words are presented and validated. These include nearly all lemmas language researchers are likely to be interested in.

Concreteness Data

Following similar studies in English, ratings of concreteness for 30,000 Dutch words are presented and validated. These include nearly all lemmas language researchers are likely to be interested in.

Datasets for Psychological Research

https://www.ugent.be/pp/experimentele-psychologie/en/research/documents