Download area
Trial and test data
- Note: The dtd in the trial data was enriched to accommodate sentence boundaries and, in the case of Dutch, to properly encode component lemmas of compounds. You can retrieve the newer version here or with the test data.
Background text
Wordnets
Wordnets are included in LMF format.
- Dutch WordNet: The Dutch Wordnet is now part of the Cornetto database which is distributed free for research by the Nederlandse Taalunie (Dutch-Flemish Language Union). To obtain a license for the Dutch wordnet in WN-LMF you need to obtain a license for the Cornetto database first. The Taalunie will notify Piek Vossen at the Faculty of Arts of the VU University about the license and Piek Vossen will then generate the WN-LMF file for the Dutch wordnet and provide it to you. Information on the Cornetto database can be found at http://www2.let.vu.nl/oz/cltl/cornetto/index.html. The Dutch TST centrale is distributing Dutch language resources for the Taalunie. The Cornetto database is not yet listed as an available as a product but you can email the service desk for the license: servicedesk@inl.nl. Information on the license can be found a http://www2.let.vu.nl/oz/cltl/cornetto/license.html. The license document can be found at: http://www2.let.vu.nl/oz/cltl/cornetto/docs/lic-cornetto-tussenresultaten-eng.doc. You can print and sign the license file and send it to the service desk of the TST centrale. For questions about the licensing you can email to: Piek Vossen (p.vossen[at]let.vu.nl)
- English WordNet: the English WordNet can be obtained from http://wordnet.princeton.edu/wordnet/download. A LMF version has been included in the trial data.
- Italian WordNet: participants can obtain "ItalWordNet for SEMEVAL" through ELDA (Evaluations and Language resources Distribution Agency) by contacting Ms Valerie Mapelli at mapelli[at]elda.org, who will inform abount licensing and delivery procedure. Please mention explicitly that the purpose of ItalWordNet is to participate in the Semeval task.
- Chinese WordNet: The Chinese WordNet is now partially open to the public through Institute of Linguistics, Academia Sinica in Taiwan. The LMF version for the Chinese WordNet includes lemma, part of speech, gloss, and synset (examples are not included). To obtain the LMF file for the Chinese WordNet, please contact Ms. Jessie Lo at jessielo[at]gate.sinica.edu.tw.