Collected Item: “A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment”
Врста публикације
Рад у зборнику
Верзија документа
објављена
Језик
енглески
Аутор/и (Милан Марковић, Никола Николић)
Ahmadi, Sina and McCrae, John P and Nimb, Sanni and Khan, Fahad and Monachini, Monica and Pedersen, Bolette S and Declerck, Thierry and Wissik, Tanja and Bellandi, Andrea and Pisani, Irene and ... Ranka Stanković and others
Наслов рада (Наслов - поднаслов)
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
Назив конференције (зборника), место и датум одржавања
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), Marseille
Издавач (Београд : Просвета)
European Language Resources Association (ELRA)
Година издавања
2020
Сажетак рада на енглеском језику
Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data will pave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriously requiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA.
Почетна страна рада
3232
Завршна страна рада
3242
Кључне речи на енглеском (одвојене знаком ", ")
lexical semantic resources, sense alignment, lexicography, language resource
Шира категорија рада према правилнику МПНТ
М30
Ужа категорија рада према правилнику МПНТ
М33
Ниво приступа
Отворени приступ
Лиценца
Creative Commons – Attribution-Share Alike 4.0 International
Формат датотеке
.pdf