An Approach to Efficient Processing of Multi-Word Units
Објеката
- Тип
- Поглавље у монографији
- Верзија рада
- објављена верзија
- Језик
- енглески
- Креатор
- Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas
- Извор
- Computational Linguistics - Applications, Studies in Computational Intelligence 458
- Уредник
- Adam Przepiórkowski, Maciej Piasecki, Krzysztof Jassem, Piotr Fuglewicz
- Издавач
- Berlin Heidelberg : Springer-Verlag
- Датум издавања
- 2013
- doi
- 10.1007/978-3-642-34399-5_6
- isbn
- 978-3642-343-98-8
- Subject
- Natural Language Processing, Grammatical Category, Lexical Representation, MWU, multi-word unit
- Шира категорија рада
- M10
- Ужа категорија рада
- M14
- Права
- Затворени приступ
- Лиценца
- All rights reserved
- Формат
- Број
- 458
- Почетна страна
- 109
- Завршна страна
- 129
- issn
- 1860-949X
- Сажетак
- Efficient processing of Multi-Word Units in the course of development of morphological MWU dictionaries is not easy to achieve, especially when languages with complex morphological structures are concerned, such as Serbian. Manual development of this type of dictionaries is a tedious and extremely slow process. To alleviate this problem we turned to our multipurpose software tool, dubbed LeXimir, in the production of lemmas for e-dictionaries of multi-word units. In addition to that, we developed a procedure aimed at making the production of MWU dictionary lemmas more efficient. This procedure, which strongly relies on our comprehensive e-dictionaries of Serbian simple words, was subsequently implemented as a new functionality LeXimir. In this paper we present our approach, and offer an evaluation of the performance of the new functionality of LeXimir, and hence of our procedure, obtained through two rounds of experiments on various types of data. The paper ends with a brief discussion of some further possible applications of both the procedure and LeXimir in various language processing tasks.
Cvetana Krstev, Ivan Obradović, Ranka Stanković, Duško Vitas. "An Approach to Efficient Processing of Multi-Word Units" in Computational Linguistics - Applications, Studies in Computational Intelligence 458 no. 458, Berlin Heidelberg : Springer-Verlag (2013): 109-129. https://doi.org/10.1007/978-3-642-34399-5_6