Collected Item: “Managing mining project documentation using human language technology”
Врста публикације
Рад у часопису
Верзија рада
објављена верзија
Језик рада
енглески
Аутор/и (Милан Марковић, Никола Николић)
Aleksandra Tomašević, Ranka Stanković, Miloš Utvić, Ivan Obradović, Božo Kolonja
Наслов рада (Наслов - поднаслов)
Managing mining project documentation using human language technology
Наслов часописа
The Electronic Library
Година издавања
2018
Сажетак на српском језику
Purpose: This paper aims to develop a system, which would enable efficient management and exploitation of documentation in electronic form, related to mining projects, with information retrieval and information extraction (IE) features, using various language resources and natural language processing.
Design/methodology/approach: The system is designed to integrate textual, lexical, semantic and terminological resources, enabling advanced document search and extraction of information. These resources are integrated with a set of Web services and applications, for different user profiles and use-cases.
Findings: The use of the system is illustrated by examples demonstrating keyword search supported by Web query expansion services, search based on regular expressions, corpus search based on local grammars, followed by extraction of information based on this search and finally, search with lexical masks using domain and semantic markers.
Originality/value: The presented system is the first software solution for implementation of human language technology in management of documentation from the mining engineering domain, but it is also applicable to other engineering and non-engineering domains. The system is independent of the type of alphabet (Cyrillic and Latin), which makes it applicable to other languages of the Balkan region related to Serbian, and its support for morphological dictionaries can be applied in most morphologically complex languages, such as Slavic languages. Significant search improvements and the efficiency of IE are based on semantic networks and terminology dictionaries, with the support of local grammars.
Design/methodology/approach: The system is designed to integrate textual, lexical, semantic and terminological resources, enabling advanced document search and extraction of information. These resources are integrated with a set of Web services and applications, for different user profiles and use-cases.
Findings: The use of the system is illustrated by examples demonstrating keyword search supported by Web query expansion services, search based on regular expressions, corpus search based on local grammars, followed by extraction of information based on this search and finally, search with lexical masks using domain and semantic markers.
Originality/value: The presented system is the first software solution for implementation of human language technology in management of documentation from the mining engineering domain, but it is also applicable to other engineering and non-engineering domains. The system is independent of the type of alphabet (Cyrillic and Latin), which makes it applicable to other languages of the Balkan region related to Serbian, and its support for morphological dictionaries can be applied in most morphologically complex languages, such as Slavic languages. Significant search improvements and the efficiency of IE are based on semantic networks and terminology dictionaries, with the support of local grammars.
Волумен/том или годиште часописа
36
Број часописа
6
Почетна страна
993
Завршна страна
1009
DOI број
10.1108/EL-11-2017-0239
ISSN број часописа
0264-0473
Кључне речи на српском (одвојене знаком ", ")
Digital libraries, Information retrieval, Data mining, Human language technologies, Project documentation
Линк
https://www.emerald.com/insight/content/doi/10.1108/EL-11-2017-0239/full/html?af=R
Шира категорија рада према правилнику МПНТ
M20
Ужа категорија рада према правилнику МПНТ
М23
Пројект у склопу кога је настао рад
Unapređenje tehnologije površinske eksploatacije lignita u cilju povećanja energetske efikasnosti, sigurnosti i zaštite na radu
Степен доступности
Затворени приступ
Лиценца
All rights reserved
Формат дигиталног објекта
.pdf