A WordNet Ontology in Improving Searches of Digital Dialect Dictionary
In this paper, we present a method for automatic generation of a digital resource, which connects all indirect synonyms of a dialect term to all indirect synonyms of a corresponding term in the standard language, aiming to improve the search of a digital dialect dictionary. The method uses SWRL rules defined in the Serbian WordNet ontology to identify sets of synonymous words. It also uses e-dictionaries to produce correct lemmas in standard language that users usually employ in searches. ...... for representing digital resources as knowledge based resources and as Linked Open Data (LOD) on the Web [6], [15]. Digital dictionary of the South Serbian dialect*, containing over 20 thousand terms, is the first comprehensive implementation [7] of a digital version of a di- alect vocabulary of the Serbian ...
... management of dialectal dictionaries [10], the software for data visualization and presenting of linguistic dialectal maps [12], [11], the analysis of the geo- graphical distribution of a language and geographical information relevant for linguistic research [9], using of Semantic Web-based techniques for ...
... records). One of the records is shown below. It is related to the first example in the step (2) and represents an infinitive of a verb upropastiti linked to the 8 entries found in the dialect dictionary whose definitions contained this verb in the form of the first person singularupropastim. upropastiti ...Miljana Mladenović, Ranka Stanković, Cvetana Krstev. "A WordNet Ontology in Improving Searches of Digital Dialect Dictionary" in New Trends in Databases and Information Systems: ADBIS 2017 Short Papers and Workshops - SW4CH (Semantic Web for Cultural Heritage) 767, Springer International Publishing (2017). https://doi.org/10.1007/978-3-319-67162-8_37
An update on the mineral-like Sr-containing transition metal arsenates
Tamara Đorđević, Ljiljana Karanović (2021)We report on the crystal structures of three novel synthetic SrM-arsenates (M = Ni and Fe3+), isostructural or structurally related to the minerals from tsumcorite, carminite and brackebuschite groups. They were synthesised under mild hydrothermal conditions and further characterised using single-crystal X-ray diffraction (SXRD), scanning electron microscopy with energy dispersive spectroscopy (SEMEDS) and Raman spectroscopy. SXRD and SEM-EDS yielded formulae: (I) SrNi2(AsO4)2·2H2O, (II) Sr1.4Fe3+ 1.6(AsO4)2(OH)1.6 and (III) SrFe3+(AsO4)(AsO3OH). All three structures are built up of slightly distorted MO6 octahedra ...арсенати метала, кристална структура, дифракција рендгенских зрака на монокристалима, хидротермална синтеза, Раманова спектроскопија, релевантна једињења у животној средини, арсенeochemistry and PetrologyTamara Đorđević, Ljiljana Karanović. "An update on the mineral-like Sr-containing transition metal arsenates" in Mineralogical Magazine, Mineralogical Society (2021). https://doi.org/10.1180/mgm.2021.41
From DELA Based Dictionary to Leximirka Lexical Database
Biljana Lazić, Mihailo Škorić (2020)In this paper, we will present an approach in transforming Serbian language Morphological dictionaries from a DELA text format to a lexical database dubbed Leximirka. Considering the benefits of storing data within a database when compared to storing them in textual documents, we will outline some of the functionality that the database has made possible. We will also show how hand-made rules that use category labels lexical entries are marked with can be used to link lexical entries. ...... Database”. In Proceedings of the 11th International Conference on Language Re- sources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), McCrae, John P., Chris- tian Chiarcos, Thierry Declerck, Jorge Gracia and Bettina Klimek. Paris, France: ...
... dedicated to corpora. The plan is to link lexical records to the WordNet for the Serbian language. It is also envisaged to prepare the data for display in the form of Linked Open Data on the web, which would enable connection with other lexical resources. Since the application is independent of the language for ...
... domain, and semantic markers. Figure 2. A lexicographic database model for data category information . The DataCategories table stores information about marker categories, that is, marker type information. The table is linked to itself, allowing for hierarchical representation of categories that are ...Biljana Lazić, Mihailo Škorić. "From DELA Based Dictionary to Leximirka Lexical Database" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.4
Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection
Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić (2022)In this paper we present the Serbian part of the ELTeC multilingual corpus of novels written in the time period 1840-1920. The corpus is being built in order to test various distant reading methods and tools with the aim of re-thinking the European literary history. We present the various steps that led to the production of the Serbian sub-collection: the novel selection and retrieval, text preparation, structural annotation, POS-tagging, lemmatization and named entity recognition. The Serbian sub-collection was published ...Ranka Stanković, Cvetana Krstev, Branislava Šandrih Todorović, Duško Vitas, Mihailo Škorić, Milica Ikonić Nešić. "Distant Reading in Digital Humanities: Case Study on the Serbian Part of the ELTeC Collection" in Proceedings of the Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
Combining Heterogeneous Lexical Resources
... instance, (2) devojcyin,A1+Pos+Ek_N=4ka devojka,N617+Hum+Ek_A=2cyin The information in the first line states that the adjective devojcyin is linked to the noun entry and also indicates the way to identify this noun in the dictionary. Conversely, the information in the second line links the noun ...
... +Imperf mark and the passive past participle by the +PP mark. Similarly, the progressive verb povezivati in the second line of the example (5) is linked to its corresponding perfective verb povezati, the passive past participle povezivan and the verbal noun povezivanxe. This information can be used ...
... used to link and/or add synsets containing such literals through the DERIVED relation. For example, the following two synsets in Serbian WN were linked by the DERIVED relation using this information: (6) (zdruzxiti:1, povezati:1, …) � (join:2, bring together:1) (povezan:4) � (connected:2) ...Cvetana Krstev, Duško Vitas, Ranka Stanković, Ivan Obradović, Gordana Pavlović-Lažetić. "Combining Heterogeneous Lexical Resources" in Proceedings of the Fourth Interantional Conference on Language Resources and Evaluation, Lisabon, Portugal , May 2004, vol. 4, ELRA - European Language Resources Association (2004)
3D modeling and monitoring of karst system as a base for its evaluation and utilization: a case study from eastern Serbia
Earth-Surface Processes, Geology, Pollution, Soil Science, Water Science and Technology, Environmental Chemistry, Global and Planetary ChangeSaša Milanović, Zoran Stevanović, Ljiljana Vasić, Vesna Ristić-Vakanjac. "3D modeling and monitoring of karst system as a base for its evaluation and utilization: a case study from eastern Serbia" in Environmental Earth Sciences, Springer Science and Business Media LLC (2013). https://doi.org/10.1007/s12665-013-2591-9
Novi pristup proučavanju tekstura minerala značajnih za istraživanje rudnih ležišta
Aleksandar Pačevski (2022)... implications Ranka Stanković, Harmonizacija geopodataka korišćenjem povezanih otvorenih podataka Ranka Stanković, Harmonization of Geodata Using Linked Open Data 18. Kongres geologa Srbije "Geologija rešava probleme", Divčibare, 01-04 jun 2022. Zbornik apstrakata Book of abstracts 18h Serbian Geological ...
... laserskom ablacijom (eng. LA-JCP-M5S) ı druge slične metode daju istraživaču u ruke veoma „„moćno oružje“. Međutim, ukoliko se ovakvo oružje samo i malo pogrešno usmeri dobija se niz pogrešnih interpretacija i zaključaka koji mogu veoma da štete samom istraživanju. Pravilnu upotrebu savremenih metoda ...
... kao što su teksture izdvajanja, zamenjivanja i dr, a koje su značajno doprinele razumenju postanka minerala i procesa obrazovanja ruda. Može se reći da rudna mikroskopija, kao zasebna disciplina polarizacione mikroskopije, doživljava svoj vrhunac sredinom i drugom polovinom XX veka, gde svakako treba ...Aleksandar Pačevski. "Novi pristup proučavanju tekstura minerala značajnih za istraživanje rudnih ležišta" in 18. Kongres geologa Srbije “Geologija rešava probleme” Divčibare, 01-04 jun 2022., Beograd : Srpsko geološko društvo (2022)
A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian
Uvredljivi govor na društvenim medijima, uključujući psovke, pogrdni govor i govor mržnje, dostigao je nivo pandemije. Sistem koji bi bio u stanju da detektuje takve tekstove mogao bi da pomogne da internet i društveni mediji postanu bolji virtuelni prostor sa više poštovanja. Istraživanja i komercijalna primena u ovoj oblasti do sada su bili fokusirani uglavnom na engleski jezik. Ovaj rad predstavlja rad na izgradnji AbCoSER-a, prvog korpusa uvredljivog govora na srpskom jeziku. Korpus se sastoji od 6.436 ručno označenih ...... Twitter, lexicon, corpus Digital Object IdentiĄer 10.4230/OASIcs.LDK.2021.13 Funding Linked data development is supported by the COST Action CA18209-NexusLinguarum “European network for Web-centred linguistic data science”. Access to SketchEngine and Lexonomy is provided by the ELEXIS project funded ...
... for information integration is the Linked (Open) Data (LOD) paradigm that is used for publishing lexical resources by using URIs to unambiguously identify lexical entries, their components and their relations in the web of data. Moreover, it is used to make lexical data sets accessible via http(s), to publish ...
... between lexical data sets and other LOD resources [8]. The goal of our research is to make its results compatible with the Linked Data approach, using its set of design principles for sharing machine-readable interlinked data on the Web. This vision of globally accessible and linked data on the internet ...Danka Jokić, Ranka Stanković, Cvetana Krstev, Branislava Šandrih. "A Twitter Corpus and Lexicon for Abusive Speech Detection in Serbian" in 3rd Conference on Language, Data and Knowledge (LDK 2021), MDPI AG (2021). https://doi.org/10.4230/OASIcs.LDK.2021.13
Ontološki model upravljanja rizikom u rudarstvu
Olivera Kitanović (2021)Rudarska proizvodnja obuhvata kompleksne tehnološke sisteme, što nameće potrebu za uspostavljanjem i unapređivanjem sistema upravljanja rizikom. Heterogenost i obim podataka neophodnih za upravljanje rizikom zahtevaju sistem koji ih na fleksibilan način integriše i omogućava njihovo optimalno korišćenje. Osnovni cilj ove disertacije je razvoj ontologije za domen rudarstva i na njoj zasnovanog modela za upravljanje rizikom. Njegova realizacija podrazumeva i implementaciju algoritama ekstrakcije informacija za popunjavanje ontologije, kao i odgovarajuće softversko rešenje. Razvoj modela obuhvata i značajno proširenje rudarskog korpusa, kao ...rudarstvo, rizik, upravljanje rizikom, procena rizika, ontologija, semantička mreža, ekstrakcija informacija, upravljanje znanjem, računarska lingvistika... Database.” In Proceedings of the Eleventh International Conference on Language Resources and Evaluation - W23 6th Workshop on Linked Data in Linguistics : Towards Linguistic Data Science (LDL-2018), LREC 2018, edited by John P. McCrae, Christian Chiarcos, Thierry Declerck, Jorge Gracia, and Bettina Klimek ...
... ds/semanticweb/data 64 pravnih, tehnoloških ili društvenih prepreka koristeći otvorene standarde. Linked Open Data je danas sve popularniji termin koji predstavlja otvorene podatke u RDF formatu, bez obzira na domen iz kog potiču isti (državna uprava ili poslovni sektor), tako da se i Srbija uključila ...
... njihovih relacija u mreži podataka. Deo neba otvorenih podataka koji se naziva Linked Open Data (LOD), predstavlja sadržaje koji su dostupni za korišćenje i distribuciju bez 55 https://www.w3.org/standards/semanticweb/data 56 Uniform Resource Identifier (URI) ) je jedinstveni niz karaktera koji i ...Olivera Kitanović. Ontološki model upravljanja rizikom u rudarstvu, Beograd : [O. Kitanović], 2021
A Mathematical Learning Environment Based on Serbian Language Resources
In recent years, in line with ever growing usage of Information technology, the learning environments are changing. The amount of available learning materials in various forms has increased. These new environments demand comprehensive learning systems, which enable management of the learning corpus with special attention paid to relevant lexical resources. In this paper we present the concept of a Mathematical Learning Environment in Serbian (MLES), which is based on a corpus of mathematical materials and various lexical resources, enabling ...... engineering practice based on mathematical concepts (Figure 3). Results of the third component are annotated and linked texts, where every mathematical term in the text is linked to the appropriate dictionary entry or relevant corpus content related to that term. This system component also ...
... presented. Annotation also alleviates the statistical analysis of the corpus, namely automatic assignment of the distribution of annotated linguistic properties. 5. APPLICATION TERMI The Termi application has recently been launched to serve as a support for the development of terminological ...
... be developed for special functions, integrals, equations and the like. For corpus management, we have used the IMS Open Corpus Workbench (CWB) as a collection of open-source tools [14] and an adaptation of CQPweb, a web-based graphical user interface designed specifically for CWB query ...Radojičić Marija, Obradović Ivan, Stanković Ranka, Utvić Miloć, Kaplar Sebastijan. "A Mathematical Learning Environment Based on Serbian Language Resources" in Proceedings of the 7th International Scientific Conference Technics and Informatics in Education, Faculty of Technical Sciences, Čačak (2018)
Razvoj integralnog modela za optimizaciju diskontinualnog sistema proizvodnje na površinskim kopovima nemetala
Miodrag Čelebić (2024)Izbor optimalne mehanizacije u diskontinualnom sistemu transporta je jedna od najvažnijih odluka koju treba donijeti prilikom projektovanja površinskih kopova. Pri izboru optimalne mehanizacije, uticaj ima veliki broj faktora, kako prirodnih – geoloških i ekoloških, tako i tehničkih, ekonomskih i socijanih. Neki od njih se mogu izraziti numerički, u određenim mijernim jedinicama, a neki opisno, lingvističkim varijablama, shodno uslovima koji vladaju na rudnim ležištima, a koje karakterišu visok stepen neizvjesnosti i neodređenosti kako prilikom istraživanja, tako i tokom procesa eksploatacije ...izbor mehanizacije, tehnologija eksploatacije, diskontinualani sistem, ekspertsko odlučivanje, lingvističke varijable, MCDM, FAHPMiodrag Čelebić. Razvoj integralnog modela za optimizaciju diskontinualnog sistema proizvodnje na površinskim kopovima nemetala, Beograd : [M. Čelebić], 2024
Improvement of geodatabase queries within GeolISS
Ranka Stanković (2008)... highlighted spatial objects that satisfy query conditions. It is further possible to automatically open the data form with selected (highlighted) objects and to perform further filtering, exploration and data management. Figure 4. Form for spatial objects searching Ranka Stanković ...
... raster and the vector data model to represent reality. Raster data sets record a value for each point in the area of interest, which may require more storage space than the representation in vector format that stores only the Ranka Stanković 66 necessary data. Vector data can be displayed as vector ...
... vector data is usually much smaller than the space required for raster data. Another advantage of vector data is that they can be easily updated and maintained. In GeolISS vectorization of geologic maps is chosen as the approach to digitization of geological structures, namely geospatial data in general ...Ranka Stanković. "Improvement of geodatabase queries within GeolISS" in Review of the National Center for Digitization, Beograd : Faculty of Mathematics, Belgrade (2008)
Knowledge and Rule-Based Diacritic Restoration in Serbian
In this paper we present a procedure for the restoration of diacritics in Serbian texts written using the degraded Latin alphabet. The procedure relies on the comprehensive lexical resources for Serbian: the morphological electronic dictionaries, the Corpus of Contemporary Serbian and local grammars. Dictionaries are used to identify possible candidates for the restoration, while the dataobtainedfromSrpKorandlocalgrammarsassistsinmakingadecisionbetween several candidates in cases of ambiguity. The evaluation results reveal that,dependingonthetext,accuracyrangesfrom95.03%to99.36%,whilethe precision (average 98.93%) is always higher than the recall (average 94.94%).... and Geology archives faculty publications available in open access, as well as the employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Conventional information retrieval thesauri can also be considered as linguistic ontologies because they are based on real terms of a subject ...
... WordNet Conference, Seged, Hungary, pages 162–177. Dobrov, B. and Loukachevitch, N. (2006). Development of linguistic ontology on natural sciences and technology. In Proceedings of Linguistic Resources and Evaluation Conference, pages 1077–1082. Fellbaum, C., Ed. (1998). WordNet: An Electronic Lexical ...
... ontologies. Rules for inclusion of phrases in the thesaurus are more similar to information-retrieval thesauri guidelines (NISO, 2005). Each concept is linked with words and phrases conveying the concept in texts (text entries). Detailed description of lexical units (words in specific senses), representation ...Cvetana Krstev, Ranka Stanković, Duško Vitas. "Knowledge and Rule-Based Diacritic Restoration in Serbian" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018): 41-51
E-Connecting Balkan Languages
In this paper we present a versatile language processing tool that can be successfully used for many Balkan languages. This tool relies for its work on several sophisticated textual and lexical resources that were developed for most of Balkan languages. These resources are based on several de facto standards in natural language processing.... analysis of text. The tool WS4LR expects them to be in the form of wordnets, that is, nodes representing sets of synonymous word (synsets) which are linked by various semantic relations. The first built wordnet was English wordnet, so-called Princeton Wordnet (PWN), having today approximately 140,000 ...
... languages and the way they are connected. Serbian wordnet today consists of more then 15,000 synsets built by app. 25,000 literals. All of them are linked to PWN, except for 532 Balkan specific concepts that are connected with other Balkan languages, and 155 Serbian specific concepts that remain ...
... unconnected with other languages. Bulgarian wordnet consists of more then 31,000 synsets built by more than 66,000 literals. The synsets are linked with the PWN as well, again there are 436 Balkan specific concepts shared with other Balkan languages and 182 Bulgarian language specific concepts ...Cvetana Krstev, Ranka Stanković, Duško Vitas, Svetla Koeva. "E-Connecting Balkan Languages" in Proceedings of the Workshop Workshop on Multilingual resources, technologies and evaluation for Central and Eastern European Languages, 17 September 2009, eds. C. Vertan, S. Piperidis, E. Paskaleva and Milena Slavcheva, Borovets, Bulgaria : Association for Computational Linguistics Stroudsburg, PA, USA (2009)
Sentiment Analysis of Serbian Old Novels
In this paper we present first study of Sentiment Analysis (SA) of Serbian novels from the 1840-1920 period. The preparation of sentiment lexicon was based on three existing lexicons: NRC, AFFIN and Bing with additional extensive corrections. The first phase of dataset refinement included filtering the word that are not found in Serbian morphological dictionary and in second automatic POS tagging and lemma were manually corrected. The polarity lexicon was extracted and transformed into ontolex-lemon and published as initial ...Ranka Stanković, Miloš Košprdić, Milica Ikonić Nešić, Tijana Radović. "Sentiment Analysis of Serbian Old Novels" in Proceedings of the 2nd Workshop on Sentiment Analysis and Linguistic Linked Data, June 2022, Marseille, France, European Language Resources Association (2022)
Keyword-Based Search on Bilingual Digital Libraries
This paper outlines the main features of Biblisha, a tool that offers various possibilities of enhancing queries submitted to large collections of aligned parallel text residing in bilingual digital library. Biblishsa supports keyword queries as an intuitive way of specifying information needs. The keyword queries initiated, in Serbian or English, can be expanded, both semantically, morphologically and in other language, using different supporting monolingual and bilingual resources. Terminological and lexical resources are of various types, such as wordnets, electronic ...Ranka Stanković, Cvetana Krstev, Duško Vitas, Nikola Vulović, Olivera Kitanović. "Keyword-Based Search on Bilingual Digital Libraries" in Semantic Keyword-Based Search on Structured Data Sources - Second COST Action IC1302 International KEYSTONE Conference, IKC 2016, Springer (2017). https://doi.org/10.1007/978-3-319-53640-8_10
Coal homogenization stockyard sizing “Tamnava – Zapad” case study
... for the geometry of a stockpile, using both actual grade data and a variogram to describe the input variation. An application of efficient methods of conditional simulation for optimizing coal blending strategies in large continuous open pit mining operations (Benndorf, 2013.) integrates simulated ...
... considered a small number, and above that a large number of layers. The number of layers that ensures consistent quality of the stacked coal is linked to the value range of the observed parameter, and is determined by statistical methods - through standard deviation. When quality is assessed based ...
... methods. (Pavloudakis & Agioutantis, 2001) For "Tamnava Zapodno-Polje" a simulation was run on a small test site using data from the technological block model, as well as simulated data based on statistical parameters: average value and standard deviation. It should be pointed out that increasing the ...Božo Kolonja, Dinko Knežević, Ranka Stanković, Dejan Stevanović, Detlef Trummer. "Coal homogenization stockyard sizing “Tamnava – Zapad” case study" in Proceedings of 13th International Symposium Continuous Surface Mining, ISCSM 2016, / (2016)
Towards a Mining Equipment Ontology
... all types of specific equipment (excavation machinery, conveying and auxiliary machinery, pumps etc.). Data on exploitation areas, open pit or work bench are modeled by the class Područje. The open pit, that is, the area where the equipment is operating is modeled by the class OpremaUPodrucju. Values ...
... with the basic interface, used for invoking all functionalities implemented within the system. The interface for managing system data pertaining to user profiles, users, open pits, excavation machinery and conveying mechanization and technological systems is realized within the SukuMine library. The ...
... ">machine for excavating. Figure 7: Panel for data export from RudOnto A part of the OWL code resulting from the export is given below: The exported OWL code can be viewed and verified through the free, open source ontology editor and knowledge-base framework Protégé ( ...Ranka Stanković, Ivan Obradović, Olivera Kitanović, Ljiljana Kolonja. "Towards a Mining Equipment Ontology" in Proceedings of the 12th International Conference Research and Development in Mechanical Industry, RaDMI 2012, September 2012, Vrnjačka Banja, Serbia no. 1, Vrnjačka Banja, Serbia : SaTCIP (Scientific and Technical Center for Intellectual Property) Ltd. (2012)
Low-temperature phase transition and magnetic properties of K3YbSi2O7
Predrag Dabić, Volker Kahlenberg, Biljana Krüger, Marko Rodić, Sabina Kovač, Jovan Blanuša, Zvonko Jagličić, Ljiljana Karanović, Václav Petříček, Aleksandar Kremenović (2021)alkalni silikati elemenata retkih zemalja, fazni prelazi, magnetne karakteristike, razdvajanje kristalnog polja, silikati lantanica... from a regular alternation of layers of two types, which are parallel to the (001) plane. In the octahedral layer, YbO6 octahedra are isolated and linked by K1O6+3 polyhedra. The second, slightly thicker sorosilicate layer is formed by a combination of Si2O7 dimers and K2O6+3 polyhedra. The boundary ...
... situated at z ’ 1 4 is significantly thicker (thickness: �4.3 Å). By sharing common vertices each YbO6 octahedron from the octahedral layer is linked to six Si2O7 groups from two neighbouring sorosilicate slabs via Yb–O–Si linkages. At the boundary between adjacent octahedral and soro- silicate ...
slab situated at z ~ } is significantly thicker (thickness: ~4.3 A). By sharing common vertices each YbO, octahedron from the octahedral layer is linked to six Si,07 groups from two neighbouring sorosilicate slabs via Yb—O-Si linkages. At the boundary between adjacent octahedral and soro- silicate
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources
In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615