Приказ основних података о документу

dc.creatorStanković, Ranka
dc.creatorStijović, Rada
dc.creatorVitas, Duško
dc.creatorKrstev, Cvetana
dc.creatorSabo, Olga
dc.date.accessioned2019-02-20T11:08:41Z
dc.date.available2019-02-20T11:08:41Z
dc.date.issued2018
dc.identifier.isbn978-961-06-0097-8
dc.identifier.urihttps://dais.sanu.ac.rs/123456789/4927
dc.description.abstractIn this paper we discuss the project of digitization of the Dictionary of the Serbo-Croatian Standard and Vernacular Language. Scanning and character recognition were a particular challenge, since various non-standard character set encoding was used in the course of the almost 60-year long production of the dictionary. The first aim of the project was to formalize the micro-structure of the dictionary articles in order to parse the digitized text of and transform it into structured data stored in relational lexical database. This approach is compatible with several standard structured forms and ontologies (TEI, LMF, Ontolex, LexInfo). A lexical database model was designed in compliance with these structured forms, following mostly the lemon model. Mapping of the lexical entry markers to LexInfo and TEI enabled export of the lexical data to the mentioned formats. A software solution for the dictionary text analysis, parsing and lexical database population was developed and tested on the first and the last published volumes of the dictionary (which contain 27,141 articles in total). An evaluation of the results shows that the developed model and software solution can be successfully used for the other volumes as well.en
dc.language.isoensr
dc.publisherLjubljana : Ljubljana University Press, Faculty of Artssr
dc.relationinfo:eu-repo/grantAgreement/MESTD/Basic Research (BR or ON)/178009/RS//sr
dc.rightsopenAccesssr
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.sourceProceedings of the XVIII EURALEX International Congress: Lexicography in Global Contextssr
dc.subjectcomputer lexicographysr
dc.subjectlexical databasеsr
dc.subjectlanguage resourcessr
dc.subjectdictionarysr
dc.subjectSerbian languagesr
dc.titleThe Dictionary of the Serbian Academy: from the Text to the Lexical Databaseen
dc.typearticlesr
dc.rights.licenseBY-NC-NDsr
dcterms.abstractСтијовић, Рада; Сабо, Олга; Крстев, Цветана; Станковић, Ранка; Витас, Душко;
dc.citation.spage941
dc.citation.epage949
dc.type.versionpublishedVersionsr
dc.identifier.fulltexthttps://dais.sanu.ac.rs/bitstream/id/15375/stankovic.stijovic.vitas.krstev.sabo.dictionary.pdf
dc.identifier.rcubhttps://hdl.handle.net/21.15107/rcub_dais_4927


Документи

Thumbnail

Овај документ се појављује у следећим колекцијама

Приказ основних података о документу