You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
TEI Lex-0: A Target Format for TEI-Encoded Dictionaries and Lexical Resources
TEI Lex-0: A Target Format for TEI-Encoded Dictionaries and Lexical Resources
International audience; Achieving consistent encoding within a given community of practice has been a recurrent issue for the TEI Guidelines. The topic is of particular importance for lexical data if we think of the potential wealth of content we could gain from pooling together the information available in the variety of highly structured, historical and contemporary lexical resources. Still, the encoding possibilities offered by the Dictionaries Chapter in the Guidelines are too numerous and too flexible to guarantee sufficient interoperability and a coherent model for searching, visualising or enriching multiple lexical resources.Following the spirit of TEI Analytics [Zillig, 2009], developed in the context of the MONK project, TEI Lex-0 aims at establishing a target format to facilitate the interoperability of heterogeneously encoded lexical resources. This is important both in the context of building lexical infrastructures as such [Ermolaev and Tasovac, 2012] and in the context of developing generic TEI-aware tools such as dictionary viewers and profilers. The format itself should not necessarily be one which is used for editing or managing individual resources, but one to which they can be univocally transformed to be queried, visualised, or mined in a uniform way. We are also aiming to stay as aligned as possible with the TEI subset developed in conjunction with the revision of the ISO LMF (Lexical Markup Framework) standard so that coherent design guidelines can be provided to the community (cf. [Romary, 2015]).The paper will provide an overview of the various domains covered by TEI Lex- 0 and the main decisions that were taken over the last 18 months: constraining the general structure of a lexical entry; offering mechanisms to overcome the limits of when used in retro-digitized dictionaries (by allowing, for instance, and as children of ); systematizing the representation of morpho-syntactic information [Bański et al., 2017]; providing a strict -based encoding of sense-related information; deprecating ; dealing with internal and external references in dictionary entries, providing more advanced encodings of etymology (see submission by Bowers, Herold and Romary); as well as defining technical constraints on the systematic use of @xml:id at different levels of the dictionary microstructure. The activity of the group has already lead to changes in the Guidelines in response to specific GitHub tickets.
dictionaries, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], TEI Lex-0, modeling, [SCCO.LING] Cognitive science/Linguistics, [SCCO.LING]Cognitive science/Linguistics, TEI, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
dictionaries, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL], TEI Lex-0, modeling, [SCCO.LING] Cognitive science/Linguistics, [SCCO.LING]Cognitive science/Linguistics, TEI, [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Bański, Piotr, Jack Bowers and Tomaz Erjavec (2017). TEI-Lex0 guidelines for the encoding of dictionary information on written and spoken forms. Electronic Lexicography in the 21st Century: Proceedings of ELex 2017 Conference, Sep Leiden, Netherlands. hal-01757108
Ermolaev, Natalia, and Toma Tasovac (2012) “Building a Lexicographic Infrastructure for Serbian Digital Libraries.” Libraries in the Digital Age (LIDA) Proceedings 12, no. 0 (June 12). http://ozk.unizd.hr/proceedings/index.php/lida/article/view/55.
ISO 24613:2008 Language resource management - Lexical markup framework (LMF), currently revised as a multipart standards with Part 1: core model, Part 2: Machine Readable Dictionaries, Part 3: Etymology, Part 4: TEI serialisation
Romary, Laurent (2015). TEI and LMF crosswalks. JLCL - Journal for Language Technology and Computational Linguistics, 30 (1), http://www.jlcl.org . hal00762664v4
Zillig, Brian (2009) “TEI Analytics: Converting Documents into a TEI Format for CrossCollection Text Analysis.” Literary and Linguistic Computing 24 (2009): 187-192. https://doi.org/10.1093/llc/fqp005
2 Research products, page 1 of 1
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).0 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Average influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average visibility views 118 download downloads 80 citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).0 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Average influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average Powered byBIP!- 118views80downloads