publication . Article . Conference object . 2020

Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use Case

Fahad Khan; Laurent Romary; Ana Salgado; Jack Bowers; Mohamed Khemakhem; Toma Tasovac;
Open Access English
  • Published: 11 May 2020
  • Publisher: HAL CCSD
  • Country: France
Due to COVID19 pandemic, the 12th edition is cancelled. The LREC 2020 Proceedings are available at; International audience; In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework (LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, and Part 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the use of both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of the reference Portuguese dictionary Grande Dicionário Houaiss da Língua Portuguesa, part of a broader experiment comprising the analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the Unified Modelling Language (UML) and also in a couple of cases in TEI.
free text keywords: Dictionaries, Portuguese Language Resources, LMF, TEI, [INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL], [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL], WP1, WP5, strategies, tools, standards for lexicographic resources (objective 3), access to data and tools (objective 4), opennesse in lexicography (objective 5), LMF, TEI, Portuguese Language Resources, Dictionaries
Communities with gateway
OpenAIRE Connect image
Other Communities
Funded by
European Lexicographic Infrastructure
  • Funder: European Commission (EC)
  • Project Code: 731015
  • Funding stream: H2020 | RIA
Validated by funder
Any information missing or wrong?Report an Issue