publication . Article . 2012

Consistent modelling of heterogeneous lexical structures

Laurent Romary; Werner Wegstein;
Open Access English
  • Published: 05 Nov 2012
  • Publisher: HAL CCSD
  • Country: France
Abstract
Our paper outlines a proposal for the consistent modeling of heterogeneous lexical structures in semasiological dictionaries, based on the element structures described in detail in chapter 9 (Dictionaries) of the TEI Guidelines. The core of our proposal describes a system of relatively autonomous lexical “crystals” that can, within the constraints of the relevant element’s definition, be combined to form complex structures for the description of morphological form, grammatical information, etymology, word-formation, and meaning for a lexical structure. The encoding structures we suggest guarantee sustainability and support re-usability and interoperability of data. This paper presents case studies of encoding dictionary entries in order to illustrate our concepts and test their usability. We comment on encoding issues involving <entry>, <form>, <etym>, and on refinements to the internal content of <sense>.
Persistent Identifiers
Subjects
free text keywords: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL], dictionary encoding, semasiological dictionary, entry, form, sense, Samuel Johnson, Computer software, QA76.75-76.765, dictionary encoding, semasiological dictionary, entry, form, sense, Samuel Johnson, Dictionary of the English Language, Encoding (memory), Natural language processing, computer.software_genre, computer, Interoperability, Structure (mathematical logic), Element (category theory), Artificial intelligence, business.industry, business, Lexical chain, Lexical choice, Usability, Lemma (morphology), Computer science
Communities
Communities with gateway
OpenAIRE Connect image
Other Communities
  • DARIAH EU
Any information missing or wrong?Report an Issue