- home
- Search
223 Research products, page 1 of 23
Loading
- Publication . 2021Open Access EnglishAuthors:Bowers, Jack; Herold, Axel; Romary, Laurent; Tasovac, Toma;Bowers, Jack; Herold, Axel; Romary, Laurent; Tasovac, Toma;Publisher: HAL CCSDCountry: France
The present paper describes the etymological component of the TEI Lex-0 initiative which aims at defining a terser subset of the TEI guidelines for the representation of etymological features in dictionary entries. Going beyond the basic provision of etymological mechanisms in the TEI guidelines, TEI Lex-0 Etym proposes a systematic representation of etymological and cognate descriptions by means of embedded constructs based on the (for etymologies) and (for etymons and cognates) elements. In particular, given that all the potential contents of etymons are highly analogous to those of dictionary entries in general, the contents presented herein heavily re-use many of the corresponding features and constraints introduced in other components of the TEI Lex-0 to the encoding of etymologies and etymons. The TEI Lex-0 Etym model is also closely aligned to ISO 24613-3 on modelling etymological data and the corresponding TEI serialisation available in ISO 24613-4.
- Publication . Other literature type . Part of book or chapter of book . Book . 2020Open Access EnglishAuthors:Edmond, Jennifer; Romary, Laurent;Edmond, Jennifer; Romary, Laurent;Publisher: Open Book PublishersCountry: France
Introduction The scholarly monograph has been compared to the Hapsburg monarchy in that it seems to have been in decline forever! It was in 2002 that Stephen Greenblatt, in his role as president of the US Modern Language Association, urged his membership to recognise what he called a ‘crisis in scholarly publication’. It is easy to forget now that this crisis, as he then saw it, had nothing to do with the rise of digital technologies, e-publishing, or open access. Indeed, it puts his words in...
- Publication . 2019Open AccessAuthors:Angela Cossu;Angela Cossu;Country: France
International audience
- Publication . 2013Open Access FrenchAuthors:Savonnet, Marinette;Savonnet, Marinette;Publisher: HAL CCSDCountry: France
Les Systèmes d'Information Scientifique (SIS) sont des Systèmes d'Information (SI) dont le but est de produire de la connaissance et non pas de gérer ou contrôler une activité de production de biens ou de services comme les SI d'entreprise. Les SIS se caractérisent par des domaines de recherche fortement collaboratifs impliquant des équipes pluridisciplinaires et le plus souvent géographiquement éloignées, ils manipulent des données aux structures très variables dans le temps qui vont au-delà de la simple hétérogénéité : nuages de points issus de scanner 3D, modèles numériques de terrain, cartographie, publications, données issues de spectromètre de masse ou de technique de thermoluminescence, données attributaires en très grand volume, etc. Ainsi, contrairement aux bases de données d'entreprise qui sont modélisées avec des structures établies par l'activité qu'elles supportent, les données scientifiques ne peuvent pas se contenter de schémas de données pré-definis puisque la structure des données évolue rapidement de concert avec l'évolution de la connaissance. La gestion de données scientifiques nécessite une architecture de SIS ayant un niveau d'extensibilité plus élevé que dans un SI d'entreprise. Afin de supporter l'extensibilité tout en contrôlant la qualité des données mais aussi l'interopérabilité, nous proposons une architecture de SIS reposant sur : - des données référentielles fortement structurées, identifiables lors de la phase d'analyse et amenées à évoluer rarement ; - des données complémentaires multi-modèles (matricielles, cartographiques, nuages de points 3D, documentaires, etc.). Pour établir les liens entre les données complémentaires et les données référentielles, nous avons utilisé un unique paradigme, l'annotation sémantique. Nous avons proposé un modèle formel d'annotation à base ontologique pour construire des annotations sémantiques dont la cohérence et la consistance peuvent être contrôlées par une ontologie et des règles. Dans ce cadre, les annotations offrent ainsi une contextualisation des données qui permet de vérifier leur cohérence, par rapport à la connaissance du domaine. Nous avons dressé les grandes lignes d'une sémantique du processus d'annotation par analogie avec la sémantique des langages de programmation. Nous avons validé notre proposition, à travers deux collaborations pluridisciplinaires : - le projet ANR CARE (Corpus Architecturae Religiosae Europeae - IV-X saec. ANR-07- CORP-011) dans le domaine de l'archéologie. Son objectif était de développer un corpus numérique de documents multimédia sur l'évolution des monuments religieux du IVe au XIe siècle (http://care.tge-adonis.fr). Un assistant d'annotation a été développé pour assurer la qualité des annotations par rapport à la connaissance représentée dans l'ontologie. Ce projet a donné lieu au développement d'une extension sémantique pour MediaWiki ; - le projet eClims dans le domaine de la protéomique clinique. eClims est un composant clinique d'un LIMS (Laboratory Information Management System) développé pour la plate-forme de protéomique CLIPP. eClims met en oeuvre un outil d'intégration basé sur le couplage entre des modèles représentant les sources et le système protéomique, et des ontologies utilisées comme médiatrices entre ces derniers. Les différents contrôles que nous mettons en place garantissent la validité des domaines de valeurs, la complétude, la consistance des données et leur cohérence. Le stockage des annotations est assuré par une Base de Données orientées colonnes associée à une Base de Données relationnelles.
- Publication . Article . Conference object . Preprint . 2016Open Access EnglishAuthors:Grefenstette, Gregory; Muchemi, Lawrence;Grefenstette, Gregory; Muchemi, Lawrence;Country: France
International audience; Current research in lifelog data has not paid enough attention to analysis of cognitive activities in comparison to physical activities. We argue that as we look into the future, wearable devices are going to be cheaper and more prevalent and textual data will play a more significant role. Data captured by lifelogging devices will increasingly include speech and text, potentially useful in analysis of intellectual activities. Analyzing what a person hears, reads, and sees, we should be able to measure the extent of cognitive activity devoted to a certain topic or subject by a learner. Test-based lifelog records can benefit from semantic analysis tools developed for natural language processing. We show how semantic analysis of such text data can be achieved through the use of taxonomic subject facets and how these facets might be useful in quantifying cognitive activity devoted to various topics in a person's day. We are currently developing a method to automatically create taxonomic topic vocabularies that can be applied to this detection of intellectual activity.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Other literature type . Conference object . 2015Open Access EnglishAuthors:Boukhelifa, Nadia; Giannisakis, Emmanouil; Dimara, Evanthia; Willett, Wesley; Fekete, Jean-Daniel;Boukhelifa, Nadia; Giannisakis, Emmanouil; Dimara, Evanthia; Willett, Wesley; Fekete, Jean-Daniel;Publisher: HAL CCSDCountry: FranceProject: EC | CENDARI (284432)
International audience; In this paper we describe the development and evaluation of a visual analytics tool to support historical research. Historians continuously gather data related to their scholarly research from archival visits and background search. Organising and making sense of all this data can be challenging as many historians continue to rely on analog or basic digital tools. We built an integrated note-taking environment for historians which unifies a set of func-tionalities we identified as important for historical research including editing, tagging, searching, sharing and visualization. Our approach was to involve users from the initial stage of brainstorming and requirement analysis through to design, implementation and evaluation. We report on the process and results of our work, and conclude by reflecting on our own experience in conducting user-centered visual analytics design for digital humanities.
- Publication . Conference object . 2012EnglishAuthors:Bel, Bernard;Bel, Bernard;Publisher: HAL CCSDCountry: FranceProject: ANR | ORTOLANG (ANR-11-EQPX-0032)
In 2008, a pilot project initiated by TGE Adonis, a large research infrastructure, brought together designers of data repositories, archivists and system engineers to set up collaborative oral/linguistic resource centres in France. This paper discusses challenging issues addressed by this team when implementing an Open Archival Information System (OAIS) bundled with an institutional archive. After the completion of the pilot project, the Speech & Language Data Repository (SLDR) underwent development for the systematic management of access rights in compliance with the French Heritage code. Its framework claims to be applicable to other systems worldwide, which would facilitate interoperability between protected repositories equipped with transfer of authentication techniques (Single Sign-On).
- Publication . Article . Other literature type . Conference object . 2020Open AccessAuthors:Stefan Bornhofen; Marten Düring;Stefan Bornhofen; Marten Düring;Publisher: Springer Science and Business Media LLCCountry: FranceProject: ANR | BLIZAAR (ANR-15-CE23-0002)
AbstractThe paper presents Intergraph, a graph-based visual analytics technical demonstrator for the exploration and study of content in historical document collections. The designed prototype is motivated by a practical use case on a corpus of circa 15.000 digitized resources about European integration since 1945. The corpus allowed generating a dynamic multilayer network which represents different kinds of named entities appearing and co-appearing in the collections. To our knowledge, Intergraph is one of the first interactive tools to visualize dynamic multilayer graphs for collections of digitized historical sources. Graph visualization and interaction methods have been designed based on user requirements for content exploration by non-technical users without a strong background in network science, and to compensate for common flaws with the annotation of named entities. Users work with self-selected subsets of the overall data by interacting with a scene of small graphs which can be added, altered and compared. This allows an interest-driven navigation in the corpus and the discovery of the interconnections of its entities across time.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Report . 2019EnglishAuthors:Szprot, Jakub; Arpagaus, Brigitte; Ciula, Arianna; Clivaz, Claire; Gabay, Simon; Honegger, Matthieu; Hughes, Lorna; Immenhauser, Beat; Jakeman, Neil; Lhotak, Martin; +8 moreSzprot, Jakub; Arpagaus, Brigitte; Ciula, Arianna; Clivaz, Claire; Gabay, Simon; Honegger, Matthieu; Hughes, Lorna; Immenhauser, Beat; Jakeman, Neil; Lhotak, Martin; Romanova, Natasha; Ros, Salvador; Schulthess, Sara; Tahko, Tuuli; Tolonen, Mikko; Erdinast Vulcan, Daphna; Willa, Pierre; Zehavi, Ora;Publisher: HAL CCSDCountry: FranceProject: EC | DESIR (731081)
This report provides information about activities and progress towards establishing DARIAH membership in six countries: the Czech Republic, Finland, Israel, Spain, Switzerland, and the UK, which took place between July and December 2019. Previous activities were described in detail in the D3.2 - Regularly Monitor Country-Specific Progress in Enabling New DARIAH Membership. During the project lifetime, the Czech Republic joined DARIAH ERIC; in other countries, collaboration with DARIAH has been greatly strengthened and significant progress regarding DARIAH membership has been achieved. The report also outlines the next steps in the accession processes, building on the results of the DESIR project.
- FrenchAuthors:Pouyllau, Stéphane;Pouyllau, Stéphane;Publisher: HAL CCSDCountry: FranceProject: EC | HaS-DARIAH (675570)
International audience; Le web sémantique et l'ouverture des données (publications, archives, référentiels) dans les sciences humaines et sociales (SHS) ont permis la création outils de recherche nouveaux permettant à la fois de créer des portails documentaires (autour de moteurs de recherche sémantiques) et des applications pouvant être embarquées dans des sites web. ISIDORE, l'accès unifié aux données, publications et informations des SHS entre dans cette catégorie. Lancé en 2010 par le CNRS et actuellement développé par Huma-Num, ISIDORE propose, outre un moteur de recherche sur le web, une collection d'applications permettant de l'utiliser de façon « embarqué » pour des usages aux plus près des besoins des enseignants-chercheurs et des étudiants.
223 Research products, page 1 of 23
Loading
- Publication . 2021Open Access EnglishAuthors:Bowers, Jack; Herold, Axel; Romary, Laurent; Tasovac, Toma;Bowers, Jack; Herold, Axel; Romary, Laurent; Tasovac, Toma;Publisher: HAL CCSDCountry: France
The present paper describes the etymological component of the TEI Lex-0 initiative which aims at defining a terser subset of the TEI guidelines for the representation of etymological features in dictionary entries. Going beyond the basic provision of etymological mechanisms in the TEI guidelines, TEI Lex-0 Etym proposes a systematic representation of etymological and cognate descriptions by means of embedded constructs based on the (for etymologies) and (for etymons and cognates) elements. In particular, given that all the potential contents of etymons are highly analogous to those of dictionary entries in general, the contents presented herein heavily re-use many of the corresponding features and constraints introduced in other components of the TEI Lex-0 to the encoding of etymologies and etymons. The TEI Lex-0 Etym model is also closely aligned to ISO 24613-3 on modelling etymological data and the corresponding TEI serialisation available in ISO 24613-4.
- Publication . Other literature type . Part of book or chapter of book . Book . 2020Open Access EnglishAuthors:Edmond, Jennifer; Romary, Laurent;Edmond, Jennifer; Romary, Laurent;Publisher: Open Book PublishersCountry: France
Introduction The scholarly monograph has been compared to the Hapsburg monarchy in that it seems to have been in decline forever! It was in 2002 that Stephen Greenblatt, in his role as president of the US Modern Language Association, urged his membership to recognise what he called a ‘crisis in scholarly publication’. It is easy to forget now that this crisis, as he then saw it, had nothing to do with the rise of digital technologies, e-publishing, or open access. Indeed, it puts his words in...
- Publication . 2019Open AccessAuthors:Angela Cossu;Angela Cossu;Country: France
International audience
- Publication . 2013Open Access FrenchAuthors:Savonnet, Marinette;Savonnet, Marinette;Publisher: HAL CCSDCountry: France
Les Systèmes d'Information Scientifique (SIS) sont des Systèmes d'Information (SI) dont le but est de produire de la connaissance et non pas de gérer ou contrôler une activité de production de biens ou de services comme les SI d'entreprise. Les SIS se caractérisent par des domaines de recherche fortement collaboratifs impliquant des équipes pluridisciplinaires et le plus souvent géographiquement éloignées, ils manipulent des données aux structures très variables dans le temps qui vont au-delà de la simple hétérogénéité : nuages de points issus de scanner 3D, modèles numériques de terrain, cartographie, publications, données issues de spectromètre de masse ou de technique de thermoluminescence, données attributaires en très grand volume, etc. Ainsi, contrairement aux bases de données d'entreprise qui sont modélisées avec des structures établies par l'activité qu'elles supportent, les données scientifiques ne peuvent pas se contenter de schémas de données pré-definis puisque la structure des données évolue rapidement de concert avec l'évolution de la connaissance. La gestion de données scientifiques nécessite une architecture de SIS ayant un niveau d'extensibilité plus élevé que dans un SI d'entreprise. Afin de supporter l'extensibilité tout en contrôlant la qualité des données mais aussi l'interopérabilité, nous proposons une architecture de SIS reposant sur : - des données référentielles fortement structurées, identifiables lors de la phase d'analyse et amenées à évoluer rarement ; - des données complémentaires multi-modèles (matricielles, cartographiques, nuages de points 3D, documentaires, etc.). Pour établir les liens entre les données complémentaires et les données référentielles, nous avons utilisé un unique paradigme, l'annotation sémantique. Nous avons proposé un modèle formel d'annotation à base ontologique pour construire des annotations sémantiques dont la cohérence et la consistance peuvent être contrôlées par une ontologie et des règles. Dans ce cadre, les annotations offrent ainsi une contextualisation des données qui permet de vérifier leur cohérence, par rapport à la connaissance du domaine. Nous avons dressé les grandes lignes d'une sémantique du processus d'annotation par analogie avec la sémantique des langages de programmation. Nous avons validé notre proposition, à travers deux collaborations pluridisciplinaires : - le projet ANR CARE (Corpus Architecturae Religiosae Europeae - IV-X saec. ANR-07- CORP-011) dans le domaine de l'archéologie. Son objectif était de développer un corpus numérique de documents multimédia sur l'évolution des monuments religieux du IVe au XIe siècle (http://care.tge-adonis.fr). Un assistant d'annotation a été développé pour assurer la qualité des annotations par rapport à la connaissance représentée dans l'ontologie. Ce projet a donné lieu au développement d'une extension sémantique pour MediaWiki ; - le projet eClims dans le domaine de la protéomique clinique. eClims est un composant clinique d'un LIMS (Laboratory Information Management System) développé pour la plate-forme de protéomique CLIPP. eClims met en oeuvre un outil d'intégration basé sur le couplage entre des modèles représentant les sources et le système protéomique, et des ontologies utilisées comme médiatrices entre ces derniers. Les différents contrôles que nous mettons en place garantissent la validité des domaines de valeurs, la complétude, la consistance des données et leur cohérence. Le stockage des annotations est assuré par une Base de Données orientées colonnes associée à une Base de Données relationnelles.
- Publication . Article . Conference object . Preprint . 2016Open Access EnglishAuthors:Grefenstette, Gregory; Muchemi, Lawrence;Grefenstette, Gregory; Muchemi, Lawrence;Country: France
International audience; Current research in lifelog data has not paid enough attention to analysis of cognitive activities in comparison to physical activities. We argue that as we look into the future, wearable devices are going to be cheaper and more prevalent and textual data will play a more significant role. Data captured by lifelogging devices will increasingly include speech and text, potentially useful in analysis of intellectual activities. Analyzing what a person hears, reads, and sees, we should be able to measure the extent of cognitive activity devoted to a certain topic or subject by a learner. Test-based lifelog records can benefit from semantic analysis tools developed for natural language processing. We show how semantic analysis of such text data can be achieved through the use of taxonomic subject facets and how these facets might be useful in quantifying cognitive activity devoted to various topics in a person's day. We are currently developing a method to automatically create taxonomic topic vocabularies that can be applied to this detection of intellectual activity.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Other literature type . Conference object . 2015Open Access EnglishAuthors:Boukhelifa, Nadia; Giannisakis, Emmanouil; Dimara, Evanthia; Willett, Wesley; Fekete, Jean-Daniel;Boukhelifa, Nadia; Giannisakis, Emmanouil; Dimara, Evanthia; Willett, Wesley; Fekete, Jean-Daniel;Publisher: HAL CCSDCountry: FranceProject: EC | CENDARI (284432)
International audience; In this paper we describe the development and evaluation of a visual analytics tool to support historical research. Historians continuously gather data related to their scholarly research from archival visits and background search. Organising and making sense of all this data can be challenging as many historians continue to rely on analog or basic digital tools. We built an integrated note-taking environment for historians which unifies a set of func-tionalities we identified as important for historical research including editing, tagging, searching, sharing and visualization. Our approach was to involve users from the initial stage of brainstorming and requirement analysis through to design, implementation and evaluation. We report on the process and results of our work, and conclude by reflecting on our own experience in conducting user-centered visual analytics design for digital humanities.
- Publication . Conference object . 2012EnglishAuthors:Bel, Bernard;Bel, Bernard;Publisher: HAL CCSDCountry: FranceProject: ANR | ORTOLANG (ANR-11-EQPX-0032)
In 2008, a pilot project initiated by TGE Adonis, a large research infrastructure, brought together designers of data repositories, archivists and system engineers to set up collaborative oral/linguistic resource centres in France. This paper discusses challenging issues addressed by this team when implementing an Open Archival Information System (OAIS) bundled with an institutional archive. After the completion of the pilot project, the Speech & Language Data Repository (SLDR) underwent development for the systematic management of access rights in compliance with the French Heritage code. Its framework claims to be applicable to other systems worldwide, which would facilitate interoperability between protected repositories equipped with transfer of authentication techniques (Single Sign-On).
- Publication . Article . Other literature type . Conference object . 2020Open AccessAuthors:Stefan Bornhofen; Marten Düring;Stefan Bornhofen; Marten Düring;Publisher: Springer Science and Business Media LLCCountry: FranceProject: ANR | BLIZAAR (ANR-15-CE23-0002)
AbstractThe paper presents Intergraph, a graph-based visual analytics technical demonstrator for the exploration and study of content in historical document collections. The designed prototype is motivated by a practical use case on a corpus of circa 15.000 digitized resources about European integration since 1945. The corpus allowed generating a dynamic multilayer network which represents different kinds of named entities appearing and co-appearing in the collections. To our knowledge, Intergraph is one of the first interactive tools to visualize dynamic multilayer graphs for collections of digitized historical sources. Graph visualization and interaction methods have been designed based on user requirements for content exploration by non-technical users without a strong background in network science, and to compensate for common flaws with the annotation of named entities. Users work with self-selected subsets of the overall data by interacting with a scene of small graphs which can be added, altered and compared. This allows an interest-driven navigation in the corpus and the discovery of the interconnections of its entities across time.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Report . 2019EnglishAuthors:Szprot, Jakub; Arpagaus, Brigitte; Ciula, Arianna; Clivaz, Claire; Gabay, Simon; Honegger, Matthieu; Hughes, Lorna; Immenhauser, Beat; Jakeman, Neil; Lhotak, Martin; +8 moreSzprot, Jakub; Arpagaus, Brigitte; Ciula, Arianna; Clivaz, Claire; Gabay, Simon; Honegger, Matthieu; Hughes, Lorna; Immenhauser, Beat; Jakeman, Neil; Lhotak, Martin; Romanova, Natasha; Ros, Salvador; Schulthess, Sara; Tahko, Tuuli; Tolonen, Mikko; Erdinast Vulcan, Daphna; Willa, Pierre; Zehavi, Ora;Publisher: HAL CCSDCountry: FranceProject: EC | DESIR (731081)
This report provides information about activities and progress towards establishing DARIAH membership in six countries: the Czech Republic, Finland, Israel, Spain, Switzerland, and the UK, which took place between July and December 2019. Previous activities were described in detail in the D3.2 - Regularly Monitor Country-Specific Progress in Enabling New DARIAH Membership. During the project lifetime, the Czech Republic joined DARIAH ERIC; in other countries, collaboration with DARIAH has been greatly strengthened and significant progress regarding DARIAH membership has been achieved. The report also outlines the next steps in the accession processes, building on the results of the DESIR project.
- FrenchAuthors:Pouyllau, Stéphane;Pouyllau, Stéphane;Publisher: HAL CCSDCountry: FranceProject: EC | HaS-DARIAH (675570)
International audience; Le web sémantique et l'ouverture des données (publications, archives, référentiels) dans les sciences humaines et sociales (SHS) ont permis la création outils de recherche nouveaux permettant à la fois de créer des portails documentaires (autour de moteurs de recherche sémantiques) et des applications pouvant être embarquées dans des sites web. ISIDORE, l'accès unifié aux données, publications et informations des SHS entre dans cette catégorie. Lancé en 2010 par le CNRS et actuellement développé par Huma-Num, ISIDORE propose, outre un moteur de recherche sur le web, une collection d'applications permettant de l'utiliser de façon « embarqué » pour des usages aux plus près des besoins des enseignants-chercheurs et des étudiants.