Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
20 Research products, page 1 of 2

  • DARIAH EU
  • Publications
  • 2017-2021
  • Other literature type
  • Report
  • European Commission
  • EU
  • Hyper Article en Ligne

10
arrow_drop_down
Date (most recent)
arrow_drop_down
  • Publication . Report . 2020
    English
    Authors: 
    Bertrand, Loïc; Anglos, Demetrios; Castillejo, Marta; Charbonnel, Bénédicte; David, Sophie; de Clercq, Hilde; Dubray, Fanny; Spring, Marika;
    Publisher: HAL CCSD
    Country: France
    Project: EC | E-RIHS PP (739503)

    The study and preservation of tangible cultural and natural heritage is a global challenge for science and society at large. The European Research Infrastructure for Heritage Science (E-RIHS) will play a leading role in research on the interpretation, preservation, documentation and management of heritage. As an interdisciplinary infrastructure, E-RIHS will interconnect knowledge and methodologies to address key scientific questions in the field of heritage as a whole. The infrastructure is built on ten core pillars. It will provide a structured and unified input of large-scale instruments, portable devices, physical and digital archives. Its implementation will focus on scientific excellence, interdisciplinarity and cooperation. In doing so, it will offer unprecedented research opportunities to a wide range of interdisciplinary scientific communities.

  • Publication . Preprint . Conference object . Contribution for newspaper or weekly magazine . Article . 2020
    Open Access English
    Authors: 
    Rehm, Georg; Marheinecke, Katrin; Hegele, Stefanie; Piperidis, Stelios; Bontcheva, Kalina; Hajic, Jan; Choukri, Khalid; Vasiljevs, Andrejs; Backfried, Gerhard; Prinz, Christoph; +37 more
    Publisher: European Language Resources Association
    Countries: Denmark, France
    Project: EC | ELG (825627), EC | BDVe (732630), SFI | ADAPT: Centre for Digital... (13/RC/2106), EC | AI4EU (825619), FCT | PINFRA/22117/2016 (PINFRA/22117/2016), EC | X5gon (761758)

    Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality. However, language barriers impacting business, cross-lingual and cross-cultural communication are still omnipresent. Language Technologies (LTs) are a powerful means to break down these barriers. While the last decade has seen various initiatives that created a multitude of approaches and technologies tailored to Europe's specific needs, there is still an immense level of fragmentation. At the same time, AI has become an increasingly important concept in the European Information and Communication Technology area. For a few years now, AI, including many opportunities, synergies but also misconceptions, has been overshadowing every other topic. We present an overview of the European LT landscape, describing funding programmes, activities, actions and challenges in the different countries with regard to LT, including the current state of play in industry and the LT market. We present a brief overview of the main LT-related activities on the EU level in the last ten years and develop strategic guidance with regard to four key dimensions. Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020). To appear

  • Open Access English
    Authors: 
    Khan, Fahad; Romary, Laurent; Salgado, Ana; Bowers, Jack; Khemakhem, Mohamed; Tasovac, Toma;
    Publisher: HAL CCSD
    Country: France
    Project: EC | ELEXIS (731015)

    Due to COVID19 pandemic, the 12th edition is cancelled. The LREC 2020 Proceedings are available at http://www.lrec-conf.org/proceedings/lrec2020/index.html; International audience; In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework (LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, and Part 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the use of both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of the reference Portuguese dictionary Grande Dicionário Houaiss da Língua Portuguesa, part of a broader experiment comprising the analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the Unified Modelling Language (UML) and also in a couple of cases in TEI.

  • Publication . Report . 2019
    Open Access English
    Authors: 
    Raciti, Marco; Chambers, Sally;
    Publisher: HAL CCSD
    Country: France
    Project: EC | DESIR (731081)

    The DESIR Final Event took place in Zagreb on 6-7 November 2019, alongside the DARIAH General Assembly. The DESIR Final Event brought together the whole consortium and involved several Bodies of the ERIC to maximise the impact of the event. The aim of the meeting was two-fold. First, it was to opportunity to summarise the activities conducted throughout the project and showcase results towards the DARIAH community. Secondly, it proposed a wider discussion over sustainability and funding models with National Representatives prior to the General Assembly.Overall, 57 participants attended the event. The schedule has been structured to present the outcomes of each Work Package. As the project DESIR was divided into an evolving 6-dimensional process, main results can be summarised as follows:Dissemination and Innovation: the aim of this Work Package was to ensure a continuous dissemination of DARIAH service improvements, activities and research results within the relevant communities, academic stakeholders and other interested parties. Furthermore, the Work Package was in charge of organising main dissemination activities during the project and the DARIAH Beyond Europe workshop series (Stanford University, Library of Congress, National Library of Australia).Growth: this Work Package was concerned with the enlargement of the DARIAH membership targeting six countries (Czech Republic, Finland, Israel, Spain, Switzerland, United Kingdom). Membership could only be achieved in Czech Republic, the application was submitted and ratified by the General Assembly in November 2019. Nevertheless, in other countries, collaboration with DARIAH has been greatly strengthened and significant progress regarding DARIAH membership has been achieved.Technology: the Work Package delivered three demonstrators focused on bibliographical metadata. The demonstrators show the usage of tools for bibliographical metadata in various stages of the research process, e.g. extraction of entities, the collection and sorting of citations, visualisation of selected aspects of the data. These demonstrators were built with the involvement of the community of researchers through two Code Sprints organised in Berlin.Robustness: the aim of this Work Package was to strengthen the organisational structure of DARIAH. The activities undertaken had a multifaceted nature: the relationship with the user community was improved through the implementation of a centralised helpdesk; preliminary work towards the Marketplace was conducted; the EURISE Network (European Research Infrastructure Software Engineers' Network) was established with CLARIN and CESSDA; finally a renewed business plan building on existing strategic documents was developed.Trust: this Work Package contributed to DARIAH’s long-term sustainability by measuring acceptance and impact of DARIAH in new cross-disciplinary communities and core groups through a survey, in-depth qualitative interviews and strategic meetings. The analysedinformation collected from each source led to a recommendations and community engagement tool developed to share and translate the recommendations dataset for a broader audience.Teaching: the work carried out during the course of the project led to the implementation of a discovery framework and hosting platform for DARIAH learning resources, DARIAH-Campus. The platform was officially launched during the event. The goal of DARIAH-Campus is to widen access to open, inclusive, high-quality learning materials that aim to enhance creativity, skills, technology and knowledge in the digitally-enabled arts and humanities.

  • Publication . Other literature type . Conference object . 2019
    Open Access English
    Authors: 
    Marlet , Olivier; Francart, Thomas; Markhoff, Béatrice; Rodier, Xavier;
    Publisher: HAL CCSD
    Country: France
    Project: EC | ARIADNEplus (823914)

    International audience; CIDOC CRM is an ontology intended to facilitate the integration, mediation and interchange of heterogeneous cultural heritage information. The Semantic Web with its Linked Open Data cloud enables scholars and cultural institutions to publish their data in RDF, using CIDOC CRM as an interlingua that enables a semantically consistent re-interpretation of their data. Nowadays more and more projects have done the task of mapping legacy datasets to CIDOC CRM, and successful Extract-Transform-Load data-integration processes have been performed in this way. A next step is enabling people and applications to actually dynamically explore autonomous datasets using the semantic mediation offered by CIDOC CRM. This is the purpose of OpenArchaeo, a tool for querying archaeological datasets on the LOD cloud. We present its main features: the principles behind its user friendly query interface and its SPARQL Endpoint for programs, together with its overall architecture designed to be extendable and scalable, for handling transparent interconnections with evolving distributed sources while achieving good efficiency.

  • English
    Authors: 
    Romary, Laurent; Biabiany, Damien; Klaus Illmayer; Puren, Marie; Riondet, Charles; Seillier, Dorian; Tadjou, Lionel;
    Publisher: HAL CCSD
    Country: France
    Project: EC | PARTHENOS (654119)

    International audience

  • Open Access English
    Authors: 
    Raciti, Marco; Gabay, Simon; Moranville, Yoann; Jorge, Maria Do Rosário; Fernandes, João;
    Publisher: HAL CCSD
    Country: France
    Project: EC | DESIR (731081)

    International audience; Europe has a long and rich tradition as a centre of research and teaching in the arts and humanities. However, the huge digital transformation that affects the arts and humanities research landscape all over the world requires that we set up sustainable research infrastructures, new and refined techniques, state-of-the-art methods and an expanded skills base. Responding to these challenges, the Digital Research Infrastructure for Arts and Humanities (DARIAH) was launched as a pan-European network and research infrastructure. After expansion and consolidation, which involved DARIAH’s inclusion in the ESFRI roadmap, DARIAH became a European Research Infrastructure Consortium (ERIC) in 2014. The Horizon 2020 funded project DESIR (DARIAH ERIC Sustainability Refined) sets out to strengthen the sustainability of DARIAH and help establish it as a reliable long-term partner within our communities. Sustaining existing digital expertise, tools, resources in Europe in the context of DESIR involves a goal-oriented set of measures in order to first, maintain, expand and develop DARIAH in its capacities as an organisation and technical research infrastructure; secondly, to engage its members further, as well as measure and increase their trust in DARIAH; thirdly, to expand the network in order to integrate new regions and communities. The DESIR consortium is composed of core DARIAH members, representatives from potential new DARIAH members and external technical experts. The sustainability of a research infrastructure is the capacity to remain operative, effective and competitive over its expected lifetime. In DESIR, this definition is translated into an evolving 6-dimensional process, divided into the following challenges:•Dissemination•Growth•Technology•Robustness•Trust•EducationWith our poster, we would like to show how the project helps sustaining DARIAH. Within DESIR, dissemination is the ability to communicate DARIAH’s strategy and benefits effectively within the DARIAH community and in new areas, spreading out to new communities. Through the international workshops held at Stanford University and at the Library of Congress, DARIAH has been introduced to many non-European DH scholars. These events were an important first step to foster international cooperation between US and European colleagues as well as a catalyst for ongoing collaborations in the future. A third workshop took place in Canberra at the Australian Research Data Commons in March 2019.DARIAH has currently 17 members from all over Europe. Nevertheless, efforts should be made to include as many countries as possible to bring in and scale, to a European level, even more state-of-the-art DH activities.Six candidates ready for building strong national consortia have been identified, enabling a substantial expansion of DARIAH’s country coverage. Additionally, thematic workshops are organised in each country as well as tailored training measures.DESIR widens the research infrastructure in core areas which are vital for DARIAH’s sustainability but are not yet covered by the existing set-up. As DARIAH expands across Europe, continuously enhancing and further developing the ERIC exceeds DARIAH’s internal technological capacities. Two notable results were achieved so far: firstly, the publication of a technical reference as a result of a workshop organised in October 2017 with CESSDA and CLARIN. It’s a collection of basic guidelines and references for development and maintenance of infrastructure services within DARIAH and beyond, addressing an ongoing issue for research infrastructures, namely software sustainability. Secondly, the organisation of a Code Sprint, focusing on bibliographical and citation metadata, which helped shaping DARIAH’s profile in four technology areas (visualisation, text analytic services, entity-based search and scholarly content management). Another Code sprint is expected to take place in Summer 2019.Another output is the implementation of a centralized helpdesk. This helpdesk is hosted by CLARIN-D and the solution of integration within the existing DARIAH website was the creation of a WordPress plugin. This plugin is used to connect our website with the OTRS server and allows the creation of issues easily by users unfamiliar with OTRS.Sustaining a research infrastructure involves also two important aspects: trust and education. For DARIAH, it is crucial to increase trust and confidence from its users. In DESIR we develop recommendations and strategies accordingly, targeting new cross-disciplinary communities, based on the results of a survey and interviews addressed to the scientific community, with different levels of approach - national, institutional and individual.In addition, education is a key area and the project contributes to the ongoing discussions about the role and modalities of training and education in the development, consolidation and sustainability of digital research infrastructures. We believe that investing time and efforts into training and educating users is a way of securing the social sustainability of a research infrastructure.

  • English
    Authors: 
    Puren, Marie; Vernus, Pierre;
    Publisher: HAL CCSD
    Country: France
    Project: EC | SILKNOW (769504)

    International audience

  • Publication . Part of book or chapter of book . 2019
    Open Access English
    Authors: 
    Gelati, Francesco;
    Publisher: HAL CCSD
    Project: EC | EHRI (654164)

    The European Holocaust Research Infrastructure (EHRI) portal website aims to aggregate digitally available archival descriptions concerning the Holocaust. This portal is actually a meta-catalogue, or an information aggregator, whose biggest goal is to have up-to-date information by means of building sustainable data pipelines between EHRI and its content providers. Just like in similar archival information aggregators (e.g. Archives Portal Europe or Monasterium), the XML-based metadata standard Encoded Archival Description (EAD) plays a key role. The article presents how EADs are imported into the portal, mainly thanks to the Open Archive Initiative protocols.

  • Open Access
    Authors: 
    Romary, Laurent; Tasovac, Toma;
    Publisher: Zenodo
    Country: France
    Project: EC | ELEXIS (731015)

    International audience; Achieving consistent encoding within a given community of practice has been a recurrent issue for the TEI Guidelines. The topic is of particular importance for lexical data if we think of the potential wealth of content we could gain from pooling together the information available in the variety of highly structured, historical and contemporary lexical resources. Still, the encoding possibilities offered by the Dictionaries Chapter in the Guidelines are too numerous and too flexible to guarantee sufficient interoperability and a coherent model for searching, visualising or enriching multiple lexical resources.Following the spirit of TEI Analytics [Zillig, 2009], developed in the context of the MONK project, TEI Lex-0 aims at establishing a target format to facilitate the interoperability of heterogeneously encoded lexical resources. This is important both in the context of building lexical infrastructures as such [Ermolaev and Tasovac, 2012] and in the context of developing generic TEI-aware tools such as dictionary viewers and profilers. The format itself should not necessarily be one which is used for editing or managing individual resources, but one to which they can be univocally transformed to be queried, visualised, or mined in a uniform way. We are also aiming to stay as aligned as possible with the TEI subset developed in conjunction with the revision of the ISO LMF (Lexical Markup Framework) standard so that coherent design guidelines can be provided to the community (cf. [Romary, 2015]).The paper will provide an overview of the various domains covered by TEI Lex- 0 and the main decisions that were taken over the last 18 months: constraining the general structure of a lexical entry; offering mechanisms to overcome the limits of when used in retro-digitized dictionaries (by allowing, for instance, and as children of ); systematizing the representation of morpho-syntactic information [Bański et al., 2017]; providing a strict -based encoding of sense-related information; deprecating ; dealing with internal and external references in dictionary entries, providing more advanced encodings of etymology (see submission by Bowers, Herold and Romary); as well as defining technical constraints on the systematic use of @xml:id at different levels of the dictionary microstructure. The activity of the group has already lead to changes in the Guidelines in response to specific GitHub tickets.

Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
20 Research products, page 1 of 2
  • Publication . Report . 2020
    English
    Authors: 
    Bertrand, Loïc; Anglos, Demetrios; Castillejo, Marta; Charbonnel, Bénédicte; David, Sophie; de Clercq, Hilde; Dubray, Fanny; Spring, Marika;
    Publisher: HAL CCSD
    Country: France
    Project: EC | E-RIHS PP (739503)

    The study and preservation of tangible cultural and natural heritage is a global challenge for science and society at large. The European Research Infrastructure for Heritage Science (E-RIHS) will play a leading role in research on the interpretation, preservation, documentation and management of heritage. As an interdisciplinary infrastructure, E-RIHS will interconnect knowledge and methodologies to address key scientific questions in the field of heritage as a whole. The infrastructure is built on ten core pillars. It will provide a structured and unified input of large-scale instruments, portable devices, physical and digital archives. Its implementation will focus on scientific excellence, interdisciplinarity and cooperation. In doing so, it will offer unprecedented research opportunities to a wide range of interdisciplinary scientific communities.

  • Publication . Preprint . Conference object . Contribution for newspaper or weekly magazine . Article . 2020
    Open Access English
    Authors: 
    Rehm, Georg; Marheinecke, Katrin; Hegele, Stefanie; Piperidis, Stelios; Bontcheva, Kalina; Hajic, Jan; Choukri, Khalid; Vasiljevs, Andrejs; Backfried, Gerhard; Prinz, Christoph; +37 more
    Publisher: European Language Resources Association
    Countries: Denmark, France
    Project: EC | ELG (825627), EC | BDVe (732630), SFI | ADAPT: Centre for Digital... (13/RC/2106), EC | AI4EU (825619), FCT | PINFRA/22117/2016 (PINFRA/22117/2016), EC | X5gon (761758)

    Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality. However, language barriers impacting business, cross-lingual and cross-cultural communication are still omnipresent. Language Technologies (LTs) are a powerful means to break down these barriers. While the last decade has seen various initiatives that created a multitude of approaches and technologies tailored to Europe's specific needs, there is still an immense level of fragmentation. At the same time, AI has become an increasingly important concept in the European Information and Communication Technology area. For a few years now, AI, including many opportunities, synergies but also misconceptions, has been overshadowing every other topic. We present an overview of the European LT landscape, describing funding programmes, activities, actions and challenges in the different countries with regard to LT, including the current state of play in industry and the LT market. We present a brief overview of the main LT-related activities on the EU level in the last ten years and develop strategic guidance with regard to four key dimensions. Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020). To appear

  • Open Access English
    Authors: 
    Khan, Fahad; Romary, Laurent; Salgado, Ana; Bowers, Jack; Khemakhem, Mohamed; Tasovac, Toma;
    Publisher: HAL CCSD
    Country: France
    Project: EC | ELEXIS (731015)

    Due to COVID19 pandemic, the 12th edition is cancelled. The LREC 2020 Proceedings are available at http://www.lrec-conf.org/proceedings/lrec2020/index.html; International audience; In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework (LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, and Part 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the use of both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of the reference Portuguese dictionary Grande Dicionário Houaiss da Língua Portuguesa, part of a broader experiment comprising the analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the Unified Modelling Language (UML) and also in a couple of cases in TEI.

  • Publication . Report . 2019
    Open Access English
    Authors: 
    Raciti, Marco; Chambers, Sally;
    Publisher: HAL CCSD
    Country: France
    Project: EC | DESIR (731081)

    The DESIR Final Event took place in Zagreb on 6-7 November 2019, alongside the DARIAH General Assembly. The DESIR Final Event brought together the whole consortium and involved several Bodies of the ERIC to maximise the impact of the event. The aim of the meeting was two-fold. First, it was to opportunity to summarise the activities conducted throughout the project and showcase results towards the DARIAH community. Secondly, it proposed a wider discussion over sustainability and funding models with National Representatives prior to the General Assembly.Overall, 57 participants attended the event. The schedule has been structured to present the outcomes of each Work Package. As the project DESIR was divided into an evolving 6-dimensional process, main results can be summarised as follows:Dissemination and Innovation: the aim of this Work Package was to ensure a continuous dissemination of DARIAH service improvements, activities and research results within the relevant communities, academic stakeholders and other interested parties. Furthermore, the Work Package was in charge of organising main dissemination activities during the project and the DARIAH Beyond Europe workshop series (Stanford University, Library of Congress, National Library of Australia).Growth: this Work Package was concerned with the enlargement of the DARIAH membership targeting six countries (Czech Republic, Finland, Israel, Spain, Switzerland, United Kingdom). Membership could only be achieved in Czech Republic, the application was submitted and ratified by the General Assembly in November 2019. Nevertheless, in other countries, collaboration with DARIAH has been greatly strengthened and significant progress regarding DARIAH membership has been achieved.Technology: the Work Package delivered three demonstrators focused on bibliographical metadata. The demonstrators show the usage of tools for bibliographical metadata in various stages of the research process, e.g. extraction of entities, the collection and sorting of citations, visualisation of selected aspects of the data. These demonstrators were built with the involvement of the community of researchers through two Code Sprints organised in Berlin.Robustness: the aim of this Work Package was to strengthen the organisational structure of DARIAH. The activities undertaken had a multifaceted nature: the relationship with the user community was improved through the implementation of a centralised helpdesk; preliminary work towards the Marketplace was conducted; the EURISE Network (European Research Infrastructure Software Engineers' Network) was established with CLARIN and CESSDA; finally a renewed business plan building on existing strategic documents was developed.Trust: this Work Package contributed to DARIAH’s long-term sustainability by measuring acceptance and impact of DARIAH in new cross-disciplinary communities and core groups through a survey, in-depth qualitative interviews and strategic meetings. The analysedinformation collected from each source led to a recommendations and community engagement tool developed to share and translate the recommendations dataset for a broader audience.Teaching: the work carried out during the course of the project led to the implementation of a discovery framework and hosting platform for DARIAH learning resources, DARIAH-Campus. The platform was officially launched during the event. The goal of DARIAH-Campus is to widen access to open, inclusive, high-quality learning materials that aim to enhance creativity, skills, technology and knowledge in the digitally-enabled arts and humanities.

  • Publication . Other literature type . Conference object . 2019
    Open Access English
    Authors: 
    Marlet , Olivier; Francart, Thomas; Markhoff, Béatrice; Rodier, Xavier;
    Publisher: HAL CCSD
    Country: France
    Project: EC | ARIADNEplus (823914)

    International audience; CIDOC CRM is an ontology intended to facilitate the integration, mediation and interchange of heterogeneous cultural heritage information. The Semantic Web with its Linked Open Data cloud enables scholars and cultural institutions to publish their data in RDF, using CIDOC CRM as an interlingua that enables a semantically consistent re-interpretation of their data. Nowadays more and more projects have done the task of mapping legacy datasets to CIDOC CRM, and successful Extract-Transform-Load data-integration processes have been performed in this way. A next step is enabling people and applications to actually dynamically explore autonomous datasets using the semantic mediation offered by CIDOC CRM. This is the purpose of OpenArchaeo, a tool for querying archaeological datasets on the LOD cloud. We present its main features: the principles behind its user friendly query interface and its SPARQL Endpoint for programs, together with its overall architecture designed to be extendable and scalable, for handling transparent interconnections with evolving distributed sources while achieving good efficiency.

  • English
    Authors: 
    Romary, Laurent; Biabiany, Damien; Klaus Illmayer; Puren, Marie; Riondet, Charles; Seillier, Dorian; Tadjou, Lionel;
    Publisher: HAL CCSD
    Country: France
    Project: EC | PARTHENOS (654119)

    International audience

  • Open Access English
    Authors: 
    Raciti, Marco; Gabay, Simon; Moranville, Yoann; Jorge, Maria Do Rosário; Fernandes, João;
    Publisher: HAL CCSD
    Country: France
    Project: EC | DESIR (731081)

    International audience; Europe has a long and rich tradition as a centre of research and teaching in the arts and humanities. However, the huge digital transformation that affects the arts and humanities research landscape all over the world requires that we set up sustainable research infrastructures, new and refined techniques, state-of-the-art methods and an expanded skills base. Responding to these challenges, the Digital Research Infrastructure for Arts and Humanities (DARIAH) was launched as a pan-European network and research infrastructure. After expansion and consolidation, which involved DARIAH’s inclusion in the ESFRI roadmap, DARIAH became a European Research Infrastructure Consortium (ERIC) in 2014. The Horizon 2020 funded project DESIR (DARIAH ERIC Sustainability Refined) sets out to strengthen the sustainability of DARIAH and help establish it as a reliable long-term partner within our communities. Sustaining existing digital expertise, tools, resources in Europe in the context of DESIR involves a goal-oriented set of measures in order to first, maintain, expand and develop DARIAH in its capacities as an organisation and technical research infrastructure; secondly, to engage its members further, as well as measure and increase their trust in DARIAH; thirdly, to expand the network in order to integrate new regions and communities. The DESIR consortium is composed of core DARIAH members, representatives from potential new DARIAH members and external technical experts. The sustainability of a research infrastructure is the capacity to remain operative, effective and competitive over its expected lifetime. In DESIR, this definition is translated into an evolving 6-dimensional process, divided into the following challenges:•Dissemination•Growth•Technology•Robustness•Trust•EducationWith our poster, we would like to show how the project helps sustaining DARIAH. Within DESIR, dissemination is the ability to communicate DARIAH’s strategy and benefits effectively within the DARIAH community and in new areas, spreading out to new communities. Through the international workshops held at Stanford University and at the Library of Congress, DARIAH has been introduced to many non-European DH scholars. These events were an important first step to foster international cooperation between US and European colleagues as well as a catalyst for ongoing collaborations in the future. A third workshop took place in Canberra at the Australian Research Data Commons in March 2019.DARIAH has currently 17 members from all over Europe. Nevertheless, efforts should be made to include as many countries as possible to bring in and scale, to a European level, even more state-of-the-art DH activities.Six candidates ready for building strong national consortia have been identified, enabling a substantial expansion of DARIAH’s country coverage. Additionally, thematic workshops are organised in each country as well as tailored training measures.DESIR widens the research infrastructure in core areas which are vital for DARIAH’s sustainability but are not yet covered by the existing set-up. As DARIAH expands across Europe, continuously enhancing and further developing the ERIC exceeds DARIAH’s internal technological capacities. Two notable results were achieved so far: firstly, the publication of a technical reference as a result of a workshop organised in October 2017 with CESSDA and CLARIN. It’s a collection of basic guidelines and references for development and maintenance of infrastructure services within DARIAH and beyond, addressing an ongoing issue for research infrastructures, namely software sustainability. Secondly, the organisation of a Code Sprint, focusing on bibliographical and citation metadata, which helped shaping DARIAH’s profile in four technology areas (visualisation, text analytic services, entity-based search and scholarly content management). Another Code sprint is expected to take place in Summer 2019.Another output is the implementation of a centralized helpdesk. This helpdesk is hosted by CLARIN-D and the solution of integration within the existing DARIAH website was the creation of a WordPress plugin. This plugin is used to connect our website with the OTRS server and allows the creation of issues easily by users unfamiliar with OTRS.Sustaining a research infrastructure involves also two important aspects: trust and education. For DARIAH, it is crucial to increase trust and confidence from its users. In DESIR we develop recommendations and strategies accordingly, targeting new cross-disciplinary communities, based on the results of a survey and interviews addressed to the scientific community, with different levels of approach - national, institutional and individual.In addition, education is a key area and the project contributes to the ongoing discussions about the role and modalities of training and education in the development, consolidation and sustainability of digital research infrastructures. We believe that investing time and efforts into training and educating users is a way of securing the social sustainability of a research infrastructure.

  • English
    Authors: 
    Puren, Marie; Vernus, Pierre;
    Publisher: HAL CCSD
    Country: France
    Project: EC | SILKNOW (769504)

    International audience

  • Publication . Part of book or chapter of book . 2019
    Open Access English
    Authors: 
    Gelati, Francesco;
    Publisher: HAL CCSD
    Project: EC | EHRI (654164)

    The European Holocaust Research Infrastructure (EHRI) portal website aims to aggregate digitally available archival descriptions concerning the Holocaust. This portal is actually a meta-catalogue, or an information aggregator, whose biggest goal is to have up-to-date information by means of building sustainable data pipelines between EHRI and its content providers. Just like in similar archival information aggregators (e.g. Archives Portal Europe or Monasterium), the XML-based metadata standard Encoded Archival Description (EAD) plays a key role. The article presents how EADs are imported into the portal, mainly thanks to the Open Archive Initiative protocols.

  • Open Access
    Authors: 
    Romary, Laurent; Tasovac, Toma;
    Publisher: Zenodo
    Country: France
    Project: EC | ELEXIS (731015)

    International audience; Achieving consistent encoding within a given community of practice has been a recurrent issue for the TEI Guidelines. The topic is of particular importance for lexical data if we think of the potential wealth of content we could gain from pooling together the information available in the variety of highly structured, historical and contemporary lexical resources. Still, the encoding possibilities offered by the Dictionaries Chapter in the Guidelines are too numerous and too flexible to guarantee sufficient interoperability and a coherent model for searching, visualising or enriching multiple lexical resources.Following the spirit of TEI Analytics [Zillig, 2009], developed in the context of the MONK project, TEI Lex-0 aims at establishing a target format to facilitate the interoperability of heterogeneously encoded lexical resources. This is important both in the context of building lexical infrastructures as such [Ermolaev and Tasovac, 2012] and in the context of developing generic TEI-aware tools such as dictionary viewers and profilers. The format itself should not necessarily be one which is used for editing or managing individual resources, but one to which they can be univocally transformed to be queried, visualised, or mined in a uniform way. We are also aiming to stay as aligned as possible with the TEI subset developed in conjunction with the revision of the ISO LMF (Lexical Markup Framework) standard so that coherent design guidelines can be provided to the community (cf. [Romary, 2015]).The paper will provide an overview of the various domains covered by TEI Lex- 0 and the main decisions that were taken over the last 18 months: constraining the general structure of a lexical entry; offering mechanisms to overcome the limits of when used in retro-digitized dictionaries (by allowing, for instance, and as children of ); systematizing the representation of morpho-syntactic information [Bański et al., 2017]; providing a strict -based encoding of sense-related information; deprecating ; dealing with internal and external references in dictionary entries, providing more advanced encodings of etymology (see submission by Bowers, Herold and Romary); as well as defining technical constraints on the systematic use of @xml:id at different levels of the dictionary microstructure. The activity of the group has already lead to changes in the Guidelines in response to specific GitHub tickets.