- home
- Advanced Search
33 Research products, page 1 of 4
Loading
- Publication . Report . 2019EnglishAuthors:Szprot, Jakub; Arpagaus, Brigitte; Ciula, Arianna; Clivaz, Claire; Gabay, Simon; Honegger, Matthieu; Hughes, Lorna; Immenhauser, Beat; Jakeman, Neil; Lhotak, Martin; +8 moreSzprot, Jakub; Arpagaus, Brigitte; Ciula, Arianna; Clivaz, Claire; Gabay, Simon; Honegger, Matthieu; Hughes, Lorna; Immenhauser, Beat; Jakeman, Neil; Lhotak, Martin; Romanova, Natasha; Ros, Salvador; Schulthess, Sara; Tahko, Tuuli; Tolonen, Mikko; Erdinast Vulcan, Daphna; Willa, Pierre; Zehavi, Ora;Publisher: HAL CCSDCountry: FranceProject: EC | DESIR (731081)
This report provides information about activities and progress towards establishing DARIAH membership in six countries: the Czech Republic, Finland, Israel, Spain, Switzerland, and the UK, which took place between July and December 2019. Previous activities were described in detail in the D3.2 - Regularly Monitor Country-Specific Progress in Enabling New DARIAH Membership. During the project lifetime, the Czech Republic joined DARIAH ERIC; in other countries, collaboration with DARIAH has been greatly strengthened and significant progress regarding DARIAH membership has been achieved. The report also outlines the next steps in the accession processes, building on the results of the DESIR project.
- Publication . Report . 2019EnglishAuthors:Tahko, Tuuli; Zehavi, Ora; Lhotak, Martin; Romanova, Natasha; Clivaz, Claire; Ros, Salvador; Raciti, Marco;Tahko, Tuuli; Zehavi, Ora; Lhotak, Martin; Romanova, Natasha; Clivaz, Claire; Ros, Salvador; Raciti, Marco;Publisher: HAL CCSDCountry: FranceProject: EC | DESIR (731081), EC | Locus Ludi (741520)
The DESIR project sets out to strengthen the sustainability of DARIAH and firmly establish it as a long-term leader and partner within arts and humanities communities. The project was designed to address six core infrastructural sustainability dimensions and one of these was dedicated to training and education, which is also one of the four pillars identified in the DARIAH Strategic Plan 2019-2026. In the framework of Work Package 7: Teaching, DESIR organised dedicated workshops in the six DARIAH accession countries (Czech Republic, Finland, Israel, Spain, Switzerland and the United Kingdom) to introduce them to the DARIAH infrastructure and related services, and to develop methodological research skills. The topic of each workshop was decided by accession countries representatives according to the training needs of the national communities of researchers in the (Digital) Humanities. Training topics varied greatly: on the one hand, some workshops had the objective to introduce participants to specific methodological research skills; on the other hand, a different approach was used, and some events focused on the infrastructural role of training and education. The workshops organised in the context of Work Package 7: Teaching are listed below:• CZECH REPUBLIC: “A series of fall tutorials 2019 organized by LINDAT/CLARIAHCZ, tutorial #3 on TEI Training”, November 28, 2019, Prague;• FINLAND: “Reuse & sustainability: Open Science and social sciences and humanities research infrastructures”, 23 October 2019, Helsinki;• ISRAEL: “Introduction to Text Encoding and Digital Editions”, 24 October 2019, Haifa;• SPAIN: “DESIR Workshop: Digital Tools, Shared Data, and Research Dissemination”, 3 July 2019, Madrid;• SWITZERLAND: “Sharing the Experience: Workflows for the Digital Humanities”, 5-6 December 2019, Neuchâtel;• UNITED KINGDOM: “Research Software Engineering for Digital Humanities: Role of Training in Sustaining Expertise”, 9 December, London.
- Publication . Article . 2019Open Access EnglishAuthors:Van der Eycken, Johan; Gheldof, Tom; Styven, Dorien; Depoortere, Rolande;Van der Eycken, Johan; Gheldof, Tom; Styven, Dorien; Depoortere, Rolande;Publisher: HAL CCSDCountries: Belgium, France
This article shows that metadata plays a central role in our society and concludes that through collaborative work, it is possible to pool solutions and to establish relationships of cooperation, both at the level of practical tool development and with regard to sharing and creating knowledge and know-how. ispartof: ABB: Archives et Bibliothèques de Belgique - Archief- en Bibliotheekwezen in België vol:106 pages:135-144 status: published
- Publication . Preprint . Article . 2019Open Access EnglishAuthors:Rizza, Ettore; Chardonnens, Anne; Van Hooland, Seth;Rizza, Ettore; Chardonnens, Anne; Van Hooland, Seth;Publisher: HAL CCSDCountries: France, Belgium
More and more cultural institutions use Linked Data principles to share and connect their collection metadata. In the archival field, initiatives emerge to exploit data contained in archival descriptions and adapt encoding standards to the semantic web. In this context, online authority files can be used to enrich metadata. However, relying on a decentralized network of knowledge bases such as Wikidata, DBpedia or even Viaf has its own difficulties. This paper aims to offer a critical view of these linked authority files by adopting a close-reading approach. Through a practical case study, we intend to identify and illustrate the possibilities and limits of RDF triples compared to institutions' less structured metadata. Comment: Workshop "Dariah "Trust and Understanding: the value of metadata in a digitally joined-up world" (14/05/2018, Brussels), preprint of the submission to the journal "Archives et Biblioth\`eques de Belgique"
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Conference object . 2019EnglishAuthors:Bassett, Sheena; Wessels, Leon; Krauwer, Steven; Maegaard, Bente; Hollander, Hella; Admiraal, Femmy; Romary, Laurent; Uiterwaal, Frank;Bassett, Sheena; Wessels, Leon; Krauwer, Steven; Maegaard, Bente; Hollander, Hella; Admiraal, Femmy; Romary, Laurent; Uiterwaal, Frank;Publisher: HAL CCSDCountry: FranceProject: EC | PARTHENOS (654119)
International audience; Several Research Infrastructures(RIs)exist in the Humanities and Social Sciences, some –such as CLARIN, DARIAH and CESSDA –which address specific areas of interest, i.e. linguistic studies, digital humanities and social science data archives. RIs are also unique in their scope and application, largely tailored to their specific community needs. However, commonalities do exist and it is recognised that benefits are to be gained from these such as efficient use of resources, enabling multi-disciplinary research and sharing good practices. As such,a bridging project PARTHENOS has worked closely with CLARIN and DARIAH as well as ARIADNE (archaeology), CENDARI (history), EHRI (holocaust studies) and E-RIHS (heritage science) to iden-tify, develop and promote these commonalities. In this paper, we present some specif-ic examples of cross-discipline and trans-border applications arising from joint RI collaboration, allowing for entirely new avenues of research
- Publication . Other literature type . Conference object . 2019Open Access EnglishAuthors:Lamé, M.; Pittet, P.; Ponchio, F.; Markhoff, B.; EMILIO MARIA SANFILIPPO;Lamé, M.; Pittet, P.; Ponchio, F.; Markhoff, B.; EMILIO MARIA SANFILIPPO;Publisher: HAL CCSDCountries: France, Italy
International audience; In this paper, we present an online communication-driven decision support system to align terms from a dataset with terms of another dataset (standardized controlled vocabulary or not). Heterotoki differs from existing proposals in that it takes place at the interface with humans, inviting the experts to commit on their definitions, so as to either agree to validate the mapping or to propose some enrichment to the terminologies. More precisely, differently to most of existing proposals that support terminology alignment, Heterotoki sustains the negotiation of meaning thanks to semantic coordination support within its interface design. This negotiation involves domain experts having produced multiple datasets.
- Publication . Other literature type . Conference object . 2019Open Access EnglishAuthors:Marlet , Olivier; Francart, Thomas; Markhoff, Béatrice; Rodier, Xavier;Marlet , Olivier; Francart, Thomas; Markhoff, Béatrice; Rodier, Xavier;Publisher: HAL CCSDCountry: FranceProject: EC | ARIADNEplus (823914)
International audience; CIDOC CRM is an ontology intended to facilitate the integration, mediation and interchange of heterogeneous cultural heritage information. The Semantic Web with its Linked Open Data cloud enables scholars and cultural institutions to publish their data in RDF, using CIDOC CRM as an interlingua that enables a semantically consistent re-interpretation of their data. Nowadays more and more projects have done the task of mapping legacy datasets to CIDOC CRM, and successful Extract-Transform-Load data-integration processes have been performed in this way. A next step is enabling people and applications to actually dynamically explore autonomous datasets using the semantic mediation offered by CIDOC CRM. This is the purpose of OpenArchaeo, a tool for querying archaeological datasets on the LOD cloud. We present its main features: the principles behind its user friendly query interface and its SPARQL Endpoint for programs, together with its overall architecture designed to be extendable and scalable, for handling transparent interconnections with evolving distributed sources while achieving good efficiency.
- Publication . 2019EnglishAuthors:Romary, Laurent; Biabiany, Damien; Klaus Illmayer; Puren, Marie; Riondet, Charles; Seillier, Dorian; Tadjou, Lionel;Romary, Laurent; Biabiany, Damien; Klaus Illmayer; Puren, Marie; Riondet, Charles; Seillier, Dorian; Tadjou, Lionel;Publisher: HAL CCSDCountry: FranceProject: EC | PARTHENOS (654119)
International audience
- Publication . Article . 2021Open Access EnglishAuthors:Frank Uiterwaal; Franco Niccolucci; Sheena Bassett; Steven Krauwer; Hella Hollander; Femmy Admiraal; Laurent Romary; George Bruseker; Carlo Meghini; Jennifer Edmond; +1 moreFrank Uiterwaal; Franco Niccolucci; Sheena Bassett; Steven Krauwer; Hella Hollander; Femmy Admiraal; Laurent Romary; George Bruseker; Carlo Meghini; Jennifer Edmond; Mark Hedges;Publisher: Edinburgh University Press for the Association for History and Computing,, Edinburgh , Regno UnitoCountries: France, France, France, Italy, Italy, NetherlandsProject: EC | PARTHENOS (654119)
This article has been accepted for publication by EUP in the IJHAC: International Journal of Humanities and Arts Computing (https://www.euppublishing.com/loi/ijhac); International audience; Since the first ESFRI roadmap in 2006, multiple humanities Research Infrastructures (RIs) have been set up all over the European continent, supporting archaeologists (ARIADNE), linguists (CLARIN-ERIC), Holocaust researchers (EHRI), cultural heritage specialists (IPERION-CH) and others. These examples only scratch the surface of the breadth of research communities that have benefited from close cooperation in the European Research Area.While each field developed discipline-specific services over the years, common themes can also be distinguished. All humanities RIs address, in varying degrees, questions around research data management, the use of standards and the desired interoperability of data across disciplinary boundaries.This article sheds light on how cluster project PARTHENOS developed pooled services and shared solutions for its audience of humanities researchers, RI managers and policymakers. In a time where the convergence of existing infrastructure is becoming ever more important – with the construction of a European Open Science Cloud as an audacious, ultimate goal – we hope that our experiences inform future work and provide inspiration on how to exploit synergies in interdisciplinary, transnational, scientific cooperation.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Article . 2020Open Access EnglishAuthors:Luca Foppiano; Laurent Romary;Luca Foppiano; Laurent Romary;Publisher: HAL CCSDCountry: FranceProject: EC | HIRMEOS (731102)
International audience; This paper presents an attempt to provide a generic named-entity recognition and disambiguation module (NERD) called entity-fishing as a stable online service that demonstrates the possible delivery of sustainable technical services within DARIAH, the European digital research infrastructure for the arts and humanities. Deployed as part of the national infrastructure Huma-Num in France, this service provides an efficient state-of-the-art implementation coupled with standardised interfaces allowing an easy deployment on a variety of potential digital humanities contexts. The topics of accessibility and sustainability have been long discussed in the attempt of providing some best practices in the widely fragmented ecosystem of the DARIAH research infrastructure. The history of entity-fishing has been mentioned as an example of good practice: initially developed in the context of the FP9 CENDARI, the project was well received by the user community and continued to be further developed within the H2020 HIRMEOS project where several open access publishers have integrated the service to their collections of published monographs as a means to enhance retrieval and access.entity-fishing implements entity extraction as well as disambiguation against Wikipedia and Wikidata entries. The service is accessible through a REST API which allows easier and seamless integration, language independent and stable convention and a widely used service oriented architecture (SOA) design. Input and output data are carried out over a query data model with a defined structure providing flexibility to support the processing of partially annotated text or the repartition of text over several queries. The interface implements a variety of functionalities, like language recognition, sentence segmentation and modules for accessing and looking up concepts in the knowledge base. The API itself integrates more advanced contextual parametrisation or ranked outputs, allowing for the resilient integration in various possible use cases. The entity-fishing API has been used as a concrete use case3 to draft the experimental stand-off proposal, which has been submitted for integration into the TEI guidelines. The representation is also compliant with the Web Annotation Data Model (WADM).In this paper we aim at describing the functionalities of the service as a reference contribution to the subject of web-based NERD services. In order to cover all aspects, the architecture is structured to provide two complementary viewpoints. First, we discuss the system from the data angle, detailing the workflow from input to output and unpacking each building box in the processing flow. Secondly, with a more academic approach, we provide a transversal schema of the different components taking into account non-functional requirements in order to facilitate the discovery of bottlenecks, hotspots and weaknesses. The attempt here is to give a description of the tool and, at the same time, a technical software engineering analysis which will help the reader to understand our choice for the resources allocated in the infrastructure.Thanks to the work of million of volunteers, Wikipedia has reached today stability and completeness that leave no usable alternatives on the market (considering also the licence aspect). The launch of Wikidata in 2010 have completed the picture with a complementary language independent meta-model which is becoming the scientific reference for many disciplines. After providing an introduction to Wikipedia and Wikidata, we describe the knowledge base: the data organisation, the entity-fishing process to exploit it and the way it is built from nightly dumps using an offline process.We conclude the paper by presenting our solution for the service deployment: how and which the resources where allocated. The service has been in production since Q3 of 2017, and extensively used by the H2020 HIRMEOS partners during the integration with the publishing platforms. We believe we have strived to provide the best performances with the minimum amount of resources. Thanks to the Huma-num infrastructure we still have the possibility to scale up the infrastructure as needed, for example to support an increase of demand or temporary needs to process huge backlog of documents. On the long term, thanks to this sustainable environment, we are planning to keep delivering the service far beyond the end of the H2020 HIRMEOS project.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.
33 Research products, page 1 of 4
Loading
- Publication . Report . 2019EnglishAuthors:Szprot, Jakub; Arpagaus, Brigitte; Ciula, Arianna; Clivaz, Claire; Gabay, Simon; Honegger, Matthieu; Hughes, Lorna; Immenhauser, Beat; Jakeman, Neil; Lhotak, Martin; +8 moreSzprot, Jakub; Arpagaus, Brigitte; Ciula, Arianna; Clivaz, Claire; Gabay, Simon; Honegger, Matthieu; Hughes, Lorna; Immenhauser, Beat; Jakeman, Neil; Lhotak, Martin; Romanova, Natasha; Ros, Salvador; Schulthess, Sara; Tahko, Tuuli; Tolonen, Mikko; Erdinast Vulcan, Daphna; Willa, Pierre; Zehavi, Ora;Publisher: HAL CCSDCountry: FranceProject: EC | DESIR (731081)
This report provides information about activities and progress towards establishing DARIAH membership in six countries: the Czech Republic, Finland, Israel, Spain, Switzerland, and the UK, which took place between July and December 2019. Previous activities were described in detail in the D3.2 - Regularly Monitor Country-Specific Progress in Enabling New DARIAH Membership. During the project lifetime, the Czech Republic joined DARIAH ERIC; in other countries, collaboration with DARIAH has been greatly strengthened and significant progress regarding DARIAH membership has been achieved. The report also outlines the next steps in the accession processes, building on the results of the DESIR project.
- Publication . Report . 2019EnglishAuthors:Tahko, Tuuli; Zehavi, Ora; Lhotak, Martin; Romanova, Natasha; Clivaz, Claire; Ros, Salvador; Raciti, Marco;Tahko, Tuuli; Zehavi, Ora; Lhotak, Martin; Romanova, Natasha; Clivaz, Claire; Ros, Salvador; Raciti, Marco;Publisher: HAL CCSDCountry: FranceProject: EC | DESIR (731081), EC | Locus Ludi (741520)
The DESIR project sets out to strengthen the sustainability of DARIAH and firmly establish it as a long-term leader and partner within arts and humanities communities. The project was designed to address six core infrastructural sustainability dimensions and one of these was dedicated to training and education, which is also one of the four pillars identified in the DARIAH Strategic Plan 2019-2026. In the framework of Work Package 7: Teaching, DESIR organised dedicated workshops in the six DARIAH accession countries (Czech Republic, Finland, Israel, Spain, Switzerland and the United Kingdom) to introduce them to the DARIAH infrastructure and related services, and to develop methodological research skills. The topic of each workshop was decided by accession countries representatives according to the training needs of the national communities of researchers in the (Digital) Humanities. Training topics varied greatly: on the one hand, some workshops had the objective to introduce participants to specific methodological research skills; on the other hand, a different approach was used, and some events focused on the infrastructural role of training and education. The workshops organised in the context of Work Package 7: Teaching are listed below:• CZECH REPUBLIC: “A series of fall tutorials 2019 organized by LINDAT/CLARIAHCZ, tutorial #3 on TEI Training”, November 28, 2019, Prague;• FINLAND: “Reuse & sustainability: Open Science and social sciences and humanities research infrastructures”, 23 October 2019, Helsinki;• ISRAEL: “Introduction to Text Encoding and Digital Editions”, 24 October 2019, Haifa;• SPAIN: “DESIR Workshop: Digital Tools, Shared Data, and Research Dissemination”, 3 July 2019, Madrid;• SWITZERLAND: “Sharing the Experience: Workflows for the Digital Humanities”, 5-6 December 2019, Neuchâtel;• UNITED KINGDOM: “Research Software Engineering for Digital Humanities: Role of Training in Sustaining Expertise”, 9 December, London.
- Publication . Article . 2019Open Access EnglishAuthors:Van der Eycken, Johan; Gheldof, Tom; Styven, Dorien; Depoortere, Rolande;Van der Eycken, Johan; Gheldof, Tom; Styven, Dorien; Depoortere, Rolande;Publisher: HAL CCSDCountries: Belgium, France
This article shows that metadata plays a central role in our society and concludes that through collaborative work, it is possible to pool solutions and to establish relationships of cooperation, both at the level of practical tool development and with regard to sharing and creating knowledge and know-how. ispartof: ABB: Archives et Bibliothèques de Belgique - Archief- en Bibliotheekwezen in België vol:106 pages:135-144 status: published
- Publication . Preprint . Article . 2019Open Access EnglishAuthors:Rizza, Ettore; Chardonnens, Anne; Van Hooland, Seth;Rizza, Ettore; Chardonnens, Anne; Van Hooland, Seth;Publisher: HAL CCSDCountries: France, Belgium
More and more cultural institutions use Linked Data principles to share and connect their collection metadata. In the archival field, initiatives emerge to exploit data contained in archival descriptions and adapt encoding standards to the semantic web. In this context, online authority files can be used to enrich metadata. However, relying on a decentralized network of knowledge bases such as Wikidata, DBpedia or even Viaf has its own difficulties. This paper aims to offer a critical view of these linked authority files by adopting a close-reading approach. Through a practical case study, we intend to identify and illustrate the possibilities and limits of RDF triples compared to institutions' less structured metadata. Comment: Workshop "Dariah "Trust and Understanding: the value of metadata in a digitally joined-up world" (14/05/2018, Brussels), preprint of the submission to the journal "Archives et Biblioth\`eques de Belgique"
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Conference object . 2019EnglishAuthors:Bassett, Sheena; Wessels, Leon; Krauwer, Steven; Maegaard, Bente; Hollander, Hella; Admiraal, Femmy; Romary, Laurent; Uiterwaal, Frank;Bassett, Sheena; Wessels, Leon; Krauwer, Steven; Maegaard, Bente; Hollander, Hella; Admiraal, Femmy; Romary, Laurent; Uiterwaal, Frank;Publisher: HAL CCSDCountry: FranceProject: EC | PARTHENOS (654119)
International audience; Several Research Infrastructures(RIs)exist in the Humanities and Social Sciences, some –such as CLARIN, DARIAH and CESSDA –which address specific areas of interest, i.e. linguistic studies, digital humanities and social science data archives. RIs are also unique in their scope and application, largely tailored to their specific community needs. However, commonalities do exist and it is recognised that benefits are to be gained from these such as efficient use of resources, enabling multi-disciplinary research and sharing good practices. As such,a bridging project PARTHENOS has worked closely with CLARIN and DARIAH as well as ARIADNE (archaeology), CENDARI (history), EHRI (holocaust studies) and E-RIHS (heritage science) to iden-tify, develop and promote these commonalities. In this paper, we present some specif-ic examples of cross-discipline and trans-border applications arising from joint RI collaboration, allowing for entirely new avenues of research
- Publication . Other literature type . Conference object . 2019Open Access EnglishAuthors:Lamé, M.; Pittet, P.; Ponchio, F.; Markhoff, B.; EMILIO MARIA SANFILIPPO;Lamé, M.; Pittet, P.; Ponchio, F.; Markhoff, B.; EMILIO MARIA SANFILIPPO;Publisher: HAL CCSDCountries: France, Italy
International audience; In this paper, we present an online communication-driven decision support system to align terms from a dataset with terms of another dataset (standardized controlled vocabulary or not). Heterotoki differs from existing proposals in that it takes place at the interface with humans, inviting the experts to commit on their definitions, so as to either agree to validate the mapping or to propose some enrichment to the terminologies. More precisely, differently to most of existing proposals that support terminology alignment, Heterotoki sustains the negotiation of meaning thanks to semantic coordination support within its interface design. This negotiation involves domain experts having produced multiple datasets.
- Publication . Other literature type . Conference object . 2019Open Access EnglishAuthors:Marlet , Olivier; Francart, Thomas; Markhoff, Béatrice; Rodier, Xavier;Marlet , Olivier; Francart, Thomas; Markhoff, Béatrice; Rodier, Xavier;Publisher: HAL CCSDCountry: FranceProject: EC | ARIADNEplus (823914)
International audience; CIDOC CRM is an ontology intended to facilitate the integration, mediation and interchange of heterogeneous cultural heritage information. The Semantic Web with its Linked Open Data cloud enables scholars and cultural institutions to publish their data in RDF, using CIDOC CRM as an interlingua that enables a semantically consistent re-interpretation of their data. Nowadays more and more projects have done the task of mapping legacy datasets to CIDOC CRM, and successful Extract-Transform-Load data-integration processes have been performed in this way. A next step is enabling people and applications to actually dynamically explore autonomous datasets using the semantic mediation offered by CIDOC CRM. This is the purpose of OpenArchaeo, a tool for querying archaeological datasets on the LOD cloud. We present its main features: the principles behind its user friendly query interface and its SPARQL Endpoint for programs, together with its overall architecture designed to be extendable and scalable, for handling transparent interconnections with evolving distributed sources while achieving good efficiency.
- Publication . 2019EnglishAuthors:Romary, Laurent; Biabiany, Damien; Klaus Illmayer; Puren, Marie; Riondet, Charles; Seillier, Dorian; Tadjou, Lionel;Romary, Laurent; Biabiany, Damien; Klaus Illmayer; Puren, Marie; Riondet, Charles; Seillier, Dorian; Tadjou, Lionel;Publisher: HAL CCSDCountry: FranceProject: EC | PARTHENOS (654119)
International audience
- Publication . Article . 2021Open Access EnglishAuthors:Frank Uiterwaal; Franco Niccolucci; Sheena Bassett; Steven Krauwer; Hella Hollander; Femmy Admiraal; Laurent Romary; George Bruseker; Carlo Meghini; Jennifer Edmond; +1 moreFrank Uiterwaal; Franco Niccolucci; Sheena Bassett; Steven Krauwer; Hella Hollander; Femmy Admiraal; Laurent Romary; George Bruseker; Carlo Meghini; Jennifer Edmond; Mark Hedges;Publisher: Edinburgh University Press for the Association for History and Computing,, Edinburgh , Regno UnitoCountries: France, France, France, Italy, Italy, NetherlandsProject: EC | PARTHENOS (654119)
This article has been accepted for publication by EUP in the IJHAC: International Journal of Humanities and Arts Computing (https://www.euppublishing.com/loi/ijhac); International audience; Since the first ESFRI roadmap in 2006, multiple humanities Research Infrastructures (RIs) have been set up all over the European continent, supporting archaeologists (ARIADNE), linguists (CLARIN-ERIC), Holocaust researchers (EHRI), cultural heritage specialists (IPERION-CH) and others. These examples only scratch the surface of the breadth of research communities that have benefited from close cooperation in the European Research Area.While each field developed discipline-specific services over the years, common themes can also be distinguished. All humanities RIs address, in varying degrees, questions around research data management, the use of standards and the desired interoperability of data across disciplinary boundaries.This article sheds light on how cluster project PARTHENOS developed pooled services and shared solutions for its audience of humanities researchers, RI managers and policymakers. In a time where the convergence of existing infrastructure is becoming ever more important – with the construction of a European Open Science Cloud as an audacious, ultimate goal – we hope that our experiences inform future work and provide inspiration on how to exploit synergies in interdisciplinary, transnational, scientific cooperation.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product. - Publication . Article . 2020Open Access EnglishAuthors:Luca Foppiano; Laurent Romary;Luca Foppiano; Laurent Romary;Publisher: HAL CCSDCountry: FranceProject: EC | HIRMEOS (731102)
International audience; This paper presents an attempt to provide a generic named-entity recognition and disambiguation module (NERD) called entity-fishing as a stable online service that demonstrates the possible delivery of sustainable technical services within DARIAH, the European digital research infrastructure for the arts and humanities. Deployed as part of the national infrastructure Huma-Num in France, this service provides an efficient state-of-the-art implementation coupled with standardised interfaces allowing an easy deployment on a variety of potential digital humanities contexts. The topics of accessibility and sustainability have been long discussed in the attempt of providing some best practices in the widely fragmented ecosystem of the DARIAH research infrastructure. The history of entity-fishing has been mentioned as an example of good practice: initially developed in the context of the FP9 CENDARI, the project was well received by the user community and continued to be further developed within the H2020 HIRMEOS project where several open access publishers have integrated the service to their collections of published monographs as a means to enhance retrieval and access.entity-fishing implements entity extraction as well as disambiguation against Wikipedia and Wikidata entries. The service is accessible through a REST API which allows easier and seamless integration, language independent and stable convention and a widely used service oriented architecture (SOA) design. Input and output data are carried out over a query data model with a defined structure providing flexibility to support the processing of partially annotated text or the repartition of text over several queries. The interface implements a variety of functionalities, like language recognition, sentence segmentation and modules for accessing and looking up concepts in the knowledge base. The API itself integrates more advanced contextual parametrisation or ranked outputs, allowing for the resilient integration in various possible use cases. The entity-fishing API has been used as a concrete use case3 to draft the experimental stand-off proposal, which has been submitted for integration into the TEI guidelines. The representation is also compliant with the Web Annotation Data Model (WADM).In this paper we aim at describing the functionalities of the service as a reference contribution to the subject of web-based NERD services. In order to cover all aspects, the architecture is structured to provide two complementary viewpoints. First, we discuss the system from the data angle, detailing the workflow from input to output and unpacking each building box in the processing flow. Secondly, with a more academic approach, we provide a transversal schema of the different components taking into account non-functional requirements in order to facilitate the discovery of bottlenecks, hotspots and weaknesses. The attempt here is to give a description of the tool and, at the same time, a technical software engineering analysis which will help the reader to understand our choice for the resources allocated in the infrastructure.Thanks to the work of million of volunteers, Wikipedia has reached today stability and completeness that leave no usable alternatives on the market (considering also the licence aspect). The launch of Wikidata in 2010 have completed the picture with a complementary language independent meta-model which is becoming the scientific reference for many disciplines. After providing an introduction to Wikipedia and Wikidata, we describe the knowledge base: the data organisation, the entity-fishing process to exploit it and the way it is built from nightly dumps using an offline process.We conclude the paper by presenting our solution for the service deployment: how and which the resources where allocated. The service has been in production since Q3 of 2017, and extensively used by the H2020 HIRMEOS partners during the integration with the publishing platforms. We believe we have strived to provide the best performances with the minimum amount of resources. Thanks to the Huma-num infrastructure we still have the possibility to scale up the infrastructure as needed, for example to support an increase of demand or temporary needs to process huge backlog of documents. On the long term, thanks to this sustainable environment, we are planning to keep delivering the service far beyond the end of the H2020 HIRMEOS project.
Average popularityAverage popularity In bottom 99%Average influencePopularity: Citation-based measure reflecting the current impact.Average influence In bottom 99%Influence: Citation-based measure reflecting the total impact.add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.