- home
- Advanced Search
- DARIAH EU
- Open Access
- Publications
- European Commission
- EU
- INRIA a CCSD electronic archive ser...
- arXiv.org e-Print Archive
- DARIAH EU
- Open Access
- Publications
- European Commission
- EU
- INRIA a CCSD electronic archive ser...
- arXiv.org e-Print Archive
Loading
description Publicationkeyboard_double_arrow_right Article 2023Embargo end date: 01 Jan 2023 SpainPublisher:arXiv Funded by:EC | CLS INFRA, EC | LyrAIcsEC| CLS INFRA ,EC| LyrAIcsBenito-Santos, Alejandro; Ghajari, Adrián; Hernández,Pedro; Fresno, Victor; Ros, Salvador; González-Blanco, Elena;handle: 10045/137147
En este trabajo presentamos un nuevo conjunto de datos y benchmark orientados a la tarea de similitud semántica en letras de canciones. Nuestro conjunto de datos, originalmente formado por 2775 pares de canciones en Español, fue anotado en un experimento de anotación colectivo por 63 anotadores nativos. Después de recoger y refinar los datos para asegurar un alto grado de consenso e integridad en los datos, obtuvimos 676 pares anotados de alta calidad que fueron empleados para evaluar el rendimiento de diferentes modelos del lenguaje monolingües y multilingües pertenecientes al estado del arte. En consecuencia, obtuvimos unos resultados base que esperamos sean de utilidad a la comunidad en todas aquellas aplicaciones académicas e industriales futuras que se realicen en este contexto. In this paper, we present a new dataset and benchmark tailored to the task of semantic similarity in song lyrics. Our dataset, originally consisting of 2775 pairs of Spanish songs, was annotated in a collective annotation experiment by 63 native annotators. After collecting and refining the data to ensure a high degree of consensus and data integrity, we obtained 676 high-quality annotated pairs that were used to evaluate the performance of various state-of-the-art monolingual and multilingual language models. Consequently, we established baseline results that we hope will be useful to the community in all future academic and industrial applications conducted in this context. This research has been carried out in the framework of the Grant LyrAIcs Grant agreement ID: 964009 funded by ERC-POCLS, and in the framework of the Grant CLS INFRA reference 101004984 funded by H2020-INFRAIA-2020-1. It has also received funding from the project ISL: Intelligent Systems for Learning (GID2016-39) in the call PID 22/23, and from FAIRTRANSNLP-DIAGNOSIS: Measuring and quantifying bias and fairness in NLP systems, grant PID2021-124361OB-C32, funded by MCIN/AEI/10.13039/501100011033 and by ERDF, EU A way of making Europe. Alejandro Benito-Santos acknowledges support from the postdoctoral grant ”Margarita Salas”, awarded by the Spanish Ministry of Universities.
arXiv.org e-Print Ar... arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2023Data sources: arXiv.org e-Print ArchiveRepositorio Institucional de la Universidad de AlicanteArticle . 2023Data sources: Repositorio Institucional de la Universidad de AlicanteRecolector de Ciencia Abierta, RECOLECTAArticle . 2023Full-Text: https://doi.org/10.26342/2023-71-12Data sources: Recolector de Ciencia Abierta, RECOLECTAadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.48550/arxiv.2306.01325&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 7visibility views 7 download downloads 10 Powered bymore_vert arXiv.org e-Print Ar... arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2023Data sources: arXiv.org e-Print ArchiveRepositorio Institucional de la Universidad de AlicanteArticle . 2023Data sources: Repositorio Institucional de la Universidad de AlicanteRecolector de Ciencia Abierta, RECOLECTAArticle . 2023Full-Text: https://doi.org/10.26342/2023-71-12Data sources: Recolector de Ciencia Abierta, RECOLECTAadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.48550/arxiv.2306.01325&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2021 Italy, Italy, France, France, Netherlands, France, FrancePublisher:Edinburgh University Press Funded by:EC | PARTHENOSEC| PARTHENOSFrank Uiterwaal; Franco Niccolucci; Sheena Bassett; Steven Krauwer; Hella Hollander; Femmy Admiraal; Laurent Romary; George Bruseker; Carlo Meghini; Jennifer Edmond; Mark Hedges;Since the first ESFRI roadmap in 2006, multiple humanities Research Infrastructures (RIs) have been set up all over the European continent, supporting archaeologists (ARIADNE), linguists (CLARIN-ERIC), Holocaust researchers (EHRI), cultural heritage specialists (IPERION-CH) and others. These examples only scratch the surface of the breadth of research communities that have benefited from close cooperation in the European Research Area.While each field developed discipline-specific services over the years, common themes can also be distinguished. All humanities RIs address, in varying degrees, questions around research data management, the use of standards and the desired interoperability of data across disciplinary boundaries.This article sheds light on how cluster project PARTHENOS developed pooled services and shared solutions for its audience of humanities researchers, RI managers and policymakers. In a time where the convergence of existing infrastructure is becoming ever more important – with the construction of a European Open Science Cloud as an audacious, ultimate goal – we hope that our experiences inform future work and provide inspiration on how to exploit synergies in interdisciplinary, transnational, scientific cooperation. This article has been accepted for publication by EUP in the IJHAC: International Journal of Humanities and Arts Computing (https://www.euppublishing.com/loi/ijhac) International audience
International Journa... arrow_drop_down International Journal of Humanities and Arts ComputingArticle . 2021 . Peer-reviewedLicense: EUP TDMInternational Journal of Humanities and Arts ComputingArticleLicense: CC BYData sources: UnpayWallHal-DiderotArticle . 2021License: CC BYFull-Text: https://hal.inria.fr/hal-03402145/documentData sources: Hal-Diderotadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.3366/ijhac.2021.0264&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routeshybrid 2 citations 2 popularity Average influence Average impulse Average Powered by BIP!visibility 2visibility views 2 Powered bymore_vert International Journa... arrow_drop_down International Journal of Humanities and Arts ComputingArticle . 2021 . Peer-reviewedLicense: EUP TDMInternational Journal of Humanities and Arts ComputingArticleLicense: CC BYData sources: UnpayWallHal-DiderotArticle . 2021License: CC BYFull-Text: https://hal.inria.fr/hal-03402145/documentData sources: Hal-Diderotadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.3366/ijhac.2021.0264&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Report 2021 France EnglishPublisher:HAL CCSD Funded by:EC | ELEXISEC| ELEXISAuthors: Tasovac, Toma; Romary, Laurent; Tóth-Czifra, Erzsébet; Marinski, Irena;Tasovac, Toma; Romary, Laurent; Tóth-Czifra, Erzsébet; Marinski, Irena;All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______177::1ddda313a51a3c3a9b2012f453ecf1f7&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______177::1ddda313a51a3c3a9b2012f453ecf1f7&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2020 FrancePublisher:Japanese Association for Digital Humanities Funded by:EC | HIRMEOSEC| HIRMEOSAuthors: Foppiano, Luca; Romary, Laurent;Foppiano, Luca; Romary, Laurent;This paper presents an attempt to provide a generic named-entity recognition and disambiguation module (NERD) called entity-fishing as a stable online service that demonstrates the possible delivery of sustainable technical services within DARIAH, the European digital research infrastructure for the arts and humanities. Deployed as part of the national infrastructure Huma-Num in France, this service provides an efficient state-of-the-art implementation coupled with standardised interfaces allowing an easy deployment on a variety of potential digital humanities contexts. Initially developed in the context of the FP9 EU project CENDARI, the software was well received by the user community and continued to be further developed within the H2020 HIRMEOS project where several open access publishers have integrated the service to their collections of published monographs as a means to enhance retrieval and access. entity-fishing implements entity extraction as well as disambiguation against Wikipedia and Wikidata entries. The service is accessible through a REST API which allows easier and seamless integration, language independent and stable convention and a widely used service-oriented architecture (SOA) design. Input and output data are carried out over a query data model with a defined structure providing flexibility to support the processing of partially annotated text or the repartition of text over several queries. The interface implements a variety of functionalities, like language recognition, sentence segmentation and modules for accessing and looking up concepts in the knowledge base. The API itself integrates more advanced contextual parametrisation or ranked outputs, allowing for the resilient integration in various possible use cases. The entity-fishing API has been used as a concrete use case to draft the experimental stand-off proposal, which has been submitted for integration into the TEI guidelines. The representation is also compliant with the Web Annotation Data Model (WADM). In this paper we aim at describing the functionalities of the service as a reference contribution to the subject of web-based NERD services. In this paper, we detail the workflow from input to output and unpack each building box in the processing flow. Besides, with a more academic approach, we provide a transversal schema of the different components taking into account non-functional requirements in order to facilitate the discovery of bottlenecks, hotspots and weaknesses. We also describe the underlying knowledge base, which is set up on the basis of Wikipedia and Wikidata content. We conclude the paper by presenting our solution for the service deployment: how and which the resources where allocated. The service has been in production since Q3 of 2017, and extensively used by the H2020 HIRMEOS partners during the integration with the publishing platforms. International audience
Journal of the Japan... arrow_drop_down HAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2020License: CC BYHAL Descartes; HAL-Pasteur; HAL-Inserm; Mémoires en Sciences de l'Information et de la Communication; Hal-DiderotArticle . Conference object . 2020 . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01812100v2/documentJournal of the Japanese Association for Digital HumanitiesArticle . 2020 . Peer-reviewedData sources: Crossrefadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.17928/jjadh.5.1_22&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routesgold 4 citations 4 popularity Top 10% influence Average impulse Average Powered by BIP!more_vert Journal of the Japan... arrow_drop_down HAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2020License: CC BYHAL Descartes; HAL-Pasteur; HAL-Inserm; Mémoires en Sciences de l'Information et de la Communication; Hal-DiderotArticle . Conference object . 2020 . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01812100v2/documentJournal of the Japanese Association for Digital HumanitiesArticle . 2020 . Peer-reviewedData sources: Crossrefadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.17928/jjadh.5.1_22&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article , Conference object 2019Publisher:ACM Funded by:EC | ALEXANDRIA, EC | AFEL, EC | DESIREC| ALEXANDRIA ,EC| AFEL ,EC| DESIRAuthors: Hube, Christoph; Fetahu, Besnik;Hube, Christoph; Fetahu, Besnik;Biased language commonly occurs around topics which are of controversial nature, thus, stirring disagreement between the different involved parties of a discussion. This is due to the fact that for language and its use, specifically, the understanding and use of phrases, the stances are cohesive within the particular groups. However, such cohesiveness does not hold across groups. In collaborative environments or environments where impartial language is desired (e.g. Wikipedia, news media), statements and the language therein should represent equally the involved parties and be neutrally phrased. Biased language is introduced through the presence of inflammatory words or phrases, or statements that may be incorrect or one-sided, thus violating such consensus. In this work, we focus on the specific case of phrasing bias, which may be introduced through specific inflammatory words or phrases in a statement. For this purpose, we propose an approach that relies on a recurrent neural networks in order to capture the inter-dependencies between words in a phrase that introduced bias. We perform a thorough experimental evaluation, where we show the advantages of a neural based approach over competitors that rely on word lexicons and other hand-crafted features in detecting biased language. We are able to distinguish biased statements with a precision of P=0.92, thus significantly outperforming baseline models with an improvement of over 30%. Finally, we release the largest corpus of statements annotated for biased language. Comment: The Twelfth ACM International Conference on Web Search and Data Mining, February 11--15, 2019, Melbourne, VIC, Australia
arXiv.org e-Print Ar... arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2018Data sources: arXiv.org e-Print Archivehttps://doi.org/10.1145/328960...Conference object . 2019 . Peer-reviewedLicense: ACM Copyright PoliciesData sources: Crossrefhttps://doi.org/10.48550/arxiv...Article . 2018License: arXiv Non-Exclusive DistributionData sources: Dataciteadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1145/3289600.3291018&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 28 citations 28 popularity Top 10% influence Top 10% impulse Top 10% Powered by BIP!more_vert arXiv.org e-Print Ar... arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2018Data sources: arXiv.org e-Print Archivehttps://doi.org/10.1145/328960...Conference object . 2019 . Peer-reviewedLicense: ACM Copyright PoliciesData sources: Crossrefhttps://doi.org/10.48550/arxiv...Article . 2018License: arXiv Non-Exclusive DistributionData sources: Dataciteadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1145/3289600.3291018&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2018 FrancePublisher:Springer Science and Business Media LLC Funded by:EC | PARTHENOS, EC | EHRI, EC | EHRIEC| PARTHENOS ,EC| EHRI ,EC| EHRIAuthors: Laurent Romary; Charles Riondet;Laurent Romary; Charles Riondet;This article tackles the issue of integrating heterogeneous archival sources in one single data repository, namely the EHRI portal, whose aim is to support Holocaust research by providing online access to information about dispersed sources relating to the Holocaust (http://portal.ehri-project.eu). In this case, the problem at hand is to combine data coming from a network of archives in order to create an interoperable data space which can be used to search for, retrieve and disseminate content in the context of archival-based research. The central aspect of the work described in this paper is the assessment of the role of the Encoded Archival Description (EAD) standard as the basis for achieving the tasks described above. We have worked out how we could develop a real strategy of defining specific customization of EAD that could be used at various stages of the process of integrating heterogeneous sources. We have developed a methodology based on a specification and customization method inspired from the extensive experience of the Text Encoding Initiative (TEI) community. In the TEI framework, one has the possibility to model specific subsets or extensions of the TEI guidelines while maintaining both the technical (XML schemas) and editorial (documentation) content within a single framework. This work has led us quite far in anticipating that the method we have developed may be of a wider interest within similar environments, but also, as we believe, for the future maintenance of the EAD standard. Special thanks to Annelies van Nispen (NIOD) and Hector Martinez Alonso (ALMAnaCH) for their help, and to Lou Burnard (TEI) for his wise comments International audience
OpenAIRE; Archival S... arrow_drop_down OpenAIRE; Archival ScienceOther literature type . Article . 2018 . Peer-reviewedLicense: Springer TDMHyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01737568v2/documentHAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2018License: CC BYadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1007/s10502-018-9290-y&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routeshybrid 2 citations 2 popularity Average influence Top 10% impulse Average Powered by BIP!more_vert OpenAIRE; Archival S... arrow_drop_down OpenAIRE; Archival ScienceOther literature type . Article . 2018 . Peer-reviewedLicense: Springer TDMHyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01737568v2/documentHAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2018License: CC BYadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1007/s10502-018-9290-y&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2018 France, United Kingdom, Germany EnglishPublisher:HAL CCSD Funded by:EC | CENDARIEC| CENDARINadia Boukhelifa; Michael Bryant; Natasa Bulatovic; Ivan Čukić; Jean-Daniel Fekete; Milica Knežević; Jörg Lehmann; David I. Stuart; Carsten Thiel;doi: 10.1145/3092906
The CENDARI infrastructure is a research-supporting platform designed to provide tools for transnational historical research, focusing on two topics: medieval culture and World War I. It exposes to the end users modern Web-based tools relying on a sophisticated infrastructure to collect, enrich, annotate, and search through large document corpora. Supporting researchers in their daily work is a novel concern for infrastructures. We describe how we gathered requirements through multiple methods to understand historians' needs and derive an abstract workflow to support them. We then outline the tools that we have built, tying their technical descriptions to the user requirements. The main tools are the note-taking environment and its faceted search capabilities; the data integration platform including the Data API, supporting semantic enrichment through entity recognition; and the environment supporting the software development processes throughout the project to keep both technical partners and researchers in the loop. The outcomes are technical together with new resources developed and gathered, and the research workflow that has been described and documented. International audience
OpenAIRE arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2016Data sources: arXiv.org e-Print ArchivePublikationenserver der Georg-August-Universität GöttingenArticle . 2020Journal on Computing and Cultural HeritageArticle . 2018 . Peer-reviewedLicense: ACM Copyright PoliciesData sources: CrossrefHal-DiderotArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01523102v2/documentData sources: Hal-DiderotHAL - UPEC / UPEM; HAL-Pasteur; HAL-InsermArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01523102v2/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1145/3092906&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen hybrid more_vert OpenAIRE arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2016Data sources: arXiv.org e-Print ArchivePublikationenserver der Georg-August-Universität GöttingenArticle . 2020Journal on Computing and Cultural HeritageArticle . 2018 . Peer-reviewedLicense: ACM Copyright PoliciesData sources: CrossrefHal-DiderotArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01523102v2/documentData sources: Hal-DiderotHAL - UPEC / UPEM; HAL-Pasteur; HAL-InsermArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01523102v2/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1145/3092906&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Preprint 2017 France FrenchPublisher:HAL CCSD Funded by:EC | EHRIEC| EHRIVanden Daelen, Veerle; Edmond, Jennifer; Links, Petra; Priddy, Mike; Reijnhoudt, Linda; Tollar, Václav; van Nispen, Annelies; Hauwaert, Charlotte; Riondet, Charles;HAL Descartes; INRIA... arrow_drop_down Hyper Article en Ligne; Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotOther literature type . Preprint . 2017Full-Text: https://hal.inria.fr/hal-01632366/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______2592::9fd3a1a786b0416aeaad80ab0d5ab200&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert HAL Descartes; INRIA... arrow_drop_down Hyper Article en Ligne; Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotOther literature type . Preprint . 2017Full-Text: https://hal.inria.fr/hal-01632366/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______2592::9fd3a1a786b0416aeaad80ab0d5ab200&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Conference object 2015 France EnglishPublisher:Eurographics Digital Library Funded by:EC | CENDARIEC| CENDARIAuthors: Boukhelifa , Nadia; Giannisakis , Emmanouil; Dimara , Evanthia; Willett , Wesley; +1 AuthorsBoukhelifa , Nadia; Giannisakis , Emmanouil; Dimara , Evanthia; Willett , Wesley; Fekete , Jean-Daniel;In this paper we describe the development and evaluation of a visual analytics tool to support historical research. Historians continuously gather data related to their scholarly research from archival visits and background search. Organising and making sense of all this data can be challenging as many historians continue to rely on analog or basic digital tools. We built an integrated note-taking environment for historians which unifies a set of func-tionalities we identified as important for historical research including editing, tagging, searching, sharing and visualization. Our approach was to involve users from the initial stage of brainstorming and requirement analysis through to design, implementation and evaluation. We report on the process and results of our work, and conclude by reflecting on our own experience in conducting user-centered visual analytics design for digital humanities. International audience
ProdInra arrow_drop_down Hal-DiderotConference object . 2015License: CC BY SAFull-Text: https://hal.inria.fr/hal-01156527/documentData sources: Hal-DiderotHAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationConference object . 2015License: CC BY SAAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______3379::2978ac0808120e015f4adf816ff96365&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert ProdInra arrow_drop_down Hal-DiderotConference object . 2015License: CC BY SAFull-Text: https://hal.inria.fr/hal-01156527/documentData sources: Hal-DiderotHAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationConference object . 2015License: CC BY SAAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______3379::2978ac0808120e015f4adf816ff96365&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu
Loading
description Publicationkeyboard_double_arrow_right Article 2023Embargo end date: 01 Jan 2023 SpainPublisher:arXiv Funded by:EC | CLS INFRA, EC | LyrAIcsEC| CLS INFRA ,EC| LyrAIcsBenito-Santos, Alejandro; Ghajari, Adrián; Hernández,Pedro; Fresno, Victor; Ros, Salvador; González-Blanco, Elena;handle: 10045/137147
En este trabajo presentamos un nuevo conjunto de datos y benchmark orientados a la tarea de similitud semántica en letras de canciones. Nuestro conjunto de datos, originalmente formado por 2775 pares de canciones en Español, fue anotado en un experimento de anotación colectivo por 63 anotadores nativos. Después de recoger y refinar los datos para asegurar un alto grado de consenso e integridad en los datos, obtuvimos 676 pares anotados de alta calidad que fueron empleados para evaluar el rendimiento de diferentes modelos del lenguaje monolingües y multilingües pertenecientes al estado del arte. En consecuencia, obtuvimos unos resultados base que esperamos sean de utilidad a la comunidad en todas aquellas aplicaciones académicas e industriales futuras que se realicen en este contexto. In this paper, we present a new dataset and benchmark tailored to the task of semantic similarity in song lyrics. Our dataset, originally consisting of 2775 pairs of Spanish songs, was annotated in a collective annotation experiment by 63 native annotators. After collecting and refining the data to ensure a high degree of consensus and data integrity, we obtained 676 high-quality annotated pairs that were used to evaluate the performance of various state-of-the-art monolingual and multilingual language models. Consequently, we established baseline results that we hope will be useful to the community in all future academic and industrial applications conducted in this context. This research has been carried out in the framework of the Grant LyrAIcs Grant agreement ID: 964009 funded by ERC-POCLS, and in the framework of the Grant CLS INFRA reference 101004984 funded by H2020-INFRAIA-2020-1. It has also received funding from the project ISL: Intelligent Systems for Learning (GID2016-39) in the call PID 22/23, and from FAIRTRANSNLP-DIAGNOSIS: Measuring and quantifying bias and fairness in NLP systems, grant PID2021-124361OB-C32, funded by MCIN/AEI/10.13039/501100011033 and by ERDF, EU A way of making Europe. Alejandro Benito-Santos acknowledges support from the postdoctoral grant ”Margarita Salas”, awarded by the Spanish Ministry of Universities.
arXiv.org e-Print Ar... arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2023Data sources: arXiv.org e-Print ArchiveRepositorio Institucional de la Universidad de AlicanteArticle . 2023Data sources: Repositorio Institucional de la Universidad de AlicanteRecolector de Ciencia Abierta, RECOLECTAArticle . 2023Full-Text: https://doi.org/10.26342/2023-71-12Data sources: Recolector de Ciencia Abierta, RECOLECTAadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.48550/arxiv.2306.01325&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 0 citations 0 popularity Average influence Average impulse Average Powered by BIP!visibility 7visibility views 7 download downloads 10 Powered bymore_vert arXiv.org e-Print Ar... arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2023Data sources: arXiv.org e-Print ArchiveRepositorio Institucional de la Universidad de AlicanteArticle . 2023Data sources: Repositorio Institucional de la Universidad de AlicanteRecolector de Ciencia Abierta, RECOLECTAArticle . 2023Full-Text: https://doi.org/10.26342/2023-71-12Data sources: Recolector de Ciencia Abierta, RECOLECTAadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.48550/arxiv.2306.01325&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2021 Italy, Italy, France, France, Netherlands, France, FrancePublisher:Edinburgh University Press Funded by:EC | PARTHENOSEC| PARTHENOSFrank Uiterwaal; Franco Niccolucci; Sheena Bassett; Steven Krauwer; Hella Hollander; Femmy Admiraal; Laurent Romary; George Bruseker; Carlo Meghini; Jennifer Edmond; Mark Hedges;Since the first ESFRI roadmap in 2006, multiple humanities Research Infrastructures (RIs) have been set up all over the European continent, supporting archaeologists (ARIADNE), linguists (CLARIN-ERIC), Holocaust researchers (EHRI), cultural heritage specialists (IPERION-CH) and others. These examples only scratch the surface of the breadth of research communities that have benefited from close cooperation in the European Research Area.While each field developed discipline-specific services over the years, common themes can also be distinguished. All humanities RIs address, in varying degrees, questions around research data management, the use of standards and the desired interoperability of data across disciplinary boundaries.This article sheds light on how cluster project PARTHENOS developed pooled services and shared solutions for its audience of humanities researchers, RI managers and policymakers. In a time where the convergence of existing infrastructure is becoming ever more important – with the construction of a European Open Science Cloud as an audacious, ultimate goal – we hope that our experiences inform future work and provide inspiration on how to exploit synergies in interdisciplinary, transnational, scientific cooperation. This article has been accepted for publication by EUP in the IJHAC: International Journal of Humanities and Arts Computing (https://www.euppublishing.com/loi/ijhac) International audience
International Journa... arrow_drop_down International Journal of Humanities and Arts ComputingArticle . 2021 . Peer-reviewedLicense: EUP TDMInternational Journal of Humanities and Arts ComputingArticleLicense: CC BYData sources: UnpayWallHal-DiderotArticle . 2021License: CC BYFull-Text: https://hal.inria.fr/hal-03402145/documentData sources: Hal-Diderotadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.3366/ijhac.2021.0264&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routeshybrid 2 citations 2 popularity Average influence Average impulse Average Powered by BIP!visibility 2visibility views 2 Powered bymore_vert International Journa... arrow_drop_down International Journal of Humanities and Arts ComputingArticle . 2021 . Peer-reviewedLicense: EUP TDMInternational Journal of Humanities and Arts ComputingArticleLicense: CC BYData sources: UnpayWallHal-DiderotArticle . 2021License: CC BYFull-Text: https://hal.inria.fr/hal-03402145/documentData sources: Hal-Diderotadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.3366/ijhac.2021.0264&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Report 2021 France EnglishPublisher:HAL CCSD Funded by:EC | ELEXISEC| ELEXISAuthors: Tasovac, Toma; Romary, Laurent; Tóth-Czifra, Erzsébet; Marinski, Irena;Tasovac, Toma; Romary, Laurent; Tóth-Czifra, Erzsébet; Marinski, Irena;All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______177::1ddda313a51a3c3a9b2012f453ecf1f7&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od_______177::1ddda313a51a3c3a9b2012f453ecf1f7&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2020 FrancePublisher:Japanese Association for Digital Humanities Funded by:EC | HIRMEOSEC| HIRMEOSAuthors: Foppiano, Luca; Romary, Laurent;Foppiano, Luca; Romary, Laurent;This paper presents an attempt to provide a generic named-entity recognition and disambiguation module (NERD) called entity-fishing as a stable online service that demonstrates the possible delivery of sustainable technical services within DARIAH, the European digital research infrastructure for the arts and humanities. Deployed as part of the national infrastructure Huma-Num in France, this service provides an efficient state-of-the-art implementation coupled with standardised interfaces allowing an easy deployment on a variety of potential digital humanities contexts. Initially developed in the context of the FP9 EU project CENDARI, the software was well received by the user community and continued to be further developed within the H2020 HIRMEOS project where several open access publishers have integrated the service to their collections of published monographs as a means to enhance retrieval and access. entity-fishing implements entity extraction as well as disambiguation against Wikipedia and Wikidata entries. The service is accessible through a REST API which allows easier and seamless integration, language independent and stable convention and a widely used service-oriented architecture (SOA) design. Input and output data are carried out over a query data model with a defined structure providing flexibility to support the processing of partially annotated text or the repartition of text over several queries. The interface implements a variety of functionalities, like language recognition, sentence segmentation and modules for accessing and looking up concepts in the knowledge base. The API itself integrates more advanced contextual parametrisation or ranked outputs, allowing for the resilient integration in various possible use cases. The entity-fishing API has been used as a concrete use case to draft the experimental stand-off proposal, which has been submitted for integration into the TEI guidelines. The representation is also compliant with the Web Annotation Data Model (WADM). In this paper we aim at describing the functionalities of the service as a reference contribution to the subject of web-based NERD services. In this paper, we detail the workflow from input to output and unpack each building box in the processing flow. Besides, with a more academic approach, we provide a transversal schema of the different components taking into account non-functional requirements in order to facilitate the discovery of bottlenecks, hotspots and weaknesses. We also describe the underlying knowledge base, which is set up on the basis of Wikipedia and Wikidata content. We conclude the paper by presenting our solution for the service deployment: how and which the resources where allocated. The service has been in production since Q3 of 2017, and extensively used by the H2020 HIRMEOS partners during the integration with the publishing platforms. International audience
Journal of the Japan... arrow_drop_down HAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2020License: CC BYHAL Descartes; HAL-Pasteur; HAL-Inserm; Mémoires en Sciences de l'Information et de la Communication; Hal-DiderotArticle . Conference object . 2020 . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01812100v2/documentJournal of the Japanese Association for Digital HumanitiesArticle . 2020 . Peer-reviewedData sources: Crossrefadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.17928/jjadh.5.1_22&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routesgold 4 citations 4 popularity Top 10% influence Average impulse Average Powered by BIP!more_vert Journal of the Japan... arrow_drop_down HAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2020License: CC BYHAL Descartes; HAL-Pasteur; HAL-Inserm; Mémoires en Sciences de l'Information et de la Communication; Hal-DiderotArticle . Conference object . 2020 . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01812100v2/documentJournal of the Japanese Association for Digital HumanitiesArticle . 2020 . Peer-reviewedData sources: Crossrefadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.17928/jjadh.5.1_22&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article , Conference object 2019Publisher:ACM Funded by:EC | ALEXANDRIA, EC | AFEL, EC | DESIREC| ALEXANDRIA ,EC| AFEL ,EC| DESIRAuthors: Hube, Christoph; Fetahu, Besnik;Hube, Christoph; Fetahu, Besnik;Biased language commonly occurs around topics which are of controversial nature, thus, stirring disagreement between the different involved parties of a discussion. This is due to the fact that for language and its use, specifically, the understanding and use of phrases, the stances are cohesive within the particular groups. However, such cohesiveness does not hold across groups. In collaborative environments or environments where impartial language is desired (e.g. Wikipedia, news media), statements and the language therein should represent equally the involved parties and be neutrally phrased. Biased language is introduced through the presence of inflammatory words or phrases, or statements that may be incorrect or one-sided, thus violating such consensus. In this work, we focus on the specific case of phrasing bias, which may be introduced through specific inflammatory words or phrases in a statement. For this purpose, we propose an approach that relies on a recurrent neural networks in order to capture the inter-dependencies between words in a phrase that introduced bias. We perform a thorough experimental evaluation, where we show the advantages of a neural based approach over competitors that rely on word lexicons and other hand-crafted features in detecting biased language. We are able to distinguish biased statements with a precision of P=0.92, thus significantly outperforming baseline models with an improvement of over 30%. Finally, we release the largest corpus of statements annotated for biased language. Comment: The Twelfth ACM International Conference on Web Search and Data Mining, February 11--15, 2019, Melbourne, VIC, Australia
arXiv.org e-Print Ar... arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2018Data sources: arXiv.org e-Print Archivehttps://doi.org/10.1145/328960...Conference object . 2019 . Peer-reviewedLicense: ACM Copyright PoliciesData sources: Crossrefhttps://doi.org/10.48550/arxiv...Article . 2018License: arXiv Non-Exclusive DistributionData sources: Dataciteadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1145/3289600.3291018&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen 28 citations 28 popularity Top 10% influence Top 10% impulse Top 10% Powered by BIP!more_vert arXiv.org e-Print Ar... arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2018Data sources: arXiv.org e-Print Archivehttps://doi.org/10.1145/328960...Conference object . 2019 . Peer-reviewedLicense: ACM Copyright PoliciesData sources: Crossrefhttps://doi.org/10.48550/arxiv...Article . 2018License: arXiv Non-Exclusive DistributionData sources: Dataciteadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1145/3289600.3291018&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2018 FrancePublisher:Springer Science and Business Media LLC Funded by:EC | PARTHENOS, EC | EHRI, EC | EHRIEC| PARTHENOS ,EC| EHRI ,EC| EHRIAuthors: Laurent Romary; Charles Riondet;Laurent Romary; Charles Riondet;This article tackles the issue of integrating heterogeneous archival sources in one single data repository, namely the EHRI portal, whose aim is to support Holocaust research by providing online access to information about dispersed sources relating to the Holocaust (http://portal.ehri-project.eu). In this case, the problem at hand is to combine data coming from a network of archives in order to create an interoperable data space which can be used to search for, retrieve and disseminate content in the context of archival-based research. The central aspect of the work described in this paper is the assessment of the role of the Encoded Archival Description (EAD) standard as the basis for achieving the tasks described above. We have worked out how we could develop a real strategy of defining specific customization of EAD that could be used at various stages of the process of integrating heterogeneous sources. We have developed a methodology based on a specification and customization method inspired from the extensive experience of the Text Encoding Initiative (TEI) community. In the TEI framework, one has the possibility to model specific subsets or extensions of the TEI guidelines while maintaining both the technical (XML schemas) and editorial (documentation) content within a single framework. This work has led us quite far in anticipating that the method we have developed may be of a wider interest within similar environments, but also, as we believe, for the future maintenance of the EAD standard. Special thanks to Annelies van Nispen (NIOD) and Hector Martinez Alonso (ALMAnaCH) for their help, and to Lou Burnard (TEI) for his wise comments International audience
OpenAIRE; Archival S... arrow_drop_down OpenAIRE; Archival ScienceOther literature type . Article . 2018 . Peer-reviewedLicense: Springer TDMHyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01737568v2/documentHAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2018License: CC BYadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1007/s10502-018-9290-y&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess Routeshybrid 2 citations 2 popularity Average influence Top 10% impulse Average Powered by BIP!more_vert OpenAIRE; Archival S... arrow_drop_down OpenAIRE; Archival ScienceOther literature type . Article . 2018 . Peer-reviewedLicense: Springer TDMHyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01737568v2/documentHAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationArticle . 2018License: CC BYadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1007/s10502-018-9290-y&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Article 2018 France, United Kingdom, Germany EnglishPublisher:HAL CCSD Funded by:EC | CENDARIEC| CENDARINadia Boukhelifa; Michael Bryant; Natasa Bulatovic; Ivan Čukić; Jean-Daniel Fekete; Milica Knežević; Jörg Lehmann; David I. Stuart; Carsten Thiel;doi: 10.1145/3092906
The CENDARI infrastructure is a research-supporting platform designed to provide tools for transnational historical research, focusing on two topics: medieval culture and World War I. It exposes to the end users modern Web-based tools relying on a sophisticated infrastructure to collect, enrich, annotate, and search through large document corpora. Supporting researchers in their daily work is a novel concern for infrastructures. We describe how we gathered requirements through multiple methods to understand historians' needs and derive an abstract workflow to support them. We then outline the tools that we have built, tying their technical descriptions to the user requirements. The main tools are the note-taking environment and its faceted search capabilities; the data integration platform including the Data API, supporting semantic enrichment through entity recognition; and the environment supporting the software development processes throughout the project to keep both technical partners and researchers in the loop. The outcomes are technical together with new resources developed and gathered, and the research workflow that has been described and documented. International audience
OpenAIRE arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2016Data sources: arXiv.org e-Print ArchivePublikationenserver der Georg-August-Universität GöttingenArticle . 2020Journal on Computing and Cultural HeritageArticle . 2018 . Peer-reviewedLicense: ACM Copyright PoliciesData sources: CrossrefHal-DiderotArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01523102v2/documentData sources: Hal-DiderotHAL - UPEC / UPEM; HAL-Pasteur; HAL-InsermArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01523102v2/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1145/3092906&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.euAccess RoutesGreen hybrid more_vert OpenAIRE arrow_drop_down arXiv.org e-Print ArchiveOther literature type . Preprint . 2016Data sources: arXiv.org e-Print ArchivePublikationenserver der Georg-August-Universität GöttingenArticle . 2020Journal on Computing and Cultural HeritageArticle . 2018 . Peer-reviewedLicense: ACM Copyright PoliciesData sources: CrossrefHal-DiderotArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01523102v2/documentData sources: Hal-DiderotHAL - UPEC / UPEM; HAL-Pasteur; HAL-InsermArticle . 2018License: CC BYFull-Text: https://hal.inria.fr/hal-01523102v2/documentadd ClaimPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.All Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=10.1145/3092906&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Preprint 2017 France FrenchPublisher:HAL CCSD Funded by:EC | EHRIEC| EHRIVanden Daelen, Veerle; Edmond, Jennifer; Links, Petra; Priddy, Mike; Reijnhoudt, Linda; Tollar, Václav; van Nispen, Annelies; Hauwaert, Charlotte; Riondet, Charles;HAL Descartes; INRIA... arrow_drop_down Hyper Article en Ligne; Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotOther literature type . Preprint . 2017Full-Text: https://hal.inria.fr/hal-01632366/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______2592::9fd3a1a786b0416aeaad80ab0d5ab200&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert HAL Descartes; INRIA... arrow_drop_down Hyper Article en Ligne; Hyper Article en Ligne - Sciences de l'Homme et de la Société; Hal-DiderotOther literature type . Preprint . 2017Full-Text: https://hal.inria.fr/hal-01632366/documentAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______2592::9fd3a1a786b0416aeaad80ab0d5ab200&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eudescription Publicationkeyboard_double_arrow_right Conference object 2015 France EnglishPublisher:Eurographics Digital Library Funded by:EC | CENDARIEC| CENDARIAuthors: Boukhelifa , Nadia; Giannisakis , Emmanouil; Dimara , Evanthia; Willett , Wesley; +1 AuthorsBoukhelifa , Nadia; Giannisakis , Emmanouil; Dimara , Evanthia; Willett , Wesley; Fekete , Jean-Daniel;In this paper we describe the development and evaluation of a visual analytics tool to support historical research. Historians continuously gather data related to their scholarly research from archival visits and background search. Organising and making sense of all this data can be challenging as many historians continue to rely on analog or basic digital tools. We built an integrated note-taking environment for historians which unifies a set of func-tionalities we identified as important for historical research including editing, tagging, searching, sharing and visualization. Our approach was to involve users from the initial stage of brainstorming and requirement analysis through to design, implementation and evaluation. We report on the process and results of our work, and conclude by reflecting on our own experience in conducting user-centered visual analytics design for digital humanities. International audience
ProdInra arrow_drop_down Hal-DiderotConference object . 2015License: CC BY SAFull-Text: https://hal.inria.fr/hal-01156527/documentData sources: Hal-DiderotHAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationConference object . 2015License: CC BY SAAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______3379::2978ac0808120e015f4adf816ff96365&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu0 citations 0 popularity Average influence Average impulse Average Powered by BIP!more_vert ProdInra arrow_drop_down Hal-DiderotConference object . 2015License: CC BY SAFull-Text: https://hal.inria.fr/hal-01156527/documentData sources: Hal-DiderotHAL Descartes; INRIA a CCSD electronic archive server; Mémoires en Sciences de l'Information et de la CommunicationConference object . 2015License: CC BY SAAll Research productsarrow_drop_down <script type="text/javascript"> <!-- document.write('<div id="oa_widget"></div>'); document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=od______3379::2978ac0808120e015f4adf816ff96365&type=result"></script>'); --> </script>
For further information contact us at helpdesk@openaire.eu