Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
20 Research products, page 1 of 2

  • DARIAH EU
  • Publications
  • Other research products
  • 2013-2022
  • Open Access
  • European Commission
  • EC|FP7
  • DARIAH EU

10
arrow_drop_down
Date (most recent)
arrow_drop_down
  • Publication . Article . Conference object . Preprint . 2018
    Open Access English
    Authors: 
    Christoph Hube; Besnik Fetahu;
    Project: EC | AFEL (687916), EC | DESIR (731081), EC | ALEXANDRIA (339233)

    Biased language commonly occurs around topics which are of controversial nature, thus, stirring disagreement between the different involved parties of a discussion. This is due to the fact that for language and its use, specifically, the understanding and use of phrases, the stances are cohesive within the particular groups. However, such cohesiveness does not hold across groups. In collaborative environments or environments where impartial language is desired (e.g. Wikipedia, news media), statements and the language therein should represent equally the involved parties and be neutrally phrased. Biased language is introduced through the presence of inflammatory words or phrases, or statements that may be incorrect or one-sided, thus violating such consensus. In this work, we focus on the specific case of phrasing bias, which may be introduced through specific inflammatory words or phrases in a statement. For this purpose, we propose an approach that relies on a recurrent neural networks in order to capture the inter-dependencies between words in a phrase that introduced bias. We perform a thorough experimental evaluation, where we show the advantages of a neural based approach over competitors that rely on word lexicons and other hand-crafted features in detecting biased language. We are able to distinguish biased statements with a precision of P=0.92, thus significantly outperforming baseline models with an improvement of over 30%. Finally, we release the largest corpus of statements annotated for biased language. The Twelfth ACM International Conference on Web Search and Data Mining, February 11--15, 2019, Melbourne, VIC, Australia

  • Open Access English
    Authors: 
    Laurent Romary; Charles Riondet;
    Publisher: HAL CCSD
    Country: France
    Project: EC | EHRI (654164), EC | PARTHENOS (654119), EC | EHRI (261873)

    This article tackles the issue of integrating heterogeneous archival sources in one single data repository, namely the European Holocaust Research Infrastructure (EHRI) portal, whose aim is to support Holocaust research by providing online access to information about dispersed sources relating to the Holocaust (http://portal.ehri-project.eu). In this case, the problem at hand is to combine data coming from a network of archives in order to create an interoperable data space which can be used to search for, retrieve and disseminate content in the context of archival-based research. The scholarly purpose has specific consequences on our task. It assumes that the information made available to the researcher is as close as possible to the originating source in order to guarantee that the ensuing analysis can be deemed reliable. In the EHRI network of archives, as already observed in the case of the EU Cendari project, one cannot but face heterogeneity. The EHRI portal brings together descriptions from more than 1900 institutions. Each archive comes with a whole range of idiosyncrasies corresponding to the way it has been set up and evolved over time. Cataloging practices may also differ. Even the degree of digitization may range from the absence of a digital catalogue to the provision of a full-fledged online catalogue with all the necessary APIs for anyone to query and extract content. There is indeed a contrast here with the global endeavour at the international level to develop and promote standards for the description of archival content as a whole. Nonetheless, in a project like EHRI, standards should play a central role. They are necessary for many tasks related to the integration and exploitation of the aggregated content, namely: ● Being able to compare the content of the various sources, thus being able to develop quality-checking processes; ● Defining of an integrated repository infrastructure where the content of the various archival sources can be reliably hosted; ● Querying and re-using content in a seamless way; ● Deploying tools that have been developed independently of the specificities of the information sources, for instance in order to visualise or mine the resulting pool of information. The central aspect of the work described in this paper is the assessment of the role of the EAD (Encoded Archival Description) standard as the basis for achieving the tasks described above. We have worked out how we could develop a real strategy of defining specific customization of EAD that could be used at various stages of the process of integrating heterogeneous sources. While doing so, we have developed a methodology based on a specification and customization method inspired from the extensive experience of the Text Encoding Initiative (TEI) community. In the TEI framework, as we show in section 1, one has the possibility to model specific subsets or extensions of the TEI guidelines while maintaining both the technical (XML schemas) and editorial (documentation) content within a single framework. This work has led us quite far in anticipating that the method we have developed may be of a wider interest within similar environments, but also, as we believe, for the future maintenance of the EAD standard. Finally this work, successfully tested and implemented in the framework of EHRI [Riondet 2017], can be seen as part of the wider endeavour of European research infrastructures in the humanities such as CLARIN and DARIAH to provide support for researchers to integrate the use of standards in their scholarly practices. This is the reason why the general workflow studied here has been introduced as a use case in the umbrella infrastructure project PARTHENOS which aims, among other things, at disseminating information and resources about methodological and technical standards in the humanities.

  • Publication . Article . Preprint . 2018
    Open Access English
    Authors: 
    Nadia Boukhelifa; Michael Bryant; Natasa Bulatovic; Ivan Čukić; Jean-Daniel Fekete; Milica Knežević; Jörg Lehmann; David I. Stuart; Carsten Thiel;
    Publisher: HAL CCSD
    Countries: United Kingdom, France
    Project: EC | CENDARI (284432)

    International audience; The CENDARI infrastructure is a research-supporting platform designed to provide tools for transnational historical research, focusing on two topics: medieval culture and World War I. It exposes to the end users modern Web-based tools relying on a sophisticated infrastructure to collect, enrich, annotate, and search through large document corpora. Supporting researchers in their daily work is a novel concern for infrastructures. We describe how we gathered requirements through multiple methods to understand historians' needs and derive an abstract workflow to support them. We then outline the tools that we have built, tying their technical descriptions to the user requirements. The main tools are the note-taking environment and its faceted search capabilities; the data integration platform including the Data API, supporting semantic enrichment through entity recognition; and the environment supporting the software development processes throughout the project to keep both technical partners and researchers in the loop. The outcomes are technical together with new resources developed and gathered, and the research workflow that has been described and documented.

  • Other research product . Other ORP type . 2017
    Open Access English
    Authors: 
    Hollander, H.S.; Geser, Guntram;
    Publisher: ARIADNE
    Country: Netherlands
    Project: EC | ARIADNE (313193)

    This report addresses the ARIADNE project objectives and the framework of the impact evaluation, summarises the overall results, and gives recommendations for ARIADNE and other stakeholders with a focus on potential further advances.

  • Open Access Hungarian
    Authors: 
    unknown;
    Publisher: ELTE BTK Könyvtár- és Információtudományi Intézet
    Country: Hungary
    Project: EC | COLLMOT (227878)
  • Open Access
    Authors: 
    Monica Monachini; Francesca Frontini;
    Countries: Italy, France
    Project: EC | CLARIN (212230)

    National audience; Il 1° ottobre 2015 il MIUR firma l'adesione dell'Italia a CLARIN-ERIC, l'infrastruttura di ricerca che offre risorse e tecnologie linguistiche dedicate al settore delle scienze del linguaggio e delle scienze umane e sociali. Questo articolo intende fornire alla comunità italiana una ampia panoramica di CLARIN, la sua missione, i suoi pilastri, i servizi, la sua organizzazione tecnica ed amministrativa e la struttura di governance, sia a livello europeo che locale. Viene introdotto il network italiano, con il primo centro nazionale ILC4CLARIN, ospitato ed in via di sviluppo presso l'ILC-CNR, le funzionalità, le risorse ed i servizi offerti; viene presentato infine il primo nucleo del consorzio nazionale CLARIN-IT, illustrando i criteri di costituzione, le attività previste e le prospettive future.

  • Publication . Other literature type . Article . 2016
    Open Access
    Authors: 
    Birgit Schmidt; Astrid Orth; Gwen Franck; Iryna Kuchma; Petr Knoth; Jose Carvalho;
    Publisher: MDPI AG
    Countries: Germany, Portugal
    Project: EC | FOSTER (612425)

    Open science refers to all things open in research and scholarly communication: from publications and research data to code, models and methods as well as quality evaluation based on open peer review. However, getting started with implementing open science might not be as straightforward for all stakeholders. For example, what do research funders expect in terms of open access to publications and/or research data? Where and how to publish research data? How to ensure that research results are reproducible? These are all legitimate questions and, in particular, early career researchers may benefit from additional guidance and training. In this paper we review the activities of the European-funded FOSTER project which organized and supported a wide range of targeted trainings for open science, based on face-to-face events and on a growing suite of e-learning courses. This article reviews the approach and experiences gained from the first two years of the project. The FOSTER project has received funding from the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant agreement no 612425. The authors gratefully acknowledge the contributions of all project partners to the design and implementation of the FOSTER project.

  • Publication . Other literature type . Other ORP type . 2016
    Open Access English
    Authors: 
    Lehmann, Jörg; Morselli, Francesca;
    Publisher: HAL CCSD
    Project: EC | CENDARI (284432)

    CENDARI Archival Research Guide; The subject “Science and Technology in the First World War” has so far been treated from the perspective on inventions and the development of new weapons, and often the focus has been on the topic of chemical warfare at the expense of other important dimensions. The approach of this Archival Research Guide, by contrast, comes from the social sciences and focuses on the establishment of relevant scientific, military and governmental bodies and on the personal networks established during the war. By examining these institutions and networks country by country, comparisons between them can be drawn, enabling further research with regard to the sociology of institutions. By pointing to the connections and channels of exchange between the nations and institutions under consideration, this approach opens up a transnational perspective and supports the paradoxal insight that transnational ties can dissolve national obstacles while simultaneously strengthening the nation-states themselves. On the individual level, the ARG takes the role of intellectuals into account, for whom scientific objectivity / neutrality and patriotic commitment seemed to have been no contradiction. It is remarkable that the First World War led to the establishment of several institutions aiming at funding science through the state, most notably in the case of France (CNRS), the U.S. (NACA/NASA) and Russia (KEPS).

  • Open Access English
    Authors: 
    Edmond, Jennifer; Morselli, Francesca;
    Publisher: CENDARI Project
    Country: Netherlands
    Project: EC | CENDARI (284432)

    The CENDARI project committed in its original description of work to the production of a Business Plan (Deliverable 2.4), outlining a method statement for ensuring that the outputs of the CENDARI project would be available to future users. The project team decided to rename this deliverable the CENDARI “Sustainability Plan” and this document sets out the plan which is based upon a multidimensional conceptualisation of what CENDARI is and the value of its assets, as well as on the fundamental understanding of a digital project as useless if it does not ‘evolve and involve.’ This document presents not only the actions that have been and will continue to be taken by the CENDARI team, but also interrogates the topic of sustainability of research infrastructures

  • Open Access English
    Authors: 
    Dorn, Amelie; Villalva, Aline; Giouli, Voula; Blanck, Wiebke; Kovalenko, Kira; Wandl-Vogt, Eveline;
    Publisher: Tbilisi University Press
    Country: Portugal
    Project: EC | ATHEME (613465)

    In this paper we present a case study on colour and emotion terms and their cultural references in the framework of the COST European Network of e-Lexicography (ENeL), working towards Pan-European lexicography. We take an initial use case of red in connection with emotions (anger) and look at its roots across different European languages, including Russian. Our data model offers the possibility of connecting these fields in the context of digital lexicography using markup for etymological information with description standards like ONTOLEX or TEI. This is particularly relevant for using and displaying such data on the European Dictionary Portal, potentially offering access to detailed diachronic and synchronic lexicographic knowledge across a variety of languages.

Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
20 Research products, page 1 of 2
  • Publication . Article . Conference object . Preprint . 2018
    Open Access English
    Authors: 
    Christoph Hube; Besnik Fetahu;
    Project: EC | AFEL (687916), EC | DESIR (731081), EC | ALEXANDRIA (339233)

    Biased language commonly occurs around topics which are of controversial nature, thus, stirring disagreement between the different involved parties of a discussion. This is due to the fact that for language and its use, specifically, the understanding and use of phrases, the stances are cohesive within the particular groups. However, such cohesiveness does not hold across groups. In collaborative environments or environments where impartial language is desired (e.g. Wikipedia, news media), statements and the language therein should represent equally the involved parties and be neutrally phrased. Biased language is introduced through the presence of inflammatory words or phrases, or statements that may be incorrect or one-sided, thus violating such consensus. In this work, we focus on the specific case of phrasing bias, which may be introduced through specific inflammatory words or phrases in a statement. For this purpose, we propose an approach that relies on a recurrent neural networks in order to capture the inter-dependencies between words in a phrase that introduced bias. We perform a thorough experimental evaluation, where we show the advantages of a neural based approach over competitors that rely on word lexicons and other hand-crafted features in detecting biased language. We are able to distinguish biased statements with a precision of P=0.92, thus significantly outperforming baseline models with an improvement of over 30%. Finally, we release the largest corpus of statements annotated for biased language. The Twelfth ACM International Conference on Web Search and Data Mining, February 11--15, 2019, Melbourne, VIC, Australia

  • Open Access English
    Authors: 
    Laurent Romary; Charles Riondet;
    Publisher: HAL CCSD
    Country: France
    Project: EC | EHRI (654164), EC | PARTHENOS (654119), EC | EHRI (261873)

    This article tackles the issue of integrating heterogeneous archival sources in one single data repository, namely the European Holocaust Research Infrastructure (EHRI) portal, whose aim is to support Holocaust research by providing online access to information about dispersed sources relating to the Holocaust (http://portal.ehri-project.eu). In this case, the problem at hand is to combine data coming from a network of archives in order to create an interoperable data space which can be used to search for, retrieve and disseminate content in the context of archival-based research. The scholarly purpose has specific consequences on our task. It assumes that the information made available to the researcher is as close as possible to the originating source in order to guarantee that the ensuing analysis can be deemed reliable. In the EHRI network of archives, as already observed in the case of the EU Cendari project, one cannot but face heterogeneity. The EHRI portal brings together descriptions from more than 1900 institutions. Each archive comes with a whole range of idiosyncrasies corresponding to the way it has been set up and evolved over time. Cataloging practices may also differ. Even the degree of digitization may range from the absence of a digital catalogue to the provision of a full-fledged online catalogue with all the necessary APIs for anyone to query and extract content. There is indeed a contrast here with the global endeavour at the international level to develop and promote standards for the description of archival content as a whole. Nonetheless, in a project like EHRI, standards should play a central role. They are necessary for many tasks related to the integration and exploitation of the aggregated content, namely: ● Being able to compare the content of the various sources, thus being able to develop quality-checking processes; ● Defining of an integrated repository infrastructure where the content of the various archival sources can be reliably hosted; ● Querying and re-using content in a seamless way; ● Deploying tools that have been developed independently of the specificities of the information sources, for instance in order to visualise or mine the resulting pool of information. The central aspect of the work described in this paper is the assessment of the role of the EAD (Encoded Archival Description) standard as the basis for achieving the tasks described above. We have worked out how we could develop a real strategy of defining specific customization of EAD that could be used at various stages of the process of integrating heterogeneous sources. While doing so, we have developed a methodology based on a specification and customization method inspired from the extensive experience of the Text Encoding Initiative (TEI) community. In the TEI framework, as we show in section 1, one has the possibility to model specific subsets or extensions of the TEI guidelines while maintaining both the technical (XML schemas) and editorial (documentation) content within a single framework. This work has led us quite far in anticipating that the method we have developed may be of a wider interest within similar environments, but also, as we believe, for the future maintenance of the EAD standard. Finally this work, successfully tested and implemented in the framework of EHRI [Riondet 2017], can be seen as part of the wider endeavour of European research infrastructures in the humanities such as CLARIN and DARIAH to provide support for researchers to integrate the use of standards in their scholarly practices. This is the reason why the general workflow studied here has been introduced as a use case in the umbrella infrastructure project PARTHENOS which aims, among other things, at disseminating information and resources about methodological and technical standards in the humanities.

  • Publication . Article . Preprint . 2018
    Open Access English
    Authors: 
    Nadia Boukhelifa; Michael Bryant; Natasa Bulatovic; Ivan Čukić; Jean-Daniel Fekete; Milica Knežević; Jörg Lehmann; David I. Stuart; Carsten Thiel;
    Publisher: HAL CCSD
    Countries: United Kingdom, France
    Project: EC | CENDARI (284432)

    International audience; The CENDARI infrastructure is a research-supporting platform designed to provide tools for transnational historical research, focusing on two topics: medieval culture and World War I. It exposes to the end users modern Web-based tools relying on a sophisticated infrastructure to collect, enrich, annotate, and search through large document corpora. Supporting researchers in their daily work is a novel concern for infrastructures. We describe how we gathered requirements through multiple methods to understand historians' needs and derive an abstract workflow to support them. We then outline the tools that we have built, tying their technical descriptions to the user requirements. The main tools are the note-taking environment and its faceted search capabilities; the data integration platform including the Data API, supporting semantic enrichment through entity recognition; and the environment supporting the software development processes throughout the project to keep both technical partners and researchers in the loop. The outcomes are technical together with new resources developed and gathered, and the research workflow that has been described and documented.

  • Other research product . Other ORP type . 2017
    Open Access English
    Authors: 
    Hollander, H.S.; Geser, Guntram;
    Publisher: ARIADNE
    Country: Netherlands
    Project: EC | ARIADNE (313193)

    This report addresses the ARIADNE project objectives and the framework of the impact evaluation, summarises the overall results, and gives recommendations for ARIADNE and other stakeholders with a focus on potential further advances.

  • Open Access Hungarian
    Authors: 
    unknown;
    Publisher: ELTE BTK Könyvtár- és Információtudományi Intézet
    Country: Hungary
    Project: EC | COLLMOT (227878)
  • Open Access
    Authors: 
    Monica Monachini; Francesca Frontini;
    Countries: Italy, France
    Project: EC | CLARIN (212230)

    National audience; Il 1° ottobre 2015 il MIUR firma l'adesione dell'Italia a CLARIN-ERIC, l'infrastruttura di ricerca che offre risorse e tecnologie linguistiche dedicate al settore delle scienze del linguaggio e delle scienze umane e sociali. Questo articolo intende fornire alla comunità italiana una ampia panoramica di CLARIN, la sua missione, i suoi pilastri, i servizi, la sua organizzazione tecnica ed amministrativa e la struttura di governance, sia a livello europeo che locale. Viene introdotto il network italiano, con il primo centro nazionale ILC4CLARIN, ospitato ed in via di sviluppo presso l'ILC-CNR, le funzionalità, le risorse ed i servizi offerti; viene presentato infine il primo nucleo del consorzio nazionale CLARIN-IT, illustrando i criteri di costituzione, le attività previste e le prospettive future.

  • Publication . Other literature type . Article . 2016
    Open Access
    Authors: 
    Birgit Schmidt; Astrid Orth; Gwen Franck; Iryna Kuchma; Petr Knoth; Jose Carvalho;
    Publisher: MDPI AG
    Countries: Germany, Portugal
    Project: EC | FOSTER (612425)

    Open science refers to all things open in research and scholarly communication: from publications and research data to code, models and methods as well as quality evaluation based on open peer review. However, getting started with implementing open science might not be as straightforward for all stakeholders. For example, what do research funders expect in terms of open access to publications and/or research data? Where and how to publish research data? How to ensure that research results are reproducible? These are all legitimate questions and, in particular, early career researchers may benefit from additional guidance and training. In this paper we review the activities of the European-funded FOSTER project which organized and supported a wide range of targeted trainings for open science, based on face-to-face events and on a growing suite of e-learning courses. This article reviews the approach and experiences gained from the first two years of the project. The FOSTER project has received funding from the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant agreement no 612425. The authors gratefully acknowledge the contributions of all project partners to the design and implementation of the FOSTER project.

  • Publication . Other literature type . Other ORP type . 2016
    Open Access English
    Authors: 
    Lehmann, Jörg; Morselli, Francesca;
    Publisher: HAL CCSD
    Project: EC | CENDARI (284432)

    CENDARI Archival Research Guide; The subject “Science and Technology in the First World War” has so far been treated from the perspective on inventions and the development of new weapons, and often the focus has been on the topic of chemical warfare at the expense of other important dimensions. The approach of this Archival Research Guide, by contrast, comes from the social sciences and focuses on the establishment of relevant scientific, military and governmental bodies and on the personal networks established during the war. By examining these institutions and networks country by country, comparisons between them can be drawn, enabling further research with regard to the sociology of institutions. By pointing to the connections and channels of exchange between the nations and institutions under consideration, this approach opens up a transnational perspective and supports the paradoxal insight that transnational ties can dissolve national obstacles while simultaneously strengthening the nation-states themselves. On the individual level, the ARG takes the role of intellectuals into account, for whom scientific objectivity / neutrality and patriotic commitment seemed to have been no contradiction. It is remarkable that the First World War led to the establishment of several institutions aiming at funding science through the state, most notably in the case of France (CNRS), the U.S. (NACA/NASA) and Russia (KEPS).

  • Open Access English
    Authors: 
    Edmond, Jennifer; Morselli, Francesca;
    Publisher: CENDARI Project
    Country: Netherlands
    Project: EC | CENDARI (284432)

    The CENDARI project committed in its original description of work to the production of a Business Plan (Deliverable 2.4), outlining a method statement for ensuring that the outputs of the CENDARI project would be available to future users. The project team decided to rename this deliverable the CENDARI “Sustainability Plan” and this document sets out the plan which is based upon a multidimensional conceptualisation of what CENDARI is and the value of its assets, as well as on the fundamental understanding of a digital project as useless if it does not ‘evolve and involve.’ This document presents not only the actions that have been and will continue to be taken by the CENDARI team, but also interrogates the topic of sustainability of research infrastructures

  • Open Access English
    Authors: 
    Dorn, Amelie; Villalva, Aline; Giouli, Voula; Blanck, Wiebke; Kovalenko, Kira; Wandl-Vogt, Eveline;
    Publisher: Tbilisi University Press
    Country: Portugal
    Project: EC | ATHEME (613465)

    In this paper we present a case study on colour and emotion terms and their cultural references in the framework of the COST European Network of e-Lexicography (ENeL), working towards Pan-European lexicography. We take an initial use case of red in connection with emotions (anger) and look at its roots across different European languages, including Russian. Our data model offers the possibility of connecting these fields in the context of digital lexicography using markup for etymological information with description standards like ONTOLEX or TEI. This is particularly relevant for using and displaying such data on the European Dictionary Portal, potentially offering access to detailed diachronic and synchronic lexicographic knowledge across a variety of languages.