Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
3 Research products, page 1 of 1

  • DARIAH EU
  • Publications
  • Research data
  • Research software
  • Other research products
  • 2013-2022
  • Preprint
  • DE
  • English
  • DARIAH EU

Relevance
arrow_drop_down
  • Publication . Article . Preprint . 2018
    Open Access English
    Authors: 
    Nadia Boukhelifa; Michael Bryant; Natasa Bulatovic; Ivan Čukić; Jean-Daniel Fekete; Milica Knežević; Jörg Lehmann; David I. Stuart; Carsten Thiel;
    Publisher: HAL CCSD
    Countries: France, United Kingdom
    Project: EC | CENDARI (284432)

    International audience; The CENDARI infrastructure is a research-supporting platform designed to provide tools for transnational historical research, focusing on two topics: medieval culture and World War I. It exposes to the end users modern Web-based tools relying on a sophisticated infrastructure to collect, enrich, annotate, and search through large document corpora. Supporting researchers in their daily work is a novel concern for infrastructures. We describe how we gathered requirements through multiple methods to understand historians' needs and derive an abstract workflow to support them. We then outline the tools that we have built, tying their technical descriptions to the user requirements. The main tools are the note-taking environment and its faceted search capabilities; the data integration platform including the Data API, supporting semantic enrichment through entity recognition; and the environment supporting the software development processes throughout the project to keep both technical partners and researchers in the loop. The outcomes are technical together with new resources developed and gathered, and the research workflow that has been described and documented.

  • Open Access English
    Authors: 
    Jacobs, Arthur M.;

    This paper describes a corpus of about 3000 English literary texts with about 250 million words extracted from the Gutenberg project that span a range of genres from both fiction and non-fiction written by more than 130 authors (e.g., Darwin, Dickens, Shakespeare). Quantitative Narrative Analysis (QNA) is used to explore a cleaned subcorpus, the Gutenberg English Poetry Corpus (GEPC) which comprises over 100 poetic texts with around 2 million words from about 50 authors (e.g., Keats, Joyce, Wordsworth). Some exemplary QNA studies show author similarities based on latent semantic analysis, significant topics for each author or various text-analytic metrics for George Eliot's poem 'How Lisa Loved the King' and James Joyce's 'Chamber Music', concerning e.g. lexical diversity or sentiment analysis. The GEPC is particularly suited for research in Digital Humanities, Natural Language Processing or Neurocognitive Poetics, e.g. as training and test corpus, or for stimulus development and control. 27 pages, 4 figures

  • Open Access English
    Authors: 
    Gimena del Rio Riande; Erzsébet Tóth-Czifra; Ulrike Wuttke; Yoann Moranville;
    Publisher: Preprints

    The digital transformation has initiated a paradigm shift in research and scholarly communication practices towards a more open scholarly culture. Although this transformation is slowly happening in the Digital Humanities field, open is not yet default. The article introduces the OpenMethods metablog, a community platform that highlights open research methods, tools, and practices within the context of the Digital Humanities by republishing open access content around methods and tools in various formats and languages. It also describes the platform’s technical infrastructure based on its requirements and main functionalities, and especially the collaborative content sourcing and editorial workflows. The article concludes with a discussion of the potentials of the OpenMethods metablog to overcome barriers towards open practices by focusing on inclusive, community sourced information based around opening up research processes and the challenges that need to be overcome to achieve its goals.

Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
3 Research products, page 1 of 1
  • Publication . Article . Preprint . 2018
    Open Access English
    Authors: 
    Nadia Boukhelifa; Michael Bryant; Natasa Bulatovic; Ivan Čukić; Jean-Daniel Fekete; Milica Knežević; Jörg Lehmann; David I. Stuart; Carsten Thiel;
    Publisher: HAL CCSD
    Countries: France, United Kingdom
    Project: EC | CENDARI (284432)

    International audience; The CENDARI infrastructure is a research-supporting platform designed to provide tools for transnational historical research, focusing on two topics: medieval culture and World War I. It exposes to the end users modern Web-based tools relying on a sophisticated infrastructure to collect, enrich, annotate, and search through large document corpora. Supporting researchers in their daily work is a novel concern for infrastructures. We describe how we gathered requirements through multiple methods to understand historians' needs and derive an abstract workflow to support them. We then outline the tools that we have built, tying their technical descriptions to the user requirements. The main tools are the note-taking environment and its faceted search capabilities; the data integration platform including the Data API, supporting semantic enrichment through entity recognition; and the environment supporting the software development processes throughout the project to keep both technical partners and researchers in the loop. The outcomes are technical together with new resources developed and gathered, and the research workflow that has been described and documented.

  • Open Access English
    Authors: 
    Jacobs, Arthur M.;

    This paper describes a corpus of about 3000 English literary texts with about 250 million words extracted from the Gutenberg project that span a range of genres from both fiction and non-fiction written by more than 130 authors (e.g., Darwin, Dickens, Shakespeare). Quantitative Narrative Analysis (QNA) is used to explore a cleaned subcorpus, the Gutenberg English Poetry Corpus (GEPC) which comprises over 100 poetic texts with around 2 million words from about 50 authors (e.g., Keats, Joyce, Wordsworth). Some exemplary QNA studies show author similarities based on latent semantic analysis, significant topics for each author or various text-analytic metrics for George Eliot's poem 'How Lisa Loved the King' and James Joyce's 'Chamber Music', concerning e.g. lexical diversity or sentiment analysis. The GEPC is particularly suited for research in Digital Humanities, Natural Language Processing or Neurocognitive Poetics, e.g. as training and test corpus, or for stimulus development and control. 27 pages, 4 figures

  • Open Access English
    Authors: 
    Gimena del Rio Riande; Erzsébet Tóth-Czifra; Ulrike Wuttke; Yoann Moranville;
    Publisher: Preprints

    The digital transformation has initiated a paradigm shift in research and scholarly communication practices towards a more open scholarly culture. Although this transformation is slowly happening in the Digital Humanities field, open is not yet default. The article introduces the OpenMethods metablog, a community platform that highlights open research methods, tools, and practices within the context of the Digital Humanities by republishing open access content around methods and tools in various formats and languages. It also describes the platform’s technical infrastructure based on its requirements and main functionalities, and especially the collaborative content sourcing and editorial workflows. The article concludes with a discussion of the potentials of the OpenMethods metablog to overcome barriers towards open practices by focusing on inclusive, community sourced information based around opening up research processes and the challenges that need to be overcome to achieve its goals.