Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
11 Research products, page 1 of 2

  • DARIAH EU
  • Publications
  • Research software
  • 2012-2021
  • Open Access
  • BE
  • IE
  • Mémoires en Sciences de l'Information et de la Communication
  • Lirias

10
arrow_drop_down
Date (most recent)
arrow_drop_down
  • Publication . Preprint . Conference object . Contribution for newspaper or weekly magazine . Article . 2020
    Open Access English
    Authors: 
    Rehm, Georg; Marheinecke, Katrin; Hegele, Stefanie; Piperidis, Stelios; Bontcheva, Kalina; Hajic, Jan; Choukri, Khalid; Vasiljevs, Andrejs; Backfried, Gerhard; Prinz, Christoph; +37 more
    Publisher: Zenodo
    Countries: France, Denmark
    Project: EC | X5gon (761758), SFI | ADAPT: Centre for Digital... (13/RC/2106), FCT | PINFRA/22117/2016 (PINFRA/22117/2016), EC | AI4EU (825619), EC | ELG (825627), EC | BDVe (732630)

    Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality. However, language barriers impacting business, cross-lingual and cross-cultural communication are still omnipresent. Language Technologies (LTs) are a powerful means to break down these barriers. While the last decade has seen various initiatives that created a multitude of approaches and technologies tailored to Europe's specific needs, there is still an immense level of fragmentation. At the same time, AI has become an increasingly important concept in the European Information and Communication Technology area. For a few years now, AI, including many opportunities, synergies but also misconceptions, has been overshadowing every other topic. We present an overview of the European LT landscape, describing funding programmes, activities, actions and challenges in the different countries with regard to LT, including the current state of play in industry and the LT market. We present a brief overview of the main LT-related activities on the EU level in the last ten years and develop strategic guidance with regard to four key dimensions. Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020). To appear

  • Open Access English
    Authors: 
    Nicolas L; Verena Lyding; Borg C; Forascu C; Fort K; Zdravkova K; Kosem I; Čibej J; Š, Arhar Holdt; Millour A; +8 more
    Publisher: HAL CCSD
    Countries: France, France, Ireland

    International audience; We introduce in this paper a generic approach to combine implicit crowdsourcing and language learning in order to mass-produce language resources (LRs) for any language for which a crowd of language learners can be involved. We present the approach by explaining its core paradigm that consists in pairing specific types of LRs with specific exercises, by detailing both its strengths and challenges, and by discussing how much these challenges have been addressed at present. Accordingly, we also report on ongoing proof-of-concept efforts aiming at developing the first prototypical implementation of the approach in order to correct and extend an LR called ConceptNet based on the input crowdsourced from language learners. We then present an international network called the European Network for Combining Language Learning with Crowdsourcing Techniques (enetCollect) that provides the context to accelerate the implementation of the generic approach. Finally, we exemplify how it can be used in several language learning scenarios to produce a multitude of NLP resources and how it can therefore alleviate the long-standing NLP issue of the lack of LRs.

  • Open Access English
    Authors: 
    Raciti, Marco; Chambers, Sally;
    Publisher: HAL CCSD
    Country: France
    Project: EC | DESIR (731081)

    The DESIR Final Event took place in Zagreb on 6-7 November 2019, alongside the DARIAH General Assembly. The DESIR Final Event brought together the whole consortium and involved several Bodies of the ERIC to maximise the impact of the event. The aim of the meeting was two-fold. First, it was to opportunity to summarise the activities conducted throughout the project and showcase results towards the DARIAH community. Secondly, it proposed a wider discussion over sustainability and funding models with National Representatives prior to the General Assembly.Overall, 57 participants attended the event. The schedule has been structured to present the outcomes of each Work Package. As the project DESIR was divided into an evolving 6-dimensional process, main results can be summarised as follows:Dissemination and Innovation: the aim of this Work Package was to ensure a continuous dissemination of DARIAH service improvements, activities and research results within the relevant communities, academic stakeholders and other interested parties. Furthermore, the Work Package was in charge of organising main dissemination activities during the project and the DARIAH Beyond Europe workshop series (Stanford University, Library of Congress, National Library of Australia).Growth: this Work Package was concerned with the enlargement of the DARIAH membership targeting six countries (Czech Republic, Finland, Israel, Spain, Switzerland, United Kingdom). Membership could only be achieved in Czech Republic, the application was submitted and ratified by the General Assembly in November 2019. Nevertheless, in other countries, collaboration with DARIAH has been greatly strengthened and significant progress regarding DARIAH membership has been achieved.Technology: the Work Package delivered three demonstrators focused on bibliographical metadata. The demonstrators show the usage of tools for bibliographical metadata in various stages of the research process, e.g. extraction of entities, the collection and sorting of citations, visualisation of selected aspects of the data. These demonstrators were built with the involvement of the community of researchers through two Code Sprints organised in Berlin.Robustness: the aim of this Work Package was to strengthen the organisational structure of DARIAH. The activities undertaken had a multifaceted nature: the relationship with the user community was improved through the implementation of a centralised helpdesk; preliminary work towards the Marketplace was conducted; the EURISE Network (European Research Infrastructure Software Engineers' Network) was established with CLARIN and CESSDA; finally a renewed business plan building on existing strategic documents was developed.Trust: this Work Package contributed to DARIAH’s long-term sustainability by measuring acceptance and impact of DARIAH in new cross-disciplinary communities and core groups through a survey, in-depth qualitative interviews and strategic meetings. The analysedinformation collected from each source led to a recommendations and community engagement tool developed to share and translate the recommendations dataset for a broader audience.Teaching: the work carried out during the course of the project led to the implementation of a discovery framework and hosting platform for DARIAH learning resources, DARIAH-Campus. The platform was officially launched during the event. The goal of DARIAH-Campus is to widen access to open, inclusive, high-quality learning materials that aim to enhance creativity, skills, technology and knowledge in the digitally-enabled arts and humanities.

  • Open Access English
    Authors: 
    Van Der Eycken, Johan; Styven, Dorien; Gheldof, Tom; Depoortere, Rolande;
    Publisher: HAL CCSD
    Countries: Belgium, France

    This article shows that metadata plays a central role in our society and concludes that through collaborative work, it is possible to pool solutions and to establish relationships of cooperation, both at the level of practical tool development and with regard to sharing and creating knowledge and know-how. ispartof: ABB: Archives et Bibliothèques de Belgique - Archief- en Bibliotheekwezen in België vol:106 pages:135-144 status: published

  • Open Access English
    Authors: 
    Rizza, Ettore; Chardonnens, Anne; Van Hooland, Seth;
    Publisher: HAL CCSD
    Countries: France, Belgium

    More and more cultural institutions use Linked Data principles to share and connect their collection metadata. In the archival field, initiatives emerge to exploit data contained in archival descriptions and adapt encoding standards to the semantic web. In this context, online authority files can be used to enrich metadata. However, relying on a decentralized network of knowledge bases such as Wikidata, DBpedia or even Viaf has its own difficulties. This paper aims to offer a critical view of these linked authority files by adopting a close-reading approach. Through a practical case study, we intend to identify and illustrate the possibilities and limits of RDF triples compared to institutions' less structured metadata. Workshop "Dariah "Trust and Understanding: the value of metadata in a digitally joined-up world" (14/05/2018, Brussels), preprint of the submission to the journal "Archives et Biblioth\`eques de Belgique"

  • Open Access
    Authors: 
    Depoortere, Rolande; Gheldof, Tom; Styven, Dorien; Van der Eycken, Johan;
    Publisher: Koninklijke Bibliotheek van België
    Countries: France, Belgium

    Contemporary researchers are beginning to explore the possibilities and opportunities of digital humanities, but encounter major obstacles regarding (meta)data1. Many archival institutions lack the technology or the skills to process (meta)data, let alone share it. Different types of standards coexist and sometimes conflict with each other, while implementing the same standards often leads to slight differences which decrease interoperability. Storing, securing and making born-digital or digitized information available in a sustainable manner is a major challenge. Concepts such as metadata, Meta Information, Linked Open Data (LOD), Big Data… are on the rise, but their meaning and content — let alone their implications in terms of social impact — are seldom questioned. Archival institutions thus face a myriad of challenges when sharing (meta)data with the scientific community and when planning to preserve information for future generations while maintaining data authenticity2 . ispartof: Archief- en Bibliotheekwezen in België vol:106 pages:5-13 status: published

  • Publication . Article . 2019
    Open Access English
    Authors: 
    Truyen, Frederik;
    Publisher: HAL CCSD
    Country: Belgium

    pdf: https://hal.archives-ouvertes.fr/hal-02125052/file/TRUYEN_20190425_Trust_and_Understanding_vpub_125_136.pdf hal_id: hal-02125052 hal_version: v1 ispartof: ABB: Archives et Bibliothèques de Belgique - Archief- en Bibliotheekwezen in België vol:106 pages:123-134 status: published

  • Publication . Conference object . 2017
    Open Access English
    Authors: 
    Joke Daems; Sally Chambers; Zere, Tecle; Christophe Verbruggen;
    Publisher: HAL CCSD
    Countries: Belgium, France

    International audience; The digital text platform is part of the Flemish contribution to DARIAH Belgium (DARIAH = Digital Research Infrastructure for the Arts and Humanities). The goal is to create a platform for the collaborative management and discovery of digitised textual collections that allows digital humanities researchers to prepare their corpora (consisting of, for example, digitised newspapers and books) for textual analysis. The platform will enable researchers to browse and search the digitised collections compiled, cleaned, enriched and managed by the researchers themselves. Once the relevant research sub-corpus has been compiled, data export tools, using standardised open formats (such as XML, JSON, .csv, .txt, etc.) will enable researchers to export sub-corpus for analysis with existing digital text analysis tools such as MALLET, (http://mallet.cs.umass.edu/topics.php) for topic modelling, VOYANT (http://voyant-tools.org) for data visualisation or AntConC (http://www.laurenceanthony.net/software/antconc/) for concordance and textual analysis.The platform has been conceived as part of a larger and modular virtual research environment service infrastructure (http://www.ghentcdh.ugent.be/projects/dariah-vl_vre.si). In a previous phase, possible frameworks and content management systems were tested, notably Islandora (a digital asset management system based on Fedora Commons and Drupal), but also Mediawiki and Omeka.One of the main challenges of the envisaged new platform is the possibility to integrate a wider variety of possible textual data streams (including a scan workflow). In addition, user-friendliness, scalability, adherence to standards and facilitating the interoperability of data are key issues to be addressed. The platform will build on the existing IIIF format, the International Image Interoperability Framework. This format is used by some of the most important libraries and cultural heritage institutions in the world, therefore providing access to enormous collections of digital objects. As the name suggests, IIIF is mainly focused on displaying and annotating images. However, we fully endorse the IIIF-community’s vision to develop an overarching interoperability framework for other data types, including all kinds of textual data. Benefits of the format include the interoperability, the ease of sharing images and annotations without the need to exchange files, and its support for multilingual data. In the months leading up to the conference, we will evaluate the existing IIIFpowered digital libraries and research projects and how they deal with practices of co-creation, data cleaning and enrichment of (structural) metadata. OCR improvement will become vital, as digital textual analysis can only be performed well on high-quality textual data. A related challenge will be combining the various input formats and converting them to different output formats required for analysis. In our poster, we will present a summary of our experiences with and technical assessment of our previous Islandora installation, in addition to our survey of the existing corpus management solutions. As a way of conclusion, we will introduce the envisioned new version of the platform.

  • Publication . Part of book or chapter of book . Conference object . Other literature type . 2015
    Open Access English
    Authors: 
    Veerle Vanden Daelen; Jennifer Edmond; Petra Links; Mike Priddy; Linda Reijnhoudt; Vaclav Tollar; Annelies Van Nipsen;
    Publisher: HAL CCSD
    Countries: Belgium, France, France
    Project: EC | CENDARI (284432), EC | EHRI (654164), EC | EHRI (261873)

    International audience; One of the funded project proposals under DARIAH’s Open Humanities call 2015 was “Open History: Sustainable digital publishing of archival catalogues of twentieth-century history archives”. Based on the experiences of the Collaborative EuropeaN Digital Archival Research Infrastructure (CENDARI) and the European Holocaust Research Infrastructure (EHRI), the main goal of the “Open History” project was to enhance the dialogue between (meta-)data providers and research infrastructures. Integrating archival descriptions – when they were already available – held at a wide variety of twentieth-century history archives (from classic archives to memorial sites, libraries and private archives) into research infrastructures has proven to be a major challenge, which could not be done without some degree of limited to extensive pre-processing or other preparatory work. The “Open History” project organized two workshops and developed two tools: an easily accessible and general article on why the practice of standardization and sharing is important and how this can be achieved; and a model which provides checklists for self-analyses of archival institutions. The text that follows is the article we have developed. It intentionally remains at a general level, without much jargon, so that it can be easily read by those who are non-archivists or non-IT. Hence, we hope it will be easy to understand for both those who are describing the sources at various archives (with or without IT or archival sciences degrees), as well as decision-makers (directors and advisory boards) who wish to understand the benefits of investing in standardization and sharing of data. It is important to note is that this text is a first step, not a static, final result. Not all aspects about standardization and publication of (meta-)data are discussed, nor are updates or feedback mechanisms for annotations and comments discussed. The idea is that this text can be used in full or in part and that it will include further chapters and section updates as time goes by and as other communities begin using it. Some archives will read through much of these and see confirmation of what they have already been implementing; others – especially the smaller institutions, such as private memory institutions – will find this a low-key and hands-on introduction to help them in their efforts.

  • Open Access English
    Authors: 
    Vanholsbeeck, Marc; Engels, Tim; Istenič Starčič, Andreja;
    Publisher: HAL CCSD
    Countries: Belgium, France

    info:eu-repo/semantics/inPress

Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
11 Research products, page 1 of 2
  • Publication . Preprint . Conference object . Contribution for newspaper or weekly magazine . Article . 2020
    Open Access English
    Authors: 
    Rehm, Georg; Marheinecke, Katrin; Hegele, Stefanie; Piperidis, Stelios; Bontcheva, Kalina; Hajic, Jan; Choukri, Khalid; Vasiljevs, Andrejs; Backfried, Gerhard; Prinz, Christoph; +37 more
    Publisher: Zenodo
    Countries: France, Denmark
    Project: EC | X5gon (761758), SFI | ADAPT: Centre for Digital... (13/RC/2106), FCT | PINFRA/22117/2016 (PINFRA/22117/2016), EC | AI4EU (825619), EC | ELG (825627), EC | BDVe (732630)

    Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality. However, language barriers impacting business, cross-lingual and cross-cultural communication are still omnipresent. Language Technologies (LTs) are a powerful means to break down these barriers. While the last decade has seen various initiatives that created a multitude of approaches and technologies tailored to Europe's specific needs, there is still an immense level of fragmentation. At the same time, AI has become an increasingly important concept in the European Information and Communication Technology area. For a few years now, AI, including many opportunities, synergies but also misconceptions, has been overshadowing every other topic. We present an overview of the European LT landscape, describing funding programmes, activities, actions and challenges in the different countries with regard to LT, including the current state of play in industry and the LT market. We present a brief overview of the main LT-related activities on the EU level in the last ten years and develop strategic guidance with regard to four key dimensions. Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020). To appear

  • Open Access English
    Authors: 
    Nicolas L; Verena Lyding; Borg C; Forascu C; Fort K; Zdravkova K; Kosem I; Čibej J; Š, Arhar Holdt; Millour A; +8 more
    Publisher: HAL CCSD
    Countries: France, France, Ireland

    International audience; We introduce in this paper a generic approach to combine implicit crowdsourcing and language learning in order to mass-produce language resources (LRs) for any language for which a crowd of language learners can be involved. We present the approach by explaining its core paradigm that consists in pairing specific types of LRs with specific exercises, by detailing both its strengths and challenges, and by discussing how much these challenges have been addressed at present. Accordingly, we also report on ongoing proof-of-concept efforts aiming at developing the first prototypical implementation of the approach in order to correct and extend an LR called ConceptNet based on the input crowdsourced from language learners. We then present an international network called the European Network for Combining Language Learning with Crowdsourcing Techniques (enetCollect) that provides the context to accelerate the implementation of the generic approach. Finally, we exemplify how it can be used in several language learning scenarios to produce a multitude of NLP resources and how it can therefore alleviate the long-standing NLP issue of the lack of LRs.

  • Open Access English
    Authors: 
    Raciti, Marco; Chambers, Sally;
    Publisher: HAL CCSD
    Country: France
    Project: EC | DESIR (731081)

    The DESIR Final Event took place in Zagreb on 6-7 November 2019, alongside the DARIAH General Assembly. The DESIR Final Event brought together the whole consortium and involved several Bodies of the ERIC to maximise the impact of the event. The aim of the meeting was two-fold. First, it was to opportunity to summarise the activities conducted throughout the project and showcase results towards the DARIAH community. Secondly, it proposed a wider discussion over sustainability and funding models with National Representatives prior to the General Assembly.Overall, 57 participants attended the event. The schedule has been structured to present the outcomes of each Work Package. As the project DESIR was divided into an evolving 6-dimensional process, main results can be summarised as follows:Dissemination and Innovation: the aim of this Work Package was to ensure a continuous dissemination of DARIAH service improvements, activities and research results within the relevant communities, academic stakeholders and other interested parties. Furthermore, the Work Package was in charge of organising main dissemination activities during the project and the DARIAH Beyond Europe workshop series (Stanford University, Library of Congress, National Library of Australia).Growth: this Work Package was concerned with the enlargement of the DARIAH membership targeting six countries (Czech Republic, Finland, Israel, Spain, Switzerland, United Kingdom). Membership could only be achieved in Czech Republic, the application was submitted and ratified by the General Assembly in November 2019. Nevertheless, in other countries, collaboration with DARIAH has been greatly strengthened and significant progress regarding DARIAH membership has been achieved.Technology: the Work Package delivered three demonstrators focused on bibliographical metadata. The demonstrators show the usage of tools for bibliographical metadata in various stages of the research process, e.g. extraction of entities, the collection and sorting of citations, visualisation of selected aspects of the data. These demonstrators were built with the involvement of the community of researchers through two Code Sprints organised in Berlin.Robustness: the aim of this Work Package was to strengthen the organisational structure of DARIAH. The activities undertaken had a multifaceted nature: the relationship with the user community was improved through the implementation of a centralised helpdesk; preliminary work towards the Marketplace was conducted; the EURISE Network (European Research Infrastructure Software Engineers' Network) was established with CLARIN and CESSDA; finally a renewed business plan building on existing strategic documents was developed.Trust: this Work Package contributed to DARIAH’s long-term sustainability by measuring acceptance and impact of DARIAH in new cross-disciplinary communities and core groups through a survey, in-depth qualitative interviews and strategic meetings. The analysedinformation collected from each source led to a recommendations and community engagement tool developed to share and translate the recommendations dataset for a broader audience.Teaching: the work carried out during the course of the project led to the implementation of a discovery framework and hosting platform for DARIAH learning resources, DARIAH-Campus. The platform was officially launched during the event. The goal of DARIAH-Campus is to widen access to open, inclusive, high-quality learning materials that aim to enhance creativity, skills, technology and knowledge in the digitally-enabled arts and humanities.

  • Open Access English
    Authors: 
    Van Der Eycken, Johan; Styven, Dorien; Gheldof, Tom; Depoortere, Rolande;
    Publisher: HAL CCSD
    Countries: Belgium, France

    This article shows that metadata plays a central role in our society and concludes that through collaborative work, it is possible to pool solutions and to establish relationships of cooperation, both at the level of practical tool development and with regard to sharing and creating knowledge and know-how. ispartof: ABB: Archives et Bibliothèques de Belgique - Archief- en Bibliotheekwezen in België vol:106 pages:135-144 status: published

  • Open Access English
    Authors: 
    Rizza, Ettore; Chardonnens, Anne; Van Hooland, Seth;
    Publisher: HAL CCSD
    Countries: France, Belgium

    More and more cultural institutions use Linked Data principles to share and connect their collection metadata. In the archival field, initiatives emerge to exploit data contained in archival descriptions and adapt encoding standards to the semantic web. In this context, online authority files can be used to enrich metadata. However, relying on a decentralized network of knowledge bases such as Wikidata, DBpedia or even Viaf has its own difficulties. This paper aims to offer a critical view of these linked authority files by adopting a close-reading approach. Through a practical case study, we intend to identify and illustrate the possibilities and limits of RDF triples compared to institutions' less structured metadata. Workshop "Dariah "Trust and Understanding: the value of metadata in a digitally joined-up world" (14/05/2018, Brussels), preprint of the submission to the journal "Archives et Biblioth\`eques de Belgique"

  • Open Access
    Authors: 
    Depoortere, Rolande; Gheldof, Tom; Styven, Dorien; Van der Eycken, Johan;
    Publisher: Koninklijke Bibliotheek van België
    Countries: France, Belgium

    Contemporary researchers are beginning to explore the possibilities and opportunities of digital humanities, but encounter major obstacles regarding (meta)data1. Many archival institutions lack the technology or the skills to process (meta)data, let alone share it. Different types of standards coexist and sometimes conflict with each other, while implementing the same standards often leads to slight differences which decrease interoperability. Storing, securing and making born-digital or digitized information available in a sustainable manner is a major challenge. Concepts such as metadata, Meta Information, Linked Open Data (LOD), Big Data… are on the rise, but their meaning and content — let alone their implications in terms of social impact — are seldom questioned. Archival institutions thus face a myriad of challenges when sharing (meta)data with the scientific community and when planning to preserve information for future generations while maintaining data authenticity2 . ispartof: Archief- en Bibliotheekwezen in België vol:106 pages:5-13 status: published

  • Publication . Article . 2019
    Open Access English
    Authors: 
    Truyen, Frederik;
    Publisher: HAL CCSD
    Country: Belgium

    pdf: https://hal.archives-ouvertes.fr/hal-02125052/file/TRUYEN_20190425_Trust_and_Understanding_vpub_125_136.pdf hal_id: hal-02125052 hal_version: v1 ispartof: ABB: Archives et Bibliothèques de Belgique - Archief- en Bibliotheekwezen in België vol:106 pages:123-134 status: published

  • Publication . Conference object . 2017
    Open Access English
    Authors: 
    Joke Daems; Sally Chambers; Zere, Tecle; Christophe Verbruggen;
    Publisher: HAL CCSD
    Countries: Belgium, France

    International audience; The digital text platform is part of the Flemish contribution to DARIAH Belgium (DARIAH = Digital Research Infrastructure for the Arts and Humanities). The goal is to create a platform for the collaborative management and discovery of digitised textual collections that allows digital humanities researchers to prepare their corpora (consisting of, for example, digitised newspapers and books) for textual analysis. The platform will enable researchers to browse and search the digitised collections compiled, cleaned, enriched and managed by the researchers themselves. Once the relevant research sub-corpus has been compiled, data export tools, using standardised open formats (such as XML, JSON, .csv, .txt, etc.) will enable researchers to export sub-corpus for analysis with existing digital text analysis tools such as MALLET, (http://mallet.cs.umass.edu/topics.php) for topic modelling, VOYANT (http://voyant-tools.org) for data visualisation or AntConC (http://www.laurenceanthony.net/software/antconc/) for concordance and textual analysis.The platform has been conceived as part of a larger and modular virtual research environment service infrastructure (http://www.ghentcdh.ugent.be/projects/dariah-vl_vre.si). In a previous phase, possible frameworks and content management systems were tested, notably Islandora (a digital asset management system based on Fedora Commons and Drupal), but also Mediawiki and Omeka.One of the main challenges of the envisaged new platform is the possibility to integrate a wider variety of possible textual data streams (including a scan workflow). In addition, user-friendliness, scalability, adherence to standards and facilitating the interoperability of data are key issues to be addressed. The platform will build on the existing IIIF format, the International Image Interoperability Framework. This format is used by some of the most important libraries and cultural heritage institutions in the world, therefore providing access to enormous collections of digital objects. As the name suggests, IIIF is mainly focused on displaying and annotating images. However, we fully endorse the IIIF-community’s vision to develop an overarching interoperability framework for other data types, including all kinds of textual data. Benefits of the format include the interoperability, the ease of sharing images and annotations without the need to exchange files, and its support for multilingual data. In the months leading up to the conference, we will evaluate the existing IIIFpowered digital libraries and research projects and how they deal with practices of co-creation, data cleaning and enrichment of (structural) metadata. OCR improvement will become vital, as digital textual analysis can only be performed well on high-quality textual data. A related challenge will be combining the various input formats and converting them to different output formats required for analysis. In our poster, we will present a summary of our experiences with and technical assessment of our previous Islandora installation, in addition to our survey of the existing corpus management solutions. As a way of conclusion, we will introduce the envisioned new version of the platform.

  • Publication . Part of book or chapter of book . Conference object . Other literature type . 2015
    Open Access English
    Authors: 
    Veerle Vanden Daelen; Jennifer Edmond; Petra Links; Mike Priddy; Linda Reijnhoudt; Vaclav Tollar; Annelies Van Nipsen;
    Publisher: HAL CCSD
    Countries: Belgium, France, France
    Project: EC | CENDARI (284432), EC | EHRI (654164), EC | EHRI (261873)

    International audience; One of the funded project proposals under DARIAH’s Open Humanities call 2015 was “Open History: Sustainable digital publishing of archival catalogues of twentieth-century history archives”. Based on the experiences of the Collaborative EuropeaN Digital Archival Research Infrastructure (CENDARI) and the European Holocaust Research Infrastructure (EHRI), the main goal of the “Open History” project was to enhance the dialogue between (meta-)data providers and research infrastructures. Integrating archival descriptions – when they were already available – held at a wide variety of twentieth-century history archives (from classic archives to memorial sites, libraries and private archives) into research infrastructures has proven to be a major challenge, which could not be done without some degree of limited to extensive pre-processing or other preparatory work. The “Open History” project organized two workshops and developed two tools: an easily accessible and general article on why the practice of standardization and sharing is important and how this can be achieved; and a model which provides checklists for self-analyses of archival institutions. The text that follows is the article we have developed. It intentionally remains at a general level, without much jargon, so that it can be easily read by those who are non-archivists or non-IT. Hence, we hope it will be easy to understand for both those who are describing the sources at various archives (with or without IT or archival sciences degrees), as well as decision-makers (directors and advisory boards) who wish to understand the benefits of investing in standardization and sharing of data. It is important to note is that this text is a first step, not a static, final result. Not all aspects about standardization and publication of (meta-)data are discussed, nor are updates or feedback mechanisms for annotations and comments discussed. The idea is that this text can be used in full or in part and that it will include further chapters and section updates as time goes by and as other communities begin using it. Some archives will read through much of these and see confirmation of what they have already been implementing; others – especially the smaller institutions, such as private memory institutions – will find this a low-key and hands-on introduction to help them in their efforts.

  • Open Access English
    Authors: 
    Vanholsbeeck, Marc; Engels, Tim; Istenič Starčič, Andreja;
    Publisher: HAL CCSD
    Countries: Belgium, France

    info:eu-repo/semantics/inPress