Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
90 Research products

  • DARIAH EU
  • Publications
  • Research data
  • FR
  • English
  • Hal-Diderot
  • Mémoires en Sciences de l'Information et de la Communication
  • Hyper Article en Ligne - Sciences de l'Homme et de la Société
  • Hyper Article en Ligne

10
arrow_drop_down
Relevance
arrow_drop_down
  • Authors: Romary, Laurent; Puren, Marie;

    International audience; Le projet européen Iperion regroupe un ensemble d'acteurs européens offrant des services d'infrastructure pour l'étude du patrimoine matériel sous la forme d'équipements fixes ou mobiles. Ces différents services génèrent potentiellement de grandes quantités de données qu'il est nécessaire de gérer et documenter. En particulier, il semble utile de travailler à la constitution d'un réservoir de telles données qui soit consultable par une large communauté de chercheurs, notamment en sciences humaines. On peut ainsi penser au rôle que peuvent jouer des analyses précises d'une oeuvre pour un historien des arts qui souhaite étudier l'évolution de la technique d'un peintre par exemple. La mise en place d'une telle infrastructure de données réutilisables dans le domaine du patrimoine matériel se heurte cependant à plusieurs difficultés que nous essayons de réduire au sein du projet Iperion. Tout d'abord, il n'est pas nécessairement dans la culture du déploiement des équipements eux-mêmes d'envisager une réutilisation large des données. Le scénario de base est souvent celui d'un chercheur qui va conduire une analyse ciblée d'un objet patrimonial, pour ensuite exploiter lui-même les résultats correspondants et passer à l'analyse suivante, sans se préoccuper d'une réutilisation des données produites. Ensuite, du point de vue des formats de données, on observe l'absence de réels standards de représentation communs aux différents types d'équipements. On se retrouve ainsi à devoir gérer des données propriétaires qui dépendent principalement des constructeurs des équipements. Enfin, se posele problème complexe des droits d'utilisation qui combinent un ensemble de difficultés liées au statut des oeuvres elles-mêmes, aux règles régissant l'équipement, mais aussi à la volonté de partage du chercheur qui a effectué le recueil initial des données. Dans ce cadre, notre objectif est de mettre en place une démarche d'analyse de l'état des lieux et de proposition de principes communs de gestion des données au sein du projet. Il s'agirait ainsi de préfigurer une charte de gestion des données applicable à la future infrastructure européenne E-RIHS, en collaboration avec l'infrastructure numérique DARIAH en sciences humaines. Nous avons ainsi recueilli les réponses des différents partenaires du projet concernant à la fois les modes de gestion des équipements, et le statut des jeux de données disponibles. La variété des réponses obtenues montre déjà que seules des recommandations génériques pourront être produites à l'échelle européenne, et nous esquisserons quelques propositions dans ce sens. Laurent Romary est directeur de recherche à Inria où il mène des recherches dans le domaine des humanités numériques et plus particulièrement sur la modélisation et la représentation de données en sciences humaines et sociales. Depuis plusieurs années, il a contribué à la définition des politiques d'information scientifique du CNRS, de la société Max Planck et d'Inria, où il a contribué notamment à la définition d'une obligation de dépôt en archives ouvertes dans HAL. Il a aussi participé de longue date à la définition et à l'évolution des directives de la TEI (Text Encoding Initiative), notamment comme membre, mais aussi comme président du conseil technique de la TEI, et préside le comité 37 de l'ISO (Organisation international de normalisation). Il dirige l'infrastructure Européenne DARIAH pour le développement de méthodes numériques en sciences humaines et sociales. https://cv.archives-ouvertes.fr/laurentromary

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Saarti, Jarmo; Ropponen, Jari; Soivanen, Satu;

    International audience; The Karjala database contains digitized demographic data of the parish registers from the regions ceded to the Soviet Union in 1944. The objectives of the digitization project have been to promote access to digitized records for scientific research and genealogy as well as encouraging research on the people of the ceded Karelia region. The main sources for the database have been catechetical lists, lists of children, and registers of vital statistics (registers of births, marriages, migrations and deaths) that are available in Digital Archives of the National Archives of Finland from the period of 1681 – 1949. The data in the database amounts to about 10.3 million entries, but only data older than 100 years is published openly on the Internet. According to decisions by the Finnish data protection authorities, the Personal Data Act is applied to personal registers less than 100 years old. The digitization process is still going on; it has been calculated that there are 1.2 million entries still to be processed. The database is available to users via https://katiha.mamk.fi/. At present, there are about 6.5 million file entries available on the Internet, each presenting data about one individual, e.g. names, the date of birth and death, the cause of death, age, gender, marital status, occupation, residence, migration, the parish. The Karjala database can be exploited for diverse research purposes; it improves access to the church records that are sometimes very difficult to read. Information in the database can be utilized for historical research, medical genetics, social sciences, and family and onomastics. The database is can be utilized for clarifying family structures, migratory patterns or child mortality. The database also offers excellent opportunities for interdisciplinary research. Our presentation will describe the digitization process management of old, handwritten documents that consist of non-structured data from a historical period that contains varied linguistic material: several languages from a historical period where nations, states and languages were still evolving, different calendars and spelling rules etc. We will also introduce our plans to use text recognition technology so that the handwritten documents such as the Karjala database will be incorporated into the international READ project network http://read.transkribus.eu/network/. We will also discuss the challenges encountered in this type of heterogeneous data and the possibilities for more defined and structured data management that could enable the automated use of the database. We will also include in our presentation a description of the evolution of the different phases of the database, emphasizing the evolution of the database and its linkage with internet technologies e.g. how they have either hindered or enabled the digitization project.

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Hinrichs, Erhard;

    International audience; CLARIN-D stand on sustainability in a national and international context

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Zhang, Jing; Lin, Jiaping; Xiao, Peng;

    International audience

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Romary, Laurent; Seillier, Dorian; Tóth-Czifra, Erzsébet;

    A defining feature of data and data workflows in the arts and humanities domain is their dependence on cultural heritage sources hosted and curated in museums, libraries, galleries and archives. A major difficulty when scholars interact with heritage data is that the nature of the cooperation between researchers and Cultural Heritage Institutions and the researchers working in CHIs (henceforth CHIs) is often constrained by structural and legal challenges but even more by uncertainties as to the expectations of both parties.This recognition led several European organizations such as APEF, CLARIN, Europeana, E-RIHS to come together and join forces under the governance of DARIAH to set up principles and mechanisms for improving the conditions for the use and re-use of cultural heritage data issued by cultural heritage institutions and studied and enriched by researchers. As a first step of this joint effort is the Heritage Data Reuse Charter (https://datacharter.hypotheses.org/) establishes 6 basic principles for improving the use and re-use of cultural heritage resources by researchers and , to help all the relevant actors to work together to connect and improve access to heritage data. These are: Reciprocity, Interoperability, Citability, Openness, Stewardship and Trustworthiness.As a further step in translating these principles to actual data workflows the survey below serves as a template to frame exchanges around cultural heritage data by enabling both Cultural Heritage Institutions, infrastructure providers and researchers and to clarify their goals at the beginning and the project, to specify access to data, provenance information, preferred citation standards, hosting responsibilities etc. on the basis of which the parties can arrive at mutual reuse agreements that could serve as a starting point for a FAIR-by-construction data management, right from the project planning/application phase. In practice, the survey below can be flexibly applied in platform-independent ways in exchange protocols between Cultural Heritage Institutions and researchers, Institutions who sign the Charter could use it (and expect to use such surveys) in their own exchange protocols. Another direction of future developments is to set up a platform dedicated to such exchanges. On the other hand, researchers are encouraged to contact the CHIs during the initial stages of their project in order to explain their plans and figure details of transaction together. This mutual declaration can later be a powerful component in their Data Management Plans as it shows evidence for responsible and fair conduct of cultural heritage data, and fair (but also FAIR) research data management practices that are based on partnership with the holding institution. As enclosing a Research Data Management Plan to grant applications is becoming a more and more common requirement among research funders, we need to raise the funders’ awareness to the fact that such bi- or trilateral agreements and data reuse declarations among researchers, CHIs and infrastructure providers are crucial domain-specific components of FAIR data management.

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Engelhardt, Claudia; Leone, Claudio; Larrousse, Nicolas; Montoliu, Delphine; +5 Authors
    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    Authors: Larrousse, Nicolas; Gray, Edward J.; Concordia, Cesare;

    If citation is a common practice for publications, it is relatively new for data especially in SSH. This paper will present the work carried out during the SSHOC project about data citation in general and more precisely how to make them actionable. The metaphor of a travel journal of an expedition seemed appropriate to us to present this work carried out during the SSHOC project. The first part was to study this terra incognita by making an inventory of citation practices (https://doi.org/10.5281/zenodo.3595965). To summarize, we discovered that in the research communities we investigated, practices were seldom standardized and were very diverse, generally producing citations that could not be processed by machines: in other words they were not “actionable”. This led us to develop a sort of guide necessary to journey through this new, uncharted territory in the form of a set of recommendations ( https://doi.org/10.5281/zenodo.5361717) to build citations in SSH. So as not to reinvent the wheel, we based these recommendations on existing principles created by Force11 ( https://doi.org/10.25490/a97f-egyk) by adapting them to the specific characteristics of the SSH data. These recommendations were validated by a committee of experts from different backgrounds and structures (RDA participants, CODATA director, OpenAire Engineers etc.) during a round table (https://www.sshopencloud.eu/news/roundtable-experts-data-citation) and in a parallel review process. Then we decided to analyze the resources available in this new territory, that is, the repositories that are so crucial to be able to cite data. We carried out an analysis of 85 repositories against 7 quality criteria based on the recommendations which ensure continuity with the work mentioned above: PID from “Unique Identification & Persistence” Landing page from “Access” Structured metadata from “Importance & Credit and Attribution” Cite as from “Evidence, Specificity & Verifiability” Versioning from “Specificity and Verifiability” Standardized vocabularies from “Interoperability and Flexibility” Links to publications from “Importance” The results of this survey (https://doi.org/10.5281/zenodo.5603306) are encouraging - even if there is room for improvement, particularly in the use of Persistent Identifiers. Importantly, the presence of a landing page in almost all cases allowed us to build up a test sample made up of a very diverse dataset from those repositories for which we want to build standardized and actionable citations. In parallel we developed a tool in order to “harvest” the resources found in this new land so as to better understand them and also be able to explain them to others. We developed a prototype composed of three components: a harvester which grabs information about a dataset and normalizes it an API to disseminate the metadata of the citation thereby making it actionable a citation viewer for human purposes For the first iteration to populate this prototype, we used the dataset collected during our survey of repositories and we are going to gradually add more datasets from various sources. This prototype is primarily designed to implement what we called “actionability” to a citation and provide a ready-to-use citation in various citation formats. Starting from the PID of a dataset, the prototype attempts to aggregate metadata from different sources: the repository of the dataset, the PID Registration Agency and a number of Knowledge Graphs. For instance, while metadata associated with a DOI (Digital Object Identifier) are limited and those provided by a handle are even more scarce, it is possible to get more information from a landing page and thus enrich the citation. We also used another indirect approach to gather additional information by using a registry of repositories (RE3Data https://www.re3data.org/) which provides, among other things, information on the available APIs available for a specific repository. Thus the prototype can give a unified view of information about datasets coming from different sources. For researchers, it thus avoids cumbersome work on how to cite a dataset or get information about its provenance. In return, it makes a researcher aware of the importance of properly documenting a dataset and depositing it in a “good” repository. This paper will present in greater detail what we learned at each step of this expedition and how a research project can take advantage of a good citation system to enhance the visibility of the output. We will also introduce the potential uses based on the information provided by the prototype such as the possibility of associating a specific tool to process data or the use of this information as a base to build data papers.

    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ISTI Open Portalarrow_drop_down
    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    ISTI Open Portal
    Conference object . 2022
    Data sources: ISTI Open Portal
    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    ZENODO; CNR ExploRA
    Other literature type . Conference object . 2022
    License: CC BY
    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    ZENODO
    Presentation . 2022
    License: CC BY
    Data sources: Datacite
    addClaim

    This Research product is the result of merged Research products in OpenAIRE.

    You have already added works in your ORCID record related to the merged Research product.
    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ISTI Open Portalarrow_drop_down
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
      ISTI Open Portal
      Conference object . 2022
      Data sources: ISTI Open Portal
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
      ZENODO; CNR ExploRA
      Other literature type . Conference object . 2022
      License: CC BY
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
      ZENODO
      Presentation . 2022
      License: CC BY
      Data sources: Datacite
      addClaim

      This Research product is the result of merged Research products in OpenAIRE.

      You have already added works in your ORCID record related to the merged Research product.
  • image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    Authors: Di Pretoro, Emmanuel; Geeraert, Friedel;

    International audience

    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ Hyper Article en Lig...arrow_drop_down
    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    Hal-Diderot
    Other literature type . 2019
    Data sources: Hal-Diderot
    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ Hyper Article en Lig...arrow_drop_down
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
      Hal-Diderot
      Other literature type . 2019
      Data sources: Hal-Diderot
  • Authors: Wissik, Tanja; Edmond, Jennifer; Fischer, Frank; de Jong, Franciska; +5 Authors

    The digital humanities (DH) enrich the traditional fields of the humanities with new practices, approaches and methods. Since the turn of the millennium, the necessary skills to realise these new possibilities have been taught in summer schools, workshops and other alternative formats. In the meantime, a growing number of Bachelor's and Master's programmes in digital humanities have been launched worldwide. The DH Course Registry, which is the focus of this article, was created to provide an overview of the growing range of courses on offer worldwide. Its mission is to gather the rich offerings of different courses and to provide an up-to-date picture of the teaching and training opportunities in the field of DH. The article provides a general introduction to this emerging area of research and introduces the two European infrastructures CLARIN and DARIAH, which jointly operate the DH Course Registry. A short history of the Registry is accompanied by a description of the data model and the data curation workflow. Current data, available through the API of the Registry, is evaluated to quantitatively map the international landscape of DH teaching.Preprint of a publication for LibraryTribune (China) (accepted)

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Buddenbohm, Stefan; de Jong, Maaike; Priddy, Mike; Moranville, Yoann; +1 Authors
    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
Advanced search in Research products
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
The following results are related to DARIAH EU. Are you interested to view more results? Visit OpenAIRE - Explore.
90 Research products
  • Authors: Romary, Laurent; Puren, Marie;

    International audience; Le projet européen Iperion regroupe un ensemble d'acteurs européens offrant des services d'infrastructure pour l'étude du patrimoine matériel sous la forme d'équipements fixes ou mobiles. Ces différents services génèrent potentiellement de grandes quantités de données qu'il est nécessaire de gérer et documenter. En particulier, il semble utile de travailler à la constitution d'un réservoir de telles données qui soit consultable par une large communauté de chercheurs, notamment en sciences humaines. On peut ainsi penser au rôle que peuvent jouer des analyses précises d'une oeuvre pour un historien des arts qui souhaite étudier l'évolution de la technique d'un peintre par exemple. La mise en place d'une telle infrastructure de données réutilisables dans le domaine du patrimoine matériel se heurte cependant à plusieurs difficultés que nous essayons de réduire au sein du projet Iperion. Tout d'abord, il n'est pas nécessairement dans la culture du déploiement des équipements eux-mêmes d'envisager une réutilisation large des données. Le scénario de base est souvent celui d'un chercheur qui va conduire une analyse ciblée d'un objet patrimonial, pour ensuite exploiter lui-même les résultats correspondants et passer à l'analyse suivante, sans se préoccuper d'une réutilisation des données produites. Ensuite, du point de vue des formats de données, on observe l'absence de réels standards de représentation communs aux différents types d'équipements. On se retrouve ainsi à devoir gérer des données propriétaires qui dépendent principalement des constructeurs des équipements. Enfin, se posele problème complexe des droits d'utilisation qui combinent un ensemble de difficultés liées au statut des oeuvres elles-mêmes, aux règles régissant l'équipement, mais aussi à la volonté de partage du chercheur qui a effectué le recueil initial des données. Dans ce cadre, notre objectif est de mettre en place une démarche d'analyse de l'état des lieux et de proposition de principes communs de gestion des données au sein du projet. Il s'agirait ainsi de préfigurer une charte de gestion des données applicable à la future infrastructure européenne E-RIHS, en collaboration avec l'infrastructure numérique DARIAH en sciences humaines. Nous avons ainsi recueilli les réponses des différents partenaires du projet concernant à la fois les modes de gestion des équipements, et le statut des jeux de données disponibles. La variété des réponses obtenues montre déjà que seules des recommandations génériques pourront être produites à l'échelle européenne, et nous esquisserons quelques propositions dans ce sens. Laurent Romary est directeur de recherche à Inria où il mène des recherches dans le domaine des humanités numériques et plus particulièrement sur la modélisation et la représentation de données en sciences humaines et sociales. Depuis plusieurs années, il a contribué à la définition des politiques d'information scientifique du CNRS, de la société Max Planck et d'Inria, où il a contribué notamment à la définition d'une obligation de dépôt en archives ouvertes dans HAL. Il a aussi participé de longue date à la définition et à l'évolution des directives de la TEI (Text Encoding Initiative), notamment comme membre, mais aussi comme président du conseil technique de la TEI, et préside le comité 37 de l'ISO (Organisation international de normalisation). Il dirige l'infrastructure Européenne DARIAH pour le développement de méthodes numériques en sciences humaines et sociales. https://cv.archives-ouvertes.fr/laurentromary

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Saarti, Jarmo; Ropponen, Jari; Soivanen, Satu;

    International audience; The Karjala database contains digitized demographic data of the parish registers from the regions ceded to the Soviet Union in 1944. The objectives of the digitization project have been to promote access to digitized records for scientific research and genealogy as well as encouraging research on the people of the ceded Karelia region. The main sources for the database have been catechetical lists, lists of children, and registers of vital statistics (registers of births, marriages, migrations and deaths) that are available in Digital Archives of the National Archives of Finland from the period of 1681 – 1949. The data in the database amounts to about 10.3 million entries, but only data older than 100 years is published openly on the Internet. According to decisions by the Finnish data protection authorities, the Personal Data Act is applied to personal registers less than 100 years old. The digitization process is still going on; it has been calculated that there are 1.2 million entries still to be processed. The database is available to users via https://katiha.mamk.fi/. At present, there are about 6.5 million file entries available on the Internet, each presenting data about one individual, e.g. names, the date of birth and death, the cause of death, age, gender, marital status, occupation, residence, migration, the parish. The Karjala database can be exploited for diverse research purposes; it improves access to the church records that are sometimes very difficult to read. Information in the database can be utilized for historical research, medical genetics, social sciences, and family and onomastics. The database is can be utilized for clarifying family structures, migratory patterns or child mortality. The database also offers excellent opportunities for interdisciplinary research. Our presentation will describe the digitization process management of old, handwritten documents that consist of non-structured data from a historical period that contains varied linguistic material: several languages from a historical period where nations, states and languages were still evolving, different calendars and spelling rules etc. We will also introduce our plans to use text recognition technology so that the handwritten documents such as the Karjala database will be incorporated into the international READ project network http://read.transkribus.eu/network/. We will also discuss the challenges encountered in this type of heterogeneous data and the possibilities for more defined and structured data management that could enable the automated use of the database. We will also include in our presentation a description of the evolution of the different phases of the database, emphasizing the evolution of the database and its linkage with internet technologies e.g. how they have either hindered or enabled the digitization project.

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Hinrichs, Erhard;

    International audience; CLARIN-D stand on sustainability in a national and international context

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Zhang, Jing; Lin, Jiaping; Xiao, Peng;

    International audience

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Romary, Laurent; Seillier, Dorian; Tóth-Czifra, Erzsébet;

    A defining feature of data and data workflows in the arts and humanities domain is their dependence on cultural heritage sources hosted and curated in museums, libraries, galleries and archives. A major difficulty when scholars interact with heritage data is that the nature of the cooperation between researchers and Cultural Heritage Institutions and the researchers working in CHIs (henceforth CHIs) is often constrained by structural and legal challenges but even more by uncertainties as to the expectations of both parties.This recognition led several European organizations such as APEF, CLARIN, Europeana, E-RIHS to come together and join forces under the governance of DARIAH to set up principles and mechanisms for improving the conditions for the use and re-use of cultural heritage data issued by cultural heritage institutions and studied and enriched by researchers. As a first step of this joint effort is the Heritage Data Reuse Charter (https://datacharter.hypotheses.org/) establishes 6 basic principles for improving the use and re-use of cultural heritage resources by researchers and , to help all the relevant actors to work together to connect and improve access to heritage data. These are: Reciprocity, Interoperability, Citability, Openness, Stewardship and Trustworthiness.As a further step in translating these principles to actual data workflows the survey below serves as a template to frame exchanges around cultural heritage data by enabling both Cultural Heritage Institutions, infrastructure providers and researchers and to clarify their goals at the beginning and the project, to specify access to data, provenance information, preferred citation standards, hosting responsibilities etc. on the basis of which the parties can arrive at mutual reuse agreements that could serve as a starting point for a FAIR-by-construction data management, right from the project planning/application phase. In practice, the survey below can be flexibly applied in platform-independent ways in exchange protocols between Cultural Heritage Institutions and researchers, Institutions who sign the Charter could use it (and expect to use such surveys) in their own exchange protocols. Another direction of future developments is to set up a platform dedicated to such exchanges. On the other hand, researchers are encouraged to contact the CHIs during the initial stages of their project in order to explain their plans and figure details of transaction together. This mutual declaration can later be a powerful component in their Data Management Plans as it shows evidence for responsible and fair conduct of cultural heritage data, and fair (but also FAIR) research data management practices that are based on partnership with the holding institution. As enclosing a Research Data Management Plan to grant applications is becoming a more and more common requirement among research funders, we need to raise the funders’ awareness to the fact that such bi- or trilateral agreements and data reuse declarations among researchers, CHIs and infrastructure providers are crucial domain-specific components of FAIR data management.

    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • Authors: Engelhardt, Claudia; Leone, Claudio; Larrousse, Nicolas; Montoliu, Delphine; +5 Authors
    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
  • image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    Authors: Larrousse, Nicolas; Gray, Edward J.; Concordia, Cesare;

    If citation is a common practice for publications, it is relatively new for data especially in SSH. This paper will present the work carried out during the SSHOC project about data citation in general and more precisely how to make them actionable. The metaphor of a travel journal of an expedition seemed appropriate to us to present this work carried out during the SSHOC project. The first part was to study this terra incognita by making an inventory of citation practices (https://doi.org/10.5281/zenodo.3595965). To summarize, we discovered that in the research communities we investigated, practices were seldom standardized and were very diverse, generally producing citations that could not be processed by machines: in other words they were not “actionable”. This led us to develop a sort of guide necessary to journey through this new, uncharted territory in the form of a set of recommendations ( https://doi.org/10.5281/zenodo.5361717) to build citations in SSH. So as not to reinvent the wheel, we based these recommendations on existing principles created by Force11 ( https://doi.org/10.25490/a97f-egyk) by adapting them to the specific characteristics of the SSH data. These recommendations were validated by a committee of experts from different backgrounds and structures (RDA participants, CODATA director, OpenAire Engineers etc.) during a round table (https://www.sshopencloud.eu/news/roundtable-experts-data-citation) and in a parallel review process. Then we decided to analyze the resources available in this new territory, that is, the repositories that are so crucial to be able to cite data. We carried out an analysis of 85 repositories against 7 quality criteria based on the recommendations which ensure continuity with the work mentioned above: PID from “Unique Identification & Persistence” Landing page from “Access” Structured metadata from “Importance & Credit and Attribution” Cite as from “Evidence, Specificity & Verifiability” Versioning from “Specificity and Verifiability” Standardized vocabularies from “Interoperability and Flexibility” Links to publications from “Importance” The results of this survey (https://doi.org/10.5281/zenodo.5603306) are encouraging - even if there is room for improvement, particularly in the use of Persistent Identifiers. Importantly, the presence of a landing page in almost all cases allowed us to build up a test sample made up of a very diverse dataset from those repositories for which we want to build standardized and actionable citations. In parallel we developed a tool in order to “harvest” the resources found in this new land so as to better understand them and also be able to explain them to others. We developed a prototype composed of three components: a harvester which grabs information about a dataset and normalizes it an API to disseminate the metadata of the citation thereby making it actionable a citation viewer for human purposes For the first iteration to populate this prototype, we used the dataset collected during our survey of repositories and we are going to gradually add more datasets from various sources. This prototype is primarily designed to implement what we called “actionability” to a citation and provide a ready-to-use citation in various citation formats. Starting from the PID of a dataset, the prototype attempts to aggregate metadata from different sources: the repository of the dataset, the PID Registration Agency and a number of Knowledge Graphs. For instance, while metadata associated with a DOI (Digital Object Identifier) are limited and those provided by a handle are even more scarce, it is possible to get more information from a landing page and thus enrich the citation. We also used another indirect approach to gather additional information by using a registry of repositories (RE3Data https://www.re3data.org/) which provides, among other things, information on the available APIs available for a specific repository. Thus the prototype can give a unified view of information about datasets coming from different sources. For researchers, it thus avoids cumbersome work on how to cite a dataset or get information about its provenance. In return, it makes a researcher aware of the importance of properly documenting a dataset and depositing it in a “good” repository. This paper will present in greater detail what we learned at each step of this expedition and how a research project can take advantage of a good citation system to enhance the visibility of the output. We will also introduce the potential uses based on the information provided by the prototype such as the possibility of associating a specific tool to process data or the use of this information as a base to build data papers.

    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ISTI Open Portalarrow_drop_down
    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    ISTI Open Portal
    Conference object . 2022
    Data sources: ISTI Open Portal
    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    ZENODO; CNR ExploRA
    Other literature type . Conference object . 2022
    License: CC BY
    image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
    ZENODO
    Presentation . 2022
    License: CC BY
    Data sources: Datacite
    addClaim

    This Research product is the result of merged Research products in OpenAIRE.

    You have already added works in your ORCID record related to the merged Research product.
    0
    citations0
    popularityAverage
    influenceAverage
    impulseAverage
    BIP!Powered by BIP!
    more_vert
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ISTI Open Portalarrow_drop_down
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
      ISTI Open Portal
      Conference object . 2022
      Data sources: ISTI Open Portal
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
      ZENODO; CNR ExploRA
      Other literature type . Conference object . 2022
      License: CC BY
      image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
      ZENODO
      Presentation . 2022
      License: CC BY
      Data sources: Datacite
      addClaim

      This Research product is the result of merged Research products in OpenAIRE.

      You have already added works in your ORCID record related to the merged Research product.