International audience; The aim of the talk is to present the methodology used to reorganise the PACTOLS thesaurus of Frantiq, launched within the framework of the MASA consortium. PACTOLS is a multilingual and open repository about archaeology from Prehistory to the present and for Classics. It is organized into six micro-thesaurus at the root of its name (Peuples, Anthroponymes,Chronologie, Toponymes, Oeuvres, Lieux, Sujets). The goal is to turn it into a tool interoperable with information systems beyond its original documentary purpose, and usable by archaeologists as a repository for managing scientific data. During the talk, we will describe the choice of tools, the organisation of work within the steering group and the collaborations with specialists for the upgrading and development of the vocabulary while showing the strengths and limitations of some experiments. Above allit will show how the introduction of the conceptual categories of the BackBone Thesaurus of DARIAH, modelled on the CIDOC-CRM ontology, through a progressive deconstruction/reconstruction process, eventually had an impact on all micro thesauri and questioned the organisation of knowledge so far proposed.
This report provides information about activities and progress towards establishing DARIAH membership in six countries: the Czech Republic, Finland, Israel, Spain, Switzerland, and the UK, which took place between July and December 2019. Previous activities were described in detail in the D3.2 - Regularly Monitor Country-Specific Progress in Enabling New DARIAH Membership. During the project lifetime, the Czech Republic joined DARIAH ERIC; in other countries, collaboration with DARIAH has been greatly strengthened and significant progress regarding DARIAH membership has been achieved. The report also outlines the next steps in the accession processes, building on the results of the DESIR project.
The European Holocaust Research Infrastructure (EHRI) started in October 2010 to build on a network that connects both people (Holocaust researchers, archivists, curators, librarians and digital humanists) and dispersed Holocaust source material and collections. EHRI’s aim is making sources visible in a systematic way in order to counteract the fragmentation of the sources and to reveal interconnections. EHRI focuses on Archive and collection descriptions, which are available through the EHRI Portal. EHRI is currently in its second phase and is on the ESFRI Roadmap2 for a more sustainable future. EHRI has developed a set of controlled vocabularies that serves both as a retrieval and cataloguing tool for the multilingual and highly heterogeneous data of the EHRI portal. These vocabularies were partly implemented in the first phase of the project. In the current phase of EHRI the vocabularies are in the process of quality improvement improve and enrich the existing terms, add new terms, disambiguate and remove the mistakes (deduplication, merging, adding multilingual labels, consistency checks, multiple parent relations, etc.) and increase their coverage. In the EHRI portal the subject terms are currently not available for the public, as they are used only for retrieval purposes.
International audience; In this paper we describe the development and evaluation of a visual analytics tool to support historical research. Historians continuously gather data related to their scholarly research from archival visits and background search. Organising and making sense of all this data can be challenging as many historians continue to rely on analog or basic digital tools. We built an integrated note-taking environment for historians which unifies a set of func-tionalities we identified as important for historical research including editing, tagging, searching, sharing and visualization. Our approach was to involve users from the initial stage of brainstorming and requirement analysis through to design, implementation and evaluation. We report on the process and results of our work, and conclude by reflecting on our own experience in conducting user-centered visual analytics design for digital humanities.
AbstractThe paper presents Intergraph, a graph-based visual analytics technical demonstrator for the exploration and study of content in historical document collections. The designed prototype is motivated by a practical use case on a corpus of circa 15.000 digitized resources about European integration since 1945. The corpus allowed generating a dynamic multilayer network which represents different kinds of named entities appearing and co-appearing in the collections. To our knowledge, Intergraph is one of the first interactive tools to visualize dynamic multilayer graphs for collections of digitized historical sources. Graph visualization and interaction methods have been designed based on user requirements for content exploration by non-technical users without a strong background in network science, and to compensate for common flaws with the annotation of named entities. Users work with self-selected subsets of the overall data by interacting with a scene of small graphs which can be added, altered and compared. This allows an interest-driven navigation in the corpus and the discovery of the interconnections of its entities across time.
Project: FWF | Arabic in the Middle Atla... (P 21722)
International audience; Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language resources that are needed by dictionaries and dictionary tools. In particular this holds true for the crucial role that statistical data obtained from language resources play in lexicographic workflow—a role that also has to be reflected in the model of the data produced in these workflows. Presenting a range of current projects, the authors address two main questions in this area: How can the relationship between a dictionary and other language resources be conceptualized, irrespective of whether they are used in the production of the dictionary or to enrich existing lexicographic data? And how can this be documented using the TEI Guidelines? Discussing a variety of options, this paper proposes a customization of the TEI dictionary module that tries to respond to the emerging requirements in an environment of increasingly intertwined language resources.
The project will deliver training materials for the digital arts and humanities in different languages and make them available via an online e-learning platform. This report elaborates on the implementation of such a platform. It describes the main user scenarios, it collects user and technical requirements, defines the data model and functional specification and explores technical solutions. The report has been created within WP 4 Infrastructure Development with input from all partners, especially WP 2 (user requirements). As a first step, we performed desktop research on what kind of solutions and projects on portals for training materials exist and what kind of systems they are using. The second step was an evaluation of different tools. There are different evaluation methods and criteria for e-learning systems (e.g. Kurilovas & Dagiene 2009). We chose to start from user requirements and a mapping of the user requirements from WP2 to functionalities available in existing systems. Finally, we determined which solution would suit the requirements and other circumstances within the project best.
Project: EC | Locus Ludi (741520), EC | DESIR (731081)
The DESIR project sets out to strengthen the sustainability of DARIAH and firmly establish it as a long-term leader and partner within arts and humanities communities. The project was designed to address six core infrastructural sustainability dimensions and one of these was dedicated to training and education, which is also one of the four pillars identified in the DARIAH Strategic Plan 2019-2026. In the framework of Work Package 7: Teaching, DESIR organised dedicated workshops in the six DARIAH accession countries (Czech Republic, Finland, Israel, Spain, Switzerland and the United Kingdom) to introduce them to the DARIAH infrastructure and related services, and to develop methodological research skills. The topic of each workshop was decided by accession countries representatives according to the training needs of the national communities of researchers in the (Digital) Humanities. Training topics varied greatly: on the one hand, some workshops had the objective to introduce participants to specific methodological research skills; on the other hand, a different approach was used, and some events focused on the infrastructural role of training and education. The workshops organised in the context of Work Package 7: Teaching are listed below:• CZECH REPUBLIC: “A series of fall tutorials 2019 organized by LINDAT/CLARIAHCZ, tutorial #3 on TEI Training”, November 28, 2019, Prague;• FINLAND: “Reuse & sustainability: Open Science and social sciences and humanities research infrastructures”, 23 October 2019, Helsinki;• ISRAEL: “Introduction to Text Encoding and Digital Editions”, 24 October 2019, Haifa;• SPAIN: “DESIR Workshop: Digital Tools, Shared Data, and Research Dissemination”, 3 July 2019, Madrid;• SWITZERLAND: “Sharing the Experience: Workflows for the Digital Humanities”, 5-6 December 2019, Neuchâtel;• UNITED KINGDOM: “Research Software Engineering for Digital Humanities: Role of Training in Sustaining Expertise”, 9 December, London.
International audience; The panel presents results and ongoing work from corpus projects in which TEI-P5 hasbeen adopted for the representation and linguistic annotation of genres of social mediaand computer-mediated communication (CMC). It relates to the work of the TEI-SIG“computer-mediated communication” which is developing TEI models for therepresentation of CMC genres and testing these models for a broad range of genres(ranging from “text-only” genres such as chat and SMS to multimodal genres such aslearning environments and Second Life) and in corpus building initiatives for variousEuropean languages.The goal of the panel is to give an overview of models and practices in representingCMC in TEI on the example of German and French CMC corpora. A documentation andODD files of the schemas developed by the group will be made available in the TEI wikiand be announced via the TEI mailing list before the conference so that everybody whois interested in participating in the discussion can examine the CMC models in advance.The discussion in the panel shall serve as an opportunity for collecting feedback onthese models and schema drafts from a broader community within the TEI who isinterested in adapting TEI-P5 for the representation of new (digital) genres. Thisfeedback will be taken into consideration when revising the models and – as a next stepafter the conference – preparing feature requests for adapting the TEI for CMC.
To support the digital evolution within Social Sciences and Humanities research, it is necessary to stabilize knowledge on standards and research good practices. The goal of the Standardization Survival Kit (SSK), developed within the PARTHENOS project, is to accompany researchers along this route, giving access to standards and best practices in a meaningful way, by the mediation of research scenarios. A research scenario is a (digital) workflow practiced by researchers, that can be repeatedly applied to a task that will help to gain material or insights in view of a research question. These scenarios are at the core of the SSK, as they embed resources with contextual information and relevant examples on standardized processes and methods in a research context. The SSK is an open tool where users are able to publish new scenarios or adapt existing ones. These scenarios can be seen as a living memory of what should be the best research practices in a given community, made accessible and reusable for other researchers.