My research focusses on the automatic extraction of canonical references from publications in Classics. Such references are the standard way of citing classical texts and are found in great numbers throughout monographs, journal articles and commentaries. In chapters 1 and 2 I argue for the importance of canonical citations and for the need to capture them automatically. Their importance and function is to signal text passages that are studied and discussed, often in relation to one another as can be seen in parallel passages found in modern commentaries. Scholars in the field have long been exploiting this kind of information by manually creating indexes of cited passages, the so-called indices locorum. However, the challenge we now face is find new ways of indexing and retrieving information contained in the growing volume of digital archives and libraries. Chapters 3 and 4 look at how this problem can be tackled by translating the extraction of canonical citations into a computationally solvable problem. The approach I developed consists of treating the extraction of such citations as a problem of named entity extraction. This problem can be solved with some degree of accuracy by applying and adapting methods of Natural Language Processing. In this part of the dissertation I discuss the implementation of this approach as a working prototype and an evaluation of its performance. Once canonical references have been extracted from texts, the web of relations between documents that they create can be represented as a network. This network can then be searched, manipulated, visualised and analysed in various ways. In chapter 5 I focus specifically on how this network can be leveraged to search through bodies of secondary literature. Finally in chapter 6 I discuss how my work opens up new research perspectives in terms of visualisation, analysis and the application of such automatically extracted citation networks.
The article is available in open access on the publisher website.; International audience; By examining the case of the French translation of the expression "digital humanities" (DH), this article argues that cultural diversity and multilingualism could be fostered in digital culture. If other languages have been invited and forced to welcome this English phrase, its translations have to be studied since they could potentially have strong epistemological backwash-effects on it. Through an historical etymological inquiry, it can be demonstrated that the use of the outmoded French word humanités is the most significant element in the two French expressions humanités numériques or humanités digitales. This single word opens up a specific space for humanist approaches within the open-ended digital approaches. On this base, the encounter between Humanities and hard sciences can be reconsidered, as it happens already in two examples of new DH masters in French-speaking countries. To my late mother, who read so many books aloud to me, building my cultural memory of the forgotten meanings of words By examining the case of the French translation of the expression "digital humanities" (DH), this article argues that cultural diversity and multilingualism could be fostered in digital culture. At first glance, the international success of this expression seems to contradict this statement: isn't it a clear example of English language domination over other Western and non-Western languages? Used in written form for Lost in translation? The odyssey of 'digital humanities' in French 27 Studia UBB Digitalia Volume 62, No. 1, 2017 the first time in 2004 (Kirschenbaum 56), tirelessly discussed in DH conferences and works, "DH" has quickly been used in professorship titles, in undergraduate and postgraduate degrees, or to qualify centers, laboratories, and research projects (Clivaz "Common Era" 41). If other languages have been invited and forced to welcome this expression, its translations have to be studied since they could potentially have strong epistemological backwash-effects on it. French is an example worth examining: it can be demonstrated that the use of the outmoded French word humanités is the most significant element in the two French expressions humanités numériques or humanités digitales. This single word opens up a specific space for humanist approaches within the open-ended digital approaches. The introduction below aims to present the specific impact of a study of the phrase "digital humanities" and its translations within the general problematic of the phrase's definition. The second part of this article summarizes the main progressions and arguments in the discussions surrounding humanités numériques (humanities computing) and humanités digitales (digital humanities) in the French-speaking sphere. The third section examines the historical epistemology of humanités while the final section considers the resulting confrontation between the humanities and the 'hard' sciences: this underlines their potential synergy and the proper role of the humanities.
International audience; This article presents the challenges of developing Humanities research in a digital environment in relation to a New Testament test-case: the MARK16 project. The first section argues that virtual research environments (VREs) have become an excellent milieu in which to develop a digitized research project based on collaborative work. The second section presents an overview of VREs and digital projects on the New Testament. The third section demonstrates the ways in which the MARK16 project participates in the development of VREs and fosters new modes of engaging material in digitized NT research. Preamble The research question of this paper is simultaneously simple and boundless: does it matter if we practice Humanities research in a digital culture rather than in traditional print cultures? And what does the answer to this question mean for New Testament research in particular? Such abyssal questions are fundamental and should at least be considered when a scholar is planning a research project in biblical studies, theology, or religious studies. Indeed, the number of digital research projects are increasing at the international, European, and national levels.1 Such questions closely accompanied the preparatory phase of the SNSF PRIMA grant MARK16, a five-year project supported by the Swiss National Foundation.2 These interrogations are deeply embedded within the opening phase of the project and will remain so throughout, as MARK16 aims to build a new Digital Humanities research model. This will be based on a test case that is well known in New Testament textual criticism (NTTC): the ending of the Gospel according to Mark. Consequently, this article explores the epistemological digital turn in the Humanities and relates it to the MARK16 project, hoping to inspire further research and engagement in NTTC and New Testament studies. The first section outlines some challenges for digital research, pointing to the fact that virtual research environments (VREs) seem to be the main emergent digital milieu in which this work occurs. The second section presents an overview of VREs in New Testament and Early Christian research, and the third discusses the challenges presented by MARK16 in building a new Humanities research model around a NTTC test case.
International audience; This paper describes the workflow of the Grammateus project, from gathering data on Greek documentary papyri to the creation of a web application. The first stage is the selection of a corpus and the choice of metadata to record: papyrology specialists gather data from printed editions, existing online resources and digital facsimiles. In the next step, this data is transformed into the EpiDoc standard of XML TEI encoding, to facilitate its reuse by others, and processed for HTML display. We also reuse existing text transcriptions available on . Since these transcriptions may be regularly updated by the scholarly community, we aim to access them dynamically. Although the transcriptions follow the EpiDoc guidelines, the wide diversity of the papyri as well as small inconsistencies in encoding make data reuse challenging. Currently, our data is available on an institutional GitLab repository, and we will archive our final dataset according to the FAIR principles.
International audience; The CENDARI infrastructure is a research-supporting platform designed to provide tools for transnational historical research, focusing on two topics: medieval culture and World War I. It exposes to the end users modern Web-based tools relying on a sophisticated infrastructure to collect, enrich, annotate, and search through large document corpora. Supporting researchers in their daily work is a novel concern for infrastructures. We describe how we gathered requirements through multiple methods to understand historians' needs and derive an abstract workflow to support them. We then outline the tools that we have built, tying their technical descriptions to the user requirements. The main tools are the note-taking environment and its faceted search capabilities; the data integration platform including the Data API, supporting semantic enrichment through entity recognition; and the environment supporting the software development processes throughout the project to keep both technical partners and researchers in the loop. The outcomes are technical together with new resources developed and gathered, and the research workflow that has been described and documented.
International audience; Defining digital humanities might be an endless debate if we stick to the discussion about the boundaries of this concept as an academic “discipline”. In an attempt to concretely identify this field and its actors, this paper shows that it is possible to analyse them through Twitter, a social media widely used by this “community of practice”. Based on a network analysis of 2,500 users identified as members of this movement, the visualisation of the “who’s following who?” graph allows us to highlight the structure of the network’s relationships, and identify users whose position is particular. Specifically, we show that linguistic groups are key factors to explain clustering within a network whose characteristics look similar to a small world.
For many libraries, mass digitisation has become routine. Digitisation centres are available in many places and there is a wealth of online platforms for the presentation of a wide variety of different media. Current projects from ETH Library reveal the directions in which the enormous potential harboured in these platforms and the millions of digital copies already produced may evolve. Research partnerships play just as important a role here as active user participation and intensified outreach. HAL Archive