Professionally edited open access online encyclopaedias enable a systemic and reliable orientation within the ever-increasing amount of data and information on the Internet. Providing access to scientifically verified information, they represent an important part of the research and didactic infrastructure. This paper demonstrates the activities of Croatia’s Miroslav Krleža Institute of Lexicography aimed at exploring the new encyclopaedic concept in the digital age. The Institute’s digital transformation is shown, which involves the digitisation and online publishing of archival editions, publishing of the permanently updated online general encyclopaedia, and the transformation of specialised encyclopedias to the encyclopaedic portals. Encyclopaedic portals could represent a new concept of encyclopaedias in the digital realm by serving as platforms for data networking and sharing, a sort of ‘junction points’ that connect diverse digital content on a specific topic. Institute’s publicly available repository of encyclopaedic knowledge enables the linking to the digital data and collections of other research and cultural institutions; therefore the collaborative projects aimed at reinforcing digital research and cultural infrastructure will be described. Thanks to the properties of the digital media and increasing connectivity, a closer collaboration Towards a New Concept of Open Access Online Encyclopaedia : A Case Study from... between professionally edited online encyclopaedias across Europe (and beyond) is enabled. This paper elaborates a range of initiatives seeking to build connections across individual European and North American national encyclopaedias, focusing on the role that Croatian encyclopaedistics plays in this endeavour.
The European Holocaust Research Infrastructure (EHRI) started in October 2010 to build on a network that connects both people (Holocaust researchers, archivists, curators, librarians and digital humanists) and dispersed Holocaust source material and collections. EHRI’s aim is making sources visible in a systematic way in order to counteract the fragmentation of the sources and to reveal interconnections. EHRI focuses on Archive and collection descriptions, which are available through the EHRI Portal. EHRI is currently in its second phase and is on the ESFRI Roadmap2 for a more sustainable future. EHRI has developed a set of controlled vocabularies that serves both as a retrieval and cataloguing tool for the multilingual and highly heterogeneous data of the EHRI portal. These vocabularies were partly implemented in the first phase of the project. In the current phase of EHRI the vocabularies are in the process of quality improvement improve and enrich the existing terms, add new terms, disambiguate and remove the mistakes (deduplication, merging, adding multilingual labels, consistency checks, multiple parent relations, etc.) and increase their coverage. In the EHRI portal the subject terms are currently not available for the public, as they are used only for retrieval purposes.
International audience; In this paper we describe the development and evaluation of a visual analytics tool to support historical research. Historians continuously gather data related to their scholarly research from archival visits and background search. Organising and making sense of all this data can be challenging as many historians continue to rely on analog or basic digital tools. We built an integrated note-taking environment for historians which unifies a set of func-tionalities we identified as important for historical research including editing, tagging, searching, sharing and visualization. Our approach was to involve users from the initial stage of brainstorming and requirement analysis through to design, implementation and evaluation. We report on the process and results of our work, and conclude by reflecting on our own experience in conducting user-centered visual analytics design for digital humanities.
AbstractThe paper presents Intergraph, a graph-based visual analytics technical demonstrator for the exploration and study of content in historical document collections. The designed prototype is motivated by a practical use case on a corpus of circa 15.000 digitized resources about European integration since 1945. The corpus allowed generating a dynamic multilayer network which represents different kinds of named entities appearing and co-appearing in the collections. To our knowledge, Intergraph is one of the first interactive tools to visualize dynamic multilayer graphs for collections of digitized historical sources. Graph visualization and interaction methods have been designed based on user requirements for content exploration by non-technical users without a strong background in network science, and to compensate for common flaws with the annotation of named entities. Users work with self-selected subsets of the overall data by interacting with a scene of small graphs which can be added, altered and compared. This allows an interest-driven navigation in the corpus and the discovery of the interconnections of its entities across time.
Project: FWF | Arabic in the Middle Atla... (P 21722)
International audience; Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language resources that are needed by dictionaries and dictionary tools. In particular this holds true for the crucial role that statistical data obtained from language resources play in lexicographic workflow—a role that also has to be reflected in the model of the data produced in these workflows. Presenting a range of current projects, the authors address two main questions in this area: How can the relationship between a dictionary and other language resources be conceptualized, irrespective of whether they are used in the production of the dictionary or to enrich existing lexicographic data? And how can this be documented using the TEI Guidelines? Discussing a variety of options, this paper proposes a customization of the TEI dictionary module that tries to respond to the emerging requirements in an environment of increasingly intertwined language resources.
This article describes how virtual research environments (VREs) offer new opportunities for researchers to analyse open data and to obtain new insights for policy making. Although various VRE-related initiatives are under development, there is a lack of insight into how VREs support collaborative open data analysis by researchers and how this might be improved, ultimately leading to input for policy making to solve societal issues. This article clarifies in which ways VREs support researchers in open data analysis. Seven cases presenting different modes of researcher support for open data analysis were investigated and compared. Four types of support were identified: 1) ‘Figure it out yourself', 2) ‘Leading users by the hand', 3) ‘Training to provide the basics' and 4) ‘Learning from peers'. The author provides recommendations to improve the support of researchers' open data analysis and to subsequently obtain new insights for policy making to solve societal challenges.
In this article, we propose a Category Theory approach to (syntactic) interoperability between linguistic tools. The resulting category consists of textual documents, including any linguistic annotations, NLP tools that analyze texts and add additional linguistic information, and format converters. Format converters are necessary to make the tools both able to read and to produce different output formats, which is the key to interoperability. The idea behind this document is the parallelism between the concepts of composition and associativity in Category Theory with the NLP pipelines. We show how pipelines of linguistic tools can be modeled into the conceptual framework of Category Theory and we successfully apply this method to two real-life examples. Paper submitted to Applied Category Theory 2020 and accepted for Virtual Poster Session
Miriam Baglioni; Alessia Bardi; Argiro Kokogiannaki; Paolo Manghi; Katerina Iatropoulou; Pedro Príncipe; André Vieira; Lars Holm Nielsen; Harry Dimitropoulos; Ioannis Foufoulas; +7 more
Miriam Baglioni; Alessia Bardi; Argiro Kokogiannaki; Paolo Manghi; Katerina Iatropoulou; Pedro Príncipe; André Vieira; Lars Holm Nielsen; Harry Dimitropoulos; Ioannis Foufoulas; Natalia Manola; Claudio Atzori; Sandro La Bruzzo; Emma Lazzeri; Michele Artini; Michele De Bonis; Andrea Dell’Amico;
Despite the hype, the effective implementation of Open Science is hindered by several cultural and technical barriers. Researchers embraced digital science, use “digital laboratories” (e.g. research infrastructures, thematic services) to conduct their research and publish research data, but practices and tools are still far from achieving the expectations of transparency and reproducibility of Open Science. The places where science is performed and the places where science is published are still regarded as different realms. Publishing is still a post-experimental, tedious, manual process, too often limited to articles, in some contexts semantically linked to datasets, rarely to software, generally disregarding digital representations of experiments. In this work we present the OpenAIRE Research Community Dashboard (RCD), designed to overcome some of these barriers for a given research community, minimizing the technical efforts and without renouncing any of the community services or practices. The RCD flanks digital laboratories of research communities with scholarly communication tools for discovering and publishing interlinked scientific products such as literature, datasets, and software. The benefits of the RCD are show-cased by means of two real-case scenarios: the European Marine Science community and the European Plate Observing System (EPOS) research infrastructure. This work is partly funded by the OpenAIRE-Advance H2020 project (grant number: 777541; call: H2020-EINFRA-2017) and the OpenAIREConnect H2020 project (grant number: 731011; call: H2020-EINFRA-2016-1). Moreover, we would like to thank our colleagues Michele Manunta, Francesco Casu, and Claudio De Luca (Institute for the Electromagnetic Sensing of the Environment, CNR, Italy) for their work on the EPOS infrastructure RCD; and Stephane Pesant (University of Bremen, Germany) his work on the European Marine Science RCD. First Online 30 August 2019