Summary information

Study title

Mapping manuscript migrations knowledge graph 500-1500

Creator

Burrows, T, University of Oxford
Page, K, University of Oxford
Koho, M, Aalto University
Tuominen, J, Aalto University
Lewis, D, University of Oxford
Ikkala, E, Aalto University
Velios, A, University of the Arts London
Hyvönen, E, Aalto University
Ransom, L, University of Pennsylvania
Brix, A, Institut de recherche et d'histoire des textes
Wijsman, H, Institut de recherche et d'histoire des textes
Thomson, E, University of Pennsylvania
Fraas, M, University of Pennsylvania
Emery, D, University of Pennsylvania
Morrison, A, University of Oxford
Myking, S, Institut de recherche et d'histoire des textes

Study number / PID

854544 (UKDA)

10.5255/UKDA-SN-854544 (DOI)

Data access

Open

Series

Not available

Abstract

The Mapping Manuscript Migrations (MMM) project was funded from 2017 to 2020 by the Digging into Data Challenge of the Trans-Atlantic Platform. The project partners were the University of Oxford, the University of Pennsylvania, Aalto University, and the Institut de recherche et d'histoire des textes. The project's goal was to bring together data from different sources relating to the history and provenance of medieval and Renaissance manuscripts, enabling large-scale browsing and searching through a semantic Web portal as well as by direct access to the data. Three separate datasets covering more than 200,000 manuscripts, were combined into a unified knowledge graph, using Linked Open Data technologies. This approach includes a unified data model which is based on the CIDOC-CRM and FRBRoo ontologies, as well as more than 20 million RDF triples. Overlapping vocabularies for persons, places, and organizations in the source datasets were reconciled against identifiers from VIAF, GeoNames, and the Getty Thesaurus of Geographical Names. Works and manuscripts were reconciled by semi-automatic matching techniques based on string similarities. The three source datasets were: (1) Schoenberg Database of Manuscripts from the Schoenberg Institute for Manuscript Studies, University of Pennsylvania; (2) Bibale database from the Institut de recherche et d'histoire des textes (IRHT-CNRS, Paris) and (3) Medieval Manuscripts in Oxford Libraries catalogue from the Bodleian Libraries, University of Oxford. To test and demonstrate its usefulness, the MMM Knowledge Graph is in use in the MMM Semantic Portal. Based on the Sampo-UI software developed at Aalto University, the portal enables browsing, searching, and filtering across the project's triple store, together with map-based visualizations of the results.Hundreds of thousands of European pre-modern manuscripts have survived until the present day. As the result of changes in their ownership over the centuries, they are now spread...
Read more

Topics

Methodology

Data collection period

01/07/2017 - 31/08/2020

Country

United Kingdom, United States, France

Time dimension

Not available

Analysis unit

Text unit

Universe

Not available

Sampling procedure

Not available

Kind of data

Text

Data collection mode

The Mapping Manuscript Migrations (MMM) project transformed three separate datasets into a unified knowledge graph: Schoenberg Database of Manuscripts (relational database); Bibale (relational database ); and Medieval Manuscripts in Oxford Libraries (XML documents in Text Encoding Initiative format). Each source dataset was transformed into RDF (Resource Description Framework) triples, and mapped to the MMM Data Model, which combined elements from the CIDOC-CRM and FRBRoo ontologies. Overlapping vocabularies were reconciled using two methods: (1) automatic reconciliation using references to external authoritative Linked Open Data identifiers, and (2) semi-automatic reconciliation using expert review of possible matches identified by string similarity.The combined data were then loaded to a public triple store, and made available through a SPARQL endpoint and a semantic portal interface using the Sampo-UI software.

Funding information

Grant number

ES/R003971/1

Access

Publisher

UK Data Service

Publication year

2021

Terms of data access

The Data Collection is available from an external repository. Access is available via Related Resources.

Related publications

Not available