Introduction
The digital representation of cultural heritage collections provides new access for researchers and the general public alike. This is because digitized images can be viewed at any time and from any location via mobile devices. The use of structured metadata enables users to rapidly discover whole collections or individual objects in detail and in depth. This is in stark contrast to the conventional perception and use of collections. The precarious and valuable holdings require special protection for good reasons, but this means that the originals are difficult and sometimes impossible for the public to access. Moreover, only a select few individuals are granted the opportunity to access and explore them. The digitization of the collections is one aspect of the process; the findability of the digital objects is another. The establishment and application of standards is driving semantic networking in the digital world forward and is also indispensable for the provision and dissemination of knowledge about our cultural heritage. The emergence of the Semantic Web and Linked Open Data offers unique opportunities to achieve this comprehensive interconnectivity. The utilization of identifiers to reference all entities (persons, processes, objects, etc.) within a corpus and the application of standardized ontologies to represent the links between these entities and their properties, enables the creation of knowledge graphs that provide their users with a multitude of possibilities for querying, reusing, and the discovering of known or previously unknown information and cultural goods.
In this paper, we introduce the GESAH Graphic Arts Ontology (GESAH GAO), including its content, metrics, and implementation. Created during the DFG-funded GESAH project (2019–2023),Footnote 1 the GESAH GAO was designed in the Web Ontology Language (OWL)Footnote 2 to contribute to the knowledge graph with the digital collection of single sheet prints and drawings from the Albrecht Haupt Collection. The OWL is one of the technologies recommended by the W3C for handling data on the Web.Footnote 3 It incorporates elements from the Resource Description Framework (RDF) and RDF Schema (RDFS). However, in comparison with these standards, OWL offers more functions in terms of expressivity. It allows defining more complex classes (e.g. owl:unionOf) and types of properties (e.g. owl:FunctionalProperty). The axioms can be interpreted by reasoners, enabling the deduction of additional information from the data described by the ontology. Moreover, it provides methods for matching or mismatching classes (owl:equivalentClass/owl:disjointWith), properties (owl:equivalentProperty) and individuals (owl:sameAs/owl:differentFrom) from different ontologies and datasets, thereby enhancing the interoperability of data described by OWL.Footnote 4
The Albrecht Haupt Collection
The Albrecht Haupt Collection is a historic collection of national and international importance in the fields of architecture, art history, garden history and others. It is owned by the TIB – Leibniz Information Center for Science and Technology (TIB) in Hanover, Germany.Footnote 5 In 1901, architect and university lecturer Albrecht Haupt (1852–1932) bequeathed his extensive private collection of books, prints, drawings and travel sketches to what was then the library of Hanover College of Technology. With a passion for travel and collecting, Haupt devoted more than 25 years of his life compiling this collection.Footnote 6 In the GESAH project, our initial objective was to conduct an investigation into only one subset of the collection such as the individual graphic sheets (Grafische Einzelblätter der Sammlung des Architekten Albrecht Haupt), which had rarely been studied prior to the start of the GESAH project.
With just over 6,200 sheets, this relatively compact collection of prints and drawings dating from the sixteenth to the nineteenth centuries is characterized by its remarkable thematic range. While architecture and figurative drawings predominate within the approximately 1,000 hand drawings, the collection of prints presents itself as a panopticon of all genres and themes. In addition to architectural prints, the figurative prints of the great and small early European masters are represented as well as extensive documentation of wall and ceiling paintings of the Baroque era or neoclassical interiors. The collection also includes ornaments, portraits, vedute and, from the field of applied graphics, bookplates, seals, calligraphies, initials and much more. The drawings and prints also offer a comprehensive insight into European architecture and art as well as arts and crafts in chronological and geographical terms. They date from the end of the fifteenth century to the end of the nineteenth century and were created in Germany, Italy, France, the Netherlands, Flanders, Switzerland, Great Britain, Spain and – in a few cases – Russia. The collection includes a wide variety of drawing and printing techniques of different times. In addition to the obvious pleasure Albrecht Haupt derived from his collection, he employed it in two distinct ways: On the one hand, he utilized the material for his studies in preparation for his own creations in architecture, interior design or in restoration projects he carried out. On the other hand, he also utilized it for teaching purposes. Haupt was convinced that an understanding of the past was not only the best foundation for creative invention, but also essential. This position was highly controversial at the time and was seen differently by his contemporaries with more modern views (to make a long story short). Nevertheless, thanks to Haupt’s conservatism, we are able to appreciate the works of art today and still learn from and about them.
The Ontology
The GESAH GAO was developed with the objective of adequately cataloguing the Albrecht Haupt Collection and publishing it on the World Wide Web. It enables the description of the complex process of creating a work of art and events related to it in the past or in the future.
Related Works
One of the established reference models in Digital Humanities for describing works of art is CIDOC Conceptual Reference Model (CRM).Footnote 7 CIDOC CRM (and its owl-variant the Erlangen CRMFootnote 8) were too extensive to be used in a Vitro-based application.Footnote 9 They serve to describe processes and manifestations from the beginning of the existence of an entity to its end, which are researched in very different disciplines, including archeology, musicology, theatre, art and architecture, and others. As a result, they are very large and complex.
Another data model is the Europeana Data Model,Footnote 10 which does not offer the required complexity for the event centric approach and primarily employs Dublin Core properties to describe agents participating in events, including properties such as e.g. dc:creator and dc:contributor. Dublin CoreFootnote 11 is not based on OWL and therefore a direct reuse of the components of the Europeana Data Model would not match the requirements of a Vitro-based application.
Ontology scope
At the beginning of the ontology development process, competency questions were created. These questions shape the scope of the ontology and facilitate its evaluation. The following questions illustrate various approaches to the corpus. The ontology development is considered successful if the questions can be answered by data described by the ontology.
Selection of competency questions:
1. Which German prints from before 1600 are part of the collection?
2. Which Italian architectural drawings can be found in the collection?
3. Which prints were issued in Antwerp?
4. Which drawings were formerly attributed to Parmigianino?
5. Which drawings and prints have a collector’s mark?
6. Which drawings and prints were kept in the Collection of Hippolyte Destailleur (Paris)?
7. Which drawings were kept under the former/current inventory no. (e.g. Albrecht Haupt’s folder numbered e.g. ‘XXXIIIa’)?
8. Which drawings and prints depict the subject ‘Adam and Eve’?
Therefore, the ontology is designed to serve the inventory, classification, and structured description of works on paper. It should provide a means of describing changing attributions to artists, complex events (such as the multi-phase creation of a work of art or its provenance history), as well as of recording inscriptions, collector’s marks and other features in a structured manner. The ontology should facilitate easy access to the material for the public, enable scholarly indexing by the specialist community and curatorial documentation by the preserving institution. To define structure, fields, and relations our domain experts made extensive use of Knaus et al. 2019 (LIDO-Handbuch für Graphik)Footnote 12 and the so-called “Gemeinsamer Feldkatalog”.Footnote 13
Ontology metrics
The ontology was developed using the open-source ontology editor Protégé.Footnote 14 Its design was conducted in an iterative process involving continuous input from domain experts, and it was constructed in OWL DL.Footnote 15 The ontology comprises 273 declaration axioms and contains 80 named classes, 131 properties, 47 data properties, and 21 annotation properties. It includes incorporated classes and properties from other ontologies and vocabularies. Complex class expressions such as domain unions were used for the domain definition of several object and data type properties.
The Basic Formal Ontology (BFO)Footnote 16 was employed as an upper-level ontology, with several of its classes incorporated to represent fundamental concepts such as entity, role, quality, and so forth. The Friend of a Friend Ontology (FOAF)Footnote 17 was reused to describe persons, organizations, and groups of people. Additionally, the class skos:Concept from the Simple Knowledge Organization System (SKOS) vocabularyFootnote 18 is widely utilized in the GESAH Ontology as well. Particular basic classes like vivo:Relationship, vivo:DateTimeValue, vivo:DateTimeInterval including related properties such as the object property vivo:dateTimeInterval were integrated from the VIVO Ontology.Footnote 19 This allowed us to reuse the existing structure of forms and display configurations based on these classes for our purposes.
It should be noted that a significant share of the relationships between cultural objects have equivalences in the Lightweight Information Describing Objects (LIDO) Terminology,Footnote 20 although these are not declared as equivalent via axioms. All possible references to corresponding entities in LIDO, CIDOC CRM, Art & Architecture ThesaurusFootnote 21, ICONCLASSFootnote 22 and other vocabularies were integrated via rdfs:seeAlso.
Application of the GESAH GAO on the Albrecht Haupt Collection
The GESAH GAO can be employed to describe works on paper. These are not necessarily limited to drawings and prints; due to the composition of the Albrecht Haupt Collection, other classes such as photomechanical prints, printed (text) materials, manuscripts, musical notations and book illumination are also covered.Footnote 23 We differentiate between the work (cultural object) and its digital representation (digital image), which are described separately and linked to each other.
A cultural object is described using various relationships and properties. Firstly, it is identified by its title and inventory number. If available, catalogue raisonné entries and/or authority data of works of art assist with unambiguous identification. The works are classified by object type, geographical and stylistic categorization and assignment to an object category.
Other features that serve to precisely describe the object are the state of the printing plate or the edition, dimensions, watermarks, collector’s marks and the state of preservation. Any documentation of the work is included in the fields on secondary literature and exhibitions. Eventually, the current holding institution and location are listed, as well as the licenses under which the digitized image and the metadata are made available.
Titles, dimensions, watermarks, collector’s marks, catalogue raisonné entries, exhibitions and licenses are complex description objects consisting of context nodes. This structure allows us not only to store the metadata as texts or simple relationships, but also to classify them.
Events in the history of a work are modelled as activities, from its creation, through changes of ownership (provenance), to current preservation measures. For instance, a print is first conceived as a design during the act of creation (fig. 1) before it is engraved into the printing plate during the production activity, and subsequently issued by a publisher (edition activity). These activities also encompass the used techniques and materials. We understand an activity as a specific action occurring at a particular time and place, carried out by a person or organization occupying a specific role at that moment, the type of which is to be determined. The nature of the attribution to this person or organization can be expressed by the attribution type, e.g. traditional, attributed to or rejected. This is important in cases where there is uncertainty as to the identity of the artist or in cases where the objective is to register both former and current states of research. Therefore, several roles can be linked to the same creation or production activity with different attribution types.

Fig. 1. Visualization of the activity gesah:Creation with object properties and data properties.
A distinct feature of the GESAH GAO is the fact that inscriptions on the works themselves as well as on their mounts and backing boards can also be recorded in a structured form, namely as an activity (fig. 2). This means that the inscriptions can be assigned to individuals and located in time. Furthermore, they can also be differentiated according to their position and classified in terms of content by assigning attributes.

Fig. 2. Visualization of the activity gesah:Inscription with object properties and data properties.
The inscriptions on the works provide valuable information about the history of the collection. Old inventory numbers of previous owners, lot numbers from auctions, attributions, dates or identifications of the pictorial subjects were noted on the sheets. These details reveal much about the provenance of the works and also about the history of their research. Handwritings can be attributed to individuals (fig. 3). Not only Albrecht Haupt himself and the long-time curator Lieselotte Vossnack, but also some prominent visitors to the collection, such as Heinrich Geissler (1927–1990), noted and even signed attributions made on the mounts.Footnote 24 In combination with archive material, the structured digital recording of this information allows for historical reconstructions of the perception and scientific study of the collection.

Fig. 3. Oriental King, accompanied by soldiers, pen and wash, heightened with white on paper, inv. kl D Z 63: 7 (https://sah.tib.eu/display/n766) with attribution by Heinrich Geissler to Johann Wolfgang Baumgartner and traditional attribution to Johann Evangelist Holzer by an unknown hand on the drawing itself at lower right, by Albrecht Haupt on the support at left and Lieselotte Vossnack at lower right.
The subject of the image is recorded using keywords. In the case of people, places, events and real objects, they are recorded separately and stored in standardized data. For architectural representations, the method of representation is also recorded, such as whether it is an orthogonal or a perspective representation, a view or a plan.
The digital images constitute a distinct class. They are assigned their own titles and are grouped in barcode collections. They are given additional information such as old and new folder numbers (reference to the scanned object in the collection), date of scanning and license. The property is_digital_representation of is used to establish the relationship between the image and the cultural object data record.
Technical implementation in TIB SAH digital
The GESAH GAO was implemented and utilized in TIB SAH digital.Footnote 25 The open-source software Vitro was selected as a software for the implementation. Vitro is a web application with integrated ontology, instance editor and customizable public browsing. Vitro is most prominently used as the core of the community-based research information system VIVO,Footnote 26 but it is used in any other areas such as for a philosophical lexiconFootnote 27, as a linked-data based cataloguing toolFootnote 28 and more.Footnote 29 Vitro requires formalized ontologies and vocabularies according to the Web Ontology Language specification. Additionally, it is capable of handling the Simple Knowledge Organisation System.Footnote 30
In this environment, domain experts catalogue drawings, prints and other works manually with the help of specially developed entry forms; they facilitate and speed up the indexing process and also prevent incorrect entries (figs. 4–5). In addition, the public has access to the entire collection, including works that have only been partially catalogued. The collection search function enables users to locate objects by employing both full-text search and predefined filters which are organized into groups for ease of use. Preview images, brief information and a sorting function facilitate the rapid identification of the object searched for or provide a convenient overview.

Fig. 4. Screenshot of TIB SAH digital, the profile of the cultural object https://sah.tib.eu/individual/n5672 with detail of an inscription.

Fig. 5. Screenshot of TIB SAH digital with the custom entry form for inscriptions.
Reuse
The GESAH GAO has been published on GitHub under the Attribution 4.0 International license and is freely available for reuse.Footnote 31 The latest release is version v2024-11-25, with the next release being in planning. The ontology can also be found on the TIB Terminology Service.Footnote 32 A data export was made possible via the LIDO exchange format to the Graphikportal, hosted by Deutsches Dokumentationszentrum für Kunstgeschichte – Bildarchiv Foto Marburg, which acts as a subject-specific aggregator.Footnote 33 This was done in order to foster the dissemination of the collection, the underlying ontology and its software implementation. A mapping to LIDO via XSLT has been created and is released on GitHub,Footnote 34 as well as the public endpoint for harvesting.Footnote 35
Conclusion
It can be argued whether the creation of a local ontology should be favoured over the reuse of an existing one. With the GESAH GAO, we have a data model which is tailored to the needs of domain experts and fits the GESAH collection. Although the ontology was created to be used for this specific collection, it can also be reused by organizations with similar collections.
As the ontology is employed in its implementation by TIB SAH digital and is actively utilized, it is undergoing constant further development. Currently the ontology is being expanded in order to include further holdings such as bound graphic collections and 6000 travel sketches by Albrecht Haupt.Footnote 36 In addition to the necessary extensions in terms of content, genuine ontological improvements are also being sought. There are several controlled vocabularies, containing individuals for assigning as role types, attribution types, materials, techniques etc. These controlled vocabularies are going to be published as well. Furthermore, it is conceivable to map the GESAH GAO to CIDOC CRM and Linked ArtFootnote 37 in the future, thus enabling our flow into the Knowledge Graph of the worldwide Cultural Heritage and to be used in completely different contexts.
Acknowledgments
We would like to thank Susanne Arndt, Ina Blümel, Brian Lowe, Simon Paulus and Hedda Saemann for feedback during the development process of the ontology.
Funding
The work on this publication was partly funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) in the project Digitisation and scientific indexing of the graphic holdings of the architect Albrecht Haupt’s collection (GESAH/GESAH+), project number 424739761.