Academia.eduAcademia.edu
CHAPTER 9 Jeffrey P. Emanuel EC TE D Harvard University, Cambridge, MA, United States PR OO F Stitching Together Technology for the Digital Humanities With the International Image Interoperability Framework (IIIF) 9.1 INTRODUCTION UN CO RR Compartmentalization of information, communication, and effort are an inevitability in an organization of almost any size. Despite wide recognition of the risks they pose, and repeated commitments to cross-divisional collaboration, silos develop quickly, and once in place, it is difficult to effectively work across them with consistent success[1–4]. Similarly, digital material held by li-braries and museums around the world can become trapped in silos, with users afforded very little choice about how they will access and interact with individual objects, and very little consistency in these methods across institutions and repositories. A clear, focused emphasis on collaboration and interoperability, between both institutions and technical approaches, can provide significant relief from this conundrum, and can provide value for organizational members and technological end-users alike. Harvard University is an example of a large organization, with a complex internal structure, that can make interdivisional (and, in some cases, intradivisional) collaboration difficult, particularly on projects of larger scope. Over the last five years, though, a significant digital humanities project has provided several groups at Harvard with an opportunity to work together toward a shared goal: the implementation of a standard delivery framework for visual material in repositories across the university, Digital Humanities, Libraries, and Partnerships ISBN 978-0-08-102023-4, DOI: 10.1016/B978-0-08-102023-4.00009-4 © 2017. 1 2 Digital Humanities, Libraries, and Partnerships PR OO F and the development of an image viewer for internal and external (library, museum, research, and teaching) image display. Despite the difficulty of pulling numerous disparate groups together on a large, multiyear project, the participants in the International Image Interoperability Framework (IIIF) and Mirador projects found tremendous success, and were able to forge a project- and standards-based alliance that was greater than the sum of its individual parts. This chapter will describe that partnership, and its success, with an emphasis on the IIIF and Mirador projects that it was built around. 9.2 THE INTERNATIONAL IMAGE INTEROPERABILITY FRAMEWORK UN CO RR EC TE D Library and museum holdings are being digitized and made available as pixels at an ever-increasing rate. This serves both access and preservation needs, while also increasing the potential for deeper interaction with digital visual material by the end user. However, efforts to share content across institutions, and even across repositories within an institution, continue to encounter numerous obstacles. Aside from rights and permissions, the most significant obstacle may be the differences in the handling of storage, management, and delivery of digital material across repositories, which frequently use separate frameworks, formats, and applications. This is a widely recognized issue, but efforts to resolve it have all too often fallen victim to what has been called “the not-invented-here syndrome”: the conviction that “you and I will collaborate just fine if you adopt my system and abandon yours.”1 For example, a library has an image in a database that a scholar wants to see. This is a simple undertaking, but the difficulty is compounded when that scholar also wants to access an image from another repository. Because of differences in storage and delivery across institutions, that scholar may have to use an entirely different application to view and interact with the second image, thus creating a very different experience. Because of this, the easiest way for people to access, curate, and view images in the environment of their choice has been to download the image and make a local copy. Add this up over the course of time and increasing viewer counts, and the result is tens, hundreds, thousands, or 1 Waters [5], p. 14. Stitching Together Technology for the Digital Humanities 3 CO RR EC TE D PR OO F more copies of individual images, potentially living on hard drives all over the world, in countless different resolutions and formats, and with wide variations in the associated metadata. This is clearly a problem—but how can it be solved? IIIF was designed to address this barrier to access and sharing of content by “promot[ing] the building of a global and interoperable framework by which image based resources could be easily shared and reused across institutions using any combination of image servers or client viewing software.”2 Questions that governed the development of IIIF included: • What if deep zoom were standard and fast? • What if you could compare across sites, hosts, and institutions? • What if you could collect items that belong together physically—not just intellectually (e.g., the miniature from a medieval folio was physically extracted, and lives at a different library than the folio itself)? • What if you could access images using the viewer of your choice, instead of a certain one tied to a particular repository? A community-developed and -maintained protocol for standardized image retrieval, IIIF was created in an effort to make affirmative answers to these questions possible, by collaboratively producing both interoperable technology and a common framework for image delivery. You can think of the countless locally-stored copies of an image as one of a countless number of identical water bottles, filled and shipped to the end user, where it will potentially be isolated from the source and from others forever. The impetus for IIIF was an effort to alter that method of image consumption[7], p. 5: UN The [IIIF] community set out to create pathways for images to flow through systems from their source, rather than trying to get all repositories and service providers to build everything the same way. Digital image content generated and served up by a range of different image servers could be standardized as output (both the individual image and the structure of an image grouping) for re-use…and could flow, rather than be handed off like plastic water bottles. 2 Snydman, Sanderson, and Cramer [6], p. 17. 4 Digital Humanities, Libraries, and Partnerships PR OO F It is axiomatic that academic and cultural heritage institutions are not going to drop their own tools and development efforts en masse in favor of uniformly adopting yet another one-size-fits-all third-party tool. The growing IIIF effort addresses this reality by focusing on common APIs, eschewing the quixotic quest for universal tool adoption in favor of creating a common method of content description and delivery. An API, or an Application Programming Interface, is at its most basic a means for applications to communicate: EC TE D An API is a software intermediary that allows two applications to talk to each other. Each time you use the Facebook app, send an instant message, or check the weather on your phone, you’re using an API. When you use one of these applications, the application on your phone connects to the Internet and sends data to a server. The server then retrieves that data, interprets it, performs the necessary actions and sends it back to your phone. The application then interprets that data and presents you with the information you wanted in a readable way. This is what an API is—this all happens via an API.3 UN CO RR IIIF currently consists of two APIs: image and presentation. The Image API simply brings the pixels, via a structure specifying the image’s source (URL), region (its specific location as a part of the original image), size (zoom percentage), rotation (in degrees), quality (color, grey, or black and white), and format (file type). The Presentation API then brings just enough metadata to drive a remote viewing experience, from the repository name and logo to the table of contents (if applicable) and image thumbnails. This method has several advantages over its download-and-view predecessor. The first is simultaneously the most simple and the most important: no images are leaving their home repositories, and no new copies or versions of them are being created. Instead, the image is accessed virtually and displayed within the end user’s image viewer, with the Image and Presentation API requests specifying just what portion of the original image should be displayed, and how. In this way, images from any repository in the world can be accessed, viewed, and interacted 3 https://www.mulesoft.com/resources/api/what-is-an-api Stitching Together Technology for the Digital Humanities 5 4 5 6 7 UN CO RR EC TE D PR OO F with as though they were stored locally—both individually, and in comparison with other images from the same or from completely different repositories (Fig. 9.1). In other words, circling back to the water bottle analogy, instead of handing off water bottles, users can fill the glass of their choice from the same tap. The number of potential sources is constantly growing: worldwide, over 100 institutions have exposed some or all of their collections via IIIF thus far, for a total of between 335,000,000 and 1,000,000,000 shared images.4 As this community grows, the world’s digital archives are made even more available and accessible to scholars both within and across repositories. A further advantage of IIIF is that the user is not locked in to using a single particular viewer. Many applications now feature IIIF compatibility, including (but not limited to) the Universal Viewer, Leaflet, OpenSeaDragon, and Omeka. Another IIIF-compliant viewer, used for visual material by Harvard and several other institutions, is an open source JavaScript application called Mirador.5 Primarily developed by Harvard and Stanford Universities, Mirador is a “multirepository, configurable, extensible, and easy-to-integrate viewer and annotation creation and comparison environment for IIIF resources, ranging from deep-zooming artwork to complex objects.”6 It “provides a tiling windowed environment for comparing multiple image-based resources, synchronized structural and visual navigation of content using OpenSeadragon, Open Annotation-compliant annotation creation and viewing on deep-zoomable canvases, metadata display, book reading, and bookmarking.”7 In briefer terms, Mirador currently provides scholars who work with digitized objects with a responsive, high-resolution viewer for digital objects [8]. Images from any repository that have been made accessible via the IIIF APIs can be viewed individually or together as a slideshow, while books can be viewed as individual pages or displayed in two-page view, complete with “opening” to emulate what has historically been one of the most important experiential aspects of the codex ([9], p. 51). OpenSeadragon’s deep zooming functionality allows large objects to be viewed with ease, while the layout of the viewer can be altered by adding http://iiif.io/community http://iiif.harvard.edu/mirador http://projectmirador.org/ http://projectmirador.org/; openannotation.org/ https://openseadragon.github.io/; http://www. Digital Humanities, Libraries, and Partnerships PR OO F 6 EC TE D Figure 9.1 Side-by-side comparison of digitized statues from two repositories, the Harvard Art Museums (left) and the Yale Center for British Art (right), accessed via IIIF API and displayed in the Mirador viewer. CO RR and removing “slots,” creating a grid of separate images so that multiple items from one or more repositories, or multiple aspects of a single item, can be examined in a single window (Fig. 9.2). UN Figure 9.2 A medieval Book of Hours (MS Richardson 7, Houghton Library, Harvard University) displayed in the left slot in Mirador’s “Book View,” with page thumbnails and table of contents included. In the slot at right is the denoted portion of the illumination on folio 15 recto, displayed at full zoom level (highlighting added). Stitching Together Technology for the Digital Humanities 7 9.3 ONE FRAMEWORK TO RULE THEM ALL: IIIF AT HARVARD UNIVERSITY 9.3.1 Risks EC TE D PR OO F Within Harvard, a multiorganizational effort to adopt IIIF has resulted in digitized content from multiple sources being exposed via a shared API, from the Harvard Art Museums and Library, the Faculty of Arts and Sciences (FAS), and the University’s massive open online course provider, HarvardX (Fig 9.3) [10,11]. The number of objects being made available is remarkable: the library has now exposed over 195,000 books and manuscripts, and over 14,000,000 total images, while the Art Museums have exposed over 250,000 art objects as IIIF manifests. In addition, image collections used in residential courses within the FAS and image content contained in HarvardX-produced learning experiences is also accessed via IIIF API and displayed in Mirador. Further, by sharing knowledge, expertise, digital content, and Mirador, multiple “heads” have sprouted: an image viewer and annotation application for HarvardX courses, a new Harvard Library Viewer, faculty image collections embedded in the Canvas course platform, and walls of images in the Harvard Art Museums, all making use of IIIF and instances of the Mirador viewer. 8 9 UN CO RR Such an undertaking was not without risk, of course, and in many ways, the undertaking on the planning, coordination, and development side mirrored, in human form, the aforementioned technical challenges that led to IIIF in the first place. The solution was similar, as well: greater coordination and interoperability between component parts being key. With its 12 schools, 70 libraries,8 and independent networks of museums,9 the University has a complex internal structure, which can result in communication gaps and replication of effort even in the smallest under http://asklib.hcl.harvard.edu/faq/82186 The major museum networks are the Harvard Art Museums (http://www. harvardartmuseums.org/), made up of the Fogg, Busch-Reisinger, and Arthur M. Sackler Museums, and the Harvard Museums of Science and Culture (hmsc.harvard.edu), which consists of the Harvard Museum of Natural History, the Peabody Museum of Archaeology and Ethnology, the Semitic Museum, and the Collection of Historical Scientific Instruments. Digital Humanities, Libraries, and Partnerships PR OO F 8 EC TE D Figure 9.3 Multiple uses of Mirador at Harvard, including library exhibits, library page delivery, residential and online learning, and museums, all utilizing the IIIF APIs to access images. UN CO RR taking. One of the risks identified in committing both to adopting IIIF and to helping drive the development of Mirador was the large number of organizations involved (both internally and externally), and the multiple handoffs in development that could result. An additional risk was the externally collaborative nature of the program, and the fact that Harvard did not have control over the IIIF and Mirador road maps and resources. IIIF and Mirador were “born” at Stanford University, which utilized funding from the Andrew W. Mellon foundation to finance work on both the IIIF specifications and the Mirador application. Further, the number of people acting in project management roles greatly outweighed the number of developers dedicated to the project. The sole software engineer working on IIIF and Mirador on a full-time basis was located at HarvardX, while management of the project extended from HarvardX to the Libraries—via the Preservation Services division and Library Technology Services, which is organizationally a part of Harvard University Information Technology (HUIT)—to the Arts and Humanities Research Computing group, which is associated with both HUIT and the FAS’ Arts and Humanities Division. This had the potential to create a “too many cooks in the kitchen” situation in every phase of the project, from requirements gathering, to development, to implementation. Stitching Together Technology for the Digital Humanities 9 9.3.2 Benefits EC TE D PR OO F Fortunately, the “too many cooks” issue resolved itself with surprising ease: the HarvardX software engineer hired to work on the project rapidly ascended to a position of product ownership within the university, and leadership within the broader IIIF consortium. This allowed the project management team to focus less on management and more on coordination and support, sharing use cases and project while acting as a united front to drive prioritization and development. This also allowed Harvard to move quickly on time-sensitive needs. For example, when outside assistance was needed to update the coordinate system used in OpenSeaDragon, the image display technology used in Mirador, it was funds provided by Academic Technology at Harvard that allowed the contract with an OSD developer to be finalized and executed in a timely fashion. UN CO RR On the other hand, the potential benefits of IIIF for the university were clear. This approach could open Harvard’s library, museum, and course-based digital content for reuse over the web, while also allowing the university to reuse digital content from peer institutions. Use cases spanned the continuum from teaching, to research, to library and museum exhibits. For example, a “virtual manuscript” could be compiled using pages held by both Harvard and Yale; Harvard and British Library copies of the same work could be compared in great detail within the same viewer, and online exhibits—or virtual companions to physical exhibits—could be created using material from multiple institutions. The exhibit-based research case was exemplified by the 2016 project “Beyond Words: Illuminated Manuscripts in Boston Area Collections,” an exhibit that spanned three institutions: Harvard’s Houghton Library, Boston College’s McMullen Museum, and the Isabella Stewart Gardener Museum. “Beyond Words” featured more than 260 manuscripts and printed books from 19 Boston-area collections, dating from the 9th to the 17th centuries (http://beyondwords2016.org). Co-led by Harvard’s faculty champion of IIIF and Mirador, Jeffrey Hamburger (Kuno Francke Professor of German Art and Culture), the exhibit was set up with both material and virtual components: physical codices on display at each exhibit location were accompanied by tablets running Mirador, thus allowing visitors to appreciate the manuscripts’ materiality 10 Digital Humanities, Libraries, and Partnerships CO RR EC TE D PR OO F and construction, while also being able to page through each, and to appreciate images and text more clearly though OpenSeaDragon’s deepzooming functionality. For the Harvard Library, IIIF provided further advantages. Firstly, the API-based framework offered a more efficient and standardized way to deliver visual material from the Digital Repository, and allowed Harvard scholars easier access to other institutions’ collections. Secondly, Mirador offered the library the much-needed opportunity to replace its outdated Page Delivery Service (PDS), which had been used for years to deliver digital material despite its lack of suitability to noncodex formats (whether codices or another physical form, like scrolls, the PDS uniformly delivered visual material as paged objects which had to be clicked through for viewing). In teaching and learning, on the other hand, IIIF and Mirador offered a highly interactive viewing experience for visual material while affording faculty and course staff the opportunity to dynamically access library and museum material, and their own collections, without having to replicate images at every turn. The Art Museums’ attraction to IIIF came from its ability to help meet three particular goals: enhancing the public’s desire to view the physical objects in the museums’ galleries, expanding image comparison capability in the museums’ digital tours platform, and serving as a study in the effort to demonstrate that museum data can be interoperable. As IIIF standards further mature, the Art Museums anticipate using this framework to deliver three-dimensional objects, both for viewing and for virtual reconstruction, as well as complex living documents like curatorial object files and archives. In this way, the tablet approach utilized in the “Beyond Words” exhibit can likewise be applied to physical art exhibits. 9.4 CONCLUSION AND NEXT STEPS UN Despite its breadth and complex internal structure, several organizations within Harvard have found IIIF and the Mirador viewer to be a standard around which they can rally in collaborative and communicative fashion. From the Library to the Art Museums to the classroom—both residential and virtual—Harvard has embraced the shared-API approach that IIIF offers for image delivery and retrieval, and the openness that comes with being part of a hundred-institution consortium dedicated to the sharing of digital visual material. Stitching Together Technology for the Digital Humanities 11 REFERENCES EC TE D PR OO F The potential next step of this process for the University, which has just completed the requirements-gathering phase, is to construct an IIIF-compliant scholars’ workspace. This workspace would allow scholars to take full advantage of IIIF and Mirador, while affording them the ability to collect, store, share, annotate, and arrange high-resolution digital images from multiple repositories worldwide for both teaching and research [12]. Drawing on the remarkable interoperability afforded by IIIF and the deep interaction that Mirador brings to the study of digital visual material, this workspace would provide users with the equivalent of a digital office or library carrel holding personal images, as well as references to, and notes on, as many web-accessible digital works as their research or class projects require. UN CO RR [1] R.M. Kanter, Collaborative advantage: the art of alliances, Harvard Business Rev. 4 (1994) 96–108. [2] A. Kezar, Redesigning for collaboration within higher education institutions: an exploration into the developmental process, Res. Higher Educat. 46 (7) (2005) 831–860. [3] L. Marchese, How the ‘silo effect’ is hurting cross team collaboration. <https:// blog.trello.com/tips-to-improve-cross-team-collaboration>, 2016, May 10 (accessed 1.03.17). [4] R. Schaubroeck, F. Holsztejn Tarczewski, R. Theunissen, Making collaboration across functions a reality. McKinsey Quarterly, March 2016. <http://www. mckinsey.com/business-functions/organization/our-insights/ making-collaboration-across-functions-a-reality>, 2016 (accessed 1.03.17). [5] D.J. Waters, Digital humanities and the changing ecology of scholarly communications., Int. J. Human. Arts Comput. 7 (2013) 13–28. [6] S. Snydman, R. Sanderson, T. Cramer, The International Image Interoperability Framework (IIIF): A community and technology approach for web-based images. Paper presented at the conference Archiving 2015, Los Angeles, CA, May 19–22. <http://purl.stanford.edu/df650pk4327>, 2015 (accessed 30.09.17). [7] W. Ying, J. Shulman, Bottled or tap? A map for integrating the International Image Framework (IIIF) into Shared Shelf and ArtStor, D-Lib 21 (7/8) (2015) 1–12. [8] V.J. Harward, J. Hamburger, J.P. Emanuel, R. Singhal, R. Stern, Oculus: Using open APIs to share Harvard’s digitized books and manuscripts. Paper presented at the 4th Annual Harvard University IT Summit, Cambridge, MA, June 4, 2014. [9] J.F. Hamburger, Openings, in: G.C. Kratzmann, J.F. Hamburger, R. Minion (Eds.), Imagination, Books & Community in Medieval Europe, Macmillan, South Yarra, 2009, pp. 51–133. [10] R. Stern, J.P. Emanuel, V.J. Harward, R. Singhal, J. Steward, IIIF as an enabler to interoperability within a single institution. Paper presented at the conference IIIF New York 2016: Access to the World’s Images, New York, NY, May 9–11, 2016. [11] J.P. Emanuel, A. Barrett, J. Steward, R. Stern, One framework to rule them all: The unifying impact of IIIF on teaching, research, museums, and libraries at Harvard. Paper presented at the annual New Media Consortium Summer Conference, Boston, MA, June 13–15, 2017. 12 Digital Humanities, Libraries, and Partnerships FURTHER READING PR OO F [12] V.J. Harward, J.P. Emanuel, Planning Phase for a IIIF-Compliant Scholars’ Workspace for Visual Material. Harvard University ITCRB Grant Proposal (funded FY2016), 2016. UN CO RR EC TE D Emanuel, J.P., Morse, C.M., Hollis, L., 2016. The new interactive: Reimagining visual collections as immersive environments. VRA Bulletin 43 (2), 1–16.