Roster of Contributed Talks:
- Andrew Berger
- Ben Fino-Radin
- Bonnie Gordon
- Christopher (Cal) Lee
- Mark Leggott
- Jenny Mitcham
- Courtney Mumma
- Mike Shallcross and Max Eckard
- Kari Smith
- Nathan Tallman
- Bradley Westbrook
- Kam Woods
Implementing Archivematica at the Computer History Museum
Andrew Berger, Computer History Museum
The Computer History Museum has recently implemented Archivematica as part of its digital repository. This presentation will give an overview of the implementation process, with particular attention to software selection and testing, resource allocation and staffing, and workfow development. I will also briefly discuss the use of other open source tools to carry out the steps of preparing packages for submission to Archivematica.
Assessment, Acquisition, Ingest, and Beyond: MoMA’s Open-Source Stack
Ben Fino-Radin, Associate Media Conservator, MoMA
When setting out to build a digital repository to house it’s digital collections, the Museum of Modern Art began with several years worth of information gathering with internal stakeholders in order to rigorously document the needs and requirements of facilitating digital collections care, management, and preservation for time-based media and born-digital artworks. After the production of a comprehensive requirements document, the decision was made to investigate the potential for existing open-source systems and tools that could fulfill these established needs and requirements. Long-term viability and sustainability were at the core of every aspect of the project from the interdepartmental and interdisciplinary team of advisors at MoMA that steered the project, to the fundamental involvement of outside field experts, and the decision to leverage and adapt existing, open-source software. It was understood that the adoption popular off-the-shelf, free and open source software would be a much greater boon to this mission of sustainability than the adoption (or creation) of monolithic proprietary solutions. MoMA found that Archivematica, an existing FLOSS (Free, Libre, Open Source) system fulfilled a great deal of the museum’s documented requirements – this system facilitated the ingest process, creating Archival Information Packages that would enable long-term preservation of the collection – however it left requirements focused on repository management (reporting, curation, access, and analysis) unfulfilled. To meet these remaining requirements, MoMA worked with the creators of Archivematica to build a new free and open source system called.
This presentation will offer a look at how Binder facilitates effective repository management, how MoMA has been leveraging the unique APIs offered by Archivematica and Binder to more effectively manage it’s workflows and activities, and how the museum has integrated these tools with existing proprietary systems deployed internally at MoMA.
Note: A demonstration of Binder, is available here: https://www.youtube.com/watch?v=TelwvLkt84 and technical documentation may be found here: http://binder.readthedocs.org/en/latest/
Integration of Open-Source Systems at the Rockefeller Archive Center
Bonnie Gordon, Rockefeller Archive Center
There will be a discussion and brief demonstration of how the Rockefeller Archive Center’s digital preservation system (Archivematica) is integrated with its collections management system (ArchivesSpace). This Archivematica/ArchivesSpace integration was developed for the RAC by Artefactual and allows for Archivematica to pass rights information and technical metadata to ArchivesSpace through automated processes.
- The demonstration will cover
- An overview of integration settings available in Archivematica
- Creating PREMIS rights statements during ingest
- Matching digital access objects in Archivematica to archival description in ArchivesSpace
- The resulting resource record in ArchivesSpace with DAOs attached
This discussion will focus on how this integration fits into our processing workflows. Our descriptive and rights information is managed and stored in ArchivesSpace, and this integration allows us to automate the creation of PREMIS rights statements added during the ingest process in Archivematica in ArchivesSpace. Additionally, there will be a discussion of the issues
and challenges we’ve encountered. These include the shortcomings of a process whose requirements focused on digitized materials and the challenges of developing open source software with a third party vendor.
Demonstration of the BitCurator Environment
Christopher (Cal) Lee, University of North Carolina
This demonstration will highlight several key steps in a digital curation workflow that incorporates digital forensics tools and methods. Using the open-source BitCurator environment, I will demonstrate several discrete tasks, how they can feed into each other, and considerations related to incorporating them into a larger set of curation practices within collecting institutions. A strong emphasis will be placed on features of the software that have been added or enhanced over the past year, including mounting and exporting of files from forensically packaged disk images, identification of duplicate files, generation of PREMIS metadata and initial steps toward redaction of potentially sensitive information.
Digital Preservation with the Islandora Framework
Mark Leggott, University of Prince Edward Island
I will present on a couple of case studies for institutions using Islandora/Fedora as preservation tools, including:
– The University of PEI, which has built an Islandora module called Vault, which provides integration with the preservation services from DuraSpace via DuraCloud. UPEI has also developed a substantial backend workflow using the Taverna system, which allows for the creation of standards-based workflows that can be shared with other Taverna users. Most of UPEI’s digitization efforts, including object transformations, are orchestrated by this system.
– METRO New York, which has developed an integration between Islandora and DPLA.
– Simon Fraser University, which has developed a number of preservation-focused modules for Islandora, including checksumming and PREMIS features.
– The University of Saskatchewan, which worked with discoverygarden and Artefactual to integrate Islandora with Archivematica.
The presentation would also discuss Islandora’s support for the OAIS Reference model.
“Filling the Digital Preservation Gap” with Archivematica
Jenny Mitcham, University of York
The Universities of Hull and York are currently working a project (funded by Jisc) to investigate how we can use the open source digital preservation solution Archivematica to “fill the digital preservation gap” in our current infrastructures for managing digital data. Neither institution is a stranger to open source software, both having established digital repositories using Fedora and now Hydra. The Borthwick Archives at York have also very recently adopted AtoM as their archival management system.
Both institutions have been investigating how they might add digital preservation functionality into their systems for digital data and Archivematica seems to fit this gap well. Our Jisc funded project looks specifically at how we manage the research data that academics at our Universities produce, though both institutions also have other use cases in mind for Archivematica.
This case study will discuss the findings of phase 1 of our project and cover some of the basics of how Archivematica can help preserve research data. Phase 2 of the project is now underway and we are working with Artefactual Systems to enhance Archivematica. These developments include some work to help Archivematica integrate with our own Fedora repositories, though these developments are not specific to fedora and should also help others with alternative repository systems in place.
During phase 2 we are also looking in more detail at exactly how we want to fit the different elements of our preservation and access systems together and how we envisage data flowing through them. This implementation plan will help inform the work that we hope will be funded next year as we attempt to get our proof of concepts up and running.
Changing your Preservation Planning Rules in Archivematica
Courtney Mumma, Internet Archive (on behalf of Artefactual Systems)
Archivematica uses a database of rules called the Format Policy Registry (FPR), which describes commands to execute on various file formats for digital preservation activities such as identification, characterization and normalization. There is a vision to make the FPR a shared resource where individual institutions can share their local preservation planning rules and commands through a shared web application. Until that vision is realized, local changes can be made on an installation-by-installation basis. In this demonstration, This demonstration will show how to write custom rules and commands in Archivematica by creating a new rule and command to extract attachments from an mbox email archive.
ArchivesSpace-Archivematica-DSpace Workflow Integration Project
Mike Shallcross and Max Eckard (Bentley Historical Library, University of Michigan)
The Bentley Historical Library collects and preserves unique materials related to the University of Michigan and the state as a whole. These holdings include more than 20 TB of digital content, with extensive web archives, born-digital archives, and digitized collections of print, photographic, and audio-visual materials. As part of its mission, the Bentley Historical Library is committed to ensuring the preservation and accessibility of this content over the long-term by implementing professional best practices and standards in its workflows and infrastructure.
In recent years, ArchivesSpace and Archivematica have emerged as two of the most exciting open source platforms related to digital archives. The former manages accessions and collections and provides a framework for entering descriptive, administrative, rights, and other metadata. The latter ingests digital content and prepares information packages for long-term preservation and access. In April 2014, the Bentley Historical Library received a $355,000 grant from the Andrew W. Mellon Foundation to integrate these two systems in an end-to-end workflow that will include the automated deposit of content into a DSpace repository.
- The goals of this project include:
Integrate open-source software systems to largely automate digital archives workflows from accession through the deposit of Archival Information Packages (AIPs) in a repository.
- Develop a new Appraisal and Arrangement feature in Archivematica so archivists may review content, create intellectual arrangements, and associate materials with archival objects from ArchivesSpace.
- Facilitate the creation and reuse of metadata among platforms, including archival description, administrative information, and PREMIS rights.
- Ensure that new features and functionality are modular so that other institutions may adopt some or all project deliverables.
- Share all code and documentation with the archives and digital preservation communities.
At the workshop panelists will give an introduction to the project, demonstrate a prototype of the new Appraisal tab in Archivematica which will be integrated with a live instance of ArchivesSpace, and outline future development efforts.
Inefficiencies with Ingest
Kari Smith, MIT Libraries
This presentation will discuss challenges that MIT Libraries is having using open-source digital archives and preservation tools for preparing files for ingest. In particular, trying to combine QC task automation through the use of BitCurator, Archivematica, and external tools is proving challenging and leading to greater inefficiencies than expected. The presentation will outline the issues, describe the use case, and suggest some ways forward while soliciting ideas from the participants.
Digital Preservation Workflows/Mapping OAIS
Nathan Tallman, University of Cincinnati Libraries
The University of Cincinnati Libraries presently publishes digital content in multiple platforms: hydra/fedora, DSPace, LUNA, HTML, and filesystems. We are working towards storing all digital content in hydra/fedora repositories, but aren’t there yet. Knowing tempus fugit where digital preservation is concerned, we must act now or accept an increased risk. The challenge lies in creating uniform and standard Archival Information Packages from these disparate sources and then logistical management storing the content locally and in an external preservation repository. At UC Libraries, we tested the waters with Bash scripts for processing DSpace content but now realize this approach won’t scale. We are actively exploring the use of Archivematica to be a central conduit through which all of our content flows. Through its pipelines and workflows, Archivematica seems well suited to create standard AIPs, but we will need to introduce some customizations for required by our preservation repository. Archivematica’s integration with ATOM may help in the logistical management as well, though I admit concern to having to keep too many duplicate copies locally (costs!). I welcome the opportunity to discuss these workflows with other preservation professionals and to possibly find better solutions.
ArchivesSpace: Supporting Digital Curation
Brad Westbrook, ArchivesSpace
As postulated in the Bentley Library’s to build an Archivematica > ArchivesSpace > DSpace workflow (described elsewhere on this page), the value of the ArchivesSpace application to data curation is as an application for enhancing descriptions of born digital content, managing collections of born digital content, and supporting discovery of those contents through the ArchivesSpace public interface and exportable metadata documents that can be shared with other access systems. Referencing a collection of born digital astronomy research, this presentation will illustrate how ArchivesSpace can be used to describe and arrange items in the collection and how those materials can be delivered via ArchivesSpace. The presentation will suggest some of the kinds of collection management functions that are available in the ArchivesSpace application.
Kam Woods, University of North Carolina
The BitCurator Access project is developing software to assist libraries, archives, and museums in providing web-based and local access to born-digital materials held on disk images. In this presentation we will describe and demonstrate bca-webtools, a web-based application that allows users to browse file systems contained in raw and forensically-packaged disk images, download items of interest, generate exportable metadata, and search the indexed contents of certain file types. We will discuss ongoing work in expanding this tool to provide interactive mechanisms for administrators to identify and mask out or redact file-level items containing private and sensitive data. Finally, we will show how application can be deployed easily as a local server in a VM, or remotely using common cloud infrastructure providers.