ADDING NEW CONTENT TYPES TO A LARGE-SCALE SHARED DIGITAL REPOSITORY: Paper - iPres 2010 - Vienna (PHAIDRA - o:185242)
You are here: University of Vienna PHAIDRA Detail o:185242
Title
ADDING NEW CONTENT TYPES TO A LARGE-SCALE SHARED DIGITAL REPOSITORY
Subtitle (en)
Paper - iPres 2010 - Vienna
Language
English
Description (en)
HathiTrust is a collaboration of universities working together to establish a repository that archives and shares their digitized collections. Initially, the Submission Information Packages (SIPs) deposited into HathiTrust were extremely uniform, being constituted primarily of books digitized by Google. HathiTrust’s ingest validation processes were correspondingly highly regular, designed to ensure that these SIPs met agreedupon qualities and specifications. As HathiTrust has expanded to include materials digitized from other sources, SIPs have become more varied in their content and specifications, introducing the need to make adjustments to ingest and validation routines. One of the primary sources of new SIPs is the Internet Archive, which has digitized a large number of public domain materials owned by HathiTrust partners. Many of the technical, structural, and descriptive characteristics of materials digitized by the Internet Archive did not match previously developed standards for materials in HathiTrust. A variety of solutions were developed to transform these materials into HathiTrust-compatible AIPs and ingest them into the repository. The process of developing these solutions provides an example to other organizations that would like to add new types of materials to their repository, but are uncertain of the issues that may arise, or how these issues can be addressed.
Keywords (en)
iPRES
Author of the digital object
Shane  Beers
Jeremy  York
Andrew  Mardesich
Format
application/pdf
Size
167.0 kB
Licence Selected
GPLv3
Conferences
Conference 2010
Type of publication
Article in collected edition
Content
Details
Object type
PDFDocument
Format
application/pdf
Created
27.09.2012 01:02:08
Metadata