Automated Risk Assessment
of File Formats
Hannah Frost & Nancy Hoebelheinrich
Stanford University
Digital Library Federation
Spring Forum 2006
Austin, Texas
April 11, 2006
Stanford Digital Repository
As many of you probably know, Stanford was involved in the Archive Ingest and Handling Test, a project of NDIIPP in 2004-05.
Collaborated with LC, Harvard, Old Dominion, and Johns Hopkins to ingesting and disseminating the 9/11 archive collected by George Mason Univ.

That project has been covered here at DLF forums in the past, and the Dec 2005 issue of D-Lib Magazine was devoted to the subject. Official project reports are available from the NDIIPP web site.

So we are not here to report on that project again.

Nancy and I do want to share with this audience some of the specifics on a methodology that our team developed in the course of the AIHT project to automatically assess files in a digital collection for risks associated with their long-term preservation. I’ll be discussing the theory and concepts underlying the method and walk through the process.

NANCY will discuss metadata implications of assessment process, how we recorded in METS using PREMIS, discuss some of issues regarding managing metadata over time.