•Observations
–JHOVE can process 97% of the 57,000 files
•ASCII/UTF-8, HTML, JPEG, WAV, TIF, PDF, GIF, AIFF,
XML
–The PREMIS event model is very flexible, but it is
difficult to determine
the best way to capture provenance metadata
–Data manipulation issues:
•You can FTP 13GB as one file in 3 hours; to FTP it as
57,000 files takes 35+
hours
•Some FTP clients do not like 0 length files
•Some ZIP tools have a file size limitation
•Some network appliance file servers have a file size
limitation
–The data does not include any infected files!