27
What we are learning and documenting
•Costs and processes of digitizing paper and building a statistical digital library:
•Scanning requirements (TIFFs, PDF/a, etc)
•OCR of Spanish text
•OCR of numbers into spreadsheets (zoned scanning)
•Quality assessment
•Is it less expensive to key in the tables? What is the ‘tipping point’?
•