Day 2 at CML

Hours: 7.5 Total Hours: 15

Back to work on the City of Columbus collection. I fixed a few more propagation issues, and three incorrect identifiers from my first day, and got those uploaded and verified first thing. I then went on to add 1867-8 through 1874-5 to the collection. A few years are missing from the physical collection, so there are a couple gaps in the digital set.

I came across a few new issues on my second day. First, there were a few volumes with uncropped images right in the middle. Upon investigation, I found that they had been cropped and saved, sometimes twice, so I cleaned up the extra files before importing again. Fortunately, the OCR process identifies files that are too big, and warns us about it, so that these oversights can be found and corrected quickly. I did find one set, in 1874-5 that had not been cropped, so I created cropped JPG images, and TIFF archival copies before moving on. I also discovered that the Approve/Index process was not taking as long as I thought it was, the index page was simply not reloading itself in a timely manner.

Once I got into the rhythm, I found a more efficient way to complete the process. I would start one object, and when I got to the point of uploading it to the server, I started importing the next object. Once the first set was uploaded, I would start the approval/indexing process, while I completed the metadata on the second and started it uploading. At which point the aprroval and indexing was complete on the first, so I verified it and started importing the next object. This kept my flow a little smoother, but creating the PDFs still took a lot longer than any other part of the process.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.