Scanning typewritten text
We have a zillion dockets that need entering into a very simple ExCel spreadsheet:
-Date commenced (eight digits,typewritten)
-Author (typewritten)
-subject matter(typewritten)
-location(typewritten)
-docket number (four digits, handwritten)
-date finalised(eight digits, handwritten)
All the dockets are 50 to 75 years old. They have some "foxing" but are otherwise very well archived.
We could go on simply typing directly into ExCel, but the fact is it will take four of us at least 20 to 30 years to finish at the current rate
We have access to a good Canon scanner with standard OCR.
Any ideas how we can tweak this to improve accuracy:
-Is PDF converter software better to use than OCR,here?
-Will tweaking the contrast, resolution etc and using overlays decrease the error rate?
John B