I've played with 3 different OCR apps (including one that seems to be tops (Abbyy Fine Reader Pro) and find it too much. Easy to use but far too time consuming for my skills with it.
See the attached pdf (38 pages of typed text). I have it and 5 others that need converted but am finding it too overwhelming.
If there's someone out there who can convert it to [accurate] text (.doc, txt, html, etc), please give us a bid.
We are a non profit organization on a limited budged. I am eager to present the best bid to the group to see if we can get these indexes put into text format.
... otherwise I'm left to spending hours on it. I can't help but guess that some ocr guru out there can do it in minutes...
Just as an aside, OCR software is not needed here. What you would need is PDF converter software. OCR is used when you have a scanned image from hard copy that you want to convert to editable text.
Zamzar is not an ocr program, but just a file conversion program. It did not convert the old typed text in the pdf to text, but just put the pdf "image" into a .doc and .html file.
My purpose is to get the pdf file to actual text...
Thanks, though, seahawk. I do have other uses for a quick and easy file conversion app.
Just as an aside, OCR software is not needed here. What you would need is PDF converter software. OCR is used when you have a scanned image from hard copy that you want to convert to editable text.
? Ocr can be used for a pdf file or scanned image... and in most cases the software I use does well... but research has shown me that even good ocr software does poorly with old typewriter text, which is far more irregular than computer typed text.
The software I use does a perfect job when the scanned image or pdf is of computer text... but it just hates the old typewriter text...
I'll keep looking... and look too at "pdf converter" apps to see if there's a difference.
I know that OCR can be used on PDFs that are the result of scanned hard copy. But its still a matter of the source being a scanned image.
Typewriter text, if clean, is usually read well. The copy you have is not that clean. I used Paperport to convert to plain text. Its not 100% but pretty good.
What a bummer. I'd have thought that ocr software has "come a long way" since I purchased Abbyy Fine Reader... but the latest review at pcmag is from 2002 and the latest from cnet is from 2000! Gosh. Surprising...
Still hunting :)
PS. Wikipedia needs updated. The latest entry says modern ocr software is 99% accurate on typewritten text. Very wrong.
Ok, OCR packages work on a graphical image file. If its already text, there is no need to convert. Many scan programs will save the scanned image as a pdf. So the key is the source of the pdf.
Like I said, your problem is not so much the typewriter typeface, but the crispness of the scanned image. That's what's causing the errors.
That's right. In using the supposedly best ocr software out there (Abbyy Fine Reader) I scan the typewritten document, in ultra high quality, to pdf or jpg but the results are the same: Poor.
Yes, the "crispness" is key. Typewriter text is notoriously "uncrisp"... which is the issue.
Can you suggest an app that will turn the attached (sample in both pdf and jpg format) typed text to editable text?
Tiff is no better. After digging through the help, and checking forums I see that many are in the same boat as I: With old typewritten text, there is much "training" needed for the software to convert it to editable text.
I spent 2 hours "training" it and only got to page 6 of 38...
So maybe that's the best that can be done with today's technology...
I have the following numbers that exceed 15 characters that needs to be split into its own columns.
Down the road, there would be thousands of such rows of data with the first couple set of unique numbers.
890432453253208820,5004500558,05CC,1,0,0,0,0,0,0, 0000,5.0000,2007-01-11...
Dear Helpdesk advisors,
I look for a scanner,that reads any character from paper,converts it to text and has interface with Microsoft software.Usage area: reads product name , unit of product bought, price and other sort of text info.The scanner is able to convert printed and possibly...
Hi,
I very new to access. What exactly I want is that users will take a screenshot of an application and paste it in the form by possibly an unbound ole object or whatever is suitable or suggested. There are other fields in the form to be filled up. How can I do the same without asking the user...
Hi,
Can you please let me know if a video file be converted to some readable text file. For example, if we read the text file, we should understand what the video shows and vice versa. Is there any such conversion possible. Please help me in this issue as early as possible.