Free Republic
Browse · Search
Bloggers & Personal
Topics · Post Article

To: Kleon
OCR doesn't necessarily mean the resulting document is searchable. The software "reads" text and other elements, but what it does to them is up to the user. Making the document searchable doesn't make much sense unless the text is clear and crisp to begin with, which isn't the case here.

If the application is normally used for the searching of an existing database of image files for the purpose of making copies or fabricating new replacement documents, it is reasonable to believe someone might make use of the text searching feature. What is not reasonable, is to believe there is any good explanation for changing pixel size and dynamic range from one character to another. The only reasonable conclusion is that the output document accurately represents the input image files.

290 posted on 07/21/2011 8:41:25 AM PDT by DiogenesLamp (The TAIL of Hawaiian Bureaucracy WAGS the DOG of Constitutional Law.)
[ Post Reply | Private Reply | To 283 | View Replies ]


To: DiogenesLamp
What is not reasonable, is to believe there is any good explanation for changing pixel size and dynamic range from one character to another.

That's to be expected when running an enhanced scan on documents that aren't uniformly clear. Some characters aren't recognized and get rendered along with the background.

For example, the same thing can be seen in this PDF:


292 posted on 07/21/2011 10:33:07 AM PDT by Kleon
[ Post Reply | Private Reply | To 290 | View Replies ]

Free Republic
Browse · Search
Bloggers & Personal
Topics · Post Article


FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson