Free Republic
Browse · Search
Bloggers & Personal
Topics · Post Article

To: WhiskeyX
The absence of font table information indicates no OCR scan occurred, because an OCR scan creates a font table and exports it to the PDF file.

If the desired result was to either have the text in the PDF searchable or replace them with computer fonts (in other words, a full OCR process), then this would be the case. But like I said, that wouldn't make sense with a document like this. The software here was used to detect text blocks and enhance accordingly.

111 posted on 08/03/2011 8:34:55 AM PDT by Kleon
[ Post Reply | Private Reply | To 81 | View Replies ]


To: Kleon

“The software here was used to detect text blocks and enhance accordingly. “ - K

And just what software was that?

You do realize that these results have not been replicated in spite of many efforts.


116 posted on 08/03/2011 9:02:41 AM PDT by Triple (Socialism denies people the right to the fruits of their labor, and is as abhorrent as slavery)
[ Post Reply | Private Reply | To 111 | View Replies ]

To: Kleon

OCR software is made for the purpose of creating the fonts table as the means by which searchable text is created or any other form of text is created for export to another application. Either you have an OCR scan software creating the font table, or you don’t have OCR software OR an OCR scan. To have an OCR scan, a font table nust be created and exported to the PDF, otherwise it is by definition not an OCR scan.


117 posted on 08/03/2011 9:09:17 AM PDT by WhiskeyX
[ Post Reply | Private Reply | To 111 | View Replies ]

Free Republic
Browse · Search
Bloggers & Personal
Topics · Post Article


FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson