Free Republic
Browse · Search
General/Chat
Topics · Post Article

To: agere_contra

Why can’t they import Word or PDF files instead of scanning hard copies?

Rhetorical question.

I do not expect you to have an answer.


20 posted on 11/26/2020 12:56:31 PM PST by E. Pluribus Unum (You are in far more danger from an authoritarian government than you are from a seasonal virus.)
[ Post Reply | Private Reply | To 1 | View Replies ]


To: E. Pluribus Unum

In our situation, the documents were so old and were not in pdf. They were scanned to pdf. Not to be confused with pdf native documents, of course.


22 posted on 11/26/2020 1:01:03 PM PST by dhs12345
[ Post Reply | Private Reply | To 20 | View Replies ]

To: E. Pluribus Unum

Hi Pluribus - yes you’re correct: DOCx or PDF files can be imported directly into text. And both are usually searchable as individual files (PDF’s might not be).

But sometimes those docs may contain true handwriting.

Imagine for instance an affadavit containing the scan of a hand-marked up work order for the repair of a Georgia toilet. That might need to be scanned for text, or it be ok to leave it as a scan, depending on an organisations workflow.

Case in point. I once had the job of creating a database using text that I OCR-ed from hand-written work-orders scanned to doc files.

These work-orders detailed railway incidents and remedial work on those incidents. The orders obviously had massive potential significance due to legal liability. My work made them searchable.

Data collection tools are far more digitised these days, but not everybody can wave a tablet or a phone and gather all they need from a crash, a bridge-strike location, the site of a leak etc.

Handwritten forms are still a big deal. I can certainly imagine a large organisation bound by Government regulations retaining a safe catch-all way of doing things. At least as a fall-back.

Schools, Hospitals, Nuclear facilities, Courts - the risk of missing a reference by not OCR-ing everything might be enormous.

And you always have the clean PDFs to work from if you need to. I guess that OCR-ed text is for searching on, not for presenting to a Judge.


26 posted on 11/26/2020 3:08:29 PM PST by agere_contra (Please pray for Pope Benedict XVI)
[ Post Reply | Private Reply | To 20 | View Replies ]

Free Republic
Browse · Search
General/Chat
Topics · Post Article


FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson