Topic: convert to searchable PDF Pages that link to <a href="https://ozoneasylum.com/backlink?for=29041" title="Pages that link to Topic: convert to searchable PDF" rel="nofollow" >Topic: convert to searchable PDF\

 
Author Thread
CPrompt
Maniac (V) Inmate

From: there...no..there.....
Insane since: May 2001

IP logged posted posted 03-12-2007 15:36 Edit Quote

Does anyone have a suggestion on how to convert either a TIFF or PDF to a searchable PDF?

I know Acrobat Pro can do it but I just need something for one large document and then never use again. If there is something free for linux, that would be great. If not...maybe a trial version of something that won't watermark?

Thanks in advance!

Later,

C:\

White Hawk
Maniac (V) Inmate

From: zero divided.
Insane since: May 2004

IP logged posted posted 03-12-2007 17:14 Edit Quote

Probably not quite what you're looking for, but I found examples like this one (uses OCR on images to make text-searchable PDFs) on VeryPDF.com ...

Okay, so they're not necessarily free, and probably not linux-specific, but that particular one comes with a 30-day money-back thingymajig. Perhaps you'll even decide to keep it. Who knows?

____
ZZZZZZZZZZZZZZZZZZzzz.....
Microsoft have discovered a cure for cancer! They'll release it as soon as they find a way to stop anyone using it...

reisio
Paranoid (IV) Inmate

From: Florida
Insane since: Mar 2005

IP logged posted posted 03-13-2007 01:30 Edit Quote

gocr, ocrad, clara, ocre

I think it was one of gocr or ocrad that was decent.

CPrompt
Maniac (V) Inmate

From: there...no..there.....
Insane since: May 2001

IP logged posted posted 03-13-2007 22:37 Edit Quote

reisio,

have any idea how I can take the output from gocr and make it a searchable pdf? From what I can see of gocr, it makes an TeX HTML XML and ASCII files.

Later,

C:\

ugghhh...gocr only takes PAM, PPM, PGM or PBM files. I have TIFF or PDF only.

(Edited by CPrompt on 03-13-2007 22:45)

reisio
Paranoid (IV) Inmate

From: Florida
Insane since: Mar 2005

IP logged posted posted 03-14-2007 00:17 Edit Quote

TIFF or PDF? Why?

Last I checked you can search text...more easily than you can a PDF, even.

Any decent image converter can handle those formats.

CPrompt
Maniac (V) Inmate

From: there...no..there.....
Insane since: May 2001

IP logged posted posted 03-14-2007 03:02 Edit Quote
quote:

reisio said:

TIFF or PDF? Why?




I am trying to figure out a different solution a problem one of our clients is having. The hard copies of the documents are scanned and they can only be TIFF or PDF. Limitations of the scanners. After that, the document needs to be turned into a searchable PDF. For some reason, these documents are not ocring. I've never had a problem with ocr before. If it came to a document that it couldn't ocr, the software would tell you and move on to the next page. It just craps out all over itself though.

At any rate, I have tried a *bunch* of software to try to convert these documents to searchable pdf's. Nothing can do some of them. Nature of ocr though.

thanks for the help!

Later,

C:\

reisio
Paranoid (IV) Inmate

From: Florida
Insane since: Mar 2005

IP logged posted posted 03-14-2007 03:47 Edit Quote

We use ABBYY FineReader at work.



Post Reply
 
Your User Name:
Your Password:
Login Options:
 
Your Text:
Loading...
Options:


« BackwardsOnwards »

Show Forum Drop Down Menu