General discussion


OCR scan images embedded in a PDF

By thomson9 ·
I have a project I'm working on that involves ~1200 pages of PDF documents. Embedded in the PDF's are scanned images of papers, articles, and lectures from the late 1800's.

I'm looking for a Windows software product that will scan the images inside the PDF files using OCR and output to either text or a Word document so I can do searches on the contents without having to pull out the 1200 or so images into separate graphics files for OCR scanning.

This is a personal project so the budget is tiny. Free would be good. I don't have Acrobat to separate out the layers so that's not an option.

Any help will be greatly appreciated.

This conversation is currently closed to new comments.

Thread display: Collapse - | Expand +

All Comments

Related Discussions

Related Forums