I have a project I'm working on that involves ~1200 pages of PDF documents. Embedded in the PDF's are scanned images of papers, articles, and lectures from the late 1800's.
I'm looking for a Windows software product that will scan the images inside the PDF files using OCR and output to either text or a Word document so I can do searches on the contents without having to pull out the 1200 or so images into separate graphics files for OCR scanning.
This is a personal project so the budget is tiny. Free would be good. I don't have Acrobat to separate out the layers so that's not an option.
Any help will be greatly appreciated.
This conversation is currently closed to new comments.
If you're asking for technical help, please be sure to include all your system info, including operating system, model number, and any other specifics related to the problem. Also please exercise your best judgment when posting in the forums--revealing personal information such as your e-mail address, telephone number, and address is not recommended.
OCR scan images embedded in a PDF
I'm looking for a Windows software product that will scan the images inside the PDF files using OCR and output to either text or a Word document so I can do searches on the contents without having to pull out the 1200 or so images into separate graphics files for OCR scanning.
This is a personal project so the budget is tiny. Free would be good. I don't have Acrobat to separate out the layers so that's not an option.
Any help will be greatly appreciated.