[X4U] Mac OCR
Randy B. Singer
randy at macattorney.com
Wed Jul 28 12:07:01 PDT 2004
Michael Winter said:
>If I understand what you're asking, it all depends on how the pdf is
>made. For most pdf's, the text is actually text (ASCII or equivalent
>characters) and can be selected. There are some pdf's however, where
>what looks like text is actually an image of some text (what you get if
>someone scans in a page of text). You can't select that "text" because
>its really an image, you need to use OCR software to turn it into
>actual text.
Exactly. In short, a PDF of a graphic of a text document has no text in
it to copy out of it.
Another reason why you may not be able to copy the text out of a PDF is
because it was created with security set to keep you from doing this.
Most folks don't know that you can use the text selection tool in the
free Adobe Acrobat Reader and copy and paste the text out of any PDF with
text in it that isn't security protected.
You can also convert PDF's to text using:
PDF2Office $99
http://www.recosoft.com/products/pdf2office/index.htm
PDFtotext (free) (does not retain formatting)
http://q41.de/downloads/pdftotext_en/
OmniPage Pro for OS X will convert any PDF, even a PDF of a graphic with
text in it, into a wordprocessing document, and it will retain all
formatting, and you can then annotate or alter the document in your word
processor.
Randy B. Singer
Co-Author of: The Macintosh Bible (4th, 5th and 6th editions)
Routine OS X Maintenance and Generic Troubleshooting
http://www.macattorney.com/ts.html
More information about the X4U
mailing list