|
⇤ ← Revision 1 as of 2007-08-15 10:29:50
Size: 222
Comment:
|
← Revision 2 as of 2009-09-20 21:47:55 ⇥
Size: 222
Comment: converted to 1.6 markup
|
| No differences found! | |
Extracting text from a PDF document
In the event that you are going to index the content of a PDF, a good place to look first is a Java library called PDFBox http://www.pdfbox.org/userguide/text_extraction.html