Extracting text from a PDF document
In the event that you are going to index the content of a PDF, a good place to look first is a Java library called PDFBox http://www.pdfbox.org/userguide/text_extraction.html
In the event that you are going to index the content of a PDF, a good place to look first is a Java library called PDFBox http://www.pdfbox.org/userguide/text_extraction.html
PDF (last edited 2009-09-20 21:47:55 by localhost)