Notes on Specific Parsers
PDFParser (Apache PDFBox)
Microsoft Office Parsers (Apache POI)
SQLite Parser
TesseractOCRParser