List of 3rd party parser plugins
These are 3rd party parser plugins which cannot be included due to licensing incompatibiliy. To install a plugin, download it according to instructions below and drop the jar(s) on your classpath. Tika will auto detect the plugin.
Microsoft TNEF / LZFU
This is a MS compression format used for compressed RTF, email attachments (like WINMAIL.DAT) and more. The parser is available from a github fork of the JTNEF project.
(Tika 0.10 includes a TNEF parser as standard now, which may be sufficient)
Install instructions:
git clone http://github.com/jukka/jtnef.git jtnef
- cd jtnef
- mvn package
- cp target/jtnef-*.jar $SOMEWHERE_ON_CLASS_PATH
Ogg Vorbis and FLAC
This parser extracts metadata from Ogg Vorbis and FLAC audio files.
While the library and parser are all available under the Apache License, the parser isn't included in Tika yet as there has yet to be a release of the library.
Installation instructions:
git clone http://github.com/Gagravarr/VorbisJava
cd VorbisJava
- mvn package
- cp core/target/vorbis-java-core*SNAPSHOT.jar $SOMEWHERE_ON_CLASS_PATH
- cp tika/target/vorbis-java-tika*SNAPSHOT.jar $SOMEWHERE_ON_CLASS_PATH
Your plugin
<Your description here>
Install instructions:
<Your instructions here>