This page has been superseded by the Solr Ref Guide section Uploading Data with Solr Cell using Apache Tika.

If you'd like to share information about how you use this feature, you can edit this page to add tips and tricks.

/* cwikimigrated */

a.k.a the SolrCell project!

Community Tips

Committer Notes

Upgrading Tika

  • Run ant clean clean-jars from project root or /solr
  • Update solr/CHANGES.txt and solr/NOTICE.txt as well as relevant LICENSE/NOTICE files in solr/licenses/
  • Edit solr/contrib/extraction/ivy.xml according to instructions inline the file
  • ant jar-checksums - to generate new sha1 files for the jars
  • ant test - see that Ivy downloads the new dependencies
  • Commit

Additional Resources

What's in a Name

Grant was writing the javadocs for the code and needed an entry for the <title> tag and wrote out "Solr Content Extraction Library", since the contrib directory is named "extraction". This then lead to an "acronym": Solr CEL which then gets mashed to: Solr Cell. Hence, the project name is "Solr Cell". It's also appropriate because a Solar Cell's job is to convert the raw energy of the Sun to electricity, and this contrib's module is responsible for converting the "raw" content of a document to something usable by Solr. http://en.wikipedia.org/wiki/Solar_cell

  • No labels