Differences between revisions 22 and 23
Revision 22 as of 2007-06-24 02:40:53
Size: 8986
Editor: Peter W
Comment:
Revision 23 as of 2009-09-20 21:47:38
Size: 9018
Editor: localhost
Comment: converted to 1.6 markup
Deletions are marked like this. Additions are marked like this.
Line 5: Line 5:
 ''URL:'' [http://www.onjava.com/pub/a/onjava/2007/05/24/using-the-lucene-query-parser-without-lucene.html]  ''URL:'' [[http://www.onjava.com/pub/a/onjava/2007/05/24/using-the-lucene-query-parser-without-lucene.html]]
Line 12: Line 12:
 ''URL:'' [http://sourceforge.net/projects/lius]  ''URL:'' [[http://sourceforge.net/projects/lius]]
Line 27: Line 27:
 ''URL:'' [http://www.amazon.co.jp/exec/obidos/ASIN/4774127809]  ''URL:'' [[http://www.amazon.co.jp/exec/obidos/ASIN/4774127809]]
Line 46: Line 46:
 ''URL:'' [http://www.searchblox.com]  ''URL:'' [[http://www.searchblox.com]]
Line 62: Line 62:
 ''URL:'' [http://www.bibl.ulaval.ca/lius/index.en.html]  ''URL:'' [[http://www.bibl.ulaval.ca/lius/index.en.html]]
Line 76: Line 76:
 ''URL:'' [http://www.bibl.ulaval.ca/lius/index.en.html]  ''URL:'' [[http://www.bibl.ulaval.ca/lius/index.en.html]]
Line 91: Line 91:
 ''URL:'' [http://codecrawler.sourceforge.net]  ''URL:'' [[http://codecrawler.sourceforge.net]]
Line 98: Line 98:
 ''URL:'' [http://www.bibl.ulaval.ca/lius/index.en.html]  ''URL:'' [[http://www.bibl.ulaval.ca/lius/index.en.html]]
Line 108: Line 108:
 ''URL:'' [http://www.searchblox.com]  ''URL:'' [[http://www.searchblox.com]]
Line 126: Line 126:
 ''URL:'' [http://ppinew.mnis.com/jdbcdirectory]  ''URL:'' [[http://ppinew.mnis.com/jdbcdirectory]]
Line 140: Line 140:
 ''URL:'' [http://www.searchblox.com]  ''URL:'' [[http://www.searchblox.com]]
Line 151: Line 151:
 ''URL:'' [http://www.textmining.org/]  ''URL:'' [[http://www.textmining.org/]]
Line 160: Line 160:
 ''URL:'' [http://limo.sourceforge.net/]  ''URL:'' [[http://limo.sourceforge.net/]]
Line 176: Line 176:
 ''URL:'' [http://www.getopt.org/luke]  ''URL:'' [[http://www.getopt.org/luke]]
Line 196: Line 196:
 ''URL:'' [http://jakarta.apache.org/lucene]  ''URL:'' [[http://jakarta.apache.org/lucene]]
Line 198: Line 198:
 [http://cvs.apache.org/viewcvs.cgi/*checkout*/jakarta-lucene/CHANGES.txt?rev=1.65]  [[http://cvs.apache.org/viewcvs.cgi/*checkout*/jakarta-lucene/CHANGES.txt?rev=1.65]]

"Using the Lucene Query Parser Without Lucene"

Discusses unique method of using QueryParser.


Lius Version 1.0-RC1


Lius 1.0-RC1 is now available.

This new version :

  • Lucene 2.0

    Major modifications in the API

    Increase the performance while indexing and searching

    Fix some bugs


"Introduction to Apache Lucene" is published

The first Japanese "Apache Lucene" book.


Lucene sub-project "Lucy" proposed

  • Date: 13 May 2006

"A shared core engine for Lucene, written in C, with Perl and Ruby bindings."

Details at LucyProposal.


SearchBlox J2EE Content Search Software Version 3.1

SearchBlox Software has released Version 3.1 of its J2EE Content Search Software. SearchBlox delivers out-of-the-box search functionality for quick and easy integration with websites, applications, intranets and portals. SearchBlox uses the Lucene Search API and incorporates integrated HTTP/HTTPS, File System and Feed (RSS/Atom) crawlers, support for various document formats including HTML, Word, PDF, PowerPoint and Excel, support for indexing and searching content in 30 languages and customizable search results, all controlled from a browser-based Admin Console. SearchBlox is available as a Web Archive (WAR) and has been tested with all major Java Application Servers. It is also available as a standalone application for Windows and Mac OS X.

Main features in this release:

  • REST API (Free and Enterprise Editions Only) for indexing and deleting custom content. The built-in browser-based SearchBlox Development Environment provides developers with an easy-to-use interface to develop and test using the REST API.

  • Spelling Suggestions based on the indexed content in the collection
  • Support for selective indexing of content within HTML documents using <noindex> </noindex> or <!--stopindex--> <!--startindex--> tags

  • Support for JDK 1.5.


Lius Version 0.4 LGPL

LIUS is an indexing Java framework based on the Jakarta Lucene project. The LIUS framework adds to Lucene many files format indexing functionalities as: Ms World, Ms Excel, Ms PowerPoint, RTF, PDF, XML, HTML, TXT, Open Office, MP3, Vcard, Latex suite and JavaBeans. It also contains search and update capabilities. The search results are gathered as a Java list or as a JDOM or DOM XML document.

This new version :

  • OpenOffice 2 indexing

    Vcard and Latex indexing

    Utf-8 accent remover analyzer

    Fix some bugs


Lius (Lucene Index Update and Search) Version 0.3.3

LIUS is an indexing Java framework based on the Jakarta Lucene project. The LIUS framework adds to Lucene many files format indexing functionalities as: Ms World, Ms Excel, Ms PowerPoint, RTF, PDF, XML, HTML, TXT, Open Office suite and JavaBeans. It also contains search and update capabilities. The search results are gathered as a Java list or as a JDOM or DOM XML document.

This new version :

  • Use log4j as logging system

    Highlight search results

    MP3 indexing

    Use caching for multiple configuration files

    Fix some bugs


CodeCrawler 2005

CodeCrawler is a smart, web-based search engine specifically built for use by developers for searching source code. It combines ease of use, superb performance, and intelligent search capabilities in order to increase developer productivity and reduce source code learning time. It uses Lucene technology.


Lius (Lucene Index Update and Search) Version 0.3.1

LIUS is an indexing Java framework based on the Jakarta Lucene project. The LIUS framework adds to Lucene many files format indexing functionalities as: Ms World, Ms Excel, Ms PowerPoint, RTF, PDF, XML, HTML, TXT, Open Office suite and JavaBeans. It also contains search and update capabilities. The search results are gathered as a Java list or as a JDOM or DOM XML document. The new version allows boosting document and fields using the configuration files and support new languages.


SearchBlox J2EE Search Component Version 2.0

SearchBlox is a J2EE Search Component that delivers out-of-the-box search functionality for fast and easy implementation in your websites, applications, intranets and portals. SearchBlox uses the Lucene Search API and incorporates integrated HTTP/HTTPS and File System crawlers, support for various document formats including HTML, Word, PDF, PowerPoint and Excel, support for indexing and searching content in 18 languages and customizable search results, all controlled from a browser-based Admin Console. SearchBlox is available as a Web Archive (WAR) and is deployable on any Servlet 2.3/JSP 1.2 compliant server.

Main features in this release:

  • Advanced Search: search by file format, language, keyword occurrence and modified date
  • Keyword-in-Context Display: search results are displayed with areas of content where the keyword occurs
  • Upgrade to Lucene 1.4.2
  • Performance and stability improvements
  • Bug fixes


JDBCDirectory version 0.05

  • Date: 7 May 2004 URL: http://ppinew.mnis.com/jdbcdirectory

    • Tested with MySQL and open source drivers.
    Some issues that I just thought of that aren't mentioned...
    • Pooling prepared statements on the connection is must for good performance under the current code. ( see test code )
    • The first search is always really slow as everything initializes and the cache fills ;) so don't let that discourage you.


SearchBlox J2EE Search Component Version 1.2

  • Date: 16 February 2004 URL: http://www.searchblox.com

    SearchBlox is a J2EE Search Component that delivers out-of-the-box search functionality for quick integration with your websites, applications, intranets and portals. SearchBlox uses the Lucene Search API and incorporates integrated HTTP and File System crawlers, support for various document formats, support for indexing and searching content in 17 languages and customizable search results, all controlled from a browser-based Admin Console.

    Comment from a user: I installed SearchBlox onto my Tomcat 4. It runs and produces nice results. Still, after a few hours it crashes the server. My opinion, as for 25 August 2004 is this product Searchblox is interesting and looks promising, but still lacks stability, appearingly a matter of maturity. Product developers: I encourage you to improve this product, hold on.


Word Document text extractor released


LIMO new release (v0.3)

  • Date: 22 January 2004 URL: http://limo.sourceforge.net/ There's a new release of limo available ! This new version :

    includes lucene-1.3-final.jar

    fixes a bug with index loading

    detects when index changes and auto refreshes the information (as proposed by Jakob Flierl)

    uses css for easier customisation (as proposed by E Hatcher)

    escapes HTML code in the value of the fields (as proposed by E Hatcher)


Luke, Lucene index browser and diagnostic tool

  • Date: 17 January 2004 URL: http://www.getopt.org/luke

    Luke is a Lucene index browser and diagnostic tool, available under Apache License. Please see the following link for more details, binaries, sources and Java WebStart version: Changes in v. 0.45:

    Added more details to the Overview panel.

    Add support for undeleting all deleted documents.

    Add Boost column to Document view.

    Use nicer formatting for numbers in the Explain window.

    Fix for not updating the parsed query view when pressing Search.

    Fix the JNLP file to require J2SE 1.3+.

    By popular demand, add a single self-contained JAR to the binary distribution.

    Minor restructuring to increase reuse.


Lucene 1.3 Final Released

LatestNews (last edited 2009-09-20 21:47:38 by localhost)