Lucene is a gem in the open-source world‹-a highly scalable, fast search engine. It delivers performance and is disarmingly easy to use. Lucene in Action is the authoritative guide to Lucene. It describes how to index your data, including types you definitely need to know such as MS Word, PDF, HTML, and XML. It introduces you to searching, sorting, filtering, and highlighting search results.Lucene powers search in surprising places‹-in discussion groups at Fortune 100 companies, in commercial issue trackers, in email search from Microsoft, in the Nutch web search engine (that scales to billions of pages). It is used by diverse companies including Akamai, Overture, Technorati, HotJobs, Epiphany, FedEx, Mayo Clinic, MIT, New Scientist Magazine, and many others. Adding search to your application can be easy. With many reusable examples and good advice on best practices, Lucene in Action shows you how.What's Inside- How to integrate Lucene into your applications- Ready-to-use framework for rich document handling- Case studies including Nutch, TheServerSide, jGuru, etc.- Lucene ports to Perl, Python, C#/.Net, and C++- Sorting, filtering, term vectors, multiple, and remote index searching- The new SpanQuery family, extending query parser, hit collecting- Performance testing and tuning- Lucene add-ons (hit highlighting, synonym lookup, and others)
This book is #1 on my list of books I'd recommend to other programmers. I don't know how else to elaborate on how important I think the idaes in this book are.
Lucene is the masterpiece of open-source search. It's at the heart of every enterprise-level search engine such as ElasticSearch. There was also a second edition since this one. But at this time, this text did the job pretty well, covering all major aspects of Lucene. It' not an outstanding book either, but a classic "In Action" : the explanations are clear and most of the features are illustrated by examples, especially the case study of the last chapter. Overall, very decent. Ma note de lecture en Français ici
The book is now (2009) outdated to the point of being useless. But it was a disappointed even before, there's precious little of useful information on how to use Lucene beyond the freely available chapters (that are, to be fair, quite useful, Lucene's API documentation being what it is). The 2nd edition coming out this year may be better.
Working in Search space, Lucene has been one of the open sourced fundamental search concepts we read in Information Retrieval space. More fancier engines embellish this engine, but if you want to go to the basics, this is a pretty solid book to sit and read.
Lucene is search Framework - the cool idea. Eric has explained the framework and also explained the surrounding tools required to develop the application using Lucene or developing application in search domain very nicely.
This is one of my favorite book - i come back to read this book again and again.
I think the book would serve as a good start for those who want to learn Lucene from scratch. It contains clear examples and intuitive explanations for major definitions like k-means, fuzzy k-means. For those who have real interest in mining text, I suggest "Taming Text" as a good companion for the book
Very good, detailed book about Lucene, providing different levels of details - from quick start to detailed discussion of Lucene's internals, contrib modules and success stories. Highly recommended for all, who are planning to use Lucene