Web Search Software
Apache Nutch 1.1 Released
Apache Nutch 1.1 is out now.
Apache Nutch is an extensible framework built on Hadoop, Lucene/Solr and Tika, for building out large-scale, web-based search.
This version upgrades to Tika version 0.7, Hadoop version 0.20.2 and Lucene version 3.0.1. The RTF and MP3 parse plugins have been removed, and the Crawl class can now call either Solr or Lucene Indexer.
0 Comments