Apache Nutch
Nutch is open source web-search software. It builds on Lucene and Solr, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.
tags:Available versions
| Version | Type | Download |
|---|---|---|
| 2.0-dev | Binary (350 KB) | |
| 1.4 | release | Binary (530 KB) |
| 1.3 | release | Binary (523 KB) |
Stats