Repository

Artifacts/Jars

Popular Tags

ajax analysis annotations ant apache api archetype aspect asynchronously beans binding bpm build buildsystem bytecode cache cms codecoverage codehaus collections concurrency container database directory distributed doc eclipse ejb esb format framework graph graphics hadoop hibernate html http ide imap io jbi jdbc jdo jini jms jmx jndi jsf jsp language logging mail maven metadata microsoft mock net osgi parser pdf persistence plugin pool portal portlet query regexp rmi rpc rss ruleengine scheduling scm scripting security server servlet soa soap socket spring ssh svg swt system taglib template testing transaction ui web webdav webframework webserver webservice workflow xml xquery xslt

[See All Tags]
home » org.apache.nutch » nutch » 2.0-dev

Apache Nutch

Nutch is open source web-search software. It builds on Lucene and Solr, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.



Artifact Download (JAR) (350 KB)
POM File View
HomePage http://maven.apache.org
Organization
Issue Tracker

This artifact depends on ...

Group Artifact Version
com.healthmarketscience.sqlbuilder sqlbuilder 2.0.6
com.ibm.icu icu4j 4.0.1
commons-codec commons-codec 1.3
commons-collections commons-collections 3.1
commons-httpclient commons-httpclient 3.1
commons-lang commons-lang 2.4
junit junit 3.8.1
log4j log4j 1.2.15
org.apache.hadoop avro 1.3.2
org.apache.hadoop hadoop-core 0.20.2
org.apache.hadoop hadoop-test 0.20.2
org.apache.lucene lucene-core 3.0.2
org.apache.lucene lucene-misc 3.0.2
org.apache.solr solr-solrj 1.4.1
org.apache.tika tika-core 0.7
org.apache.tika tika-parsers 0.7
org.gora gora-core 0.1
org.gora gora-sql 0.1
org.hsqldb hsqldb 2.0.0
org.jdom jdom 1.1
org.mortbay.jetty jetty 6.1.22
org.mortbay.jetty jetty-client 6.1.22
org.mortbay.jetty jetty-util 6.1.22
org.slf4j slf4j-log4j12 1.5.11
oro oro 2.0.8
xerces xercesImpl 2.6.2
xerces xmlParserAPIs 2.6.2

Licenses

License URL
The Apache Software License, Version 2.0 http://www.apache.org/licenses/LICENSE-2.0.txt

Developers

Name Email Developer Id Roles Organization
Andrzej Bialecki ab<at>apache.org ab
Chris A. Mattmann mattmann<at>apache.org mattmann
Dennis Kubes kubes<at>apache.org kubes
Dogacan G��ney dogacan<at>apache.org dogacan
Julien Nioche jnioche<at>apache.org jnioche
Otis Gospodneti�� otis<at>apache.org otis
Sami Siren siren<at>apache.org siren

Source Control

Connection http://svn.apache.org/viewvc/nutch
Developer Connection
Tag HEAD
URL http://svn.apache.org/viewvc/nutch

Packages

org.apache.nutch.crawl
org.apache.nutch.fetcher
org.apache.nutch.html
org.apache.nutch.indexer
org.apache.nutch.indexer.solr
org.apache.nutch.metadata
org.apache.nutch.net
org.apache.nutch.net.protocols
org.apache.nutch.parse
org.apache.nutch.plugin
org.apache.nutch.protocol
org.apache.nutch.scoring
org.apache.nutch.storage
org.apache.nutch.tools
org.apache.nutch.tools.arc
org.apache.nutch.tools.proxy
org.apache.nutch.util
org.apache.nutch.util.domain