The HTML Parser is an implementation of the HTML5 parsing algorithm in Java for applications. The parser is designed to work as a drop-in replacement for the XML parser in applications that already support XHTML 1.x content with an XML parser and use SAX, DOM or XOM to interface with the parser.

LicenseBSD 2-clauseMIT
CategoriesHTML Parsers
Date(Jun 07, 2012)
Filespom (9 KB)  bundle (288 KB)  View All
Ranking#5432 in MvnRepository (See Top Artifacts)
Used By65 artifacts

Compile Dependencies (3)

Category/License Group / ArtifactVersionUpdates
I18N Lib
ICU » icu4j (optional)

MPL 1.1
net.sourceforge.jchardet » jchardet (optional) 1.0

LGPL 2.1
xom » xom (optional)

Test Dependencies (1)

Category/License Group / ArtifactVersionUpdates

LGPL 2.1
com.sdicons.jsontools » jsontools-core 1.41.7


NameEmailDev IdRolesOrganization
Henri Sivonenhsivonen<at>iki.fihsivonen