The HTML Parser is an implementation of the HTML5 parsing algorithm in Java for applications. The parser is designed to work as a drop-in replacement for the XML parser in applications that already support XHTML 1.x content with an XML parser and use SAX, DOM or XOM to interface with the parser.

CategoriesHTML Parsers
DateSep 26, 2008
Filespom (4 KB)  jar (257 KB)  View All
Ranking#6148 in MvnRepository (See Top Artifacts)
#8 in HTML Parsers
Used By67 artifacts

Note: There is a new version for this artifact

New Version1.4

Compile Dependencies (3)

Category/License Group / ArtifactVersionUpdates
I18N Lib
ICU » icu4j (optional) 3.874.1

MPL 1.1
net.sourceforge.jchardet » jchardet (optional) 1.0

LGPL 2.1
xom » xom (optional)

Test Dependencies (1)

Category/License Group / ArtifactVersionUpdates

LGPL 2.1
com.sdicons.jsontools » jsontools-core 1.41.7


NameEmailDev IdRolesOrganization
Henri Sivonenhsivonen<at>iki.fihsivonen