HtmlCleaner is an HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that most web-browsers use.

LicenseBSD
CategoriesHTML Parsers
Tagsbundleparserdomhtmlosgi
HomePage http://htmlcleaner.sourceforge.net/
Ranking#3403 in MvnRepository (See Top Artifacts)
#7 in HTML Parsers
Used By147 artifacts