HTML Parsers
Sort by:Popular

Java HTML/XML parsers suite
Last Release on Jun 8, 2022
A fork of http://hg.mozilla.org/projects/htmlparser/ used by the Nu Html Checker.
Last Release on Jun 30, 2020
NekoHtml is the Html parser used by HtmlUnit.
Last Release on Oct 30, 2025
Powerful, fast and easy to use HTML and XML parser for Java
Last Release on Jul 30, 2023
Provides HTML parsing functionality
Last Release on Feb 8, 2021
Apache Tika HTML Parser Module
Last Release on Sep 11, 2025
HTML Lexer is the low level lexical analyzer.
Last Release on Apr 24, 2011
A HTML parser for Clojure.
Last Release on May 1, 2012
Jericho HTML Parser is a simple but powerful java library allowing analysis and manipulation of parts of an HTML document, including some common server-side tags, while reproducing verbatim any unrecognised or invalid HTML. It also provides high-level HTML form manipulation functions.
Last Release on Oct 6, 2006

flexmark-java extension to convert HTML to Markdown
Last Release on Jan 23, 2020