html
Sort by:Popular

jsoup is a Java library that simplifies working with real-world HTML and XML. It offers an easy-to-use API for URL fetching, data parsing, extraction, and manipulation using DOM API methods, CSS, and xpath selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers.
Last Release on Aug 24, 2025
A headless browser intended for use in testing web-based applications.
Last Release on Jan 22, 2023

Relocated → org.htmlunit » htmlunit
An HTML parser and tag balancer.
Last Release on Apr 17, 2015
WebDriver compatible driver for HtmlUnit headless browser
Last Release on Oct 5, 2023
A kotlinx.html library provides DSL to build HTML to Writer/Appendable or DOM at JVM and browser (or other JavaScript engine) for better Kotlin programming for Web.
Last Release on Jan 15, 2025
TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. TagSoup also includes a command-line processor that reads HTML files and can generate either ...
Last Release on Aug 22, 2011
Neko HTML
Last Release on Mar 23, 2008

Selenium HtmlUnit Driver
Last Release on Feb 11, 2016

ScalaTags is a small XML/HTML construction library for Scala.
Last Release on Apr 15, 2024
JTidy is a Java port of HTML Tidy, a HTML syntax checker and pretty printer. Like its non-Java cousin, JTidy can be used as a tool for cleaning up malformed and faulty HTML. In addition, JTidy provides a DOM interface to the document that is being processed, which effectively makes you able to use JTidy as a DOM parser for real-world HTML.
Last Release on Jul 20, 2010