HTML Parsers
Sort by:Popular

TagSoup
Last Release on Nov 8, 2005

Comtoo Parser: Plugins: HTML Parser
Last Release on May 7, 2014
A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. First and foremost it aims to be a testing lib, but it can also be used to scrape websites in a convenient fashion.
Last Release on Jul 17, 2022
The development version of the Jericho HTML parser.
Last Release on Dec 17, 2013
A rewrapping of the validator.nu html parser for use in OSGi containers.
Last Release on Jun 29, 2012
JSilver is a pure-Java implementation of Clearsilver. This package includes only the streamhtmlparser part.
Last Release on Feb 28, 2014
HTML-parser provides a parser for HTML 5 that produces HTML 5 document object model. It aims to be a Java-implementation of http://www.w3.org/TR/html5/. It is for use in the server. It does not implement features that are relevant in the client, like event handling. It is for use from javascript, via Java's scripting library.
Last Release on Apr 29, 2024
Daisy HTML Cleaner
Last Release on Apr 4, 2006
Apache's commons-io library.
Last Release on Jul 19, 2013
Simple HTML parser/selector library built on Jsoup
Last Release on Jan 30, 2015