jsoup is a Java library that simplifies working with real-world HTML and XML. It offers an easy-to-use API for URL fetching, data parsing, extraction, and manipulation using DOM API methods, CSS, and xpath selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers.

LicenseMIT
CategoriesHTML Parsers
Tagsjsouphtmlparserdom
HomePage https://jsoup.org/
Ranking#134 in MvnRepository (See Top Artifacts)
#1 in HTML Parsers
Used By4,306 artifacts