jsoup is a Java library that simplifies working with real-world HTML and XML. It offers an easy-to-use API for URL fetching, data parsing, extraction, and manipulation using DOM API methods, CSS, and xpath selectors. jsoup implements the WHATWG HTML5 specification, and parses HTML to the same DOM as modern browsers.
License | MIT |
---|---|
Categories | HTML Parsers |
Tags | jsouphtmlparserdom |
Organization | Jonathan Hedley |
HomePage | https://jsoup.org/ |
Date | Jun 11, 2017 |
Files | pom (7 KB) jar (347 KB) View All |
Repositories | CentralHortonworksXceptance |
Ranking | #133 in MvnRepository (See Top Artifacts) #1 in HTML Parsers |
Used By | 3,948 artifacts |
Vulnerabilities | Direct vulnerabilities: CVE-2022-36033 CVE-2021-37714 Vulnerabilities from dependencies: CVE-2022-25647 CVE-2020-15250 |
Compile Dependencies (0)
Category/License | Group / Artifact | Version | Updates |
---|
Test Dependencies (2)
Category/License | Group / Artifact | Version | Updates | |
---|---|---|---|---|
JSON Lib Apache 2.0 | com.google.code.gson » gson1 vulnerability | 2.7 | 2.11.0 | |
Testing EPL 2.0 | junit » junit1 vulnerability | 4.12 | 5.11.2 |
Licenses
License | URL |
---|---|
The MIT License | https://jsoup.org/license |
Developers
Name | Dev Id | Roles | Organization | |
---|---|---|---|---|
Jonathan Hedley | jonathan<at>hedley.net | jhy | Lead Developer |
Related Books
Jsoup: Questions and Answers (2015) by George Duckett | |
Instant jsoup How-to (2013) by Pete Houston |