Java library for parsing various datasets: ENRON email dataset, Wikipedia web pages, DBLP papers, Reuters news ...

LicenseMIT
Tagsdataset
HomePage https://github.com/tdebatty/java-datasets
DateJan 12, 2016
Filespom (5 KB)  jar (142 KB)  View All
RepositoriesCentralSonatypePublic
Ranking#118707 in MvnRepository (See Top Artifacts)
Used By3 artifacts
VulnerabilitiesVulnerabilities from dependencies:
CVE-2022-36033
CVE-2021-37714

Note: There is a new version for this artifact

New Version0.14


Compile Dependencies (1)

Category/License Group / ArtifactVersionUpdates
HTML Parser
MIT
org.jsoup » jsoup2 vulnerabilities 1.8.31.18.3

Test Dependencies (1)

Category/License Group / ArtifactVersionUpdates
Testing
EPL 2.0
junit » junit 3.8.15.11.3

Developers

NameEmailDev IdRolesOrganization
Thibault Debattythibault<at>debatty.infodebatty.info