Content extraction simplified! Retrieve text, data and metadata from binary documents using Tika and similar toolkits

LicenseApache 2.0
HomePage https://opensextant.github.io/XText
DateJul 24, 2019
Filespom (19 KB)  jar (110 KB)  View All
RepositoriesCentral
Ranking#676807 in MvnRepository (See Top Artifacts)
VulnerabilitiesVulnerabilities from dependencies:
CVE-2024-47554
CVE-2024-25710
CVE-2023-6378
View 17 more ...

Note: There is a new version for this artifact

New Version3.7.1


Compile Dependencies (24)

Category/License Group / ArtifactVersionUpdates
Logging
EPL 1.0LGPL 2.1
ch.qos.logback » logback-classic1 vulnerability 1.2.31.5.12
I18N Lib
com.ibm.icu » icu4j 63.176.1

Apache 2.0
com.pff » java-libpst 0.8.10.9.3
Mail Client
EDL 1.0EPL 2.0GPL
com.sun.mail » javax.mail 1.5.12.0.1
Base64
Apache 2.0
commons-codec » commons-codec 1.121.17.1
I/O
Apache 2.0
commons-io » commons-io2 vulnerabilities 2.62.18.0

Apache 2.0
de.l3s.boilerpipe » boilerpipe 1.1.0
CLI Parser
gnu.getopt » java-getopt 1.0.13
Java Spec
EDL 1.0
javax.activation » activation 1.12.1.3
Date/Time
Apache 2.0
joda-time » joda-time 2.10.12.13.0
HTML Parser
ApacheEPL 1.0LGPL
net.htmlparser.jericho » jericho-html 3.4
Core Utils
Apache 2.0
org.apache.commons » commons-lang3 3.93.17.0
String Utils
Apache 2.0
org.apache.commons » commons-text1 vulnerability 1.71.12.0
Compression
Apache 2.0
org.apache.commons » commons-compress6 vulnerabilities 1.181.27.1
HTTP Clients
Apache 2.0
org.apache.httpcomponents » httpclient1 vulnerability 4.5.55.4.1
HTTP Clients
Apache 2.0
org.apache.httpcomponents » httpcore 4.4.95.3.1
HTTP Clients
Apache 2.0
org.apache.httpcomponents » httpmime 4.5.55.4.1
PDF Lib
Apache 2.0
org.apache.pdfbox » pdfbox4 vulnerabilities 2.0.153.0.3

Apache 2.0
org.apache.tika » tika-core3 vulnerabilities 1.213.0.0

Apache 2.0
org.apache.tika » tika-parsers1 vulnerability 1.213.0.0

BSD 2-clause
org.jodd » jodd-json 5.0.106.0.3

Apache 2.0
org.opensextant » opensextant-xponents-core 3.23.7.3
Logging
MIT
org.slf4j » slf4j-api 1.7.252.0.16
XML Processing
ApacheW3C
xml-apis » xml-apis 1.4.012.0.2

Runtime Dependencies (4)

Test Dependencies (1)

Category/License Group / ArtifactVersionUpdates
Testing
EPL 2.0
junit » junit1 vulnerability 4.125.11.3

Managed Dependencies (3)

Developers

NameEmailDev IdRolesOrganization
Marc Ubaldinoubaldino<at>mitre.orgLeadMITRE