The pdf2txt-scienceparse subproject implements an interface to the allenai/science-parser converters.

LicenseApache 2.0
Tagspdf
Organization Computational Language Understanding (CLU) Lab
HomePage https://github.com/clulab/pdf2txt
DateMar 31, 2022
Filespom (8 KB)  jar (18 KB)  View All
RepositoriesCentralAutonomX RelClulab
Ranking#300044 in MvnRepository (See Top Artifacts)
Used By1 artifacts
Scala TargetScala 2.11 (View all targets)
VulnerabilitiesVulnerabilities from dependencies:
CVE-2024-47554
CVE-2024-30172
CVE-2024-30171
View 30 more ...

Note: There is a new version for this artifact

New Version1.1.5


Compile Dependencies (34)

Category/License Group / ArtifactVersionUpdates
Logging
EPL 1.0LGPL 2.1
ch.qos.logback » logback-classic1 vulnerability 1.2.31.5.16
Logging
EPL 1.0LGPL 2.1
ch.qos.logback » logback-core4 vulnerabilities 1.2.31.5.16
Logging
EPL 1.0LGPL 2.1
ch.qos.logback » logback-classic2 vulnerabilities 1.1.71.5.16
S3 Client
Apache 2.0
com.amazonaws » aws-java-sdk-s31 vulnerability 1.11.2132.30.5
S3 Client
Apache 2.0
com.amazonaws » aws-java-sdk-s31 vulnerability 1.10.292.30.5
CLI Parser
MIT
com.github.scopt » scopt_2.11 3.7.14.1.0
Collections
EDL 1.0EPL 1.0
com.goldmansachs » gs-collections 6.1.011.1.0
Core Utils
Apache 2.0
com.google.guava » guava3 vulnerabilities 18.033.4.0-jre
Config
Apache 2.0
com.typesafe » config 1.2.11.4.3
Config
Apache 2.0
com.typesafe » config 1.3.01.4.3
I/O
Apache 2.0
commons-io » commons-io2 vulnerabilities 2.42.18.0
Off-Heap Lib
Apache 2.0
de.ruedigermoeller » fst 2.473.0.4-jdk17
JSON Lib
BUSL 1.1
io.spray » spray-json_2.11 1.3.510.7.0
Date/Time
Apache 2.0
joda-time » joda-time 2.32.13.0
Logging
Apache 2.0
log4j » log4j5 vulnerabilities 1.2.172.24.3
CSV
Apache 2.0
net.sf.opencsv » opencsv 2.15.10
Core Utils
Apache 2.0
org.apache.commons » commons-lang3 3.43.17.0
Core Utils
Apache 2.0
org.apache.commons » commons-lang3 3.93.17.0
Math Lib
Apache 2.0
org.apache.commons » commons-math3 3.6.14.0-beta1
Natural Lang Proc
Apache 2.0
org.apache.opennlp » opennlp-tools1 vulnerability 1.7.22.5.3
PDF Lib
Apache 2.0
org.apache.pdfbox » pdfbox6 vulnerabilities 2.0.93.0.4
Font Lib
Apache 2.0
org.apache.pdfbox » fontbox 2.0.93.0.4
PDF Lib
Apache 2.0
org.apache.pdfbox » pdfbox6 vulnerabilities 2.0.13.0.4
Font Lib
Apache 2.0
org.apache.pdfbox » fontbox 2.0.13.0.4

Apache 2.0
org.apache.thrift » libfb303 0.9.3
Encryption Lib
BouncyCastle
org.bouncycastle » bcprov-jdk15on7 vulnerabilities 1.541.80

BouncyCastle
org.bouncycastle » bcmail-jdk15on 1.541.80
Encryption Lib
BouncyCastle
org.bouncycastle » bcpkix-jdk15on 1.541.80

Apache 2.0
org.clulab » pdf2txt-common_2.11 1.0.01.1.5
HTML Parser
MIT
org.jsoup » jsoup3 vulnerabilities 1.8.11.18.3
CodeGen
MIT
org.projectlombok » lombok 1.16.201.18.36
JVM Languages
Apache 2.0
org.scala-lang » scala-library 2.11.123.6.3

Apache 2.0
org.scala-lang.modules » scala-java8-compat_2.11 0.8.01.0.2
HTTP Clients
Apache 2.0
org.scalaj » scalaj-http_2.11 2.3.02.4.2

Licenses

LicenseURL
Apache License, Version 2.0 http://www.apache.org/licenses/LICENSE-2.0.html

Developers

NameEmailDev IdRolesOrganization
Mihai Surdeanumsurdeanu<at>email.arizona.edumihai.surdeanu