The pdf2txt-scienceparse subproject implements an interface to the allenai/science-parser converters.

LicenseApache 2.0
Tagspdf
Organization Computational Language Understanding (CLU) Lab
HomePage https://github.com/clulab/pdf2txt
DateOct 30, 2023
Filespom (8 KB)  jar (26 KB)  View All
RepositoriesCentralClulab
Ranking#283667 in MvnRepository (See Top Artifacts)
Used By1 artifacts
Scala TargetScala 2.12 (View all targets)
VulnerabilitiesVulnerabilities from dependencies:
CVE-2024-47554
CVE-2024-30172
CVE-2024-30171
View 30 more ...


Compile Dependencies (35)

Category/License Group / ArtifactVersionUpdates
Logging
EPL 1.0LGPL 2.1
ch.qos.logback » logback-classic1 vulnerability 1.2.31.5.16
Logging
EPL 1.0LGPL 2.1
ch.qos.logback » logback-core4 vulnerabilities 1.2.31.5.16
Logging
EPL 1.0LGPL 2.1
ch.qos.logback » logback-classic2 vulnerabilities 1.1.71.5.16
S3 Client
Apache 2.0
com.amazonaws » aws-java-sdk-s31 vulnerability 1.11.2132.29.50
S3 Client
Apache 2.0
com.amazonaws » aws-java-sdk-s31 vulnerability 1.10.292.29.50
CLI Parser
MIT
com.github.scopt » scopt_2.12 3.7.14.1.0
Collections
EDL 1.0EPL 1.0
com.goldmansachs » gs-collections 6.1.011.1.0
Core Utils
Apache 2.0
com.google.guava » guava3 vulnerabilities 18.033.4.0-jre
JSON Lib
MIT
com.lihaoyi » ujson_2.12 2.0.04.1.0
Config
Apache 2.0
com.typesafe » config 1.2.11.4.3
Config
Apache 2.0
com.typesafe » config 1.3.01.4.3
I/O
Apache 2.0
commons-io » commons-io2 vulnerabilities 2.42.18.0
Off-Heap Lib
Apache 2.0
de.ruedigermoeller » fst 2.473.0.4-jdk17
JSON Lib
BUSL 1.1
io.spray » spray-json_2.12 1.3.510.7.0
Date/Time
Apache 2.0
joda-time » joda-time 2.32.13.0
Logging
Apache 2.0
log4j » log4j5 vulnerabilities 1.2.172.24.3
CSV
Apache 2.0
net.sf.opencsv » opencsv 2.15.10
Core Utils
Apache 2.0
org.apache.commons » commons-lang3 3.43.17.0
Core Utils
Apache 2.0
org.apache.commons » commons-lang3 3.93.17.0
Math Lib
Apache 2.0
org.apache.commons » commons-math3 3.6.14.0-beta1
Natural Lang Proc
Apache 2.0
org.apache.opennlp » opennlp-tools1 vulnerability 1.7.22.5.3
PDF Lib
Apache 2.0
org.apache.pdfbox » pdfbox6 vulnerabilities 2.0.93.0.3
Font Lib
Apache 2.0
org.apache.pdfbox » fontbox 2.0.93.0.3
PDF Lib
Apache 2.0
org.apache.pdfbox » pdfbox6 vulnerabilities 2.0.13.0.3
Font Lib
Apache 2.0
org.apache.pdfbox » fontbox 2.0.13.0.3

Apache 2.0
org.apache.thrift » libfb303 0.9.3
Encryption Lib
BouncyCastle
org.bouncycastle » bcprov-jdk15on7 vulnerabilities 1.541.79

BouncyCastle
org.bouncycastle » bcmail-jdk15on 1.541.79
Encryption Lib
BouncyCastle
org.bouncycastle » bcpkix-jdk15on 1.541.79

Apache 2.0
org.clulab » pdf2txt-common_2.12 1.1.5
HTML Parser
MIT
org.jsoup » jsoup3 vulnerabilities 1.8.11.18.3
CodeGen
MIT
org.projectlombok » lombok 1.16.201.18.36
JVM Languages
Apache 2.0
org.scala-lang » scala-library 2.12.153.6.2

Apache 2.0
org.scala-lang.modules » scala-java8-compat_2.12 0.8.01.0.2
HTTP Clients
Apache 2.0
org.scalaj » scalaj-http_2.12 2.3.02.4.2

Licenses

LicenseURL
Apache License, Version 2.0 http://www.apache.org/licenses/LICENSE-2.0.html

Developers

NameEmailDev IdRolesOrganization
Mihai Surdeanumsurdeanu<at>email.arizona.edumihai.surdeanu