jWeb1T is an open source Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format. It is based on a binary search algorithm that finds the n-grams and returns their frequency counts in logarithmic time. As the corpus is stored in many files a simple index is used to retrieve the files containing the n-grams.
License | Apache 2.0 |
---|---|
HomePage | http://code.google.com/p/jweb1t/ |
Date | (Feb 07, 2012) |
Files | pom (13 KB) jar (22 KB) View All |
Repositories | CentralGeomajasSonatype |
Ranking | #49326 in MvnRepository (See Top Artifacts) |
Used By | 6 artifacts |
Vulnerabilities | Vulnerabilities from dependencies: CVE-2021-29425 CVE-2020-15250 |
Compile Dependencies (4)
Category/License | Group / Artifact | Version | Updates | |
---|---|---|---|---|
I/O Apache 2.0 | commons-io » commons-io1 vulnerability | 2.1 | 2.11.0 | |
Core Utils Apache 2.0 | commons-lang » commons-lang | 2.6 | 3.12.0 | |
Logging Bridge Apache 2.0 | commons-logging » commons-logging-api | 1.1 | 1.2 | |
pcj » pcj | 1.2 | ✔ |
Test Dependencies (2)
Category/License | Group / Artifact | Version | Updates | |
---|---|---|---|---|
Logging Apache 2.0 | commons-logging » commons-logging | 1.1.1 | 1.2 | |
Testing EPL 2.0 | junit » junit1 vulnerability | 4.10 | 5.8.2 |
Licenses
License | URL |
---|---|
The Apache Software License, Version 2.0 | http://www.apache.org/licenses/LICENSE-2.0.txt |