Indexed Artifacts (28.1M)

Popular Categories

jWeb1T is an open source Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format. It is based on a binary search algorithm that finds the n-grams and returns their frequency counts in logarithmic time. As the corpus is stored in many files a simple index is used to retrieve the files containing the n-grams.

LicenseApache 2.0
Used By6 artifacts

1.4.0Central6Oct, 2017
1.3.0Central3Jun, 2013
1.2.1Central3Feb, 2012
1.2.0Central 0 Feb, 2012