Indexed Artifacts (28.1M)

Popular Categories

Group: Googlecode JWEB1T

Sort: popular | newest

1. JWeb1T6 usages

com.googlecode.jweb1t » com.googlecode.jweb1tApache

jWeb1T is an open source Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format. It is based on a binary search algorithm that finds the n-grams and returns their frequency counts in logarithmic time. As the corpus is stored in many files a simple index is used to retrieve the files containing the n-grams.
Last Release on Oct 3, 2017