Group: Debatty

Sort: popular | newest
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity...
Last Release on May 12, 2020

2. Java LSH11 usages

info.debatty » java-lshMIT

A Java implementation of Locality Sensitive Hashing (LSH)
Last Release on Jun 24, 2019
Group Debatty Mark

4. Java SpamSum4 usages

info.debatty » java-spamsumMIT

A Java implementation of SpamSum / SSDeep / Context Triggered Piecewise Hashing
Last Release on Aug 11, 2016

5. Java Graphs4 usages

info.debatty » java-graphsMIT

Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...
Last Release on Nov 23, 2017
Implementation of aggregation operators like WA, OWA and WOWA
Last Release on Oct 29, 2020

7. Java Datasets3 usages

info.debatty » java-datasetsMIT

Java library for parsing various datasets: ENRON email dataset, Wikipedia web pages, DBLP papers, Reuters news ...
Last Release on Sep 7, 2017
Spark implementation of various k-medoids clustering algorithms.
Last Release on Apr 17, 2018

9. Spark KNN Graphs

info.debatty » spark-knn-graphsMIT

Spark algorithms for building k-nn graphs
Last Release on Sep 16, 2017
Maven plugin to publish on sparkpackages.org
Last Release on Jun 3, 2017

11. Spark Nndescent

info.debatty » spark-nndescentMIT

Spark implementation of NN-Descent algorithm for building k-nn graphs, based on the paper "Efficient K-Nearest Neighbor Graph Construction for Generic Similarity Measures" by Dong et al.
Last Release on May 18, 2015
Maven plugin to publish on sparkpackages.org
Last Release on Dec 3, 2015

13. Jinu

info.debatty » jinuMIT

Easy evaluation and comparison of Java algorithms.
Last Release on Dec 4, 2017