Group: Debatty
Sort:
popular
|
newest
1. Java String Similarity58 usages
info.debatty » java-string-similarityMIT
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity...
Last Release on May 12, 2020
4. Java SpamSum4 usages
info.debatty » java-spamsumMIT
A Java implementation of SpamSum / SSDeep / Context Triggered Piecewise Hashing
Last Release on Aug 11, 2016
5. Java Graphs4 usages
info.debatty » java-graphsMIT
Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...
Last Release on Nov 23, 2017
6. Java Aggregation4 usages
info.debatty » java-aggregationMIT
Implementation of aggregation operators like WA, OWA and WOWA
Last Release on Oct 29, 2020
7. Java Datasets3 usages
info.debatty » java-datasetsMIT
Java library for parsing various datasets: ENRON email dataset, Wikipedia web pages, DBLP papers, Reuters news ...
Last Release on Sep 7, 2017
8. Spark Kmedoids1 usages
info.debatty » spark-kmedoidsMIT
Spark implementation of various k-medoids clustering algorithms.
Last Release on Apr 17, 2018
9. Spark KNN Graphs
info.debatty » spark-knn-graphsMIT
Spark algorithms for building k-nn graphs
Last Release on Sep 16, 2017
10. Sparkpackages Maven Plugin
info.debatty » sparkpackages-maven-pluginMIT
Maven plugin to publish on sparkpackages.org
Last Release on Jun 3, 2017
11. Spark Nndescent
info.debatty » spark-nndescentMIT
Spark implementation of NN-Descent algorithm for building k-nn graphs, based on the paper "Efficient K-Nearest Neighbor Graph Construction for Generic Similarity Measures" by Dong et al.
Last Release on May 18, 2015
12. Sparkpackage Maven Plugin
info.debatty » sparkpackage-maven-pluginMIT
Maven plugin to publish on sparkpackages.org
Last Release on Dec 3, 2015