2 results
-
derrickburns/generalized-kmeans-clustering 1.2.2
Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL, Itakura-Saito, L1, etc). 6 algorithms, 740 tests, cross-version persistence. Drop-in replacement for MLlib with mathematically correct distance functions for probability distributions, spectral data, and count data.
Scala versions: 2.10 -
vickumar1981/stringdistance 1.2.7
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Scala versions: 2.13 2.12 2.11dice-coefficient soundex-algorithm fuzzy-matching jaccard jaro-distance cosine-similarity-scores string-similarity jaro-winkler hamming-distance jaro-winkler-distance jaro soundex sorensen-dice-distance hacktoberfest levenshtein cosine-similarity jaccard-similarity ngram longest-common-subsequence levenshtein-distance