Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
- spark
- similarity-search
- spark-mllib
- itakura-saito-divergence
- kullback-leibler-divergence
- clustering
- entropy
- embeddings
- k-means
- bregman-divergence
- euclidean-distance
- cosine-similarity
Scala versions:
2.10
massivedatascience-clusterer
Found 6 versions