Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL, Itakura-Saito, L1, etc). 6 algorithms, 740 tests, cross-version persistence. Drop-in replacement for MLlib with mathematically correct distance functions for probability distributions, spectral data, and count data.
- euclidean-distance
 - bregman-divergence
 - cosine-similarity
 - itakura-saito-divergence
 - embeddings
 - similarity-search
 - k-means
 - clustering
 - kullback-leibler-divergence
 - spark
 - entropy
 - spark-mllib
 
              Scala versions:
              
                2.10
              
            
          
    
          
    
          
    
          
    
          
        massivedatascience-clusterer 1.0-9f86973fa49d924861cc0129027b4b2aaa1196a7
Group ID:
     com.massivedatascience 
  Artifact ID:
     massivedatascience-clusterer_2.10 
  Version:
     1.0-9f86973fa49d924861cc0129027b4b2aaa1196a7 
  Release Date:
     Apr 20, 2015 
  Licenses:
    
  libraryDependencies += "com.massivedatascience" %% "massivedatascience-clusterer" % "1.0-9f86973fa49d924861cc0129027b4b2aaa1196a7"
resolvers += Resolver.bintrayRepo("derrickburns", "maven")
          
        ivy"com.massivedatascience::massivedatascience-clusterer:1.0-9f86973fa49d924861cc0129027b4b2aaa1196a7"
MavenRepository("https://dl.bintray.com/derrickburns/maven")
          
        //> using dep "com.massivedatascience::massivedatascience-clusterer:1.0-9f86973fa49d924861cc0129027b4b2aaa1196a7"
import $ivy.`com.massivedatascience::massivedatascience-clusterer:1.0-9f86973fa49d924861cc0129027b4b2aaa1196a7` import ammonite._, Resolvers._ val res = Resolver.Http( "Bintray derrickburns maven", "Some(https://dl.bintray.com/derrickburns/maven)", IvyPattern, false) interp.resolvers() = interp.resolvers() :+ res
<dependency> <groupId>com.massivedatascience</groupId> <artifactId>massivedatascience-clusterer_2.10</artifactId> <version>1.0-9f86973fa49d924861cc0129027b4b2aaa1196a7</version> </dependency>
compile group: 'com.massivedatascience', name: 'massivedatascience-clusterer_2.10', version: '1.0-9f86973fa49d924861cc0129027b4b2aaa1196a7'