Spark SQS Amazon queue receiver
AMQP data source for dstream (Spark Streaming)
A Scala based Spark Publish/Subscribe NATS Connector
Scala-based DSLink implementation for Apache Spark
Divolte default Avro schema to use as external dependency
A library for reading public search results from Reddit using Spark Streaming.
Impatient fork of Ammonite
A neural network library which trained by Spark RDD instances.
sparkml extend library implements calculation algorithm
Scala library for scraping metadata from specified URLs (e.g. OpenGraph)
Microsoft Machine Learning for Apache Spark
C4E, a Scala or Spark library for local and distributed Clustering.
Approximate Nearest Neighbors in Spark
Spark ML Lib serving library
Export spark ml SparseVectors as numpy csr matrix
Offline Recommender System Evaluation for Spark
An implementation of DBSCAN runing on top of Apache Spark
This project generalizes the Spark MLLIB Batch and Streaming K-Means clusterers in every practical way.