Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Eclipse Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
Statistical Machine Intelligence & Learning Engine
REST job server for Apache Spark
Microsoft Machine Learning for Apache Spark
Base classes to use when writing tests with Spark
The Programming Language Designed For Big Data and AI
MLeap: Deploy Spark Pipelines to Production
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Integration of TensorFlow with other open-source frameworks
Expressive types for Spark.
Simplifying robust end-to-end machine learning on Apache Spark.
An open source framework for building data analytic applications.
Serverless proxy for Spark cluster
Scala Library/REPL for Machine Learning Research
Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logistic regression, latent dirichilet allocation, factorization machines and DNN.
An open-source toolkit for large-scale genomic analysis
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.