Apache Spark - A unified analytics engine for large-scale data processing
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Eclipse Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
Statistical Machine Intelligence & Learning Engine
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
REST job server for Apache Spark
Abstract Algebra for Scala
Streaming MapReduce with Scalding and Storm
Microsoft Machine Learning for Apache Spark
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
Mirror of Apache Mahout
Base classes to use when writing tests with Spark
The Programming Language Designed For Big Data and AI
GeoTrellis is a geographic data processing engine for high performance applications.
MLeap: Deploy Spark Pipelines to Production
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Integration of TensorFlow with other open-source frameworks
CSV Data Source for Apache Spark 1.x