Apache Spark - A unified analytics engine for large-scale data processing
The Programming Language Designed For Big Data and AI
REST job server for Apache Spark
Eclipse Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
Base classes to use when writing tests with Spark
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
Livy is an open source REST interface for interacting with Apache Spark from anywhere
Serverless proxy for Spark cluster
C# and F# language binding and extensions to Apache Spark
SANSA RDF Library
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
An open source framework for building data analytic applications.
Upserts, Deletes And Incremental Processing on Big Data.
MLeap: Deploy Spark Pipelines to Production
Upserts And Incremental Processing on Big Data
Integration of TensorFlow with other open-source frameworks
Global-scale event sourcing and event collaboration with causal consistency
A connector for Spark that allows reading and writing to/from Redis cluster