Apache Spark - A unified analytics engine for large-scale data processing
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
Async Scala-Akka-Netty based Load Test Tool
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Mirror of Apache Mahout
a command line tool to apply templates defined on GitHub
The Pants Build System
MLeap: Deploy Spark Pipelines to Production
GeoTrellis is a geographic data processing engine for high performance applications.
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
A Thrift parser/generator
Quasar Analytics is a general-purpose compiler for translating data processing and analytics over semi-structured data into efficient plans that run 100% in the target infrastructure.
Refactoring and linting tool for Scala
Simplifying robust end-to-end machine learning on Apache Spark.
Scala at your command
scalaxb is an XML data binding tool for Scala.