Apache Spark - A unified analytics engine for large-scale data processing
Apache Flink
PredictionIO, a machine learning server for developers and ML engineers. Built on Apache Spark, HBase and Spray.
A Scala API for Cascading
Compile-time Language Integrated Queries for Scala
A Scala API for Apache Beam and Google Cloud Dataflow.
Streaming MapReduce with Scalding and Storm
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Reversible conversions between types
Scala extensions for the Kryo serialization library
A Scala feature transformation library for data science and machine learning
Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover, seamlessly and without downtime.
Thin Scala wrapper around Kafka Streams Java API
Scala extensions for Storm
Artificial Neural Networks for Scala
A lightweight workflow definition library
CodeFeedr core infrastructure
Scaliper is a scala microbenchmarking toolkit based on Google Caliper.