-
mongodb/mongo-spark
The MongoDB Spark Connector
Scala (JVM): 2.10 2.11 2.12 -
frees-io/freestyle
A cohesive & pragmatic framework of FP centric Scala libraries
Scala (JVM): 2.11 2.12Scala.js: 0.6 -
cdapio/cdap
An open source framework for building data analytic applications.
Scala (JVM): 2.11 -
yotpoltd/metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Scala (JVM): 2.11 2.12 -
lucacanali/sparkmeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
Scala (JVM): 2.11 2.12 -
deeplearning4j/scalnet
A Scala wrapper for Deeplearning4j, inspired by Keras. Scala + DL + Spark + GPUs
Scala (JVM): 2.10 2.11 -
housepower/clickhouse-native-jdbc
ClickHouse Native Protocol JDBC implementation
Scala (JVM): 2.11 2.12 -
lightbend/cloudflow
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Scala (JVM): 2.12 2.13Sbt: 1.0 -
mrpowers/spark-fast-tests
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Scala (JVM): 2.11 2.12 -
deeplearning4j/datavec
ETL Library for Machine Learning - data pipelines, data munging and wrangling
Scala (JVM): 2.10 2.11 -
sparklinedata/spark-druid-olap
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Scala (JVM): 2.10