Mirror of Apache Spark
REST job server for Apache Spark
Large-scale event processing with Akka Persistence and Apache Spark
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.
The missing MatPlotLib for Scala + Spark
CSV data source for Spark SQL and DataFrames
Spark library for easy MongoDB access
Avro support for Spark, SQL, and DataFrames
Redshift data source for Spark
Simplifying robust end-to-end machine learning on Apache Spark.
Scala client for Amazon Kinesis. Also provides write to Kinesis capability for Apache Spark or Spark Streaming.
An efficient updatable key-value store for Apache Spark
Base classes to use when writing tests with Spark