An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
REST job server for Apache Spark
Base classes to use when writing tests with Spark
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Livy is an open source REST interface for interacting with Apache Spark from anywhere
C# and F# language binding and extensions to Apache Spark
Redshift data source for Apache Spark
Serverless proxy for Spark cluster
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Boiler plate framework to use Spark and ZIO together.
Snowflake Data Source for Apache Spark.
Mirror of Apache livy (Incubating)
A simple Spark-powered ETL framework that just works 🍺
A framework for writing Spark 2.x applications in a pretty way
SANSA RDF Library
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.
Spark connector for SFTP