Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
CSV Data Source for Apache Spark 1.x
A connector for MemSQL and Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Lightweight Scala kernel for Jupyter / IPython 3
An open-source toolkit for large-scale genomic analysis
Avro SerDe for Apache Spark structured APIs.
Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover, seamlessly and without downtime.
Snowflake Data Source for Apache Spark.
Quasar Analytics is a general-purpose compiler for translating data processing and analytics over semi-structured data into efficient plans that run 100% in the target infrastructure.
Spark RDD with Lucene's query and entity linkage capabilities
Avro Data Source for Apache Spark
Essential Spark extensions and helper methods ✨😲
This is a library for SQL optimizing/rewriting including Materialized View rewrite
Geospatial Raster support for Spark DataFrames
Boiler plate framework to use Spark and ZIO together.
Essential Building Blocks for Scala
Calliope is a library integrating Cassandra and Spark framework.
MLeap allows for easily putting Spark ML pipelines into production
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark