DataStax Spark Cassandra Connector
The Programming Language Designed For Big Data and AI
MLeap: Deploy Spark Pipelines to Production
Base classes to use when writing tests with Spark
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
Easy access to big things. Library for Apache Spark extending and improving its capabilities
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).
Extended datasource implementation for Spark/Hadoop on Aliyun E-MapReduce.
An open-source toolkit for large-scale genomic analysis
MLeap allows for easily putting Spark ML pipelines into production
Showcase for IoT Platform Blog
A library based on delta for Spark and MLSQL
This is a library for SQL optimizing/rewriting including Materialized View rewrite
This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache Arrow as the exchanging data format.
YugabyteDB Spark Connector for YCQL, based on the DataStax Connector
A library provides a more easy way to describe DataFrame schema for Spark and [MLSQL](http://www.mlsql.tech).
A library for starting service in Executor when startup.