The Programming Language Designed For Big Data and AI
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).
Avro SerDe for Apache Spark structured APIs.
A library based on delta for Spark and MLSQL
This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache Arrow as the exchanging data format.
Spline agent for Apache Spark
Basic framework utilities to quickly start writing production ready Apache Spark applications
Executable Apache Spark Tools: Format Converter & SQL Processor
Kafka offset committer for structured streaming query
A library provides a more easy way to describe DataFrame schema for Spark and [MLSQL](http://www.mlsql.tech).
A library for starting service in Executor when startup.
Provides the DebeziumTransform stage
Testing Hadoop related apps in a single JVM
Provides KafkaExtract, KafkaLoad and KafkaCommitExecute stages