Global-scale event sourcing and event collaboration with causal consistency
Quasar Analytics is a general-purpose compiler for translating data processing and analytics over semi-structured data into efficient plans that run 100% in the target infrastructure.
A simple library for creating complex neural networks
The MongoDB Spark Connector
Avro Data Source for Apache Spark
Simplifying robust end-to-end machine learning on Apache Spark.
Redshift data source for Apache Spark
A Scala feature transformation library for data science and machine learning
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
Spark library for easy MongoDB access
An open source framework for building data analytic applications.
Spark RDD to read, write and delete from HBase
Large-scale event processing with Akka Persistence and Apache Spark
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Serverless proxy for Spark cluster
An efficient updatable key-value store for Apache Spark
Mirror of Apache Bahir
Spark RDD to read and write from HBase
A tool for monitoring and tuning Spark jobs for efficiency.
ETL Library for Machine Learning - data pipelines, data munging and wrangling