Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover, seamlessly and without downtime.
The Official Couchbase Spark Connector
Quasar Analytics is a general-purpose compiler for translating data processing and analytics over semi-structured data into efficient plans that run 100% in the target infrastructure.
This project is used to capture machine learning pipelines created on top of Spark as OK
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Write your Spark data to Kafka seamlessly
Avro SerDe for Apache Spark structured APIs.
An open-source toolkit for large-scale genomic analysis
Spark RDD with Lucene's query and entity linkage capabilities
Avro Data Source for Apache Spark
Snowflake Data Source for Apache Spark.
Calliope is a library integrating Cassandra and Spark framework.
Geospatial Raster support for Spark DataFrames
This is a library for SQL optimizing/rewriting including Materialized View rewrite
MLeap allows for easily putting Spark ML pipelines into production
Essential Building Blocks for Scala
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Distributed Matrix Library