Basic framework utilities to quickly start writing production ready Apache Spark applications
Use Scala API to read/write data from different databases,HBase,MySQL,etc.
This project aims to make writing Spark applications easier by abstracting the effort to assemble the driver into reusable steps and pipelines.
A library for reading social data from Instagram using Spark Streaming.
Executable Apache Spark Tools: Format Converter & SQL Processor
A library for reading social data from Facebook using Spark Streaming.
Unoffical sink for cassandra for spark structured streaming
Creating reusable workflows for Apache Spark
Divolte default Avro schema to use as external dependency
A JDBC streaming source for Spark
Online latent state estimation with Spark
Spark connector for Ryft ONE
A library for reading public web news results from Bing Custom Search using Spark Streaming.
DIS SDK for SparkStreaming
A library for reading public search results from Reddit using Spark Streaming.
Spark Connector for Alibaba Log Service
A monadic design pattern that can be used to construct data processing pipeline. It also provides several monads implemented using Apache Spark.
A Scala based Spark Publish/Subscribe NATS Connector
A library to query heterogeneous data sources uniformly using SPARQL