Composable, Testable library in Scala for writing ETL jobs in Spark with BigQuery support
Native Spark OSM PBF data source
A refreshing treatment for all quality control ailments. Apache 2 licensed.
The Lucius REST API based on Spark-Jobserver
Data Quality Monitoring Tool
Nested array transformation helper extensions for Apache Spark
A Play Module for running Livy Job, that runs code on remote Spark Session.
A Random Walk Engine for Apache Spark
A library for reading public web news results from Bing Custom Search using Spark Streaming.
A Scala based Spark Publish/Subscribe NATS Connector
A library for reading public search results from Reddit using Spark Streaming.
DIS SDK for SparkStreaming
Apache Spark Extensions