Redshift data source for Apache Spark
REST job server for Apache Spark
Serializers (input and output) for the phone call-related models
Mirror of Apache livy (Incubating)
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
Generation of a few sample data sets. For instance, feature set derived from CDR
Library for building data products
Optics for Spark DataFrames
API enabling switching between Spark execution engine and local fast implementation based on Scala collections.
Livy is an open source REST interface for interacting with Apache Spark from anywhere
Provides KafkaExtract, KafkaLoad and KafkaCommitExecute stages
Calliope is a library integrating Cassandra and Spark framework.
Collection of Spark SQL Helper : udf, udaf, …
A framework for writing Spark 2.x applications in a pretty way
Base classes to use when writing tests with Spark