Redshift data source for Apache Spark
REST job server for Apache Spark
Serializers (input and output) for the phone call-related models
Mirror of Apache livy (Incubating)
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
Generation of a few sample data sets. For instance, feature set derived from CDR
Library for building data products
A Cluster Computing System for Processing Large-Scale Spatial Data
Optics for Spark DataFrames
API enabling switching between Spark execution engine and local fast implementation based on Scala collections.
Provides KafkaExtract, KafkaLoad and KafkaCommitExecute stages
Calliope is a library integrating Cassandra and Spark framework.
Collection of Spark SQL Helper : udf, udaf, …
A framework for writing Spark 2.x applications in a pretty way
Base classes to use when writing tests with Spark