Google BigQuery support for Spark, SQL, and DataFrames
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
Spark MLlib wrapper for the Snowball framework
Run spark calculations from Ammonite
Fork of dmlc/xgboost for RAPIDS + XGBoost integration
SANSA Machine Learning Layer
Spark DataFrames for earth observation data
Executable Apache Spark Tools: Format Converter & SQL Processor
Profile and monitor your ML data pipeline end-to-end
Optics for Spark DataFrames
Spark based implementation of the Topological Mapper algorithm
The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. While still allowing you to take advantage of native Apache Spark features. You can still combine it with standard Spark code.
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
Secondary sort and streaming reduce for Apache Spark
The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support.
Spline agent for Apache Spark
InfluxDB connector to Apache Spark on top of Chronicler
A Variant Caller, Distributed. Apache 2 licensed.