dllib is a distributed deep learning library running on Apache Spark
Bucketing and partitioning system for Parquet
A Spark package for retrieving data from Google Analytics
SANSA Query Layer
A Spark datasource for the HadoopOffice library
Calliope is a library integrating Cassandra and Spark framework.
This is a library for SQL optimizing/rewriting including Materialized View rewrite
Apache Spark OpenCPU Executor (ROSE)
SparkSQL utils for ScalaPB
Scala client for Amazon Kinesis. Also provides write to Kinesis capability for Apache Spark or Spark Streaming.
An extension to the amazing Spark framework for better functional programming.
Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.
Spark-Transformers: Library for exporting Apache Spark MLLIB models in to use them in any Java application with no other dependencies.
Fork of dmlc/xgboost for RAPIDS + XGBoost integration
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
SANSA Stack OWL (Web Ontology Language) API
SANSA Machine Learning Layer
Spark DataFrames for earth observation data