REST job server for Apache Spark
ETL Library for Machine Learning - data pipelines, data munging and wrangling
SANSA RDF Library
A Scala wrapper for Deeplearning4j, inspired by Keras. Scala + DL + Spark + GPUs
Apache Spark Data Source for ROOT File Format
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Generate Scala case class based on Spark DataFrame schema
Scala-Spark port of https://github.com/bmabey/pyLDAvis for Apache Spark LDA Topic Modelling Visualisation
A Scala feature transformation library for data science and machine learning
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
MLeap: Deploy Spark Pipelines to Production
Sparkling Water provides H2O functionality inside Spark cluster
Expressive types for Spark.
A Unified Integration Platform - Cask Data Application Platform (CDAP)
GeoTrellis for PySpark
SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Micro Spark Rest API
A RPC framework leveraging Spark RPC module
Scala client for Amazon Kinesis. Also provides write to Kinesis capability for Apache Spark or Spark Streaming.
Geospatial Raster support for Spark DataFrames