-
tupol/spark-utils 0.6.2
Basic framework utilities to quickly start writing production ready Apache Spark applications
Scala versions: 2.12 -
joomcode/trace-analysis 0.1.1
Library for performance bottleneck detection and optimization efficiency prediction
Scala versions: 2.13 2.12 -
agile-lab-dev/darwin 1.2.2
Avro Schema Evolution made easy
Scala versions: 2.13 2.12 2.11 2.10 -
music-of-the-ainur/almaren-framework 0.9.11-3.4
The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. While still allowing you to take advantage of native Apache Spark features. You can still combine it with standard Spark code.
Scala versions: 2.13 -
sansa-stack/archived-sansa-query 0.7.1
SANSA Query Layer
Scala versions: 2.11 -
indix/sparkplug 0.2.0
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
Scala versions: 2.10 -
sansa-stack/archived-sansa-inference 0.7.1
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
Scala versions: 2.11 -
agile-lab-dev/wasp 2.35.0
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Scala versions: 2.12 2.11 -
fsanaulla/chronicler 0.7.2
Scala toolchain for InfluxDB
Scala versions: 2.13 2.12 2.11 -
sansa-stack/archived-sansa-owl 0.7.1
SANSA Stack OWL (Web Ontology Language) API
Scala versions: 2.11 -
weaviate/spark-connector 1.3.3
Weaviate connector for Apache Spark
Scala versions: 2.13 2.12 -
locationtech/rasterframes 0.11.1
Geospatial Raster support for Spark DataFrames
Scala versions: 2.12 -
timgent/data-flare 3.2.0_0.1.14
Data quality control tool built on spark and deequ
Scala versions: 2.12 -
absaoss/pramen 1.10.1
Resilient data pipeline framework running on Apache Spark
Scala versions: 2.13 2.12 2.11 -
whylabs/whylogs-java 0.1.3
Profile and monitor your ML data pipeline end-to-end
Scala versions: 2.12