-
whylabs/whylogs-java 0.1.3
Profile and monitor your ML data pipeline end-to-end
Scala versions: 2.12 -
isarn/isarn-sketches-spark 0.6.0-sp3.2
Routines and data structures for using isarn-sketches idiomatically in Apache Spark
Scala versions: 2.12 -
arcizon/spark-filetransfer 0.3.0
API for reading and writing data via various file transfer protocols from Apache Spark.
Scala versions: 2.12 2.11 -
florentf9/sparkml-som 0.2
:sparkles: Spark ML implementation of SOM algorithm (Kohonen self-organizing map)
Scala versions: 2.11 -
romans-weapon/spear-framework 3.1.1-3.0
Rapid ETL/ELT-connectors/pipeline development leveraged on top of Apache Spark
Scala versions: 2.12 -
s22s/pre-lt-raster-frames 0.6.1
Spark DataFrames for earth observation data
Scala versions: 2.11 -
qubole/streaminglens 0.5.3
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
Scala versions: 2.11 -
qubole/s3-sqs-connector 0.5.1
A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).
Scala versions: 2.11 -
getsentry/sentry-spark 0.0.1-alpha04
Apache Spark Sentry Integration
Scala versions: 2.11 -
piotr-kalanski/data-quality-monitoring 0.3.8
Data Quality Monitoring Tool
Scala versions: 2.11 -
jtnystrom/discount 3.0.1
Very large scale k-mer counting and analysis on Apache Spark.
Scala versions: 2.13 2.12 -
qubole/spark-state-store 1.0.0
Rocksdb state storage implementation for Structured Streaming.
Scala versions: 2.11 -
data-tools/big-data-types 1.4.1
A library to transform Scala product types and Schemes from different systems into other Schemes. Any implemented type automatically gets methods to convert it into the rest of the types and vice versa. E.g: a Spark Schema can be transformed into a BigQuery table.
Scala versions: 3.x 2.13 2.12