Spark Structured Streaming State Tools
Offline Recommender System Evaluation for Spark
A spark package to approximate the diameter of large graphs
Functional, Composable library in Scala based on ZIO for writing ETL jobs in AWS and GCP https://tharwaninitin.github.io/etlflow/site/
Scala-Spark port of https://github.com/bmabey/pyLDAvis for Apache Spark LDA Topic Modelling Visualisation
A library for Spark DataFrame using MinIO Select API
Online latent state estimation with Spark
A library for reading social data from Instagram using Spark Streaming.
Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)
Spark connector for BigQuery
A JDBC streaming source for Spark
A library for reading social data from Facebook using Spark Streaming.
A package for dealing with crowdsourced big data.
Scala implementation of Histogrammar, with optional front-ends and back-ends as separate Maven projects.
Power a Spark Stream from anywhere in your Akka Stream Flow
Apache Spark Data Source for ROOT File Format
A Spark datasource for the HadoopCryptoLedger library
NetFlow data source for Spark SQL and DataFrames