-
yotpoltd/metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Scala versions: 2.12 2.11 -
mrpowers/spark-fast-tests
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Scala versions: 2.13 2.12 2.11 -
groupon/sparklint
A tool for monitoring and tuning Spark jobs for efficiency.
Scala versions: 2.11 2.10 -
lightbend/cloudflow
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Scala versions: 2.13 2.12sbt plugins: 1.0 -
neo4j-contrib/neo4j-spark-connector
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Scala versions: 2.13 2.12 -
hbutani/spark-druid-olap
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Scala versions: 2.10 -
azure/azure-event-hubs-spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Scala versions: 2.12 2.11 2.10 -
absaoss/abris
Avro SerDe for Apache Spark structured APIs.
Scala versions: 2.12 2.11 -
linkedin/isolation-forest
A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Scala versions: 2.13 2.12 2.11