-
hydrospheredata/mist
Serverless proxy for Spark cluster
Scala (JVM): 2.10 2.11 2.12Sbt: 2.11 -
uosdmlab/spark-nkp
Natural Korean Processor for Apache Spark
Scala (JVM): 2.11 -
theastrolab/spark3d
Spark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Scala (JVM): 2.11 -
datasystemslab/geospark
A Cluster Computing System for Processing Large-Scale Spatial Data
Scala (JVM): -
seznam/euphoria
Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model which can express both batch and stream transformations.
Scala (JVM): -
coxautomotivedatasolutions/vegalite4s
Vega-Lite4s is a small library over the comprehensive Vega-Lite Javascript visualisation library, allowing you to create beautiful Vega-Lite visualisations in Scala
Scala (JVM): 2.11 2.12 -
azure/azure-cosmosdb-spark
Apache Spark Connector for Azure Cosmos DB
Scala (JVM):Sbt: 2.10 2.11 -
tupol/spark-utils
Basic framework utilities to quickly start writing production ready Apache Spark applications
Scala (JVM): 2.11 2.12 -
seahrh/spark-util
Utility for common use cases and bug workarounds in Apache Spark 2
Scala (JVM): 2.11 -
chitralverma/sparkml-extensions
Scala (JVM): 2.11 -
tupol/spark-tools
Executable Apache Spark Tools: Format Converter & SQL Processor
Scala (JVM): 2.11 -
flipkart-incubator/spark-transformers
Spark-Transformers: Library for exporting Apache Spark MLLIB models in to use them in any Java application with no other dependencies.
Scala (JVM): 2.10 2.11 -
liquidsvm/liquidsvm
Support vector machines (SVMs) and related kernel-based learning algorithms are a well-known class of machine learning algorithms, for non-parametric classification and regression. liquidSVM is an implementation of SVMs whose key features are: fully integrated hyper-parameter selection, extreme speed on both small and large data sets, full flexibility for experts, and inclusion of a variety of different learning scenarios: multi-class classification, ROC, and Neyman-Pearson learning, and least-squares, quantile, and expectile regression.
Scala (JVM): 2.11 -
bizreach/aws-kinesis-scala
Scala client for Amazon Kinesis. Also provides write to Kinesis capability for Apache Spark or Spark Streaming.
Scala (JVM): 2.11 2.12 2.13 -
lucacanali/sparkmeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark workload metrics data.
Scala (JVM): 2.11 2.12 -
intel-analytics/analytics-zoo
Distributed Tensorflow, Keras and BigDL on Apache Spark
Scala (JVM):Sbt: 1.6 2.1 2.2 2.3 2.4 -
microsoft/mobius
C# and F# language binding and extensions to Apache Spark
Scala (JVM): 2.10 2.11 -
emcecs/spark-ecs-connector
ECS connector for Apache Spark
Scala (JVM): 2.11