-
aliyun/aliyun-emapreduce-datasources 2.2.0
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Scala versions: 2.11 -
microsoft/mobius 2.0.200
C# and F# language binding and extensions to Apache Spark
Scala versions: 2.11 -
helgeho/archivespark 3.0
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Scala versions: 2.11 -
chermenin/spark-states 0.2
Custom state store providers for Apache Spark
Scala versions: 2.12 2.11 -
galliaproject/gallia-core 0.6.1
A schema-aware Scala library for data transformation
Scala versions: 3.x 2.13 2.12 -
sansa-stack/sansa-stack 0.9.5
Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/
Scala versions: 2.12 -
simplexspatial/osm4scala 1.0
Scala and Spark library focused on reading OpenStreetMap Pbf files.
Scala versions: 2.11 2.10 -
jelmerk/hnswlib 1.2.1
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Scala versions: 2.13 2.12 2.11 -
swoop-inc/spark-records 3.0.1
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
Scala versions: 2.12 -
cerndb/sparkplugins 0.4
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics systems with user-provided monitoring probes.
Scala versions: 2.13 2.12