Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark - From Skymind
Compile-time Language Integrated Queries for Scala
A cohesive & pragmatic framework of FP centric Scala libraries
A Scala feature transformation library for data science and machine learning
REST job server for Apache Spark
NetFlow data source for Spark SQL and DataFrames
Spark extension for processing large-scale 3D data sets, such as astrophysical or high energy physics data.
PageRank in Spark
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
A Scala based Spark Publish/Subscribe NATS Connector
Natural Korean Processor for Apache Spark
Writing application logic for Spark jobs that can be unit-tested without a SparkContext
Spark ML Lib serving library
Google Spreadsheets datasource for SparkSQL and DataFrames
A library for downloading dataframes from S3 compatible object storage using Select API.
Sparkling Water provides H2O functionality inside Spark cluster
ETL Library for Machine Learning - data pipelines, data munging and wrangling
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
dllib is a distributed deep learning library running on Apache Spark
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.