Abstract Algebra for Scala
DataStax Spark Cassandra Connector
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Avro Data Source for Apache Spark
GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
CSV Data Source for Apache Spark 1.x
Expressive types for Spark.
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
XML data source for Spark SQL and DataFrames
Redshift data source for Apache Spark
Eclipse Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
BigDL: Distributed Deep Learning Library for Apache Spark
REST job server for Apache Spark
General utility code used across BDG products. Apache 2 licensed.
MLeap: Deploy Spark Pipelines to Production
Spark library for easy MongoDB access
SANSA RDF Library
General Vectorization Lib for Machine Learning Tools
Impatient fork of Ammonite