Compile-time Language Integrated Queries for Scala
DataStax Spark Cassandra Connector
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Avro Data Source for Apache Spark
Expressive types for Spark.
CSV Data Source for Apache Spark 1.x
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Redshift data source for Apache Spark
Eclipse Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
Lightweight Scala kernel for Jupyter / IPython 3
REST job server for Apache Spark
General utility code used across BDG products. Apache 2 licensed.
SANSA RDF Library
Impatient fork of Ammonite
Spark library for easy MongoDB access
General Vectorization Lib for Machine Learning Tools
ETL Library for Machine Learning - data pipelines, data munging and wrangling
Distributed Matrix Library
Apache Spark Connector for Azure Cosmos DB