Avro Data Source for Apache Spark
CSV Data Source for Apache Spark 1.x
Expressive types for Spark.
Redshift data source for Apache Spark
REST job server for Apache Spark
General utility code used across BDG products. Apache 2 licensed.
Impatient fork of Ammonite
Spark library for easy MongoDB access
General Vectorization Lib for Machine Learning Tools
ETL Library for Machine Learning - data pipelines, data munging and wrangling
Distributed Matrix Library
Miscellaneous functionality for manipulating Apache Spark RDDs.
Utilities for writing tests that use Apache Spark.
low-level helpers for Apache Spark libraries and tests
The Official Couchbase Spark Connector
A Cluster Computing System for Processing Large-Scale Spatial Data
An ADAM extension library for loading .vcf files annotated with SnpEff and SnpSift.
Mirror of Apache Bahir
Basic framework utilities to quickly start writing production ready Apache Spark applications
Utilities for representing genomic loci and reference-genomes