Miscellaneous functionality for manipulating Apache Spark RDDs.
Load genomic BAM files using Apache Spark
type-classes for structural manipulation of algebraic data types
Utilities for writing tests that use Apache Spark.
SBT plugins for publishing to Maven Central, shading and managing dependencies, reporting to Coveralls from TravisCI, and more
low-level helpers for Apache Spark libraries and tests
Enrichment-methods for Scala collections (Iterators, Iterables, Arrays)
Spark-based implementation of pDC3, a linear-time parallel suffix-array-construction algorithm
Libraries for console/file I/O, processing/formatting sizes in bytes, etc.
Math and statistics utilities
Library for representing and working with genomic-sequencing reads.
Utilities for representing genomic loci and reference-genomes
General (non-omics) code used across BDG products. Apache 2 licensed.
Helpers for creating command-line applications
A genomics processing engine and specialized file format built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.
Stand-alone utility for filtering a BAM file to specific genomic regions, using Apache Spark.