A Scala API for Apache Beam and Google Cloud Dataflow.
A Scala feature transformation library for data science and machine learning
A tool for data sampling, data generation, and data diffing
Google BigQuery support for Spark, SQL, and DataFrames
Scala Aggregators used for ML Model metrics monitoring
A collection of Magnolia add-on modules
A lightweight workflow definition library
DBeam extracts SQL tables using JDBC and Apache Beam
Runs JVM closures in Docker containers on Kubernetes
Provides compile-time derivation of conversions between Scala case classes and Tensorflow Example protocol buffers
Community-supported add-ons for Scio