A tool for data sampling, data generation, and data diffing
Big Data Toolkit for the JVM
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Read SparkSQL parquet file as RDD[Protobuf]
Scala macros for generating Parquet schema projections and filter predicates
A set of stream connectors and integrations for Monix. 🔛
A collection of Apache Parquet add-on modules