Scala macros for generating Parquet schema projections and filter predicates
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Read SparkSQL parquet file as RDD[Protobuf]
Big Data Toolkit for the JVM
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
A tool for data sampling, data generation, and data diffing