Scala macros for generating Parquet schema projections and filter predicates
Big Data Toolkit for the JVM
Scala Parquet reader. Spark not needed anymore to just read Parquet files.
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
A tool for data sampling, data generation, and data diffing
Read SparkSQL parquet file as RDD[Protobuf]