Read SparkSQL parquet file as RDD[Protobuf]
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Big Data Toolkit for the JVM
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
A tool for data sampling, data generation, and data diffing
Scala macros for generating Parquet schema projections and filter predicates