Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
Export spark ml SparseVectors as numpy csr matrix
Scala utils for anything and everything
Asynchronous crawler utils