timgent / spark-data-quality

Data quality control tool built on spark and deequ

Version Matrix

Maven Central javadoc Maven Central javadoc Build Status

Spark Data Quality

A data quality library built with Spark, to give you ultimate flexibility and power in ensuring your data is of high quality.

Checkout the full documentation here:
https://timgent.github.io/spark-data-quality