#Deequ can define unit tests for #data -
find errors early, before the data gets fed to eg #machinelearning algorithms
https://github.com/awslabs/deequ

#apache20 #tech

awslabs/deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets. - awslabs/deequ