How well do you think you know your data, #dataengineers and #datascientists ? You might want to profile your data more.
I've worked with the #Python package #ydata-profiling . It has some issues. But when I got it working, I found some surprising details about a dataset that I thought I already knew quite well. #pyspark
https://marcel-jan.eu/datablog/2025/04/24/profiling-data-with-ydata-in-pyspark/
Profiling data with ydata in PySpark | Expedition Data