Statistics for Scalable and Holistic Qualitative Data Cleaning