Skip to main content

One doc tagged with "near duplicates"

View all tags

Intrinsic Self-Supervision for Data Quality Audits

- Introduces a dataset cleaning framework that I was able to install and use in less than 2 minutes to clean some sample datasets. I would need to test further to fully evaluate it's effectiveness for my domain.