CleanLab / CleanVision
Created: 26 Jan 2023, 06:03 PM | Modified: =dateformat(this.file.mtime,"dd MMM yyyy, hh:mm a")
Tags: knowledge,
https://github.com/cleanlab/cleanlab
cleanlab automatically detects problems in a ML dataset. This data-centric AI package facilitatesmachine learning with messy, real-world databy providingclean labels for robust training and flagging errors in your data.
From <https://github.com/cleanlab/cleanlab>

Automatic Error Detection for Image/Text Tagging and Multi-label Datasets
From <https://cleanlab.ai/blog/>
https://github.com/cleanlab/cleanvision

CleanVision automatically detects potential issues in image datasets like images that are: blurry, under/over-exposed, (near) duplicates, etc. This data-centric AI package is a quick first step for any computer vision project to find problems in the dataset, which you want to address before applying machine learning. CleanVision is super simple — run the same couple lines of Python code to audit any image dataset!