CleanLab / CleanVision


Created: 26 Jan 2023, 06:03 PM | Modified: =dateformat(this.file.mtime,"dd MMM yyyy, hh:mm a") Tags: knowledge,


https://github.com/cleanlab/cleanlab

cleanlab automatically detects problems in a ML dataset. This data-centric AI package facilitatesmachine learning with messy, real-world databy providingclean labels for robust training and flagging errors in your data.

From <https://github.com/cleanlab/cleanlab>

Automatic Error Detection for Image/Text Tagging and Multi-label Datasets

Automatic Error Detection for Image/Text Tagging and Multi-label Datasets

From <https://cleanlab.ai/blog/>

https://github.com/cleanlab/cleanvision

Screen Shot 2023-03-10 at 10 23 33 AM

CleanVision automatically detects potential issues in image datasets like images that are: blurry, under/over-exposed, (near) duplicates, etc. This data-centric AI package is a quick first step for any computer vision project to find problems in the dataset, which you want to address before applying machine learning. CleanVision is super simple — run the same couple lines of Python code to audit any image dataset!

From <https://github.com/cleanlab/cleanvision>