We often deal with label errors in datasets, but no common framework exists to support machine learning research and benchmarking with label noise. Announcing cleanlab: a Python package for finding label errors in datasets and learning with noisy labels. cleanlab...
This post overviews the paper Confident Learning: Estimating Uncertainty in Dataset Labels authored by Curtis G. Northcutt, Lu Jiang, and Isaac L. Chuang.
03/07/2019 This post is in the all-time highest ranked posts on Reddit in the r/MachineLearning forum. 03/21/2019 Updates: Amazon links added for all parts. Added Blower-style GPU, faster/cheaper M.2 SSD, and other options. 04/16/2019 Update: A better build is available...