Distilling Effective Supervision from Severe Label Noise

Tomas Pfister

Distilling Effective Supervision from Severe Label Noise

Abstract

Collecting large-scale data with clean labels for supervised training of neural networks is practically challenging. Although noisy labels are usually cheap to acquire, existing methods suffer a lot from label noise. This paper targets at the challenge of robust training at high label noise regimes. The key insight to achieve this goal is to wisely leverage a small trusted set to estimate exemplar weights and pseudo labels for noisy data in order to reuse them for supervised training. We present a holistic framework to train deep neural networks in a way that is highly invulnerable to label noise. Our method sets the new state of the art on various types of label noise and achieves excellent performance on large-scale datasets with real-world label noise. For instance, on CIFAR100 with a 40\% uniform noise ratio and only 10 trusted labeled data per class, our method achieves 80.20.3\% classification accuracy, where the error rate is only 1.4\% higher than a neural network trained without label noise. Moreover, increasing the noise ratio to 80\%, our method still maintains a high accuracy of 75.50.2\%, compared to the previous best accuracy 48.2\%. Source code available: https://github.com/google-research/google-research/tree/master/ieg

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…