ARNet: Automatic Refinement Network for Noisy Partial Label Learning

要約

部分ラベル学習 (PLL) は、各サンプルが一連の候補ラベルに関連付けられている典型的な弱教師あり学習です。
PLL の基本的な前提は、グラウンドトゥルースラベルが候補セットに存在する必要があるということです。
ただし、この仮定は、アノテーターの専門外の判断により満たされない場合があり、PLL の実際の適用が制限されます。
このホワイトペーパーでは、この仮定を緩和し、グラウンドトゥルースラベルが候補セットに存在しない可能性がある、より一般的な問題であるノイズの多い PLL に焦点を当てます。
この困難な問題に対処するために、「Automatic Refinement Network (ARNet)」と呼ばれる新しいフレームワークをさらに提案します。
私たちの方法は、複数のラウンドで構成されています。
各ラウンドでは、2 つの主要なモジュール、つまり、ノイズの多いサンプルの検出とラベルの修正を通じて、ノイズの多いサンプルを浄化します。
これらのモジュールのパフォーマンスを保証するために、ウォームアップトレーニングから開始し、適切な修正エポックを自動的に選択します。
一方、データ拡張を活用して、ARNet の予測エラーをさらに減らします。
理論的分析を通じて、私たちの方法がデータセットのノイズレベルを低減し、最終的にベイズ最適分類器に近づくことができることを証明します。
ARNet の有効性を検証するために、複数のベンチマークデータセットで実験を行います。
実験結果は、ARNet がノイズの多い PLL における既存の最先端のアプローチよりも優れていることを示しています。
私たちのコードはまもなく公開されます。

要約(オリジナル)

Partial label learning (PLL) is a typical weakly supervised learning, where each sample is associated with a set of candidate labels. The basic assumption of PLL is that the ground-truth label must reside in the candidate set. However, this assumption may not be satisfied due to the unprofessional judgment of the annotators, thus limiting the practical application of PLL. In this paper, we relax this assumption and focus on a more general problem, noisy PLL, where the ground-truth label may not exist in the candidate set. To address this challenging problem, we further propose a novel framework called ‘Automatic Refinement Network (ARNet)’. Our method consists of multiple rounds. In each round, we purify the noisy samples through two key modules, i.e., noisy sample detection and label correction. To guarantee the performance of these modules, we start with warm-up training and automatically select the appropriate correction epoch. Meanwhile, we exploit data augmentation to further reduce prediction errors in ARNet. Through theoretical analysis, we prove that our method is able to reduce the noise level of the dataset and eventually approximate the Bayes optimal classifier. To verify the effectiveness of ARNet, we conduct experiments on multiple benchmark datasets. Experimental results demonstrate that our ARNet is superior to existing state-of-the-art approaches in noisy PLL. Our code will be made public soon.

arxiv情報

著者	Zheng Lian,Mingyu Xu,Lan Chen,Licai Sun,Bin Liu,Jianhua Tao
発行日	2022-11-10 04:24:21+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

ARNet: Automatic Refinement Network for Noisy Partial Label Learning

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー