Black-Box Batch Active Learning for Regression

要約

バッチアクティブラーニングは、最初はラベル付けされていない大規模なデータセットで機械学習モデルを効率的にトレーニングするための一般的なアプローチであり、データポイントのバッチのラベルを繰り返し取得します。
ただし、最近の多くのバッチアクティブラーニング手法は、微分可能なパラメトリックモデルに限定されたホワイトボックスアプローチです。モデルの埋め込みまたは 1 次および 2 次導関数に基づく取得関数を使用して、ラベルのないポイントをスコアリングします。
このホワイトペーパーでは、ホワイトボックスアプローチの拡張として、回帰タスクのブラックボックスバッチアクティブラーニングを提案します。
このアプローチは、通常のディープラーニングモデルやベイジアンディープラーニングモデル、ランダムフォレストなどの微分不可能なモデルなど、幅広い機械学習モデルと互換性があります。
これはベイジアンの原則に根ざしており、最近のカーネルベースのアプローチを利用しています。
重要なことは、私たちの方法はモデル予測のみに依存していることです。
これにより、既存の最先端のホワイトボックスバッチアクティブラーニング手法 (BADGE、BAIT、LCMD) を幅広くブラックボックスモデルに拡張することができます。
回帰データセットの広範な実験的評価を通じてアプローチの有効性を実証し、ディープラーニングモデルのホワイトボックスアプローチと比較して驚くほど強力なパフォーマンスを達成しました。

要約(オリジナル)

Batch active learning is a popular approach for efficiently training machine learning models on large, initially unlabelled datasets, which repeatedly acquires labels for a batch of data points. However, many recent batch active learning methods are white-box approaches limited to differentiable parametric models: they score unlabeled points using acquisition functions based on model embeddings or first- and second-order derivatives. In this paper, we propose black-box batch active learning for regression tasks as an extension of white-box approaches. This approach is compatible with a wide range of machine learning models including regular and Bayesian deep learning models and non-differentiable models such as random forests. It is rooted in Bayesian principles and utilizes recent kernel-based approaches. Importantly, our method only relies on model predictions. This allows us to extend a wide range of existing state-of-the-art white-box batch active learning methods (BADGE, BAIT, LCMD) to black-box models. We demonstrate the effectiveness of our approach through extensive experimental evaluations on regression datasets, achieving surprisingly strong performance compared to white-box approaches for deep learning models.

arxiv情報

著者	Andreas Kirsch
発行日	2023-02-17 16:35:47+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Black-Box Batch Active Learning for Regression

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー