From Vision to Sound: Advancing Audio Anomaly Detection with Vision-Based Algorithms

要約

視覚異常検出（VAD）の最近の進歩により、事前に訓練された特徴抽出器によって生成された埋め込みを活用する洗練されたアルゴリズムが導入されました。
これらの開発に触発されて、オーディオ異常検出（AAD）の問題に対処するために、このようなアルゴリズムのオーディオドメインへの適応を調査します。
主に異常なサンプルを分類するほとんどの既存のAADメソッドとは異なり、私たちのアプローチは、スペクトログラム内の異常のきめ細かい時間周波数局在化を導入し、説明可能性を大幅に改善します。
この機能により、アノマリーがどこでいつ発生するかをより正確に理解することで、結果をエンドユーザーにとってより実用的にします。
産業および環境のベンチマークに関するアプローチを評価し、オーディオ信号の異常の検出におけるVAD技術の有効性を実証します。
さらに、局所的な異常識別を可能にすることにより、説明可能性を向上させ、オーディオの異常検出システムをより解釈可能かつ実用的にします。

要約(オリジナル)

Recent advances in Visual Anomaly Detection (VAD) have introduced sophisticated algorithms leveraging embeddings generated by pre-trained feature extractors. Inspired by these developments, we investigate the adaptation of such algorithms to the audio domain to address the problem of Audio Anomaly Detection (AAD). Unlike most existing AAD methods, which primarily classify anomalous samples, our approach introduces fine-grained temporal-frequency localization of anomalies within the spectrogram, significantly improving explainability. This capability enables a more precise understanding of where and when anomalies occur, making the results more actionable for end users. We evaluate our approach on industrial and environmental benchmarks, demonstrating the effectiveness of VAD techniques in detecting anomalies in audio signals. Moreover, they improve explainability by enabling localized anomaly identification, making audio anomaly detection systems more interpretable and practical.

arxiv情報

著者	Manuel Barusco,Francesco Borsatti,Davide Dalle Pezze,Francesco Paissan,Elisabetta Farella,Gian Antonio Susto
発行日	2025-02-25 16:22:42+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

From Vision to Sound: Advancing Audio Anomaly Detection with Vision-Based Algorithms

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー