Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation

要約

この論文では、時間-周波数ドメインでの音声復元のために、ディープニューラルネットワークベースの乗法マスカーをディープサブバンドフィルターに拡張するためのスキームを提示します。
結果として得られる方法は、時間-周波数領域でマスクを提供する任意の深層ニューラルネットワークに一般的に適用できますが、必要なトレーニング可能なパラメーターはわずかであり、最先端のニューラルネットワークでは無視できるほどの計算オーバーヘッドが必要です。
結果として得られるディープサブバンドフィルタリングスキームが、残響除去の乗算マスキングよりも優れていることを示しますが、ノイズ除去のパフォーマンスは実質的に同じままです。
これは、時間-周波数領域での深いサブバンドフィルタリングが残響除去の文献でよく想定されるサブバンド近似に適合するためであると主張しますが、乗法マスキングはノイズ除去で一般的に使用される狭帯域近似に対応します。

要約(オリジナル)

In this paper, we present a scheme for extending deep neural network-based multiplicative maskers to deep subband filters for speech restoration in the time-frequency domain. The resulting method can be generically applied to any deep neural network providing masks in the time-frequency domain, while requiring only few more trainable parameters and a computational overhead that is negligible for state-of-the-art neural networks. We demonstrate that the resulting deep subband filtering scheme outperforms multiplicative masking for dereverberation, while leaving the denoising performance virtually the same. We argue that this is because deep subband filtering in the time-frequency domain fits the subband approximation often assumed in the dereverberation literature, whereas multiplicative masking corresponds to the narrowband approximation generally employed in denoising.

arxiv情報

著者	Jean-Marie Lemercier,Julian Tobergte,Timo Gerkmann
発行日	2023-03-01 14:10:21+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー