MADI: Inter-domain Matching and Intra-domain Discrimination for Cross-domain Speech Recognition

要約

通常、エンドツーエンドの自動音声認識 (ASR) は、ドメインシフトにより新しいドメインに適用されると、パフォーマンスが低下します。
教師なしドメイン適応 (UDA) は、ソースからターゲットドメインに知識を転送することにより、ラベル付けされていないターゲットドメインのパフォーマンスを向上させることを目的としています。
転送可能性を改善するために、既存の UDA アプローチは主に、モデルの識別可能性を無視しながら、ソースドメインとターゲットドメインの分布をグローバルおよび/またはローカルで一致させることに重点を置いています。
この論文では、ドメイン間マッチングとドメイン内識別（MADI）を介したASRの新しいUDAアプローチを提案します。これは、ドメイン間マッチングによるモデルの転送可能性とドメイン内対照識別による識別可能性を同時に改善します。
Libri-Adapt データセットの評価は、私たちのアプローチの有効性を示しています。
MADI は、クロスデバイスおよびクロス環境 ASR の相対ワードエラー率 (WER) をそれぞれ 17.7% および 22.8% 削減します。

要約(オリジナル)

End-to-end automatic speech recognition (ASR) usually suffers from performance degradation when applied to a new domain due to domain shift. Unsupervised domain adaptation (UDA) aims to improve the performance on the unlabeled target domain by transferring knowledge from the source to the target domain. To improve transferability, existing UDA approaches mainly focus on matching the distributions of the source and target domains globally and/or locally, while ignoring the model discriminability. In this paper, we propose a novel UDA approach for ASR via inter-domain MAtching and intra-domain DIscrimination (MADI), which improves the model transferability by fine-grained inter-domain matching and discriminability by intra-domain contrastive discrimination simultaneously. Evaluations on the Libri-Adapt dataset demonstrate the effectiveness of our approach. MADI reduces the relative word error rate (WER) on cross-device and cross-environment ASR by 17.7% and 22.8%, respectively.

arxiv情報

著者	Jiaming Zhou,Shiwan Zhao,Ning Jiang,Guoqing Zhao,Yong Qin
発行日	2023-02-22 09:11:06+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

MADI: Inter-domain Matching and Intra-domain Discrimination for Cross-domain Speech Recognition

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー