「eess.AS」カテゴリーアーカイブ

Certification of Speaker Recognition Models to Additive Perturbations

投稿日: 2024年4月30日作成者: jarxiv

要約話者認識テクノロジーは、パーソナル仮想アシスタントから安全なアクセスシス … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification

投稿日: 2024年4月29日作成者: jarxiv

要約この論文では、32 のカテゴリ (2 つの性別、4 つの年齢層、4 つの録 … 続きを読む →

カテゴリー: cs.CL, cs.DL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

The LuViRA Dataset: Synchronized Vision, Radio, and Audio Sensors for Indoor Localization

投稿日: 2024年4月29日作成者: jarxiv

要約私たちは、正確かつ堅牢な屋内位置特定のための同期された多感覚データセット、 … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention

投稿日: 2024年4月29日作成者: jarxiv

要約顔と声が互いに密接に関連しているため、視聴覚融合を使用した個人または身元確 … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

Automatic Speech Recognition System-Independent Word Error Rate Estimation

投稿日: 2024年4月29日作成者: jarxiv

要約単語誤り率 (WER) は、自動音声認識 (ASR) システムによって生成 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Developing Acoustic Models for Automatic Speech Recognition in Swedish

投稿日: 2024年4月26日作成者: jarxiv

要約この論文は、訓練可能なシステムを使用した自動連続音声認識に関するものです。 … 続きを読む →

カテゴリー: 68T10, cs.AI, cs.SD, eess.AS, I.2.0 | コメントを受け付けていません

ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling

投稿日: 2024年4月26日作成者: jarxiv

要約環境音響モデルは、特定の音源/受信機の場所において、音が屋内環境の物理的特 … 続きを読む →

カテゴリー: cs.CV, cs.RO, cs.SD, eess.AS | コメントを受け付けていません

U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF

投稿日: 2024年4月26日作成者: jarxiv

要約 Scale は自然言語処理の新たな境地を切り開きましたが、それには高いコス … 続きを読む →

カテゴリー: cs.CL, eess.AS, I.2.7 | コメントを受け付けていません

STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models

投稿日: 2024年4月26日作成者: jarxiv

要約 Transformer ベースの音声自己教師あり学習 (SSL) モデルは … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Automatic Speech Recognition System-Independent Word Error Rate Estimatio

投稿日: 2024年4月26日作成者: jarxiv

要約単語誤り率 (WER) は、自動音声認識 (ASR) システムによって生成 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

「eess.AS」カテゴリーアーカイブ

Certification of Speaker Recognition Models to Additive Perturbations

A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification

The LuViRA Dataset: Synchronized Vision, Radio, and Audio Sensors for Indoor Localization

Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention

Automatic Speech Recognition System-Independent Word Error Rate Estimation

Developing Acoustic Models for Automatic Speech Recognition in Swedish

ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling

U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF

STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models

Automatic Speech Recognition System-Independent Word Error Rate Estimatio

最近の投稿

最近のコメント

アーカイブ

カテゴリー