「cs.SD」カテゴリーアーカイブ

Cross-speaker Emotion Transfer by Manipulating Speech Style Latents

投稿日: 2023年3月16日作成者: jarxiv

要約近年、感情的なテキスト読み上げはかなりの進歩を遂げています。ただし、大量 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Once-for-All Sequence Compression for Self-Supervised Speech Models

投稿日: 2023年3月16日作成者: jarxiv

要約時間軸に沿ったシーケンスの長さは、多くの場合、音声処理における計算の支配的 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech

投稿日: 2023年3月16日作成者: jarxiv

要約この論文では、テキスト音声合成 (TTS) モデルのための大規模な多言語音 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Leveraging Pretrained Representations with Task-related Keywords for Alzheimer’s Disease Detection

投稿日: 2023年3月15日作成者: jarxiv

要約世界人口の急速な高齢化に伴い、アルツハイマー病 (AD) は特に高齢者に顕 … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS, q-bio.QM | コメントを受け付けていません

A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition

投稿日: 2023年3月15日作成者: jarxiv

要約非言語発声による感情シグナリングの一般的な方法として、ボーカルバースト … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

I3D: Transformer architectures with input-dependent dynamic depth for speech recognition

投稿日: 2023年3月15日作成者: jarxiv

要約 Transformer ベースのエンドツーエンドの音声認識は、大きな成功を … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Cross-lingual Alzheimer’s Disease detection based on paralinguistic and pre-trained features

投稿日: 2023年3月15日作成者: jarxiv

要約 ICASSP-SPGC-2023 ADReSS-M チャレンジタスクへの … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis

投稿日: 2023年3月15日作成者: jarxiv

要約最近の表現力豊かなテキスト読み上げ (TTS) モデルは、感情的なスピーチ … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy

投稿日: 2023年3月15日作成者: jarxiv

要約すべてのターゲットトークンを並行して予測するため、非自己回帰モデルは、従 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Improving CTC-based ASR Models with Gated Interlayer Collaboration

投稿日: 2023年3月15日作成者: jarxiv

要約通常、外部言語モデルを使用しない CTC ベースの自動音声認識 (ASR) … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

「cs.SD」カテゴリーアーカイブ

Cross-speaker Emotion Transfer by Manipulating Speech Style Latents

Once-for-All Sequence Compression for Self-Supervised Speech Models

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech

Leveraging Pretrained Representations with Task-related Keywords for Alzheimer’s Disease Detection

A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition

I3D: Transformer architectures with input-dependent dynamic depth for speech recognition

Cross-lingual Alzheimer’s Disease detection based on paralinguistic and pre-trained features

QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis

Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy

Improving CTC-based ASR Models with Gated Interlayer Collaboration

最近の投稿

最近のコメント

アーカイブ

カテゴリー