「eess.AS」カテゴリーアーカイブ

End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator

投稿日: 2023年3月16日作成者: jarxiv

要約エンドツーエンドの音声言語理解 (SLU) には、ロングテールワード … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Chat with the Environment: Interactive Multimodal Perception using Large Language Models

投稿日: 2023年3月16日作成者: jarxiv

要約複雑な世界でロボットの動作をプログラミングするには、器用な低レベルのスキル … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.RO, cs.SD, eess.AS | コメントを受け付けていません

Cross-speaker Emotion Transfer by Manipulating Speech Style Latents

投稿日: 2023年3月16日作成者: jarxiv

要約近年、感情的なテキスト読み上げはかなりの進歩を遂げています。ただし、大量 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Once-for-All Sequence Compression for Self-Supervised Speech Models

投稿日: 2023年3月16日作成者: jarxiv

要約時間軸に沿ったシーケンスの長さは、多くの場合、音声処理における計算の支配的 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech

投稿日: 2023年3月16日作成者: jarxiv

要約この論文では、テキスト音声合成 (TTS) モデルのための大規模な多言語音 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences

投稿日: 2023年3月16日作成者: jarxiv

要約教師なし構文解析に関する過去の作業は、記述された形式に限定されています。 … 続きを読む →

カテゴリー: cs.CL, eess.AS | コメントを受け付けていません

Leveraging Pretrained Representations with Task-related Keywords for Alzheimer’s Disease Detection

投稿日: 2023年3月15日作成者: jarxiv

要約世界人口の急速な高齢化に伴い、アルツハイマー病 (AD) は特に高齢者に顕 … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS, q-bio.QM | コメントを受け付けていません

A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition

投稿日: 2023年3月15日作成者: jarxiv

要約非言語発声による感情シグナリングの一般的な方法として、ボーカルバースト … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

I3D: Transformer architectures with input-dependent dynamic depth for speech recognition

投稿日: 2023年3月15日作成者: jarxiv

要約 Transformer ベースのエンドツーエンドの音声認識は、大きな成功を … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Cross-lingual Alzheimer’s Disease detection based on paralinguistic and pre-trained features

投稿日: 2023年3月15日作成者: jarxiv

要約 ICASSP-SPGC-2023 ADReSS-M チャレンジタスクへの … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

「eess.AS」カテゴリーアーカイブ

End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator

Chat with the Environment: Interactive Multimodal Perception using Large Language Models

Cross-speaker Emotion Transfer by Manipulating Speech Style Latents

Once-for-All Sequence Compression for Self-Supervised Speech Models

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech

Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences

Leveraging Pretrained Representations with Task-related Keywords for Alzheimer’s Disease Detection

A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition

I3D: Transformer architectures with input-dependent dynamic depth for speech recognition

Cross-lingual Alzheimer’s Disease detection based on paralinguistic and pre-trained features

最近の投稿

最近のコメント

アーカイブ

カテゴリー