「cs.SD」カテゴリーアーカイブ

WavLLM: Towards Robust and Adaptive Speech Large Language Model

投稿日: 2024年8月15日作成者: jarxiv

要約大規模言語モデル (LLM) の最近の進歩は、自然言語処理の分野に革命をも … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation

投稿日: 2024年8月15日作成者: jarxiv

要約最近、さまざまな配布外シナリオを条件としたユニバーサル波形生成タスクが研究 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

Robust online reconstruction of continuous-time signals from a lean spike train ensemble code

投稿日: 2024年8月15日作成者: jarxiv

要約動物の感覚刺激はニューロンによってスパイク列に符号化され、スパース性、エネ … 続きを読む →

カテゴリー: cs.AI, cs.NE, cs.SD, eess.AS | コメントを受け付けていません

Integrating Representational Gestures into Automatically Generated Embodied Explanations and its Effects on Understanding and Interaction Quality

投稿日: 2024年8月15日作成者: jarxiv

要約人間の対話において、ジェスチャは、会話のリズムをマークしたり、重要な要素を … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.HC, cs.SD, eess.AS | コメントを受け付けていません

Exploring the anatomy of articulation rate in spontaneous English speech: relationships between utterance length effects and social factors

投稿日: 2024年8月14日作成者: jarxiv

要約発話速度は、性別、年齢、方言などの社会的カテゴリーによって異なる一方、発話 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Temporal Variability and Multi-Viewed Self-Supervised Representations to Tackle the ASVspoof5 Deepfake Challenge

投稿日: 2024年8月14日作成者: jarxiv

要約 ASVspoof シリーズの第 5 版である ASVspoof5 は、世界 … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

Neural Speech and Audio Coding

投稿日: 2024年8月14日作成者: jarxiv

要約この論文では、ニューラル音声およびオーディオコーディングシステムの領域内で … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

PSM: Learning Probabilistic Embeddings for Multi-scale Zero-Shot Soundscape Mapping

投稿日: 2024年8月14日作成者: jarxiv

要約サウンドスケープは、人がその場所で知覚する音響環境によって定義されます。 … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

Controlling Surprisal in Music Generation via Information Content Curve Matching

投稿日: 2024年8月13日作成者: jarxiv

要約近年、音楽生成システムの品質と社会の関心が高まっており、これらのシステムを … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning

投稿日: 2024年8月13日作成者: jarxiv

要約最近の対話システムはターンベースの音声対話に依存しており、正確な自動音声認 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

「cs.SD」カテゴリーアーカイブ

WavLLM: Towards Robust and Adaptive Speech Large Language Model

PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation

Robust online reconstruction of continuous-time signals from a lean spike train ensemble code

Integrating Representational Gestures into Automatically Generated Embodied Explanations and its Effects on Understanding and Interaction Quality

Exploring the anatomy of articulation rate in spontaneous English speech: relationships between utterance length effects and social factors

Temporal Variability and Multi-Viewed Self-Supervised Representations to Tackle the ASVspoof5 Deepfake Challenge

Neural Speech and Audio Coding

PSM: Learning Probabilistic Embeddings for Multi-scale Zero-Shot Soundscape Mapping

Controlling Surprisal in Music Generation via Information Content Curve Matching

Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー