「cs.SD」カテゴリーアーカイブ

Applications of Artificial Intelligence for Cross-language Intelligibility Assessment of Dysarthric Speech

投稿日: 2025年5月7日作成者: jarxiv

要約目的：音声明瞭度は、ダイサルリアの評価と管理における重要な結果ですが、ほと … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation

投稿日: 2025年5月7日作成者: jarxiv

要約現代の音声分離技術は、長い混合オーディオ波形を巧みに処理しますが、騒々しい … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models

投稿日: 2025年5月7日作成者: jarxiv

要約音楽言語モデルの出現により、AIシステムの自動音楽生成能力が大幅に向上しま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.MM, cs.SD | コメントを受け付けていません

Bemba Speech Translation: Exploring a Low-Resource African Language

投稿日: 2025年5月6日作成者: jarxiv

要約本論文では、国際音声言語翻訳会議（IWSLT2025）の低リソース言語トラ … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Automatic Proficiency Assessment in L2 English Learners

投稿日: 2025年5月6日作成者: jarxiv

要約英語の第二言語能力（L2）は通常、英語の教師または専門家の評価者によって知 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

fastabx: A library for efficient computation of ABX discriminability

投稿日: 2025年5月6日作成者: jarxiv

要約 ABX差別タスクを構築するための高性能PythonライブラリであるFast … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

投稿日: 2025年5月6日作成者: jarxiv

要約リアルタイムでインテリジェントかつ自然な音声対話は、次世代の人間とコンピュ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

投稿日: 2025年5月6日作成者: jarxiv

要約日常生活にシームレスに溶け込む音声AIエージェントは、自律的で、リアルタイ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD | コメントを受け付けていません

FolAI: Synchronized Foley Sound Generation with Semantic and Temporal Alignment

投稿日: 2025年5月6日作成者: jarxiv

要約従来のサウンドデザインワークフローは、フォーリーサウンドデザインのように、 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

How much to Dereverberate? Low-Latency Single-Channel Speech Enhancement in Distant Microphone Scenarios

投稿日: 2025年5月5日作成者: jarxiv

要約残響除去は、信号の明瞭度と品質を向上させる音声強調（SE）の重要なサブタス … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS, I.5.1 | コメントを受け付けていません

「cs.SD」カテゴリーアーカイブ

Applications of Artificial Intelligence for Cross-language Intelligibility Assessment of Dysarthric Speech

SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation

Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models

Bemba Speech Translation: Exploring a Low-Resource African Language

Automatic Proficiency Assessment in L2 English Learners

fastabx: A library for efficient computation of ABX discriminability

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

FolAI: Synchronized Foley Sound Generation with Semantic and Temporal Alignment

How much to Dereverberate? Low-Latency Single-Channel Speech Enhancement in Distant Microphone Scenarios

最近の投稿

最近のコメント

アーカイブ

カテゴリー