「eess.AS」カテゴリーアーカイブ

Automatic Proficiency Assessment in L2 English Learners

投稿日: 2025年5月6日作成者: jarxiv

要約英語の第二言語能力（L2）は通常、英語の教師または専門家の評価者によって知 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

fastabx: A library for efficient computation of ABX discriminability

投稿日: 2025年5月6日作成者: jarxiv

要約 ABX差別タスクを構築するための高性能PythonライブラリであるFast … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

投稿日: 2025年5月6日作成者: jarxiv

要約リアルタイムでインテリジェントかつ自然な音声対話は、次世代の人間とコンピュ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

FolAI: Synchronized Foley Sound Generation with Semantic and Temporal Alignment

投稿日: 2025年5月6日作成者: jarxiv

要約従来のサウンドデザインワークフローは、フォーリーサウンドデザインのように、 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

How much to Dereverberate? Low-Latency Single-Channel Speech Enhancement in Distant Microphone Scenarios

投稿日: 2025年5月5日作成者: jarxiv

要約残響除去は、信号の明瞭度と品質を向上させる音声強調（SE）の重要なサブタス … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS, I.5.1 | コメントを受け付けていません

REFFLY: Melody-Constrained Lyrics Editing Model

投稿日: 2025年5月5日作成者: jarxiv

要約メロディから歌詞への自動生成（M2L）は、与えられたメロディに沿った歌詞を … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment

投稿日: 2025年5月5日作成者: jarxiv

要約オーディオビジュアル学習における最近の進歩は、モダリティを超えた表現の学習 … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing

投稿日: 2025年5月5日作成者: jarxiv

要約ムービーダビングは、与えられた短い参照音声のボーカルの音色を維持しながら、 … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

Voice Cloning: Comprehensive Survey

投稿日: 2025年5月2日作成者: jarxiv

要約音声クローニングは、今日のデジタルの世界で急速に進歩しており、多くの研究者 … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

投稿日: 2025年5月1日作成者: jarxiv

要約拡散モデルの最近の進歩により、微妙な表現と鮮やかなヘッドの動きを備えたトー … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

「eess.AS」カテゴリーアーカイブ

Automatic Proficiency Assessment in L2 English Learners

fastabx: A library for efficient computation of ABX discriminability

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

FolAI: Synchronized Foley Sound Generation with Semantic and Temporal Alignment

How much to Dereverberate? Low-Latency Single-Channel Speech Enhancement in Distant Microphone Scenarios

REFFLY: Melody-Constrained Lyrics Editing Model

CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment

FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing

Voice Cloning: Comprehensive Survey

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

最近の投稿

最近のコメント

アーカイブ

カテゴリー