「cs.SD」カテゴリーアーカイブ

SSPS: Self-Supervised Positive Sampling for Robust Self-Supervised Speaker Verification

投稿日: 2025年5月21日作成者: jarxiv

要約自己学習学習（SSL）は、スピーカー検証（SV）のかなりの進歩をもたらしま … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information

投稿日: 2025年5月20日作成者: jarxiv

要約大規模なオーディオ言語モデル（LALMS）は、スピーチ、オーディオなどのマ … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Machine Learning Approaches to Vocal Register Classification in Contemporary Male Pop Music

投稿日: 2025年5月19日作成者: jarxiv

要約すべての経験レベルの歌手にとって、技術的なレパートリーを学ぶ際の最も困難な … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese

投稿日: 2025年5月19日作成者: jarxiv

要約大規模な言語モデル（LLMS）の最近の進歩により、テキストからスピーチ（T … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.HC, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors

投稿日: 2025年5月19日作成者: jarxiv

要約最近、大規模な事前訓練を受けた音声エンコーダと大規模な言語モデル（LLM） … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

ImprovNet — Generating Controllable Musical Improvisations with Iterative Corruption Refinement

投稿日: 2025年5月19日作成者: jarxiv

要約 Deep Learningがさまざまなドメインにまたがるスタイル転送におけ … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

On the Role of Speech Data in Reducing Toxicity Detection Bias

投稿日: 2025年5月19日作成者: jarxiv

要約テキスト毒性検出システムは、人口統計グループに言及しているサンプルに不均衡 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization

投稿日: 2025年5月19日作成者: jarxiv

要約犬の樹皮を聞いて、駐車した車を見るためだけに音に向かって曲がると想像してく … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

Learning Nonlinear Dynamics in Physical Modelling Synthesis using Neural Ordinary Differential Equations

投稿日: 2025年5月16日作成者: jarxiv

要約モーダル合成方法は、分散された音楽システムをモデル化するための長年のアプロ … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS, physics.comp-ph | コメントを受け付けていません

Deconstructing Jazz Piano Style Using Machine Learning

投稿日: 2025年5月15日作成者: jarxiv

要約芸術的なスタイルは何世紀にもわたって研究されてきましたが、機械学習の最近の … 続きを読む →

カテゴリー: cs.IR, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

「cs.SD」カテゴリーアーカイブ

SSPS: Self-Supervised Positive Sampling for Robust Self-Supervised Speaker Verification

SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information

Machine Learning Approaches to Vocal Register Classification in Contemporary Male Pop Music

Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese

LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors

ImprovNet — Generating Controllable Musical Improvisations with Iterative Corruption Refinement

On the Role of Speech Data in Reducing Toxicity Detection Bias

Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization

Learning Nonlinear Dynamics in Physical Modelling Synthesis using Neural Ordinary Differential Equations

Deconstructing Jazz Piano Style Using Machine Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー