「cs.SD」カテゴリーアーカイブ

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

投稿日: 2024年1月29日作成者: jarxiv

要約話者検証 (SV) システムには、主に特徴抽出と分類という 2 つの個別の … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Multiple output samples per input in a single-output Gaussian process

投稿日: 2024年1月29日作成者: jarxiv

要約標準のガウスプロセス (GP) では、トレーニングセット内の入力ごとに … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization

投稿日: 2024年1月29日作成者: jarxiv

要約構音障害音声再構成 (DSR) システムは、構音障害のある音声を正常な音声 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Comparison of parameters of vowel sounds of russian and english languages

投稿日: 2024年1月29日作成者: jarxiv

要約多言語音声認識システムでは、言語が事前にわかっていないにもかかわらず、信号 … 続きを読む →

カテゴリー: 68T10, cs.CL, cs.SD, eess.AS, H.2.8 | コメントを受け付けていません

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

投稿日: 2024年1月29日作成者: jarxiv

要約我々は、神経音響モデルと大規模言語モデル（LLM）を融合することにより、音 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Disentanglement in a GAN for Unconditional Speech Synthesis

投稿日: 2024年1月26日作成者: jarxiv

要約明示的な条件付けをせずに、潜在空間から直接リアルな音声を合成できるモデルを … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion

投稿日: 2024年1月26日作成者: jarxiv

要約オーディオビジュアル音声分離は、音声認識、日記化、シーン分析、支援技術など … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks

投稿日: 2024年1月26日作成者: jarxiv

要約暗黙的ニューラル表現 (INR) は急速に成長している研究分野であり、マル … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.NE, cs.SD, eess.AS | コメントを受け付けていません

SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation

投稿日: 2024年1月26日作成者: jarxiv

要約効果的な音声モデリングの恩恵を受けて、現在の音声大規模言語モデル (SLL … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction

投稿日: 2024年1月25日作成者: jarxiv

要約自己教師あり学習 (SSL) は、ラベルのないデータから柔軟な音声表現を学 … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

「cs.SD」カテゴリーアーカイブ

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

Multiple output samples per input in a single-output Gaussian process

UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization

Comparison of parameters of vowel sounds of russian and english languages

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Disentanglement in a GAN for Unconditional Speech Synthesis

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion

HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks

SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation

Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction

最近の投稿

最近のコメント

アーカイブ

カテゴリー