「cs.SD」カテゴリーアーカイブ

Learning to Dub Movies via Hierarchical Prosody Models

投稿日: 2023年4月5日作成者: jarxiv

要約タイトル：階層的プロソディモデルに基づく映画の吹き替え学習要約：・映画 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP

投稿日: 2023年4月4日作成者: jarxiv

要約【タイトル】IEMOCAPを用いた発話感情認識システムの設計と評価：現実的 … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR

投稿日: 2023年4月4日作成者: jarxiv

要約タイトル：実用的なConformer：オンデバイスおよびクラウドASRのた … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Lego-Features: Exporting modular encoder features for streaming and deliberation ASR

投稿日: 2023年4月4日作成者: jarxiv

要約タイトル： Lego-Features：ストリーミングおよび審議 ASR … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Multilingual Word Error Rate Estimation: e-WER3

投稿日: 2023年4月4日作成者: jarxiv

要約タイトル：Multilingual Word Error Rate Est … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines

投稿日: 2023年4月4日作成者: jarxiv

要約タイトル：SIG-VC：人間と機械の両方のためのスピーカー情報ガイド付きゼ … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

Unsupervised Anomaly Detection and Localization of Machine Audio: A GAN-based Approach

投稿日: 2023年4月3日作成者: jarxiv

要約タイトル：機械音声の非教師あり異常検出と位置特定：GANベースのアプローチ … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

On Batching Variable Size Inputs for Training End-to-End Speech Enhancement Systems

投稿日: 2023年4月3日作成者: jarxiv

要約タイトル：エンドツーエンド音声強化システムのトレーニングにおける可変サイズ … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Dialog act guided contextual adapter for personalized speech recognition

投稿日: 2023年4月3日作成者: jarxiv

要約タイトル：パーソナライズド音声認識のためのダイアログアクト誘導コンテキスト … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Exploiting prompt learning with pre-trained language models for Alzheimer’s Disease detection

投稿日: 2023年4月3日作成者: jarxiv

要約タイトル：プレトレーニングされた言語モデルを用いたアルツハイマー病の検出に … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

「cs.SD」カテゴリーアーカイブ

Learning to Dub Movies via Hierarchical Prosody Models

Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP

Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR

Lego-Features: Exporting modular encoder features for streaming and deliberation ASR

Multilingual Word Error Rate Estimation: e-WER3

SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines

Unsupervised Anomaly Detection and Localization of Machine Audio: A GAN-based Approach

On Batching Variable Size Inputs for Training End-to-End Speech Enhancement Systems

Dialog act guided contextual adapter for personalized speech recognition

Exploiting prompt learning with pre-trained language models for Alzheimer’s Disease detection

最近の投稿

最近のコメント

アーカイブ

カテゴリー