「cs.SD」カテゴリーアーカイブ

Novel-View Acoustic Synthesis from 3D Reconstructed Rooms

投稿日: 2023年10月24日作成者: jarxiv

要約私たちは、ブラインドオーディオ録音と 3D シーン情報を組み合わせて、新 … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

Definition-independent Formalization of Soundscapes: Towards a Formal Methodology

投稿日: 2023年10月23日作成者: jarxiv

要約サウンドスケープは、さまざまな分野の研究者によって研究されており、それぞれ … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS | コメントを受け付けていません

Two-Stage Triplet Loss Training with Curriculum Augmentation for Audio-Visual Retrieval

投稿日: 2023年10月23日作成者: jarxiv

要約クロスモーダル検索モデルは、三重損失最適化の可能性を活用して、堅牢な埋め込 … 続きを読む →

カテゴリー: cs.CV, cs.IR, cs.MM, cs.SD, eess.AS | コメントを受け付けていません

Audio Editing with Non-Rigid Text Prompts

投稿日: 2023年10月20日作成者: jarxiv

要約このペーパーでは、非固定テキスト編集によるオーディオ編集について検討します … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

投稿日: 2023年10月20日作成者: jarxiv

要約 GPT や DALL-E などの大規模な生成モデルは、研究コミュニティに革 … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

EmoDiarize: Speaker Diarization and Emotion Identification from Speech Signals using Convolutional Neural Networks

投稿日: 2023年10月20日作成者: jarxiv

要約高度な人工知能と人間とコンピューターの対話の時代では、話し言葉で感情を識別 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook

投稿日: 2023年10月20日作成者: jarxiv

要約近年、強化学習とバンディットは、ヘルスケア、金融、レコメンデーションシス … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Analysis and Detection of Pathological Voice using Glottal Source Features

投稿日: 2023年10月18日作成者: jarxiv

要約音声の病状を自動検出することで、客観的な評価と早期の診断介入が可能になりま … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

Wav2vec-based Detection and Severity Level Classification of Dysarthria from Speech

投稿日: 2023年10月18日作成者: jarxiv

要約音響音声信号から直接、構音障害を自動検出および重症度レベル分類することは、 … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation

投稿日: 2023年10月18日作成者: jarxiv

要約エンドツーエンドの音声言語理解 (SLU) は、テキストと音声に関する現在 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

「cs.SD」カテゴリーアーカイブ

Novel-View Acoustic Synthesis from 3D Reconstructed Rooms

Definition-independent Formalization of Soundscapes: Towards a Formal Methodology

Two-Stage Triplet Loss Training with Curriculum Augmentation for Audio-Visual Retrieval

Audio Editing with Non-Rigid Text Prompts

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

EmoDiarize: Speaker Diarization and Emotion Identification from Speech Signals using Convolutional Neural Networks

Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook

Analysis and Detection of Pathological Voice using Glottal Source Features

Wav2vec-based Detection and Severity Level Classification of Dysarthria from Speech

The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation

最近の投稿

最近のコメント

アーカイブ

カテゴリー