「cs.SD」カテゴリーアーカイブ

RealImpact: A Dataset of Impact Sound Fields for Real Objects

投稿日: 2023年6月19日作成者: jarxiv

要約物体は、さまざまな摂動、環境条件、リスナーに対する姿勢の下で独特の音を出し … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.SD, eess.AS | コメントを受け付けていません

Few-shot bioacoustic event detection at the DCASE 2023 challenge

投稿日: 2023年6月16日作成者: jarxiv

要約フューショット生体音響イベント検出では、対象クラスの少数の例のみにアクセス … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation

投稿日: 2023年6月16日作成者: jarxiv

要約音声基礎モデルの自己教師あり学習 (SSL) の優れた一般化能力が大きな注 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

KIT’s Multilingual Speech Translation System for IWSLT 2023

投稿日: 2023年6月16日作成者: jarxiv

要約既存の音声翻訳ベンチマークの多くは、高品質の録音条件でのネイティブ英語の音 … 続きを読む →

カテゴリー: cs.CL, cs.SD | コメントを受け付けていません

Inconsistency Ranking-based Noisy Label Detection for High-quality Data

投稿日: 2023年6月16日作成者: jarxiv

要約ディープラーニングを成功させるには、注釈付きの高品質で大量のデータが必要で … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language

投稿日: 2023年6月16日作成者: jarxiv

要約現在の自己教師あり学習アルゴリズムはモダリティ固有であることが多く、大量の … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Audio Tagging on an Embedded Hardware Platform

投稿日: 2023年6月16日作成者: jarxiv

要約畳み込みニューラルネットワーク (CNN) は、さまざまな音声分類タスク … 続きを読む →

カテゴリー: cs.AI, cs.SD, cs.SY, eess.AS, eess.SY | コメントを受け付けていません

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

投稿日: 2023年6月16日作成者: jarxiv

要約パーソナルアシスタント、自動音声認識装置、対話理解システムは、相互接続さ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Unsupervised speech enhancement with deep dynamical generative speech and noise models

投稿日: 2023年6月14日作成者: jarxiv

要約この研究は、クリーン音声モデルとして動的変分オートエンコーダ (DVAE) … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Modality Adaption or Regularization? A Case Study on End-to-End Speech Translation

投稿日: 2023年6月14日作成者: jarxiv

要約事前トレーニングと微調整は、エンドツーエンド音声翻訳 (E2E ST) に … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

「cs.SD」カテゴリーアーカイブ

RealImpact: A Dataset of Impact Sound Fields for Real Objects

Few-shot bioacoustic event detection at the DCASE 2023 challenge

Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation

KIT’s Multilingual Speech Translation System for IWSLT 2023

Inconsistency Ranking-based Noisy Label Detection for High-quality Data

Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language

Audio Tagging on an Embedded Hardware Platform

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

Unsupervised speech enhancement with deep dynamical generative speech and noise models

Modality Adaption or Regularization? A Case Study on End-to-End Speech Translation

最近の投稿

最近のコメント

アーカイブ

カテゴリー