「cs.SD」カテゴリーアーカイブ

Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine

投稿日: 2024年4月19日作成者: jarxiv

要約救急部門での専門的な精神医学的評価と自殺傾向のリスクのある患者へのケアへの … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS, I.2 | コメントを受け付けていません

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition

投稿日: 2024年4月19日作成者: jarxiv

要約この論文では、一般的な大規模事前学習モデル (PTM) を音声感情認識タス … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair

投稿日: 2024年4月19日作成者: jarxiv

要約同時機械翻訳 (SiMT) システムでは、同時通訳 (SI) コーパスを使 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey

投稿日: 2024年4月19日作成者: jarxiv

要約深層学習 (DL) の最近の進歩により、自動音声認識 (ASR) にとって … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

Dynamic Modality and View Selection for Multimodal Emotion Recognition with Missing Modalities

投稿日: 2024年4月19日作成者: jarxiv

要約人間の感情の研究は、伝統的に心理学や神経科学などの分野の基礎でしたが、人工 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

The LuViRA Dataset: Measurement Description

投稿日: 2024年4月18日作成者: jarxiv

要約視覚、音声、無線センサーを利用した位置特定アルゴリズムを評価するためのデー … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

投稿日: 2024年4月17日作成者: jarxiv

要約ジェネレーティブマルチモーダルコンテンツは、アーティストやメディア担当 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Anatomy of Industrial Scale Multilingual ASR

投稿日: 2024年4月17日作成者: jarxiv

要約このペーパーでは、さまざまなアプリケーションニーズに対応する大規模な多言 … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness

投稿日: 2024年4月15日作成者: jarxiv

要約自然言語処理 (NLP) の最近の進歩により、大規模言語モデル (LLM) … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Differentiable All-pole Filters for Time-varying Audio Systems

投稿日: 2024年4月15日作成者: jarxiv

要約無限インパルス応答フィルターは、オーディオエフェクトやシンセサイザーなど … 続きを読む →

カテゴリー: cs.LG, cs.SD, eess.AS | コメントを受け付けていません

「cs.SD」カテゴリーアーカイブ

Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition

Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair

Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey

Dynamic Modality and View Selection for Multimodal Emotion Recognition with Missing Modalities

The LuViRA Dataset: Measurement Description

Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

Anatomy of Industrial Scale Multilingual ASR

Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness

Differentiable All-pole Filters for Time-varying Audio Systems

最近の投稿

最近のコメント

アーカイブ

カテゴリー