「eess.AS」カテゴリーアーカイブ

Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine

投稿日: 2024年4月19日作成者: jarxiv

要約救急部門での専門的な精神医学的評価と自殺傾向のリスクのある患者へのケアへの … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS, I.2 | コメントを受け付けていません

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition

投稿日: 2024年4月19日作成者: jarxiv

要約この論文では、一般的な大規模事前学習モデル (PTM) を音声感情認識タス … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Visually grounded few-shot word learning in low-resource settings

投稿日: 2024年4月19日作成者: jarxiv

要約我々は、ほんの数個の単語と画像の例のペアから新しい単語とその視覚的描写を学 … 続きを読む →

カテゴリー: cs.CL, eess.AS | コメントを受け付けていません

Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair

投稿日: 2024年4月19日作成者: jarxiv

要約同時機械翻訳 (SiMT) システムでは、同時通訳 (SI) コーパスを使 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey

投稿日: 2024年4月19日作成者: jarxiv

要約深層学習 (DL) の最近の進歩により、自動音声認識 (ASR) にとって … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

Dynamic Modality and View Selection for Multimodal Emotion Recognition with Missing Modalities

投稿日: 2024年4月19日作成者: jarxiv

要約人間の感情の研究は、伝統的に心理学や神経科学などの分野の基礎でしたが、人工 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

The LuViRA Dataset: Measurement Description

投稿日: 2024年4月18日作成者: jarxiv

要約視覚、音声、無線センサーを利用した位置特定アルゴリズムを評価するためのデー … 続きを読む →

カテゴリー: cs.CV, cs.SD, eess.AS, eess.SP | コメントを受け付けていません

Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

投稿日: 2024年4月17日作成者: jarxiv

要約ジェネレーティブマルチモーダルコンテンツは、アーティストやメディア担当 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Anatomy of Industrial Scale Multilingual ASR

投稿日: 2024年4月17日作成者: jarxiv

要約このペーパーでは、さまざまなアプリケーションニーズに対応する大規模な多言 … 続きを読む →

カテゴリー: cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

A Large-Scale Evaluation of Speech Foundation Models

投稿日: 2024年4月16日作成者: jarxiv

要約基盤モデルパラダイムは、共有基盤モデルを活用して、さまざまなタスクに対し … 続きを読む →

カテゴリー: cs.CL, eess.AS, eess.SP | コメントを受け付けていません

「eess.AS」カテゴリーアーカイブ

Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition

Visually grounded few-shot word learning in low-resource settings

Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair

Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey

Dynamic Modality and View Selection for Multimodal Emotion Recognition with Missing Modalities

The LuViRA Dataset: Measurement Description

Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

Anatomy of Industrial Scale Multilingual ASR

A Large-Scale Evaluation of Speech Foundation Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー