「eess.AS」カテゴリーアーカイブ

Is one brick enough to break the wall of spoken dialogue state tracking?

投稿日: 2023年12月6日作成者: jarxiv

要約タスク指向対話 (TOD) システムでは、ユーザーのニーズに対するシステム … 続きを読む →

カテゴリー: cs.AI, cs.CL, eess.AS, eess.SP | コメントを受け付けていません

Iterative autoregression: a novel trick to improve your low-latency speech enhancement model

投稿日: 2023年12月6日作成者: jarxiv

要約ストリーミングモデルは、リアルタイム音声強調ツールの重要なコンポーネント … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

Building Ears for Robots: Machine Hearing in the Age of Autonomy

投稿日: 2023年12月6日作成者: jarxiv

要約この研究では、ロボット聴覚システムの重要性を調査し、多様で不確実な環境で動 … 続きを読む →

カテゴリー: cs.RO, cs.SD, eess.AS | コメントを受け付けていません

Building Ears for Robots: Machine Hearing in the Age of Autonomy

投稿日: 2023年12月5日作成者: jarxiv

要約不確実な環境におけるフィールドロボットの増加により、ロボットの聴覚システム … 続きを読む →

カテゴリー: cs.RO, cs.SD, eess.AS | コメントを受け付けていません

Efficient Deep Speech Understanding at the Edge

投稿日: 2023年12月5日作成者: jarxiv

要約現代の音声理解（SU）では、ストリーミング音声入力の取り込みを含む高度なパ … 続きを読む →

カテゴリー: cs.CL, cs.LG, eess.AS | コメントを受け付けていません

Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State Tracking

投稿日: 2023年12月5日作成者: jarxiv

要約対話状態の追跡は、タスク指向の対話システムにおいて情報を抽出する上で重要な … 続きを読む →

カテゴリー: cs.AI, cs.SD, eess.AS | コメントを受け付けていません

H_eval: A new hybrid evaluation metric for automatic speech recognition tasks

投稿日: 2023年12月4日作成者: jarxiv

要約自動音声認識(ASR)システムの評価指標としての単語誤り率(WER)の欠点 … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

Unified Segment-to-Segment Framework for Simultaneous Sequence Generation

投稿日: 2023年12月1日作成者: jarxiv

要約同時シーケンス生成は、ストリーミング音声認識、同時機械翻訳、同時音声翻訳な … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.SD, eess.AS | コメントを受け付けていません

CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation

投稿日: 2023年12月1日作成者: jarxiv

要約 CoDi-2 は、複雑なマルチモーダルのインターリーブ命令に従い、コンテキ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

End-to-end Joint Rich and Normalized ASR with a limited amount of rich training data

投稿日: 2023年11月30日作成者: jarxiv

要約句読点や大文字を含む場合と含まない場合の両方の文字起こしを生成する、統合リ … 続きを読む →

カテゴリー: cs.CL, cs.SD, eess.AS | コメントを受け付けていません

「eess.AS」カテゴリーアーカイブ

Is one brick enough to break the wall of spoken dialogue state tracking?

Iterative autoregression: a novel trick to improve your low-latency speech enhancement model

Building Ears for Robots: Machine Hearing in the Age of Autonomy

Building Ears for Robots: Machine Hearing in the Age of Autonomy

Efficient Deep Speech Understanding at the Edge

Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State Tracking

H_eval: A new hybrid evaluation metric for automatic speech recognition tasks

Unified Segment-to-Segment Framework for Simultaneous Sequence Generation

CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation

End-to-end Joint Rich and Normalized ASR with a limited amount of rich training data

最近の投稿

最近のコメント

アーカイブ

カテゴリー