「cs.AI」カテゴリーアーカイブ

MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization

投稿日: 2025年1月6日作成者: jarxiv

要約近年、音楽タグ付け、楽器分類、キー検出など、様々な音楽インフォマティクス理 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

Speech Retrieval-Augmented Generation without Automatic Speech Recognition

投稿日: 2025年1月6日作成者: jarxiv

要約音声データに対する質問応答の一般的なアプローチの1つは、まず自動音声認識（ … 続きを読む →

カテゴリー: cs.AI, cs.CL, eess.AS | コメントを受け付けていません

Predicate Invention from Pixels via Pretrained Vision-Language Models

投稿日: 2025年1月5日作成者: jarxiv

要約我々の目的は、画像という形の生のセンサー入力が与えられた、変動が激しく、組 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Design Optimizer for Soft Growing Robot Manipulators in Three-Dimensional Environments

投稿日: 2025年1月5日作成者: jarxiv

要約ソフトグローイングロボットは、散らかった環境や危険な環境でのナビゲーション … 続きを読む →

カテゴリー: cs.AI, cs.NE, cs.RO | コメントを受け付けていません

H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters

投稿日: 2025年1月5日作成者: jarxiv

要約カテーテル治療の成功率は、外科医に提供される感覚データと密接な関係がある。 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, eess.IV | コメントを受け付けていません

MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception

投稿日: 2025年1月5日作成者: jarxiv

要約マルチセンサーフュージョンモデルは、自律走行知覚、特に3D物体検出やHDマ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Symmetries-enhanced Multi-Agent Reinforcement Learning

投稿日: 2025年1月5日作成者: jarxiv

要約マルチエージェント強化学習は、エージェントが複雑で協調的な行動を学習するた … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.MA, cs.RO, math.RT | コメントを受け付けていません

Risks of Cultural Erasure in Large Language Models

投稿日: 2025年1月5日作成者: jarxiv

要約大規模な言語モデルは、検索、オンライン教育、旅行計画など、社会的知識の生産 … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization

投稿日: 2025年1月5日作成者: jarxiv

要約近年、音楽タグ付け、楽器分類、キー検出など、様々な音楽理解タスクにおいて、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

TED: Turn Emphasis with Dialogue Feature Attention for Emotion Recognition in Conversation

投稿日: 2025年1月5日作成者: jarxiv

要約会話における感情認識（ERC）は、複数ターンの文脈をモデル化する手法によっ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization

Speech Retrieval-Augmented Generation without Automatic Speech Recognition

Predicate Invention from Pixels via Pretrained Vision-Language Models

Design Optimizer for Soft Growing Robot Manipulators in Three-Dimensional Environments

H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters

MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception

Symmetries-enhanced Multi-Agent Reinforcement Learning

Risks of Cultural Erasure in Large Language Models

MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization

TED: Turn Emphasis with Dialogue Feature Attention for Emotion Recognition in Conversation

最近の投稿

最近のコメント

アーカイブ

カテゴリー