「cs.AI」カテゴリーアーカイブ

AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction

投稿日: 2025年3月18日作成者: jarxiv

要約自律運転には、レーンや横断歩道などのインフラストラクチャ要素を理解する必要 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

投稿日: 2025年3月18日作成者: jarxiv

要約要素レベルの視覚操作はデジタルコンテンツの作成に不可欠ですが、現在の拡散ベ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

Humanoid Policy ~ Human Policy

投稿日: 2025年3月18日作成者: jarxiv

要約さまざまなデータを使用したヒューマノイドロボットのトレーニング操作ポリシー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

投稿日: 2025年3月18日作成者: jarxiv

要約独自の時間的次元を備えたビデオは、回答が視覚的で解釈可能な証拠に直接リンク … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Training Directional Locomotion for Quadrupedal Low-Cost Robotic Systems via Deep Reinforcement Learning

投稿日: 2025年3月17日作成者: jarxiv

要約この作業では、現実の世界の低コスト四足動物ロボットの方向移動のディープ補強 … 続きを読む →

カテゴリー: cs.AI, cs.RO | コメントを受け付けていません

Low-cost Real-world Implementation of the Swing-up Pendulum for Deep Reinforcement Learning Experiments

投稿日: 2025年3月17日作成者: jarxiv

要約 Deep Rehnection Learning（DRL）は仮想ドメインと … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation

投稿日: 2025年3月17日作成者: jarxiv

要約モバイル操作では、ナビゲーションと操作はしばしば別々の問題として扱われ、そ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

EmbodiedVSR: Dynamic Scene Graph-Guided Chain-of-Thought Reasoning for Visual Spatial Tasks

投稿日: 2025年3月17日作成者: jarxiv

要約マルチモーダルの大手言語モデル（MLLM）は、具体化された知性を画期的に進 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality

投稿日: 2025年3月17日作成者: jarxiv

要約自律的なナビゲーションのコンテキストでは、特にナビゲーション情報がビジョン … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving

投稿日: 2025年3月17日作成者: jarxiv

要約視覚セマンティックセグメンテーションのために二重エンコーダを使用してデータ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Humanoid Policy ~ Human Policy

VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

Training Directional Locomotion for Quadrupedal Low-Cost Robotic Systems via Deep Reinforcement Learning

Low-cost Real-world Implementation of the Swing-up Pendulum for Deep Reinforcement Learning Experiments

MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation

EmbodiedVSR: Dynamic Scene Graph-Guided Chain-of-Thought Reasoning for Visual Spatial Tasks

Virtual Guidance as a Mid-level Representation for Navigation with Augmented Reality

LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving

最近の投稿

最近のコメント

アーカイブ

カテゴリー