「cs.CV」カテゴリーアーカイブ

ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet

投稿日: 2024年12月11日作成者: jarxiv

要約ディープラーニングは、その並外れた有効性と多くの分野への適用性により、広く … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

[MASK] is All You Need

投稿日: 2024年12月11日作成者: jarxiv

要約生成モデルでは、次のセット予測ベースのマスク生成モデルと次のノイズ予測ベー … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Robust Bayesian Scene Reconstruction by Leveraging Retrieval-Augmented Priors

投稿日: 2024年12月10日作成者: jarxiv

要約オブジェクトジオメトリの 3D 表現の構築は、多くの下流のロボット工学タ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Benchmarking Vision, Language, & Action Models on Robotic Learning Tasks

投稿日: 2024年12月10日作成者: jarxiv

要約視覚言語動作 (VLA) モデルは、汎用ロボットシステム開発の有望な方向 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion

投稿日: 2024年12月10日作成者: jarxiv

要約メトリック単眼深度推定の一般化は、その不適切な姿勢の性質により大きな課題を … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Self-supervised cost of transport estimation for multimodal path planning

投稿日: 2024年12月10日作成者: jarxiv

要約実際の環境で動作する自律ロボットは、周囲をどのように移動するのが最善である … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering

投稿日: 2024年12月10日作成者: jarxiv

要約まばらなオンライン観察から新しい環境におけるロボットの予測世界モデルを特定 … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.RO | コメントを受け付けていません

AgentAlign: Misalignment-Adapted Multi-Agent Perception for Resilient Inter-Agent Sensor Correlations

投稿日: 2024年12月10日作成者: jarxiv

要約協調知覚は、コネクテッド自動運転車 (CAV) やスマートインフラストラ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks

投稿日: 2024年12月10日作成者: jarxiv

要約実用的なナビゲーションエージェントは、指示に従う、オブジェクトを検索する … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information

投稿日: 2024年12月10日作成者: jarxiv

要約複雑で未知の環境における効率的な自律ナビゲーションと障害物回避のための U … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet

[MASK] is All You Need

Robust Bayesian Scene Reconstruction by Leveraging Retrieval-Augmented Priors

Benchmarking Vision, Language, & Action Models on Robotic Learning Tasks

GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion

Self-supervised cost of transport estimation for multimodal path planning

One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering

AgentAlign: Misalignment-Adapted Multi-Agent Perception for Resilient Inter-Agent Sensor Correlations

Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks

Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information

最近の投稿

最近のコメント

アーカイブ

カテゴリー