「cs.CV」カテゴリーアーカイブ

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

投稿日: 2025年3月25日作成者: jarxiv

要約最近、拡散モデルは、マルチモードアクション分布をモデル化できるロボットポリ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Humanoid Policy ~ Human Policy

投稿日: 2025年3月25日作成者: jarxiv

要約さまざまなデータを使用したヒューマノイドロボットのトレーニング操作ポリシー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving

投稿日: 2025年3月25日作成者: jarxiv

要約軌道計画は、自律的な運転に不可欠であり、複雑な環境での安全で効率的なナビゲ … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

ETAP: Event-based Tracking of Any Point

投稿日: 2025年3月25日作成者: jarxiv

要約任意のポイント（TAP）を追跡すると、最近、モーション推定パラダイムが個々 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Kalib: Easy Hand-Eye Calibration with Reference Point Tracking

投稿日: 2025年3月25日作成者: jarxiv

要約ハンドアイキャリブレーションは、カメラとロボット間の変換を推定することを目 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Robust Tube-based Control Strategy for Vision-guided Autonomous Vehicles

投稿日: 2025年3月25日作成者: jarxiv

要約自律車両の堅牢な制御戦略は、システムの安定性を改善し、ライディングの快適さ … 続きを読む →

カテゴリー: cs.CV, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers

投稿日: 2025年3月25日作成者: jarxiv

要約最近のマルチティーチャー蒸留方法により、複数の基礎モデルのエンコーダーが単 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding

投稿日: 2025年3月25日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）の最近の進歩により、ビデオ理解のた … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Global-Local Tree Search for Language Guided 3D Scene Generation

投稿日: 2025年3月25日作成者: jarxiv

要約 GPT-4などの大きなビジョン言語モデル（VLM）は、さまざまな分野で顕著 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

投稿日: 2025年3月25日作成者: jarxiv

要約大規模な言語モデル（LLM）は、自然言語処理環境を変え、多様なアプリケーシ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

Humanoid Policy ~ Human Policy

CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving

ETAP: Event-based Tracking of Any Point

Kalib: Easy Hand-Eye Calibration with Reference Point Tracking

Robust Tube-based Control Strategy for Vision-guided Autonomous Vehicles

DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers

Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding

Global-Local Tree Search for Language Guided 3D Scene Generation

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー