「cs.CV」カテゴリーアーカイブ

CODEI: Resource-Efficient Task-Driven Co-Design of Perception and Decision Making for Mobile Robots Applied to Autonomous Vehicles

投稿日: 2025年3月14日作成者: jarxiv

要約このペーパーでは、安全性、効率、コスト、エネルギー、計算要件、重量などのリ … 続きを読む →

カテゴリー: cs.AI, cs.AR, cs.CV, cs.RO, cs.SY, eess.SY, I.2.10 | コメントを受け付けていません

ReVLA: Reverting Visual Domain Limitation of Robotic Foundation Models

投稿日: 2025年3月14日作成者: jarxiv

要約大規模な言語モデルの最近の進歩と大規模なロボットデータセットへのアクセスは … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

6D Object Pose Tracking in Internet Videos for Robotic Manipulation

投稿日: 2025年3月14日作成者: jarxiv

要約インターネットの指導ビデオから操作されたオブジェクトの一時的に一貫した6D … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

LUMOS: Language-Conditioned Imitation Learning with World Models

投稿日: 2025年3月14日作成者: jarxiv

要約ロボット工学のための言語条件付きマルチタスク模倣学習フレームワークであるL … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback

投稿日: 2025年3月14日作成者: jarxiv

要約動的環境での自律運転には、人間のような適応軌道を生成することが不可欠です。 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary

投稿日: 2025年3月14日作成者: jarxiv

要約特にテスト時のOODサンプルがトレーニングの外れ値と大きく異なる場合、ディ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion

投稿日: 2025年3月14日作成者: jarxiv

要約リアルタイムでの共同スピーチジェスチャーを生成するには、時間的一貫性と効率 … 続きを読む →

カテゴリー: cs.CV, cs.HC, cs.LG | コメントを受け付けていません

Video Super-Resolution: All You Need is a Video Diffusion Model

投稿日: 2025年3月14日作成者: jarxiv

要約潜在空間に無条件のビデオ生成モデルを備えた拡散後サンプリングフレームワーク … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

投稿日: 2025年3月14日作成者: jarxiv

要約 8Bパラメーターを備えた高度なマルチモーダルプロセス報酬モデル（PRM）で … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions

投稿日: 2025年3月14日作成者: jarxiv

要約オープンセマンティックマッピング（OSM）は、セマンティックセグメンテーシ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

CODEI: Resource-Efficient Task-Driven Co-Design of Perception and Decision Making for Mobile Robots Applied to Autonomous Vehicles

ReVLA: Reverting Visual Domain Limitation of Robotic Foundation Models

6D Object Pose Tracking in Internet Videos for Robotic Manipulation

LUMOS: Language-Conditioned Imitation Learning with World Models

Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback

OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary

Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion

Video Super-Resolution: All You Need is a Video Diffusion Model

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions

最近の投稿

最近のコメント

アーカイブ

カテゴリー