「cs.CV」カテゴリーアーカイブ

Real-time Seafloor Segmentation and Mapping

投稿日: 2025年4月16日作成者: jarxiv

要約 Posidonia Oceanica Meadowsは、生存と保全のために … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models

投稿日: 2025年4月16日作成者: jarxiv

要約ビジョン言語モデル（VLM）は、自律運転の可能性を示していますが、多くの場 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

SeeTree — A modular, open-source system for tree detection and orchard localization

投稿日: 2025年4月16日作成者: jarxiv

要約正確なローカリゼーションは、精密な果樹園管理の重要な機能要件です。ただし … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving

投稿日: 2025年4月16日作成者: jarxiv

要約既存の学習ベースの自律運転（AD）システムは、高レベルの情報を理解し、まれ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping

投稿日: 2025年4月16日作成者: jarxiv

要約ロボット把握は、具体化されたシステムの基礎能力です。多くの方法は、シーン … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach

投稿日: 2025年4月16日作成者: jarxiv

要約具体化されたAIの進歩により、エンドツーエンドの訓練を受けたエージェントが … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Acquisition of high-quality images for camera calibration in robotics applications via speech prompts

投稿日: 2025年4月16日作成者: jarxiv

要約正確な固有および外因性カメラのキャリブレーションは、視力に依存するロボット … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Using LLMs as prompt modifier to avoid biases in AI image generators

投稿日: 2025年4月16日作成者: jarxiv

要約この調査では、ユーザープロンプトを変更することにより、テキストから画像の生 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.CY | コメントを受け付けていません

What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness

投稿日: 2025年4月16日作成者: jarxiv

要約視覚的なキャプションベンチマークは、現代のマルチモーダル大手言語モデル（M … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image

投稿日: 2025年4月16日作成者: jarxiv

要約このペーパーは、ロボット操作タスクにおける明確なオブジェクトのカテゴリレベ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Real-time Seafloor Segmentation and Mapping

ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models

SeeTree — A modular, open-source system for tree detection and orchard localization

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving

ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping

Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach

Acquisition of high-quality images for camera calibration in robotics applications via speech prompts

Using LLMs as prompt modifier to avoid biases in AI image generators

What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness

CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image

最近の投稿

最近のコメント

アーカイブ

カテゴリー