「cs.CV」カテゴリーアーカイブ

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

投稿日: 2025年3月18日作成者: jarxiv

要約一般的な推論のための視覚言語モデル（VLM）の最近の進歩により、視覚言語ア … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering

投稿日: 2025年3月18日作成者: jarxiv

要約複雑な環境での堅牢な知覚、ナビゲーション、および計画を可能にするため、自律 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning

投稿日: 2025年3月18日作成者: jarxiv

要約 3Dアクティビティの推論と計画は、マルチモーダル学習の最近の進歩のおかげで … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Believing is Seeing: Unobserved Object Detection using Generative Models

投稿日: 2025年3月18日作成者: jarxiv

要約画像には見えないが、カメラの近くにあるオブジェクトは検出できますか？この … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Vision-based automatic fruit counting with UAV

投稿日: 2025年3月18日作成者: jarxiv

要約賢い農業のために無人航空機（UAV）の使用がますます人気が高まっています。 … 続きを読む →

カテゴリー: cs.CV, cs.RO, eess.IV | コメントを受け付けていません

Free-form language-based robotic reasoning and grasping

投稿日: 2025年3月18日作成者: jarxiv

要約人間の指示に基づいて散らかったビンからロボット把握を実行することは、自由形 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Multi-Platform Teach-and-Repeat Navigation by Visual Place Recognition Based on Deep-Learned Local Features

投稿日: 2025年3月18日作成者: jarxiv

要約均一で可変的な環境は、モバイルロボットナビゲーションにおける安定した視覚的 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

3D Hierarchical Panoptic Segmentation in Real Orchard Environments Across Different Sensors

投稿日: 2025年3月18日作成者: jarxiv

要約作物の収穫量の推定は、正確な作物収量の推定値が農民の収穫または精度の介入に … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Dense Policy: Bidirectional Autoregressive Learning of Actions

投稿日: 2025年3月18日作成者: jarxiv

要約主流の視覚運動ポリシーは、主に全体的なアクション予測の生成モデルに依存して … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Mind the Gap: Confidence Discrepancy Can Guide Federated Semi-Supervised Learning Across Pseudo-Mismatch

投稿日: 2025年3月18日作成者: jarxiv

要約 Federated Semi-Supervised Learning（FS … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering

Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning

Believing is Seeing: Unobserved Object Detection using Generative Models

Vision-based automatic fruit counting with UAV

Free-form language-based robotic reasoning and grasping

Multi-Platform Teach-and-Repeat Navigation by Visual Place Recognition Based on Deep-Learned Local Features

3D Hierarchical Panoptic Segmentation in Real Orchard Environments Across Different Sensors

Dense Policy: Bidirectional Autoregressive Learning of Actions

Mind the Gap: Confidence Discrepancy Can Guide Federated Semi-Supervised Learning Across Pseudo-Mismatch

最近の投稿

最近のコメント

アーカイブ

カテゴリー