「cs.CV」カテゴリーアーカイブ

A Review on Geometry and Surface Inspection in 3D Concrete Printing

投稿日: 2025年3月11日作成者: jarxiv

要約建設中の添加剤の使用（AMC）の使用の大幅な成長を考えると、従来の製造され … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting

投稿日: 2025年3月11日作成者: jarxiv

要約アンカーベースの3Dガウススプラッティング（3D-GS）は、3Dガウス予測 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models

投稿日: 2025年3月11日作成者: jarxiv

要約大規模な視覚言語モデル（LVLMS）は、マルチモーダルタスクで強力なパフォ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction

投稿日: 2025年3月11日作成者: jarxiv

要約レーントポロジ抽出には、車線と交通要素を検出し、その関係を決定することが含 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?

投稿日: 2025年3月11日作成者: jarxiv

要約最近、マルチモーダル大規模モデル（MLLM）は、さまざまなビジョン言語タス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks

投稿日: 2025年3月11日作成者: jarxiv

要約特にGANおよび拡散モデルの出現による画像統合の最近の進歩は、偽情報の普及 … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving

投稿日: 2025年3月11日作成者: jarxiv

要約高品質のマルチモーダル軌道を生成するためのエンドツーエンドの自律運転方法で … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PseudoTouch: Efficiently Imaging the Surface Feel of Objects for Robotic Manipulation

投稿日: 2025年3月10日作成者: jarxiv

要約触覚センシングは、人間の器用な操作に不可欠ですが、ロボット工学では広く使用 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

METDrive: Multi-modal End-to-end Autonomous Driving with Temporal Guidance

投稿日: 2025年3月10日作成者: jarxiv

要約マルチモーダルエンドツーエンドの自律運転は、最近の研究で有望な進歩を示して … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition

投稿日: 2025年3月10日作成者: jarxiv

要約モバイルロボットは、場所を正確に識別し、パッケージ配信などのタスクを実行す … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

A Review on Geometry and Surface Inspection in 3D Concrete Printing

SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting

VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models

Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction

LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?

Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks

GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving

PseudoTouch: Efficiently Imaging the Surface Feel of Objects for Robotic Manipulation

METDrive: Multi-modal End-to-end Autonomous Driving with Temporal Guidance

Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition

最近の投稿

最近のコメント

アーカイブ

カテゴリー