「cs.CV」カテゴリーアーカイブ

MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

投稿日: 2025年4月8日作成者: jarxiv

要約既存のMLLMベンチマークは、次のために統一されたMLLM（U-MLLM） … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving

投稿日: 2025年4月8日作成者: jarxiv

要約信頼できる3Dオブジェクトの知覚は、自律運転に不可欠です。すべての気象条 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration

投稿日: 2025年4月7日作成者: jarxiv

要約シングルイメージの人間の再構築は、デジタルヒューマンモデリングアプリケーシ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation

投稿日: 2025年4月7日作成者: jarxiv

要約シーンフロー推定は、ロバストな動的物体検出、自動ラベリング、センサー同期な … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey

投稿日: 2025年4月7日作成者: jarxiv

要約モバイル機器アプリケーションの複雑化に伴い、これらの機器は高い俊敏性を目指 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

FoundationStereo: Zero-Shot Stereo Matching

投稿日: 2025年4月7日作成者: jarxiv

要約ディープステレオマッチングでは、ドメインごとの微調整により、ベンチマークデ … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

GraphSeg: Segmented 3D Representations via Graph Edge Addition and Contraction

投稿日: 2025年4月7日作成者: jarxiv

要約非構造化環境で動作するロボットは、多くの場合、正確で一貫性のあるオブジェク … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery

投稿日: 2025年4月7日作成者: jarxiv

要約 DeepSeekモデルは、その効率的な学習パラダイムと強力な推論能力により … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.RO | コメントを受け付けていません

Real-Time Roadway Obstacle Detection for Electric Scooters Using Deep Learning and Multi-Sensor Fusion

投稿日: 2025年4月7日作成者: jarxiv

要約都市部における電動スクーター（eスクーター）の普及は、その小さな車輪、サス … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning

投稿日: 2025年4月7日作成者: jarxiv

要約コンパクトで情報量の多い3Dシーン表現を構築することは、特に長時間に渡る複 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving

HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration

Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation

Towards Mobile Sensing with Event Cameras on High-agility Resource-constrained Devices: A Survey

FoundationStereo: Zero-Shot Stereo Matching

GraphSeg: Segmented 3D Representations via Graph Edge Addition and Contraction

Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery

Real-Time Roadway Obstacle Detection for Electric Scooters Using Deep Learning and Multi-Sensor Fusion

3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning

最近の投稿

最近のコメント

アーカイブ

カテゴリー