「cs.CV」カテゴリーアーカイブ

LMPOcc: 3D Semantic Occupancy Prediction Utilizing Long-Term Memory Prior from Historical Traversals

投稿日: 2025年6月11日作成者: jarxiv

要約ビジョンベースの3Dセマンティック占有率予測は、自律的な運転に重要であり、 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

EVA: An Embodied World Model for Future Video Anticipation

投稿日: 2025年6月11日作成者: jarxiv

要約ビデオ生成モデルは、将来の状態をシミュレートする際に大きな進歩を遂げ、具体 … 続きを読む →

カテゴリー: cs.CV, cs.MM, cs.RO | コメントを受け付けていません

Adaptive path planning for efficient object search by UAVs in agricultural fields

投稿日: 2025年6月11日作成者: jarxiv

要約このペーパーでは、UAVを使用して農業分野でのオブジェクト検索の適応パスプ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly

投稿日: 2025年6月11日作成者: jarxiv

要約ビジョン言語モデル（VLM）は、具体化されたエージェントの推論と計画におい … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation

投稿日: 2025年6月11日作成者: jarxiv

要約追加の範囲センサーを必要とせずに視覚ベースのナビゲーションの安全性を高める … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

A PDE-Based Image Dehazing Method via Atmospheric Scattering Theory

投稿日: 2025年6月11日作成者: jarxiv

要約このホワイトペーパーでは、シングルイメージの脱毛のための新しい部分微分方程 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Flow Diverse and Efficient: Learning Momentum Flow Matching via Stochastic Velocity Field Sampling

投稿日: 2025年6月11日作成者: jarxiv

要約最近、特にFlux 1.0やSD 3.0などの一連のRFモデルによって生成 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation

投稿日: 2025年6月11日作成者: jarxiv

要約最近、大規模な言語モデル（LLM）が大幅に成功し、一般的なテキストを超えて … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation

投稿日: 2025年6月11日作成者: jarxiv

要約人間とオブジェクトの相互作用（HOI）ビデオ生成の重要な制限に対処するため … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding

投稿日: 2025年6月11日作成者: jarxiv

要約 Pointmambaなどの状態空間モデル（SSM）は、線形の複雑さを伴うポ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

LMPOcc: 3D Semantic Occupancy Prediction Utilizing Long-Term Memory Prior from Historical Traversals

EVA: An Embodied World Model for Future Video Anticipation

Adaptive path planning for efficient object search by UAVs in agricultural fields

PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly

Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation

A PDE-Based Image Dehazing Method via Atmospheric Scattering Theory

Flow Diverse and Efficient: Learning Momentum Flow Matching via Stochastic Velocity Field Sampling

CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation

HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation

ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding

最近の投稿

最近のコメント

アーカイブ

カテゴリー