「cs.CV」カテゴリーアーカイブ

Active InSAR monitoring of building damage in Gaza during the Israel-Hamas War

投稿日: 2025年6月18日作成者: jarxiv

要約 2023年10月7日から始まるガザ地区の空中爆撃は、21世紀の最も激しい爆 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting

投稿日: 2025年6月18日作成者: jarxiv

要約現実的で音声駆動型のトーキングヘッドビデオの統合において高い同期を達成する … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Cost-Aware Routing for Efficient Text-To-Image Generation

投稿日: 2025年6月18日作成者: jarxiv

要約拡散モデルは、反復的な除去プロセスを介して入力プロンプトの高忠実度画像を生 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset

投稿日: 2025年6月18日作成者: jarxiv

要約今日、地球観測（EO）衛星は大量のデータを生成し、コペルニクスセンチネル2 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM

投稿日: 2025年6月18日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLLM）はしばしば幻覚に苦しんでいます。 … 続きを読む →

カテゴリー: 68T45, cs.CL, cs.CV | コメントを受け付けていません

CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion

投稿日: 2025年6月18日作成者: jarxiv

要約拡散ポリシー（DP）により、ロボットはアクション拡散を通じて専門家のデモを … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Lecture Video Visual Objects (LVVO) Dataset: A Benchmark for Visual Object Detection in Educational Videos

投稿日: 2025年6月18日作成者: jarxiv

要約教育ビデオコンテンツでの視覚オブジェクト検出のための新しいベンチマークであ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models

投稿日: 2025年6月18日作成者: jarxiv

要約ビデオインペインティング拡散トランスモデルを使用して、部分微分方程式（PD … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure

投稿日: 2025年6月17日作成者: jarxiv

要約視覚的な同時ローカリゼーションとマッピング（SLAM）は、極端な視点、スケ … 続きを読む →

カテゴリー: cs.CV, cs.RO, I.2.10 | コメントを受け付けていません

A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method

投稿日: 2025年6月17日作成者: jarxiv

要約同時ローカリゼーションとマッピング（SLAM）のためのマルチセンサー融合の … 続きを読む →

カテゴリー: 93C85, cs.CV, cs.RO, I.4 | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Active InSAR monitoring of building damage in Gaza during the Israel-Hamas War

SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting

Cost-Aware Routing for Efficient Text-To-Image Generation

Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset

ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM

CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion

Lecture Video Visual Objects (LVVO) Dataset: A Benchmark for Visual Object Detection in Educational Videos

VideoPDE: Unified Generative PDE Solving via Video Inpainting Diffusion Models

SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure

A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method

最近の投稿

最近のコメント

アーカイブ

カテゴリー