月別アーカイブ: 2025年4月

ActiveGS: Active Scene Reconstruction Using Gaussian Splatting

投稿日: 2025年4月9日作成者: jarxiv

要約ロボットアプリケーションは、多くの場合、シーンの再構成に依存して、ダウンス … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

投稿日: 2025年4月9日作成者: jarxiv

要約大規模なマルチモダリティモデル（LMM）は、視覚的理解と生成に大きな進歩を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling

投稿日: 2025年4月9日作成者: jarxiv

要約高解像度のセグメンテーションは、医療画像からマイクロイメージング情報を抽出 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

HiMoR: Monocular Deformable Gaussian Reconstruction with Hierarchical Motion Representation

投稿日: 2025年4月9日作成者: jarxiv

要約高品質の単眼のダイナミック3D再構成を達成できる3Dガウスプリミティブの新 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation

投稿日: 2025年4月9日作成者: jarxiv

要約パラメーター効率の高い微調整（PEFT）は、固有の機能を維持および解き放ち … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Retrieval-Based Interleaved Visual Chain-of-Thought in Real-World Driving Scenarios

投稿日: 2025年4月9日作成者: jarxiv

要約大規模な言語モデルの推論を促すチェーン（COT）は、テキストの手がかりと記 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Privacy Attacks on Image AutoRegressive Models

投稿日: 2025年4月9日作成者: jarxiv

要約画像の自己回帰生成は、画像の自己回帰モデル（IAR）が画像品質（FID：1 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance

投稿日: 2025年4月9日作成者: jarxiv

要約テキストからイメージ（T2I）拡散/フローモデルは、柔軟な視覚的な創造物を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Monitoring Viewer Attention During Online Ads

投稿日: 2025年4月9日作成者: jarxiv

要約今日、ビデオ広告は多数のオンラインプラットフォームに広がり、世界中の何百万 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Transfer between Modalities with MetaQueries

投稿日: 2025年4月9日作成者: jarxiv

要約統一されたマルチモーダルモデルは、理解（テキスト出力）と生成（ピクセル出力 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年4月

ActiveGS: Active Scene Reconstruction Using Gaussian Splatting

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling

HiMoR: Monocular Deformable Gaussian Reconstruction with Hierarchical Motion Representation

Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation

Retrieval-Based Interleaved Visual Chain-of-Thought in Real-World Driving Scenarios

Privacy Attacks on Image AutoRegressive Models

HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance

Monitoring Viewer Attention During Online Ads

Transfer between Modalities with MetaQueries

最近の投稿

最近のコメント

アーカイブ

カテゴリー