月別アーカイブ: 2025年4月

Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures

投稿日: 2025年4月15日作成者: jarxiv

要約スパースビューRGB入力からのリアルタイムフリービューのヒューマンレンダリ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LMFormer: Lane based Motion Prediction Transformer

投稿日: 2025年4月15日作成者: jarxiv

要約モーション予測は、自律運転において重要な役割を果たします。この研究では、 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing

投稿日: 2025年4月15日作成者: jarxiv

要約リモートセンシングの移動オブジェクト検出（MOD）は、低解像度、非常に小さ … 続きを読む →

カテゴリー: 68T10, cs.CV, I.4.8 | コメントを受け付けていません

Distilling Textual Priors from LLM to Efficient Image Fusion

投稿日: 2025年4月15日作成者: jarxiv

要約マルチモダリティ画像Fusionは、複数のソース入力からの単一の包括的な画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials

投稿日: 2025年4月15日作成者: jarxiv

要約原子スケール材料の特性評価では、伝統的に、数ヶ月から長年の専門的なトレーニ … 続きを読む →

カテゴリー: cond-mat.mes-hall, cond-mat.mtrl-sci, cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Multi-Level Embedding and Alignment Network with Consistency and Invariance Learning for Cross-View Geo-Localization

投稿日: 2025年4月15日作成者: jarxiv

要約 Cross-View Geo-Localization（CVGL）には、最 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Noise2Ghost: Self-supervised deep convolutional reconstruction for ghost imaging

投稿日: 2025年4月15日作成者: jarxiv

要約私たちは、監視されていない方法の中で騒々しい獲得のために比類のない再構築パ … 続きを読む →

カテゴリー: cs.CV, cs.LG, physics.data-an | コメントを受け付けていません

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

投稿日: 2025年4月15日作成者: jarxiv

要約最近、Deepseek R1は、補強学習（RL）が、シンプルで効果的なデザ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting

投稿日: 2025年4月15日作成者: jarxiv

要約近年、テキスト駆動型の3Dコンテンツ生成において、大きな進歩がなされていま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Analysis of Attention in Video Diffusion Transformers

投稿日: 2025年4月15日作成者: jarxiv

要約ビデオ拡散トランス（VDIT）で注意の詳細な分析を実施し、多くの新しい発見 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年4月

Real-time Free-view Human Rendering from Sparse-view RGB Videos using Double Unprojected Textures

LMFormer: Lane based Motion Prediction Transformer

DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing

Distilling Textual Priors from LLM to Efficient Image Fusion

Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials

Multi-Level Embedding and Alignment Network with Consistency and Invariance Learning for Cross-View Geo-Localization

Noise2Ghost: Self-supervised deep convolutional reconstruction for ghost imaging

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

ESCT3D: Efficient and Selectively Controllable Text-Driven 3D Content Generation with Gaussian Splatting

Analysis of Attention in Video Diffusion Transformers

最近の投稿

最近のコメント

アーカイブ

カテゴリー