月別アーカイブ: 2025年4月

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

投稿日: 2025年4月14日作成者: jarxiv

要約 Video Variation Autoencoder（VAE）はビデオを … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Open-CD: A Comprehensive Toolbox for Change Detection

投稿日: 2025年4月14日作成者: jarxiv

要約 Open-CDを提示します。これは、関連するコンポーネントとモジュールと同 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Multi-head Ensemble of Smoothed Classifiers for Certified Robustness

投稿日: 2025年4月14日作成者: jarxiv

要約ランダム化スムージング（RS）は、認定された堅牢性のための有望な手法であり … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions

投稿日: 2025年4月14日作成者: jarxiv

要約一般的な環境を積極的に探索しながら、任意のオブジェクトを説明する際のエージ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review

投稿日: 2025年4月14日作成者: jarxiv

要約自動化された運転には正確な車線検出が不可欠であり、さまざまな道路シナリオで … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset

投稿日: 2025年4月14日作成者: jarxiv

要約デジタルツインカタログ（DTC）を紹介します。これは、新しい大規模なフォト … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.RO | コメントを受け付けていません

Discriminator-Free Direct Preference Optimization for Video Diffusion

投稿日: 2025年4月14日作成者: jarxiv

要約直接選好最適化（DPO）は、WIN/LOSITデータペアを通じてモデルを人 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection

投稿日: 2025年4月14日作成者: jarxiv

要約ディープフェイクの顔の急増は、私たちの日常生活に大きな潜在的な悪影響をもた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails

投稿日: 2025年4月14日作成者: jarxiv

要約リモートセンシングでは、同じシーンをキャプチャするさまざまなセンサーのマル … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery

投稿日: 2025年4月14日作成者: jarxiv

要約継続的な一般化されたカテゴリの発見が、以前に学んだカテゴリの壊滅的な忘却を … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年4月

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Open-CD: A Comprehensive Toolbox for Change Detection

Multi-head Ensemble of Smoothed Classifiers for Certified Robustness

Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions

Datasets for Lane Detection in Autonomous Driving: A Comprehensive Review

Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset

Discriminator-Free Direct Preference Optimization for Video Diffusion

Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection

COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails

Proxy-Anchor and EVT-Driven Continual Learning Method for Generalized Category Discovery

最近の投稿

最近のコメント

アーカイブ

カテゴリー