月別アーカイブ: 2024年6月

Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis

投稿日: 2024年6月4日作成者: jarxiv

要約意味画像合成(SIS)は、意味マップ(ラベル)に対応する現実的な画像を生成 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization

投稿日: 2024年6月4日作成者: jarxiv

要約最近のマルチモーダル大規模言語モデル（LLM）の進歩に伴い、画像-テキスト … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Scalable Wasserstein Gradient Flow for Generative Modeling through Unbalanced Optimal Transport

投稿日: 2024年6月4日作成者: jarxiv

要約 Wasserstein Gradient Flow (WGF)は、Wass … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data

投稿日: 2024年6月4日作成者: jarxiv

要約既存の一発4次元頭部合成法は、通常、3DMM再構成の助けを借りて単眼映像か … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PLUG: Revisiting Amodal Segmentation with Foundation Model and Hierarchical Focus

投稿日: 2024年6月4日作成者: jarxiv

要約部分的に隠蔽された物体の完全な形状を予測することを目的としたアモーダルセグ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba

投稿日: 2024年6月4日作成者: jarxiv

要約生物システムからヒントを得たイベントカメラは、低消費電力でありながら、周囲 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Automatic Cranial Defect Reconstruction with Self-Supervised Deep Deformable Masked Autoencoders

投稿日: 2024年6月4日作成者: jarxiv

要約毎年、何千人もの人々が頭蓋損傷に苦しんでいる。このような人々には、再建手術 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

DeCoF: Generated Video Detection via Frame Consistency: The First Benchmark Dataset

投稿日: 2024年6月4日作成者: jarxiv

要約高度な映像生成手法によって生成される映像の品質が高まるにつれ、新たなセキュ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Efficient Masked Autoencoders with Self-Consistency

投稿日: 2024年6月4日作成者: jarxiv

要約自然言語処理タスクにおけるマスク付き言語モデリング(MLM)に触発されたマ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild

投稿日: 2024年6月4日作成者: jarxiv

要約画質評価(IQA)は、一連のアプリケーションにおいて、高品質な画像を選択し … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年6月

Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis

Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization

Scalable Wasserstein Gradient Flow for Generative Modeling through Unbalanced Optimal Transport

Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data

PLUG: Revisiting Amodal Segmentation with Foundation Model and Hierarchical Focus

Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba

Automatic Cranial Defect Reconstruction with Self-Supervised Deep Deformable Masked Autoencoders

DeCoF: Generated Video Detection via Frame Consistency: The First Benchmark Dataset

Efficient Masked Autoencoders with Self-Consistency

DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild

最近の投稿

最近のコメント

アーカイブ

カテゴリー