月別アーカイブ: 2023年7月

UGCANet: A Unified Global Context-Aware Transformer-based Network with Feature Alignment for Endoscopic Image Analysis

投稿日: 2023年7月13日作成者: jarxiv

要約消化管内視鏡検査は、カメラやその他の器具を備えた柔軟なチューブを使用して消 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Exposing the Fake: Effective Diffusion-Generated Images Detection

投稿日: 2023年7月13日作成者: jarxiv

要約画像合成は、ノイズ除去拡散確率モデル (DDPM) やテキストから画像への … 続きを読む →

カテゴリー: cs.CR, cs.CV, cs.LG | コメントを受け付けていません

Stochastic Light Field Holography

投稿日: 2023年7月13日作成者: jarxiv

要約ビジュアルチューリングテストは、ホログラフィックディスプレイのリアリ … 続きを読む →

カテゴリー: cs.CV, cs.GR, eess.IV, physics.optics | コメントを受け付けていません

MMBench: Is Your Multi-modal Model an All-around Player?

投稿日: 2023年7月13日作成者: jarxiv

要約大型の視覚言語モデルは最近目覚ましい進歩を遂げ、視覚情報に関する優れた認識 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Improved Real-time Image Smoothing with Weak Structures Preserved and High-contrast Details Removed

投稿日: 2023年7月13日作成者: jarxiv

要約画像のスムージングは、ピクセル単位の勾配を減らして細部を滑らかにするこ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

投稿日: 2023年7月13日作成者: jarxiv

要約コンピュータビジョンモデルで処理する前に画像のサイズを固定解像度に変更する … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Facial Reenactment Through a Personalized Generator

投稿日: 2023年7月13日作成者: jarxiv

要約近年、顔の再現における画像生成モデルの役割は着実に増加しています。このよ … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Correlation-Aware Mutual Learning for Semi-supervised Medical Image Segmentation

投稿日: 2023年7月13日作成者: jarxiv

要約半教師あり学習は、大量のラベルなしデータを活用して追加情報を抽出できるため … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of ‘Never Enough Training Data’

投稿日: 2023年7月13日作成者: jarxiv

要約線状転位などの結晶欠陥は、多くの金属デバイスの性能と信頼性にとって重要な役 … 続きを読む →

カテゴリー: cond-mat.mtrl-sci, cs.CV | コメントを受け付けていません

Synthesizing Artistic Cinemagraphs from Text

投稿日: 2023年7月13日作成者: jarxiv

要約 Text2Cinemagraph は、テキストの説明からシネマグラフを作成 … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.LG | コメントを受け付けていません

月別アーカイブ: 2023年7月

UGCANet: A Unified Global Context-Aware Transformer-based Network with Feature Alignment for Endoscopic Image Analysis

Exposing the Fake: Effective Diffusion-Generated Images Detection

Stochastic Light Field Holography

MMBench: Is Your Multi-modal Model an All-around Player?

Improved Real-time Image Smoothing with Weak Structures Preserved and High-contrast Details Removed

Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

Facial Reenactment Through a Personalized Generator

Correlation-Aware Mutual Learning for Semi-supervised Medical Image Segmentation

Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of ‘Never Enough Training Data’

Synthesizing Artistic Cinemagraphs from Text

最近の投稿

最近のコメント

アーカイブ

カテゴリー