月別アーカイブ: 2024年1月

Energy-Calibrated VAE with Test Time Free Lunch

投稿日: 2024年1月17日作成者: jarxiv

要約この論文では、変分オートエンコーダ (VAE) を強化するために条件付きエ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images

投稿日: 2024年1月17日作成者: jarxiv

要約単眼の RGB 画像から手持ちのオブジェクトを再構成することは、魅力的では … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

End-to-End Optimized Image Compression with the Frequency-Oriented Transform

投稿日: 2024年1月17日作成者: jarxiv

要約画像圧縮は、情報爆発の時代の中で重要な課題となっています。深層学習手法を … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.MM | コメントを受け付けていません

ZeroShape: Regression-based Zero-shot Shape Reconstruction

投稿日: 2024年1月17日作成者: jarxiv

要約単一画像のゼロショット 3D 形状再構成の問題を研究します。最近の研究で … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Adaptive Confidence Multi-View Hashing for Multimedia Retrieval

投稿日: 2024年1月17日作成者: jarxiv

要約マルチビューハッシュ法は、複数のビューからの異種データをバイナリハッシ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

FUSC: Fetal Ultrasound Semantic Clustering of Second Trimester Scans Using Deep Self-supervised Learning

投稿日: 2024年1月17日作成者: jarxiv

要約超音波は、妊娠中の臨床現場での主要な画像診断手段です。毎年 1 億 4, … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary

投稿日: 2024年1月17日作成者: jarxiv

要約単一画像超解像度は、低解像度 (LR) 画像から高解像度 (HR) 画像を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification

投稿日: 2024年1月17日作成者: jarxiv

要約最近、3D 点群分類は多くのデータセットの助けを借りて大幅に進歩しました。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Human vs. LMMs: Exploring the Discrepancy in Emoji Interpretation and Usage in Digital Communication

投稿日: 2024年1月17日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) を活用して、特にソーシャルメデ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data

投稿日: 2024年1月17日作成者: jarxiv

要約ノイズ除去拡散確率モデル (DDPM) は、大量のデータでトレーニングする … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年1月

Energy-Calibrated VAE with Test Time Free Lunch

Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images

End-to-End Optimized Image Compression with the Frequency-Oriented Transform

ZeroShape: Regression-based Zero-shot Shape Reconstruction

Adaptive Confidence Multi-View Hashing for Multimedia Retrieval

FUSC: Fetal Ultrasound Semantic Clustering of Second Trimester Scans Using Deep Self-supervised Learning

Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary

ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification

Human vs. LMMs: Exploring the Discrepancy in Emoji Interpretation and Usage in Digital Communication

DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data

最近の投稿

最近のコメント

アーカイブ

カテゴリー