月別アーカイブ: 2025年2月

Action-based image editing guided by human instructions

投稿日: 2025年2月5日作成者: jarxiv

要約テキストベースの画像編集は、通常、人間の指示に基づいて入力画像の要素を挿入 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Field Matching: an Electrostatic Paradigm to Generate and Transfer Data

投稿日: 2025年2月5日作成者: jarxiv

要約我々は、静電場マッチング（EFM）を提案する。これは、生成モデリングと分配 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning

投稿日: 2025年2月5日作成者: jarxiv

要約バーチャルデジタルアバターの生成は、コンピュータビジョンの分野において極め … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Covariances for Free: Exploiting Mean Distributions for Federated Learning with Pre-Trained Models

投稿日: 2025年2月5日作成者: jarxiv

要約事前に訓練されたモデルを使用することで、データの不均一性の影響を軽減し、連 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation

投稿日: 2025年2月5日作成者: jarxiv

要約本論文では、Video Latent Flow Matching (VLF … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

ControlFace: Harnessing Facial Parametric Control for Face Rigging

投稿日: 2025年2月5日作成者: jarxiv

要約ポーズ、表情、照明などの特定の制御を満たすための顔画像の操作は、顔のリギン … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models

投稿日: 2025年2月5日作成者: jarxiv

要約クロスアテンションは、視覚情報を言語バックボーンに統合するために、マルチモ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.DC, cs.LG | コメントを受け付けていません

Extending SEEDS to a Supervoxel Algorithm for Medical Image Analysis

投稿日: 2025年2月5日作成者: jarxiv

要約この研究では、SEEDSスーパーピクセルアルゴリズムを2D画像から3Dボリ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising

投稿日: 2025年2月5日作成者: jarxiv

要約 Generative Adversarial Networks（GAN）は … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

DCBM: Data-Efficient Visual Concept Bottleneck Models

投稿日: 2025年2月5日作成者: jarxiv

要約概念ボトルネックモデル（CBM）は、人間が理解可能な概念に基づいて予測を行 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年2月

Action-based image editing guided by human instructions

Field Matching: an Electrostatic Paradigm to Generate and Transfer Data

MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning

Covariances for Free: Exploiting Mean Distributions for Federated Learning with Pre-Trained Models

Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation

ControlFace: Harnessing Facial Parametric Control for Face Rigging

LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models

Extending SEEDS to a Supervoxel Algorithm for Medical Image Analysis

GAN-Based Architecture for Low-dose Computed Tomography Imaging Denoising

DCBM: Data-Efficient Visual Concept Bottleneck Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー