「cs.CV」カテゴリーアーカイブ

A Narrative Review of Image Processing Techniques Related to Prostate Ultrasound

投稿日: 2024年10月8日作成者: jarxiv

要約前立腺がん（PCa）は男性の健康に重大な脅威をもたらしており、予後の改善と … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

投稿日: 2024年10月8日作成者: jarxiv

要約 Fr\’echet Video Distance (FVD) … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Finding Visual Task Vectors

投稿日: 2024年10月8日作成者: jarxiv

要約視覚的なプロンプトは、追加のトレーニングを行わずに、コンテキスト内の例を通 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Organizing Unstructured Image Collections using Natural Language

投稿日: 2024年10月8日作成者: jarxiv

要約非構造化ビジュアルデータをセマンティッククラスターに編成することは、コ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

The Dawn of Video Generation: Preliminary Explorations with SORA-like Models

投稿日: 2024年10月8日作成者: jarxiv

要約テキストからビデオ (T2V)、画像からビデオ (I2V)、およびビデオか … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3D-free meets 3D priors: Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance

投稿日: 2024年10月8日作成者: jarxiv

要約最近の 3D ノベルビュー合成 (NVS) 手法は、単一オブジェクト中心 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Generative Parameter-Efficient Fine-Tuning

投稿日: 2024年10月8日作成者: jarxiv

要約事前トレーニングされた Transformer バックボーンをダウンストリ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

SimO Loss: Anchor-Free Contrastive Loss for Fine-Grained Supervised Contrastive Learning

投稿日: 2024年10月8日作成者: jarxiv

要約私たちが提案する類似性直交性 (SimO) 損失を活用した、新しいアンカー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration

投稿日: 2024年10月8日作成者: jarxiv

要約変形可能な画像位置合わせは、さまざまなモダリティまたは時間からの医療画像を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

投稿日: 2024年10月8日作成者: jarxiv

要約長いテキストを理解することは実際には大きな要求ですが、ほとんどの言語画像事 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

A Narrative Review of Image Processing Techniques Related to Prostate Ultrasound

Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality

Finding Visual Task Vectors

Organizing Unstructured Image Collections using Natural Language

The Dawn of Video Generation: Preliminary Explorations with SORA-like Models

3D-free meets 3D priors: Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance

Generative Parameter-Efficient Fine-Tuning

SimO Loss: Anchor-Free Contrastive Loss for Fine-Grained Supervised Contrastive Learning

DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

最近の投稿

最近のコメント

アーカイブ

カテゴリー