「cs.CV」カテゴリーアーカイブ

MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report Generation

投稿日: 2024年11月18日作成者: jarxiv

要約放射線科レポートは、治療戦略を計画し、医師と患者のコミュニケーションを強化 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Low-Resolution Image is Worth 1×1 Words: Enabling Fine Image Super-Resolution with Transformers and TaylorShift

投稿日: 2024年11月18日作成者: jarxiv

要約トランスベースの超解像度 (SR) モデルは、最近画像再構成の品質を向上さ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MM | コメントを受け付けていません

ColorEdit: Training-free Image-Guided Color editing with diffusion model

投稿日: 2024年11月18日作成者: jarxiv

要約 Text-to-image (T2I) 拡散モデルは、優れた生成機能を備え … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

ScribbleVS: Scribble-Supervised Medical Image Segmentation via Dynamic Competitive Pseudo Label Selection

投稿日: 2024年11月18日作成者: jarxiv

要約臨床医学では、正確な画像セグメンテーションは臨床医に実質的なサポートを提供 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DCD: Discriminative and Consistent Representation Distillation

投稿日: 2024年11月18日作成者: jarxiv

要約知識蒸留 (KD) は、大規模な教師モデルから小規模な生徒モデルに知識を伝 … 続きを読む →

カテゴリー: 68T07, cs.AI, cs.CV, I.2 | コメントを受け付けていません

Morpho-Aware Global Attention for Image Matting

投稿日: 2024年11月18日作成者: jarxiv

要約ビジョントランスフォーマー (ViT) と畳み込みニューラルネットワー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning

投稿日: 2024年11月18日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は、画像内の記述タスクには優 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

The Unreasonable Effectiveness of Guidance for Diffusion Models

投稿日: 2024年11月18日作成者: jarxiv

要約ガイダンスは、拡散モデルによって生成された画像の知覚品質を向上させるために … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Partial Scene Text Retrieval

投稿日: 2024年11月18日作成者: jarxiv

要約部分シーンテキスト取得のタスクには、画像ギャラリーからの特定のクエリテ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CLCE: An Approach to Refining Cross-Entropy and Contrastive Learning for Optimized Learning Fusion

投稿日: 2024年11月18日作成者: jarxiv

要約最先端の事前トレーニング済み画像モデルは、主に 2 段階のアプローチを採用 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report Generation

A Low-Resolution Image is Worth 1×1 Words: Enabling Fine Image Super-Resolution with Transformers and TaylorShift

ColorEdit: Training-free Image-Guided Color editing with diffusion model

ScribbleVS: Scribble-Supervised Medical Image Segmentation via Dynamic Competitive Pseudo Label Selection

DCD: Discriminative and Consistent Representation Distillation

Morpho-Aware Global Attention for Image Matting

Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning

The Unreasonable Effectiveness of Guidance for Diffusion Models

Partial Scene Text Retrieval

CLCE: An Approach to Refining Cross-Entropy and Contrastive Learning for Optimized Learning Fusion

最近の投稿

最近のコメント

アーカイブ

カテゴリー