月別アーカイブ: 2025年1月

Learning Point Spread Function Invertibility Assessment for Image Deconvolution

投稿日: 2025年1月28日作成者: jarxiv

要約 Deep-Learning（DL）ベースの画像デコンボリューション（ID） … 続きを読む →

カテゴリー: 68T10, 94A08, cs.CV, eess.IV, I.4.5 | コメントを受け付けていません

VCRScore: Image captioning metric based on V\&L Transformers, CLIP, and precision-recall

投稿日: 2025年1月28日作成者: jarxiv

要約画像キャプションは、本質的なビジョンと言語研究のタスクになっています。特 … 続きを読む →

カテゴリー: 68Txx, cs.CL, cs.CV, I.4 | コメントを受け付けていません

BAG: Body-Aligned 3D Wearable Asset Generation

投稿日: 2025年1月28日作成者: jarxiv

要約最近の進歩により、一般的な3D形状生成モデルで顕著な進歩が示されていますが … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis

投稿日: 2025年1月28日作成者: jarxiv

要約目的：外科的ワークフロー分析は、外科的効率と安全性を改善するために重要です … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

The Linear Attention Resurrection in Vision Transformer

投稿日: 2025年1月28日作成者: jarxiv

要約 Vision Transformers（VITS）は最近、コンピュータービ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning

投稿日: 2025年1月28日作成者: jarxiv

要約画像キャプションは、コンピュータービジョンと自然言語処理の交差点における重 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images

投稿日: 2025年1月28日作成者: jarxiv

要約水中環境でのアクティビティは、いくつかのシナリオで最も重要であり、水中画像 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction

投稿日: 2025年1月28日作成者: jarxiv

要約目的：この研究の目的は、3D外科シーン再建（3D-SSR）で使用されるマル … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SPECIAL: Zero-shot Hyperspectral Image Classification With CLIP

投稿日: 2025年1月28日作成者: jarxiv

要約 Hyperspectral Image（HSI）分類は、HSIの各ピクセル … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PDC-ViT : Source Camera Identification using Pixel Difference Convolution and Vision Transformer

投稿日: 2025年1月28日作成者: jarxiv

要約ソースカメラの識別は、テロ、暴力、その他の犯罪行為などの重要なケースを含む … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2025年1月

Learning Point Spread Function Invertibility Assessment for Image Deconvolution

VCRScore: Image captioning metric based on V\&L Transformers, CLIP, and precision-recall

BAG: Body-Aligned 3D Wearable Asset Generation

Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis

The Linear Attention Resurrection in Vision Transformer

MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning

UDBE: Unsupervised Diffusion-based Brightness Enhancement in Underwater Images

Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction

SPECIAL: Zero-shot Hyperspectral Image Classification With CLIP

PDC-ViT : Source Camera Identification using Pixel Difference Convolution and Vision Transformer

最近の投稿

最近のコメント

アーカイブ

カテゴリー