「cs.CV」カテゴリーアーカイブ

Scene Understanding Enabled Semantic Communication with Open Channel Coding

投稿日: 2025年1月27日作成者: jarxiv

要約通信システムがシンボル送信から意味のある情報を伝えることに移行するにつれて … 続きを読む →

カテゴリー: cs.CV, eess.SP | コメントを受け付けていません

Training-Free Style and Content Transfer by Leveraging U-Net Skip Connections in Stable Diffusion 2.*

投稿日: 2025年1月27日作成者: jarxiv

要約拡散モデルを使用した画像生成は最近大きく進歩しましたが、その内部の潜在表現 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CheapNVS: Real-Time On-Device Narrow-Baseline Novel View Synthesis

投稿日: 2025年1月27日作成者: jarxiv

要約シングルビューの新規ビュー合成 (NVS) は、その不適切な性質により悪名 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Trick-GS: A Balanced Bag of Tricks for Efficient Gaussian Splatting

投稿日: 2025年1月27日作成者: jarxiv

要約 3D再建のためのガウススプラッティング（GS）は、迅速なトレーニング、推論 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Rethinking Encoder-Decoder Flow Through Shared Structures

投稿日: 2025年1月27日作成者: jarxiv

要約密な予測タスクは、エンコーダーアーキテクチャの複雑さを増しているため、デコ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Leveraging ChatGPT’s Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research

投稿日: 2025年1月27日作成者: jarxiv

要約この論文では、村レベルの貧困予測のために衛星画像を分析するための視覚機能を … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding

投稿日: 2025年1月27日作成者: jarxiv

要約人工知能 (AI) は、放射線科医を支援して医療画像の読影と診断の効率と精 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions

投稿日: 2025年1月27日作成者: jarxiv

要約最近の研究では、長く詳細な画像キャプションを使用したビジョン言語モデル ( … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

An Interpretable X-ray Style Transfer via Trainable Local Laplacian Filter

投稿日: 2025年1月27日作成者: jarxiv

要約放射線科医は、診断パフォーマンスをサポートするためにニーズに手動で調整され … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection

投稿日: 2025年1月27日作成者: jarxiv

要約太陽光発電（PV）発電所のメンテナンスでは、サーマルカメラを搭載した無人航 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Scene Understanding Enabled Semantic Communication with Open Channel Coding

Training-Free Style and Content Transfer by Leveraging U-Net Skip Connections in Stable Diffusion 2.*

CheapNVS: Real-Time On-Device Narrow-Baseline Novel View Synthesis

Trick-GS: A Balanced Bag of Tricks for Efficient Gaussian Splatting

Rethinking Encoder-Decoder Flow Through Shared Structures

Leveraging ChatGPT’s Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research

Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding

Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions

An Interpretable X-ray Style Transfer via Trainable Local Laplacian Filter

Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection

最近の投稿

最近のコメント

アーカイブ

カテゴリー