「cs.CV」カテゴリーアーカイブ

WEM-GAN: Wavelet transform based facial expression manipulation

投稿日: 2024年12月4日作成者: jarxiv

要約表情操作は、顔認識に影響を与えることなく人間の表情を変化させることを目的と … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks

投稿日: 2024年12月4日作成者: jarxiv

要約リモートセンシング・シーン分類（RSSC）は、土地利用や資源管理における様 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LiDAR-based Registration against Georeferenced Models for Globally Consistent Allocentric Maps

投稿日: 2024年12月4日作成者: jarxiv

要約最新の無人航空機（UAV）は、捜索・救助（SAR）ミッションにおいて、人員 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents

投稿日: 2024年12月4日作成者: jarxiv

要約最近のマルチモーダルモデルの進歩は、物体認識やシーン理解において素晴らしい … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Comparative Analysis of Resource-Efficient CNN Architectures for Brain Tumor Classification

投稿日: 2024年12月4日作成者: jarxiv

要約 MRI画像における正確な脳腫瘍の分類は、タイムリーな診断と治療計画のために … 続きを読む →

カテゴリー: 92C55, cs.CV, eess.IV, I.2.10, I.4.8 | コメントを受け付けていません

Tomographic SAR Reconstruction for Forest Height Estimation

投稿日: 2024年12月4日作成者: jarxiv

要約樹木の高さ推定は、生態学的および林業的用途において、バイオマス推定の重要な … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Unveiling Concept Attribution in Diffusion Models

投稿日: 2024年12月4日作成者: jarxiv

要約拡散モデルは、テキストプロンプトからリアルで高品質な画像を生成する顕著な能 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

ShadowHack: Hacking Shadows via Luminance-Color Divide and Conquer

投稿日: 2024年12月4日作成者: jarxiv

要約影は、輝度の低下、テクスチャの劣化、色の歪みといった課題を画像にもたらし、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection

投稿日: 2024年12月4日作成者: jarxiv

要約視覚言語理解の進歩にもかかわらず、マルチモーダルアーキテクチャに画像分割を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Segmentation of Coronary Artery Stenosis in X-ray Angiography using Mamba Models

投稿日: 2024年12月4日作成者: jarxiv

要約冠動脈疾患は世界的な死亡率の主な要因の1つである。X線画像から冠動脈狭窄を … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

WEM-GAN: Wavelet transform based facial expression manipulation

Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks

LiDAR-based Registration against Georeferenced Models for Globally Consistent Allocentric Maps

Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents

Comparative Analysis of Resource-Efficient CNN Architectures for Brain Tumor Classification

Tomographic SAR Reconstruction for Forest Height Estimation

Unveiling Concept Attribution in Diffusion Models

ShadowHack: Hacking Shadows via Luminance-Color Divide and Conquer

SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection

Segmentation of Coronary Artery Stenosis in X-ray Angiography using Mamba Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー