「cs.CV」カテゴリーアーカイブ

On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation

投稿日: 2025年2月28日作成者: jarxiv

要約病理学のビジョン言語モデルにより、マルチモーダルケースの検索と自動レポート … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GHOST 2.0: generative high-fidelity one shot transfer of heads

投稿日: 2025年2月28日作成者: jarxiv

要約フェイススワッピングのタスクは最近、研究コミュニティで注目を集めていますが … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions

投稿日: 2025年2月28日作成者: jarxiv

要約数百万のメラニン細胞皮膚病変が毎年病理学者によって検査されていますが、その … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images

投稿日: 2025年2月28日作成者: jarxiv

要約眼底の画質は眼疾患を診断するために重要ですが、実際の状態はしばしばぼやけた … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Autonomous Vision-Guided Resection of Central Airway Obstruction

投稿日: 2025年2月27日作成者: jarxiv

要約既存の気管腫瘍切除法は、効果的な気道クリアランスに必要な精度が欠けているこ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries

投稿日: 2025年2月27日作成者: jarxiv

要約視覚言語モデル（VLM）のトレーニングに使用される大規模なインターネットデ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

MaskPlanner: Learning-Based Object-Centric Motion Generation from 3D Point Clouds

投稿日: 2025年2月27日作成者: jarxiv

要約オブジェクト中心のモーション生成（OCMG）は、ロボットスプレー塗装や溶接 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Attention-Guided Integration of CLIP and SAM for Precise Object Masking in Robotic Manipulation

投稿日: 2025年2月27日作成者: jarxiv

要約このペーパーでは、コンビニエンスストアのマスキング製品の特定のドメイン内で … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments

投稿日: 2025年2月27日作成者: jarxiv

要約 Vision-and-Language Navigation（VLN）は、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.RO | コメントを受け付けていません

SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps

投稿日: 2025年2月27日作成者: jarxiv

要約実際の環境で展開されたRGB-Dセンサーによってキャプチャされた深度マップ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation

GHOST 2.0: generative high-fidelity one shot transfer of heads

Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions

RetinaRegen: A Hybrid Model for Readability and Detail Restoration in Fundus Images

Autonomous Vision-Guided Resection of Central Airway Obstruction

QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries

MaskPlanner: Learning-Based Object-Centric Motion Generation from 3D Point Clouds

Attention-Guided Integration of CLIP and SAM for Precise Object Masking in Robotic Manipulation

Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments

SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps

最近の投稿

最近のコメント

アーカイブ

カテゴリー