「cs.CV」カテゴリーアーカイブ

Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields

投稿日: 2025年4月4日作成者: jarxiv

要約ラディアンスフィールドは、複数の画像から3Dシーンをモデリングするための強 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

Towards Computation- and Communication-efficient Computational Pathology

投稿日: 2025年4月4日作成者: jarxiv

要約現在の計算病理学モデルは、広範な用途で優れた性能を発揮しているにもかかわら … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation

投稿日: 2025年4月4日作成者: jarxiv

要約高解像度リモートセンシング画像の意味的セグメンテーションは、土地利用モニタ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation

投稿日: 2025年4月4日作成者: jarxiv

要約最近のマルチモーダル大規模言語モデル（MLLM）の進歩により、ビデオ理解に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery

投稿日: 2025年4月4日作成者: jarxiv

要約歴史的な航空写真から屋根を正確に検出することは、長期的な都市開発と人間の居 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Understanding Depth and Height Perception in Large Visual-Language Models

投稿日: 2025年4月4日作成者: jarxiv

要約奥行きや高さの知覚を含む幾何学的理解は、知能の基本であり、環境をナビゲート … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

BECAME: BayEsian Continual Learning with Adaptive Model MErging

投稿日: 2025年4月4日作成者: jarxiv

要約継続的学習（CL）は、破滅的な忘却を軽減しながら、タスク間で段階的に学習す … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization

投稿日: 2025年4月4日作成者: jarxiv

要約多様で物理的にもっともらしい人間とシーンのインタラクション（HSI）を合成 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation

投稿日: 2025年4月4日作成者: jarxiv

要約大気乱流は、長距離撮像システムにおける画像劣化の主な原因である。ディープラ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis

投稿日: 2025年4月4日作成者: jarxiv

要約非言語的コミュニケーションは、発話の意味を伝えるのに役立つ意味豊かなジェス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Enforcing View-Consistency in Class-Agnostic 3D Segmentation Fields

Towards Computation- and Communication-efficient Computational Pathology

Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation

ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation

A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery

Understanding Depth and Height Perception in Large Visual-Language Models

BECAME: BayEsian Continual Learning with Adaptive Model MErging

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization

Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation

Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis

最近の投稿

最近のコメント

アーカイブ

カテゴリー