月別アーカイブ: 2024年4月

OpenStreetView-5M: The Many Roads to Global Visual Geolocation

投稿日: 2024年4月30日作成者: jarxiv

要約地球上の任意の場所にある画像の位置を決定することは、視覚的に複雑なタスクで … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

A Multilevel Strategy to Improve People Tracking in a Real-World Scenario

投稿日: 2024年4月30日作成者: jarxiv

要約ブラジル大統領官邸であるプラナルト宮殿は、2023 年 1 月 8 日にデ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Adaptive Input-image Normalization for Solving the Mode Collapse Problem in GAN-based X-ray Images

投稿日: 2024年4月30日作成者: jarxiv

要約対象となる疾患の希少性により、生物医学画像データセットの不均衡が生じる可能 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Hide and Seek: How Does Watermarking Impact Face Recognition?

投稿日: 2024年4月30日作成者: jarxiv

要約生成モデルの最近の進歩により、顔画像などの非常に現実的な画像の合成に革命が … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

IPixMatch: Boost Semi-supervised Semantic Segmentation with Inter-Pixel Relation

投稿日: 2024年4月30日作成者: jarxiv

要約現実世界のシナリオにおけるラベル付きデータの不足は、深層学習の有効性の重大 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

RSCaMa: Remote Sensing Image Change Captioning with State Space Model

投稿日: 2024年4月30日作成者: jarxiv

要約リモートセンシング画像変化キャプション (RSICC) は、多時間リモー … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Amodal Ground Truth and Completion in the Wild

投稿日: 2024年4月30日作成者: jarxiv

要約この論文では、モーダル画像セグメンテーション、つまり可視部分と不可視 (遮 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA

投稿日: 2024年4月30日作成者: jarxiv

要約ウィリス環 (CoW) は、脳の主要な循環を接続する重要な動脈ネットワーク … 続きを読む →

カテゴリー: cs.CV, cs.LG, q-bio.QM, q-bio.TO | コメントを受け付けていません

Make-it-Real: Unleashing Large Multimodal Model’s Ability for Painting 3D Objects with Realistic Materials

投稿日: 2024年4月30日作成者: jarxiv

要約物理的にリアルなマテリアルは、さまざまなアプリケーションや照明条件にわたっ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

投稿日: 2024年4月30日作成者: jarxiv

要約多様で大規模なマルチモーダルマルチビュービデオデータセットとベンチマ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年4月

OpenStreetView-5M: The Many Roads to Global Visual Geolocation

A Multilevel Strategy to Improve People Tracking in a Real-World Scenario

Adaptive Input-image Normalization for Solving the Mode Collapse Problem in GAN-based X-ray Images

Hide and Seek: How Does Watermarking Impact Face Recognition?

IPixMatch: Boost Semi-supervised Semantic Segmentation with Inter-Pixel Relation

RSCaMa: Remote Sensing Image Change Captioning with State Space Model

Amodal Ground Truth and Completion in the Wild

Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA

Make-it-Real: Unleashing Large Multimodal Model’s Ability for Painting 3D Objects with Realistic Materials

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

最近の投稿

最近のコメント

アーカイブ

カテゴリー