「cs.CV」カテゴリーアーカイブ

UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation

投稿日: 2024年11月14日作成者: jarxiv

要約このペーパーでは、Transformer アーキテクチャを使用した統合増分 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

V-LoL: A Diagnostic Dataset for Visual Logical Learning

投稿日: 2024年11月14日作成者: jarxiv

要約ビジュアル AI の最近の開発は成功を収めていますが、さまざまな欠点が依然 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation

投稿日: 2024年11月14日作成者: jarxiv

要約視覚と言語のナビゲーション (VLN) は、身体化されたインテリジェンスの … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model

投稿日: 2024年11月14日作成者: jarxiv

要約形態学的手法は、小さな構造の詳細を捕捉して保存する能力があるため、リモート … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Generalized Pose Space Embeddings for Training In-the-Wild using Anaylis-by-Synthesis

投稿日: 2024年11月14日作成者: jarxiv

要約最新の姿勢推定モデルは、手動でラベル付けされた大規模なデータセットでトレー … 続きを読む →

カテゴリー: cs.CV, cs.HC | コメントを受け付けていません

LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation

投稿日: 2024年11月14日作成者: jarxiv

要約視線推定モデルの一般化能力は、特にトレーニングデータセットが限られている … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Extracting polygonal footprints in off-nadir images with Segment Anything Model

投稿日: 2024年11月14日作成者: jarxiv

要約オフナディア航空画像からの建物フットプリント抽出 (BFE) には、屋根の … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Zero-shot capability of SAM-family models for bone segmentation in CT scans

投稿日: 2024年11月14日作成者: jarxiv

要約 Segment Anything Model (SAM) および同様のモデ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Optimal Transport on the Lie Group of Roto-translations

投稿日: 2024年11月14日作成者: jarxiv

要約ロトトランスレーション群 SE2 は、画像データをこのリー群で定義された多 … 続きを読む →

カテゴリー: 62H35, 68T45, 68U10, 68U99, 90B06, cs.CV, math.DG, math.OC | コメントを受け付けていません

Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models

投稿日: 2024年11月14日作成者: jarxiv

要約特にニューラルラディアンスフィールドや 3D ガウススプラッティング … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation

V-LoL: A Diagnostic Dataset for Visual Logical Learning

NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation

Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model

Generalized Pose Space Embeddings for Training In-the-Wild using Anaylis-by-Synthesis

LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation

Extracting polygonal footprints in off-nadir images with Segment Anything Model

Zero-shot capability of SAM-family models for bone segmentation in CT scans

Optimal Transport on the Lie Group of Roto-translations

Towards More Accurate Fake Detection on Images Generated from Advanced Generative and Neural Rendering Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー