「cs.CV」カテゴリーアーカイブ

Shelf-Supervised Cross-Modal Pre-Training for 3D Object Detection

投稿日: 2024年10月16日作成者: jarxiv

要約最先端の 3D オブジェクト検出器は、多くの場合、大量のラベル付きデータセ … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Leveraging Structure Knowledge and Deep Models for the Detection of Abnormal Handwritten Text

投稿日: 2024年10月16日作成者: jarxiv

要約現在、手書きテキストのシーケンス構造の破壊が、認識タスクを制限する主なボト … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Teaching AI Agents to Search with Reflective-MCTS and Exploratory Learning

投稿日: 2024年10月16日作成者: jarxiv

要約自律エージェントは、複雑な複数ステップの意思決定タスクを自動化する上で大き … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering

投稿日: 2024年10月16日作成者: jarxiv

要約 Text-to-Image（TTI）生成モデルは目覚ましい成功を収めている … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SurFhead: Affine Rig Blending for Geometrically Accurate 2D Gaussian Surfel Head Avatars

投稿日: 2024年10月16日作成者: jarxiv

要約ガウスプリミティブを使用した頭部アバターレンダリングの最近の進歩により … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR | コメントを受け付けていません

Estimating the distribution of numerosity and non-numerical visual magnitudes in natural scenes using computer vision

投稿日: 2024年10月16日作成者: jarxiv

要約人間は、多くの動物種と同様に、視覚的なシーン内のオブジェクトの数を認識し、 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem

投稿日: 2024年10月16日作成者: jarxiv

要約事前トレーニングされた視覚言語基礎モデルの出現は、ゼロ/少数ショット (つ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Visual Fixation-Based Retinal Prosthetic Simulation

投稿日: 2024年10月16日作成者: jarxiv

要約この研究では、サッケード機構にヒントを得て、視覚固視によって駆動される人工 … 続きを読む →

カテゴリー: cs.CV, cs.NE | コメントを受け付けていません

Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor

投稿日: 2024年10月16日作成者: jarxiv

要約データポイズニングバックドア攻撃は、機械学習モデルに対する重大なセキュリ … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

POPoS: Improving Efficient and Robust Facial Landmark Detection with Parallel Optimal Position Search

投稿日: 2024年10月16日作成者: jarxiv

要約顔ランドマーク検出 (FLD) では、精度と効率のバランスを達成することが … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Shelf-Supervised Cross-Modal Pre-Training for 3D Object Detection

Leveraging Structure Knowledge and Deep Models for the Detection of Abnormal Handwritten Text

Teaching AI Agents to Search with Reflective-MCTS and Exploratory Learning

Evaluating Image Hallucination in Text-to-Image Generation with Question-Answering

SurFhead: Affine Rig Blending for Geometrically Accurate 2D Gaussian Surfel Head Avatars

Estimating the distribution of numerosity and non-numerical visual magnitudes in natural scenes using computer vision

A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem

Visual Fixation-Based Retinal Prosthetic Simulation

Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor

POPoS: Improving Efficient and Robust Facial Landmark Detection with Parallel Optimal Position Search

最近の投稿

最近のコメント

アーカイブ

カテゴリー