投稿者「jarxiv」のアーカイブ

Unsupervised UAV 3D Trajectories Estimation with Sparse Point Clouds

投稿日: 2025年1月3日作成者: jarxiv

要約コンパクトな UAV システムは、配送と監視を進化させる一方で、そのサイズ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

EA-KD: Entropy-based Adaptive Knowledge Distillation

投稿日: 2025年1月3日作成者: jarxiv

要約知識蒸留 (KD) を使用すると、教師の出力または機能から知識を転送するこ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

MLVU: Benchmarking Multi-task Long Video Understanding

投稿日: 2025年1月3日作成者: jarxiv

要約 Long Video Understanding (LVU) パフォーマン … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Prompt-Based Segmentation at Multiple Resolutions and Lighting Conditions using Segment Anything Model 2

投稿日: 2025年1月3日作成者: jarxiv

要約このペーパーでは、ゼロショットのプロンプトベースのセグメントエニシング … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Using a CNN Model to Assess Paintings’ Creativity

投稿日: 2025年1月3日作成者: jarxiv

要約芸術的創造性の評価は研究者にとって長年の課題であり、従来の方法では時間がか … 続きを読む →

カテゴリー: cs.CV, cs.HC, cs.LG | コメントを受け付けていません

Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learning

投稿日: 2025年1月3日作成者: jarxiv

要約コンピュータビジョンにおける少数ショットのきめ細かい分類は、限られたデー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation

投稿日: 2025年1月3日作成者: jarxiv

要約 2D 画像内の事前定義された 3D キーポイントの位置を特定することは、6 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Refining Skewed Perceptions in Vision-Language Models through Visual Representations

投稿日: 2025年1月3日作成者: jarxiv

要約 CLIP などの大規模ビジョン言語モデル (VLM) は基盤となり、さまざ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better

投稿日: 2025年1月3日作成者: jarxiv

要約テキストから画像への生成モデルを使用すると、制御可能な方法で無制限の量の画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Region-Guided Attack on the Segment Anything Model (SAM)

投稿日: 2025年1月3日作成者: jarxiv

要約 Segment Anything Model (SAM) は画像セグメンテ … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Unsupervised UAV 3D Trajectories Estimation with Sparse Point Clouds

EA-KD: Entropy-based Adaptive Knowledge Distillation

MLVU: Benchmarking Multi-task Long Video Understanding

Prompt-Based Segmentation at Multiple Resolutions and Lighting Conditions using Segment Anything Model 2

Using a CNN Model to Assess Paintings’ Creativity

Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learning

VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation

Refining Skewed Perceptions in Vision-Language Models through Visual Representations

The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better

Region-Guided Attack on the Segment Anything Model (SAM)

最近の投稿

最近のコメント

アーカイブ

カテゴリー