「cs.CV」カテゴリーアーカイブ

Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models

投稿日: 2024年10月14日作成者: jarxiv

要約 Large Vision-Language Model (LVLM) は、 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Bridge the Points: Graph-based Few-shot Segment Anything Semantically

投稿日: 2024年10月14日作成者: jarxiv

要約大規模な事前トレーニング技術の最近の進歩により、ビジョン基盤モデル、特にポ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation

投稿日: 2024年10月14日作成者: jarxiv

要約キャッシュベースのアプローチは、ビジョン言語モデル (VLM) を適応させ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

投稿日: 2024年10月14日作成者: jarxiv

要約長いテキストを理解することは実際には大きな要求ですが、ほとんどの言語画像事 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Efficient Hyperparameter Importance Assessment for CNNs

投稿日: 2024年10月14日作成者: jarxiv

要約ハイパーパラメータの選択は機械学習パイプラインの重要な側面であり、モデルの … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Accurately Classifying Out-Of-Distribution Data in Facial Recognition

投稿日: 2024年10月14日作成者: jarxiv

要約標準的な分類理論では、テストセットとトレーニングセット内の画像の分布が … 続きを読む →

カテゴリー: cs.CV, cs.CY, cs.LG | コメントを受け付けていません

HyperPg — Prototypical Gaussians on the Hypersphere for Interpretable Deep Learning

投稿日: 2024年10月14日作成者: jarxiv

要約プロトタイプ学習手法は、ブラックボックスの深層学習モデルに代わる解釈可能な … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images

投稿日: 2024年10月14日作成者: jarxiv

要約私たちは、視線推定および視線追跡技術の進歩における、視覚基盤モデルである … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.HC | コメントを受け付けていません

For a semiotic AI: Bridging computer vision and visual semiotics for computational observation of large scale facial image archives

投稿日: 2024年10月14日作成者: jarxiv

要約ソーシャルネットワークは、人間の顔や体の画像の認知的、感情的、実用的な価 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Match me if you can: Semi-Supervised Semantic Correspondence Learning with Unpaired Images

投稿日: 2024年10月14日作成者: jarxiv

要約セマンティック対応方法は、モデルの能力を最大化することを目的として、複雑な … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models

Bridge the Points: Graph-based Few-shot Segment Anything Semantically

Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Efficient Hyperparameter Importance Assessment for CNNs

Accurately Classifying Out-Of-Distribution Data in Facial Recognition

HyperPg — Prototypical Gaussians on the Hypersphere for Interpretable Deep Learning

Zero-Shot Pupil Segmentation with SAM 2: A Case Study of Over 14 Million Images

For a semiotic AI: Bridging computer vision and visual semiotics for computational observation of large scale facial image archives

Match me if you can: Semi-Supervised Semantic Correspondence Learning with Unpaired Images

最近の投稿

最近のコメント

アーカイブ

カテゴリー