「cs.CV」カテゴリーアーカイブ

BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval

投稿日: 2024年11月5日作成者: jarxiv

要約 SugarCrepe のような既存の視覚言語構成性 (VLC) ベンチマー … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

投稿日: 2024年11月5日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) の機能が進化し続けるにつれて、L … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images

投稿日: 2024年11月5日作成者: jarxiv

要約リモートセンシング画像は、農業、水資源、軍事、災害救援などの分野で、かけが … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Framer: Interactive Frame Interpolation

投稿日: 2024年11月5日作成者: jarxiv

要約私たちはインタラクティブなフレーム補間のための Framer を提案します … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Model Pairing Using Embedding Translation for Backdoor Attack Detection on Open-Set Classification Tasks

投稿日: 2024年11月5日作成者: jarxiv

要約バックドア攻撃により、攻撃者は機械学習アルゴリズムに特定の脆弱性を埋め込む … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

Advanced Vision Transformers and Open-Set Learning for Robust Mosquito Classification: A Novel Approach to Entomological Studies

投稿日: 2024年11月5日作成者: jarxiv

要約蚊関連疾患は世界の公衆衛生に重大な脅威をもたらしており、効果的な監視と制御 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

The evolution of volumetric video: A survey of smart transcoding and compression approaches

投稿日: 2024年11月5日作成者: jarxiv

要約 3 次元 (3D) 画像のキャプチャと表示であるボリュメトリックビデオは … 続きを読む →

カテゴリー: cs.CV, cs.GR, cs.HC | コメントを受け付けていません

FilterViT and DropoutViT: Lightweight Vision Transformer Models for Efficient Attention Mechanisms

投稿日: 2024年11月5日作成者: jarxiv

要約この研究では、MobileViT の拡張バージョンである FilterVi … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Differentially Private Integrated Decision Gradients (IDG-DP) for Radar-based Human Activity Recognition

投稿日: 2024年11月5日作成者: jarxiv

要約人間の動作分析は、医療モニタリングと病気の早期発見に大きな可能性をもたらし … 続きを読む →

カテゴリー: cs.AI, cs.CR, cs.CV, cs.LG | コメントを受け付けていません

Deep Learning on 3D Semantic Segmentation: A Detailed Review

投稿日: 2024年11月5日作成者: jarxiv

要約この論文では、3D セマンティックセグメンテーション (3DSS) にお … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images

Framer: Interactive Frame Interpolation

Model Pairing Using Embedding Translation for Backdoor Attack Detection on Open-Set Classification Tasks

Advanced Vision Transformers and Open-Set Learning for Robust Mosquito Classification: A Novel Approach to Entomological Studies

The evolution of volumetric video: A survey of smart transcoding and compression approaches

FilterViT and DropoutViT: Lightweight Vision Transformer Models for Efficient Attention Mechanisms

Differentially Private Integrated Decision Gradients (IDG-DP) for Radar-based Human Activity Recognition

Deep Learning on 3D Semantic Segmentation: A Detailed Review

最近の投稿

最近のコメント

アーカイブ

カテゴリー