「cs.CV」カテゴリーアーカイブ

Material Fingerprinting: Identifying and Predicting Perceptual Attributes of Material Appearance

投稿日: 2024年10月18日作成者: jarxiv

要約世界には多様な素材が豊富にあり、それぞれが独特の表面外観を持ち、それらの特 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring

投稿日: 2024年10月18日作成者: jarxiv

要約この研究では、時空間物体検出モデルの開発を通じて、UAV カメラを使用した … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning

投稿日: 2024年10月18日作成者: jarxiv

要約モデルの規模が急速に拡大したため、微調整のために大量の計算リソースが必要に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Enhanced Prompt-leveraged Weakly Supervised Cancer Segmentation based on Segment Anything

投稿日: 2024年10月18日作成者: jarxiv

要約この研究は、効果的な病理学的画像分析のための教師あり学習を超えた新しいアプ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Granular Privacy Control for Geolocation with Vision Language Models

投稿日: 2024年10月18日作成者: jarxiv

要約ビジョン言語モデル (VLM) は、情報を求める質問に答える機能が急速に進 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition

投稿日: 2024年10月18日作成者: jarxiv

要約教師なしドメイン適応 (UDA) は、特にトレーニングデータとテストデ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

投稿日: 2024年10月18日作成者: jarxiv

要約画像編集モデルは、オブジェクトの置換、属性やスタイルの変更、アクションや動 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?

投稿日: 2024年10月18日作成者: jarxiv

要約少量の視覚データから新しいオブジェクトについて学習し、新しいシナリオにおけ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions

投稿日: 2024年10月18日作成者: jarxiv

要約人間は、他の人間が実行しているさまざまな行動を（物理的に、またはビデオや画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DiRecNetV2: A Transformer-Enhanced Network for Aerial Disaster Recognition

投稿日: 2024年10月18日作成者: jarxiv

要約災害評価における航空画像処理のための無人航空機 (UAV) と人工知能 ( … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Material Fingerprinting: Identifying and Predicting Perceptual Attributes of Material Appearance

Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring

LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning

Enhanced Prompt-leveraged Weakly Supervised Cancer Segmentation based on Segment Anything

Granular Privacy Control for Geolocation with Vision Language Models

Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts?

ActionCOMET: A Zero-shot Approach to Learn Image-specific Commonsense Concepts about Actions

DiRecNetV2: A Transformer-Enhanced Network for Aerial Disaster Recognition

最近の投稿

最近のコメント

アーカイブ

カテゴリー