「cs.CV」カテゴリーアーカイブ

Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features

投稿日: 2024年9月10日作成者: jarxiv

要約因数分解によるセグメンテーション (F-SEG) を紹介します。これは、事 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Boosting CNN-based Handwriting Recognition Systems with Learnable Relaxation Labeling

投稿日: 2024年9月10日作成者: jarxiv

要約手書き認識システムの主な課題は、長距離のコンテキスト依存関係を管理すること … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding

投稿日: 2024年9月10日作成者: jarxiv

要約我々は、差別的かつ談話に適した指示表現（RE）を生成することを目的とした、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

The Influence of Faulty Labels in Data Sets on Human Pose Estimation

投稿日: 2024年9月10日作成者: jarxiv

要約この研究では、トレーニングデータの品質が人間姿勢推定 (HPE) におけ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Robust Loss Functions for Object Grasping under Limited Ground Truth

投稿日: 2024年9月10日作成者: jarxiv

要約物体把握は、ロボットが環境を認識し、環境と十分に対話できるようにする重要な … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning

投稿日: 2024年9月10日作成者: jarxiv

要約最近の研究では、画像から言語への投影を学習し、大規模言語モデル (LLM) … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

ReL-SAR: Representation Learning for Skeleton Action Recognition with Convolutional Transformers and BYOL

投稿日: 2024年9月10日作成者: jarxiv

要約堅牢で一般化可能なスケルトンアクション認識特徴を抽出するには、通常、十分 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

TivNe-SLAM: Dynamic Mapping and Tracking via Time-Varying Neural Radiance Fields

投稿日: 2024年9月10日作成者: jarxiv

要約 Neural Radiance Fields (NeRF) を Simul … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Long-term Pre-training for Temporal Action Detection with Transformers

投稿日: 2024年9月10日作成者: jarxiv

要約時間的動作検出 (TAD) は困難ですが、現実世界のビデオアプリケーショ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3D Lymphoma Segmentation on PET/CT Images via Multi-Scale Information Fusion with Cross-Attention

投稿日: 2024年9月10日作成者: jarxiv

要約背景: びまん性大細胞型 B 細胞リンパ腫 (DLBCL) 病変の正確なセ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features

Boosting CNN-based Handwriting Recognition Systems with Learnable Relaxation Labeling

Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding

The Influence of Faulty Labels in Data Sets on Human Pose Estimation

Robust Loss Functions for Object Grasping under Limited Ground Truth

X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning

ReL-SAR: Representation Learning for Skeleton Action Recognition with Convolutional Transformers and BYOL

TivNe-SLAM: Dynamic Mapping and Tracking via Time-Varying Neural Radiance Fields

Long-term Pre-training for Temporal Action Detection with Transformers

3D Lymphoma Segmentation on PET/CT Images via Multi-Scale Information Fusion with Cross-Attention

最近の投稿

最近のコメント

アーカイブ

カテゴリー