月別アーカイブ: 2024年4月

Advanced wood species identification based on multiple anatomical sections and using deep feature transfer and fusion

投稿日: 2024年4月15日作成者: jarxiv

要約近年、木材種の識別において多くの進歩が見られます。 DNA 分析、近赤外 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts

投稿日: 2024年4月15日作成者: jarxiv

要約ビジュアル質問応答 (VQA) は、ビジョンと言語の内容についての理解、推 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

ProbMCL: Simple Probabilistic Contrastive Learning for Multi-label Visual Classification

投稿日: 2024年4月15日作成者: jarxiv

要約マルチラベル画像分類は、コンピュータービジョンや医療画像処理など、多くの … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Improving Referring Image Segmentation using Vision-Aware Text Features

投稿日: 2024年4月15日作成者: jarxiv

要約画像セグメンテーションの参照は、自然言語記述に基づいてピクセル単位のセグメ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

WonderJourney: Going from Anywhere to Everywhere

投稿日: 2024年4月15日作成者: jarxiv

要約永続的な 3D シーン生成のためのモジュール化されたフレームワークである … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

PromptSync: Bridging Domain Gaps in Vision-Language Models through Class-Aware Prototype Alignment and Discrimination

投稿日: 2024年4月15日作成者: jarxiv

要約 CLIP などのビジョン言語 (V-L) モデルのゼロショット一般化の可能 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation

投稿日: 2024年4月15日作成者: jarxiv

要約オープン語彙オブジェクト検出 (OVOD) は、トレーニング時には表示され … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

FloCoDe: Unbiased Dynamic Scene Graph Generation with Temporal Consistency and Correlation Debiasing

投稿日: 2024年4月15日作成者: jarxiv

要約ビデオからの動的シーングラフ生成 (SGG) には、シーン全体のオブジェ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Automatic Quantification of Serial PET/CT Images for Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-Aware Segmentation Network

投稿日: 2024年4月15日作成者: jarxiv

要約 $\textbf{目的}$: 中間治療スキャンにおける残存病変は多くの場合 … 続きを読む →

カテゴリー: cs.AI, cs.CV, physics.med-ph | コメントを受け付けていません

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

投稿日: 2024年4月15日作成者: jarxiv

要約大規模マルチモーダルモデル (LMM) は、ビジュアルエンコーダーと大 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年4月

Advanced wood species identification based on multiple anatomical sections and using deep feature transfer and fusion

Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts

ProbMCL: Simple Probabilistic Contrastive Learning for Multi-label Visual Classification

Improving Referring Image Segmentation using Vision-Aware Text Features

WonderJourney: Going from Anywhere to Everywhere

PromptSync: Bridging Domain Gaps in Vision-Language Models through Class-Aware Prototype Alignment and Discrimination

Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation

FloCoDe: Unbiased Dynamic Scene Graph Generation with Temporal Consistency and Correlation Debiasing

Automatic Quantification of Serial PET/CT Images for Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-Aware Segmentation Network

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー