「cs.AI」カテゴリーアーカイブ

A Comprehensive Survey on Deep-Learning-based Vehicle Re-Identification: Models, Data Sets and Challenges

投稿日: 2024年1月22日作成者: jarxiv

要約車両再識別 (ReID) は、さまざまな交通環境にわたるカメラの分散ネット … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing

投稿日: 2024年1月22日作成者: jarxiv

要約画像異常検出 (IAD) は、工業製造 (IM) における新たな重要なコン … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering

投稿日: 2024年1月22日作成者: jarxiv

要約 Video Question Answering (VideoQA) は、 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge

投稿日: 2024年1月22日作成者: jarxiv

要約マルチモーダルな大規模言語モデルの画期的な進歩により、高度な推論能力と世界 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Learning to Visually Connect Actions and their Effects

投稿日: 2024年1月22日作成者: jarxiv

要約この研究では、ビデオ理解におけるアクションとその効果を視覚的に接続する ( … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Understanding Video Transformers via Universal Concept Discovery

投稿日: 2024年1月22日作成者: jarxiv

要約この論文では、ビデオのトランス表現の概念ベースの解釈可能性の問題を研究しま … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation

投稿日: 2024年1月22日作成者: jarxiv

要約適応中にソースドメインデータや 3D アノテーションにアクセスせずに、RG … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

GBSD: Generative Bokeh with Stage Diffusion

投稿日: 2024年1月22日作成者: jarxiv

要約ボケ効果は、写真内の焦点の合っていない領域をぼかす芸術的なテクニックであり … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

SCENES: Subpixel Correspondence Estimation With Epipolar Supervision

投稿日: 2024年1月22日作成者: jarxiv

要約シーンの 2 つ以上のビューから点の対応関係を抽出することは、コンピュータ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Synthesizing Moving People with 3D Control

投稿日: 2024年1月22日作成者: jarxiv

要約この論文では、特定のターゲット 3D モーションシーケンスに対して単一の … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

A Comprehensive Survey on Deep-Learning-based Vehicle Re-Identification: Models, Data Sets and Challenges

IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing

Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering

Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge

Learning to Visually Connect Actions and their Effects

Understanding Video Transformers via Universal Concept Discovery

Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation

GBSD: Generative Bokeh with Stage Diffusion

SCENES: Subpixel Correspondence Estimation With Epipolar Supervision

Synthesizing Moving People with 3D Control

最近の投稿

最近のコメント

アーカイブ

カテゴリー