「cs.CV」カテゴリーアーカイブ

Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric

投稿日: 2024年11月26日作成者: jarxiv

要約 AI を活用したビデオ生成技術は近年大幅に進歩しました。ただし、人間の活 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval

投稿日: 2024年11月26日作成者: jarxiv

要約テキストから画像への人物検索 (TIPR) の目的は、指定されたテキストの … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Imperceptible Adversarial Examples in the Physical World

投稿日: 2024年11月26日作成者: jarxiv

要約ディープラーニングベースのコンピュータービジョンモデルに対するデジタルドメ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction

投稿日: 2024年11月26日作成者: jarxiv

要約陽電子放射断層撮影法 (PET) は、生体内での機能的および生物学的プロセ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Word4Per: Zero-shot Composed Person Retrieval

投稿日: 2024年11月26日作成者: jarxiv

要約特定の人物の検索には大きな社会的利点とセキュリティ上の価値があり、多くの場 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.IR | コメントを受け付けていません

DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding

投稿日: 2024年11月26日作成者: jarxiv

要約この研究では、最大 2,560$\times$2,560 の解像度で画像を … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive Learning

投稿日: 2024年11月26日作成者: jarxiv

要約医療画像の分類は疾患診断の極めて重要な側面であり、多くの場合、ディープラー … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation

投稿日: 2024年11月26日作成者: jarxiv

要約ストーリーテリングビデオ生成 (SVG) は、入力テキストスクリプトで … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Edge Weight Prediction For Category-Agnostic Pose Estimation

投稿日: 2024年11月26日作成者: jarxiv

要約カテゴリ非依存ポーズ推定 (CAPE) は、1 つまたは少数の注釈付きサポ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Diffusion Features for Zero-Shot 6DoF Object Pose Estimation

投稿日: 2024年11月26日作成者: jarxiv

要約ゼロショットオブジェクトの姿勢推定により、オブジェクト固有のトレーニング … 続きを読む →

カテゴリー: 68T45, cs.CV, I.4.8 | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric

Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval

Imperceptible Adversarial Examples in the Physical World

LegoPET: Hierarchical Feature Guided Conditional Diffusion for PET Image Reconstruction

Word4Per: Zero-shot Composed Person Retrieval

DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding

Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive Learning

DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation

Edge Weight Prediction For Category-Agnostic Pose Estimation

Diffusion Features for Zero-Shot 6DoF Object Pose Estimation

最近の投稿

最近のコメント

アーカイブ

カテゴリー