月別アーカイブ: 2023年6月

Lesion Detection on Leaves using Class Activation Maps

投稿日: 2023年6月26日作成者: jarxiv

要約植物の葉の病変の検出は、植物病理学および農業研究において重要なタスクです。 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Semi-Implicit Denoising Diffusion Models (SIDDMs)

投稿日: 2023年6月26日作成者: jarxiv

要約生成モデルの急増にもかかわらず、サンプルの多様性と品質を損なうことなく推論 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

First Place Solution to the CVPR’2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment

投稿日: 2023年6月26日作成者: jarxiv

要約アフォーダンス中心の質問主導型タスク完了 (AQTC) は、ビデオから知識 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology

投稿日: 2023年6月26日作成者: jarxiv

要約我々は、長距離相関構造情報を保存しながら任意に大きな組織学的画像を生成する … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

投稿日: 2023年6月26日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は、強力な LLM に依存し … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

3D VR Sketch Guided 3D Shape Prototyping and Exploration

投稿日: 2023年6月26日作成者: jarxiv

要約 3D 形状モデリングは多大な労力と時間がかかり、長年の専門知識が必要です。 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PP-GAN : Style Transfer from Korean Portraits to ID Photos Using Landmark Extractor with GAN

投稿日: 2023年6月26日作成者: jarxiv

要約スタイル転送の目的は、別の画像のスタイルを転送しながら、画像のコンテンツを … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation

投稿日: 2023年6月26日作成者: jarxiv

要約シーングラフ生成 (SGG) は、画像内のオブジェクトとその接続を構造的 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Day2Dark: Pseudo-Supervised Activity Recognition beyond Silent Daylight

投稿日: 2023年6月26日作成者: jarxiv

要約この論文は、日中だけでなく暗闇での活動を認識することに努めています。私た … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction

投稿日: 2023年6月26日作成者: jarxiv

要約私たちは自己中心的なビデオにおけるオブジェクトの相互作用の予測を研究してい … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

月別アーカイブ: 2023年6月

Lesion Detection on Leaves using Class Activation Maps

Semi-Implicit Denoising Diffusion Models (SIDDMs)

First Place Solution to the CVPR’2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment

DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

3D VR Sketch Guided 3D Shape Prototyping and Exploration

PP-GAN : Style Transfer from Korean Portraits to ID Photos Using Landmark Extractor with GAN

Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation

Day2Dark: Pseudo-Supervised Activity Recognition beyond Silent Daylight

Summarize the Past to Predict the Future: Natural Language Descriptions of Context Boost Multimodal Object Interaction

最近の投稿

最近のコメント

アーカイブ

カテゴリー