「cs.CV」カテゴリーアーカイブ

Near, far: Patch-ordering enhances vision foundation models’ scene understanding

投稿日: 2025年2月12日作成者: jarxiv

要約 NECO：PATCH Neighbor Consecencyを紹介します。 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness

投稿日: 2025年2月12日作成者: jarxiv

要約拡散モデル（DM）は、DMベースの防御方法が敵対的な訓練なしで優れた防御能 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DSV: Exploiting Dynamic Sparsity to Accelerate Large-Scale Video DiT Training

投稿日: 2025年2月12日作成者: jarxiv

要約拡散トランス（DIT）は、高品質のビデオのモデリングと生成において顕著なパ … 続きを読む →

カテゴリー: cs.CV, cs.DC | コメントを受け付けていません

YOLO Network For Defect Detection In Optical lenses

投稿日: 2025年2月12日作成者: jarxiv

要約大量生産された光レンズは、散乱特性を変え、品質基準を妥協する欠陥を示すこと … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning

投稿日: 2025年2月12日作成者: jarxiv

要約将来のシーンの表現を予測することは、ロボットが環境を理解して対話できるよう … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models

投稿日: 2025年2月12日作成者: jarxiv

要約ゼロショット異常検出（ZSAD）は、新興広告パラダイムです。モデルをトレ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

An Improved Optimal Proximal Gradient Algorithm for Non-Blind Image Deblurring

投稿日: 2025年2月12日作成者: jarxiv

要約画像の脱生は、画像処理の中心的な研究分野であり、画像の品質を向上させ、多様 … 続きを読む →

カテゴリー: cs.CV, math.OC | コメントを受け付けていません

Generalized Least Squares Kernelized Tensor Factorization

投稿日: 2025年2月12日作成者: jarxiv

要約不足しているエントリを備えた多次元テンソル構造データを完成させることは、不 … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

CoS: Chain-of-Shot Prompting for Long Video Understanding

投稿日: 2025年2月12日作成者: jarxiv

要約マルチモーダルの大手言語モデル（MLLM）は、過度の視覚トークンが必要なた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones?

投稿日: 2025年2月12日作成者: jarxiv

要約専門家レベルのドメインの知識と意図的な認知を要求する全プロセスのOracl … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Near, far: Patch-ordering enhances vision foundation models’ scene understanding

Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness

DSV: Exploiting Dynamic Sparsity to Accelerate Large-Scale Video DiT Training

YOLO Network For Defect Detection In Optical lenses

PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning

Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models

An Improved Optimal Proximal Gradient Algorithm for Non-Blind Image Deblurring

Generalized Least Squares Kernelized Tensor Factorization

CoS: Chain-of-Shot Prompting for Long Video Understanding

OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones?

最近の投稿

最近のコメント

アーカイブ

カテゴリー