「cs.CV」カテゴリーアーカイブ

Socratic Planner: Self-QA-Based Zero-Shot Planning for Embodied Instruction Following

投稿日: 2025年3月27日作成者: jarxiv

要約次の具体化された命令（EIF）は、インタラクティブな環境でオブジェクトをナ … 続きを読む →

カテゴリー: (Primary), 68T45, 68T50, cs.AI, cs.CL, cs.CV, cs.RO | コメントを受け付けていません

VideoGEM: Training-free Action Grounding in Videos

投稿日: 2025年3月27日作成者: jarxiv

要約 Vision-Language Foundationモデルは、主に画像のオ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG | コメントを受け付けていません

VPO: Aligning Text-to-Video Generation Models with Prompt Optimization

投稿日: 2025年3月27日作成者: jarxiv

要約ビデオ生成モデルは、テキストからビデオへのタスクで顕著な進歩を達成していま … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation

投稿日: 2025年3月27日作成者: jarxiv

要約拡散モデルは、テキスト誘導画像翻訳のための多様で高品質の画像の合成において … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Towards End-to-End Neuromorphic Voxel-based 3D Object Reconstruction Without Physical Priors

投稿日: 2025年3月27日作成者: jarxiv

要約イベントカメラとも呼ばれる神経型カメラは、モーションブラーに苦しむことなく … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Fantastic Copyrighted Beasts and How (Not) to Generate Them

投稿日: 2025年3月27日作成者: jarxiv

要約最近の研究では、画像とビデオ生成モデルをトレーニングデータから著作権で保護 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.CY, cs.LG | コメントを受け付けていません

Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection

投稿日: 2025年3月27日作成者: jarxiv

要約ストリートビューまたはダッシュボードカメラから収集されたストリートシーンの … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Towards Efficient and General-Purpose Few-Shot Misclassification Detection for Vision-Language Models

投稿日: 2025年3月27日作成者: jarxiv

要約分類器による信頼できる予測は、セキュリティが高く、動的に変化する状況での展 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

投稿日: 2025年3月27日作成者: jarxiv

要約生成モデルは、複雑な環境をシミュレートするためのスケーラブルで柔軟なパラダ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

PG-SAM: Prior-Guided SAM with Medical for Multi-organ Segmentation

投稿日: 2025年3月27日作成者: jarxiv

要約セグメントAnything Model（SAM）は、強力なゼロショット機能 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Socratic Planner: Self-QA-Based Zero-Shot Planning for Embodied Instruction Following

VideoGEM: Training-free Action Grounding in Videos

VPO: Aligning Text-to-Video Generation Models with Prompt Optimization

Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation

Towards End-to-End Neuromorphic Voxel-based 3D Object Reconstruction Without Physical Priors

Fantastic Copyrighted Beasts and How (Not) to Generate Them

Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection

Towards Efficient and General-Purpose Few-Shot Misclassification Detection for Vision-Language Models

GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

PG-SAM: Prior-Guided SAM with Medical for Multi-organ Segmentation

最近の投稿

最近のコメント

アーカイブ

カテゴリー