「cs.CV」カテゴリーアーカイブ

Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models

投稿日: 2024年10月1日作成者: jarxiv

要約強力な推論機能を備えたディープラーニングアーキテクチャは、自動運転技術の … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering

投稿日: 2024年10月1日作成者: jarxiv

要約視覚言語モデル (VLM) の最近の進歩と、高品質のマルチモーダルアライ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

POMONAG: Pareto-Optimal Many-Objective Neural Architecture Generator

投稿日: 2024年10月1日作成者: jarxiv

要約 Neural Architecture Search (NAS) はニュー … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models

投稿日: 2024年10月1日作成者: jarxiv

要約我々は、大規模言語モデル (LLM) と階層型モーション固有のベクトル量子 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties

投稿日: 2024年10月1日作成者: jarxiv

要約大規模言語モデル (LLM) の最近の成功の背後にある主な理由は、その \ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner

投稿日: 2024年10月1日作成者: jarxiv

要約言語モデル (LM) は自然言語を理解する強力な能力を備えており、人間の指 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.MA, cs.RO | コメントを受け付けていません

Continuously Improving Mobile Manipulation with Autonomous Real-World RL

投稿日: 2024年10月1日作成者: jarxiv

要約我々は、広範な機器や人間による監視なしでポリシーを学習できる、モバイル操作 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection

投稿日: 2024年10月1日作成者: jarxiv

要約ビデオ異常検出 (VAD) は、コンピュータービジョン内のビデオ分析と監 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

UniEmoX: Cross-modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception

投稿日: 2024年10月1日作成者: jarxiv

要約視覚的感情分析は、コンピュータービジョンと心理学の両方において重要な研究 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

DeRainGS: Gaussian Splatting for Enhanced Scene Reconstruction in Rainy Environments

投稿日: 2024年10月1日作成者: jarxiv

要約雨の悪条件下での再建は、視界の低下と視覚認識の歪みにより、重大な課題を引き … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models

World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering

POMONAG: Pareto-Optimal Many-Objective Neural Architecture Generator

COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models

Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties

LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner

Continuously Improving Mobile Manipulation with Autonomous Real-World RL

VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection

UniEmoX: Cross-modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception

DeRainGS: Gaussian Splatting for Enhanced Scene Reconstruction in Rainy Environments

最近の投稿

最近のコメント

アーカイブ

カテゴリー