「cs.CV」カテゴリーアーカイブ

Improving Vision-Language-Action Model with Online Reinforcement Learning

投稿日: 2025年1月29日作成者: jarxiv

要約最近の研究は、エキスパートロボットデータセットを使用した監視付き微調整（S … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Dream to Drive with Predictive Individual World Model

投稿日: 2025年1月29日作成者: jarxiv

要約道路利用者の意図が不明であるため、複雑な都市環境でリラクティブな運転行動を … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios

投稿日: 2025年1月29日作成者: jarxiv

要約複雑な交通シナリオでの車両の検出とローカリゼーションは、移動オブジェクトの … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow

投稿日: 2025年1月29日作成者: jarxiv

要約 3Dビジョンと空間的推論は、特に2D画像に基づいた従来の視覚的推論と比較し … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.RO | コメントを受け付けていません

AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies

投稿日: 2025年1月29日作成者: jarxiv

要約ディープラーニング方法を使用した地震画像の自動化された解釈は、トレーニング … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

投稿日: 2025年1月29日作成者: jarxiv

要約テキストまたは単一の画像からの3Dコンテンツ生成における最近の進歩は、限ら … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Target-driven Self-Distillation for Partial Observed Trajectories Forecasting

投稿日: 2025年1月29日作成者: jarxiv

要約交通エージェントの将来の軌跡の正確な予測は、安全な自律運転を確保するために … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models

投稿日: 2025年1月29日作成者: jarxiv

要約自己学習学習は、効果的に訓練された場合、多数の画像または言語処理の問題を解 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation

投稿日: 2025年1月29日作成者: jarxiv

要約軽量で制御可能で、身体的にもっともらしい人間の動きの合成は、アニメーション … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Conterfactual Generative Zero-Shot Semantic Segmentation

投稿日: 2025年1月29日作成者: jarxiv

要約ゼロショット学習は、コンピュータービジョンの重要な部分です。古典的なダウ … 続きを読む →

カテゴリー: 68T07, cs.CV, I.2.10 | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Improving Vision-Language-Action Model with Online Reinforcement Learning

Dream to Drive with Predictive Individual World Model

SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios

3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow

AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Target-driven Self-Distillation for Partial Observed Trajectories Forecasting

Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models

FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation

Conterfactual Generative Zero-Shot Semantic Segmentation

最近の投稿

最近のコメント

アーカイブ

カテゴリー