「cs.CV」カテゴリーアーカイブ

PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models

投稿日: 2024年12月25日作成者: jarxiv

要約テキストまたは画像を 3D に変換するジェネレーターと 3D スキャナーで … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models

投稿日: 2024年12月25日作成者: jarxiv

要約私たちは、計算オーバーヘッドを大幅に削減しながら競争力のあるパフォーマンス … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

投稿日: 2024年12月25日作成者: jarxiv

要約最近の 3D コンテンツ生成パイプラインは、拡散ベースの生成のために形状を … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling

投稿日: 2024年12月25日作成者: jarxiv

要約人手不足の深刻化により、さまざまな環境を支援する家庭用サービスロボット（D … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.RO | コメントを受け付けていません

Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models

投稿日: 2024年12月25日作成者: jarxiv

要約 Foundation Vision Language Models (VL … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding

投稿日: 2024年12月25日作成者: jarxiv

要約 3D シーンを理解するためにガウススプラッティングを知覚タスクに適用する … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SOUS VIDE: Cooking Visual Drone Navigation Policies in a Gaussian Splatting Vacuum

投稿日: 2024年12月24日作成者: jarxiv

要約私たちは、エンドツーエンドの視覚的なドローンナビゲーションのための新しい … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

投稿日: 2024年12月24日作成者: jarxiv

要約ロボットの視覚運動ポリシー学習において、拡散ベースのモデルは、従来の自己回 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling

投稿日: 2024年12月24日作成者: jarxiv

要約人手不足の深刻化により、さまざまな環境を支援する家庭用サービスロボット（D … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.RO | コメントを受け付けていません

OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving

投稿日: 2024年12月24日作成者: jarxiv

要約複雑なシナリオで自動運転の安全性を高めるために、LiDAR 点群データをシ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models

Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling

Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models

LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding

SOUS VIDE: Cooking Visual Drone Navigation Policies in a Gaussian Splatting Vacuum

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling

OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving

最近の投稿

最近のコメント

アーカイブ

カテゴリー