月別アーカイブ: 2023年2月

Joint-MAE: 2D-3D Joint Masked Autoencoders for 3D Point Cloud Pre-training

投稿日: 2023年2月28日作成者: jarxiv

要約マスクオートエンコーダー (MAE) は、2D と 3D の両方のコンピ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Object Manipulation

投稿日: 2023年2月28日作成者: jarxiv

要約将来のホームアシスタントロボットにとって、人間の日常環境で多様な 3D … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Depth Perspective-aware Multiple Object Tracking

投稿日: 2023年2月28日作成者: jarxiv

要約このホワイトペーパーでは、複数オブジェクトトラッキング (MOT) に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Subspace Diffusion Generative Models

投稿日: 2023年2月28日作成者: jarxiv

要約スコアベースのモデルは、高次元拡散プロセスを介してノイズをデータに (およ … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Image-based Pose Estimation and Shape Reconstruction for Robot Manipulators and Soft, Continuum Robots via Differentiable Rendering

投稿日: 2023年2月28日作成者: jarxiv

要約自律システムはセンサーに依存してモーションをキャプチャし、3D 世界でロー … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Knowledge-enhanced Pre-training for Auto-diagnosis of Chest Radiology Images

投稿日: 2023年2月28日作成者: jarxiv

要約自然言語理解と視覚認識における大規模データで事前トレーニングされたマルチモ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Language Is Not All You Need: Aligning Perception with Language Models

投稿日: 2023年2月28日作成者: jarxiv

要約言語、マルチモーダルな知覚、アクション、および世界モデリングの大きな収束は … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Internet Explorer: Targeted Representation Learning on the Open Web

投稿日: 2023年2月28日作成者: jarxiv

要約最新のビジョンモデルは通常、大規模な静的データセットで事前にトレーニング … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.NE, cs.RO | コメントを受け付けていません

LODE: Locally Conditioned Eikonal Implicit Scene Completion from Sparse LiDAR

投稿日: 2023年2月28日作成者: jarxiv

要約シーンの完成とは、複雑な 3D シーンの不完全な認識から高密度のシーン表現 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial

投稿日: 2023年2月28日作成者: jarxiv

要約 Simultaneous Localization and Mapping … 続きを読む →

カテゴリー: cs.RO, cs.SY, eess.SY | コメントを受け付けていません

月別アーカイブ: 2023年2月

Joint-MAE: 2D-3D Joint Masked Autoencoders for 3D Point Cloud Pre-training

DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Object Manipulation

Depth Perspective-aware Multiple Object Tracking

Subspace Diffusion Generative Models

Image-based Pose Estimation and Shape Reconstruction for Robot Manipulators and Soft, Continuum Robots via Differentiable Rendering

Knowledge-enhanced Pre-training for Auto-diagnosis of Chest Radiology Images

Language Is Not All You Need: Aligning Perception with Language Models

Internet Explorer: Targeted Representation Learning on the Open Web

LODE: Locally Conditioned Eikonal Implicit Scene Completion from Sparse LiDAR

SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial

最近の投稿

最近のコメント

アーカイブ

カテゴリー