月別アーカイブ: 2023年9月

Hierarchical Attention and Graph Neural Networks: Toward Drift-Free Pose Estimation

投稿日: 2023年9月19日作成者: jarxiv

要約 3D 幾何学的な位置合わせに対処するために最も一般的に使用される方法は、反 … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

What is a Fair Diffusion Model? Designing Generative Text-To-Image Models to Incorporate Various Worldviews

投稿日: 2023年9月19日作成者: jarxiv

要約 Generative text-to-image (GTI) モデルは、短 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.CY, cs.LG | コメントを受け付けていません

End-to-End Learned Event- and Image-based Visual Odometry

投稿日: 2023年9月19日作成者: jarxiv

要約ビジュアルオドメトリ (VO) は、特に惑星地形のような GPS が使用 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

vSHARP: variable Splitting Half-quadratic ADMM algorithm for Reconstruction of inverse-Problems

投稿日: 2023年9月19日作成者: jarxiv

要約加速並列磁気共鳴画像法 (MRI) などの医用画像処理 (MI) タスクに … 続きを読む →

カテゴリー: cs.CV, eess.IV, physics.med-ph | コメントを受け付けていません

RecolorNeRF: Layer Decomposed Radiance Fields for Efficient Color Editing of 3D Scenes

投稿日: 2023年9月19日作成者: jarxiv

要約ラディアンスフィールドは徐々にメディアの主要な表現になってきました。外 … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

投稿日: 2023年9月19日作成者: jarxiv

要約ビジュアル命令のチューニングは、最近、LLaVA や MiniGPT-4 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

Learning Human-Human Interactions in Images from Weak Textual Supervision

投稿日: 2023年9月19日作成者: jarxiv

要約人間間の相互作用は多様であり、状況に依存しますが、これまでの研究では、相互 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

GEDepth: Ground Embedding for Monocular Depth Estimation

投稿日: 2023年9月19日作成者: jarxiv

要約同じ 2D 画像が無限の 3D シーンから投影される可能性があるため、単眼 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

General In-Hand Object Rotation with Vision and Touch

投稿日: 2023年9月19日作成者: jarxiv

要約マルチモーダルな感覚入力を活用して、指先ベースの複数の軸に沿ったオブジェク … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

MAVIS: Multi-Camera Augmented Visual-Inertial SLAM using SE2(3) Based Exact IMU Pre-integration

投稿日: 2023年9月19日作成者: jarxiv

要約我々は、MAVIS と呼ばれる、部分的に重なった複数のカメラシステム用に … 続きを読む →

カテゴリー: cs.RO | コメントを受け付けていません

月別アーカイブ: 2023年9月

Hierarchical Attention and Graph Neural Networks: Toward Drift-Free Pose Estimation

What is a Fair Diffusion Model? Designing Generative Text-To-Image Models to Incorporate Various Worldviews

End-to-End Learned Event- and Image-based Visual Odometry

vSHARP: variable Splitting Half-quadratic ADMM algorithm for Reconstruction of inverse-Problems

RecolorNeRF: Layer Decomposed Radiance Fields for Efficient Color Editing of 3D Scenes

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

Learning Human-Human Interactions in Images from Weak Textual Supervision

GEDepth: Ground Embedding for Monocular Depth Estimation

General In-Hand Object Rotation with Vision and Touch

MAVIS: Multi-Camera Augmented Visual-Inertial SLAM using SE2(3) Based Exact IMU Pre-integration

最近の投稿

最近のコメント

アーカイブ

カテゴリー