「cs.CV」カテゴリーアーカイブ

Karyotype AI for Precision Oncology

投稿日: 2025年3月24日作成者: jarxiv

要約細胞分裂の中期段階の顕微鏡画像から直接血液がんを引き起こす染色体異常を正確 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV, q-bio.QM | コメントを受け付けていません

Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos

投稿日: 2025年3月24日作成者: jarxiv

要約大規模なコーパスで事前に訓練された大規模な言語モデルの最近の開発は、微調整 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation

投稿日: 2025年3月24日作成者: jarxiv

要約動的3Dアセット生成のためのマルチビュービデオ拡散モデルであるStable … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

投稿日: 2025年3月21日作成者: jarxiv

要約マルチビュー3D再構成は、特に多様な視点で正確でスケーラブルな表現を必要と … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.RO | コメントを受け付けていません

GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving

投稿日: 2025年3月21日作成者: jarxiv

要約次のトークンの予測に基づいた自己監視の事前トレーニングにより、大規模な言語 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models

投稿日: 2025年3月21日作成者: jarxiv

要約自律的な運転では、フリーフォームの応答には複雑なメトリックまたは主観的な人 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

UAS Visual Navigation in Large and Unseen Environments via a Meta Agent

投稿日: 2025年3月21日作成者: jarxiv

要約この作業の目的は、無人の航空システム（UAS）が大規模な都市環境でナビゲー … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos

投稿日: 2025年3月21日作成者: jarxiv

要約大規模なコーパスで事前に訓練された大規模な言語モデルの最近の開発は、微調整 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

MG-SLAM: Structure Gaussian Splatting SLAM with Manhattan World Hypothesis

投稿日: 2025年3月21日作成者: jarxiv

要約ガウスのスプラットスラムは、リアルタイムの再構築の効率と忠実度を改善する上 … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions

投稿日: 2025年3月21日作成者: jarxiv

要約柔軟な指導ガイド付き6-DOFグラッピングは、実際のロボットシステムにとっ … 続きを読む →

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Karyotype AI for Precision Oncology

Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos

SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving

AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models

UAS Visual Navigation in Large and Unseen Environments via a Meta Agent

Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos

MG-SLAM: Structure Gaussian Splatting SLAM with Manhattan World Hypothesis

GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions

最近の投稿

最近のコメント

アーカイブ

カテゴリー