月別アーカイブ: 2024年4月

TOP-Nav: Legged Navigation Integrating Terrain, Obstacle and Proprioception Estimation

投稿日: 2024年4月24日作成者: jarxiv

要約脚式ナビゲーションは通常、オープンワールド、オフロード、および困難な環境内 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent

投稿日: 2024年4月24日作成者: jarxiv

要約このペーパーでは、ビデオシーケンスの正確なカメラポーズ、カメラ固有の要 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization

投稿日: 2024年4月24日作成者: jarxiv

要約マルチセッション SLAM の新しいシステムを導入します。これは、単一のグ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

投稿日: 2024年4月24日作成者: jarxiv

要約ラディアンスフィールドは、本物のような 3D トーキングヘッドの合成に … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VideoXum: Cross-modal Visual and Textural Summarization of Videos

投稿日: 2024年4月24日作成者: jarxiv

要約ビデオの要約は、ソースビデオから最も重要な情報を抽出して、要約されたクリッ … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation

投稿日: 2024年4月24日作成者: jarxiv

要約制御可能な人物画像生成の最近の進歩により、構造信号 (ポーズ、奥行きなど) … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models

投稿日: 2024年4月24日作成者: jarxiv

要約命令追従モデルの最近の進歩により、ユーザーとモデルの対話がよりユーザーフレ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios

投稿日: 2024年4月24日作成者: jarxiv

要約 Medical Vision-Language Pretraining ( … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Metric-guided Image Reconstruction Bounds via Conformal Prediction

投稿日: 2024年4月24日作成者: jarxiv

要約最近の機械学習の進歩により、不適切な問題に対処する新しいイメージングシス … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV, physics.med-ph | コメントを受け付けていません

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance

投稿日: 2024年4月24日作成者: jarxiv

要約弱教師付き 3D オブジェクト検出は、2D ラベルなど、より低いアノテーシ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

月別アーカイブ: 2024年4月

TOP-Nav: Legged Navigation Integrating Terrain, Obstacle and Proprioception Estimation

FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent

Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization

TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

VideoXum: Cross-modal Visual and Textural Summarization of Videos

From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation

Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models

CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios

Metric-guided Image Reconstruction Bounds via Conformal Prediction

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance

最近の投稿

最近のコメント

アーカイブ

カテゴリー