「cs.CV」カテゴリーアーカイブ

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

投稿日: 2024年10月10日作成者: jarxiv

要約最近の研究では、(生成) 拡散モデルのノイズ除去プロセスにより、モデル内に … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

ELMO: Enhanced Real-time LiDAR Motion Capture through Upsampling

投稿日: 2024年10月10日作成者: jarxiv

要約このペーパーでは、単一の LiDAR センサー用に設計されたリアルタイム … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.GR, cs.LG | コメントを受け付けていません

Bridge the Points: Graph-based Few-shot Segment Anything Semantically

投稿日: 2024年10月10日作成者: jarxiv

要約大規模な事前トレーニング技術の最近の進歩により、ビジョン基盤モデル、特にポ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference

投稿日: 2024年10月10日作成者: jarxiv

要約ビジョン言語モデル (VLM) では、ビジュアルトークンは、テキストト … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

The BRAVO Semantic Segmentation Challenge Results in UNCV2024

投稿日: 2024年10月10日作成者: jarxiv

要約私たちは、現実的な摂動や未知の配信外 (OOD) シナリオの下でセマンティ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Diagnosis of Malignant Lymphoma Cancer Using Hybrid Optimized Techniques Based on Dense Neural Networks

投稿日: 2024年10月10日作成者: jarxiv

要約リンパ腫の診断、特にサブタイプを区別することは効果的な治療に不可欠ですが、 … 続きを読む →

カテゴリー: cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Adaptive High-Frequency Transformer for Diverse Wildlife Re-Identification

投稿日: 2024年10月10日作成者: jarxiv

要約 Wildlife ReID には、視覚テクノロジーを利用してさまざまなシナ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation

投稿日: 2024年10月10日作成者: jarxiv

要約自己教師あり学習によって可能になる単眼の奥行き推定は、コンピュータービジ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control

投稿日: 2024年10月10日作成者: jarxiv

要約マルチビューの一貫性は、画像拡散モデルにとって依然として課題です。完全な … 続きを読む →

カテゴリー: cs.CV, cs.GR | コメントを受け付けていません

A Unified Generative Framework for Realistic Lidar Simulation in Autonomous Driving Systems

投稿日: 2024年10月10日作成者: jarxiv

要約知覚センサーのシミュレーションモデルは、自動運転システム (ADS) の … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO, eess.IV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

ELMO: Enhanced Real-time LiDAR Motion Capture through Upsampling

Bridge the Points: Graph-based Few-shot Segment Anything Semantically

SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference

The BRAVO Semantic Segmentation Challenge Results in UNCV2024

Diagnosis of Malignant Lymphoma Cancer Using Hybrid Optimized Techniques Based on Dense Neural Networks

Adaptive High-Frequency Transformer for Diverse Wildlife Re-Identification

Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation

Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control

A Unified Generative Framework for Realistic Lidar Simulation in Autonomous Driving Systems

最近の投稿

最近のコメント

アーカイブ

カテゴリー