「cs.CV」カテゴリーアーカイブ

Weighted Ensemble Models Are Strong Continual Learners

投稿日: 2024年12月12日作成者: jarxiv

要約この研究では、継続学習 (CL) の問題を研究します。目標は、現在のタスク … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection

投稿日: 2024年12月12日作成者: jarxiv

要約一般的なクエリトランスフォーマアーキテクチャを備えたヒューマンオブジ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information

投稿日: 2024年12月12日作成者: jarxiv

要約大規模言語モデリング技術の進歩に伴い、ビジュアルエンコーダと大規模言語モ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Combining Neural Fields and Deformation Models for Non-Rigid 3D Motion Reconstruction from Partial Data

投稿日: 2024年12月12日作成者: jarxiv

要約我々は、非剛体変形形状の構造化されていない可能性のある部分的な観察から、時 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

INRetouch: Context Aware Implicit Neural Representation for Photography Retouching

投稿日: 2024年12月12日作成者: jarxiv

要約プロの写真編集は依然として困難であり、イメージングパイプラインに関する広 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Open-Canopy: Towards Very High Resolution Forest Monitoring

投稿日: 2024年12月12日作成者: jarxiv

要約衛星画像から樹冠の高さとその変化をメートル解像度で推定することは、重要な環 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Learning to Decouple the Lights for 3D Face Texture Modeling

投稿日: 2024年12月12日作成者: jarxiv

要約既存の研究は、顔が明るく照らされ、外部オクルージョンが最小限に抑えられた画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting

投稿日: 2024年12月12日作成者: jarxiv

要約 CLIP などの事前トレーニング済みビジョン言語モデル (VLM) は、自 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Estimating the Number of HTTP/3 Responses in QUIC Using Deep Learning

投稿日: 2024年12月12日作成者: jarxiv

要約 QUIC は、ますます使用されている新しいトランスポートプロトコルであり … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.NI | コメントを受け付けていません

Improving Satellite Imagery Masking using Multi-task and Transfer Learning

投稿日: 2024年12月12日作成者: jarxiv

要約多くのリモートセンシングアプリケーションでは、後続の測定のために衛星画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Weighted Ensemble Models Are Strong Continual Learners

Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection

Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text Information

Combining Neural Fields and Deformation Models for Non-Rigid 3D Motion Reconstruction from Partial Data

INRetouch: Context Aware Implicit Neural Representation for Photography Retouching

Open-Canopy: Towards Very High Resolution Forest Monitoring

Learning to Decouple the Lights for 3D Face Texture Modeling

SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting

Estimating the Number of HTTP/3 Responses in QUIC Using Deep Learning

Improving Satellite Imagery Masking using Multi-task and Transfer Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー