投稿者「jarxiv」のアーカイブ

BTMTrack: Robust RGB-T Tracking via Dual-template Bridging and Temporal-Modal Candidate Elimination

投稿日: 2025年1月10日作成者: jarxiv

要約 RGB-T トラッキングは、RGB と熱赤外線 (TIR) モダリティの相 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning

投稿日: 2025年1月10日作成者: jarxiv

要約海洋終端氷河の前線位置の変化は、氷の質量損失の指標であり、数値氷河モデルに … 続きを読む →

カテゴリー: cs.CV, cs.LG, I.4.6 | コメントを受け付けていません

Geometry Restoration and Dewarping of Camera-Captured Document Images

投稿日: 2025年1月10日作成者: jarxiv

要約この研究は、検出、セグメンテーション、ジオメトリ復元、歪み補正のアルゴリズ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Less is More: The Influence of Pruning on the Explainability of CNNs

投稿日: 2025年1月10日作成者: jarxiv

要約コンピュータービジョンにおける最新の畳み込みニューラルネットワーク ( … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Voxel-Aggregated Feature Synthesis: Efficient Dense Mapping for Simulated 3D Reasoning

投稿日: 2025年1月10日作成者: jarxiv

要約私たちは、最近の最先端 (SOTA) オープンセットマルチモデル 3D … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration

投稿日: 2025年1月10日作成者: jarxiv

要約ニューラルネットワークアーキテクチャ、量子化精度、およびハードウェア … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

投稿日: 2025年1月10日作成者: jarxiv

要約近年、2D Vision-Language Model (VLM) は、画 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models

投稿日: 2025年1月10日作成者: jarxiv

要約拡散モデルの進歩により、画像生成のパフォーマンスが大幅に向上しました。こ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

1-2-1: Renaissance of Single-Network Paradigm for Virtual Try-On

投稿日: 2025年1月10日作成者: jarxiv

要約仮想試着 (VTON) は、電子商取引において重要なツールとなっており、元 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance

投稿日: 2025年1月10日作成者: jarxiv

要約マルチビュー設定内で詳細な 3D シーンを再構成する 3D ガウススプラ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

BTMTrack: Robust RGB-T Tracking via Dual-template Bridging and Temporal-Modal Candidate Elimination

Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning

Geometry Restoration and Dewarping of Camera-Captured Document Images

Less is More: The Influence of Pruning on the Explainability of CNNs

Voxel-Aggregated Feature Synthesis: Efficient Dense Mapping for Simulated 3D Reasoning

JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models

1-2-1: Renaissance of Single-Network Paradigm for Virtual Try-On

Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance

最近の投稿

最近のコメント

アーカイブ

カテゴリー