「cs.CV」カテゴリーアーカイブ

Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection

投稿日: 2025年3月18日作成者: jarxiv

要約最近の研究では、大きな視覚言語モデル（LVLM）がしばしばオブジェクトの幻 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

LEAVS: An LLM-based Labeler for Abdominal CT Supervision

投稿日: 2025年3月18日作成者: jarxiv

要約放射線レポートから構造化されたラベルの抽出が採用されており、視力モデルを作 … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans

投稿日: 2025年3月18日作成者: jarxiv

要約私たちは、多様な動物種や人間にわたるポーズの同時追跡と推定のために、変圧器 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution

投稿日: 2025年3月18日作成者: jarxiv

要約全方向性ビデオ（ODV）は没入型の視覚体験を提供し、仮想現実と拡張現実で広 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

TriDF: Triplane-Accelerated Density Fields for Few-Shot Remote Sensing Novel View Synthesis

投稿日: 2025年3月18日作成者: jarxiv

要約リモートセンシング新規ビュー合成（NVS）は、都市計画と環境監視における重 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting

投稿日: 2025年3月18日作成者: jarxiv

要約 3D Gaussian Splatting（3DGS）は、最近、さまざまな … 続きを読む →

カテゴリー: cs.CR, cs.CV | コメントを受け付けていません

Parameter-free structure-texture image decomposition by unrolling

投稿日: 2025年3月18日作成者: jarxiv

要約この作業では、構造テクスチャー画像分解問題に取り組むためのパラメーターフリ … 続きを読む →

カテゴリー: 68U10, 90C26, cs.CV, cs.NA, eess.IV, math.NA | コメントを受け付けていません

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

投稿日: 2025年3月18日作成者: jarxiv

要約スーパー解像度（SR）の拡散モデルは、高品質の視覚的結果を生成しますが、高 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning

投稿日: 2025年3月18日作成者: jarxiv

要約大規模な言語モデル（LLMS）の最近の進歩は、推論能力の強化を実証しており … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Structure-Activation Synergy: A Dual Efficiency Framework for Parameter-Memory Optimized Transfer Learning

投稿日: 2025年3月18日作成者: jarxiv

要約パラメーター効率の高い転送学習（PETL）は、大規模な事前訓練モデルを適応 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

「cs.CV」カテゴリーアーカイブ

Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection

LEAVS: An LLM-based Labeler for Abdominal CT Supervision

STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans

Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution

TriDF: Triplane-Accelerated Density Fields for Few-Shot Remote Sensing Novel View Synthesis

GuardSplat: Efficient and Robust Watermarking for 3D Gaussian Splatting

Parameter-free structure-texture image decomposition by unrolling

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning

Structure-Activation Synergy: A Dual Efficiency Framework for Parameter-Memory Optimized Transfer Learning

最近の投稿

最近のコメント

アーカイブ

カテゴリー