投稿者「jarxiv」のアーカイブ

Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline

投稿日: 2025年4月17日作成者: jarxiv

要約低光の条件は、人間と機械の両方の注釈に大きな課題をもたらします。これによ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

InfoNCE: Identifying the Gap Between Theory and Practice

投稿日: 2025年4月17日作成者: jarxiv

要約 Infonceの損失を介した対照学習に関する以前の理論は、特定の仮定の下で … 続きを読む →

カテゴリー: cs.CV, cs.LG, stat.ML | コメントを受け付けていません

SpiritSight Agent: Advanced GUI Agent with One Look

投稿日: 2025年4月17日作成者: jarxiv

要約グラフィカルユーザーインターフェイス（GUI）エージェントは、ヒューマンコ … 続きを読む →

カテゴリー: cs.CV, cs.HC, cs.RO | コメントを受け付けていません

A Semi-Self-Supervised Approach for Dense-Pattern Video Object Segmentation

投稿日: 2025年4月17日作成者: jarxiv

要約ビデオオブジェクトセグメンテーション（VOS） – ビデオの各 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

CoMotion: Concurrent Multi-person 3D Motion

投稿日: 2025年4月17日作成者: jarxiv

要約単一の単眼カメラストリームから複数の人々の詳細な3Dポーズを検出および追跡 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI

投稿日: 2025年4月17日作成者: jarxiv

要約ディープラーニングは、マルチメディアシステムにかなりの進歩をもたらしました … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Modality-Independent Explainable Detection of Inaccurate Organ Segmentations Using Denoising Autoencoders

投稿日: 2025年4月17日作成者: jarxiv

要約放射線療法の計画では、臨床医によって検出されない場合、危険にさらされている … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Towards Realistic Low-Light Image Enhancement via ISP Driven Data Modeling

投稿日: 2025年4月17日作成者: jarxiv

要約ディープニューラルネットワーク（DNNS）は、最近、低照度画像強化（LLI … 続きを読む →

カテゴリー: cs.CV, cs.MM | コメントを受け付けていません

Self-Supervised Enhancement of Forward-Looking Sonar Images: Bridging Cross-Modal Degradation Gaps through Feature Space Transformation and Multi-Frame Fusion

投稿日: 2025年4月17日作成者: jarxiv

要約正確な水中ターゲット検出には、前向きに見えるソナー画像を強化することが重要 … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

MMCLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training

投稿日: 2025年4月17日作成者: jarxiv

要約医療分野のビジョンと言語の事前トレーニング（VLP）は、画像テキストペアで … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline

InfoNCE: Identifying the Gap Between Theory and Practice

SpiritSight Agent: Advanced GUI Agent with One Look

A Semi-Self-Supervised Approach for Dense-Pattern Video Object Segmentation

CoMotion: Concurrent Multi-person 3D Motion

Beyond Patches: Mining Interpretable Part-Prototypes for Explainable AI

Modality-Independent Explainable Detection of Inaccurate Organ Segmentations Using Denoising Autoencoders

Towards Realistic Low-Light Image Enhancement via ISP Driven Data Modeling

Self-Supervised Enhancement of Forward-Looking Sonar Images: Bridging Cross-Modal Degradation Gaps through Feature Space Transformation and Multi-Frame Fusion

MMCLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training

最近の投稿

最近のコメント

アーカイブ

カテゴリー