月別アーカイブ: 2024年2月

Real-World Atmospheric Turbulence Correction via Domain Adaptation

投稿日: 2024年2月13日作成者: jarxiv

要約日常生活でよく見られる現象である大気の乱流は、主に地表の不均一な加熱によっ … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Unsupervised Discovery of Object-Centric Neural Fields

投稿日: 2024年2月13日作成者: jarxiv

要約私たちは、単一の画像から 3D オブジェクト中心のシーン表現を推測すること … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Towards Calibrated Robust Fine-Tuning of Vision-Language Models

投稿日: 2024年2月13日作成者: jarxiv

要約堅牢な微調整は、配布外 (OOD) サンプルでのパフォーマンスを確保するこ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Exploring Perceptual Limitation of Multimodal Large Language Models

投稿日: 2024年2月13日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は最近、視覚的な質問に答える … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Make it more specific: A novel uncertainty based airway segmentation application on 3D U-Net and its variants

投稿日: 2024年2月13日作成者: jarxiv

要約最も正確な予測モデルを取得できるように、各医療セグメンテーションタスクは … 続きを読む →

カテゴリー: cs.CV, eess.IV | コメントを受け付けていません

Towards Perceiving Small Visual Details in Zero-shot Visual Question Answering with Multimodal LLMs

投稿日: 2024年2月13日作成者: jarxiv

要約マルチモーダル大規模言語モデル (MLLM) は最近、ビジュアル質問応答 … 続きを読む →

カテゴリー: cs.CL, cs.CV | コメントを受け付けていません

A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)

投稿日: 2024年2月13日作成者: jarxiv

要約 Contrastive Language-Image Pre-traini … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems

投稿日: 2024年2月13日作成者: jarxiv

要約近年、ディープニューラルネットワーク (DNN) は、特に自律システム … 続きを読む →

カテゴリー: cs.CV, cs.LG, cs.RO | コメントを受け付けていません

An Empirical Study Into What Matters for Calibrating Vision-Language Models

投稿日: 2024年2月13日作成者: jarxiv

要約ビジョン言語モデル (VLM) は、ゼロショット認識の主要なアプローチとし … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Novel definition and quantitative analysis of branch structure with topological data analysis

投稿日: 2024年2月13日作成者: jarxiv

要約分岐ネットワーク構造は自然界に数多く存在しますが、既存の定量的手法は分岐構 … 続きを読む →

カテゴリー: cs.CG, cs.CV, math.AT, q-bio.QM | コメントを受け付けていません

月別アーカイブ: 2024年2月

Real-World Atmospheric Turbulence Correction via Domain Adaptation

Unsupervised Discovery of Object-Centric Neural Fields

Towards Calibrated Robust Fine-Tuning of Vision-Language Models

Exploring Perceptual Limitation of Multimodal Large Language Models

Make it more specific: A novel uncertainty based airway segmentation application on 3D U-Net and its variants

Towards Perceiving Small Visual Details in Zero-shot Visual Question Answering with Multimodal LLMs

A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP)

Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems

An Empirical Study Into What Matters for Calibrating Vision-Language Models

Novel definition and quantitative analysis of branch structure with topological data analysis

最近の投稿

最近のコメント

アーカイブ

カテゴリー