「cs.AI」カテゴリーアーカイブ

SeTAR: Out-of-Distribution Detection with Selective Low-Rank Approximation

投稿日: 2024年6月19日作成者: jarxiv

要約ニューラルネットワークを安全に展開するには、分布外 (OOD) の検出が … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation

投稿日: 2024年6月19日作成者: jarxiv

要約 Segment Anything Model (SAM) などのセグメンテ … 続きを読む →

カテゴリー: cs.AI, cs.CV, eess.IV | コメントを受け付けていません

Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models

投稿日: 2024年6月19日作成者: jarxiv

要約ビジョントランスフォーマー (ViT) は、特に大規模な言語モデルと共同 … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

Do More Details Always Introduce More Hallucinations in LVLM-based Image Captioning?

投稿日: 2024年6月19日作成者: jarxiv

要約 Large Vision-Language Model (LVLM) は、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

The Lie Derivative for Measuring Learned Equivariance

投稿日: 2024年6月19日作成者: jarxiv

要約等分散により、モデルの予測がデータ内の重要な対称性を捉えていることが保証さ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly

投稿日: 2024年6月19日作成者: jarxiv

要約異常検出は、データ内の確立されたパターンからの逸脱を検出することを扱います … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

投稿日: 2024年6月19日作成者: jarxiv

要約大規模視覚言語モデル (LVLM) は、さまざまなマルチモーダルタスクで … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Beyond Visual Appearances: Privacy-sensitive Objects Identification via Hybrid Graph Reasoning

投稿日: 2024年6月19日作成者: jarxiv

要約プライバシーに敏感なオブジェクト識別 (POI) タスクは、シーン内のプラ … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

投稿日: 2024年6月19日作成者: jarxiv

要約大規模言語モデル (LLM) の進歩により、自然言語処理におけるアプリケー … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.CV | コメントを受け付けていません

Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video

投稿日: 2024年6月19日作成者: jarxiv

要約単一の 3D ビデオから流体の隠れた特性を推測し、新しいシーンで観察された … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

「cs.AI」カテゴリーアーカイブ

SeTAR: Out-of-Distribution Detection with Selective Low-Rank Approximation

An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation

Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models

Do More Details Always Introduce More Hallucinations in LVLM-based Image Captioning?

The Lie Derivative for Measuring Learned Equivariance

Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

Beyond Visual Appearances: Privacy-sensitive Objects Identification via Hybrid Graph Reasoning

Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video

最近の投稿

最近のコメント

アーカイブ

カテゴリー