月別アーカイブ: 2024年6月

MedThink: Inducing Medical Large-scale Visual Language Models to Hallucinate Less by Thinking More

投稿日: 2024年6月19日作成者: jarxiv

要約 Large Vision Language Model (LVLM) をマ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Federated Learning with a Single Shared Image

投稿日: 2024年6月19日作成者: jarxiv

要約 Federated Learning (FL) を使用すると、プライベート … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Online Anchor-based Training for Image Classification Tasks

投稿日: 2024年6月19日作成者: jarxiv

要約この論文では、\textit{Online Anchor-based Tr … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Do More Details Always Introduce More Hallucinations in LVLM-based Image Captioning?

投稿日: 2024年6月19日作成者: jarxiv

要約 Large Vision-Language Model (LVLM) は、 … 続きを読む →

カテゴリー: cs.AI, cs.CV | コメントを受け付けていません

Disturbing Image Detection Using LMM-Elicited Emotion Embeddings

投稿日: 2024年6月19日作成者: jarxiv

要約この論文では、大規模マルチモーダルモデル (LMM) にエンコードされた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models

投稿日: 2024年6月19日作成者: jarxiv

要約識別および生成事前トレーニングの最近の進歩により、強力な一般化機能を備えた … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Spatial Sequence Attention Network for Schizophrenia Classification from Structural Brain MR Images

投稿日: 2024年6月19日作成者: jarxiv

要約統合失調症は、個人の認知能力、行動、社会的相互作用に大きな影響を与える衰弱 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

The Lie Derivative for Measuring Learned Equivariance

投稿日: 2024年6月19日作成者: jarxiv

要約等分散により、モデルの予測がデータ内の重要な対称性を捉えていることが保証さ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, stat.ML | コメントを受け付けていません

Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly

投稿日: 2024年6月19日作成者: jarxiv

要約異常検出は、データ内の確立されたパターンからの逸脱を検出することを扱います … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.RO | コメントを受け付けていません

SUPER: Selfie Undistortion and Head Pose Editing with Identity Preservation

投稿日: 2024年6月19日作成者: jarxiv

要約近距離から撮影したセルフポートレートは、大きな歪みにより顔の特徴が奇形にな … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年6月

MedThink: Inducing Medical Large-scale Visual Language Models to Hallucinate Less by Thinking More

Federated Learning with a Single Shared Image

Online Anchor-based Training for Image Classification Tasks

Do More Details Always Introduce More Hallucinations in LVLM-based Image Captioning?

Disturbing Image Detection Using LMM-Elicited Emotion Embeddings

GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models

Spatial Sequence Attention Network for Schizophrenia Classification from Structural Brain MR Images

The Lie Derivative for Measuring Learned Equivariance

Online-Adaptive Anomaly Detection for Defect Identification in Aircraft Assembly

SUPER: Selfie Undistortion and Head Pose Editing with Identity Preservation

最近の投稿

最近のコメント

アーカイブ

カテゴリー