投稿者「jarxiv」のアーカイブ

SHTOcc: Effective 3D Occupancy Prediction with Sparse Head and Tail Voxels

投稿日: 2025年5月29日作成者: jarxiv

要約 3D占有予測は、強力な幾何学的認識とオブジェクト認識能力のために、自律運転 … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Single Domain Generalization for Alzheimer’s Detection from 3D MRIs with Pseudo-Morphological Augmentations and Contrastive Learning

投稿日: 2025年5月29日作成者: jarxiv

要約 AlzheimerのMRISによる疾患検出は、現代の深い学習モデルのおかげ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

VisCRA: A Visual Chain Reasoning Attack for Jailbreaking Multimodal Large Language Models

投稿日: 2025年5月29日作成者: jarxiv

要約マルチモーダル大手言語モデル（MLRMS）の出現により、強化学習と考え方（ … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

A Closer Look at Multimodal Representation Collapse

投稿日: 2025年5月29日作成者: jarxiv

要約私たちは、モダリティ崩壊の基本的な理解を開発することを目指しています。これ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Understanding Adversarial Training with Energy-based Models

投稿日: 2025年5月29日作成者: jarxiv

要約エネルギーベースのモデル（EBM）フレームワークを使用して、分類器の敵対的 … 続きを読む →

カテゴリー: cs.CV, cs.LG | コメントを受け付けていません

Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics

投稿日: 2025年5月29日作成者: jarxiv

要約人口統計学的変数から直接高忠実度の3D PET/CTボリュームを合成するた … 続きを読む →

カテゴリー: cs.CV, cs.GR, eess.IV | コメントを受け付けていません

ProCrop: Learning Aesthetic Image Cropping from Professional Compositions

投稿日: 2025年5月29日作成者: jarxiv

要約画像のトリミングは、写真の視覚的な魅力と物語の影響を高めるために重要ですが … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Risk-Sensitive Conformal Prediction for Catheter Placement Detection in Chest X-rays

投稿日: 2025年5月29日作成者: jarxiv

要約この論文では、胸部X線でのカテーテルとラインの位置検出に対する新しいアプロ … 続きを読む →

カテゴリー: cs.CV, eess.IV, stat.AP | コメントを受け付けていません

The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector

投稿日: 2025年5月29日作成者: jarxiv

要約 3Dオブジェクト検出は、自律駆動システムの重要なコンポーネントです。さま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

Latent Beam Diffusion Models for Decoding Image Sequences

投稿日: 2025年5月29日作成者: jarxiv

要約拡散モデルは、テキストプロンプトから高品質の画像を生成することに優れていま … 続きを読む →

カテゴリー: cs.CV | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

SHTOcc: Effective 3D Occupancy Prediction with Sparse Head and Tail Voxels

Single Domain Generalization for Alzheimer’s Detection from 3D MRIs with Pseudo-Morphological Augmentations and Contrastive Learning

VisCRA: A Visual Chain Reasoning Attack for Jailbreaking Multimodal Large Language Models

A Closer Look at Multimodal Representation Collapse

Understanding Adversarial Training with Energy-based Models

Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics

ProCrop: Learning Aesthetic Image Cropping from Professional Compositions

Risk-Sensitive Conformal Prediction for Catheter Placement Detection in Chest X-rays

The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector

Latent Beam Diffusion Models for Decoding Image Sequences

最近の投稿

最近のコメント

アーカイブ

カテゴリー