Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging

要約

この研究では、ケーススタディとしてADNIデータベースからの脳MRを使用したアルツハイマー病（AD）に焦点を当てた、効率的かつ意味のない監視されていない学習のための新しいエンコーダーデコーダー拡散ベースのフレームワークである潜在的な拡散自己エンコーダー（LDAE）を提示します。
イメージスペースで動作する従来の拡散自動エンコーダーとは異なり、LDAEは圧縮された潜在表現で拡散プロセスを適用し、計算効率を改善し、3D医療イメージング表現学習を扱いやすくします。
提案されたアプローチを検証するために、2つの重要な仮説を調査します。（i）LDAEは、ADおよび老化に関連する3D脳MRで意味のあるセマンティック表現を効果的にキャプチャし、（ii）LDAEは計算上効率的でありながら高品質の画像生成と再構築を達成します。
実験結果は両方の仮説をサポートします。（i）線形プローブの評価は、ADの有望な診断パフォーマンス（ROC-AUC：90％、ACC：84％）および年齢予測（MAE：4.1歳、RMSE：5.2歳）の有望な診断パフォーマンスを示しています。
（ii）学習したセマンティック表現は、属性操作を可能にし、解剖学的にもっともらしい修正をもたらします。
（iii）セマンティック補間実験では、6か月のギャップでSSIMが0.969（MSE：0.0019）の欠落スキャンの強い再構築を示しています。
長いギャップ（24か月）であっても、モデルは堅牢なパフォーマンス（SSIM> 0.93、MSE <0.004）を維持し、時間的進行の傾向をキャプチャする能力を示しています。（iv）従来の拡散自己エンコーダーと比較して、LDAEは推論のスループット（20倍高速）を大幅に増加させ、再構築品質も向上させます。これらの調査結果は、LDAEをスケーラブルな医療イメージングアプリケーションの有望なフレームワークとして位置付けており、医療画像分析の基礎モデルとして機能する可能性があります。 https://github.com/gabrielelozupone/ldaeで入手可能なコード

要約(オリジナル)

This study presents Latent Diffusion Autoencoder (LDAE), a novel encoder-decoder diffusion-based framework for efficient and meaningful unsupervised learning in medical imaging, focusing on Alzheimer disease (AD) using brain MR from the ADNI database as a case study. Unlike conventional diffusion autoencoders operating in image space, LDAE applies the diffusion process in a compressed latent representation, improving computational efficiency and making 3D medical imaging representation learning tractable. To validate the proposed approach, we explore two key hypotheses: (i) LDAE effectively captures meaningful semantic representations on 3D brain MR associated with AD and ageing, and (ii) LDAE achieves high-quality image generation and reconstruction while being computationally efficient. Experimental results support both hypotheses: (i) linear-probe evaluations demonstrate promising diagnostic performance for AD (ROC-AUC: 90%, ACC: 84%) and age prediction (MAE: 4.1 years, RMSE: 5.2 years); (ii) the learned semantic representations enable attribute manipulation, yielding anatomically plausible modifications; (iii) semantic interpolation experiments show strong reconstruction of missing scans, with SSIM of 0.969 (MSE: 0.0019) for a 6-month gap. Even for longer gaps (24 months), the model maintains robust performance (SSIM > 0.93, MSE < 0.004), indicating an ability to capture temporal progression trends; (iv) compared to conventional diffusion autoencoders, LDAE significantly increases inference throughput (20x faster) while also enhancing reconstruction quality. These findings position LDAE as a promising framework for scalable medical imaging applications, with the potential to serve as a foundation model for medical image analysis. Code available at https://github.com/GabrieleLozupone/LDAE

arxiv情報

著者	Gabriele Lozupone,Alessandro Bria,Francesco Fontanella,Frederick J. A. Meijer,Claudio De Stefano,Henkjan Huisman
発行日	2025-04-11 15:37:46+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー