KaLDeX: Kalman Filter based Linear Deformable Cross Attention for Retina Vessel Segmentation

要約

背景と目的: 眼科画像処理の分野では、さまざまな眼疾患の診断と管理には正確な血管セグメンテーションが最も重要です。
現在の深層学習ベースの血管セグメンテーションモデルは人間の精度に匹敵しますが、ニューラルネットワークアプリケーションで微細な血管を正確にセグメンテーションするには依然として大きな課題に直面しています。
CNN モデルでは複数のダウンサンプリング操作が必要なため、高解像度画像の細部は必然的に失われます。
この研究の目的は、繊細で細い血管を捕捉する構造を設計することです。
方法: これらの問題に対処するために、UNet++ フレームワーク内に統合されたカルマンフィルターベースの線形変形可能クロスアテンション (LDCA) モジュールを活用した血管セグメンテーションのための新しいネットワーク (KaLDeX) を提案します。
私たちのアプローチは、カルマンフィルター (KF) ベースの線形変形可能畳み込み (LD) モジュールとクロスアテンション (CA) モジュールという 2 つの主要なコンポーネントに基づいています。
LD モジュールは、標準的なコンボリューションでは見落とされる可能性がある細い血管への焦点を適応的に調整するように設計されています。
CA モジュールは、LD モジュールの詳細な機能と UNet++ アーキテクチャの高レベルの機能を統合することにより、血管構造の全体的な理解を向上させます。
最後に、永続的な相同性に基づくトポロジカル損失関数を採用して、セグメンテーションのトポロジカルな連続性を制約します。
結果: 提案された方法は、網膜眼底画像データセット (DRIVE、CHASE_BD1、および STARE) および OCTA-500 データセットの 3 mm および 6 mm で評価され、97.25%、97.77%、97.85% の平均精度 (ACC) を達成しました。
、98.89%、98.21%でした。
結論: 経験的証拠は、私たちの方法がさまざまな血管セグメンテーションデータセットで現在の最良のモデルよりも優れていることを示しています。
私たちのソースコードは https://github.com/AIEyeSystem/KalDeX から入手できます。

要約(オリジナル)

Background and Objective: In the realm of ophthalmic imaging, accurate vascular segmentation is paramount for diagnosing and managing various eye diseases. Contemporary deep learning-based vascular segmentation models rival human accuracy but still face substantial challenges in accurately segmenting minuscule blood vessels in neural network applications. Due to the necessity of multiple downsampling operations in the CNN models, fine details from high-resolution images are inevitably lost. The objective of this study is to design a structure to capture the delicate and small blood vessels. Methods: To address these issues, we propose a novel network (KaLDeX) for vascular segmentation leveraging a Kalman filter based linear deformable cross attention (LDCA) module, integrated within a UNet++ framework. Our approach is based on two key components: Kalman filter (KF) based linear deformable convolution (LD) and cross-attention (CA) modules. The LD module is designed to adaptively adjust the focus on thin vessels that might be overlooked in standard convolution. The CA module improves the global understanding of vascular structures by aggregating the detailed features from the LD module with the high level features from the UNet++ architecture. Finally, we adopt a topological loss function based on persistent homology to constrain the topological continuity of the segmentation. Results: The proposed method is evaluated on retinal fundus image datasets (DRIVE, CHASE_BD1, and STARE) as well as the 3mm and 6mm of the OCTA-500 dataset, achieving an average accuracy (ACC) of 97.25%, 97.77%, 97.85%, 98.89%, and 98.21%, respectively. Conclusions: Empirical evidence shows that our method outperforms the current best models on different vessel segmentation datasets. Our source code is available at: https://github.com/AIEyeSystem/KalDeX.

arxiv情報

著者	Zhihao Zhao,Shahrooz Faghihroohi,Yinzheng Zhao,Junjie Yang,Shipeng Zhong,Kai Huang,Nassir Navab,Boyang Li,M. Ali Nasseri
発行日	2024-10-28 16:00:42+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

KaLDeX: Kalman Filter based Linear Deformable Cross Attention for Retina Vessel Segmentation

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー