Can Deep Learning Reliably Recognize Abnormality Patterns on Chest X-rays? A Multi-Reader Study Examining One Month of AI Implementation in Everyday Radiology Clinical Practice

要約

本研究では、胸部X線(CXR)上の7つの特定の放射線所見(無気肺(ATE)、圧密(CON)、胸水(EFF)、肺病変(LES)、皮下気腫(SCE)、心大(CMG)、気胸(PNO))を検出・局在化する、深層学習ベースの自動検出アルゴリズム(DLAD、ケアボット AI CXR)を開発しました。956枚のCXRを収集し、DLADの性能を、病院内で画像を評価した6人の放射線科医個人の性能と比較した。提案したDLADは、高い感度(ATE 1.000 (0.624-1.000), CON 0.864 (0.671-0.956), EFF 0.953 (0.887-0.983), LES 0.905 (0.715-0.978), SCE 1.000 (0.366-1.000),CMG 0.837(0.711-0.917),PNO 0.875(0.538-0.986)), 放射線科医と比較しても(LOWEST:ATE 0.000(0.000-0.376),CON 0.182(0.070-0.382),EFF 0.400(0.302-0.506),LES 0.238(0.103-0.448),SCE 0.000(0.000-0.634),CMG 0.347(0.228-0.486),PO 0.375(0.134-0.691), 最高:EATE 1.000(0.624-1.000)、con 0.864(0.671-0.956),eff 0.953(0.887-0.983),les 0.667(0.456-0.830),sce 1.000(0.366-1.000),cmg 0.980(0.896-0.999),pno 0.875(0.538-0.986)).本研究の結果は、提案されたDLADが意思決定支援システムとして日常臨床に組み込まれる可能性があり、若手や中堅の放射線技師に関連する偽陰性率を効果的に軽減することを実証しています。

要約(オリジナル)

In this study, we developed a deep-learning-based automatic detection algorithm (DLAD, Carebot AI CXR) to detect and localize seven specific radiological findings (atelectasis (ATE), consolidation (CON), pleural effusion (EFF), pulmonary lesion (LES), subcutaneous emphysema (SCE), cardiomegaly (CMG), pneumothorax (PNO)) on chest X-rays (CXR). We collected 956 CXRs and compared the performance of the DLAD with that of six individual radiologists who assessed the images in a hospital setting. The proposed DLAD achieved high sensitivity (ATE 1.000 (0.624-1.000), CON 0.864 (0.671-0.956), EFF 0.953 (0.887-0.983), LES 0.905 (0.715-0.978), SCE 1.000 (0.366-1.000), CMG 0.837 (0.711-0.917), PNO 0.875 (0.538-0.986)), even when compared to the radiologists (LOWEST: ATE 0.000 (0.000-0.376), CON 0.182 (0.070-0.382), EFF 0.400 (0.302-0.506), LES 0.238 (0.103-0.448), SCE 0.000 (0.000-0.634), CMG 0.347 (0.228-0.486), PNO 0.375 (0.134-0.691), HIGHEST: ATE 1.000 (0.624-1.000), CON 0.864 (0.671-0.956), EFF 0.953 (0.887-0.983), LES 0.667 (0.456-0.830), SCE 1.000 (0.366-1.000), CMG 0.980 (0.896-0.999), PNO 0.875 (0.538-0.986)). The findings of the study demonstrate that the suggested DLAD holds potential for integration into everyday clinical practice as a decision support system, effectively mitigating the false negative rate associated with junior and intermediate radiologists.

arxiv情報

著者 Daniel Kvak,Anna Chromcová,Petra Ovesná,Jakub Dandár,Marek Biroš,Robert Hrubý,Daniel Dufek,Marija Pajdaković
発行日 2023-06-02 14:18:54+00:00
arxivサイト arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, DeepL

カテゴリー: cs.CV, cs.LG, eess.IV パーマリンク