A Recent Survey of Vision Transformers for Medical Image Segmentation

要約

医療画像のセグメンテーションは、様々なヘルスケアアプリケーションにおいて重要な役割を果たし、正確な診断、治療計画、病気の監視を可能にする。近年、Vision Transformers（ViT）は、医療画像セグメンテーションにおける課題に対処するための有望な技術として浮上してきた。医用画像では通常、構造は高度に相互接続され、グローバルに分散している。ViTsはマルチスケール注意メカニズムを利用して、画像中の長距離関係をモデル化する。しかし、ViTには画像に関連した誘導バイアスや並進不変性がないため、性能に影響を与える可能性がある。最近、研究者たちは、画像の大域的な情報に加えて局所的な相関を捉えるために、ハイブリッド・ビジョン・トランスフォーマー（HVT）として知られる、CNNをアーキテクチャに組み込んだ様々なViTベースのアプローチを考え出した。本サーベイペーパーでは、医療画像セグメンテーションのためのViTsとHVTsにおける最近の進歩の詳細なレビューを提供する。また、ViTおよびHVTに基づく医用画像セグメンテーションアプローチの分類とともに、いくつかの医用画像モダリティにおけるリアルタイムアプリケーションの詳細な概要も示す。本調査は、研究者、医療従事者、学生にとって、ViTベースの医用画像セグメンテーションのための最先端のアプローチを理解するための貴重な資料となるであろう。

要約(オリジナル)

Medical image segmentation plays a crucial role in various healthcare applications, enabling accurate diagnosis, treatment planning, and disease monitoring. In recent years, Vision Transformers (ViTs) have emerged as a promising technique for addressing the challenges in medical image segmentation. In medical images, structures are usually highly interconnected and globally distributed. ViTs utilize their multi-scale attention mechanism to model the long-range relationships in the images. However, they do lack image-related inductive bias and translational invariance, potentially impacting their performance. Recently, researchers have come up with various ViT-based approaches that incorporate CNNs in their architectures, known as Hybrid Vision Transformers (HVTs) to capture local correlation in addition to the global information in the images. This survey paper provides a detailed review of the recent advancements in ViTs and HVTs for medical image segmentation. Along with the categorization of ViT and HVT-based medical image segmentation approaches we also present a detailed overview of their real-time applications in several medical image modalities. This survey may serve as a valuable resource for researchers, healthcare practitioners, and students in understanding the state-of-the-art approaches for ViT-based medical image segmentation.

arxiv情報

著者	Asifullah Khan,Zunaira Rauf,Abdul Rehman Khan,Saima Rathore,Saddam Hussain Khan,Sahar Shah,Umair Farooq,Hifsa Asif,Aqsa Asif,Umme Zahoora,Rafi Ullah Khalil,Suleman Qamar,Umme Hani Asif,Faiza Babar Khan,Abdul Majid,Jeonghwan Gwak
発行日	2023-12-01 14:54:44+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, DeepL

A Recent Survey of Vision Transformers for Medical Image Segmentation

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー