Vision Transformers in Medical Imaging: A Review

要約

Attention ベースのエンコーダー/デコーダーアーキテクチャで構成されるモデルである Transformer は、自然言語処理 (NLP) の分野で普及しており、最近ではコンピュータービジョン (CV) の分野にも影響を与えています。
コンピュータビジョンと医用画像の類似点は、トランスフォーマーがコンピュータビジョンに与える影響を医用画像に変換できるかどうかという研究者の間の疑問を再検討したものです。
この論文では、医用画像における変圧器の適用に関する包括的かつ最近のレビューを提供しようとします。
さまざまな畳み込みニューラルネットワーク (CNN) と比較したトランスフォーマーモデルの説明、画像モダリティに焦点を当てた医用画像の分類、セグメンテーション、登録、および再構成のためのトランスフォーマーベースのアプローチの詳細、最新のパフォーマンスの比較
アートトランスフォーマーアーキテクチャを、標準的な医療データセットで最高のパフォーマンスを発揮する CNN に変換します。

要約(オリジナル)

Transformer, a model comprising attention-based encoder-decoder architecture, have gained prevalence in the field of natural language processing (NLP) and recently influenced the computer vision (CV) space. The similarities between computer vision and medical imaging, reviewed the question among researchers if the impact of transformers on computer vision be translated to medical imaging? In this paper, we attempt to provide a comprehensive and recent review on the application of transformers in medical imaging by; describing the transformer model comparing it with a diversity of convolutional neural networks (CNNs), detailing the transformer based approaches for medical image classification, segmentation, registration and reconstruction with a focus on the image modality, comparing the performance of state-of-the-art transformer architectures to best performing CNNs on standard medical datasets.

arxiv情報

著者	Emerald U. Henry,Onyeka Emebob,Conrad Asotie Omonhinmin
発行日	2022-11-18 05:52:37+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Vision Transformers in Medical Imaging: A Review

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー