Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation

要約

スライド全体画像 (WSI) は、高解像度のギガピクセルサイズの画像であり、そのサイズと不均一性により、従来の機械学習モデルにとって重大な計算上の課題が生じます。この論文では、パッチベースを活用することにより、WSI 分類のためのスケーラブルで効率的な方法論を紹介します。
特徴抽出、クラスタリング、フィッシャーベクトルエンコーディング。
最初に、WSI は固定サイズのパッチに分割され、事前トレーニングされた畳み込みニューラルネットワーク (CNN) を使用して各パッチから深い特徴の埋め込みが抽出されます。
これらのパッチレベルの埋め込みは、K 平均法クラスタリングを使用してクラスタリングされます。各クラスタは、WSI の意味的に類似した領域を集約します。
各クラスターを効果的に要約するために、各クラスター内のパッチ埋め込みの分布をパラメトリックガウス混合モデル (GMM) としてモデル化することにより、フィッシャーベクトル表現が計算されます。
各クラスターのフィッシャーベクトルは高次元の特徴ベクトルに連結され、WSI 全体のコンパクトで有益な表現が作成されます。
この特徴ベクトルは、WSI の診断ラベルを予測するために分類器によって使用されます。
私たちの方法は、局所的および全体的な組織構造を捕捉し、大規模な WSI 分類に対して堅牢なパフォーマンスをもたらし、他のアプローチと比較して優れた精度と拡張性を実証します。

要約(オリジナル)

Whole slide images (WSIs) are high-resolution, gigapixel sized images that pose significant computational challenges for traditional machine learning models due to their size and heterogeneity.In this paper, we present a scalable and efficient methodology for WSI classification by leveraging patch-based feature extraction, clustering, and Fisher vector encoding. Initially, WSIs are divided into fixed size patches, and deep feature embeddings are extracted from each patch using a pre-trained convolutional neural network (CNN). These patch-level embeddings are subsequently clustered using K-means clustering, where each cluster aggregates semantically similar regions of the WSI. To effectively summarize each cluster, Fisher vector representations are computed by modeling the distribution of patch embeddings in each cluster as a parametric Gaussian mixture model (GMM). The Fisher vectors from each cluster are concatenated into a high-dimensional feature vector, creating a compact and informative representation of the entire WSI. This feature vector is then used by a classifier to predict the WSI’s diagnostic label. Our method captures local and global tissue structures and yields robust performance for large-scale WSI classification, demonstrating superior accuracy and scalability compared to other approaches.

arxiv情報

著者	Ravi Kant Gupta,Shounak Das,Ardhendu Sekhar,Amit Sethi
発行日	2025-01-21 12:22:15+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー