A New Graph Node Classification Benchmark: Learning Structure from Histology Cell Graphs

要約

未調査のドメインでのノード分類のための新しいベンチマークデータセット、胎盤を紹介します: 胎盤組織学全体のスライド画像の細胞グラフから微細解剖学的組織構造を予測します。
この問題は、いくつかの理由から、グラフ学習にとって非常に困難です。
セルグラフが大きい (画像あたり 100 万ノードを超える)、ノードの特徴が多様である (11 種類のセルの 64 次元)、クラスラベルが不均衡である (データの 0.21% から 40.0% までの 9 つのクラス)、およびセルラーコミュニティ
さまざまなサイズ (単一構造で 11 ノードから 44,671 ノード) の不均一に分散した組織にクラスター化します。
ここでは、合計 2,395,747 ノードの 2 つの胎盤組織像からの 2 つのセルグラフで構成されるデータセットをリリースし、そのうち 799,745 にはグラウンドトゥルースラベルがあります。
7 つのスケーラブルなモデルの帰納的ベンチマーク結果を提示し、セルグラフの独自の品質が新しいグラフニューラルネットワークアーキテクチャの開発を促進するのにどのように役立つかを示します。

要約(オリジナル)

We introduce a new benchmark dataset, Placenta, for node classification in an underexplored domain: predicting microanatomical tissue structures from cell graphs in placenta histology whole slide images. This problem is uniquely challenging for graph learning for a few reasons. Cell graphs are large (>1 million nodes per image), node features are varied (64-dimensions of 11 types of cells), class labels are imbalanced (9 classes ranging from 0.21% of the data to 40.0%), and cellular communities cluster into heterogeneously distributed tissues of widely varying sizes (from 11 nodes to 44,671 nodes for a single structure). Here, we release a dataset consisting of two cell graphs from two placenta histology images totalling 2,395,747 nodes, 799,745 of which have ground truth labels. We present inductive benchmark results for 7 scalable models and show how the unique qualities of cell graphs can help drive the development of novel graph neural network architectures.

arxiv情報

著者	Claudia Vanea,Jonathan Campbell,Omri Dodi,Liis Salumäe,Karen Meir,Drorith Hochner-Celnikier,Hagit Hochner,Triin Laisk,Linda M. Ernst,Cecilia M. Lindgren,Christoffer Nellåker
発行日	2022-11-11 16:02:29+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

A New Graph Node Classification Benchmark: Learning Structure from Histology Cell Graphs

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー