AuToMATo: A Parameter-Free Persistence-Based Clustering Algorithm

要約

我々は、永続的な相同性に基づく新しいパラメータフリーのクラスタリングアルゴリズムである AutoToMATo を紹介します。
AuToMATo は、推定密度関数の有意なピークを有意でないピークから分離するために、既存の ToMATo クラスタリングアルゴリズムとブートストラップ手順を組み合わせます。
私たちは、他の多くの最先端のクラスタリングアルゴリズムに対して AuToMATo を徹底的に比較します。
AuToMATo は他のパラメーターを使用しないクラスタリングアルゴリズムと比べて優れているだけでなく、多くの場合、他のアルゴリズムで最適なパラメーターを選択した場合でも大幅に優れていることがわかりました。
AutoToMATo は、トポロジカルデータ分析、特にパラメーターを使用しないクラスタリングアルゴリズムを使用することが望ましいマッパーアルゴリズムでのアプリケーションによって動機付けられています。
実際、私たちは AutoToMATo が Mapper と併用すると良好なパフォーマンスを発揮するという証拠を提供しています。
最後に、standardscikit-learn アーキテクチャと完全に互換性のある Python での AuToMATo のオープンソース実装を提供します。

要約(オリジナル)

We present AuToMATo, a novel parameter-free clustering algorithm based on persistent homology. AuToMATo combines the existing ToMATo clustering algorithm with a bootstrapping procedure in order to separate significant peaks of an estimated density function from non-significant ones. We perform a thorough comparison of AuToMATo against many other state-of-the-art clustering algorithms. We find that not only that AuToMATo compares favorably against other parameter-free clustering algorithms, but in many instances also significantly outperforms even the best selection of parameters for other algorithms. AuToMATo is motivated by applications in topological data analysis, in particular the Mapper algorithm, where it is desirable to work with a parameter-free clustering algorithm. Indeed, we provide evidence that AuToMATo performs well when used with Mapper. Finally, we provide an open-source implementation of AuToMATo in Python that is fully compatible with the standardscikit-learn architecture.

arxiv情報

著者	Marius Huber,Sara Kalisnik,Patrick Schnider
発行日	2024-08-13 15:15:37+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

AuToMATo: A Parameter-Free Persistence-Based Clustering Algorithm

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー