Learning Representations for Clustering via Partial Information Discrimination and Cross-Level Interaction

要約

この論文では、PICI と呼ばれる新しい深層画像クラスタリングアプローチを紹介します。これは、共同学習フレームワークで部分情報識別とクロスレベルインタラクションを強制します。
特に、Transformer エンコーダをバックボーンとして利用し、それを通じて 2 つの並列拡張ビューによるマスクされた画像モデリングが定式化されます。
Transformer エンコーダによってマスクされた画像からクラストークンを導出した後、マスクされた画像再構成を介して自動エンコーダをトレーニングするための PISD モジュール、2 つのレベルの対照学習を採用するための PICD モジュールを含む 3 つの部分情報学習モジュールがさらに組み込まれています。
インスタンスレベルとクラスターレベルのサブスペース間の相互対話のための CLI モジュール。
6 つの現実世界の画像データセットに対して広範な実験が行われ、提案された PICI アプローチが最先端のディープクラスタリングアプローチよりも優れたクラスタリングパフォーマンスを実証しました。
ソースコードは https://github.com/Regan-Zhang/PICI で入手できます。

要約(オリジナル)

In this paper, we present a novel deep image clustering approach termed PICI, which enforces the partial information discrimination and the cross-level interaction in a joint learning framework. In particular, we leverage a Transformer encoder as the backbone, through which the masked image modeling with two paralleled augmented views is formulated. After deriving the class tokens from the masked images by the Transformer encoder, three partial information learning modules are further incorporated, including the PISD module for training the auto-encoder via masked image reconstruction, the PICD module for employing two levels of contrastive learning, and the CLI module for mutual interaction between the instance-level and cluster-level subspaces. Extensive experiments have been conducted on six real-world image datasets, which demononstrate the superior clustering performance of the proposed PICI approach over the state-of-the-art deep clustering approaches. The source code is available at https://github.com/Regan-Zhang/PICI.

arxiv情報

著者	Hai-Xin Zhang,Dong Huang,Hua-Bao Ling,Guang-Yu Zhang,Wei-jun Sun,Zi-hao Wen
発行日	2024-01-24 14:51:33+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Learning Representations for Clustering via Partial Information Discrimination and Cross-Level Interaction

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー