Federated Causal Discovery from Heterogeneous Data

要約

従来の因果関係発見手法は集中化されたデータに依存しており、現実世界の多くの状況におけるデータの分散化された性質と矛盾しています。
この矛盾は、連合因果関係発見 (FCD) アプローチの開発の動機となっています。
ただし、既存の FCD 手法は、識別可能な機能因果モデルまたは均一なデータ分布の潜在的に制限的な仮定によって制限される可能性があり、さまざまなシナリオでの適用可能性が狭まります。
この論文では、任意の因果モデルと異種データに対応することを試みる新しい FCD 手法を提案します。
まず、クライアントインデックスに対応するサロゲート変数を利用して、異なるクライアント間のデータの異質性を考慮します。
次に、因果骨格発見のための連合条件付き独立性テスト (FCIT) を開発し、因果方向を決定するための連合独立変更原則 (FICP) を確立します。
これらのアプローチには、データのプライバシーを保護するために、生データの代理として概要統計を構築することが含まれます。
ノンパラメトリックな特性により、FCIT と FICP は特定の関数形式について仮定を行わないため、任意の因果モデルの処理が容易になります。
私たちは、手法の有効性を示すために、合成データセットと実際のデータセットに対して広範な実験を実施しています。
コードは \url{https://github.com/lokali/FedCDH.git} で入手できます。

要約(オリジナル)

Conventional causal discovery methods rely on centralized data, which is inconsistent with the decentralized nature of data in many real-world situations. This discrepancy has motivated the development of federated causal discovery (FCD) approaches. However, existing FCD methods may be limited by their potentially restrictive assumptions of identifiable functional causal models or homogeneous data distributions, narrowing their applicability in diverse scenarios. In this paper, we propose a novel FCD method attempting to accommodate arbitrary causal models and heterogeneous data. We first utilize a surrogate variable corresponding to the client index to account for the data heterogeneity across different clients. We then develop a federated conditional independence test (FCIT) for causal skeleton discovery and establish a federated independent change principle (FICP) to determine causal directions. These approaches involve constructing summary statistics as a proxy of the raw data to protect data privacy. Owing to the nonparametric properties, FCIT and FICP make no assumption about particular functional forms, thereby facilitating the handling of arbitrary causal models. We conduct extensive experiments on synthetic and real datasets to show the efficacy of our method. The code is available at \url{https://github.com/lokali/FedCDH.git}.

arxiv情報

著者	Loka Li,Ignavier Ng,Gongxu Luo,Biwei Huang,Guangyi Chen,Tongliang Liu,Bin Gu,Kun Zhang
発行日	2024-02-20 18:53:53+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Federated Causal Discovery from Heterogeneous Data

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー