Towards Privacy-Preserving Relational Data Synthesis via Probabilistic Relational Models

要約

確率的リレーショナルモデルは、一次論理と確率的モデルを組み合わせる十分に確立された形式主義を提供し、それによってリレーショナルドメイン内のオブジェクト間の関係を表現できるようになります。
同時に、人工知能の分野では、さまざまな機械学習タスクのために、ますます大量のリレーショナルトレーニングデータが必要になります。
ただし、現実世界のデータを収集することは、プライバシーの問題、データ保護規制、高コストなどにより困難な場合が多いです。
これらの課題を軽減するには、合成データの生成が有望なアプローチです。
この論文では、確率的リレーショナルモデルを介して合成リレーショナルデータを生成する問題を解決します。
特に、リレーショナルデータベースから確率的リレーショナルモデルに移行するための本格的なパイプラインを提案します。これは、基礎となる確率分布から新しい合成リレーショナルデータポイントをサンプリングするために使用できます。
私たちが提案するパイプラインの一部として、与えられたリレーショナルデータベースから確率的リレーショナルモデルを構築する学習アルゴリズムを導入します。

要約(オリジナル)

Probabilistic relational models provide a well-established formalism to combine first-order logic and probabilistic models, thereby allowing to represent relationships between objects in a relational domain. At the same time, the field of artificial intelligence requires increasingly large amounts of relational training data for various machine learning tasks. Collecting real-world data, however, is often challenging due to privacy concerns, data protection regulations, high costs, and so on. To mitigate these challenges, the generation of synthetic data is a promising approach. In this paper, we solve the problem of generating synthetic relational data via probabilistic relational models. In particular, we propose a fully-fledged pipeline to go from relational database to probabilistic relational model, which can then be used to sample new synthetic relational data points from its underlying probability distribution. As part of our proposed pipeline, we introduce a learning algorithm to construct a probabilistic relational model from a given relational database.

arxiv情報

著者	Malte Luttermann,Ralf Möller,Mattis Hartwig
発行日	2024-10-02 17:01:58+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Towards Privacy-Preserving Relational Data Synthesis via Probabilistic Relational Models

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー