Quantum Generative Models for Image Generation: Insights from MNIST and MedMNIST

要約

量子生成モデルは、量子回路を活用してデータ生成能力を向上させることにより、機械学習における有望な新しい方向性を提供する。本研究では、変分量子回路を拡散ベースのモデルに統合したハイブリッド量子古典画像生成フレームワークを提案する。学習ダイナミクスと生成品質を向上させるために、2つの新しいノイズ戦略を導入する：内在する量子生成ノイズとテーラーメイドのノイズスケジューリング機構である。本手法は軽量なU-Netアーキテクチャ上に構築されており、量子層はその効果を分離するためにボトルネックモジュールに組み込まれている。MNISTデータセットとMedMNISTデータセットで我々のモデルを評価し、その実現可能性と性能を検証した。その結果、限られたデータ条件下（100枚以下の学習画像）では、量子モデルを用いた方が、同じアーキテクチャを用いた従来のモデルよりも、知覚品質と分布類似度が高い画像を生成できることが明らかになった。量子モデルはMNISTのようなグレースケールのデータでは優位性を示すが、PathMNISTのような複雑で色彩豊かなデータセットでは、その性能はより微妙なものとなる。これらの結果は、量子生成モデルの潜在的な可能性と現在の限界を浮き彫りにし、低リソース画像生成やバイオメディカル画像生成における今後の開発の基礎を築くものである。

要約(オリジナル)

Quantum generative models offer a promising new direction in machine learning by leveraging quantum circuits to enhance data generation capabilities. In this study, we propose a hybrid quantum-classical image generation framework that integrates variational quantum circuits into a diffusion-based model. To improve training dynamics and generation quality, we introduce two novel noise strategies: intrinsic quantum-generated noise and a tailored noise scheduling mechanism. Our method is built upon a lightweight U-Net architecture, with the quantum layer embedded in the bottleneck module to isolate its effect. We evaluate our model on MNIST and MedMNIST datasets to examine its feasibility and performance. Notably, our results reveal that under limited data conditions (fewer than 100 training images), the quantum-enhanced model generates images with higher perceptual quality and distributional similarity than its classical counterpart using the same architecture. While the quantum model shows advantages on grayscale data such as MNIST, its performance is more nuanced on complex, color-rich datasets like PathMNIST. These findings highlight both the potential and current limitations of quantum generative models and lay the groundwork for future developments in low-resource and biomedical image generation.

arxiv情報

著者	Chi-Sheng Chen,Wei An Hou,Hsiang-Wei Hu,Zhen-Sheng Cai
発行日	2025-04-03 17:40:26+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, DeepL

Quantum Generative Models for Image Generation: Insights from MNIST and MedMNIST

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー