- Research
- Open access
- Published:
A hybrid quantum-classical conditional generative adversarial network algorithm for human-centered paradigm in cloud
EURASIP Journal on Wireless Communications and Networking volume 2021, Article number: 37 (2021)
Abstract
As an emerging field that aims to bridge the gap between human activities and computing systems, human-centered computing (HCC) in cloud, edge, fog has had a huge impact on the artificial intelligence algorithms. The quantum generative adversarial network (QGAN) is considered to be one of the quantum machine learning algorithms with great application prospects, which also should be improved to conform to the human-centered paradigm. The generation process of QGAN is relatively random and the generated model does not conform to the human-centered concept, so it is not quite suitable for real scenarios. In order to solve these problems, a hybrid quantum-classical conditional generative adversarial network (QCGAN) algorithm is proposed, which is a knowledge-driven human–computer interaction computing mode that can be implemented in cloud. The purposes of stabilizing the generation process and realizing the interaction between human and computing process are achieved by inputting artificial conditional information in the generator and discriminator. The generator uses the parameterized quantum circuit with an all-to-all connected topology, which facilitates the tuning of network parameters during the training process. The discriminator uses the classical neural network, which effectively avoids the “input bottleneck” of quantum machine learning. Finally, the BAS training set is selected to conduct experiment on the quantum cloud computing platform. The result shows that the QCGAN algorithm can effectively converge to the Nash equilibrium point after training and perform human-centered classification generation tasks.
1 Introduction
With the development of wireless communications and networking, human-centered computing (HCC) in cloud, edge, and fog attempts to effectively integrate various computing elements related to humans [1, 2], which becomes a common focus of attention in the academic and industrial fields. Unlike other ordinary computing, HCC pays more attention to the status of human in computing technology and the interaction of humans with cyberspace and physical world [3]. Therefore, the design of HCC systems and algorithms needs to take into account the individual’s ability and subjective initiative [4, 5]. Among them, cloud computing uses a super-large-scale distributed computing method to adapt to the large number of examples and complex calculation requirements of current artificial intelligence (AI) algorithms, and it has become a computing method commonly sought [6, 7]. In the background of HCC computing and big data, there are many interesting and practical applications generating [8,9,10]. Privacy is also an important norm that computing models must pay attention to, especially related to privacy perception and privacy protection [11,12,13].
Quantum cloud computing allows users to test and develop their quantum programs on local personal computers, and run them on actual quantum devices, thereby reducing the distance between humans and the mysterious quantum [14]. Under the influence of the AI wave, many technology companies are committed to establishing quantum cloud computing platforms that enable users to implement quantum machine learning algorithms. Compared with the two major models of machine learning, the generative model and the discriminant model, the generative model is more capable of exerting human subjective initiative, so it has the potential to developed into the HCC paradigm. Therefore, we consider the very creative quantum generative adversarial network model as a breakthrough in HCC computing in cloud.
Generative adversarial network (GAN) [15] evaluates generative models through a set of adversarial neural network frameworks, which is a hot topic in recent years about generative machine learning algorithm. The GAN algorithm bases on game theory scenario, and the generator aims to learn the mapping from simple input distribution to complex training sample space by competing with discriminator. As the adversary, the discriminator should judge as accurately as possible whether the input data comes from the training set or the generator. Both participants of the game try to minimize their own loss, so that the adversarial network framework finally reaches the Nash equilibrium [16]. In recent years, GAN has been successfully used in the fields of the processing of image, audio, natural language etc., to achieve functions such as clear image generation [17, 18], video prediction [19], text summarization [20], and image generation of semantic [21]. Actually, it is difficult to ensure stable training of GAN in operation. Researchers use the relevant results obtained by deep learning to improve GAN, including methods such as designing new network structures [22], adding regular constraints [23], integrated learning [24], and improving optimization algorithms [25]. However, the improved algorithms above are not human-centered, because the rules learned by the GAN algorithm are implicit. It is difficult to generate data that meets specific requirements by changing the structure or input of a trained generator. In 2014, Mirza et al. proposed conditional generative adversarial network (CGAN) [26]. This method guides GAN to learn to sample from the conditional distribution by adding conditional constraints to the hidden variables of the input layer, so that the generative data can be guided by conditional inputs, thereby expanding the application scenarios of the GAN algorithm. In the construction, the setting of conditional constraints can make the subjective initiative of people play a role, so it can be regarded as an HCC algorithm. Based on the CGAN algorithm, many human-centered applications have been constructed, such as objects detection [27], medical images processing and synthesis [28, 29].
Quantum generative adversarial network (QGAN) is a data-driven quantum circuit machine learning algorithm which combine the classical GAN and quantum computing [30]. In 2018, Lloyd proposed the concept of QGAN [31], which analyzed the effectiveness of three different QGAN frameworks from a theoretical perspective, and demonstrated that quantum adversarial learning can also reach the Nash equilibrium point when the generative distribution can fit real distribution. In the same year, Pierre’s team discussed QGAN in more detail, by giving the general structure of the parameterized quantum circuit (PQC) as a generator and the estimation method of the parameter gradient when training the network [32]. In 2019, Hu et al. used quantum superconducting circuit physics experiments to prove the feasibility of QGAN on current noisy intermediate-scale quantum (NISQ) devices [33]. Additionally, the optimization of the quantum generator structure is also one of the research priorities. For example, using matrix product state [34] and tree tensor network [35] to construct PQCs as generator and discriminator of QGAN respectively, the convergence and robustness to noise of these methods are all verified through experiments on quantum hardware.
In terms of generating quantum data, the quantum supremacy means that classical information processors or neural networks sometimes cannot fit the data generated by quantum systems, and only quantum generator can complete such tasks. For the generation of classical data, the output of quantum generator can always meet the differentiable constraint. By sampling the output of quantum generator, classical discrete data can be obtained. In contrast, classical GAN cannot directly generate discrete data due to the influence of differentiable constraint. Therefore, as a complement to the classical GAN, QGAN with the ability to generate discrete data and the combination of other known variants of GAN and quantum mechanical mechanisms are of great research value.
Similar to classical GAN, QGAN also has the problem of uncontrollable training process and random generative output. However, in practical applications, the intent output obtained by changing the input is a more common situation, so QGAN is less practical. In order to solve the problem that the QGAN algorithm lacks human-oriented thinking, this paper proposes a hybrid quantum-classical scheme based on conditional generative adversarial network. Conditional constraints are added to the QGAN algorithm to guide the training process. This method has both the controllability of CGAN and the discrete data generation capability of QGAN. By analyzing the performance of different GAN, it is proved that this algorithm is better than the classical CGAN in terms of time complexity and algorithm functions. Through modeling and training experiments in cloud on classical data generation problem, the convergence of the model and the accuracy of the generative data verify the feasibility of applying quantum computing to the CGAN structure.
The rest of the paper is organized as follows. Section 2 describes the preliminaries about classical GAN and QGAN. Section 3 presents the design focus of QCGAN, including the method of designing the PQCs and estimating the parameter gradients. The performance analysis of QCGAN and the comparison with other related algorithms are in Sect. 4. In Sect. 5, experiments are performed in the quantum cloud computing platform to verify the feasibility of the proposed QCGAN algorithm. Section 6 summarizes what we find in this work and the prospects for future researches.
2 Principles of generative adversarial network algorithm
2.1 Generative adversarial network
The core idea of the classical GAN is to construct a zero-sum game between generator and discriminator. Through the adversarial learning strategy, generator and discriminator are alternately trained to obtain a better generative model. The structure and algorithm flowchart of GAN are shown in Fig. 1.
Specifically, the first step is to give training samples as generation target, assuming that the real data come from a fixed and unknown distribution \(p_{{\mathrm{real}}}\left( x \right)\). The generator is a neural network that can map low-dimensional distribution to high-dimensional space, and the discriminator is a neural network with classification function. The parameters of generator and discriminator are denoted as \({\overrightarrow{\theta }_{{\mathrm{G}}}}\) and \({\overrightarrow{\theta }_{\mathrm{D}}}\), respectively. The input of generator is a noise vector z, which is generally sampled from a normal distribution or a uniform distribution; \(x = G\left( {{{\overrightarrow{\theta }}_{\mathrm{G}}},z} \right)\) is the output of generator, which is transformed from the noise vector, and constitutes the generative distribution \({p_{\mathrm{G}}}\left( x \right)\). In the case of completing the ideal adversarial training, the discriminator will not be able to distinguish whether the input comes from the real distribution \({p_{{\mathrm{real}}}}\left( x \right)\) or the generative distribution \({p_{\mathrm{G}}}\left( x \right)\). Therefore, the goal of training generator is to make discriminator distinguish the output of generator as real data as much as possible. On the other hand, when training discriminator, its input contains real data \(x \sim {p_{{\mathrm{real}}}}\left( x \right)\) and the output of generator \(x \sim {p_{\mathrm{G}}}\left( x \right)\). At this time, the training goal is to accurately judge the two categories of input data. Combining these two aspects, the optimization of GAN can be described as the following minimax game problem
2.2 Conditional generative adversarial network
In view of the uncontrollable shortcoming of the training process of GAN, the CGAN algorithm adds conditional variables to the input of generator and discriminator at the same time to play a constraining and guiding role. The structure and flowchart of CGAN algorithm are shown in Fig. 2. The conditional variables y are generally known information with specific semantics, such as feature labels. Under the CGAN framework, the generator pays more attention to sample features that are closely related to conditional constraints, ignores other less relevant local features. Therefore, the addition of conditional variables can control the training process to generate higher quality data. The output of the generator can be regarded as sampling from the conditional distribution \({p_{\mathrm{G}}}\left( {x\left| y \right. } \right)\), so the objective function of CGAN can be rewritten on the basis of the original GAN as
CGAN needs to sample from the noise vector and the condition variable at the same time, so the set of reasonable conditional variable according to the generation target plays a crucial role in the generator’s ability to fit the real distribution. The most common method is to directly extract the conditional variables from the training data, so that generator and discriminator get some prior knowledge about the training set when they receive the input. For example, the category label is used as a conditional variable and attached to the input layer of the adversarial network [26]. At this time, CGAN can be regarded as an improvement of the unsupervised GAN model into a weakly supervised or a supervised model.
2.3 Quantum generative adversarial network
The QGAN is also a zero-sum game that constructed by generator and discriminator in principle. If one or more than one of the real data, the generator and the discriminator obey the quantum mechanism, the constructed algorithm scheme belongs to the QGAN concept. In general, the quantum data set is expressed in the form of a density matrix, which corresponds to the covariance matrix of the classical data set. Quantum generator and discriminator are composed of PQC. The selection, arrangement, and depth of quantum gates of PQC will affect the performance of it, so they are also the parts that can be optimized.
When QGAN is used for classical data generation tasks, if the goal of the generator is to reproduce statistical data on high-dimensional, QGAN with quantum generator has the potential to exponentially accelerate the convergence to Nash equilibrium [31]. Using classical neural networks as the discriminator in adversarial learning can avoid the input bottleneck of quantum machine learning, because it reduces the calculation and resource consumption of quantum state encoding when discriminate real classical data. Combining the above two aspects, the QCGAN algorithm proposed in this paper is based on the basic settings of the quantum generator and the classical discriminator to generate classical data. The structure and algorithm flowchart of this kind of QGAN algorithm are shown in Fig. 3.
3 Quantum conditional generative adversarial network algorithm
The QCGAN algorithm proposed in this paper is a generative adversarial network model which is suitable for fitting classical data distribution, whose generation process is controllable. The generator of QCGAN is constructed in the form of the parameterized quantum circuit, and the discriminator uses a classical neural network to complete the classification task. Different from the unconstrained QGAN algorithm, the QCGAN algorithm adds conditional variables to the input of both generator and discriminator to guide the training process. The basic flow of the algorithm can be summarized as follows (as shown in Fig. 4): the first step is to prepare classical samples and introduce appropriate conditional constraints according to the data characteristics as well as the goal of generation task. These two parts are combined to form the training data set of the network. The classical conditional constraints, which reflect the statistical characteristics of the training data set, are encoded into a entangled quantum state through a well-designed quantum circuit. The next step is to construct the PQC of the generator and the classical neural network of discriminator. Finally, the generative distribution and the real distribution are sampled separately and input these data to the discriminator for classification, and then an adversarial strategy is formulated for training. If the objective function converges, it means finding the best quantum generator. The output of the generator can be sampled to get a set of classical data, which is the result not only fits the target distribution but also meets the constraints.
3.1 Entangled state coding of conditional information and circuit design
For the quantum scheme of CGAN, an important topic is how to input the classical conditional variables into the quantum generator, which involves the quantum state encoding of the conditional variables and the circuit design for preparing this quantum state. In this paper, taking the representative category labels in the conditional variables as an example, the method of coding the entangled state of conditional information and designing corresponding circuit are explained in detail.
As shown in Fig. 4, the real data input to the discriminator are the data pairs \(\left( {x,y} \right)\) sampled from the classical training set, where y represents the conditional variable. The generator obtains the representation method of the conditional variables and the probability distribution of various samples in the training set through \(\left| {\mathrm{y}} \right\rangle\). Therefore, \(\left| {\mathrm{y}} \right\rangle\) is a quantum state entangled by m-categories conditional variables according to the probability distribution of real samples
where \(1/{\alpha _j} = {\left( {p\left( {x\left| {{y_j}} \right. } \right) } \right) ^{\mathrm{{ - }}1/2}}\), and \({1/{\alpha _j}}\) meets the normalization conditions: \({\sum \nolimits _{j = 1}^n {\left| {1/{\alpha _j}} \right| } ^2} = 1\).
The category labels of classical data samples used for machine learning tasks are generally coded by one-hot method. Assuming that three categories of data are generated, and the classical binary representations of three labels are: 001, 010, 100. Since the classical discriminator will perform classification processing on the generative distribution and the real distribution, it is most reasonable to use the same one-hot method to encode \(\left| {{\mathrm{{y}}_j}} \right\rangle\). It also happens to be similar in form to the quantum three-particle W state, \({\left| W \right\rangle _3} = 1/3\left( {\left| {001} \right\rangle + \left| {010} \right\rangle + \left| {100} \right\rangle } \right)\). When designing a quantum circuit to prepare \(\left| {\mathrm{y}} \right\rangle\), the quantum circuit of preparing a multi-particle W state can be used as a template, which reduces the complexity of circuit design to a certain extent.
Taking \(\left| y \right\rangle = {\left| W \right\rangle _3}\) as an example, where \(m=3\), \({\alpha _j} = \sqrt{3} \left( {j = 1,2,3} \right)\), which means that the training set contains three categories of uniformly distributed data. The specific preparation process of \({\left| W \right\rangle _3}\) can be divided into two steps, and the corresponding quantum circuit is shown in Fig. 5. The first step is to use a combination of single qubit rotation gates and CNOT gate. By adjusting the rotation angle, the qubits are prepared into a special state containing only three terms, i.e.,
According to the calculation rule of quantum circuit cascade, there is a equation
By solving this equation, the parameters \({\theta _1} = {\theta _3} = \mathrm{{0}}\mathrm{{.55357436}}, {\theta _2} = \mathrm{{ - 0}}\mathrm{{.36486383}}\) in the quantum circuit can be obtained. The second step is to select the quantum gates without parameters to design circuit. Firstly perform the NOT gate (i.e., Pauli X gate) on \(\left| {{Q_b}} \right\rangle\) and \(\left| {{Q_c}} \right\rangle\), then apply the Toffoli gate to set the \(\left| {{Q_a}} \right\rangle\) equal to \(\left| 1 \right\rangle\), when \(\left| {{Q_b}} \right\rangle\) and \(\left| {{Q_c}} \right\rangle\) equal to \(\left| 1 \right\rangle\). Finally, perform a NOT gate on \(\left| {{Q_b}} \right\rangle\) and \(\left| {{Q_c}} \right\rangle\) to restore the state at the end of the first step. After the above operations, the initial state \(\left| {000} \right\rangle\) can be evolved into \({\left| W \right\rangle _3}\).
Using the one-hot method to encode the conditional information in the quantum state requires relatively more quantum resources, but it can reduce the workload of converting the data into other encoding forms when the data is classically post-processed. When designing the circuit for preparing quantum state of the conditional information, as long as the fixed template is followed, the parameter value is obtained by changing the probability amplitude on the right end of Eq. 5, and the multi-class label information that meets any probability distribution can be expressed.
3.2 Circuit design of quantum generator
Quantum computing forms a quantum circuit through the arrangement and combination of wires and basic quantum gates, which act on the quantum state to achieve the evolution of the system. The so-called parameterized quantum circuit is to choose a combination of parameterized quantum rotation gates and other quantum logic gates to constitute the circuit. Single-qubit gates are used to realize qubit rotation, while multi-qubit gates mainly realize entanglement between qubits. Representing the quantum state and the quantum gate in the form of a vector and a unitary matrix, it means that the mathematical connotation of the quantum gate operation is linear transformation, which is similar to classical machine learning. In that, the roles of parameters in PQCs and classical neural networks are consistent.
Due to the unitary constraints of quantum gates, to generate N bits of data, \(N = {N_d} + {N_c}\) qubits resources are required, where \({N_d}\) channels process sample data and \({N_c}\) channels receive conditional information. For the quantum generator, the input \({\left| 0 \right\rangle ^{ \otimes {N_d}}}\left| y \right\rangle\) is converted into the final state \({\left| x \right\rangle _{\mathrm{G}}}\left| y \right\rangle\) after the \({L_{\mathrm{G}}}\) layers combination unitary operations, where the \({\left| x \right\rangle _{\mathrm{G}}}\) represents the generative distribution. Sampling the final state of the generator can collapse the quantum state to classical data. The quantum generator is realized by a PQC based on quantum gate computing mechanism, which is composed of rotation layers and entanglement layers alternately arranged. Due to the unitary nature of the quantum gate set, if the rotation layer and the entanglement layer alternately perform operations and form a sufficiently long layer sequence, any unitary transformation can be performed on the initial state in theory.
According to the decomposition theorem of single qubit unitary operation, a single rotation layer is composed of two \(R_z\) gates and one \(R_x\) gate arranged at intervals, that is \(\prod \nolimits _{i = 1}^N {R_z^{}\left( {\theta _{l,3}^i} \right) R_x^{}\left( {\theta _{l,2}^i} \right) R_z^{}\left( {\theta _{l,1}^i} \right) }\). The superscript i indicates that the quantum gate acts on the i-th qubit, and the subscript l indicates that the operations perform on the l-th layer. The matrix representations of \(R_x\) gate and \(R_z\) gate are
A single entanglement layer generally selects two-qubit controlled rotation gates (such as CRX, CRY, CRZ gate) and general two-qubits logic gates (such as CNOT gate) for permutation and combination. The arrangement of quantum gates is related to the connectivity among qubits, thus affecting the expressiveness and entanglement capabilities of PQCs. There are three common connection topologies among qubits: circle, star, and all-to-all connectivity [36, 37]. For circle or star connectivity, the entanglement between certain qubits will not occur in a single layer, which means that more layers are required to fit the distribution of complex targets. This phenomenon undoubtedly increases the difficulty of parameters optimization. All-to-all connectivity is an ideal topology structure among qubits. Although the number of parameters of a single-layer will exceed the other two methods, a shallow all-to-all connectivity quantum circuit can achieve better generative results and the computational overhead of algorithm is cheaper.
When designing the PQC of quantum generator, it is necessary to ensure that the qubits are fully connected. According to the above rules, the quantum generator circuit of QCGAN is shown in Fig. 6. The “XX” in the Fig. 6 represents an operation involving two qubits, where any one is the control qubit, and the other is the target qubit. When the control qubit is \(\left| 1 \right\rangle\) or \(\left| 0 \right\rangle\) (specified by the operation), the target qubit is operated accordingly. The \({N_c}\) qubits are only responsible for transmitting conditional information to the other \({N_d}\) qubits, and continuing to pass the conditional information to the discriminator in post-processing. Therefore, no rotation operation is performed on them, and they are only used as control qubits to affect the circuit for data generation.
3.3 Adversarial training strategy
The training of the QCGAN is a parameter optimization quantum algorithm with a feedback loop. The parameters of quantum generator and classical discriminator are denoted by \(\theta\) and \(\phi\), respectively. Similar to the classical CGAN, the objective function of QCGAN is
At the beginning of training, all parameters in quantum circuit and binary classification neural network are given random initial values. During the adversarial training process, the parameters of generator and discriminator are alternately optimized. The parameters of the quantum generator circuit are fixed first to optimize the parameters of the discriminator. The discriminator simultaneously judges the randomly sampled batch training data and the data sampled from the quantum generator. The output value of the discriminator represents the probability that the corresponding input comes from the real distribution, and the gradient is calculated in the direction of maximizing the objective function of discriminator to optimize the parameter \(\phi\). Modifying the parameters of discriminator and repeating the above optimization operations, so that discriminator can not only learn the characteristics of real data distribution but also have the ability to discriminate the data from generative distribution. Then the parameters of discriminator are fixed, and the input of discriminator is only the results of the generator sampling. The larger the output of the discriminator, the smaller the gap between the generative distribution and the previously learned real distribution. In that, the gradient is calculated according to the direction of maximizing the objective function of generator to optimize the parameters \(\theta\). The ability of generator to fit the true distribution is continuously improved by modifying the parameters and repeating the circuit on the quantum computing device. The alternate optimization of generator and discriminator parameters must be iteratively performed until generator can reconstruct the state distribution of the training set.
According to the above connotation of adversarial training, Eq. 6 is decomposed into the unsaturated maximization objective function that generator and discriminator obeys respectively,
During the training process, the gradient descent method is used to optimize the parameters. This method needs to calculate the gradient information \({\nabla _\theta }{V_{{G_\theta }}}\) and \({\nabla _\phi }{V_{{D_\phi }}}\). For classical neural networks, backpropagation can be used directly to calculate the gradient value of the objective function effectively. But for quantum devices, only the measurement results can be obtained, in that the output probability of discriminator cannot be directly accessed. Therefore, the gradient estimation of a parameterized quantum circuit needs to follow the theorem: for a circuit containing the parameter unitary gates \(U\left( \eta \right) = {e^{ - \frac{i}{2}\eta \Sigma }}\), the gradient of the expectation value of an observable B with respect to the parameter \(\eta\) reads
The \({\left\langle \right\rangle _{{\eta ^ \pm }}}\) in Eq. 8 represents expectation value of observable with respect to the output quantum wave function generated by the same circuit with parameter \({\eta ^ \pm } = \eta \pm \frac{2}{\pi }\) [38]. This is an unbiased estimation method for the gradient of PQC. According to this theorem, the gradient of the output of the discriminator with respect to the parameters \(\theta\) can be calculated
where \({\theta ^ \pm } = \theta \pm \frac{2}{\pi }{e^i}\) and \({e^i}\) represents the i-th unit vector in the parameter state space, i.e., \({\theta _i} \leftarrow {\theta _i} \pm \frac{2}{\pi }\). In order to estimate the gradient of each parameter, every single parameter needs to be optimized and then evaluated repeatedly. In the case of small-scale numerical simulation, the wave function can be used to directly calculate the expectation value. Another method is to calculate the probability distribution based on the wave function, and then sample the gradient for estimation [39].
4 Performance evaluation
In order to evaluate the performance of the algorithm proposed in this paper, the classical GAN [15] and CGAN [26], QGAN [31] and QCGAN are mainly compared from the perspectives of time complexity and algorithm function. The performance comparison of the four generative adversarial algorithms is shown in Table 1.
In the classical CGAN algorithm, the process of generator parameters optimization can be seen as performing gradient descent in the convex set of the normalized covariance matrix of the data set to fit the real distribution. Therefore, the time complexity of generating data that fit the N-dimensional classical distribution is \(O({N^2})\). In contrast, the time complexity of a quantum information processor to perform a linear transformation on an N-dimensional vector is O(N). Even if optimizing the each parameter needs to modify and execute the PQC twice, the calculation time complexity of QCGAN is still lower than that of CGAN when the same parameter optimization strategy is adopted (neglecting the time cost of preparing classical data into quantum states). On the other hand, the classical CGAN algorithm cannot directly generate discrete data due to the influence of differentiable constraints during parameter optimization, while QGAN can directly generate discrete data and also has the ability to generate continuous distribution [40]. In addition, the QCGAN algorithm proposed in this paper directly encodes classical data in quantum state, so its resource consumption is \({N_d} + {N_c}\) the same as classical CGAN (where \({N_d}\) is the resource consumption of generating target data, and \({N_c}\) is the conditional information resource consumption). While the resource consumption of unsupervised GAN and QGAN algorithms is N, which is equal to the generative target data size.
Compared with unconstrained QGAN, the input of conditional information brings prior knowledge about the training set to the model, turning unsupervised QGAN into a weakly supervised or supervised adversarial learning model, thereby achieving controllable data generation process. The learning results of unconstrained QGAN are more inclined to present the average state of all data in training set. But due to adding the conditional information, QCGAN will accordingly show an advantage in the fitness of the generated results to the real distribution. Moreover, the generator trained by QGAN is still purposelessly generated, which can only guarantee the authenticity of the generated data but cannot expand other functions. While QCGAN can achieve different purpose generation tasks by introducing different conditional information, which can fully reflect the subjective initiative of people and realize the interaction between people and algorithms. It can be considered that QCGAN is a human-centered algorithm. Therefore, from a functional perspective, the generators trained by QCGAN have more extensive application scenarios and higher efficiency.
5 Experimental
In this paper, the synthetic \(\hbox {BAS}(2, 2)\) (Bars and Stripes) data set is used for the experiments and analyses of the classical data classification generation task. The TensorFlow Quantum (TFQ), an open source quantum cloud computing platform for the rapid prototyping of hybrid quantum-classical models for classical or quantum data [41], is introduced to realize the simulation experiments.
5.1 BAS data set
The \(\hbox {BAS}(m,n)\) data is a composite image containing only horizontal bars or vertical stripes on a two-dimensional grid. For \(m \times n\)-pixel images, there are only \({2^m} + {2^n} - 2\) valid BAS images in all \({2^{m \times n}}\) cases. This defines the target probability distribution, where the probabilities for valid images are specified constants, and the probabilities for invalid images are zero. The generation goal of the experiment is the classical data of \(\hbox {BAS}(2, 2)\), which seem to be a insufficient challenging for quantum computers intuitively. However, the effective quantum state represented by the \(\hbox {BAS}(2,2)\) data set have a minimum entanglement entropy of \({S_{BAS\left( {2,2} \right) }} = \mathrm{{1}}\mathrm{{.25163}}\) and a maximum achievable entropy of \({S_{BAS\left( {2,2} \right) }} = \mathrm{{1}}.\mathrm{{79248}}\), which is the known maximum entanglement entropy of four-qubit states set [42]. Therefore, the data have rich entanglement properties and are very suitable as a generation target for quantum adversarial training.
The \(\hbox {BAS}(2,2)\) images in the training set are divided into three categories. The horizontal bar images and the vertical stripe images are respectively one category, and the image with pixel values of all 0 or all 1 is the other category. And the effective BAS images conform to the uniform distribution. According to the classification standard, the category labels are one-hot encoded and added to the basic data set as the conditional information. Hence the generator require 7 qubits resources, as processing the pixel information of BAS data requires 4 qubits, receiving conditional information requires 3 qubits.
5.2 Experimental setup
The codes synthesis 6000 samples to form the training set, including three categories of BAS data (a total of 6 valid images) that meet the above requirements and their category labels. During training, all data is out of order firstly, and then extracted by batch size. For the pre-training of the BAS data set, the discriminator and generator are alternately trained once in each iteration optimization. The batch size of each training is 40, and there are totally 100 epochs for iterative training. In each epoch, iterative training the network 150 times, so that the discriminator can traverse the entire training set. Considering that the improper setting of the learning rate will cause the network gradient to disappear/explode, setting the learning rate \(\times 0.1\) to reduce it every 10 epochs of training. The Adam (Adaptive Moment Estimation) optimizer provided by the open source library is introduced for both generator and discriminator, and the initial learning rate is set as 0.001.
Each epoch of training optimization completes, the output of generator is sampled to inspect the quality of the current generation distribution. The inspection mainly including three points:
-
1.
whether the generated pixel data constitutes a valid BAS image;
-
2.
whether the generated pixel data matches the conditional information;
-
3.
whether the generated all data conform to the uniform distribution.
Since the training process of the adversarial network is relatively unstable, if the comprehensive accuracy of the above three investigation points reaches the preset threshold of \(95\%\), the training process can be chosen to terminate early. If the threshold can not be reached all the training time, 100 epochs of alternate training are performed according to the preset settings, and then analyze the convergence of the objective function in the whole training process. After that, the adversarial network can be trained again after reasonable adjustments to the training strategy and hyperparameters, by summarizing the reasons for the unsatisfactory training results.
6 Results and discussion
In the simulation process, a series of comparative experiments are conducted on the performance of the generator using circle, star, and all-to-all connected quantum circuits firstly. The results verified the superiority of designing an all-to-all connected topology of the quantum generator in this scheme. According to the result of the comparative experiment, the PQC structure shown in Fig. 7 is used as the generator of QCGAN. The input \(\left| y \right\rangle\) of the generator is \({\left| W \right\rangle _3}\), which is prepared in advance with the circuit shown in Fig. 5.
The discriminator is classical so it is implemented using the classical deep learning framework, TensorFlow, which can form a hybrid quantum-classical model with TFQ. The discriminator has one input layer with dimension \(N_{{\mathrm{d}}} + N_{c} = 7\), one hidden layer made up of 4 neurons and one output neuron. Since the discriminator directly judges the expectation value of the generator output, the hidden layer selects the linear ReLU activation function.
As shown in Fig. 8, in the overall trend, the loss function value of the discriminator gradually decreases and the loss function value of the generator gradually increases. After training, both the losses of generator and discriminator converge to near the expected equilibrium point. As the epoch of training increases, the model gradually stabilizes and the relationship between generator and discriminator is more intense. So it shows in Fig. 8 that there is still a large oscillation around the expectation value after the convergence. This phenomenon is also related to the influence of noise on quantum systems which access through cloud platform.
After the pre-training of the BAS data set is completed, quantum generator result is sampled 10, 000 times to analyze the generative distribution. The probability distribution of the generated data is shown in Fig. 9a. It can be seen that most of the generated data fall in the six valid BAS mode images, and the three categories BAS images basically conform to the uniform distribution with \(97.15\%\) accuracy. Figure 9b visualizes the first 100 generative samples in the form of pixel maps of 1, 70 and 100 epoch, which shows that the quantum generator gradually has the ability to generate BAS(2, 2) images after pre-training.
The parameters of quantum gates in the optimal generator are extracted after pre-training, and then use the generator circuit shown in Fig. 7 to realize the task of generating classification images. The parameters of PQC in Fig. 5 are adjusted to set the input \(\left| y \right\rangle\) as \(\left| {001} \right\rangle\), and then sample the output \({\left| x \right\rangle _{\mathrm{G}}}\) of generator. The result shows that two kinds of horizontal stripe images meet the uniform distribution, which means that the quantum generator can generate data of multiple categories that meet the conditional constraints through the guidance of conditional information.
7 Conclusion
Combining the classical CGAN algorithm with quantum computing ideas, this paper proposes a quantum conditional generative adversarial network algorithm for human-centered paradigm, which is a general scheme suitable for fitting classical data distribution. This paper gives a detailed interpretation of our design focus, including the configuration design of PQC as the generator, the parameter gradient estimation method of adversarial training strategy as well as the specific steps of the algorithm’s cloud computing implementation.
The effect of the QCGAN algorithm is that by adding conditional constraints related to the training data set in the input layer, which effectively guides the network to generate data that meets specific requirements. This step increases the controllability of the generation process, but also more in line with the current human-centered requirements for machine learning algorithms. Compared with classical CGAN, the time complexity of the QCGAN algorithm proposed in this paper is lower, and it is more in line with the needs of actual application scenarios. Through experiments on the quantum cloud computing platform, the results show the QCGAN can generate the BAS data distribution effectively and the generator of QCGAN can output correct data guided by the conditional constraint in cloud.
Given that QGAN has the ability to generate discrete data and the potential to dig out data distributions that cannot be effectively summarized by classical calculations, QGAN and classical GAN are functionally complementary. Many known variants of GAN can generate very realistic images, audio, and video, in that the combination of these algorithms and quantum mechanics is undoubtedly the icing on the cake. Our future work will focus on the quantum schemes of some classical GAN variant algorithms and constructing quantum machine learning algorithms that conform to the HCC paradigm and the corresponding cloud computing implementation.
Availability of data and materials
The relevant analysis data used to support the findings of this study are included in the article.
Abbreviations
- QGAN:
-
Quantum generative adversarial network
- QCGAN:
-
Quantum conditional generative adversarial network
- NISQ:
-
Noisy intermediate-scale quantum
- CGAN:
-
Conditional generative adversarial network
- HCC:
-
Human-centered computing
- GAN:
-
Generative adversarial network
- PQC:
-
Parameterized quantum circuit
- TFQ:
-
TensorFlow quantum
- BAS:
-
Bars and stripes
References
S.P. Singh, A. Nayyar, R. Kumar et al., Fog computing: from architecture to edge computing and big data processing. J. Supercomput. 75, 2070–2105 (2019)
C. Zhou, A. Li, A. Hou, Z. Zhang, Z. Zhang, F. Wang, Modeling methodology for early warning of chronic heart failure based on real medical big data. Expert Syst. Appl. 151, 113361 (2020). https://doi.org/10.1016/j.eswa.2020.113361
Q. Liu, Y. Tian, J. Wu, T.G.W. Peng, Enabling verifiable and dynamic ranked search over outsourced data. IEEE Trans. Serv. Comput. (2019). https://doi.org/10.1109/TSC.2019.2922177
L. Qi, C. Hu, X. Zhang, R.K. Mohammad et al., Privacy-aware data fusion and prediction with spatial-temporal context for smart city industrial environment. IEEE Trans. Ind. Inf. (2020). https://doi.org/10.1109/TII.2020.3012157
L. Wang, X. Zhang, T. Wang, S. Wan, G. Srivastava et al., Diversified and scalable service recommendation with accuracy guarantee. IEEE Trans. Comput. Soc. Syst. (2020). https://doi.org/10.1109/TCSS.2020.3007812
X. Xu, R. Mo, F. Dai, W. Lin et al., Dynamic resource provisioning with fault tolerance for data-intensive meteorological workflows in cloud. IEEE Trans. Ind. Inf. 16(9), 6172–6181 (2020)
X. Xu, X. Zhang, M. Khan, W. Dou et al., A balanced virtual machine scheduling method for energy-performance trade-offs in cyber-physical cloud systems. Future Gener. Comput. Syst. 105, 789–799 (2020)
L. Wang, X. Zhang, R. Wang, C. Yan et al., Diversified service recommendation with high accuracy and efficiency. Knowl. Based Syst. 204, 106196 (2020). https://doi.org/10.1016/j.knosys.2020.106196
Y. Xu, J. Ren, Y. Zhang, C. Zhang, B. Shen et al., Blockchain empowered arbitrable data auditing scheme for network storage as a service. IEEE Trans. Serv. Comput. 13(2), 289–300 (2020)
J. Li, T. Cai, K. Deng, X. Wang et al., Community-diversified influence maximization in social networks. Inf. Syst. 92, 1–12 (2020)
L. Qi, X. Wang, X. Xu, W. Dou, S. Li, Privacy-aware cross-platform service recommendation based on enhanced locality-sensitive hashing. IEEE Trans. Netw. Sci. Eng. (2020). https://doi.org/10.1109/TNSE.2020.2969489
W. Zhong, X. Yin, X. Zhang, S. Li et al., Multi-dimensional quality-driven service recommendation with privacy-preservation in mobile edge environment. Comput. Commun. 157, 116–123 (2020)
Q. Liu, P. Hou, G. Wang, T. Peng, S. Zhang, Intelligent route planning on large road networks with efficiency and privacy. J. Parallel Distrib. Comput. 133, 93–106 (2019)
E.F. Dumitrescu, A.J. Mccaskey, G. Hagen et al., Cloud quantum computing of an atomic nucleus. Phys. Rev. Lett. 120(21), 210501 (2018)
I.J. Goodfellow, J. Pouget-Abadie, M. Mirza, et al., Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems (NIPS), pp. 2672–2680 (2014)
K.F. Wang, C. Gou, Y.J. Duan et al., The research progress and outlook of generative adversarial network. Acta Auto. Sin. 43(3), 321–332 (2017)
C. Ledig, L. Theis, F. Huszar, et al., Photo-realistic single image super-resolution using a generative adversarial network. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 105–114 (2017)
O. Kupyn, V. Budzan, M. Mykhailych, et al.: Deblurgan: blind motion deblurring using conditional adversarial networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8183–8192 (2018)
L. Zhu, S. Kwong, Y. Zhang et al., Generative adversarial network-based intra prediction for video coding 22(1), 45–58 (2020)
R. Bhargava, G. Sharma, Y. Sharma, Deep text summarization using generative adversarial networks in Indian languages. Proc. Comput. Sci. 167, 147–153 (2020)
J. Johnson, A. Gupta, L. Fei-Fei. Image generation from scene graphs. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1219–1228 (2018)
A. Radford, L. Metz, S. Chintala, Unsupervised representation learning with deep convolutional generative adversarial networks. In: 4th International Conference on Learning Representations (ICLR) (2016)
S. Ioffe, C. Szegedy, Batch normalization:accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd International Conference on Machine Learning, pp. 448–456 (2015)
I. Tolstikhin, S. Gelly, O. Bousquet. Adagan: Boosting generative models. In: Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS), pp. 5430–5439 (2017)
M. Arjovsky, S. Chintala, L. Bottou, Wasserstein GAN (2017). arXiv:1701.07875
M. Mirza, S. Osindero, Conditional generative adversarial nets. Comput. Sci. 2014, 2672–2680 (2014)
D. Zhu, S. Xia, J. Zhao, Y. Zhou et al., Diverse sample generation with multi-branch conditional generative adversarial network for remote sensing objects detection. Neurocomputinge 381, 40–51 (2020)
S.U. Dar, M. Yurt, L. Karacan, A. Erdem, E. Erdem et al., Image synthesis in multi-contrast MRI with conditional generative adversarial networks. IEEE Trans. Med. Imaging 38(10), 2375–2388 (2019)
X. Yi, P. Babyn, Sharpness-aware low-dose CT denoising using conditional generative adversarial network. J. Digit. Imaging 31, 655–669 (2018)
B. Marcello, G. Delfina, P. Oscar, A generative modeling approach for benchmarking and training shallow quantum circuits. NPJ Quantum Inf. 5, 45–61 (2019)
L. Seth, W. Christian, Quantum generative adversarial learning. Phys. Rev. Lett. 121(4), 040502 (2018)
P.L. Dallaire-Demers, N. Killoran, Quantum generative adversarial networks. Phys. Rev. A 98(1), 012324 (2018)
L. Hu, S.H. Wu, W. Cai, Quantum generative adversarial learning in a superconducting quantum circuit. Sci. Adv. 5(1), 2761 (2019)
Z.Y. Han, J. Wang, H. Fan, L. Wang, P. Zhang, Unsupervised generative modeling using matrix product states. Phys. Rev. X 8(3), 31012–31025 (2017)
W. Huggins, P. Patil, B. Mitchell et al., Towards quantum machine learning with tensor networks. Quantum Sci. Technol. 4(2), 024001 (2019)
S. Sim, P.D. Johnson, A. Aspuru-Guzik, Expressibility and entangling capability of parameterized quantum circuits for hybrid quantum-classical algorithms. Adv. Quantum Technol. 12(2), 1 (2019)
D. Zhu, N.M. Linke, M. Benedetti et al., Training of quantum circuits on a hybrid quantum computer. Sci. Adv. 5(10), 9918 (2019)
J.G. Liu, L. Wang, Differentiable learning of quantum circuit born machines. Phys. Rev. A 98(6), 062324 (2018)
H. Situ, Z. He, Y. Wang et al., Quantum generative adversarial network for discrete data. Inf. Sci. 538, 193–208 (2020)
R. Jonathan, A.G. Alan. Variational quantum generators: generative adversarial quantum machine learning for continuous distributions (2019). arXiv:1901.00848
M. Broughton, G. Verdon, T. McCourt, A. Martinez, J. Yoo et al. TensorFlow quantum: a software framework for quantum machine learning (2020). arXiv:1811.04968v3
A. Higuchi, A. Sudbery, How entangled can two couples get? Phys. Lett. A 273, 213–217 (2000)
Acknowledgements
This work is supported by National Natural Science Foundation of China (Grant Nos. 62071240, 61802002); the Graduate Research and Practice Innovation Program of Jiangsu Province (Grant No. KYCX20_0969); the Natural Science Foundation of Jiangsu Higher Education Institutions of China under Grant No. 19KJB520028; the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD).
Author information
Authors and Affiliations
Contributions
WJL conducted research and discussion on the algorithm scheme and drafted the manuscript. YZ carried out the design and realization of the simulation experiment and completed the drafting of the manuscript. ZLD participated in the research and discussion of the algorithm scheme, and proposed some amendments to the first draft. JJZ and LT reviewed and revised the content of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liu, W., Zhang, Y., Deng, Z. et al. A hybrid quantum-classical conditional generative adversarial network algorithm for human-centered paradigm in cloud. J Wireless Com Network 2021, 37 (2021). https://doi.org/10.1186/s13638-021-01898-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13638-021-01898-3