Abstract
We propose a hybrid quantum-classical algorithm to compute approximate solutions of binary combinatorial problems. We employ a shallow-depth quantum circuit to implement a unitary and Hermitian operator that block-encodes the weighted maximum cut or the Ising Hamiltonian. Measuring the expectation of this operator on a variational quantum state yields the variational energy of the quantum system. The system is enforced to evolve toward the ground state of the problem Hamiltonian by optimizing a set of angles using normalized gradient descent. Experimentally, our algorithm outperforms the state-of-the-art quantum approximate optimization algorithm on random fully connected graphs and challenges D-Wave quantum annealers by producing good approximate solutions. Source code and data files are publicly available (https://github.com/nkuetemeli/UQMaxCutAndIsing).
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Quantum computing has emerged as a powerful computation paradigm taking advantage of principles of quantum mechanisms. It involves two computational models that fundamentally differ in their functioning: The adiabatic quantum model is best suited for optimization problems, typically cast in quadratic unconstrained binary optimization form. Commercial devices such as d-wave annealers [1] can already solve combinatorial problems in various fields such as computer vision [2, 3] and database engineering [4, 5]. The universal model of quantum computing, also referred to as the gate-based or circuit model, is more flexible and can potentially implement any classical operation, as shown by Bennett [6]. For selected problems such as Shor’s factoring [7] and Grover’s search [8] algorithms, strong theoretical convergence properties and drastic speed-up of the universal quantum computing model over classical counterparts could be proven. However, for problem sizes of practical interest, these algorithms still require more resources and run-time than existing universal devices can provide. In the era of noisy intermediate-scale devices, it is a challenging task to find real-world applications of universal quantum computing. Hybrid quantum-classical algorithms are therefore considered to be a promising way to obtain practical quantum supremacy.
The Ising model [9] describes a quantum mechanical system with \(n\in \mathbb {N}\) particles or spins that can be in two possible states, i.e., the spin \(s_i, i = 1, \ldots , n\) can be in the state \(\pm 1\) represented by \(s_i = (-1)^{q_i}, q_i \in \left\{ 0, 1\right\} \). Each spin \(s_i\) can interact with some external energy of strength \(\mathcal {C}_{ii}\) or with an adjacent spin \(s_j\) by a mutual interaction energy \(\mathcal {C}_{ij}\). The complete system can be modeled by a general un-directed n vertices graph \( \mathcal {G} = (\mathcal {S}, \mathcal {E}, \mathcal {C}) \) with \( \mathcal {S} = \left\{ s_1, \ldots , s_n\right\} \), \( \mathcal {E} \subseteq \mathcal {S} \times \mathcal {S} \) and a cost function \( \mathcal {C}: \mathcal {E} \rightarrow \mathbb {R}\) with \(\mathcal {C}(s_i, s_j):= \mathcal {C}_{ij}\) on \( \mathcal {E} \). For a quantum system in the state \({| q\rangle } = \bigotimes _{i=1}^n{| q_i\rangle }, q_i \in \left\{ 0, 1\right\} \), the Ising model describes the total energy of the system as being the expectation \(\langle \textbf{C}\rangle := \langle q | \textbf{C} | q\rangle \) of the \((2^n \times 2^n)\) Hamiltonian
where \(\textbf{Z}_k\) denotes the Pauli-\(\textbf{Z}\) operator acting on the kth particle of the system.
The problem is to
i.e., to search for the state \({| q^\star \rangle }\) of the system with the minimal energy according to the Ising model in Eq. (1). By setting \(\mathcal {C}_{ii} = 0\) for all i, the problem reduces to the so-called weighted maximum cut (maxcut) problem [10].
The observable \(\textbf{C}\) in Eq. (1) is a diagonal matrix and it is straightforward to verify that the expectation \(\langle \textbf{C}\rangle \) is lower-bounded by the smallest eigenvalue \(\mathcal {C}_{min}\) of \(\textbf{C}\). Indeed, measuring \(\textbf{C}\) on a quantum system prepared in the superposition state \({| \psi \rangle } = \sum _{q=0}^{2^n-1} \alpha _q {| q\rangle }\) yields
In other words, under all norm-1 vectors, \(\langle \textbf{C}\rangle \) reaches the minimum exactly for \({| \psi \rangle }\) being an eigenvector of \(\textbf{C}\) to the lowest eigenvalue, a ground state \({| q^\star \rangle }\) of \(\textbf{C}\). Computing \(\langle \textbf{C}\rangle \) for all eigenvectors of \(\textbf{C}\)—which amounts to computing the diagonal elements of \(\textbf{C}\)—is practically hard, as it amounts to brute-forcing the problem. Instead, in hybrid quantum-classical approaches, one aims for an efficient way to search for parameters \(\Theta \in \Gamma \) that solve the variational problem
where \(\Gamma \) is a suitable parameterization of the search space.
The Ising spin model is a powerful tool describing ferromagnetism in statistical mechanics [9] as well as many practically relevant np-problems [11]. maxcut is no less important. It finds practical applications in varying fields such as computer vision [12], data clustering [13], and communications network design [14]. Thus, determining solutions of the Ising spin model or the maxcut problem, even approximately, is of great practical interest.
Adiabatic quantum computation (aqc) relies on the adiabatic theorem [15, 16] and solves the Ising problem by performing an adiabatic evolution of the quantum system from the known and easily-prepared ground state of an initial Hamiltonian to that of the problem Hamiltonian. d-wave [1] quantum annealers are intermediate-scale devices implementing aqc. It is an established fact that the run-time of aqc algorithms scales inversely with the energy gap between the ground state and the first excited state of the system Hamiltonian [15,16,17]. This introduces the necessity of carefully designing the system Hamiltonian or to use spectral gap amplification techniques [18]. Another workaround is the conception of (hybrid) gate-based algorithms that use moderate quantum resources with outer-loop classical optimization [19].
Quantum approximate optimization algorithms (qaoa), firstly introduced by Fahri et al. [19] and widely discussed in the literature [20,21,22,23], mimic the aqc model, but use a low-depth variational circuit capable to run on near-term devices and benefiting from the maturity of classical optimization. qaoa is considered as one of the major candidates for dealing with realistic real-world applications at competitive performance using the universal model. However, in practice, optimizing qaoa parameters appears to be extremely difficult due to a non-convex objective [20, 22, 23]. Also, qaoa is threefold iterative and hence computationally expensive: First, the ansatz itself involves repetitive layers of gates; second, the classical optimization routine used around qaoa often needs to be iterative as the objective function is non-convex; third, evaluating this objective function requires a large number of quantum circuit measurements.
This work aims to facilitate combinatorial optimization on universal quantum computers. We first eliminate the repetitive layers of qaoa and propose an alternative, more effective encoding of the problem on universal quantum hardware. We present a new, easy to implement and low-depth variational circuit that block-encodes the maxcut or the Ising Hamiltonian in an Hermitian and unitary operator. Measuring this circuit reveals direct information about the loss function Eq. (5). We then derive an optimization routine based on gradient descent for the proposed variational circuit in order to drive the quantum system toward the ground state of the problem Hamiltonian. We experimentally validate the novel algorithm and find that it outperforms the state-of-the-art gate-based qaoa model for solving maxcut; on the Ising model, it also challenges the specialized d-wave annealers.
This paper is structured as follows: in Sect. 2 we recall how d-wave and qaoa solve combinatorial problems; in Sect. 3, we present the new variational quantum circuit and how its parameters are optimized; Sect. 4 provides experimental results and Sect. 5 concludes the work.
2 Related work
2.1 Adiabatic quantum computing
Adiabatic quantum computation (aqc) is an optimization paradigm which relies on the adiabatic theorem [15, 16] saying that the ground state of a quantum mechanical system is the solution of an optimization problem [24]. Theoretically, the evolution of a quantum system of \(n \in \mathbb {N}\) particles at a time \(t \in [0, T]\), typically re-scaled to \(s=t/T \in [0, 1]\), can be described by a Hamiltonian \(\textbf{H}(s)\) on a Hilbert space \(\mathcal {H} = \mathbb {C}^{\otimes 2^n}\), with the state of the system given by a unit vector \({| \psi (s)\rangle } \in \mathcal {H}\).
Industrial devices such as the d-wave quantum annealer [1] have already been proposed to solve binary combinatorial problems based on the aqc optimization principle. Their main idea consists in initializing a quantum system with a Hamiltonian \(\textbf{B} = \sum _{k=1}^n \textbf{X}_k\) whose ground state \({| +\rangle }^{\otimes n}\) is the perfect superposition state and easy to prepare. As above, \(\textbf{X}_k\) denotes the Pauli-\(\textbf{X}\) operator acting on the kth particle of the system. Then, another Hamiltonian \(\textbf{C}\), the problem Hamiltonian, is prepared as in Eq. (1). As the time s evolves from 0 to 1, the initial Hamiltonian \(\textbf{B}\) is transformed into the problem Hamiltonian \(\textbf{C}\), describing a time-dependent system Hamiltonian
where \(\mathcal {B}\) and \(\mathcal {C}\) with \(\lim _{s\rightarrow 1}\mathcal {B}(s) = 0\) and \(\lim _{s\rightarrow 1}\mathcal {C}(s) = 1\) are annealing functions. The evolution of the system generated by \(\textbf{H}(s)\) over the time \(s \in [0, 1]\) is governed by the Schrödinger equation; its solution defines a time-dependent unitary operator that transforms the ground state of \(\textbf{B}\) into the ground state of \(\textbf{C}\) with high probability if s varies sufficiently slowly [16]. The ground state of \(\textbf{C}\) is the solution of the optimization problem (2).
While being an efficient scheme for combinatorial optimization that has the potential to ultimately supercede classical computers, aqc has a caveat. It is known that the smaller the energy gap between the ground state and the first excited state of the adiabatic Hamiltonian \(\textbf{H}(s)\), the longer the required annealing time for guaranteeing the success of the optimization [15,16,17]. To overcome this, methods for universal quantum computers that take advantage of efficient classical optimization techniques have been proposed in the form of quantum approximate optimization algorithms [19].
2.2 Quantum approximate optimization algorithms
In quantum approximate optimization algorithms (qaoa, [19]), the problem (2) is relaxed to finding a state \( {| \psi \rangle } = \sum _{q} \alpha _q {| q\rangle } \) such that \(\langle \textbf{C}\rangle \) is minimized. qaoa solves the maxcut problem by trotterizing the evolution generated by Eq. (6). Its pseudo-code is recapitulated in Algorithm 1: First, the system is prepared in the perfect superposition state \({| +\rangle }^{\otimes n}\). Then, trotterization consists in repeatedly, say p times, applying to the state the unitaries \( \textbf{U}(\textbf{C}, \gamma _k):= \exp (-i \gamma _k \textbf{C}) \) and \( \textbf{U}(\textbf{B}, \beta _k):= \exp (-i \beta _k \textbf{B}) \) generated by the problem Hamiltonian \(\textbf{C}\) and the mixing Hamiltonian \(\textbf{B}\). The (small) time steps \(\gamma _k, \beta _k \in \mathbb {R}\), \(1\le k\le p\), are optimization parameters in qaoa. The resulting state is called \( {| \gamma , \beta \rangle } \):
Finally, the state \( {| \gamma , \beta \rangle } \) is measured in the computational basis and is used to evaluate the cost function \(\langle \textbf{C}\rangle := \langle \gamma , \beta |\textbf{C}|\gamma , \beta \rangle \) and if necessary its derivatives \(\nabla _{(\gamma , \beta )}\langle \textbf{C}\rangle \) and \(\nabla _{(\gamma , \beta )}^2\langle \textbf{C}\rangle \). A classical optimization routine is used to update the parameters \( (\gamma , \beta ):= (\gamma _1, \ldots , \gamma _p, \beta _1, \ldots , \beta _p) \).
For \(p \rightarrow \infty \), the results of [19] guarantee that there exist parameters \( (\gamma , \beta ) \) for which measuring \( {| \gamma , \beta \rangle } \) gives the desired ground state \({| q^\star \rangle }\) with high probability. However, the qaoa objective is difficult to optimize [20, 22, 23]. We believe that this is partially due to the fact that the qaoa ansatz encodes problem information in the argument of \(\exp \) as phases of the qubits, which is partially lost at the measurement. Another issue of qaoa is that its repetitive layers are still too expensive for running on current and near-term devices. In this work, we propose a quantum circuit that encodes the problem more effectively and does not require the repetitive layers of qaoa. Adhering to the promising concept of designing hybrid quantum algorithms, we embed the circuit in a classical optimization method to output the desired ground state \({| q^\star \rangle }\) with high probability.
3 Proposed universal quantum algorithm
Our method builds on the notion of block-encoding introduced in [25, 26] that allows to embed non-unitary matrices as the principal block of a unitary operator acting on the quantum system. Block-encoding is typically achieved by enlarging the Hilbert space of the quantum system.
Definition 1
(Block-encoding [25, 26]) Let \(a,n \in \mathbb {N}\) and \(m:=a+n\). The \(m\times m\) unitary \(\textbf{U}\) is said to be a block-encoding of the \(n\times n\) matrix \(\textbf{C}\) if there is \(\kappa \in (0,\infty )\) such that
By adding a single qubit to the system, our goal is to implement the \((2^{1+n}) \times (2^{1+n})\) unitary operator
for the Ising Hamiltonian \(\textbf{C}\) from Eq. (1) and a suitably chosen constant \( K \in \mathbb {R}\). As \(\textbf{C}\) is a diagonal matrix, the \(\sin \) and \(\cos \) functions directly apply to the diagonal elements [27, Section 2.1.8]. This allows to encode information about the problem as probability amplitudes of the qubits. Note that because \(\textbf{C}\) is Hermitian, \(\textbf{U}\) is Hermitian as well and thus can serve as measurement observable. We use the constant K to re-scale all entries of \(\textbf{C}\) to \([-\pi / 2, \pi / 2]\), where \(\sin \) is strictly increasing and invertible. Specially, Eq. (9) block-encodes a bijective transformation of \(\textbf{C}\). For reasons that will become clear in Sect. 3.1, this re-scaling also allows for an efficient implementation of \(\textbf{U}\). We stress that K should not be set to large since \(\lim _{K \rightarrow \infty } \textbf{U}(\textbf{C}, K) = \textbf{X}^{\otimes (1+n)}\), i.e., for large K the operator \(\textbf{U}\) behaves like a not-gate. In Sect. 3.2, we provide an optimization routine for the proposed circuit, we discuss its scalability and complexity in Sect. 3.3, and lastly we provide a suitable choice for K in Sect. 3.4.
3.1 Implementation
For a system prepared in the basis state \({| q\rangle }\), the cost of the cut q according to the Ising model in Eq. (1) is given by
Crucially, for a \((1+n)\)-qubit system prepared in the state \({| \hat{\psi }, \psi \rangle }:= {| \hat{\psi }\rangle }_c \otimes {| \psi \rangle }\), where \({| \hat{\psi }\rangle }_c = \alpha {| 0\rangle }_c + \beta {| 1\rangle }_c\) is a 1-qubit register that we call the cost qubit and \({| \psi \rangle }\) is the n-qubit working register, if the cost qubit \({| \hat{\psi }\rangle }_c\) is kept in the state \({| 0\rangle }_c\), it holds for \(\textbf{U}\) defined in Eq. (9) that
Thus, if K is chosen such that the entries of the scaled diagonal matrix \(\frac{\textbf{C}}{K}\) fit inside the monotone region of the sine, then minimizing \(\langle \textbf{U}\rangle \) with the cost qubit set to zero is equivalent to minimizing \(\langle \textbf{C}\rangle \).
Observe that \(\textbf{U}\) is a block matrix of diagonal matrices and applying it to the basis state \({| 0, q\rangle }\) gives the same result as applying to the cost qubit \({| 0\rangle }_c\) the controlled \((2 \times 2)\)-operator
As derived in “Appendix A,” this operator performs a reflection in the Bloch sphere of the cost qubit about the axis \(\vec n = (1/2) \left[ \begin{array}{lll}\cos \langle \hat{\textbf{C}}\rangle&0&\sin \langle \hat{\textbf{C}}\rangle \end{array}\right] ^\top \). Up to an irrelevant phase factor, \(\textbf{U}\) can be written as
The second equation holds because \(\arccos (\sin x) = \pi /2 - x\) for \(x\in [-\pi /2, \pi /2]\). The last equation is obtained by applying the identity \(R_y(\theta _1 + \theta _2) = R_y(\theta _2)\cdot R_y(\theta _1)\) for rotations in two dimensions, where \(\theta _1, \theta _2 \in \mathbb {R}\). By recursively applying this same identity to \(R_y (-2\langle \hat{\textbf{C}}\rangle )\), we find
As a result, the weighted sum of Pauli-\(\textbf{Z}\) operators naturally translates into a product of unitary transformations, which is very compatible with the gate-based model of quantum computing. As the basis state \({| q\rangle }\) is chosen arbitrarily, \(\langle \textbf{U}\rangle \) outputs sine transformed costs as derived in Eq. (11) for arbitrary states. For a system prepared in the basis state \({| \hat{\psi }, \psi \rangle } = {| 0, q\rangle }\), we can even recover the exact cost by \(\langle \textbf{C}\rangle = K \arcsin \langle \textbf{U}\rangle \).
The operator \(\textbf{U}\) can be efficiently implemented using the circuit given in Fig. 1:
-
We initialize the cost qubit in the state \({| \hat{\psi }\rangle }_c = R_y(\pi ) \cdot \textbf{Z}{| 0\rangle } = \textbf{X} {| 0\rangle }\).
-
For each weight \(\mathcal {C}_{ij}\) between two nodes \(q_i\) and \(q_j\), we rotate the cost qubit by \(R_y (-2\hat{\mathcal {C}}_{ij})\) if the corresponding working qubits are in the same state, and by \(R_y (2\hat{\mathcal {C}}_{ij}) = \textbf{X} \cdot R_y (-2\hat{\mathcal {C}}_{ij}) \cdot \textbf{X}\) if they are not.
-
For each unary weight \(\mathcal {C}_{ii}\) involving node \(q_i\), we rotate the cost qubit by \(R_y (-2\hat{\mathcal {C}}_{ii})\) if the corresponding working qubit is in the \({| 0\rangle }\)-state, and by \(R_y (2\hat{\mathcal {C}}_{ii}) = \textbf{X} \cdot R_y (-2\hat{\mathcal {C}}_{ii}) \cdot \textbf{X}\) if it is not.
Whenever the unary costs satisfy \(\mathcal {C}_{ii} = 0\) for all i, we refer to Eq. (14) as the Universal Quantum Maximum Cut (uq maxcut) model, else as the Universal Quantum Ising (uqising) model.
3.2 Optimization
3.2.1 Workflow
The overall workflow of our algorithm is presented in Fig. 2. The complete circuit consists of 3 registers: a 1-qubit register containing an ancilla qubit, another 1-qubit register for the cost qubit, and an n-qubit register for the working qubits encoding the variables of the problem.
First, the working qubits are rotated by a set of angles \(\Theta = (\theta _1, \ldots , \theta _n) \in \mathbb {R}^n\), constructing the ansatz
from a system previously prepared in the state \({| \psi \rangle }\). Here, the qubit \(q_i\), representing the ith node of the graph, is rotated by \(\theta _i\) around the y-axis. Next, a Hadamard sandwich involving a controlled version of the \(\textbf{U}\)-operator is applied to the ancilla qubit. Finally, according to the principle of implicit measurement of quantum computing [27, Section 4.4] stating that all qubits that are not measured at the end of a quantum circuit can be assumed to be measured, only the ancilla qubit is measured, leaving the cost and working qubits in the state \(\textbf{U} {| 0, \psi (\theta )\rangle } = {| 0\rangle }_c \otimes \sin (\hat{\textbf{C}}){| \psi (\theta )\rangle } + {| 1\rangle }_c \otimes \cos (\hat{\textbf{C}}){| \psi (\theta )\rangle }\).
Our goal is to solve the variational problem
i.e., to find a set of angles \(\Theta ^\star \) such that \({| \psi (\Theta ^\star )\rangle } = {| q^\star \rangle }\). The cost \(\mathcal {L} (\Theta )\) can be efficiently calculated by \(\mathcal {L} (\Theta ) = p(0) - p(1)\), where p(0) and p(1) are the probabilities of measuring the ancilla qubit in the \({| 0\rangle }\) and \({| 1\rangle }\) state, see “Appendix B” for the proof. This allows us to compute the expectation \(\langle \textbf{U}\rangle \) without having to measure \(\langle 0, \psi (\Theta )|\) needed for the scalar product. For the optimization, several approaches [28, 29] have fortunately been developed for evaluating or approximating gradients and improving optimization on quantum computers.
3.2.2 Parameter shift rule
The parameter shift rule [28] is a simple but exact method for evaluating the analytical gradient of a function given in the form of an expectation as in Eq. (16) on quantum hardware. For computing the partial derivative with respect to the parameter \(\theta _i\), it uses two function evaluations with the parameter \(\theta _i\) shifted by \(\pm \pi /2\):
where \(\Theta _{i\pm }:= (\theta _1, \ldots , \theta _{i-1}, \theta _i \pm \pi /2, \theta _{i+1}, \ldots , \theta _n)\). The parameter shift rule uses the same circuit as the function evaluation, but allows to compute the exact gradient. Its drawback is that it requires 2n function evaluations to compute the gradient of \(\mathcal {L}\).
3.2.3 Update rule
The circuits (uq maxcut, uqising) are optimized by normalized gradient descent with decreasing step size. Normalized gradient descent has recently been proposed by Suzuki et al. [30] as a powerful optimization strategy for variational quantum algorithms. Specifically, in [30] it is demonstrated experimentally that normalized gradient steps are more effective in escaping non-global minima than gradient steps. It is also known [31] that normalized gradient descent evades saddle points. At each iteration k, our update rule reads
The design of the update rule Eq. (18) is motivated by the following consideration: We know that we produce bit-strings by either flipping \({| q_i\rangle }\) or not, thus \(\theta _i^\star = \ell \pi , \ell \in \mathbb {Z}\). At iteration \(k=0\), the update rule Eq. (18) allows each \(\theta _i\) to get updated to \(\theta _i^{1} \leftarrow \theta _i^0 \pm \pi \cdot g_i/2\), where \( g_i \in [-1, 1]\) is the normalized contribution of \(\theta _i^0\) in the loss \(\mathcal {L} (\Theta )\). Subsequently, we let the step size decay exponentially to zero when approaching the maximum number of iterations \(k_{max}\). Given the noisy nature of quantum measurement, it is helpful that this frees us from the difficult task of determining a stopping condition for the algorithm.
3.3 Scalability and computational complexity
For a given graph \( \mathcal {G} = (\mathcal {S}, \mathcal {E}, \mathcal {C}) \), the circuit construction presented in Fig. 1 requires at most one not-gate, \(|\mathcal {E}|\) single-qubit rotation gates and \(4|\mathcal {E}| - 2|\mathcal {S}|\) cnot gates. Note that since the edges can be treated in arbitrary order, two consecutive cnot gates that have the same control qubit cancel each other out as their product is the identity, further reducing the number of required cnot gates.
The controlled-\(\textbf{U}(\textbf{C}, K)\) gate appearing in Fig. 2 and conditioned by the ancilla qubit \({| \cdot \rangle }_a\) can be fully decomposed into single qubit rotations and cnot gates without using any Toffoli gates. To see this, note that it can be expressed as
In particular, in order to control the complete \(\textbf{U}(\textbf{C}, K)\) gate, it suffices to control only the rotation gate \(R_y\).
Finally, the controlled rotation itself can be decomposed into two cnot gates and two single qubit rotations as
Table 1 recapitulates the main differences between uqising and a conventional qaoa of depth p. It shows that for \(p \ge 3\), qaoa requires more quantum resources than our uqising model. Further, physically mapping the qaoa ansatz onto the quantum hardware has to take into account a graph-dependent qubit connectivity, while our method, independently of the input graph, requires that one qubit (the cost qubit) is connected to all other qubits (ancilla and working qubits).
3.4 Impact of the K-rescaling
When applying the method, it is important to appropriately choose the constant \(K\in \mathbb {R}\) for re-scaling the costs \(\hat{\textbf{C}} = \textbf{C}/K\). The goal is to fix K such that \(\textrm{diag}(\hat{\textbf{C}}) \in [-\pi /2, \pi /2]^{2^n}\), as this guarantees that \(\sin (\textrm{diag}(\hat{\textbf{C}}))\) preserves the order of the original costs in \(\textrm{diag}(\textbf{C})\).
Although it is tempting to choose \(K \gg \mathcal {C}_{max}:= \max _k \textbf{C}_{kk}\), where \(\textbf{C}_{kk}\) denotes the kth diagonal element, recall that in Eq. (9) we have \(\lim _{K \rightarrow \infty } \textbf{U}(\textbf{C}, K) = \textbf{X}^{\otimes (n+1)}\), i.e., the larger K, the more shots are required to accurately measure the entries of \(\sin (\hat{\textbf{C}})\). Fortunately, in many problems an upper bound to the maximal cost \(\mathcal {C}_{max}\) can be computed from the original weights without knowing \(\textbf{C}\). For example, choosing
guarantees an error-free transformation of the initial problem (5) into the equivalent problem (16). As presented for 10-node graph examples in Fig. 3, the choice of smaller values for K entails that the approximation of \(\arccos (\sin x)\) by \(\pi /2 - x\), used in Eq. (13), is inaccurate, with more pronounced error in the largest absolute entries of \(\textrm{diag}(\hat{\textbf{C}})\). Notably, the solution of the transformed problem will only be a solution of the initial problem as long as \(\arg \min _k \hat{\textbf{C}}_{kk} = \arg \min _k \textbf{C}_{kk}\).
4 Experimental results
In order to validate the practical usefulness of uq maxcut and uqising, we benchmark against two state-of-the-art approaches for solving binary combinatorial optimization with quantum computing: qaoa for the gate-based model and d-wave solvers for the adiabatic model. Random graphs in the experiments are generated using the Python language package NetworkX [32]. The unary and quadratic edge weights are all randomly and uniformly chosen in the range [1, 10] and the graphs are all fully connected. The gate-based circuits in the experiments (uq maxcut, uqising, qaoa) are implemented in Python and simulated in a noise-free framework using the QisKit library and the IBM-QASM simulator [33]. For the adiabatic model, d-wave solvers that run on the actual quantum hardware are used. d-wave quantum annealers are made available through the leap quantum cloud service [34], and the d-wave quantum algorithms can be implemented in Python using the Ocean software [35]. We perform 1024 measurement shots for all gate-based algorithms. On d-wave [1], the experiment is run with the default annealing time of \(20\mu s\) and 50 sample reads on the advantage topology.
4.1 Benchmark metrics
We denote by \({| q^\star \rangle }\) the ground-truth global minimizer and by \({| \psi ^\star \rangle }\) the ground-state proposal of each method (uq maxcut/uqising, qaoa and d-wave). In the experiments, we adopt the following two metrics to evaluate the performance of the methods:
-
The approximation ratio
$$\begin{aligned} r(\psi ^\star ):= \dfrac{\langle \psi ^\star | \textbf{C} | \psi ^\star \rangle - \mathcal {C}_{max}}{\mathcal {C}_{min}- \mathcal {C}_{max}} \end{aligned}$$(23)informs about the quality of the result, i.e., how confident the method is with its solution proposal and how far the cost of this proposal is from the minimal cost \(\mathcal {C}_{min}:= \min _k \textbf{C}_{kk}\), cf. [36]. All the terms appearing in r are classically evaluated. It holds \(0 \le r(\psi ^\star ) \le 1\) and \(r(\psi ^\star )=1\) iff \(\psi ^\star = q^\star \).
-
The approximation index
$$\begin{aligned} i(\psi ^\star ):= \textbf{1}_{\langle q^\star | \textbf{C} | q^\star \rangle = \langle q_{max} | \textbf{C} | q_{max}\rangle } \end{aligned}$$(24)is a Boolean variable that indicates whether the most likely state \({| q_{max}\rangle }\) obtained with probability \(\left| \alpha _{max} \right| ^2\) is the desired state \({| q^\star \rangle }\). It holds \(i(\psi ^\star ) = 1\) if \(\langle q^\star | \textbf{C} | q^\star \rangle = \langle q_{max} | \textbf{C} | q_{max}\rangle \) and 0 otherwise. Note that this differs from the usual approach of d-wave, where the sampled state with the minimal energy is regarded as the best solution proposal.
4.2 Benchmarking UQMaxCut against QAOA and D-Wave
4.2.1 Symmetric solutions and entanglement
Before discussing the results on the maxcut problem, it is important to notice that for maxcut, solutions always exist in symmetric pairs. Specifically, for a basis state solution \({| q^\star \rangle } = \bigotimes _{i=1}^n{| q_i^\star \rangle }, q_i^\star \in \left\{ 0, 1\right\} \), the basis state \({| \bar{q}^\star \rangle }:= \bigotimes _{i=1}^n{| 1-q_i^\star \rangle }\) is also a solution. This feature can be enforced by introducing the entanglement circuit given in Fig. 4 after the rotation layer in Fig. 2. The circuit has the matrix representation
Also, this entanglement allows to optimize over \(n-1\) angles instead of n. Without entanglement our method just outputs one of the two possible solutions. In the experiments, uq maxcut is used without entanglement; if the algorithm outputs either one of the two solutions, we set the approximation index to 1.
4.2.2 Benchmark results
For the outer optimization algorithm of the uq maxcut circuit, we use normalized gradient descent as described in Section 3.2. Other optimization algorithms such as vanilla gradient descent or adaptive moment estimation (adam [37]) could be used as well. However, it proved difficult in the experiments to find suitable step sizes for those methods, so we leave them for future research.
The qaoa layers depth in the experiments is set to \(p = \lceil n/2 \rceil \) to allow for a fair comparison, as then both qaoa and uq maxcut optimize over approximately n real variables. We also attempted to optimize qaoa using the same optimizer as for uq maxcut, but the results were not competitive. Hence, we also show the results of qaoa when using a gradient-free optimizer; we used the cobyla solver [38] available in the scipy library [39].
The results are presented in Fig. 5 for fully connected graphs of \(n=3, 5\) and 10 nodes with strictly positive edge weights. For each n, the results are averaged over 20 graph instances and all algorithms are tested on the same instances. The angles for uq maxcut and qaoa are all initialized to 0.
The approximation ratio for the three methods (qaoa, d-wave, uq maxcut) is not adversely affected by the number of variables n, but the approximation index drops sharply as the size of the problem increases. The gradient-free Cobyla-optimized version of qaoa performs much better than qaoa with gradient descent. The latter is on average as good as d-wave regarding the approximation ratio. We conjecture that the gradient-based optimization of qaoa often gets trapped by saddle points of the qaoa loss function landscape. In contrast, uq maxcut clearly outperforms the two qaoa variants and can compete with d-wave in producing good approximate solutions. Furthermore, the approximation index demonstrates that it returns a global minimizer significantly more often than qaoa and less often than d-wave whose architecture is specifically designed to solve such problems.
4.3 Benchmarking UQIsing against D-Wave
For the Ising model, we benchmark the proposed uqising algorithm against the d-wave annealer, the adiabatic quantum computer specialized in solving this type of problems. The variational circuit for uqising is optimized in the same way as for uq maxcut, see Sect. 3.2, with the exception that the initial angles are set to \(\pi /2\) instead of 0. The results are depicted in Fig. 6.
Notable differences in performance between uqising and d-wave are consistent with the MaxCut experiments. Specifically, its high approximation ratio, similar to that of d-wave, indicates that uqising always produces either globally optimal solutions or extremely good approximations thereof. On the other hand, the approximation index shows that d-wave identifies a globally optimal solution more often than uqising.
5 Conclusion
We have presented a new low-depth quantum circuit to tackle two important combinatorial problems on universal quantum machines. The resulting Universal Quantum maxcut (uq maxcut) approach outperforms the state-of-the-art quantum approximate optimization algorithms (qaoa) by the lower depth, by the computed approximation ratios and by a higher probability of outputting optimal solutions. It also challenges the d-wave-quantum annealers that are specifically designed to solve such combinatorial problems; on the maxcut as well as on the Ising spin model, uq maxcut, respectively, uqising can compete with d-wave in producing globally optimal solutions.
We believe that the proposed approach enables the design of new methods for solving practically-sized problems on universal quantum machines. Inspired by the novel operator \(\textbf{U}\), future work should focus on designing fully universal algorithms without the classical outer optimization loop, replacing the latter with fully universal methods like for example Grover’s search [8].
Data Availability
The authors declare that the data supporting the findings of this study are random graphs that can be generated using the Python language package NetworkX [32].
Code Availability
Python source code for the algorithm and experiments is available at https://github.com/nkuetemeli/UQMaxCutAndIsing.
References
D-Wave Systems: QPU Solver Datasheet. https://docs.dwavesys.com/docs/latest/doc_qpu.html (2022)
Benkner, M.S., Lähner, Z., Golyanik, V., Wunderlich, C., Theobalt, C., Moeller, M.: Q-match: iterative shape matching via quantum annealing. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 7586–7596 (2021)
Kuete Meli, N., Mannel, F., Lellmann, J.: An iterative quantum approach for transformation estimation from point sets. In: Computer Vision and Pattern Recognition, pp. 529–537 (2022)
Groppe, S., Groppe, J.: Optimizing transaction schedules on universal quantum computers via code generation for Grover’s search algorithm. In: Proceedings of the 25th International Database Engineering & Applications Symposium, pp. 149–156 (2021)
Uotila, V.J.E.: Synergy between quantum computers and databases. In: Proceedings of the VLDB 2022 PhD Workshop Co-located with the 48th International Conference on Very Large Databases (VLDB 2022) (2022)
Bennett, C.H.: Logical reversibility of computation. IBM J. Res. Dev. 17(6), 525–532 (1973)
Shor, P.W.: Polynomial-time algorithms for prime factorization and discrete logarithms on a quantum computer. SIAM Rev. 41(2), 303–332 (1999)
Grover, L.K.: A fast quantum mechanical algorithm for database search. In: Proceedings of the Twenty-Eighth Annual ACM Symposium on Theory of Computing, pp. 212–219 (1996)
McCoy, B.M., Wu, T.T.: The Two-Dimensional Ising Model. Courier Corporation, Cambridge (2014)
Goemans, M.X., Williamson, D.P.: Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming. J. ACM (JACM) 42(6), 1115–1145 (1995)
Lucas, A.: Ising formulations of many np problems. Front. Phys. 2, 5 (2014)
Kolmogorov, V., Zabin, R.: What energy functions can be minimized via graph cuts? IEEE Trans. Pattern Anal. Mach. Intell. 26(2), 147–159 (2004)
Ding, C.H., He, X., Zha, H., Gu, M., Simon, H.D.: A min-max cut algorithm for graph partitioning and data clustering. In: Proceedings 2001 IEEE International Conference on Data Mining, pp. 107–114 (2001)
Barahona, F., Grötschel, M., Jünger, M., Reinelt, G.: An application of combinatorial optimization to statistical physics and circuit layout design. Oper. Res. 36(3), 493–513 (1988)
Albash, T., Lidar, D.A.: Adiabatic quantum computation. Rev. Mod. Phys. 90(1), 25 (2018). https://doi.org/10.1103/revmodphys.90.015002
Jansen, S., Ruskai, M.-B., Seiler, R.: Bounds for the adiabatic approximation with applications to quantum computation. J. Math. Phys. 48(10), 102111 (2007). https://doi.org/10.1063/1.2798382
Zener, C.: Non-adiabatic crossing of energy levels. Proc. R. Soc. Lond. Ser. A Contain. Pap. Math. Phys. Character 137(833), 696–702 (1932)
Somma, R., Boixo, S.: Spectral gap amplification. SIAM J. Comput. 42(2), 593–610 (2013)
Farhi, E., Goldstone, J., Gutmann, S.: A quantum approximate optimization algorithm. arXiv preprint arXiv:1411.4028 (2014)
Guerreschi, G.G., Matsuura, A.Y.: QAOA for Max-Cut requires hundreds of qubits for quantum speed-up. Sci. Rep. 9(1), 1–7 (2019)
Hadfield, S., Wang, Z., O’Gorman, B., Rieffel, E.G., Venturelli, D., Biswas, R.: From the quantum approximate optimization algorithm to a quantum alternating operator ansatz. Algorithms 12(2), 34 (2019)
Herrman, R., Ostrowski, J., Humble, T.S., Siopsis, G.: Lower bounds on circuit depth of the quantum approximate optimization algorithm. Quantum Inf. Process. 20(2), 1–17 (2021)
Shaydulin, R., Alexeev, Y.: Evaluating quantum approximate optimization algorithm: a case study. In: 2019 Tenth International Green and Sustainable Computing Conference (IGSC), pp. 1–6 (2019)
Kadowaki, T., Nishimori, H.: Quantum annealing in the transverse Ising model. Phys. Rev. E 58, 5355–5363 (1998). https://doi.org/10.1103/PhysRevE.58.5355
Gilyén, A., Su, Y., Low, G.H., Wiebe, N.: Quantum singular value transformation and beyond: exponential improvements for quantum matrix arithmetics. In: Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, pp. 193–204 (2019)
Camps, D., Beeumen, R.V.: Fable: Fast approximate quantum circuits for block-encodings. In: 2022 IEEE International Conference on Quantum Computing and Engineering (QCE), pp. 104–113 (2022). https://doi.org/10.1109/QCE53715.2022.00029
Nielsen, M.A., Chuang, I.L.: Quantum Computation and Quantum Information, 10th edn. Cambridge University Press, Cambridge (2010)
Mitarai, K., Negoro, M., Kitagawa, M., Fujii, K.: Quantum circuit learning. Phys. Rev. A 98(3), 032309 (2018)
Spall, J.C.: An overview of the simultaneous perturbation method for efficient optimization. Johns Hopkins APL Tech. Digest 19(4), 482–492 (1998)
Suzuki, Y., Yano, H., Raymond, R., Yamamoto, N.: Normalized gradient descent for variational quantum algorithms. In: 2021 IEEE International Conference on Quantum Computing and Engineering (QCE), pp. 1–9 (2021)
Murray, R., Swenson, B., Kar, S.: Revisiting normalized gradient descent: fast evasion of saddle points. IEEE Trans. Autom. Control 64(11), 4818–4824 (2019). https://doi.org/10.1109/TAC.2019.2914998
Hagberg, A., Swart, P., Chult, D.S.: Exploring network structure, dynamics, and function using networkX. Technical report, Los Alamos National Lab. (LANL), Los Alamos, NM (United States) (2008)
Cross, A.: The IBM Q experience and QISKit open-source quantum computing software. In: APS March Meeting Abstracts, vol. 2018, pp. 58–003 (2018)
D-Wave Systems: D-Wave Leap. https://www.dwavesys.com/take-leap (2021)
D-Wave Systems: D-Wave Ocean Software Documentation. https://docs.ocean.dwavesys.com/ (2021)
Willsch, M., Willsch, D., Jin, F., De Raedt, H., Michielsen, K.: Benchmarking the quantum approximate optimization algorithm. Quantum Inf. Process. 19(7), 1–24 (2020)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Powell, M.J.D.: A direct search optimization method that models the objective and constraint functions by linear interpolation. In: Advances in Optimization and Numerical Analysis, pp. 51–67. Springer, Dordrecht (1994)
Virtanen, P., Gommers, R., Oliphant, T.E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., et al.: Scipy 1.0: fundamental algorithms for scientific computing in python. Nat. Methods 17(3), 261–272 (2020)
Acknowledgements
The authors thank IBM and d-wave for the freely available cloud devices and simulators which helped testing the algorithms.
Funding
Open Access funding enabled and organized by Projekt DEAL. Open Access funding enabled and organized by Projekt DEAL. Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no financial or non-financial competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Geometric interpretation of \(\textbf{U}_{2\times 2}\)
We want to show that the operator
is up to a phase factor a reflection in the Bloch sphere about the axis \(\vec n = (1/2) \left[ \begin{array}{lll}\cos \langle \hat{\textbf{C}}\rangle&0&\sin \langle \hat{\textbf{C}}\rangle \end{array}\right] ^\top \). More precisely, we show that
Proof For \(\sin \langle \hat{\textbf{C}}\rangle = \pm 1\), we have \(\textbf{U}_{2\times 2} = \pm \textbf{Z}\), so Eq. (A2) holds. In the remainder of the proof, we can therefore assume \(\sin \langle \hat{\textbf{C}}\rangle \ne \pm 1\). The matrix \(\textbf{U}_{2\times 2}\) has the spectral decomposition \(\textbf{U}_{2\times 2} = \textbf{X}^\top \varvec{\Lambda } \textbf{X}\), where \(\varvec{\Lambda } = \text {diag}([-1 \; 1])\) is the diagonal matrix of eigenvalues of \(\textbf{U}_{2\times 2}\) and \(\textbf{X} = \left[ \begin{array}{l}{| \psi _{\_}\rangle } \; | \; {| \psi _{+}\rangle }\end{array}\right] \) is the matrix containing its eigenvectors
Let \(\varvec{\Sigma } = \text {diag}([\pi \; 2\pi ])\), so \(\varvec{\Lambda } = \exp {(i\varvec{\Sigma })}\). The generalization of the exponential map to normal matrices [27, Section 2.1.8] implies that \( \textbf{U}_{2\times 2} = \exp (i \textbf{X}^\top \varvec{\Sigma } \textbf{X}) \). Now we express \(\textbf{X}^\top \varvec{\Sigma } \textbf{X}\) in the basis formed by the Pauli matrices:
We can thus rewrite \(\textbf{U}_{2\times 2}\) as \(\textbf{U}_{2\times 2} = \exp (i \alpha ) \exp (i \vec v \cdot \vec {\varvec{\Sigma }})\), where \(\alpha = 3\pi / 2\) and \(\vec v = (1/2) \left[ \begin{array}{lll}\pi \cos \langle \hat{\textbf{C}}\rangle&0&\pi \sin \langle \hat{\textbf{C}}\rangle \end{array}\right] ^\top \). The notation \(\vec v \cdot \vec {\varvec{\Sigma }}:= v_x \cdot \textbf{X} + v_y \cdot \textbf{Y} + v_z \cdot \textbf{Z}\) denotes a linear combination of Pauli matrices. With a slight change of variables \(\theta = -2 \Vert \vec {v} \Vert _2 = - \pi \) and \(\vec n = \vec v / \Vert \vec {v} \Vert _2 = \left[ \begin{array}{lll} \cos \langle \hat{\textbf{C}}\rangle&0&\sin \langle \hat{\textbf{C}}\rangle \end{array}\right] ^\top \), this is equivalent to \(\textbf{U}_{2\times 2} = \exp (i \alpha ) \exp (-i (\theta /2) \vec n \cdot \vec {\varvec{\Sigma }})\).
The global phase \(\exp (i \alpha )\) can be ignored in the implementation as it has no effect on the measurement. The exponential term \(\exp (-i (\theta /2) \vec n \cdot \vec {\varvec{\Sigma }})\) corresponds to a rotation of angle \(\theta \) of the Bloch sphere about the axis \(\vec {n}\). As \(\vec {n}\) lies in the xz-plane and we have \(\theta = -2 \Vert \vec {v} \Vert _2 = -\pi \), it follows that \(\textbf{U}_{2\times 2}\) is a reflection about \(\vec {n}\). Equivalently, Eq. (A2) holds.
Appendix B: Implicit measurement
Let \(\textbf{U}\) be a Hermitian and unitary operator, so \(\textbf{U}\) can be used both as quantum gate and measurement observable. According to the implicit measurement principle of quantum computation [27, Section 4.4], all qubits that are not measured at the end of a quantum circuit can be assumed to be measured. Specially, the measurement of the first qubit of the circuit in Fig. 7 performs a measurement of the observable \(\textbf{U}\) for a system prepared in the state \( {| \psi _{in}\rangle } \).
Proof As \(\textbf{U}\) is both Hermitian and unitary, its eigenvalues are both real and unitary, and therefore either \(+1\) or \(-1\). Then, the operators \(\textbf{P}_\pm = \dfrac{1}{2}(\textbf{I} \pm \textbf{U})\) are orthogonal projectors into the eigenspaces of \(\textbf{U}\) to the eigenvalues \(\pm 1\). Evidently, \(\textbf{U}\) can be expressed as \(\textbf{U}=\textbf{P}_+ - \textbf{P}_-\). Before measurement, the system is in the state
A measurement of the first qubit in the standard basis, with the measurement operators \( \textbf{P}_0 = {| 0\rangle }\langle 0|\otimes \textbf{I} \) and \( \textbf{P}_1 = {| 1\rangle } \langle 1| \otimes \textbf{I} \), yields the probabilities
and leaves the system in the post-measurement state
if the output 0 is observed, respectively,
if the output 1 is observed.
Thus, the second register of the circuit is in post-measurement state of the observable \(\textbf{U}\) on a trial state \( {| \psi _{in}\rangle } \). Its expectation is
which shows that we can evaluate the expectation \(\langle \textbf{U}\rangle \) by measuring a single qubit. This is a crucial feature of our algorithm, as it makes it possible to accurately approximate \(\langle \textbf{U}\rangle \) without having to sample \(\langle \psi _{in}|\) needed for the scalar product. Also, it allows to use a moderate number of measurements that is independent of the number of qubits in \( {| \psi _{in}\rangle } \).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kuete Meli, N., Mannel, F. & Lellmann, J. A universal quantum algorithm for weighted maximum cut and Ising problems. Quantum Inf Process 22, 279 (2023). https://doi.org/10.1007/s11128-023-04025-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11128-023-04025-x