Simple Mitigation of Global Depolarizing Errors in Quantum Simulations
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Simple Mitigation of Global Depolarizing Errors in Quantum Simulations Joseph Vovrosh*,1 Kiran E. Khosla,1 Sean Greenaway,1 Christopher Self,1 M. S. Kim,1 and Johannes Knolle2, 3, 1 1 Blackett Laboratory, Imperial College London, London SW7 2AZ, United Kingdom 2 Department of Physics TQM, Technische Universität München, James-Franck-Straße 1, D-85748 Garching, Germany 3 Munich Center for Quantum Science and Technology (MCQST), 80799 Munich, Germany To get the best possible results from current quantum devices error mitigation is essential. In this work we present a simple but effective error mitigation technique based on the assumption that noise in a deep quantum circuit is well described by global depolarizing error channels. By measuring the errors directly on the device, we use an error model ansatz to infer error-free results from noisy data. We highlight the effectiveness of our mitigation via two examples of recent interest in quantum many-body physics: entanglement measurements and real time dynamics of confinement in quantum spin chains. Our technique enables us to get quantitative results from the IBM quantum computers showing signatures of confinement, i.e. we are able to extract the meson arXiv:2101.01690v2 [quant-ph] 28 Apr 2021 masses of the confined excitations which were previously out of reach. Additionally, we show the applicability of this mitigation protocol in a wider setting with numerical simulations of more general tasks using a realistic error model. Our protocol is device-independent, simply implementable and leads to large improvements in results if the global errors are well described by depolarization. INTRODUCTION Quantum computers are becoming large enough (∼ 50 − 100 qubits [1]) to, in principle, allow demonstrations of their ‘Quantum Advantage’ [2, 3]. However, the actual amount of entanglement that can be generated in current devices is constrained by noise and errors, limiting their ability to solve complex problems such as quantum simulation. To address this, various error mitigation strategies have recently been de- veloped to counteract noise and boost the fidelity of experi- mental results. Error mitigation differs from fault tolerance. Fault tolerant quantum computers will eventually be able to suppress errors by encoding quantum information in a redundantly large num- ber of qubits as error correcting codes [4, 5]. In this way they will be able to execute arbitrarily deep circuits by the repeated application of active error corrections. Unfortunately, these encodings cannot be used in current devices as they require smaller hardware errors and larger numbers of qubits than are currently available. In contrast, error mitigation strategies are applied to unencoded physical qubits. Rather than actively correcting errors, they aim to estimate what the effect of the error was and infer the error-free result. This current stage FIG. 1. Results from the IBM device Paris [6] of the second order in the development of quantum computers has been dubbed Rényi entanglement entropy. Quench dynamics before and after error mitigation of the second order Réyni entropy for the trans- the noisy intermediate-scale quantum (NISQ) era [1] and is verse field Ising model with varying longitudinal field strengths. expected to last for the foreseeable future. Here, J = 1, hx = 0.5 and n = 6. Note, 800 random unitaries In the past few years, error mitigation has been a thriving were used to collect this data giving an uncertainty in the purity of research direction as more and more quantum devices become δρ ∼ O(10−2 ). Clearly the mitigation allows results to go from available. Error mitigation strategies typically address mea- qualitative agreement to quantitative agreement with the results ob- surement errors [7–15] or the algorithms and gates employed tained numerically through exact diagonalization (ED). for digital quantum simulation [16–23]. While some of these techniques have proven useful for certain quantum algorithms, their general success is hampered by the fact that they either these methods train classical computers to predict the error rely on a high level of control of the quantum device itself, seen in a quantum device and use the results to infer the error- or are specifically designed for a given quantum simulation free quantum simulations. While successful, these methods problem, i.e. exploiting specific symmetries [24]. require a large increase in the classical computational over- One promising direction is to employ machine learning al- head and are somewhat uncontrolled. The latter drawback is gorithms [25–27] for error mitigation. Generally speaking, also true for popular protocols based on the idea of increas-
2 ing errors in the device systematically and then extrapolating supplementary material). Last but not least, the large improve- back to the zero error case [21, 28–34]. In general, how to ment in the quality of results on the IBM device motivate our tune the error rates varies from device to device and reliable choice of depolarizing errors a posteriori. fitting requires expert knowledge of the specific hardware. An n-single qubit depolarizing error channel can be mod- Here, we propose a new protocol for gate error mitiga- elled via tion combining a raft of desirable features: it is easily imple- X X n p mentable on any quantum device with little increase in work- E ⊗n (ρ) = (1 − p)n ρ + (1 − p)n−1 σαj ρσαj + ... 3 load, it is well rooted in a mathematical description of the α∈[x,y,z]j=1 errors, and it is suitable for any quantum algorithm of inter- (2) est. Furthermore, as we show via specific examples, it can where E is the error channel, p is the probability of an error lead to to large improvements in the performance of quantum occurring for each qubit (assumed to be equal for each qubit, devices, for example see Fig. 1 for Rényi entropy results. and for each Pauli error) and ‘...’ indicates higher-order terms The paper is organised as follows: First, we derive an ansatz corresponding to errors on multiple qubits [36]. One impor- for the density matrix resulting from the action of a noisy tant feature of this mathematical formulation is that the second quantum simulator with global depolarizing errors. Follow- term, which describes the depolarizing error, commutes with ing this, we explain how this ansatz can be used for a general any unitary operator. Consequently, the error on the ith qubit error mitigation protocol. We then demonstrate its effective- in a quantum circuit with purely depolarizing errors is ness by studying real time dynamics of the transverse field Ii Ising model (TFIM) with a longitudinal field. In that context, E i (ρ) = (1 − pi )ρ + p Tri [ρ] ⊗ (3) 2 some of the authors recently showed that signatures of con- in which ρ is the density matrix, Ii /2 is the maximally mixed finement and entanglement spreading can be observed on the (i.e. completely depolarized) state for the ith qubit, Tri is the IBM quantum computer [35]. With our new error mitigation partial trace over the ith qubit and pi is the error on the ith protocol we are able to measure the meson masses of confine- qubit. ment induced bound states directly on the IBM device and can Instead of dealing with all combinations of single qubit er- quantitatively measure entanglement spreading previously out rors, we approximate the total error channel of Eq. (2), under of reach. Furthermore, we corroborate these results with clas- the assumption of symmetric depolarization Eq. (3), as an ef- sical simulations of error models taken from an IBMQ device fective depolarizing channel on the entire quantum state in a more general setting to show the wide applicability of our mitigation scheme. Finally, we close with a discussion and I ⊗n ρ = (1 − ptot )ρexact + ptot (4) outline future applications. 2n where the effective total error probability Q is ptot . In princi- ple ptot is well approximated by i (1 − pi ), however, we ERROR MITIGATION PROTOCOL do not make that identification here. We will soon show that ptot can be measured directly on the device. The simplicity Error channels can be conveniently modelled through of this ansatz allows it to be easily calculated. It has already Kraus operators [36] which define a completely positive map been shown to be useful when mitigating measurement error on the density matrix [37], however we later demonstrate that it is powerful tool for mitigating global depolarizing errors which do not them- Ki ρKi† such that Ki† Ki = 1. X X ρ→ (1) selves originate from local depolarizing errors. Furthermore, i i we later numerically demonstrate that this ansatz does not rely on the assumption of single- and two-qubit depolarizing er- For a single qubit, choosing four Kraus operators to be pro- rors. portional to the Pauli operators (identity included) defines a The many partial traces over single qubits, which would depolarizing channel. The proportionality constants are re- have conserved some coherence in the remaining qubits, have lated to the individual error probabilities and must satisfy the been replaced by the maximally mixed state I ⊗n /2n over the identity constraint to preserve the trace of ρ. global quantum state, destroying all coherence. We stress that Here, we concentrate on such depolarizing errors which are even though this may not be a good approximation for a sin- inevitably present in any digital quantum simulator platform gle layer of qubit errors, it becomes a reasonable approxima- and can be treated without much specific knowledge about the tion for the error channel of a many layer, many qubit circuit. device performance (which also fluctuates over time). More- Eq. (4) is our basic ansatz for an effective error model after over, our focus on depolarizing errors has the advantage of a many-layered unitary circuit and ρexact is the exact density being treatable mathematically in a controlled way as detailed operator without noise (see Supplementary Material for more below. In fact, given a deep enough quantum circuit (with details). With this ansatz one can analytically calculate the sufficient qubits) for self-averaging of incoherent errors to oc- effect of errors on a measured observable, Ô, via cur, this depolarizing error model is a good approximation to h i p tot the physical errors in a device (more details are given in the hÔi = (1 − ptot ) Tr Ôρexact + n Tr Ô . (5) 2
3 Our approach estimates ptot in order to apply error mitiga- our quantum circuit approximates the time evolution opera- tion. We propose and test two approaches for finding ptot , the tor U (t) = exp{−iHt} where we wish to measure hO(t)i first based on estimating the purity of the final state and the for a range of times. We can tune the circuit such that second based on studying specific observables. To estimate t ∗ (Emax ) =
4 magnetization, which further simplifies with Tr[σ α ] = 0 in Eq. (5) to 1.00 hσiα i = (1 − ptot )hσiα iexact . (8) 0.95 The time dependence can be calculated by applying a quan- ED tum circuit from a Trotterisation of the time evolution oper- Mitigated Fit ator (for details of the implementation and quantum circuits NT = 6 see Refs. [24, 35]). Increasing the number of trotter steps, 0.90 NT = 5 hσ z i Mitigated Data Nt , leads to a deeper circuit and the ensuing increase in errors results in a peculiar dampening of the magnetization dynam- ics which can be removed via our error mitigation. In Fig. 0.85 2 we display the results of the local magnetization dynamics. Our mitigation technique not only allows us to measure the main oscillation frequency on the IBM device but in fact to do 0.80 so with quantitative agreement with the analytical results (the latter is explained in Ref. [35]). We note that an additional simplification can be used for the 0 1 2 3 local magnetization tJ for added efficiency — no additional mea- surement of Tr ρ2 was required because the result of hσ z i is known at tJ = 0 and thus ptot can be inferred from the FIG. 2. Results from the IBM machine before and after mitiga- corresponding measurement on the IBM device after running tion. Quench dynamics of local magnetization from Trotterized time the time evolution circuit for tJ ≈ 0. This big simplifica- evolution of the TFIM with longitudinal field before and after error mitigation. Here, J = 1, hx = 0.5, hx = 0.75 and n = 7. Data tion avoids the costly randomized measurement scheme and is shown for, Nt , the number of trotter steps Nt = 5, 6. Results go should be generally applicable for observables whose value from qualitative agreement to quantitative agreement. After fitting is known at tJ = 0. However, as this does not distinguish a cosine function (dashed green) to the mitigated data (green dots) correlated noise from the uncorrelated noise assumed in our the dominant frequency is clearly captured by the IBM device, for ansatz, it is possible for over-estimations in the mitigation to details see Supplementary Material. Note, more details of the circuit occur, resulting in non-physical results. An example of this is composition of the evolution operator can be found in [35] seen in Fig. 2 at time tJ = 0.1 in which the local magnetiza- tion is measured to be greater than unity. We have obtained a range of results with different number The confining potential between fermions induces non- of trotter steps. Under the assumption that (1 − ptot ) scales as ergodic behaviour [61], which manifests itself through per- (1 − pT )NT , where pT is the error in one trotter step, we can sistent oscillations in the local magnetization, hσiα i (α ∈ extrapolate the results back to the error-free case, see green {x, y, z}) and a slowing down of the entanglement spreading. data points in Fig. 2. We are then in a position to suppress the noise to a level which enables us to extract different meson masses on the IBM device by a basic fit of the main oscilla- Example 1: Measuring Meson Masses tion frequency. In Fig. 3 we show the data for the first three masses obtainable by starting from different initial states, see The frequencies of the oscillating magnetization can be insets. Here, 7 spins are mapped onto 5 qubits resulting in a mapped directly to the energy of the domain wall bound circuit with 5NT + 5 single qubit gates and 8NT CNOT gates states [48], which have a large overlap with chosen initial with NT = 5, 6. Remarkably, we find quantitative agreement states. These so called meson masses are defined as the energy between the mitigated results and the theoretical predictions difference between the lowest excited states and the ground for the scaling of the meson masses with the transverse and state. longitudinal fields [35]. Recent work using a trapped ion quantum simulator to sim- ulate the long-range TFIM model showed how, by choosing a variety of initial states, the meson masses can be measured Example 2: Entanglement Spreading through the persistent oscillations of local magnetization [62]. Previous attempts to perform a similar measurement on a dig- As a second example of we study the suppression of half ital quantum computer have failed for the short ranged TFIM, chain entanglement entropy spreading after the spin chain Eq. (7), because the results are too noisy to resolve the smaller quench. For hz = 0 the entanglement entropy is expected to amplitude of oscillations [35]. As our main result, we show increase linearly from the ballistic spreading of free fermionic that our new error mitigation enables us to obtain the meson excitations [64]. However, with a non-zero longitudinal field masses from the IBM device. this growth is suppressed in a characteristic fashion due to We are mainly interested in the time dependence of the local confinement [47]. To observe this we have implemented the
5 FIG. 3. Results from the IBM device Toronto [63] of meson masses. (a,b) The quench dynamics of the z-axis local magnetization is shown for different initial states. Here, J = 1, hx = 0.75, hx = 0.75 and n = 7. (a) Results calculated via exact diagonalizion. (b) Mitigated results from the IBM device. Here, clear dominant oscillations are extracted that quantitatively agree with the analytically derived values. (c) An illustration of how the masses are extracted from the analytically derived energy levels [35]. (d) A comparison of the masses obtained from the IBM device and the analytically derived values for varying hz showing this quantitative agreement. Note, more details of the circuit composition of the evolution operator can be found in [35] randomized measurement protocol of Ref. [40] on the IBM APPLICATIONS TO MORE GENERAL EXAMPLES device for measuring the second order Réyni entanglement entropy. In Ref. [35] two of us obtained qualitative agree- ment for entanglement dynamics of six spins compared to an To demonstrate that our method works more generally, we exact diagonalization (ED) calculation, but for a quantitative simulate example tasks of measuring the expectation value of agreement a large shift of the results was needed. different operators. We choose a layered, brickwork circuit consisting of a layer of single qubit rotations followed by a By using the ansatz in Eq. (4) we can see that the effect of layer of entangling CNOTs — the same circuit structure as global depolarizing errors on measurements of Tr ρ2A , where for Trotterized evolution (see Supplementary material). How- A is a subsystem in consideration, is ever, instead of time parameterized Trotter evolution, we sim- ply choose random single qubit angles, corresponding to dif- Tr ρ2A =(1 − ptot )2 Tr ρ2A,exact ferent states, and therefore different expectation values. ptot (1 − ptot ) ptot2 (9) In order to obtain a larger amount of data to support our + + nA . 2nA −1 2 mitigation technique, we numerically simulate the layered circuits with noisy gates. Qiskit’s circuit simulator allows one to simulate arbitrary gate-based noise models by speci- If ptot is known, Tr ρ2A,exact can be extracted and the second order Rényi entropy measurement is calculated via fying independent Kraus operators for each single- and two- qubit gates. Using this noisy simulator, we can directly test how gate-level non-depolarizing errors can result in an ef- S (2) (ρexact ) = − log2 (Tr ρ2A,exact ). (10) fective global circuit-level depolarizing error, and how well our mitigation technique works in the presence of local non- Fig. 1 shows how this mitigation protocol eliminates the er- depolarizing errors. To simulate a realistic noise model, ror in the second order Rényi entropy results. With our error we take the Kraus operators directly from the ibmq santiago mitigation protocol and Eq. (9) we now obtain quantitative backend [65]. This error model goes beyond the single gate agreement with ED results for six spins and can account for depolarizing assumption, by including (asymmetric) thermal the large shift of the results. relaxation.
6 In order to test local, non-local, single, and many Pauli- string operators, we have chosen the following operators: sin- gle Z operator (local, single term), TFIM Hamiltonian (local, many terms), Random Pauli string (non-local, single term), and the molecular Hamiltonian of a H4 (non-local, many terms). Here many ’terms refers’ to the number of non- commuting Pauli strings that must be measured to construct the expectation value, and locality refers to only containing single and two-qubit Pauli strings. Note the four hydrogen Hamiltonian (see Supplementary for details) is chosen as a simple molecular test case with a large Hilbert space, and where we can ignore complications from freezing out orbitals. Figure 4 shows unmitigated and mitigated expectation val- ues for each operator, and clearly demonstrates the added value of purity measurements (see Supplementary for simu- lation details). For expectation value mitigation (i.e. using a single expectation value to calibrate ptot ), a single extra cir- cuit is needed. For trace mitigation (using Eq. (9) to calibrate ptot ) uses 500 extra circuits to find ptot ; these are split into FIG. 4. Error mitigation simulations with non-depolarizing er- five groups to reduce the bias. Once ptot is calibrated for a rors. Unmitigated (black diamonds), trace mitigated (green circles) given circuit depth, this single value is used for error mitiga- and single operator mitigated (red circles) expectation values for dif- tion of the same circuit structure, but for different single qubit ferent operators, with the solid blue line showing perfect mitigation. parameter angles. The numerical simulation uses the general error model taken from ibmq santiago. Expectation values are evaluated with respect to pa- As the main results we find that our global depolarising rameterized brick-like circuits with (CNOT) depth 18 (see supp for assumption is an effective error channel for the whole circuit, details), and thirty random parameter points per plot. Operators are and, crucially, it does not require each single- and two-qubit (a) Hamiltonian of 4 linear-equidistant hydrogen atoms (6 qubits), errors to be themselves purely depolarising. (b) Single Z Pauli term (8 qubits), (c) TFIM Hamiltonian (6 qubits) and (d) Single XY ZIXX string (6 qubits). These operators are chosen to span local/global Pauli strings, and contain single/many DISCUSSION non-commuting decompositions. In this work we have proposed an error mitigation technique which is simple to implement but which retains the mathe- sufficient. This should be true generally for quantum circuits matical rigour of more complicated techniques. Our proto- which can be tuned to be close to the identity. However, this col is directly applicable to any quantum simulation whose scalable simplification for obtaining ptot potentially faces the measurements are basic expectation values, as exemplified by problem that it includes correlated errors which makes the ran- our results for the time evolution of the local magnetization domised measurements preferable as long as it is feasible. in spin chain dynamics. Though our error mitigation method The fact that our basic assumption of global depolarizing may not replace full state tomography, we expect it to be use- errors leads to such large improvements in results is in itself ful in measuring more complicated quantities beyond simple remarkable. The basic conclusion is that the total error on the observables, as corroborated by our results for the entangle- IBM device is close to a global depolarizing error at least for ment entropy. An interesting avenue of future research will our choice of problems. We note our error ansatz, Eq. (4) is be an application to variational quantum eigensolver (VQE) an approximation even for single qubit depolarizing errors, let problems [66] and other quantum circuits. alone the more complex channels that are no-doubt present in The computational overhead of implementing our protocol physical devices. Nevertheless the ansatz works remarkably is dominated by the cost of evaluating Tr ρ2 on the quan- well for correcting physical errors of large/deep circuits and tum device. The number of corresponding measurements is is a no-lose addition to quantum simulation protocols. We Nu Nm where Nu is the number of randomised unitaries and suggest this is because the depolarizing channel is a good ap- Nm is the number of random measurements. Within the ran- proximation per gate in a many gate quantum circuit, even if domised scheme of Ref. [39] it grows exponentially with the this approximation breaks down for single layers of gates. Of system size [40], which poses a potential problem for large course, our mitigation falls short of accounting for large co- quantum computers but is easily feasible for currently avail- herent or correlated errors and it will be a worthwhile endeav- able NISQ devices. Alas, in some cases the costly random- our to think about a controlled extension of our basic ansatz ized measurement scheme can be avoided entirely, i.e. in our Eq. (4) for the density matrix of a NISQ device. In particular, benchmark example of spin chain dynamics a single measure- whether we can extend it to incorporate other aspects of error ment of a local observable whose exact value is known was channels [33].
7 As an application of our error mitigation we have presented (2019). previously unobtainable quantitative results for confinement [17] S. McArdle, X. Yuan, and S. Benjamin, Physical review letters and entanglement dynamics of a quantum spin chain. We have 122, 180501 (2019). been able to extract the first meson masses of confinement in- [18] E. F. Dumitrescu, A. J. McCaskey, G. Hagen, G. R. Jansen, T. D. Morris, T. Papenbrock, R. C. Pooser, D. J. Dean, and duced bound states and observed the corresponding halting of P. Lougovski, Physical review letters 120, 210501 (2018). entanglement spreading. In that context, an ambitious next [19] J. R. McClean, M. E. Kimchi-Schwartz, J. Carter, and W. A. step would be to use the error mitigation to extend times that De Jong, Physical Review A 95, 042308 (2017). can be simulated on a quantum computer to probe the confine- [20] J. I. Colless, V. V. Ramasesh, D. Dahlen, M. S. Blok, ment induced slow-thermalisation [48] or to see meson scat- M. Kimchi-Schwartz, J. McClean, J. Carter, W. De Jong, and tering events that have recently been predicted [67–69]. I. Siddiqi, Physical Review X 8, 011021 (2018). [21] A. Kandala, K. Temme, A. D. Córcoles, A. Mezzacapo, J. M. In general, we expect that our simple error mitigation brings Chow, and J. M. Gambetta, Nature 567, 491 (2019). us closer to exploiting the quantum advantage of available [22] M. Otten and S. K. Gray, npj Quantum Information 5, 1 (2019). NISQ devices for real world practical applications. [23] M. Otten and S. K. Gray, Physical Review A 99, 012338 (2019). Acknowledgements.— We are grateful for discussions with [24] A. Smith, M. Kim, F. Pollmann, and J. Knolle, npj Quantum Information 5, 1 (2019). Hongzheng Zhao, Adam Smith and Peter Haynes. We ac- [25] A. Strikis, D. Qin, Y. Chen, S. C. Benjamin, and Y. Li, arXiv knowledge the Samsung Advanced Institute of Technology preprint arXiv:2005.07601 (2020). Global Research Partnership, travel support via the Imperial- [26] A. Zlokapa and A. Gheorghiu, arXiv preprint arXiv:2005.10811 TUM flagship partnership and the use of IBM Quantum ser- (2020). vices for this work. The views expressed are those of the au- [27] P. Czarnik, A. Arrasmith, P. J. Coles, and L. Cincio, arXiv thors, and do not reflect the official policy or position of IBM preprint arXiv:2005.10189 (2020). or the IBM Quantum team. This work is also supported by the [28] Y. Li and S. C. Benjamin, Physical Review X 7, 021050 (2017). [29] S. Endo, S. C. Benjamin, and Y. Li, Physical Review X 8, UK Hub in Quantum Computing and Simulation, part of the 031027 (2018). UK National Quantum Technologies Programme with fund- [30] K. Temme, S. Bravyi, and J. M. Gambetta, Physical review ing from UKRI EPSRC grant EP/T001062/1. letters 119, 180509 (2017). [31] E. Gustafson, Y. Meurice, and J. Unmuth-Yockey, Physical Re- view D 99, 094503 (2019). [32] T. Giurgica-Tiron, Y. Hindy, R. LaRose, A. Mari, and W. J. Zeng, arXiv preprint arXiv:2005.10921 (2020). [1] J. Preskill, Quantum 2, 79 (2018). [33] Z. Cai, arXiv preprint arXiv:2007.01265 (2020). [2] S. Boixo, S. V. Isakov, V. N. Smelyanskiy, R. Babbush, N. Ding, [34] A. He, B. Nachman, W. A. de Jong, and C. W. Bauer, arXiv Z. Jiang, M. J. Bremner, J. M. Martinis, and H. Neven, Nature preprint arXiv:2003.04941 (2020). Physics 14, 595 (2018). [35] J. Vovrosh and J. Knolle, arXiv:2001.03044 (2020). [3] F. Arute, K. Arya, R. Babbush, D. Bacon, J. C. Bardin, [36] M. A. Nielsen and I. Chuang, “Quantum computation and quan- R. Barends, R. Biswas, S. Boixo, F. G. Brandao, D. A. Buell, tum information,” (2002). et al., Nature 574, 505 (2019). [37] B. Vermersch, A. Elben, M. Dalmonte, J. I. Cirac, and P. Zoller, [4] A. M. Steane, Physical Review Letters 77, 793 (1996). Physical Review A 97, 023604 (2018). [5] P. W. Shor, Physical review A 52, R2493 (1995). [38] S. van Enk and C. Beenakker, Physical review letters 108, [6] “ibmq paris v1.6.23, ibm quantum team. retrieved from 110503 (2012). https://quantum-computing.ibm.com, (2020),”. [39] A. Elben, B. Vermersch, M. Dalmonte, J. I. Cirac, and P. Zoller, [7] “Qiskit: An open-source framework for quantum computing,” Physical review letters 120, 050406 (2018). (2019). [40] T. Brydges, A. Elben, P. Jurcevic, B. Vermersch, C. Maier, B. P. [8] M. S. Jattana, F. Jin, H. De Raedt, and K. Michielsen, Quantum Lanyon, P. Zoller, R. Blatt, and C. F. Roos, Science 364, 260 Information Processing 19, 1 (2020). (2019). [9] F. B. Maciejewski, Z. Zimborás, and M. Oszmaniec, Quantum [41] A. Elben, B. Vermersch, R. van Bijnen, C. Kokail, T. Brydges, 4, 257 (2020). C. Maier, M. K. Joshi, R. Blatt, C. F. Roos, and P. Zoller, Phys- [10] S. Bravyi, S. Sheldon, A. Kandala, D. C. Mckay, and J. M. ical Review Letters 124, 010504 (2020). Gambetta, arXiv preprint arXiv:2006.14044 (2020). [42] C. H. Bennett, D. P. DiVincenzo, J. A. Smolin, and W. K. Woot- [11] Y. Chen, M. Farahzad, S. Yoo, and T.-C. Wei, Physical Review ters, Physical Review A 54, 3824 (1996). A 100, 052315 (2019). [43] C. H. Bennett, G. Brassard, S. Popescu, B. Schumacher, J. A. [12] J. R. McClean, Z. Jiang, N. C. Rubin, R. Babbush, and Smolin, and W. K. Wootters, Physical review letters 76, 722 H. Neven, Nature Communications 11, 1 (2020). (1996). [13] X. Bonet-Monroig, R. Sagastizabal, M. Singh, and T. O’Brien, [44] E. Knill, D. Leibfried, R. Reichle, J. Britton, R. B. Blakestad, Physical Review A 98, 062339 (2018). J. D. Jost, C. Langer, R. Ozeri, S. Seidelin, and D. J. Wineland, [14] R. Sagastizabal, X. Bonet-Monroig, M. Singh, M. A. Rol, Physical Review A 77, 012307 (2008). C. Bultink, X. Fu, C. Price, V. Ostroukh, N. Muthusubrama- [45] J. Emerson, M. Silva, O. Moussa, C. Ryan, M. Laforest, nian, A. Bruno, et al., Physical Review A 100, 010302 (2019). J. Baugh, D. G. Cory, and R. Laflamme, Science 317, 1893 [15] M. C. Tran, Y. Su, D. Carney, and J. M. Taylor, arXiv preprint (2007). arXiv:2006.16248 (2020). [46] As errors on the quantum devices can vary over time (see Ref. [16] W. J. Huggins, J. McClean, N. Rubin, Z. Jiang, N. Wiebe, K. B. [24]), the determination of ptot has to be recalibrated accord- Whaley, and R. Babbush, arXiv preprint arXiv:1907.13117 ingly.
8 [47] M. Kormos, M. Collura, G. Takács, and P. Calabrese, Nature SUPPLEMENTARY INFORMATION Physics 13, 246 (2017). [48] F. Liu, R. Lundgren, P. Titum, G. Pagano, J. Zhang, C. Monroe, Justification of a global depolarizing Error Model and A. V. Gorshkov, arXiv preprint arXiv:1810.02365 (2018). [49] C. J. Turner, A. A. Michailidis, D. A. Abanin, M. Serbyn, and Z. Papić, Nature Physics 14, 745 (2018). Here we justify why such a simple global depolarizing error [50] S. Moudgalya, S. Rachel, B. A. Bernevig, and N. Regnault, model is applicable. For simplicity we consider single qubit Physical Review B 98, 235155 (2018). gate errors, modelled as depolarizing errors, before moving [51] W. W. Ho, S. Choi, H. Pichler, and M. D. Lukin, Physical re- onto multiqubit gates. The error channel for an operator act- view letters 122, 040603 (2019). ing only on qubit i is, Ei [ρ] = (1 − p)Ui ρUi† + pTri [ρ] ⊗ Ii /2 [52] D. K. Mark, C.-J. Lin, and O. I. Motrunich, Physical Review B 101, 195131 (2020). where Ui acts locally on qubit i. That is with probability [53] N. Shibata, N. Yoshioka, and H. Katsura, Physical Review Let- 1 − p the gate was successfully applied, and with probabil- ters 124, 180604 (2020). ity p the qubit suffered a completely depolarizing error, and [54] R. M. Nandkishore and M. Hermele, Annual Review of Con- ends up in the maximally mixed state I/2 = diag[1/2, 1/2], densed Matter Physics 10, 295 (2019). thereby destroying quantum correlations between qubit i and [55] S. Pai and M. Pretko, Physical Review Research 2, 013094 the reset of the state. This error model does not commute (2020). with entangling gates (e.g. consider the reduced state Tri [ρ] [56] P. Sala, T. Rakovszky, R. Verresen, M. Knap, and F. Pollmann, Physical Review X 10, 011047 (2020). depends on the entanglement between qubit i and the rest [57] A. Cervera-Lierta, Quantum 2, 114 (2018). of the state). While applying single gates to multiple differ- [58] A. Zhukov, S. Remizov, W. Pogosov, and Y. E. Lozovik, Quan- ent qubits commute, the resulting expression quickly becomes tum Information Processing 17, 223 (2018). difficult to track, Ei ◦ Ej [ρ] = (1 − pi )(1 − pj )Ui Uj ρUj† Ui† + [59] A. Francis, J. Freericks, and A. Kemper, arXiv preprint pi (1−pj )Tri [Uj ρUj† ]⊗Ii /2+pj (1−pi )Trj [Ui ρUi† ]⊗Ij /2+ arXiv:1909.05701 (2019). [60] A. Smith, B. Jobst, A. G. Green, and F. Pollmann, arXiv pi pj Trij [ρ] ⊗ Ii /2 ⊗ Ij /2. preprint arXiv:1910.05351 (2019). Our first step is to approximate a layer of single qubit gates [61] A. J. James, R. M. Konik, and N. J. Robinson, Physical review [70] as letters 122, 130603 (2019). [62] W. Tan, P. Becker, F. Liu, G. Pagano, K. Collins, A. De, L. Feng, Elayer = (1 − p0 )U ρU † + p0 I/2n , (11) H. Kaplan, A. Kyprianidis, R. Lundgren, et al., Nature Physics for n qubits. We don’t set (1 − p0 ) 6= Q , 1 (2021). i (1 − pi ), but [63] “ibmq toronto v1.1.6, ibm quantum team. retrieved from rather introduce an effective p0 to account for components in https://quantum-computing.ibm.com, (2020),”. the total sum that have some reasonable overlap with U ρU † . [64] M. Fagotti and P. Calabrese, Physical Review A 78, 010306 E.g. for each j, the state Trj [U ρU † ] ⊗ Ij /2 may have a (2008). large overlap with U ρU † , (and to a lesser extent Trij for [65] “ibmq santiago v1.3.19, ibm quantum team. retrieved from each i, j), and we’d like to include such contributions with- https://quantum-computing.ibm.com, (2021),”. out worrying about combinatorial multiplicity, or the state- [66] A. Kandala, A. Mezzacapo, K. Temme, M. Takita, M. Brink, J. M. Chow, and J. M. Gambetta, Nature 549, 242 (2017). dependent details of all the partial traces. We can follow the [67] F. M. Surace and A. Lerose, arXiv:2011.10583 (2020). same steps to arrive at Eq. (11) for entangling gates Uij once [68] P. I. Karpov, G.-Y. Zhu, M. P. Heller, and M. Heyl1, we note that the 2-qubit depolarizing error channel be writ- † arXiv:2011.11624 (2020). ten as Eij [ρ] = (1 − p)Uij ρUij + pTrij [ρ] ⊗ Iij /4, where [69] A. Milsted, J. Liu, J. Preskill, and G. Vidal, “Collisions of Iij /4 = diag[1, 1, 1, 1]/4 is the maximally mixed two qubit false-vacuum bubble walls in a quantum spin chain,” (2020), state. arXiv:2012.07243 [quant-ph]. We can construct the quantum channel for the full circuit. [70] Assuming many qubits, but not necessarily all qubits have gates applied. The non-unitary part of Eq. (11) commutes with itself over [71] J. R. McClean, N. C. Rubin, K. J. Sung, I. D. Kivlichan, different layers meaning the a full circuit error channel is ap- X. Bonet-Monroig, Y. Cao, C. Dai, E. S. Fried, C. Gidney, proximated as B. Gimby, P. Gokhale, T. Häner, T. Hardikar, V. Havlı́ček, † O. Higgott, C. Huang, J. Izaac, Z. Jiang, X. Liu, S. McArdle, Ecirc [ρ] = (1 − ptot )Ucirc ρUcirc + ptot I2⊗n /2n (12) M. Neeley, T. O’Brien, B. O’Gorman, I. Ozfidan, M. D. Radin, where (1 − ptot ) = l (1 − p0l ) (product over effective layer Q J. Romero, N. P. D. Sawaya, B. Senjean, K. Setia, S. Sim, D. S. Steiger, M. Steudtner, Q. Sun, W. Sun, D. Wang, F. Zhang, and p0 ’s) holds dueQ to commuting non-unitary terms. While this is R. Babbush, Quantum Sci. Technol. 5, 034014 (2020). not equal to g (1−pg ) (product over gate error probabilities), [72] S. Bravyi, J. M. Gambetta, A. Mezzacapo, and K. Temme, the latter is a good zeroth-order estimate of this term. arXiv (2017), 1701.08213. Finally we note the total map Eq. (12) unravels to an ef- fective error model for single (i.e. one or two qubit) gates, given by: Ei [ρ] = (1 − p00i )Ui ρUi† + p00i I2⊗n /2n . Note the all-or-nothing description of the map: either the gate is imple- mented perfectly, or the entire quantum state is destroyed, not just part of the state as is the case for Tri [ρ] ⊗ Ii /2 etc. While
9 this seems quite an unphysical model for a single gate, it is a to particle spin) are removed, reducing the number of qubits good model of the effective gate error per gate in a multi-gate by two. For the TFIM, the Hamiltonian was reduced to two circuit. non-commuting Pauli strings from which the total Hamilto- nian can be evaluated. Finally the single Z, and XY ZIXX (chosen randomly) operators were measured trivially. We do Cosine Fitting not directly account for State-Preparation and Measurement (SPAM) errors, but we note that we can (and do) include In order to extract the frequencies from the local magneti- SPAM errors as contributing to ptot in our channel ansatz. zation results presented in Fig. 2 and Fig. 3, a function that We numerically simulate circuits consisting of layers of pa- contained a singular cosine was used, namely, rameterized u3 gates, generic single-qubit rotation gates with 3 Euler angles, followed by an entangling layer. The circuit hσ z (t)i ∼ Ae−dt cos(ωt) + c1 t + c2 . (13) depth is defined by the number of entangling (CNOT) layers. There is a final u3 layer before measurement. The parameter- Here, A is the amplitude of the oscillations, D allows for de- ized circuit now creates a state ρ(θ), and varying θ changes the cay of the amplitude, ω is the oscillation frequency and c1 and state and corresponding operator expectation values. For the c2 give a time dependent shift. While this function can not numerical simulations we consider how our mitigation per- capture long term behaviour of hσ z i, it is able to extract out forms for different depth circuits, and for the four tasks shown the dominant oscillations for short time dynamics. in the main text. For each depth the calibration ptot must be done independently, as ptot is an effective parameter for the whole quantum channel and cannot simple be calculated a pri- Statistical Errors Introduced by the Error Mitigation ori for different depths. Calibration.— Two independent methods of calibrating The error when calculating ptot from Tr ρ2 is given by ptot are considered. Firstly, via single expectation value: di- rect simulation of a circuit with fixed parameters, which is δ Tr ρ2 then compared to the error free expectation value to find ptot . δptot = ±(1 − ptot ) , (14) 2(2n − Tr(ρ2 )) Secondly, via trace: from directly measuring Tr[ρ2 ] and esti- mating ptot via Eq. (9). where δptot is the errorin ptot and δ Tr ρ2 is the error in the For single expectation calibration, a single fixed parameter measurement of Tr ρ2 . Note that δ Tr ρ2 can be calculated circuit was used to find hOi (for each operator O considered). by resampling e.g. Jackknife or Bootstrap [35, 40]. Thus, This result was compared to the error free simulation of the when using the proposed error mitigation on an observable Ô exact same circuit to estimate ptot . To evaluate these expecta- this results additional uncertainty on top of the measurement tion values 40960 shots were used for each non-commuting errors. More precisely, term in the decomposition of the given operator into Pauli ! strings. For trace mitigation, an extra 500 circuits are used −n to estimate ptot using 8196 shots per circuit. This was broken δmeas hÔi δptot hÔi − 2 Tr Ô δhÔi = ± + , (15) up into five lots of 100 circuits to estimate ptot at five differ- 1 − ptot (1 − ptot )2 ent (random) circuit parameterizations. For each parameteri- zation, the method of Ref. [39] is used to find five independent where δhÔi is the error in the mitigated value of hÔi and estimates of ptot , which are then averaged to reduce the bias δmeas hÔi is the error in the measurement of hÔi. Note, as the in the estimate of ptot for the circuit. local magnetisation results presented in this paper bypassed Mitigation.— Once ptot is estimated either from the single the measurement of T r(ρ2 ), the error contributing to δptot is value or trace measurement, it is used to mitigate further simu- just shot noise. Thus, δhσiz i is of the on the order ∼ O(10−3 ) lated expectation values. New expectation values are obtained and is not displayed in Fig. 2 or Fig. 3. by varying the parameters in the circuit, while keeping the cir- cuit structure fixed. For each depth and each task, thirty dif- ferent (parameterized) circuits are simulated, and mitigation Numerical simulations applied. Figure 5 shows the mitigation results for different cir- cuit depths. While the full trace-estimation is robust showing All numerical simulations are performed using qiskit [7] a significant improvement in every case, the single expectation with the santaiago error model (as of 12t h March 2021). value mitigation is more variable. This is due to the vanishing For the hydrogen Hamiltonian, openfermion [71] was used expectation value of Pauli strings for random circuits, which to compute the orbital integrals (for a row of four H-atoms in turn increases the effect of shot noise when inverting Eq. 8. with 0.1 nm inter-atomic distance), and then map the prob- We note this shot noise problem is absent when calibrating a lem to qubits using the symmetry conserving Bravyi-Kitaev Pauli string (or single operator) with a (noise free) unit expec- transformation [72], finally the Z2 symmetries (corresponding tation value, as in the main text.
10 FIG. 5. Complete mitigation simulations with non-depolarizing errors. Scaling of our mitigation schemes (circles) with circuit depth for four different operator expectation values. Single operator (red circles) show some improvement, while trace mitigated (green circles) show a robust improvement over the unmitigated results. As in the main text, numerical simulation uses the ibmq santiago error model, and circuits are layers u3 and CNOT, with the depth equal to the number of entangling layers. Note there are some combinations (e.g. TFIM at depth 14 etc) where the single expectation value mitigation found ptot = 1, and therefore did not improve results. Likewise there are other combinations (e.g. TFIM depth 22) where the single expectation value mitigation is omitted because it gave an unphysical estimate of ptot < 0. Inset numbers show ptot as estimated for the trace estimation.
You can also read