research faculty summit 2018 - microsoft.com · [1] m. besta, th: slim fly: a cost effective...

20
Systems | Fueling future disruptions Research Faculty Summit 2018

Upload: others

Post on 30-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Systems | Fueling future disruptions

ResearchFaculty Summit 2018

Page 2: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

An HPC Systems Guy’s View of Quantum Computing

Torsten HoeflerETH Zurich, Switzerland (Professor)Microsoft Quantum, Redmond (Visiting Researcher)

Page 3: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Who is this guy and what is he doing here?

[1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper

Climate Sim Deep Learning

1 professor, 6 scientific staff, 13 PhD students 6.5k staff, 20k students, focus on research

[1]

DAPPVector

Page 4: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

What is a qubit and how do I get one?

One qubit can include a lot of information in 𝛼𝛼0 and 𝛼𝛼1but can only sample one bit while losing all

(encoding n bits takes Ω 𝑛𝑛 operations)

𝑛𝑛 qubits live in a vector space of 2𝑛𝑛complex numbers (all combinations + entanglement)

Ψn = �𝑖𝑖=0..2𝑛𝑛−1

𝛼𝛼𝑖𝑖|𝑖𝑖⟩ e.g., Ψ2 = 𝛼𝛼0 00 + 𝛼𝛼1 01 + 𝛼𝛼2 10 + 𝛼𝛼3|11⟩

“I don't like it, and I'm sorry I ever had anything to do with it.”Schrödinger (about the probability interpretation of quantum mechanics)

0 1= 10 = 0

1

01

10

Ψ = 𝛼𝛼010 + 𝛼𝛼1

01

𝛼𝛼0 2 + 𝛼𝛼1 2 = 1|Ψ⟩

= |1⟩

= |0⟩Ψ = 𝛼𝛼0 0 + 𝛼𝛼1|1⟩

For example: + = 12

0 + 12

|1⟩

|+⟩

Page 5: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Example: adding 2𝑛𝑛numbers in O(log 𝑛𝑛) time

+logarithmic

depth

linearwork

𝑎𝑎0

𝑎𝑎𝑛𝑛−1

𝑏𝑏0

𝑏𝑏𝑛𝑛−1

……

… 𝑠𝑠0

𝑠𝑠𝑛𝑛−1

𝑐𝑐

Reminder: Classical Adder

0

0

0

00

1

1

1 1

1

00

0

Quantum Adder

…|0⟩

|0⟩

…|0⟩

|0⟩𝑎𝑎[𝑛𝑛]

𝑏𝑏[𝑛𝑛]

…|0⟩

|0⟩2𝑛𝑛

|0⟩

+logarithmic

depth

linearwork

H

H

H

H

𝑎𝑎 =12𝑛𝑛

�𝑖𝑖=0..2𝑛𝑛−1

|𝑖𝑖⟩

𝑏𝑏 =12𝑛𝑛

�𝑖𝑖=0..2𝑛𝑛−1

|𝑖𝑖⟩

|Ψ𝑎𝑎+𝑏𝑏⟩

| ⋅⟩

| ⋅⟩

|0⟩

|0⟩

H

H𝑎𝑎 =

14

(|00⟩ + 01 + 10 + |11⟩)

|0⟩

|0⟩

H

H𝑏𝑏 =

14

(|00⟩ + 01 + 10 + |11⟩)

……

final adder state(entangled “probability distribution”)

print(a, a+b)(“measure”)

𝑎𝑎

𝑏𝑏

𝑎𝑎

𝑎𝑎 + 𝑏𝑏

𝑐𝑐

1

0

1

0

1

1

1

0

0

0

0

1

1

0

1

1

0

0

1

1

1

0

1

1

1

1

1

1

0

1

0

1

1

1

0

0

Page 6: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

We add all 2𝑛𝑛numbers in parallel but only recover 𝑛𝑛 classical bits!

A Corollary to Holevo’s Theorem (1973): at most 𝒏𝒏 classical bits can be extracted from a quantum state with 𝒏𝒏 qubits even though that system requires 2𝑛𝑛 − 1 complex numbers to be represented!

My corollary: practical quantum algorithms read a linear-size input andmodify an exponential-size quantum state such that the correct (polynomialsize) output is likely to be measured.

Question: Are quantum algorithms good at solving problems where a solution is verifiable efficiently (polynomial time)? Answer: Kind of

Page 7: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

• Even quantum computers may not solve NP-hard problems (limited by linearity of operators). But since quantum is at least as powerful as classic, we do not know!

• New complexity class: Bounded-error Quantum Polynomial time (BQP)

PSPACE

BQP

P

So quantum computers can solve NP-complete problems!?A problem is in NP if a solution can be verified deterministically in polynomial time.

NP

NP-complete

factoring, discrete logarithmNPI – e.g., graph isomorphism

?

Page 8: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Q error correction

Gate synthesis

Physical control

Logical layer

Hardware and software architecture for quantum computingqubits bits

Quantum circuits

Instruction stream + data

QEC Codes (Steane etc., 1:n mapping)

Control for QEC (varies with code)

Build Gates(T, Rotation,

multi-control, …)

Factory control and

qubit routing

Physical quantum

state

Analog pulse generators

abstractionQ# programming

languageQ intermediaterepresentation

Microcodedinstructions

Microcodedinstructions

Qubit control pulses

300K>100kW

77K(liquid

nitrogen)<1kW

4K(liquid helium)<10W

<0.1K(dilution

refrigeration)<0.01W

Classical Logic

qubits

SW

MW

MW

HW

Page 9: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Full Example: Grover’s search

• Task: find 𝑥𝑥 ∈ 𝐷𝐷 for which 𝑓𝑓 𝑥𝑥 = 𝑦𝑦(invert 𝑓𝑓(𝑥𝑥))• Classical requires 𝑂𝑂( 𝐷𝐷 ) queries • Quantum requires 𝑂𝑂 𝐷𝐷 queries

Q# code𝑓𝑓(𝑥𝑥)𝐷𝐷 → 𝐶𝐶↦

allocate ⌈log2 𝐷𝐷 ⌉qubits

12𝑛𝑛

0

|0 … 00⟩ |0 … 01⟩ |0 … 10⟩ |1 … 11⟩|1 … 10⟩⋯ ⋯|𝑥𝑥⟩|0 … 11⟩

1

⋯ ⋯

average

⋯ ⋯⋯ ⋯

Page 10: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Quadratic speedup? Grover on a real machine

1. Query complexity model – how algorithms are developed• 𝑇𝑇 = 𝜋𝜋

42𝑛𝑛 queries ( 𝐷𝐷 = 2𝑛𝑛 - represented by 𝑛𝑛 bits)

2. Express (oracle and diffusion operator) as n-bit unitary• Assuming 𝑂𝑂 n-bit operations for oracle!• 𝑇𝑇 = 𝑂𝑂 𝜋𝜋

42𝑛𝑛 n-bit operations - 𝑇𝑇𝑡𝑡 = 𝜋𝜋

42𝑛𝑛

3. Decompose unitary into two-bit (+arbitrary rotation) gates• 𝑇𝑇 = 𝑂𝑂2

𝜋𝜋4

2𝑛𝑛 ⋅ 2(𝑛𝑛 − 1) elementary operations - 𝑇𝑇𝑡𝑡 = 𝜋𝜋4

2𝑛𝑛 ⋅ 4(𝑛𝑛 − 1)4. Design approximate implementations in discrete gate set (using HTHT...)

• 𝑇𝑇 = 𝑂𝑂�2𝜋𝜋4

2𝑛𝑛 ⋅ 2(𝑛𝑛 − 1) discrete T gate operations - 𝑇𝑇𝑡𝑡 = 𝜋𝜋4

2𝑛𝑛 ⋅ 48(𝑛𝑛 − 1)5. Mapping to real hardware (swaps and teleport)

• Not to simple to model, depends on oracle – potentially Θ 2𝑛𝑛 slowdown6. Quantum error correction

• Not so simple, depends on quality of physical bits and circuit depth, huge constant slowdown

Performance estimates must be understood to be believed (inspired by Donald Knuth’s “An algorithm must be seen to be believed”)

𝑓𝑓(𝑥𝑥)

Page 11: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Quadratic speedup? Grover on a real machine

1. Query complexity model – how algorithms are developed• 𝑇𝑇 = 𝜋𝜋

42𝑛𝑛 queries ( 𝐷𝐷 = 2𝑛𝑛 - represented by 𝑛𝑛 bits)

2. Express (oracle and diffusion operator) as n-bit unitary• Assuming 𝑂𝑂 n-bit operations for oracle!• 𝑇𝑇 = 𝑂𝑂 𝜋𝜋

42𝑛𝑛 n-bit operations - 𝑇𝑇𝑡𝑡 = 𝜋𝜋

42𝑛𝑛

3. Decompose unitary into two-bit (+arbitrary rotation) gates• 𝑇𝑇 = 𝑂𝑂2

𝜋𝜋4

2𝑛𝑛 ⋅ 2(𝑛𝑛 − 1) elementary operations - 𝑇𝑇𝑡𝑡 = 𝜋𝜋4

2𝑛𝑛 ⋅ 4(𝑛𝑛 − 1)4. Design approximate implementations in discrete gate set (using HTHT...)

• 𝑇𝑇 = 𝑂𝑂�2𝜋𝜋4

2𝑛𝑛 ⋅ 2(𝑛𝑛 − 1) discrete T gate operations - 𝑇𝑇𝑡𝑡 = 𝜋𝜋4

2𝑛𝑛 ⋅ 48(𝑛𝑛 − 1)5. Mapping to real hardware (swaps and teleport)

• Not to simple to model, depends on oracle – probably Θ 2𝑛𝑛 slowdown6. Quantum error correction

• Not so simple, depends on quality of physical bits and circuit depth, huge constant slowdown

Performance estimates must be understood to be believed (inspired by Donald Knuth’s “An algorithm must be seen to be believed”)

𝑓𝑓(𝑥𝑥)Quantum computer with logical error rates ≤ 10−24and gate times of 10−6s vs. classical at 1 teraop/s.

from Grassl et al.: “Applying Grover's algorithm to AES: quantum resource estimates”, arXiv:1512.04965

38 billion years seco

nds

search bits

1 year

100 years

age of the universe (13.772 billion years)

Page 12: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Real applications?

Original idea by Feynman – use quantum effects to evaluate quantum effects Design catalysts, exotic materials, …

Quantum Chemistry/Physics

Big hype – destructive impact – single-shot (but big) business case Not trivial (requires arithmetic) but possible

Breaking encryption & bitcoin

Your connection is not private

Quadratic speedup can be very powerful! Requires much more detailed resource analysis systems problem

Accelerating heuristical solvers

Feynman may argue: “quantum advantage” assumes that circuits cannot be simulated classically they represent very complex functions that could be of use in ML?

Quantum machine learning

Quantum

Page 13: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Thanks!

• Special thanks to Matthias Troyer and Doug Carmean

• Thanks to: Thomas Haener, Damian Steiger, Martin Roetteler, Nathan Wiebe, Mike Upton, Bettina Heim, Vadym Kliuchnikov, Jeongwan Haah, Dave Wecker, Krysta Svore

• And the whole MSFT Quantum / QuArC team!

All used images belong to the respective owners – source list in appendix

Quantum me on the rocky path to develop my

intuition for quantum computation

Page 14: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

How does a quantum computer work?

Operations (“gates”) are applied to qubits in place!

As opposed to bits flowing through traditional computers!

Qubits are arranged on a (commonly 2D) substrate

Reuse big parts of process technology in microelectronics

Commonly limited to neighbor interactions between qubits

Limited range, may require swapping across chip

Operations (“gates”) have highly varying complexity

Some are literally free (classical tracking), some are very expensive

Qubits are error prone, need to be highly isolated (major challenge)

Quantum error correction enabled the dream of quantum computers

Quantum circuits use predication (no control flow)

Circuit view simplifies reasoning but requires classical envelope

Quantum systems are most naturally seen as accelerators

Work in close cooperation with a traditional control circuit

Page 15: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

For the unexpected discussions

Backup

Page 16: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Physical qubit implementations• Photons

• Polarization encoding, number of photons (Fock state), arrival timing • Electrons

• Spin, charge (number), • Nuclei

• Spin through NMR• Josephson junctions/superconducting

• Charge (incl. transmon), flux, phase• Quantum dots

• Spin, electron localization• Majorana

• MZMs

Page 17: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Elements of a quantum computation

• Classical control flow• Execute outer loop and classical parts of programs (W=?, D=?)

• Gate application, Measurement, Initialization• Clifford gates (W=1, D=1), T gates (W=?, D=?), Measurement (W=?, D=?), Initialization (W=?, D=1)• Qubit control (depends on technology, may be complex as well)

• Error correction• Surface code (W=?, D=?, distribution?), Steane code (W=?, D=?, distribution)

• Input-specific recompilation• (inputs may change during execution due to measurement results, maybe precompile, check

algorithms)

Page 18: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Basic components for a universal quantum computer

• State preparation (usually |0>)• Gate application (a universal set is H, CNOT, pi/8 phase rotation)• Measurement (sufficient in standard basis |0>, |1>)• Wait (apply identity, to sync with operations on other qubits)

• All needs to be performed fault-tolerant!

Page 19: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning

Thank you!

Page 20: Research Faculty Summit 2018 - microsoft.com · [1] M. Besta, TH: Slim Fly: A Cost Effective Low-Diameter Network Topology, IEEE/ACM SC14, best student paper. Climate Sim. Deep Learning