This note is part of [[quantum/Practical Quantum Information System]].
> [!info] Course code
> Use the companion repository for this lecture's runnable lab, helper functions, and regression checks:
> - [notebooks/by_concept/probability_theory_and_quantum_mechanics.ipynb](https://github.com/montekkundan/quantum-code/blob/main/notebooks/by_concept/probability_theory_and_quantum_mechanics.ipynb)
> - [qcourse/states.py](https://github.com/montekkundan/quantum-code/blob/main/qcourse/states.py)
> - [tests/test_states_density_entanglement.py](https://github.com/montekkundan/quantum-code/blob/main/tests/test_states_density_entanglement.py)
[TODO: add video - Probability Theory and Quantum Mechanics]
## What This Concept Is
Suppose a photon can reach a screen by two different paths. In ordinary probability theory, you would expect the total probability of arrival to be the sum of the probabilities of the two alternatives. Quantum mechanics does something stranger: it adds complex [[quantum/Glossary#Amplitude|amplitudes]] first and only then turns them into probabilities through the [[quantum/Glossary#Born rule|Born rule]].
That is the first conceptual shock of the subject. Quantum theory is not just classical probability with extra mystery layered on top. It changes the basic way alternatives combine. Some paths reinforce each other. Some paths cancel. That cancellation is [[quantum/Glossary#Interference|interference]], and it is one of the reasons quantum algorithms can behave so differently from classical ones.
## Foundation Terms You Need First
An [[quantum/Glossary#Amplitude|amplitude]] is a complex number attached to a possible outcome before measurement. The [[quantum/Glossary#Born rule|Born rule]] says the probability of an outcome is the squared magnitude of its amplitude. [[quantum/Glossary#Interference|Interference]] means different amplitude contributions can combine constructively or destructively.
The important ordering is: amplitudes combine first, probabilities come later. If you reverse that mental order, most beginner confusion in this lecture appears immediately.
## How The Idea Actually Works
The double-slit experiment is the cleanest first picture. With one slit open, you get one distribution on the screen. With the other slit open, you get another. With both slits open, you do not in general get the pointwise sum of those two distributions. Some regions become brighter than either single-slit contribution would suggest, and some become darker or even vanish.
Mathematically, if two paths contribute amplitudes $\alpha_1$ and $\alpha_2$, then the combined amplitude is $\alpha_1 + \alpha_2$. The probability is:
$
P = |\alpha_1 + \alpha_2|^2
$
When you expand that quantity, you get cross terms. Those cross terms are the interference terms. They are the algebraic reason the quantum story departs from classical addition of alternatives.
This also explains why which-path measurement matters. If information leaks out about which path the system took, then the coherent alternatives stop behaving like one quantum process and start behaving more like a classical mixture of alternatives. The interference pattern fades because the phase relationship between the alternatives is no longer preserved in a useful way.
That is why the early quantum story is not "everything is random." Classical probability already had randomness. The real shift is that quantum theory uses a richer mathematical object than probability itself, and the observable probabilities are extracted only at the end.
## Why It Matters
- It is the first concrete reason quantum systems do not behave like classical random systems.
- It prepares you for phase, unitary evolution, and gate-based interference.
- It gives the intuition behind many later algorithms where amplitudes are steered rather than outcomes directly chosen.
## Related Questions
> [!question] Why is quantum probability not just ordinary probability with complex notation?
>> [!answer] Ordinary alternatives combine by adding probabilities. Quantum alternatives combine by adding amplitudes first and then squaring magnitudes. That extra order of operations creates interference terms, which can raise or cancel outcome probabilities in ways classical probability cannot.
> [!question] What does which-path information do to an interference experiment?
>> [!answer] It turns coherent alternatives into distinguishable alternatives. Once the environment or a measuring device carries information about which path occurred, the alternatives no longer interfere in the same way, so the interference pattern fades into mixture-like behavior.
> [!question] What is the practical one-qubit version of the double-slit lesson?
>> [!answer] A Hadamard can split amplitude between two computational paths, and a later Hadamard can recombine those paths. Changing the relative phase before recombination changes the final measurement probabilities, just like changing path phases in an interference experiment.
## Study Checks
Use these after the explanation, not before it.
### Quick Checks
- MCQ: Which statement best describes a quantum amplitude? A. A nonnegative frequency B. A complex number whose squared magnitude gives probability C. A hidden classical bit D. A hardware noise parameter **Answer:** B. A complex number whose squared magnitude gives probability.
- MCQ: Which feature lets quantum algorithms make bad computational paths cancel? A. Decoherence B. Interference C. Thermalization D. Classical majority vote **Answer:** B. Interference.
- What is the Born rule for a normalized state $|\psi\rangle=\sum_x \alpha_x|x\rangle$? **Answer:** The probability of measuring outcome x is |alpha_x|^2.
### Oral Exam Anchors
> [!question] Oral exam anchor
> Explain the difference between amplitudes and probabilities, and why interference makes quantum algorithms possible.
A good answer should say:
Probabilities are nonnegative real numbers that add directly when alternatives are mutually exclusive. Quantum amplitudes are complex numbers that combine before measurement. The Born rule converts amplitudes into probabilities only after the relevant amplitudes have been added and squared in magnitude.
This is why interference is central to quantum computation. A quantum algorithm does not merely explore many branches and then read all of them. It arranges phases and unitary transformations so that amplitudes for wrong answers cancel while amplitudes for useful answers reinforce. The computational advantage comes from controlling this interference pattern, not from extracting an exponential amount of classical information from the statevector.
## Practical Lab
Run the double-slit intuition as a one-qubit interference experiment so the amplitude story becomes something you can observe.
- Build a small circuit where two computational paths interfere before the final measurement.
- Insert an intermediate measurement or dephasing step and compare the output counts against the coherent version.
- Record one case of constructive interference and one case of destructive interference, and explain which gate choices created them.
## Homework
Use the homework to connect the lab output back to the mathematics.
- Derive a simple probability from amplitudes using the Born rule.
- Explain why quantum probabilities are not obtained by adding path probabilities directly.
- Describe what physically changes when which-path information leaks into the environment.
## References
- Scott Aaronson, [Introduction to Quantum Information Science](https://www.scottaaronson.com/qclec.pdf), Lecture 2.
- Thomas Wong, [Introduction to Quantum and Classical Computing](http://www.thomaswong.net/).
- IBM Quantum Learning, [quantum circuits basics](https://quantum.cloud.ibm.com/learning/en).
- Qiskit documentation, [main docs](https://qiskit.org/documentation/).