0 Nomenclature and notation

0.0 Linear algebra and quantum mechanics

positive $A$ $\expval{A}{\psi} \geq 0$ $\ket\psi$ .
positive definite $A$ $\expval{A}{\psi} > 0$ $\ket\psi \neq 0$ .
The support of an operator is defined to be the vector space orthogonal to its kernel.
Hermitian operator: the vector space spanned by its eigenvectors has non-zero eigenvalues.
$U \rightarrow$ unitary operator or matrix.
$\ket0\rightarrow(1,0)$ $\ket1\rightarrow(0,1)$ .
$I, X, Y, Z$ $\sigma_0, \sigma_1, \sigma_2, \sigma_3$ respectively.

0.1 Information theory and probability

Probability distributionfinite set of real numbers $p_x$ $p_x \geq 0$ $\Sigma_xp_x = 1$ .
The relative entropypositive operator $A$ positive operator $B$ is defined by:

\begin{matrix} (1) & S (A ∥ B) \equiv tr (A \log A) - tr (A \log B) \end{matrix}

0.2 Frequently used gates & symbols

1 Introduction and overview

Each remaining section in the chapter gives a brief introduction to one or more fundamental concepts from the field: quantum bits, quantum computers, quantum gates and quantum circuits, quantum algorithms, experimental quantum information processing, quantum information and quantum communication.

1.1 Global perspectives

$\leftrightarrow$ whether it is possible to clone an unknown quantum state: no-cloning theorem (early 1980s), is one of the earliest results of quantum computation and quantum information.
Church–Turing thesis: if an algorithm can be performed on any piece of hardware (say, a modern personal computer), then there is an equivalent algorithm for a Universal Turing Machine which performs exactly the same task as the algorithm running on the personal computer.
Strong Church–Turing thesis: Any algorithmic process can be simulated efficiently using a Turing machine.
The first major challenge to the strong Church–Turing thesis arose in the mid 1970s, which is that it's possible to test whether an integer is prime or composite using a randomized algorithm. That is, the Solovay–Strassen test for primality used randomness as an essential part of the algorithm.
deterministic $\rightarrow$ computers with access to a random number generator would be able to efficiently perform computational tasks with no efficient solution on a conventional deterministic Turing machine.
Hence we have a modified version of Church-Turing Thesis: Any algorithmic process can be simulated efficiently using a probabilistic Turing machine.
It is comparatively difficult to come up with quantum algorithm, since ^①要先抑制自己已經根深蒂固的古典思維, ^②發明出新的演算法必須比當前已知所有解同樣問題的古典演算法還要快.
At the same time computer science was exploding in the 1940s, another revolution was taking place in our understanding of communication. The key step taken by Shannon was to mathematically define the concept of information.
Shannon’s noiseless channel coding theorem, quantifies the physical resources required to store the output from an information source. Shannon’s second fundamental theorem, the noisy channel coding theorem, quantifies how much information it is possible to reliably transmit through a noisy communications channel.
Shannon 的 noisy channel coding theorem 只給出在噪音通道下, 有效資訊傳輸量的上界, 但沒明確給出有哪個 error correcting code 可以達到這個上界.
$\xrightarrow{\text{subsumed by}}$ the stabilizer codes. The theory of quantum error-correcting codes was developed to protect quantum states against noise.
As for transmitting ordinary classical information using a quantum channel, we have superdense coding (transmit two classical bits of information, while only transmitting one quantum bit from sender to receiver).
No unifying theory of networked information theory exists for quantum channels. But it is of much consequence! One example may suffice its value: 兩個 zero-capacity 的古典通道併聯, 還是 zero-capacity; 但如果是兩個反向的量子通道併聯, 就竟然可以通訊!
Another field is quantum cryptography: for private key cryptosystem, there is QKD (quantum key distribution), which makes use of the property that eavesdropping changes the quantum state of the qubit; for public key cryptosystem, Shor's algorithm can break RSA, and Shor's quantum algorithm for solving the discrete logarithm problem can break other public key systems.

1.2 Quantum bits

In this section we introduce the properties of single and multiple qubits, comparing and contrasting their properties to those of classical bits.

$\ket0$ $\ket1$ ) orbiting a single atom.
$\ket0\rightarrow\ket1$ $\ket+$ state.
Geometric representation of a qubit: a qubit in superposition can be written as
$\begin{matrix} (2) & | ψ ⟩ = α | 0 ⟩ + β | 1 ⟩, whereas {| α |}^{2} + {| β |}^{2} = 1 \end{matrix}$
$\abs{\alpha}^2 + \abs{\beta}^2 = 1$ , we can rewrite the equation as
$\begin{matrix} (3) & | ψ ⟩ = e^{i γ} (\cos \frac{θ}{2} | 0 ⟩ + e^{i φ} \sin \frac{θ}{2} | 1 ⟩) \end{matrix}$
$\theta$ $\varphi$ $\gamma$ $e^{i\gamma}$ have no observable effects, we can essentially write the equation
$\begin{matrix} (4) & | ψ ⟩ = \cos \frac{θ}{2} | 0 ⟩ + e^{i φ} \sin \frac{θ}{2} | 1 ⟩ \end{matrix}$
$\theta$ $\varphi$ define a point on the unit three-dimensional sphere, which is called Bloch-sphere:
However, it must be kept in mind that this intuition is limited because there is no simple generalization of the Bloch sphere known for multiple qubits.
Multiple qubits: say, if we express a pair of qubits in their computational basis state, we get
$\begin{matrix} (5) & | ψ ⟩ = α_{00} | 00 ⟩ + α_{01} | 01 ⟩ + α_{10} | 10 ⟩ + α_{11} | 11 ⟩ \end{matrix}$
normalization $\Sigma_{x\in\{0,1\}^2} \abs{\alpha_x}^2 = 1$ . Now if we only take measurement on the first qubit:
$\begin{matrix} (6) & {\begin{cases} will measure 0 with probability {| α_{00} |}^{2} + {| α_{01} |}^{2}, leaving the post-measurement state | ψ^{'} ⟩ = \frac{α_{00} | 00 ⟩ + α_{01} | 01 ⟩}{\sqrt{{| α_{00} |}^{2} + {| α_{01} |}^{2}}} \\ will measure 1 with probability {| α_{10} |}^{2} + {| α_{11} |}^{2}, leaving the post-measurement state | ψ^{'} ⟩ = \frac{α_{10} | 10 ⟩ + α_{11} | 11 ⟩}{\sqrt{{| α_{10} |}^{2} + {| α_{11} |}^{2}}} \end{cases} \end{matrix}$
A particularly important two qubit state is the Bell state or EPR pair,
$\begin{matrix} (7) & \frac{| 00 ⟩ + | 11 ⟩}{\sqrt{2}} \end{matrix}$
is the key ingredient in quantum teleportation and superdense coding. We can know from the measurement on the first qubit which state the second qubit would have if we then measures it. The two qubits are strongly correlated.

1.3 Quantum computation

Changes occurring to a quantum state can be described using the language of quantum computation. Analogous to the way a classical computer is built from an electrical circuit containing wires and logic gates, a quantum computer is built from a quantum circuit containing wires and elementary quantum gates to carry around and manipulate the quantum information.

1.3.1 Single qubit gates

$\alpha\ket0 + \beta\ket1 \longrightarrow\begin{bmatrix}\alpha \\ \beta\end{bmatrix}$ .
NOT $X = \begin{bmatrix} 0 & 1\\ 1 & 0\end{bmatrix}$ ${X}\begin{bmatrix}\alpha \\ \beta\end{bmatrix} = \begin{bmatrix}\beta \\ \alpha\end{bmatrix}$ .
$U$ constraints $\abs{\alpha}^2 + \abs{\beta}^2 = 1$ yields
$\begin{matrix} (8) & U^{†} U = I \end{matrix}$
Amazingly, this unitarity constraint is the only constraint on quantum gates!
Another two important gates:
$\begin{matrix} (9) & \begin{matrix} Z \equiv [\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}] and H \equiv \frac{1}{\sqrt{2}} [\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}] \end{matrix} \end{matrix}$
Hadamard $\hat y$ $\hat x$ $H$ $\ket +$ state).

✏️ Decomposing single qubit operations
$2 \times 2$ unnitary matrix may be decomposed as
$\begin{matrix} (10) & \begin{matrix} U = e^{i α} [\begin{matrix} e^{- i β / 2} & 0 \\ 0 & e^{i β / 2} \end{matrix}] [\begin{matrix} \cos \frac{γ}{2} & - \sin \frac{γ}{2} \\ \sin \frac{γ}{2} & \cos \frac{γ}{2} \end{matrix}] [\begin{matrix} e^{- i δ / 2} & 0 \\ 0 & e^{i δ / 2} \end{matrix}] \end{matrix} \end{matrix}$
$\alpha, \;\beta, \;\gamma$ $\delta$ are real-valued. Notice that the second matrix is just an ordinary rotation. It turns out that the first and last matrices can also be understood as rotations in a different plane. This decomposition can be used to give an exact prescription for performing an arbitrary single qubit quantum logic gate.

1.3.2 Multiple qubit gates

常見的 classical multiple gates 有 AND, OR, XOR, NAND, 和NOR. An important theoretical result is that any function on bits can be computed from the composition of NAND gates alone, which is thus known as a universal gate.
CNOT can be realized as the control qubit and the target qubit are XORed and stored in the target qubit:
$\oplus$ is addition modulo two. Yet another way of describing the action of the CNOT is through its matrix representation:
$\begin{matrix} (11) & \begin{matrix} U_{CNOT} = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 \end{matrix}] \end{matrix} \end{matrix}$
$U_{\text{CNOT}}$ unitary $U_{\text{CNOT}}^\dagger U_{\text{CNOT}} = I$ .
Any multiple qubit logic gate may be composed from CNOT and single qubit gates. (proof in later content)

1.3.3 Measurements & Quantum circiuts

$\ket{+}$ $\ket -$ states as though they were the computaitonal basis states, for instance,
$\begin{matrix} (12) & | ψ ⟩ = α | 0 ⟩ + β | 1 ⟩ = \frac{α + β}{\sqrt{2}} | + ⟩ + \frac{α - β}{\sqrt{2}} | - ⟩ \end{matrix}$
$\ket a$ $\ket b$ $\alpha\ket a + \beta\ket b$ of these states. Furthermore, provided the states are orthonormal, it is possible to perform a measurement with respect to the basis.
Swap gate: Below circuit accomplishes the swap operation.
$\begin{aligned} | a, b ⟩ & ⟶ | a, a \oplus b ⟩ \\ ⟶ | a \oplus (a \oplus b), a \oplus b ⟩ = | b, a \oplus b ⟩ \\ (13) & ⟶ | b, (a \oplus b) \oplus b ⟩ = | b, a ⟩ \end{aligned}$
There are a few features allowed in classical circuits that are not usually present in quantum circuits:
- Acyclic: there are no loops in quantum circuits (feedback from one part of the quantum circuit to another).
- No FANIN: wires can't be joined together, since this is a bitwise-OR operation and irreversible, xhence not unitary.
- No FANOUT: quantum mechanics forbids the copying of a qubit, which is the no-cloning theorem.

✏️ The no-cloning theorem
$\ket\psi$ $A$ $B$ $\ket s$ ) in a quantum machine. Thus the initial state of the copying machine is:
$\begin{matrix} (14) & | ψ ⟩ \otimes | s ⟩ \end{matrix}$
$U$ now effects the copying procedure, ideally,
$\begin{matrix} | ψ ⟩ \otimes | s ⟩ \overset{U}{\to} U (| ψ ⟩ \otimes | s ⟩) = | ψ ⟩ \otimes | ψ ⟩ \end{matrix}$
$\ket\varphi$ , also:
$\begin{array}{r} (15) & U (| ψ ⟩ \otimes | s ⟩) = | ψ ⟩ \otimes | ψ ⟩ \\ (16) & U (| φ ⟩ \otimes | s ⟩) = | φ ⟩ \otimes | φ ⟩ \end{array}$
Taking the inner product of these two equations gives:
$\begin{matrix} (17) & ⟨ ψ | φ ⟩ = (⟨ ψ | φ ⟩)^{2} \end{matrix}$
$\ket\psi = \ket\varphi$ $\ket\psi$ $\ket\varphi$ . Thus a cloning device can only clone states which are orthogonal to one another, and therefore a general quantum cloning device is impossible.
Even if one allows non-unitary cloning devices, the cloning of non-orthogonal pure states remains impossible unless one is willing to tolerate a finite loss of fidelity in the copied states.

1.3.4 Bell states & Quantum teleportation

Above is a demonstration of quantum teleportation:

$\ket\psi = \alpha\ket0 + \beta\ket1$ , then the state of the three qubits are:
$\begin{aligned} original state : & (α | 0 ⟩ + β | 1 ⟩) | 00 ⟩ \\ \overset{after Hadamard gate}{\to} \frac{1}{\sqrt{2}} (α | 0 ⟩ + β | 1 ⟩) (| 00 ⟩ + | 10 ⟩) \\ \overset{after CNOT}{\to} \frac{1}{\sqrt{2}} (α | 0 ⟩ + β | 1 ⟩) (| 00 ⟩ + | 11 ⟩) \end{aligned}$
We then entangle the state which we want to teleport with qubit #2:
$\begin{aligned} from previous section : & \frac{1}{\sqrt{2}} (α | 000 ⟩ + α | 011 ⟩ + β | 100 ⟩ + β | 111 ⟩) \\ \overset{after CNOT}{\to} \frac{1}{\sqrt{2}} (α | 000 ⟩ + α | 011 ⟩ + β | 110 ⟩ + β | 101 ⟩) \\ \overset{after Hadamard}{\to} \frac{1}{\sqrt{2}} (α (\frac{| 0 ⟩ + | 1 ⟩}{\sqrt{2}}) (| 00 ⟩ + | 11 ⟩) + β (\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}) (| 10 ⟩ + | 01 ⟩)) \end{aligned}$
which equals to:
$\begin{matrix} (18) & \frac{1}{2} (| 00 ⟩ (α | 0 ⟩ + β | 1 ⟩) + | 01 ⟩ (α | 1 ⟩ + β | 0 ⟩) + | 10 ⟩ (α | 0 ⟩ - β | 1 ⟩) + | 11 ⟩ (α | 1 ⟩ - β | 0 ⟩)) \end{matrix}$
$X$ $Z$ depend on the measurement outcome of qubit #1 and #2 $\ket\psi$ on qubit #3.

Quantum teleportation emphasizes the interchangeability of different resources in quantum mechanics, showing that one shared EPR pair together with two classical bits of communication is a resource at least the equal of one qubit of communication. In particular, in Chapter 10 待補 we explain how teleportation can be used to build quantum gates which are resistant to the effects of noise, and in Chapter 12 待補 we show that teleportation is intimately connected with the properties of quantum error-correcting codes.

1.4 Quantum algorithms

1.4.1 Simulating Classical Computer

Though quantum circuits can't be used to directly simulate classical circuits (because unitary quantum logic gates are inherently reversible, whereas many classical logic gates such as the NAND gate are inherently irreversible), but we can make use of the Toffoli gate:
non-deterministic $H$ $\ket0$ state, then measure it and results in a 50/50 chance of 0 or 1.

1.4.2 Quantum parallelism

$f(x)$ different $x$ $f(x):\{0,1\} \rightarrow\{0,1\}$ $U_f: \ket{x,y} \rightarrow \ket{x, y\oplus f(x)}$ $\oplus$ 是為了要讓這個transformation是unitary的, 只要apply兩次就回到原來的state).
$\ket y = 0$ $f(x)$ in the second register. But if we first Hadamard transformWalsh– Hadamard transform ${\ket{0, f(0)} + \ket{1, f(1)}}/\sqrt2$ .
$n$ $\ket0$ state is
$\begin{matrix} (19) & \frac{1}{\sqrt{2^{n}}} \sum_{x} | x ⟩ \end{matrix}$
$x$ $H^{\otimes n}$ $\otimes$ $2^n$ $n$ qubits.
Only quantum parallelism is not enough, we also require the ability to extract informationmore than merely one $f(x)$ from superposition states like
$\begin{matrix} (20) & \frac{1}{\sqrt{2^{n}}} \sum_{x} | x ⟩ | f (x) ⟩ \end{matrix}$
$U_f$ $n+1$ $\ket{0}^{\otimes n}\ket{0}$ .

1.4.3 Deutsch's Algorithm

interference $\ket{\psi_0} = \ket{01}$ $U_f$ above:

$H$ $\ket{\psi_1} = \Big[\frac{\ket0 + \ket1}{\sqrt2}\Big]\Big[\frac{\ket0 - \ket1}{\sqrt2}\Big]$ $U_f$ $\ket{\psi_2} = (-1)^{f(x)}\ket{x}\frac{\ket0-\ket1}{\sqrt2}$ $f(x) = 0$ $U_f$ $f(x) = 1$ $y\oplus f(x)$ $y$ $\ket{\psi_2}$ :

\begin{matrix} (21) & \begin{matrix} | ψ_{2} ⟩ = {\begin{cases} \pm [\frac{| 0 ⟩ + | 1 ⟩}{\sqrt{2}}] [\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}], if f (0) = f (1) \\ \pm [\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}] [\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}], if f (0) \neq f (1) \end{cases} \end{matrix} \end{matrix}

, therefore we have:

\begin{matrix} (22) & \begin{matrix} | ψ_{3} ⟩ = {\begin{cases} \pm | 0 ⟩ [\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}], if f (0) = f (1) \\ \pm | 1 ⟩ [\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}], if f (0) \neq f (1) \end{cases} \end{matrix} \end{matrix}

, notice that if

\begin{matrix} (23) & {\begin{cases} f (0) = f (1) \Rightarrow f (0) \oplus f (1) = 0 \\ f (0) \neq f (1) \Rightarrow f (0) \oplus f (1) = 1 \end{cases} \end{matrix}

$\ket{\psi_3}$ as:

\begin{matrix} (24) & | ψ_{3} ⟩ = \pm | f (0) \oplus f (1) ⟩ [\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}] \end{matrix}

global property $f(x)$ $f(0) \oplus f(1)$ one $f(x)$ .

The essence of the design of many quantum algorithms is that a clever choice of function and final transformation allows efficient determination of useful global information about the function (information which cannot be attained quickly on a classical computer).

1.4.4 The Deutsch–Jozsa algorithm

First we need to deal with the Deutsch’s problem:

$f(x)$ is balanced or constant?

$x$ $0$ $2^n-1$ .
$x$ $f$ $f(x)$ $0$ $1$ .

balanced $0$ $1$ $x$ constant $x$ .

$2^{n-1}+1$ queries (for the worst case)
Quantum case: scheme $n$ $1$ , we result in
$\begin{matrix} (25) & | ψ_{0} ⟩ = {| 0 ⟩}^{\otimes n} | 1 ⟩ \end{matrix}$
$H$ gate on the answer register we get
$\begin{matrix} (26) & | ψ_{1} ⟩ = \sum_{x \in {0, 1}^{n}} \frac{| x ⟩}{\sqrt{2^{n}}} [\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}] \end{matrix}$
$f$ $U_f: \ket{x, y} \rightarrow\ket{x, y\oplus f(x)}$ , giving
$\begin{matrix} (27) & | ψ_{2} ⟩ = \sum_{x \in {0, 1}^{n}} \frac{(- 1)^{f (x)} | x ⟩}{\sqrt{2^{n}}} [\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}] \end{matrix}$
we now has a set of qubits in which the result of function evaluation is stored in the amplitude of the qubit superposition state.
$n$ $n=1$ case:
$\begin{matrix} (28) & H | x ⟩ = \sum_{z} \frac{(- 1)^{x z} | z ⟩}{\sqrt{2}} \end{matrix}$
thus,
$\begin{matrix} (29) & H^{\otimes n} | x_{1}, \dots, x_{n} ⟩ = \frac{1}{\sqrt{2^{n}}} \sum_{z_{1}, \dots, z_{n}} (- 1)^{x_{1} z_{1} + \dots x_{n} z_{n}} | z_{1}, \dots, z_{n} ⟩ \end{matrix}$
, and this can be summarized in a more terse term:
$\begin{matrix} (30) & H^{\otimes n} | x ⟩ = \frac{1}{\sqrt{2^{n}}} \sum_{z} (- 1)^{x \cdot z} | z ⟩ \end{matrix}$
$x\cdot z$ $x$ $z$ $2$ .
$\ket{\psi_3}$ :
$\begin{matrix} (31) & | ψ_{3} ⟩ = \sum_{z} \sum_{x} \frac{(- 1)^{x \cdot z + f (x)} | z ⟩}{2^{n}} [\frac{| 0 ⟩ - | 1 ⟩}{\sqrt{2}}] \end{matrix}$
$\ket z = \ket{0\cdots 0}$ $f(x)$ .
$\begin{matrix} (32) & {\begin{cases} f is constant ⟶ | 0 \dots 0 ⟩ ’s amplitude is \pm 1 ⟶ we will definitely measure all 0 for n qubits. \\ f is balanced ⟶ | 0 \dots 0 ⟩ ’s amplitude is 0 ⟶ we will definitely not measure all 0 for n qubits. \end{cases} \end{matrix}$
Caveats:
- Deutsch’s problem is not an especially important problem; it has no known applications.
- The method for evaluating the function is quite different in the two cases, render it uncomparable between classical and quantum scenario.
- If one adopts a probabilistic instead of deterministic classical computer, one can quickly decide the function type with high confidence.

Exercise 1.1

(Probabilistic classical algorithm) $\epsilon<1/2$ . What is the performance of the best classical algorithm for this problem?

solution

\begin{matrix} (33) & (\binom{2^{n} / 2}{2}) / (\binom{2^{n}}{2}) \times 2 = \frac{(2^{n - 1}) (2^{n - 1} - 1) / 2}{(2^{n}) (2^{n} - 1) / 2} \times 2 = \frac{2^{n - 1} - 1}{2^{n} - 1} < \frac{1}{2} \end{matrix}

$\epsilon<1/2$ .

1.4.5 Summarization

Generally there are 3 classes of quantum algorithms which provide an advantage over known classical algorithms:

algorithms based upon quantum versions of the Fourier transform (e.g. Deutsch–Jozsa algorithm, Shor's algorithm for factoring and discrete algorithm)
quantum search algorithms
quantum simulation

We will now breifly describe each of these three classes of algorithms.

Quantum algorithms based upon the Fourier transform discrete Fourier transform $x_0, \cdots, x_{N-1}$ $N$ $y_0, \cdots, y_{N-1}$ defined by
$\begin{matrix} (34) & y_{k} \equiv \frac{1}{\sqrt{N}} \sum_{j = 0}^{N - 1} e^{i \frac{2 π j k}{N}} x_{j} \end{matrix}$
, the Fourier transformed version of a problem is often easier than the original problem, enabling a solution.
The Fourier transform has proved so useful that a beautiful generalized theory of Fourier transforms has been developed which goes beyond the definition of equation (33). What is important is that the Hadamard transform used in the Deutsch–Jozsa algorithm is an example of this generalized class of Fourier transforms. Moreover, many of the other important quantum algorithms also involve some type of Fourier transform.
$U$ $n$ $\ket j$ :
$\begin{matrix} (35) & | j ⟩ ⟶ \frac{1}{\sqrt{2^{n}}} \sum_{k = 0}^{2^{n} - 1} e^{i \frac{2 π j k}{2^{n}}} | k ⟩, where 0 \leq j \leq (2^{n} - 1) . \end{matrix}$
, you can check the unitarity of this transformation, which makes it a valid quantum circuit; moreover if we perform same transformation on superposition states:
$\begin{matrix} (36) & \sum_{j = 0}^{2^{n} - 1} x_{j} | j ⟩ ⟶ \sum_{j = 0}^{2^{n} - 1} x_{j} (\frac{1}{\sqrt{2^{n}}} \sum_{k = 0}^{2^{n} - 1} e^{i \frac{2 π j k}{2^{n}}} | k ⟩) = \sum_{k = 0}^{2^{n} - 1} y_{k} | k ⟩ \end{matrix}$
, which is the vectorequation (33) $N = 2^n$ .
Time complexity:
- $N\log(N) \approx n2^n$ steps.
- $\Big(\log(N)\Big)^2 \approx n^2$ steps. (The quantum circuit to do this is explained in [Chapter 5][#5] 待補)
Though we can compute QFT faster on quantum computer, Fourier transform is being performed on the information hiddennot $\ket0$ $\ket1$ $y_k$ directly.
Problems like Deutsch’s problem, and Shor’s algorithms for discrete logarithm and factoring, are all cases for Kitaev's discovery of a method to solve the Abelian stabilizer problem, and the generalization to the hidden subgroup problem:
$f$ $G$ $X$ $f$ $K$ $U\ket{g}\ket{h} = \ket{g} \ket{h\oplus f(g)}$ $g\in G,\; h\in X$ $\oplus$ $X$ $K$ .
Quantum search algorithms With its basic principles were discovered by Grover, quantum search algorithm solves the following problem:
$N$ , and no prior knowledge about the structure of the information in it, we want to find an element of that search space satisfying a known property. How long does it take to find an element satisfying that property?
Time complexity:
- $N$ operations.
- $\sqrt N$ operations.
Though it offers only a quadratic speedup (whereas QFT offers an exponential speedup), it has richer application and can adapt to a wide range of problems, in [Chapter 6][#6] 待補 which will it be explained in detail.
Quantum simulation
- $n$ $c^n$ classical computer $c$ is a constant which depends upon details of the system being simulated, and the desired accuracy of the simulation.
- $cn$ $cn$ $cn$ $c^n$ bits information is not entirely accessible)
Thus, a crucial step in making quantum simulations useful is development of systematic means by which desired answers can be efficiently extracted.

The power of quantum computation

First we need to understand some of Computational complexity theory:
- $\textbf{P}$ $\textbf{NP}$ $\textbf{P}$ $\textbf{NP}$ is the class of problems which have solutions which can be quickly checked on a classical computer.
- not $\textbf{P}$ $\textbf{NP}$ .
- $\textbf{P}$ $\textbf{NP}$ , since the ability to solve a problem implies the ability to check potential solutions.
$\textbf{NP}$ $\textbf{P}$ , which is:
$\begin{matrix} (37) & P = NP or P \neq NP \end{matrix}$
- $\textbf{P} \neq \textbf{NP}$ $\textbf{NP-complete}$ problem which can be efficiently solved on a classical computer. (可以把NP-complete問題想成是NP問題裡面比較具代表性的問題)
$\textbf{P}$ $\textbf{NP}$ , there're also other complexity classes:
- $\textbf{PSPACE}$ : consists of those problems which can be solved using resources which are few in spatial size (that is, the computer is smalllong $\textbf{P}$ $\textbf{NP}$ although, again, this has never been proved.
- $\textbf{BPP}$ polynomial $\frac{1}{4}$ $\textbf{P}$ 更能在classical computer上是efficiently solvable的)

No matter what do we not know, what is already clear is that the theory of quantum computation poses interesting and significant challenges to the traditional notions of computation. What makes this an important challenge is that the theoretical model of quantum computation is believed to be experimentally realizable, because – to the best of our knowledge – this theory is consistent with the way Nature works. If this were not so then quantum computation would be just another mathematical curiosity.

1.5 Experimental quantum information processing

In the next two sections , we begin with a review of the famous Stern–Gerlach experiment, which provides evidence for the existence of qubits in Nature. We then widen our scope, addressing the broader problem of how to build practical quantum information processing systems.

1.5.1 The Stern–Gerlach experiment

Qubit is the term we used to describe the fundamental element of QCQI, but is there any quantum system that behaves like a qubit in real world? It turns out that we may now understand qubits in terms of two level quantum systems.
1927 Stern–Gerlach experiment: 用烤箱加熱氫原子(本來在1922時是用銀原子), then form a beam of atoms, which subsequently fly through a magnetic field (會偏折原子束的方向), and finally project on to a screen. Theoretically, there would only be one peak since the magnetic dipole moment is 0 for hydrogen atom, but surprisingly there appears two peaks, which then suggests there may be some hidden variables (即後來提出的電子自旋 electron spin).
$\text{beam}\rightarrow Z\rightarrow X$ $\pm X$ $\text{beam}\rightarrow Z\rightarrow X\rightarrow Z$ $\pm Z$ $Z$ $Z$ 磁場的時候被過濾掉了)
The qubit model provides a simple explanation of this experimentally observed behavior:
$\begin{aligned} | + Z ⟩ & \leftarrow | 0 ⟩ \\ | - Z ⟩ & \leftarrow | 1 ⟩ \\ | + X ⟩ & \leftarrow (| 0 ⟩ + | 1 ⟩) / \sqrt{2} \\ | - X ⟩ & \leftarrow (| 0 ⟩ - | 1 ⟩) / \sqrt{2} \end{aligned}$
$\ket{+Z} = \frac{\ket{+X}+\ket{-X}}{\sqrt2}$ $\ket{+X} = \frac{\ket{+Z}+\ket{-Z}}{\sqrt2}$ , and this explains why everytime there will be two beams split upon meeting new magnetic fields. This example demonstrates how qubits could be a believable way of modeling systems in Nature.

1.5.2 Prospects for practical quantum information processing

There are 2 possible points which will prohibit us from doing one or more forms of quantum information processing:
- Noise: there is a threshold theorem for quantum computation, which states that if the level of noise in a quantum computer can be reduced below a certain constant threshold value, quantum error-correcting codes can be used to push it down even further, essentially ad-infinitum! The theorem indicates that the effects of noise can be made essentially negligible for quantum information processing. In Chapters 8, 10 and 12 待補 we will discuss quantum noise, quantum error-correction and the threshold theorem in detail.
- QM is incorrect: If this occurs, it will be a momentous discovery in the history of science, and can be expected to have considerable consequences in other areas of science and technology, as did the discovery of quantum mechanics.
Quantum state tomography and quantum process tomography are two elementary processes whose perfection is of great importance to quantum computation and quantum information, as well as being of independent interest in their own right.
- Quantum state tomography is a method for determining the quantum state of a system: by performing repeated preparations of the same quantum state, which is then measured in different ways in order to build up a complete description of the quantum state.
- Quantum process tomography is a more ambitious (but closely related) procedure to completely characterize the dynamics of a quantum system. (e.g. can be used to characterize the performance of an alleged quantum gate or quantum communications channel, or to determine the types and magnitudes of different noise processes in a system.)
Quantum state tomography and quantum process tomography are described in more detail in Chapter 8 待補.
Various small-scale communications primitives like quantum cryptography and quantum teleportation are also of great interest. Chapter 12 待補 will show that teleportation may be an extremely useful primitive for transmitting quantum states between distant nodes in a network, in the presence of noise. 量子瞬移(~~我真會翻譯~~)其實更注重EPR pair的純度, since the EPR pairs may be corrupted during communication, but special entanglement distillation protocols can then be used to clean up the EPR pairs, enabling them to be used to teleport quantum states from one location to another.
For the medium-scale, a promising of quantum information processing is to the simulation of quantum systems. 因為60個qubit的系統就不足以讓超級電腦負荷過來了, 更不用說要跑模擬.
For large-scale applications, there are the factoring of large numbers, taking discrete logarithms, and quantum searching. 前兩者在長遠方向上不重要, 因為他們只威脅到現今我們所使用的加密技術的可靠度; 但quantum searching就應用範圍很廣了, we will discuss some possible applications in Chapter 6 待補.
Physical realization of QC:
- Optical system turns out to be suitable for small-scale quantum computer, since photons are highly stable carriers for quantum information, which in other words, hard to directly interact with another, so the quantum gates (a form of interaction) must be mediated by something else, like an atom (光子一號先跟一顆原子作用, 該原子再跟光子二號作用, 來達成兩顆光子的間接作用)
- Ion traps and neutral atom traps are alternative schemes for storing qubits. Photons are also used in this scheme, but instead of carring quantum information itself, they manipulate the quantum information stored inside the atom in trap. (e.g. Single qubit quantum gates can be performed by applying appropriate pulses of electromagnetic radiation to individual atoms; quantum measurement can be accomplished in these systems using the long established quantum jumps technique, which implements with superb accuracy the measurements in the computational basis used for quantum computation.)
- Nuclear Magnetic Resonance, or NMRnuclear spin $10^{15}$ $10^{15}$ 台電腦平行運算).

To conclude, note that it is important not to assess quantum information processing as though it were just another technology for information processing.

1.6 Quantum Information

We can identify a few fundamental goals uniting work on quantum information theory:

Identify elementary classes of static resources in quantum mechanics. (e.g. qubit, classical bit, or a Bell state shared between two distant parties 都可以作為存儲/傳遞訊息的單位)
Identify elementary classes of dynamical processes in quantum mechanics. (e.g. memory⇒the ability to store a quantum state over some period af time; quantum information transmission; or the process of protecting quantum information processing against the effects of noise)
Quantify resource tradeoffs incurred performing elementary dynamical processes. (e.g. what are the minimal resources required to reliably transfer quantum information between two parties using a noisy communications channel?)

The remainder of this section describes some examples of questions studied by quantum information theory, in each case emphasizing the fundamental static and dynamic elements under consideration, and the resource tradeoffs being considered.

Classical information through quantum channels
- The fundamental results of classical information theory are Shannon’s noiseless channel coding theorem (quantifies how many bits are required to store information being emitted by a source of information), and Shannon’s noisy channel coding theorem (quantifies how much information can be reliably transmitted through a noisy communications channel)
- information source $p_j, \;j = 1,2,\cdots,d$ $j$ $p_j$ $p_\text e > p_\text z$ , 那我們就可以改用比較少bit的資訊量來表示字母"e", 使整體文章的bit使用量減少, 而Shannon's noiseless channel coding theorem可以告訴我們how well we can compress such a source)
- noiseless channel coding theorem $p_j$ $H(p_j)$ bits of information, where
  $\begin{matrix} (38) & H (p_{j}) \equiv - \sum_{j} p_{j} \log p_{j} \end{matrix}$
  is a function of the source probability distribution known as the Shannon entropy. Moreover, the noiseless channel coding theorem tells us that to attempt to represent the source using fewer bits than this will result in a high probability of error when the information is decompressed. (See chapter 12 待補 for more detail)
- Verify that Shannon’s noiseless coding theorem satisfies the previous three goals:
  1. Static resources X2: the bit and the information source
  2. Two-stage dynamic process X1: compress the information source and then decompress it after transmission
  3. Finally a quantitative criterion for determining the resources consumed (goal 3) by an optimal data compression scheme is found.
- Shannon’s second major result, the noisy channel coding theorem, quantifies the amount of information that can be reliably transmitted through a noisy channel. 訊息傳輸可能是兩個空間點之間的傳輸, 或是兩個時間點之間的傳輸(即儲存資訊, 且是在有噪音的情況下). Both ways to overcome noises is to introduce error-correcting codes, the idea is to add an amount of redundancy into the information so that even some bits corrupt after the channel, one is still possible to recover the original message. (比如說一個channel的錯誤率是50%, 那它的 capacity 就是 half a bit) Shannon’s noisy channel coding theorem provides a general procedure for calculating the capacity of an arbitrary noisy channel.
- Verify that Shannon’s noisy channel coding theorem satisfies the previous three goals:
  1. Static resources X2: the information source, and the bits being sent through the channel.
  2. Dynamic process X3: noise process + encoding process + decoding process.
  3. For a fixed noise model, Shannon’s theorem tells us how much redundancy must be introduced by an optimal error-correction scheme if reliable information transmission is to be achieved (goal 3).
- One might wonder if the medium used to carry information changed from a bit to a qubit, are the Shannon's two theorems still valid? (e.g. if using qubits allows a better compression rate than is possible classically?) Unfortunately, we will eventually prove that qubits do not allow any significant saving in the amount of communication required to transmit information over a noiseless channel.
- For the problem of transmitting classical information through a noisy quantum channel, in Chapter 12 待補 we’ll prove the HSW (Holevo–Schumacher–Westmoreland) theorem, which provides a lower bound on the capacity of such a channel. Will the encoding scheme using entangled states raise the capacity beyond the lower bound provided by the HSW theorem? All evidence to date suggests that this doesn’t help raise the capacity, but it is still a fascinating open problem of quantum information theory.
Quantum information through quantum channels
- $\ket0$ $(\ket0+\ket1)/\sqrt2$ , we aren't able to apply Shannon entropy to quantum informations. The way we use to compress the quantum information is not error-free, instead we use a measure: fidelity, to quantify the correctness of the information after decompressing from the compressed form, in the limit of large block lengths, it should tend towards the no error limit of 1.
- Schumacher’s noiseless channel coding theorem quantifies the resources required to do quantum data compression, with the restriction that it be possible to recover the source with fidelity close to 1:
  - $\ket{\psi_j}$ $p_j$ $H(p_j)$ .
  - for a more general case (source that produces non-orthogonal states): how much a quantum source may be compressed is not Shannon entropy ever, but von Neumann entropystrictly smaller $\ket0$ $p$ $(\ket0+\ket1)/\sqrt2$ $1-p$ $H(p, 1-p)$ qubits per use of the source!)
  - compressing procedure for a certain scheme:
    1. $\ket0$ $p$ $(\ket0+\ket1)/\sqrt2$ $1-p$ $n$ $n$ qubits of information.
    2. $\lim_{n\to\infty}$ $np + \frac{n(1-p)}{2}$ $\ket0$ $\frac{n(1-p)}{2}$ $\ket1$ $n$ $\binom{n}{n(1-p)/2} \xrightarrow{\text{Stirling’s approximation}} 2^{nH(\frac{1+p}{2}, \frac{1-p}{2})} \equiv N$ combinations in total.
    3. $\ket{0\cdots00},\;\ket{0\cdots01}, \cdots, \;\ket{1\cdots10},\;\ket{1\cdots11}$ , equivalent to performing a unitary transformation:
      $\begin{matrix} (39) & {| 0 ⟩}^{\otimes \frac{n (1 + p)}{2}} {| 1 ⟩}^{\otimes \frac{n (1 - p)}{2}} \overset{unitary transform}{\to} | j ⟩ {| 0 ⟩}^{\otimes (n - n H (\frac{1 + p}{2}, \frac{1 - p}{2}))} \end{matrix}$
      $\ket0$ $nH(\frac{1+p}{2}, \frac{1-p}{2})$ qubits.
    4. $\ket0$ s and do the inverse unitary transform.
    $H(\frac{1+p}{2}, \frac{1-p}{2})$ $p\geq 1/3$ $H(p,\;1-p)$ $\ket0$ $(\ket0+\ket1)/\sqrt2$ $\ket0$ 方向上有分量的這個redundancy, 來進行壓縮的)
- Can we find an analogue of Shannon’s noisy channel coding theorem? Considerable progress on this important question has been made, using the theory of quantum error-correcting codes; however, a fully satisfactory analogue has not yet been found.
Quantum distinguishability
- $\ket0$ $(\ket0+\ket1)/2$ $\{\ket0, \ket1, \ket+, \ket-\}$ $\frac{\ket{00}+\ket{11}}{2}$ $\frac{\ket{++}+\ket{--}}{2}$ $Z$ $X$ $\{\ket0, \ket1, \ket+, \ket-\}$ and he can instantaneously know what measurement does Alice perform).
- $\{\ket0, \ket1, \ket+, \ket-\}$ on the note it publishes, it can attribute a unique classical serial number to each note, and then verify it by telling the merchant what will the result be after measuring with a sequence of basis the bank provides, after the merchants provide them the serial number on individual note. The counterfeiter cannot duplicate same note due to the quantum indistinguishability.)

Exercise 1.2

$\ket\psi$ $\ket\varphi$ $\ket\psi$ $\ket\varphi$ , in violation of the no-cloning theorem. Conversely, explain how a device for cloning could be used to distinguish non-orthogonal quantum states.

solution

If states are distinguishable, you can determine which state you send and then, employing an appropreate designer of Hamiltonian, build a second system in the same state.

Conversely, if you can prepare many identical copies of a qubit, then it is possible to measure the mean value of noncommuting observables.

Creation and transformation of entanglement
- Creating entanglement is a simple dynamical process of interest in quantum information theory. How many qubits must two parties exchange if they are to create a particular entangled state shared between them, given that they share no prior entanglement?
- Transforming entanglement from one form into another. (e.g. Alice and Bob share between them a Bell state, and wish to transform it into some other type of entangled state. What resources do they need to accomplish this task?)
Answering these and more complex questions about the creation and transformation of entanglement forms a fascinating area of study in its own right, and also promises to give insight into tasks such as quantum computation. (e.g. a distributed quantum computation may be viewed as simply a method for generating entanglement between two or more parties.)

Problem 1.1

(Feynman-Gates conversation) Construct a friendly imaginary discussion of about 2000 words between Bill Gates and Richard Feynman, set in the present, on the future of computation. (Comment: You might like to try waiting until you’ve read the rest of the book before attempting this question. See the 'History and further reading' below for pointers to one possible answer for this question.)

solution

From Bill Gates:

Thirty years ago I went on vacation and fell for Richard Feynman.
A friend and I were planning a trip together and wanted to mix a little learning in with our relaxation. We looked at a local university’s film collection, saw that they had one of his lectures on physics, and checked it out. We loved it so much that we ended up watching it twice. Feynman had this amazing knack for making physics clear and fun at the same time. I immediately went looking for more of his talks, and I’ve been a big fan ever since. Years later I bought the rights to those lectures and worked with Microsoft to get them posted online for free.

Problem 1.2

What is the most significant discovery yet made in quantum computation and quantum information? Write an essay of about 2000 words for an educated lay audience about the discovery. (Comment: As for the previous problem, you might like to try waiting until you’ve read the rest of the book before attempting this question.)

solution

Maybe try waiting until I've read the rest of the book before attempting this question.

2 Introduction to quantum mechanics

In this chapter we will acquire familiarity with elementary linear algebra while introducing the notation used by physicists to describe quantum mechanics (which is different to that used in most introductions to linear algebra). We then review the basic postulates of quantum mechanics. Later on we will introduce superdense coding, a surprising and illuminating example of quantum information processing which combines many of the postulates of quantum mechanics in a simple setting. Next there are powerful tools like density operator, purifications, and the Schmidt decomposition, which are especially useful in the study of quantum computation and quantum information.

2.1 Linear algebra

vector spaces $\mathbb{C}^n$ $n$ $(z_1,\cdots,z_n)$ .

2.1.1 Bases, operators and matrices

spanning set $\ket{v_1}, \ket{v_2}, \cdots, \ket{v_n}$ $\ket v$ $\sum_ia_i\ket{v_i}$ $\begin{bmatrix}0\\1\end{bmatrix}$ $\begin{bmatrix}1\\0\end{bmatrix}$ $\mathbb C^2$ . Generally, a vector space may have many different spanning sets.
$\ket{v_1}, \ket{v_2}, \cdots, \ket{v_n}$ dependent $a_1, \cdots, a_n$ $a_i\neq0$ $i$ $a_1\ket{v_1} + a_2\ket{v_2} + \cdots + a_n\ket{v_n} = 0$ .
$V$ $V$ .

Exercise 2.1

(Linear dependence: example) $(1, -1)$ $(1,2)$ $(2,1)$ are linearly dependent.

solution

\begin{matrix} \begin{matrix} (1) [\begin{matrix} 1 \\ - 1 \end{matrix}] + (1) [\begin{matrix} 1 \\ 2 \end{matrix}] + (- 1) [\begin{matrix} 2 \\ 1 \end{matrix}] = 0 \end{matrix} \end{matrix}

linear operator $A:V \rightarrow W$ is linear to its inputs:
$\begin{matrix} (40) & A (\sum_{i} a_{i} | v_{i} ⟩) = \sum_{i} a_{i} A | v_{i} ⟩ \end{matrix}$
$A$ $V$ $A$ $V$ $V$ .
matrix $A:V \rightarrow W$ $\ket{v_1},\cdots,\ket{v_m}$ $\ket{w_1},\cdots,\ket{w_n}$ $W$ $i$ $1,\cdots,m$ $A_{1i},\cdots,A_{ni}$ such that:
$\begin{matrix} (41) & | v_{i} ⟩ = \sum_{j = 1}^{n} A_{i j} | w_{j} ⟩ \end{matrix}$

Exercise 2.2

(Matrix representations: example) $V$ $\ket0$ $\ket1$ $A$ $V$ $V$ $A\ket0 = \ket1$ $A\ket1 = \ket0$ $A$ $\ket0, \ket1$ $\ket0, \ket1$ $A$ .

solution

\begin{matrix} (42) & \begin{matrix} A = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}] \end{matrix} \end{matrix}

$\{\ket0, \ket1\}\to \{\ket1, \ket0\}$ $A$ $\begin{bmatrix}1 &0\\0 &1\end{bmatrix}$ .

Exercise 2.3

(Matrix representation for operator products) $A$ $V$ $W$ $B$ $W$ $X$ $\ket{v_i}, \ket{w_j}, \ket{x_k}$ $V$ $W$ $X$ $BA$ $B$ $A$ , with respect to the appropriate bases.

solution

$\ket{v_i} = \sum_j A_{ji}\ket{w_j}$ $\ket{w_j} = \sum_k B_{kj}\ket{x_k}$ . Combine the two equations we get

\begin{matrix} | v_{i} ⟩ = \sum_{j} \sum_{k} B_{k j} A_{j i} | x_{k} ⟩ = \sum_{k} (B A)_{k i} | x_{k} ⟩ \end{matrix}

$BA$ $B$ $A$ .

Exercise 2.4

(Matrix representation for identity) $V$ has a matrix representation which is one along the diagonal and zero everywhere else, if the matrix representation is taken with respect to the same input and output bases. This matrix is known as the identity matrix.

solution

\begin{matrix} (43) & | v_{j} ⟩ = \sum_{i} I_{i j} | v_{i} ⟩ = | v_{j} ⟩, \forall j \Rightarrow I_{i j} = δ_{i j} \end{matrix}

2.1.2 The Pauli matrices and inner products

Four extremely useful matrices (which we shall often have occasion to use) are the Pauli matrices:
$\begin{aligned} σ_{0} \equiv I \equiv [\begin{array}{c} 1 & 0 \\ 0 & 1 \end{array}] & σ_{1} \equiv X \equiv [\begin{array}{c} 0 & 1 \\ 1 & 0 \end{array}] \\ σ_{2} \equiv Y \equiv [\begin{array}{c} 0 & - i \\ i & 0 \end{array}] & σ_{3} \equiv Z \equiv [\begin{array}{c} 1 & 0 \\ 0 & - 1 \end{array}] \end{aligned}$
inner product $\ket v$ $\ket w$ $\braket{v}{w}$ $\braket{v}{w} = \braket{w}{v}^*$ .

Exercise 2.5

$(\cdot,\cdot)$ $\mathbb C^n$

solution

$\ket v = (v_1, v_2, \cdots, v_n)^\intercal$ $\ket w = (w_1, w_2, \cdots, w_n)^\intercal$ , then the three conditions require satisfaction are:

\begin{aligned} (| v ⟩, \sum_{i} λ_{i} | w_{i} ⟩) & = \sum_{j} v_{j}^{*} (\sum_{i} λ_{i} w_{i j}) = \sum_{i} λ_{i} (\sum_{i} v_{j}^{*} w_{i j}) = \sum_{i} λ_{i} (| v ⟩, | w_{i} ⟩) \\ (| v ⟩, | w ⟩) & = \sum_{i} v_{i}^{*} w_{i} = \sum_{i} (w_{i}^{*} v_{i})^{*} = (| w ⟩, | v ⟩)^{*} \\ (| v ⟩, | v ⟩) & = \sum_{i} v_{i}^{*} v_{i} = \sum_{i} {| v_{i} |}^{2} \geq 0 \end{aligned}

$(\cdot,\cdot)$ $\mathbb C^n$ .

Exercise 2.6

$(\cdot,\cdot)$ $\bigg(\sum_i\lambda_i\ket{w_i}, \ket{v}\bigg) = \sum_i\lambda_i^*\bigg(\ket{w_i}, \ket{v}\bigg)$ .

solution

\begin{matrix} (44) & (\sum_{i} λ_{i} | w_{i} ⟩, | v ⟩) = (| v ⟩, \sum_{i} λ_{i} | w_{i} ⟩)^{*} = (\sum_{i} λ_{i} (| v ⟩, | w_{i} ⟩))^{*} = \sum_{i} λ_{i}^{*} (| v ⟩, | w_{i} ⟩)^{*} . \end{matrix}

Discussions of quantum mechanics often refer to Hilbert space. In the finite dimensional complex vector spaces that come up in quantum computation and quantum information, a Hilbert space is exactly the same thing as an inner product space.

Exercise 2.7

$\ket w\equiv(1,1)$ $\ket v\equiv(1,-1)$ are orthogonal. What are the normalized forms of these vectors?

solution

$\braket{w}{v} = 1-1 = 0\Rightarrow \text{orthogonal}$ $\ket w = \frac{1}{\sqrt2}(1,1)$ $\ket v = \frac{1}{\sqrt2}(1,-1)$ .

$\ket{w_1},\cdots,\ket{w_d}$ $V$ Gram–Schmidt procedure $\ket{v_1},\cdots,\ket{v_d}$ $V$ :
1. $\ket{v_1} \equiv \frac{\ket{w_1}}{\Vert\ket{w_1}\Vert}$
2. $1\leq k\leq(d-1)$ $\ket{v_{k+1}}$ inductively by:
$\begin{matrix} (45) & | v_{k + 1} ⟩ \equiv \frac{| w_{k + 1} ⟩ - \sum_{i = 1}^{k} ⟨ v_{i} | w_{k + 1} ⟩ | v_{i} ⟩}{‖ | w_{k + 1} ⟩ - \sum_{i = 1}^{k} ⟨ v_{i} | w_{k + 1} ⟩ | v_{i} ⟩ ‖} \end{matrix}$
$\ket{v_1}, \cdots, \ket{v_d}$ $V$ .

Exercise 2.8

Prove that the Gram–Schmidt procedure produces an orthonormal basis for V .

solution

One can proof by mathematical induction.

$\sum_i\ket i\bra i = I$ .
- $A:V\to W$ $\ket{v_i}$ $V$ $\ket{w_j}$ $W$ . By applying the completeness relation twice we obtain:
  $\begin{aligned} A & = I_{W} A I_{V} \\ = \sum_{i, j} | w_{j} ⟩ ⟨ w_{j} | A | v_{i} ⟩ ⟨ v_{i} | \\ = \sum_{i, j} ⟨ w_{j} | A | v_{i} ⟩ | w_{j} ⟩ ⟨ v_{i} | \end{aligned}$
  $A$ $A$ $\bra{w_j}A\ket{v_i}$ $i^{\text{th}}$ $j^\text{th}$ $\ket{v_i}$ $\ket{w_j}$ .
- A second application illustrating the usefulness of the completeness relation is the Cauchy–Schwarz inequality:
  $\begin{matrix} (46) & ⟨ v | v ⟩ ⟨ w | w ⟩ \geq | ⟨ w | v ⟩ |^{2} \end{matrix}$
  ✏️ Proof of the Cauchy–Schwarz inequality
  $\Vert v\Vert^2 \equiv V$ $\braket{u}{v}\equiv c$ , then we have:
  $\begin{aligned} 0 \leq \frac{1}{‖ v ‖^{2}} ‖ ‖ v ‖^{2} u - ⟨ v | u ⟩ v ‖^{2} & = \frac{1}{V} ⟨ V u - \bar{c} v | V u - \bar{c} v ⟩ \\ = \frac{1}{V} (V^{2} ‖ u ‖^{2} - V \bar{c} ⟨ u | v ⟩ - c V ⟨ v | u ⟩ + c \bar{c} ⟨ v | v ⟩) \\ = V ‖ u ‖^{2} - \bar{c} c - c \bar{c} + c \bar{c} \\ = V ‖ u ‖^{2} - \bar{c} c \\ = ‖ v ‖^{2} ‖ u ‖^{2} - | ⟨ u | v ⟩ |^{2} \end{aligned}$
  Therefore we have:
  $\begin{matrix} (47) & ‖ v ‖^{2} ‖ u ‖^{2} \geq | ⟨ u | v ⟩ |^{2} \end{matrix}$

Exercise 2.9

(Pauli operators and the outer product) $\ket0, \ket1$ for a two-dimensional Hilbert space. Express each of the Pauli operators in the outer product notation.

solution

\begin{matrix} X = | 0 ⟩ ⟨ 1 | + | 1 ⟩ ⟨ 0 | Y = - i | 0 ⟩ ⟨ 1 | + i | 1 ⟩ ⟨ 0 | Z = | 0 ⟩ ⟨ 0 | - | 1 ⟩ ⟨ 1 | \end{matrix}

Exercise 2.10

$\ket{v_i}$ $V$ $\ket{v_j}\bra{v_k}$ $\ket{v_i}$ basis?

solution

$0$ $1$ $j^\text{th}$ $k^\text{th}$ column.

2.1.3 Eigenvectors and Hermitian operators

eigenvector $A$ non-zero $\ket v$ $A\ket{v} = v\ket{v}$ $v$ $A$ $\ket{v}$ .
characteristic function $c(\lambda)\equiv \det{\vert A-\lambda I \vert}$ $A$ not on the specific matrix representation $A$ $c(\lambda) =0$ are the eigenvalues of the operator.
eigenspace $v$ $v$ $A$ degenerate $A$ defined by:
$\begin{matrix} \begin{matrix} A \equiv [\begin{matrix} 2 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 0 \end{matrix}] \end{matrix} \end{matrix}$
$(1,0,0)$ $(0,1,0)$ degenerate $A$ with the same eigenvalue.
diagonal representation $A$ $V$ $A = \sum_i\lambda_i\ket i\bra i$ $\ket i$ $A$ $\lambda_i$ . An operator is said to be diagonalizable if it has a diagonal representation. e.g. the Pauli Z matrix can be written:
$\begin{matrix} (48) & \begin{matrix} Z = [\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}] = | 0 ⟩ ⟨ 0 | - | 1 ⟩ ⟨ 1 | \end{matrix} \end{matrix}$
$\ket0$ $\ket1$ , respectively. Diagonal representations are sometimes also known as orthonormal decompositions.

Exercise 2.11

(Eigen decomposition of the Pauli matrices) $X$ $Y$ $Z$ .

solution

$X$ $\lambda = \pm1$ $\ket{\lambda_{-1}} = \frac{1}{\sqrt2}\begin{bmatrix}1\\-1\end{bmatrix}$ $\ket{\lambda_{+1}} = \frac{1}{\sqrt2}\begin{bmatrix}1\\1\end{bmatrix}$ $\ket0\bra1+\ket1\bra0$ .
$Y$ $\lambda = \pm1$ $\ket{\lambda_{-1}} = \frac{1}{\sqrt2}\begin{bmatrix}1\\-i\end{bmatrix}$ $\ket{\lambda_{+1}} = \frac{1}{\sqrt2}\begin{bmatrix}1\\i\end{bmatrix}$ $-i\ket0\bra1+i\ket1\bra0$ .
$Z$ $\lambda = \pm1$ $\ket{\lambda_{-1}} = \begin{bmatrix}0\\1\end{bmatrix}$ $\ket{\lambda_{+1}} = \begin{bmatrix}1\\0\end{bmatrix}$ $\ket0\bra0-\ket1\bra1$ .

Exercise 2.12

Prove that the matrix

\begin{matrix} [\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix}] \end{matrix}

is not diagonalizable.

solution

$(1-\lambda)^2 = 0\Rightarrow\lambda=1$ $\ket{\lambda_1} = \begin{bmatrix}0\\1\end{bmatrix}$ $c\ket{\lambda_1}\bra{\lambda_1} = c\begin{bmatrix}0&0\\0&1\end{bmatrix} \neq \begin{bmatrix}1&0\\1&1\end{bmatrix}$ , it is not diagonalizable.

Exercise 2.13

$\ket w$ $\ket v$ $(\ket w\bra v)^\dagger = \ket v\bra w$ .

solution

$\ket w\bra v$ $M$ $M_{ij} = w_i^*v_j$ $(\ket v\bra w)_{ij} = v_i^*w_j$ $(v_i^*w_j)^\intercal = v_j^*w_i$ $(v_j^*w_i)^* = v_jw_i^*$ $(\ket w\bra v)^\dagger = \ket v\bra w$ .

Or, without the matrix representation, consider:

\begin{matrix} {\begin{cases} {⟨ ψ | (| w ⟩ ⟨ v |) ϕ ⟩}^{*} = {⟨ (| w ⟩ ⟨ v |)^{†} ψ | ϕ ⟩}^{*} = ⟨ ϕ | (| w ⟩ ⟨ v |)^{†} ψ ⟩ \\ {⟨ ψ | (| w ⟩ ⟨ v |) ϕ ⟩}^{*} = (⟨ ψ | w ⟩ ⟨ v | ϕ ⟩)^{*} = ⟨ ϕ | v ⟩ ⟨ w | ψ ⟩ \end{cases} \end{matrix}

$(\ket w\bra v)^\dagger = \ket v\bra w$ .

Exercise 2.14

(Anti-linearity of the adjoint) Show that the adjoint operation is anti-linear,

\begin{matrix} (\sum_{i} a_{i} A_{i})^{†} = \sum_{i} a_{i}^{*} A_{i}^{†} \end{matrix}

solution

\begin{aligned} ⟨ (a_{i} A_{i})^{†} ψ | ϕ ⟩ & = ⟨ ψ | (a_{i} A_{i}) ϕ ⟩ \\ = a_{i} ⟨ ψ | (A_{i}) ϕ ⟩ \\ = a_{i} ⟨ (A_{i})^{†} ψ | ϕ ⟩ \\ = ⟨ a_{i}^{*} (A_{i})^{†} ψ | ϕ ⟩ . \end{aligned}

Exercise 2.15

$(A^\dagger)^\dagger = A$ .

solution

$\braket{(A^\dagger)^\dagger\psi}{\phi} = \braket{\psi}{A^\dagger\phi} = \braket{A^\dagger\phi}{\psi}^* = \braket{\phi}{A\psi}^* = \braket{A\psi}{\phi}$ $(A^\dagger)^\dagger = A$ .

$A$ $A$ is known as a Hermitian or self-adjoint operator.
- projectors $W$ $k$ $d$ $V$ $\ket1,\cdots,\ket d$ $V$ $\ket1,\cdots,\ket k$ $W$ . By definition,
  $\begin{matrix} (49) & P \equiv \sum_{i = 1}^{k} | i ⟩ ⟨ i | \end{matrix}$
  $W$ .
- $\ket v\bra v$ $\ket v$ $P$ Hermitian $P^\dagger = P$ .
- orthogonal complement $P$ $Q\equiv I-P$ $\ket{k+1},\cdots,\ket d$

Exercise 2.16

$P$ $P^2 = P$ .

solution

$P^2 = \bigg(\sum_i\ket i\bra i\bigg)\bigg(\sum_j\ket j\bra j\bigg) = \sum_{i,j}\ket i\braket{i}{j}\bra j = \sum_{i,j}\ket i\bra j\delta_{ij} = \sum_i\ket i\bra i = P$

$A$ normal $AA^\dagger = A^\dagger A$ . Obviously, an Hermitian operator is also normal. There is a remarkable representation theorem for normal operators known as the spectral decomposition, which states that an operator is a normal operator if and only if it is diagonalizable.

Theorem 2.1

(Spectral decomposition)normal $M$ $V$ is diagonalorthonormal basis $V$ . Conversely, any diagonalizable operator is normal.

proof

$d$ $V$ .

$d=1$ case, it's trivial.
$d>1$ $\lambda$ $M$ $P$ $\lambda$ $Q$ $M = IMI = (P+Q)M(P+Q) = PMP+QMP+PMQ+QMQ$ $PMP=\lambda P$ $QMP = 0$ $PMQ= 0$ for the following reason:
- $\ket v$ $P$ $MM^\dagger\ket v = M^\dagger M\ket v = \lambda M^\dagger\ket v$ $M^\dagger \ket v$ $\lambda$ $M$ $P$ $M^\dagger P = \lambda P\Rightarrow QM^\dagger P=0$ $PMQ=0$ $P$ $Q$ are Hermitian)
$M = PMP+QMQ$ $QMQ$ is normal:
- $QM = QMI = QM(P+Q) = QMP + QMQ = 0+QMQ$ $QM^\dagger = QM^\dagger(P+Q) = 0+QM^\dagger Q$ $M$ $Q^2=Q$ (one can prove it trivially), we have:
  $\begin{aligned} (Q M Q) (Q M^{†} Q) & = Q M Q M^{†} Q \\ = Q M M^{†} Q \\ = Q M^{†} M Q \\ = Q M^{†} Q M Q \\ = (Q M^{†} Q) (Q M Q) \end{aligned}$
  $QMQ$ induction $QMQ$ $Q$ $d=2$ $P$ $1$ $Q$ $1$ $d=1$ $d$ $1$ $2$ $\cdots$ $PMP$ $P$ $M=PMP+QMQ$ $V$ .

$M$ $M = \sum_i\lambda_i\ket i\bra i$ $\lambda_i$ $M$ $\ket i$ $V$ $\ket i$ $M$ $\lambda_i$ .

$M$ $M = \sum_i\lambda_iP_i$ $P_i$ $\lambda_i$ $M$ . And:

$\sum_iP_i = I$ ,
$P_iP_j = \delta_{ij}P_i$ .

Exercise 2.17

Show that a normal matrix is Hermitian if and only if it has real eigenvalues.

solution

$A$ $A^\dagger = A$ $\ket\lambda$ $A$ $\lambda$ $\bra{\lambda}A\ket\lambda = \lambda\bra{\lambda}\ket{\lambda} = \lambda$ $\lambda^* = \bra{\lambda}A^\dagger\ket\lambda = \bra{\lambda}A\ket\lambda = \lambda\;\Rightarrow\;\lambda^* = \lambda$ , it has real eigenvalues.
$A$ $A = \sum_i\lambda_i\ket i\bra i$ $A^\dagger = \sum_i\lambda_i^*\ket i\bra i$ $\lambda^* = \lambda$ $A^\dagger = A$ .

Therefore a normal matrix is Hermitian if and only if it has real eigenvalues.

$U$ unitary $U^\dagger U=I$ $U$ unitary $U^\dagger U=I$ $UU^\dagger = I$ $U$ is normal and has a spectral decomposition.
$U\ket v$ $U\ket w$ $\ket v$ $\ket w$ .
$\begin{matrix} (50) & (U | v ⟩, U | w ⟩) = ⟨ v | U^{†} U | w ⟩ = ⟨ v | I | w ⟩ = ⟨ v | w ⟩ \end{matrix}$
$U$ $\ket{v_i}$ $\ket{w_i}\equiv U\ket{v_i}$ $\ket{w_i}$ $U=\sum_i\ket{w_i}\bra{v_i}$ .

Exercise 2.18

$1$ $e^{i\theta}$ $\theta$ .

solution

$\ket\lambda$ $U$ $U\ket\lambda = \lambda\ket\lambda$ $1 = \braket{\lambda} = \bra{\lambda}I\ket\lambda = \bra{\lambda}U^\dagger U\ket\lambda = \lambda^*\lambda\braket{\lambda} = \Vert\lambda\Vert^2 = 1$ $\lambda = e^{i\theta}$ .

Exercise 2.19

(Pauli matrices: Hermitian and unitary) Show that the Pauli matrices are Hermitian and unitary.

solution

Show by simple calculation.

Exercise 2.20

(Basis changes) $A'$ $A''$ $A$ $V$ $\ket{v_i}$ $\ket{w_i}$ $A'$ $A''$ $A'_{ij} = \bra{v_i}A\ket{v_j}$ $A''_{ij} = \bra{w_i}A\ket{w_j}$ $A'$ $A''$ .

solution

$U\equiv\sum_i\ket{w_i}\bra{v_i}$ , then we have:

\begin{aligned} A_{i j}^{'} & = ⟨ v_{i} | A | v_{j} ⟩ \\ = \sum_{k, l} ⟨ v_{i} | w_{k} ⟩ ⟨ w_{k} | A | w_{l} ⟩ ⟨ w_{l} | v_{j} ⟩ \\ = \sum_{k, l} ⟨ v_{i} | U | v_{k} ⟩ ⟨ w_{k} | A | w_{l} ⟩ ⟨ v_{l} | U^{†} | v_{j} ⟩ \\ = \sum_{k, l} U_{i k} A_{k l}^{″} U_{l j}^{†} \end{aligned}

A special subclass of Hermitian operators is the positive operators (extremely important!):
- $A$ $\ket v$ $(\ket v, A\ket v)$ is a real non-negative number.
- $A$ positive definite $(\ket v, A\ket v)$ $\ket v\neq0$ any positive operator is automatically Hermitian $\sum_i\lambda_i\ket i\bra i$ $\lambda_i$ )

Exercise 2.21

proof $M$ is Hermitian, simplifying the proof wherever possible.

solution

$M$ $M^\dagger = M$ $QMQ$ is normal:

\begin{aligned} (Q M Q) (Q M Q)^{†} & = Q M Q Q M^{†} Q \\ = Q M^{†} Q Q M Q \\ = (Q M Q)^{†} (Q M Q) \end{aligned}

$QMQ$ $QMQ$ is diagonal, and so on...

Exercise 2.22

Prove that two eigenvectors of a Hermitian operator with different eigenvalues are necessarily orthogonal.

solution

\begin{aligned} ⟨ λ_{i} | H | λ_{j} ⟩ = λ_{j} ⟨ λ_{i} | λ_{j} ⟩ \\ ⟨ λ_{j} | H | λ_{i} ⟩ = λ_{i} ⟨ λ_{j} | λ_{i} ⟩ \\ ⟨ λ_{j} | H^{†} | λ_{i} ⟩ = λ_{j}^{*} ⟨ λ_{j} | λ_{i} ⟩ = λ_{i} ⟨ λ_{j} | λ_{i} ⟩ (taking the adjoint of the first equation) \\ \Rightarrow (λ_{j}^{*} - λ_{i}) ⟨ λ_{j} | λ_{i} ⟩ = (λ_{j} - λ_{i}) ⟨ λ_{j} | λ_{i} ⟩ = 0 \end{aligned}

$\lambda_i\neq\lambda_j$ $\braket{\lambda_j}{\lambda_i}=0$ , which is the eigenvectors are necessarily orthogonal.

Exercise 2.23

$P$ $0$ $1$ .

solution

$P^2 = P$ , the following two equations are equivalent:

\begin{matrix} {\begin{cases} P | λ ⟩ = λ | λ ⟩ \\ P^{2} | λ ⟩ = λ P | λ ⟩ = λ^{2} | λ ⟩ \end{cases} \end{matrix}

$\lambda = \lambda^2 \Rightarrow \lambda = \{0,1\}$ .

Exercise 2.24

(Hermiticity of positive operators)Hint $A$ $A = B+iC$ $B$ $C$ are Hermitian.)

solution

$A$ is an arbitrary operator, it can be expressed as

\begin{matrix} A = (\frac{A + A^{†}}{2}) + i (\frac{A - A^{†}}{2 i}) \equiv B + i C \end{matrix}

$B$ $C$ $\bra vA\ket v \geq 0$ $\bra vA\ket v\in\mathbb{R}$ .

\begin{array}{r} ⟨ v | A | v ⟩ = ⟨ v | (B + i C) | v ⟩ = ⟨ v | B | v ⟩ + i ⟨ v | C | v ⟩ \in R \end{array}

$C=0$ $A$ is necessarily Hermitian.

Exercise 2.25

$A$ $A^\dagger A$ is positive.

solution

$\bra\psi A^\dagger A\ket\psi = c$ $\ket\psi$ $c^* = \bra\psi (A^\dagger A)^\dagger\ket\psi = \bra\psi A^\dagger A\ket\psi = c$ $c\in\mathbb R$ $\bra\psi A^\dagger A\ket\psi = \Big(A\ket\psi, A\ket\psi\Big) = \Big\Vert A\ket\psi\Big\Vert^2 \geq 0$ $A^\dagger A$ is positive.

2.1.4 Tensor products

The tensor product is a way of putting vector spaces together to form larger vector spaces.
$V$ $W$ $m$ $n$ $V\otimes W$ $mn$ dimensional vector space.
$V\otimes W$ $\ket v\otimes\ket w$ .
$\ket i$ $\ket j$ $V$ $W$ $\ket i\otimes\ket j$ $V\otimes W$ .
$\ket v\ket w$ $\ket{v,w}$ $\ket{vw}$ $\ket v\otimes\ket w$ .
By definition the tensor product satisfies the following properties:
1. scalar $z$ $\ket v$ $V$ $\ket w$ $W$ ,
  $\begin{matrix} (51) & z (| v ⟩ \otimes | w ⟩) = (z | v ⟩) \otimes | w ⟩ = | v ⟩ \otimes (z | w ⟩) \end{matrix}$
2. $\ket{v_1}$ $\ket{v_2}$ $V$ $\ket w$ $W$ ,
  $\begin{matrix} (52) & (| v_{1} ⟩ + | v_{2} ⟩) \otimes | w ⟩ = | v_{1} ⟩ \otimes | w ⟩ + | v_{2} ⟩ \otimes | w ⟩ \end{matrix}$
3. $\ket{w_1}$ $\ket{w_2}$ $W$ $\ket v$ $V$ ,
  $\begin{matrix} (53) & | v ⟩ \otimes (| w_{1} ⟩ + | w_{2} ⟩) = | v ⟩ \otimes | w_{1} ⟩ + | v ⟩ \otimes | w_{2} ⟩ \end{matrix}$
$V\otimes W$ $A$ $B$ $V$ $W$ $A\otimes B$ $V\otimes W$ by the equation:
$\begin{matrix} (54) & (A \otimes B) (| v ⟩ \otimes | w ⟩) \equiv A | v ⟩ \otimes B | w ⟩ \end{matrix}$
$V\otimes W$ $A\otimes B$ :
$\begin{matrix} (55) & (A \otimes B) (\sum_{i} a_{i} | v_{i} ⟩ \otimes | w_{i} ⟩) \equiv \sum_{i} a_{i} A | v_{i} ⟩ \otimes B | w_{i} ⟩ \end{matrix}$
$V\otimes W$ space is naturally defined as,
$\begin{matrix} (56) & (\sum_{i} a_{i} | v_{i} ⟩ \otimes | w_{i} ⟩, \sum_{j} b_{j} | v_{j}^{'} ⟩ \otimes | w_{j}^{'} ⟩) \equiv \sum_{i j} a_{i}^{*} b_{j} ⟨ v_{i} | v_{j}^{'} ⟩ ⟨ w_{i} | w_{j}^{'} ⟩ \end{matrix}$
$V\otimes W$ inherits the other structure we are familiar with, such as notions of an adjoint, unitarity, normality, and Hermiticity.
Kronecker product $A$ $m\times n$ $B$ $p\times q$ matrix, then we have the matrix representation:
$\begin{matrix} (57) & \begin{matrix} A \otimes B \equiv [\begin{matrix} A_{11} B & A_{12} B & \dots & A_{1 n} B \\ A_{21} B & A_{22} B & \dots & A_{2 n} B \\ ⋮ & ⋮ & ⋱ & ⋮ \\ A_{m 1} B & A_{m 2} B & \dots & A_{m n} B \end{matrix}] \end{matrix} \end{matrix}$
$(1,2)$ $(3,4)$ is the vector:
$\begin{matrix} [\begin{matrix} 1 \\ 2 \end{matrix}] \otimes [\begin{matrix} 3 \\ 4 \end{matrix}] = [\begin{matrix} 1 \times 3 \\ 1 \times 4 \\ 2 \times 3 \\ 2 \times 4 \end{matrix}] = [\begin{matrix} 3 \\ 4 \\ 6 \\ 8 \end{matrix}] \end{matrix}$
$X$ $Y$ is:
$\begin{matrix} \begin{matrix} X \otimes Y = [\begin{matrix} 0 \cdot Y & 1 \cdot Y \\ 1 \cdot Y & 0 \cdot Y \end{matrix}] = [\begin{matrix} 0 & 0 & 0 & - i \\ 0 & 0 & i & 0 \\ 0 & - i & 0 & 0 \\ i & 0 & 0 & 0 \end{matrix}] \end{matrix} \end{matrix}$
$\ket\psi^{\otimes k}$ $\ket{\psi}$ $k$ $\ket\psi^{\otimes 3} = \ket\psi\otimes \ket\psi\otimes \ket\psi$ . An analogous notation is also used for operators on tensor product spaces.

Exercise 2.26

$\ket\psi = (\ket0+\ket1)/\sqrt2$ $\ket\psi^{\otimes 2}$ $\ket\psi^{\otimes 3}$ $\ket0\ket1$ , and using the Kronecker product.

solution

\begin{aligned} {| ψ ⟩}^{\otimes 2} & = \frac{1}{2} (| 0 ⟩ | 0 ⟩ + | 0 ⟩ | 1 ⟩ + | 1 ⟩ | 0 ⟩ + | 1 ⟩ | 1 ⟩) = \frac{1}{2} [\begin{array}{c} 1 \\ 1 \\ 1 \\ 1 \end{array}] \\ {| ψ ⟩}^{\otimes 3} & = \frac{1}{2 \sqrt{2}} (| 0 ⟩ | 0 ⟩ | 0 ⟩ + | 0 ⟩ | 1 ⟩ | 0 ⟩ + | 1 ⟩ | 0 ⟩ | 0 ⟩ + | 1 ⟩ | 1 ⟩ | 0 ⟩ + | 0 ⟩ | 0 ⟩ | 1 ⟩ + | 0 ⟩ | 1 ⟩ | 1 ⟩ + | 1 ⟩ | 0 ⟩ 1 | + ⟩ | 1 ⟩ | 1 ⟩ | 1 ⟩) \\ = \frac{1}{2 \sqrt{2}} [\begin{array}{c} 1 \\ 1 \\ 1 \\ 1 \\ 1 \\ 1 \\ 1 \\ 1 \end{array}] \end{aligned}

Exercise 2.27

^(a) $X$ $Z$ ^(b) $I$ $X$ ^(c) $X$ $I$ . Is the tensor product commutative?

solution

\begin{aligned} X \otimes Z & = [\begin{array}{c} 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & - 1 \\ 1 & 0 & 0 & 0 \\ 0 & - 1 & 0 & 0 \end{array}] \\ I \otimes X & = [\begin{array}{c} 0 & 1 & 0 & 0 \\ 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 \end{array}] \\ X \otimes I & = [\begin{array}{c} 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \end{array}] \end{aligned}

, the tensor product is not commutative.

Exercise 2.28

Show that the transpose, complex conjugation, and adjoint operations distribute over the tensor product.

\begin{matrix} (58) & (A \otimes B)^{*} = A^{*} \otimes B^{*}; (A \otimes B)^{⊺} = A^{⊺} \otimes B^{⊺}; (A \otimes B)^{†} = A^{†} \otimes B^{†} \end{matrix}

solution

\begin{aligned} (A \otimes B)^{*} & = [\begin{array}{c} A_{11}^{*} B^{*} & \dots & A_{1 n}^{*} B^{*} \\ ⋮ & ⋱ & ⋮ \\ A_{m 1}^{*} B^{*} & \dots & A_{m n}^{*} B^{*} \end{array}] = A^{*} \otimes B^{*} \\ (A \otimes B)^{⊺} & = [\begin{array}{c} A_{11} B^{⊺} & \dots & A_{m 1} B^{⊺} \\ ⋮ & ⋱ & ⋮ \\ A_{1 n} B^{⊺} & \dots & A_{m n} B^{⊺} \end{array}] = A^{⊺} \otimes B^{⊺} \\ (A \otimes B)^{†} & = [\begin{array}{c} A_{11}^{*} B^{†} & \dots & A_{m 1}^{*} B^{†} \\ ⋮ & ⋱ & ⋮ \\ A_{1 n}^{*} B^{†} & \dots & A_{m n}^{*} B^{†} \end{array}] = A^{†} \otimes B^{†} \end{aligned}

Exercise 2.29

Show that the tensor product of two unitary operators is unitary.

solution

$U_1$ $U_2$ $U_1U_1^\dagger = I,\;U_2U_2^\dagger=I$ ), then we have:

\begin{array}{r} (U_{1} \otimes U_{2}) (U_{1} \otimes U_{2})^{†} = U_{1} U_{1}^{†} \otimes U_{2} U_{2}^{†} = I \otimes I = I \end{array}

Exercise 2.30

Show that the tensor product of two Hermitian operators is Hermitian.

solution

$H_1$ $H_2$ be the two Hermitian operators, then we have:

\begin{array}{r} (H_{1} \otimes H_{2})^{†} = H_{1}^{†} \otimes H_{2}^{†} = H_{1} \otimes H_{2} = (H_{1} \otimes H_{2}) \end{array}

Exercise 2.31

Show that the tensor product of two positive operators is positive.

solution

$A$ $B$ be the two positive operators, then we have:

\begin{array}{r} ⟨ v \otimes w | A \otimes B | v \otimes w ⟩ = ⟨ v | A | v ⟩ \cdot ⟨ w | B | w ⟩ \geq 0 \end{array}

Exercise 2.32

Show that the tensor product of two projectors is a projector.

solution

$P_1$ $P_2$ be the two projectors, then we have:

\begin{array}{r} (P_{1} \otimes P_{2})^{2} = (P_{1} \otimes P_{2}) (P_{1} \otimes P_{2}) = P_{1}^{2} \otimes P_{2}^{2} = P_{1} \otimes P_{2} \end{array}

Exercise 2.33

The Hadamard operator on one qubit may be written as

\begin{matrix} (59) & H = \frac{1}{\sqrt{2}} [(| 0 ⟩ + | 1 ⟩) ⟨ 0 | + (| 0 ⟩ - | 1 ⟩) ⟨ 1 |] \end{matrix}

$n$ $H^{\otimes n}$ , may be written as

\begin{matrix} (60) & H^{\otimes n} = \frac{1}{\sqrt{2^{n}}} \sum_{\vec{x}, \vec{y}} (- 1)^{\vec{x} \cdot \vec{y}} | \vec{x} ⟩ ⟨ \vec{y} | \end{matrix}

$H^{\otimes 2}$ .

solution

\begin{aligned} H^{\otimes n} & = \frac{1}{\sqrt{2^{n}}} (\sum_{x_{1}, y_{1}} (- 1)^{x_{1} \cdot y_{1}} | x_{1} ⟩ ⟨ y_{1} |) \otimes (\sum_{x_{2}, y_{2}} (- 1)^{x_{2} \cdot y_{2}} | x_{2} ⟩ ⟨ y_{2} |) \otimes \dots \otimes (\sum_{x_{n}, y_{n}} (- 1)^{x_{n} \cdot y_{n}} | x_{n} ⟩ ⟨ y_{n} |) \\ = \frac{1}{\sqrt{2^{n}}} \sum_{\vec{x}, \vec{y}} (- 1)^{\vec{x} \cdot \vec{y}} | \vec{x} ⟩ ⟨ \vec{y} | \end{aligned}

$H^{\otimes 2}$ is:

\begin{matrix} [\begin{matrix} 1 & 1 & 1 & 1 \\ 1 & - 1 & 1 & - 1 \\ 1 & 1 & - 1 & - 1 \\ 1 & - 1 & - 1 & 1 \end{matrix}] \end{matrix}

2.1.5 Operator functions

$f:\{\mathbb C\to\mathbb C\}$ , it is possible to define a corresponding matrix function on normal matrices.
$A = \sum_aa\ket a\bra a$ $A$ , we then define:
$\begin{matrix} (61) & f (A) \equiv \sum_{a} f (a) | a ⟩ ⟨ a | \end{matrix}$
$f(A)$ is uniquely defined.
This procedure can be used to define the square root of a positive operator, the logarithm of a positive-definite operator, or the exponential of a normal operator. As an example,
$\begin{matrix} \begin{matrix} e^{θ Z} = [\begin{matrix} e^{θ} & 0 \\ 0 & e^{- θ} \end{matrix}] \end{matrix} \end{matrix}$

Exercise 2.34

Find the square root and logarithm of the matrix

\begin{matrix} [\begin{matrix} 4 & 3 \\ 3 & 4 \end{matrix}] \end{matrix}

solution

The spectral decomposition for the matrix is:

\begin{array}{r} [\begin{array}{c} 4 & 3 \\ 3 & 4 \end{array}] = 1 \cdot (\frac{1}{\sqrt{2}} [\begin{array}{c} 1 \\ - 1 \end{array}] \frac{1}{\sqrt{2}} [\begin{array}{c} 1 & - 1 \end{array}]) + 7 \cdot (\frac{1}{\sqrt{2}} [\begin{array}{c} 1 \\ 1 \end{array}] \frac{1}{\sqrt{2}} [\begin{array}{c} 1 & 1 \end{array}]) \end{array}

Therefore we have:

\begin{aligned} \sqrt{A} & = \frac{1}{2} [\begin{array}{c} \sqrt{7} + 1 & \sqrt{7} - 1 \\ \sqrt{7} - 1 & \sqrt{7} + 1 \end{array}] \\ \ln A & = \frac{1}{2} [\begin{array}{c} \ln 7 & \ln 7 \\ \ln 7 & \ln 7 \end{array}] \end{aligned}

Exercise 2.35

(Exponential of the Pauli matrices) $\vec v$ $\theta$ a real number. Prove that

\begin{matrix} (62) & \exp (i θ \vec{v} \cdot \vec{σ}) = (\cos θ) I + i \sin (θ) \vec{v} \cdot \vec{σ} \end{matrix}

$\vec v\cdot\vec\sigma\equiv\sum_{i=1}^3v_i\sigma_i$ .

solution

$\vec v\cdot\vec\sigma = \begin{bmatrix}v_3&v_1-iv_2\\v_1+iv_2&-v_3\end{bmatrix}$ :

\begin{aligned} \det [\begin{array}{c} v_{3} - λ & v_{1} - i v_{2} \\ v_{1} + i v_{2} & - v_{3} - λ \end{array}] & = λ^{2} - (v_{1}^{2} + v_{2}^{2} + v_{3}^{2}) = 0 \\ \Rightarrow λ = \pm \sqrt{v_{1}^{2} + v_{2}^{2} + v_{3}^{2}} \end{aligned}

$\vec v$ $\sqrt{v_1^2+v_2^2+v_3^2}=1$ $\lambda = \pm1$ $\ket{\lambda_1}$ $\ket{\lambda_{-1}}$ .

\begin{aligned} \exp (i θ \vec{v} \cdot \vec{σ}) & = \exp (1 \cdot i θ) | λ_{1} ⟩ ⟨ λ_{1} | + \exp (- 1 \cdot i θ) | λ_{- 1} ⟩ ⟨ λ_{- 1} | \\ = (\cos θ + i \sin θ) | λ_{1} ⟩ ⟨ λ_{1} | + (\cos θ - i \sin θ) | λ_{- 1} ⟩ ⟨ λ_{- 1} | \\ = (\cos θ) (| λ_{1} ⟩ ⟨ λ_{1} | + | λ_{- 1} ⟩ ⟨ λ_{- 1} |) + (i \sin θ) (| λ_{1} ⟩ ⟨ λ_{1} | - | λ_{- 1} ⟩ ⟨ λ_{- 1} |) \\ = (\cos θ) I + i \sin (θ) \vec{v} \cdot \vec{σ} \end{aligned}

$\vec v\cdot\vec\sigma$ $\ket{\lambda_1}\bra{\lambda_1}-\ket{\lambda_{-1}}\bra{\lambda_{-1}}$ .

trace $A$ (another matrix function) is defined to be the sum of its diagonal elements:
$\begin{matrix} (63) & tr (A) \equiv \sum_{i} A_{i i} \end{matrix}$
cyclic $\tr(AB) = \tr(BA)$ linear $\tr(A+B) = \tr(A)+\tr(B)$ $tr(zA) = z\tr(A)$ $A$ $B$ $z$ a complex number.
unitary similarity transformation $A\rightarrow UAU^\dagger$ Basis Change $\tr(UAU^\dagger) = \tr(U^\dagger UA) = \tr(A)$ $A$ any $A$ .
$\ket\psi$ unit vector $A$ $\tr(A\ket\psi\bra\psi)$ $\ket\psi$ being the first element. In that way we have:
$\begin{matrix} (64) & tr (A | ψ ⟩ ⟨ ψ |) = \sum_{i} ⟨ i | A | ψ ⟩ ⟨ ψ | i ⟩ = ⟨ ψ | A | ψ ⟩ \end{matrix}$
, which is extremely useful in evaluating the trace of an operator.

Exercise 2.36

$I$ have trace zero.

solution

$\tr(X) = \tr(Y) = \tr(Z) = 0$

Exercise 2.37

(Cyclic property of the trace) $A$ $B$ are two linear operators show that

\begin{matrix} (65) & tr (A B) = tr (B A) \end{matrix}

solution

$A_{m\times n}$ $B_{n\times m}$ , then we have:

\begin{array}{r} tr (A B) = \sum_{i = 1}^{m} (\sum_{j = 1}^{n} A_{i j} B_{j i}) = \sum_{j = 1}^{n} (\sum_{i = 1}^{m} B_{j i} A_{i j}) = tr (B A) \end{array}

Exercise 2.38

(Linearity of the trace) If A and B are two linear operators, show that

\begin{matrix} (66) & tr (A + B) = tr (A) + tr (B) \end{matrix}

$z$ is an arbitrary complex number show that

\begin{matrix} (67) & tr (z A) = z tr (A) \end{matrix}

solution

Expand the matrices in terms of element, then one can proof by simple calculation.

Exercise 2.39

(The Hilbert–Schmidt inner product on operators) $L_V$ $V$ $zA$ $A$ $z$ $0$ $L_V$ can be given a natural inner product structure, turning it into a Hilbert space.

(1) $(\cdot,\cdot)$ $L_V\times L_V$ defined by

\begin{matrix} (68) & (A, B) \equiv tr (A^{†} B) \end{matrix}

is an inner product function. This inner product is known as the Hilbert–Schmidt or trace (2) $V$ $d$ $L_V$ $d^2$ (3) $L_V$ .

solution

(1) $(\cdot,\cdot)$ $(a,b) = (b,a)^*$ $(a, a)\geq0$ $a=0$ .

\begin{matrix} \begin{matrix} (A, \sum_{i} λ_{i} B_{i}) = tr (A^{†} \sum_{i} λ_{i} B_{i}) = tr (\sum_{i} λ_{i} A^{†} B_{i}) = \sum_{i} λ_{i} tr (A^{†} B_{i}) = \sum_{i} λ_{i} (A, B_{i}) \\ (A, B)^{*} = (tr (A^{†} B))^{*} = tr ((A^{†} B)^{†}) = tr (B^{†} A) = (B, A) \\ (A, A) = tr (A^{†} A) = \sum_{i} | A_{i i} |^{2} \geq 0 \end{matrix} \end{matrix}

$(\cdot,\cdot)$ $L_V\times L_V$ $(A,B)\equiv \tr(A^\dagger B)$ is an inner product function.

(2) $V$ $d$ $L_V$ $d\times d = d^2$ $\dim(L_V) = d^2$ .

(3) 是不是可以只考慮上三角區域的element啊, 再用 Gram–Schmidt procedure 構造出正交歸一矩陣基底.

2.1.6 The commutator and anti-commutator

The commutator between two operators A and B is defined to be:
$\begin{matrix} (69) & [A, B] \equiv A B - B A \end{matrix}$
$AB=BA$ $A$ $B$ .
anti-commutator $A$ $B$ is defined by:
$\begin{matrix} (70) & {A, B} \equiv A B + B A \end{matrix}$
$A$ $B$ $\{A,B\}=0$ .

Theorem 2.2

(Simultaneous diagonalization theorem) $A$ $B$ $[A,B]=0$ $A$ $B$ $A$ $B$ are simultaneously diagonalizable in this case.

proof

$A$ $B$ $[A,B]=0$ ), so we start with the forward one.

$\ket{a, j}$ $V_a$ $A$ $a$ $j$ $[A,B]=0$ , we have:

\begin{matrix} (71) & A B | a, j ⟩ = B A | a, j ⟩ = a B | a, j ⟩ \end{matrix}

$B\ket{a,j}$ $V_a$ $P_a$ $V_a$ $B_a\equiv P_aBP_a$ $B_a$ $V_a$ $B_a$ $V_a$ $\ket{a,b,k}$ $a$ $b$ $A$ $B_a$ $k$ $B_a$ .

$B\ket{a,b,k}$ $V_a$ $B\ket{a,b,k} = P_aB\ket{a,b,k}$ $P_a\ket{a,b,k} = \ket{a,b,k}$ , so combine the two relation we have:

\begin{matrix} (72) & B | a, b, k ⟩ = P_{a} B P_{a} | a, b, k ⟩ = B_{a} | a, b, k ⟩ = b | a, b, k ⟩ \end{matrix}

$\ket{a,b,k}$ $B$ $b$ $\ket{a,b,k}$ $A$ $B$ $A$ $B$ $A$ $B$ are simultaneously diagonalizable.

Exercise 2.40

(Commutation relations for the Pauli matrices) Verify the commutation relations

\begin{matrix} (73) & [X, Y] = 2 i Z; [Y, Z] = 2 i X; [Z, X] = 2 i Y \end{matrix}

$\epsilon_{jkl}$ $\epsilon_{jkl}=0$ $\epsilon_{123} = \epsilon_{231} = \epsilon_{312}=1$ $\epsilon_{321} = \epsilon_{132} = \epsilon_{213}=-1$ :

\begin{matrix} (74) & [σ_{j}, σ_{k}] = 2 i \sum_{l = 1}^{3} ϵ_{j k l} σ_{l} \end{matrix}

solution

\begin{aligned} [X, Y] & = [\begin{array}{c} i & 0 \\ 0 & - i \end{array}] - [\begin{array}{c} - i & 0 \\ 0 & i \end{array}] = 2 i [\begin{array}{c} 1 & 0 \\ 0 & - 1 \end{array}] \\ [Y, Z] & = [\begin{array}{c} 0 & i \\ i & 0 \end{array}] - [\begin{array}{c} 0 & - i \\ - i & 0 \end{array}] = 2 i [\begin{array}{c} 0 & 1 \\ 1 & 0 \end{array}] \\ [Z, X] & = [\begin{array}{c} 0 & 1 \\ - 1 & 0 \end{array}] - [\begin{array}{c} 0 & - 1 \\ 1 & 0 \end{array}] = 2 i [\begin{array}{c} 0 & - i \\ i & 0 \end{array}] \end{aligned}

Exercise 2.41

(Anti-commutation relations for the Pauli matrices) Verify the anti-commutation relations

\begin{matrix} (75) & {σ_{i}, σ_{j}} = 0 \end{matrix}

$i\neq j$ $\{1,2,3\}$ $i\in\{0,1,2,3\}$ ,

\begin{matrix} (76) & σ_{i}^{2} = I \end{matrix}

solution

One can proof the above relation via the commutation relations from last exercise.

Exercise 2.42

Verify that

\begin{matrix} (77) & A B = \frac{[A, B] + {A, B}}{2} \end{matrix}

solution

\begin{array}{r} \frac{[A, B] + {A, B}}{2} = \frac{A B - B A + A B + B A}{2} = \frac{2 A B}{2} = A B \end{array}

Exercise 2.43

$j,k\in\{1,2,3\}$ ,

\begin{matrix} (78) & σ_{j} σ_{k} = δ_{j k} I + i \sum_{l = 1}^{3} ϵ_{j k l} σ_{l} \end{matrix}

solution

From the conclusion of previous exercise (2.41 and 2.42), we have:

\begin{array}{r} σ_{j} σ_{k} = \frac{[σ_{j}, σ_{k}] + {σ_{j}, σ_{k}}}{2} = \frac{[σ_{j}, σ_{k}] + 0}{2} = I δ_{j k} + i \sum_{l = 1}^{3} ϵ_{j k l} σ_{l} \end{array}

Exercise 2.44

$[A,B]=\{A,B\}=0$ $A$ $B$ $0$ .

solution

$AB=BA$ $AB=-BA$ $BA=-BA$ $A$ $BAA^{-1} = -BAA^{-1}\Rightarrow B=-B$ $B=0$ .

Exercise 2.45

$[A,B]^\dagger = [B^\dagger, A^\dagger]$ .

solution

\begin{array}{r} [A, B]^{†} = (A B)^{†} - (B A)^{†} = B^{†} A^{†} - A^{†} B^{†} = [B^{†}, A^{†}] \end{array}

Exercise 2.46

$[A,B] = -[B,A]$ .

solution

Proof by simply expanding the equation.

Exercise 2.47

$A$ $B$ $i[A,B]$ is also Hermitian.

solution

From the conclusions of the previous exercises (2.45 and 2.46), we have:

\begin{array}{r} (i [A, B])^{†} = - i [A, B]^{†} = - i [B^{†}, A^{†}] = - i [B, A] = i [A, B] \end{array}

$i[A,B]$ is also Hermitian.

2.1.7 The polar and singular value decompositions

The polar and singular value decompositions are useful ways of breaking linear operators up into simpler parts.
In particular, these decompositions allow us to break general linear operators up into products of unitary operators and positive operators.

Theorem 2.3

(Polar decomposition) $A$ $V$ $U$ $J$ $K$ such that

\begin{matrix} (79) & A = U J = K U \end{matrix}

unique positive $J$ $K$ $J\equiv \sqrt{A^\dagger A}$ $K = \sqrt{AA^\dagger}$ $A$ $U$ is unique.

$A=UJ$ left polar decomposition $A$ $A=KU$ right polar decomposition $A$ . Most often, we’ll omit the 'right' or 'left' nomenclature, and use the term 'polar decomposition' for both expressions, with context indicating which is meant.

proof

$A$ $A^\dagger A$ $J\equiv \sqrt{A^\dagger A}$ is also a positive operator, so it can be given a spectral decomposition:

\begin{matrix} (80) & J = \sum_{i} λ_{i} | i ⟩ ⟨ i |, (λ_{i} \geq 0) \end{matrix}

$\ket{\psi_i}\equiv A\ket i$ $\braket{\psi_i} = \lambda_i^2$ $i$ $\lambda_i\neq0$ $\ket{e_i}\equiv\ket{\psi_i}/\lambda_i$ $0 = \braket{e_i}{e_j} = \bra{i}A^\dagger A\ket j/(\lambda_i\lambda_j) = \bra{i}J^2\ket j/(\lambda_i\lambda_j)$ $i\neq j$ .

$\ket{e_i}$ $i$ $\lambda_i=0$ $U\equiv \sum_i\ket{e_i}\bra i$ so that for:

\begin{matrix} {\begin{cases} λ_{i} \neq 0, U J | i ⟩ = λ_{i} | e_{i} ⟩ = | ψ_{i} ⟩ = A | i ⟩ \\ λ_{i} = 0, U J | i ⟩ = 0 \end{cases} \end{matrix}

$UJ$ $A$ $\ket i$ $A = UJ$ .

$A$ $J$ $U$ $U = AJ^{-1}$ $A = UJ = UJU^\dagger U = KU$ $K\equiv UJU^\dagger$ $AA^\dagger=KUU^\dagger K^\dagger = KUU^\dagger K = K^2, K = \sqrt{AA^\dagger}$ .

The following singular value decomposition combines the polar decomposition + spectral theorem:

Corollary 2.4

(Singular value decomposition) $A$ square $U$ $V$ $D$ with non-negative entries such that

\begin{matrix} (81) & A = U D V \end{matrix}

$D$ singular values $A$ .

proof

$A = SJ$ $S$ positive $J$ left-polar decomposition $J = TDT^\dagger$ $T$ $D$ $D$ $J$ is positive). Combining the two relatinos we get:

\begin{matrix} (82) & A = S T D T^{†} = U D V \end{matrix}

$U\equiv ST$ $V\equiv T^\dagger$ .

Exercise 2.48

$P$ $U$ $H$ ?

solution

$P$ $P$ $P = \sum_i\lambda_i\ket i\bra i$ $\lambda_i \geq 0$ , and we have
$\begin{matrix} J = \sqrt{P^{†} P} = \sqrt{P P} = \sum_{i} \sqrt{λ_{i}^{2}} | i ⟩ ⟨ i | = P \end{matrix}$
$P = UP$ $P$ $U=I$ obviously.
$U$ $J = \sqrt{U^\dagger U} = I$ $U = UI$ $U$ .
$H$ $J = \sqrt{H^\dagger H} = \sqrt{HH} = \sqrt{H^2}$ $H = U\sqrt{H^2}$ $H$ $H^2\neq H$ $H = \sum_i\lambda_i\ket i\bra i$ $\sqrt{H^2} = \sum_i\big\vert\lambda_i\big\vert\ket i\bra i$ .

Exercise 2.49

Express the polar decomposition of a normal matrix in the outer product representation.

solution

$A$ $A=\sum_i\lambda_i\ket i\bra i$ $J$ :

\begin{matrix} (83) & J = \sqrt{A^{†} A} = \sqrt{\sum_{i} \sum_{j} λ_{j}^{*} λ_{i} | j ⟩ ⟨ j | i ⟩ ⟨ i |} = \sqrt{\sum_{i} | λ_{i} |^{2} | i ⟩ ⟨ i |} = \sum_{i} | λ_{i} | | i ⟩ ⟨ i | \end{matrix}

$A = U\sum_i\big\vert\lambda_i\big\vert\ket i\bra i$ .

Exercise 2.50

$A=\begin{bmatrix}1&0\\1&1\end{bmatrix}$ .

solution

$A^\dagger = \begin{bmatrix}1&1\\0&1\end{bmatrix},\;\;A^\dagger A = \begin{bmatrix}2&1\\1&1\end{bmatrix},\;\;AA^\dagger = \begin{bmatrix}1&1\\1&2\end{bmatrix}$

$A^\dagger A$ , its eigenvalues and associated eigenvectors are:
$\begin{matrix} {\begin{cases} λ_{+} = \frac{3 + \sqrt{5}}{2}, | λ_{+} ⟩ = \frac{1}{\sqrt{10 - 2 \sqrt{5}}} [\begin{array}{c} 2 \\ \sqrt{5} - 1 \end{array}] \\ λ_{-} = \frac{3 - \sqrt{5}}{2}, | λ_{-} ⟩ = \frac{1}{\sqrt{10 + 2 \sqrt{5}}} [\begin{array}{c} 2 \\ - \sqrt{5} - 1 \end{array}] \end{cases} \end{matrix}$
, then we can calculate:
$\begin{matrix} {\begin{cases} | λ_{+} ⟩ ⟨ λ_{+} | = \frac{1}{5 - \sqrt{5}} [\begin{array}{c} 2 & \sqrt{5} - 1 \\ \sqrt{5} - 1 & 3 - \sqrt{5} \end{array}] \\ | λ_{-} ⟩ ⟨ λ_{-} | = \frac{1}{5 + \sqrt{5}} [\begin{array}{c} 2 & - \sqrt{5} - 1 \\ - \sqrt{5} - 1 & 3 + \sqrt{5} \end{array}] \end{cases} \end{matrix}$
, and notice that:
$\begin{matrix} {\begin{cases} \sqrt{λ_{+}} = \frac{\sqrt{5} + 1}{2}, \frac{1}{\sqrt{λ_{+}}} = \frac{\sqrt{5} - 1}{2} \\ \sqrt{λ_{-}} = \frac{\sqrt{5} - 1}{2}, \frac{1}{\sqrt{λ_{-}}} = \frac{\sqrt{5} + 1}{2} \end{cases} \end{matrix}$
$J$ :
$\begin{matrix} \begin{matrix} J = \sqrt{A^{†} A} = \sqrt{λ_{+}} | λ_{+} ⟩ ⟨ λ_{+} | + \sqrt{λ_{-}} | λ_{-} ⟩ ⟨ λ_{-} | = \frac{1}{\sqrt{5}} [\begin{matrix} 3 & 1 \\ 1 & 2 \end{matrix}] \end{matrix} \end{matrix}$
$J^{-1}$ :
$\begin{matrix} \begin{matrix} J^{- 1} = \frac{1}{\sqrt{5}} [\begin{matrix} 2 & - 1 \\ - 1 & 3 \end{matrix}] \end{matrix} \end{matrix}$
$U=AJ^{-1}$
$\begin{matrix} \begin{matrix} U = A J^{- 1} = \frac{1}{\sqrt{5}} [\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix}] [\begin{matrix} 2 & - 1 \\ - 1 & 3 \end{matrix}] = \frac{1}{\sqrt{5}} [\begin{matrix} 2 & - 1 \\ 1 & 2 \end{matrix}] \end{matrix} \end{matrix}$
left polar decomposition $A$ is:
$\begin{matrix} \begin{matrix} A = [\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix}] = U J = (\frac{1}{\sqrt{5}} [\begin{matrix} 2 & - 1 \\ 1 & 2 \end{matrix}]) (\frac{1}{\sqrt{5}} [\begin{matrix} 3 & 1 \\ 1 & 2 \end{matrix}]) \end{matrix} \end{matrix}$
$AA^\dagger$ case, or the right polar decomposition, perform similar process we get:
$\begin{matrix} \begin{matrix} A = [\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix}] = K U = (\frac{1}{\sqrt{5}} [\begin{matrix} 2 & 1 \\ 1 & 3 \end{matrix}]) (\frac{1}{\sqrt{5}} [\begin{matrix} 2 & - 1 \\ 1 & 2 \end{matrix}]) \end{matrix} \end{matrix}$

Tedious work, but still worth a try!

2.2 The postulates of quantum mechanics

2.2.1 State space & Evolution

The first postulate of quantum mechanics sets up the arena in which quantum mechanics takes place.

Postulate 1

Associated to any isolated physical system is a complex vector space with inner product (Hilbert space) known as the state space of the system. The system is completely described by its state vector, which is a unit vector in the system’s state space.

$\ket0$ $\ket1$ form an orthonormal basis for that state space, then an arbitrary state vector in that space can be written
$\begin{matrix} (84) & | ψ ⟩ = a | 0 ⟩ + b | 1 ⟩ \end{matrix}$
$a$ $b$ are complex numbers.
$\ket\psi$ $\braket{\psi} = 1$ $\vert a\vert^2 + \vert b\vert^2 = 1$ , often called normalization condition for state vectors.

$\ket\psi$ of a quantum mechanical system change with time, we have the second postulate:

Postulate 2

The evolution of a closedunitary transformation $t_1\rightarrow\ket\psi$ $t_2\rightarrow \ket{\psi'}$ $U$ $t_1$ $t_2$ :

\begin{matrix} (85) & | ψ^{'} ⟩ = U | ψ ⟩ \end{matrix}

$U$ $U$ 該長哪種樣子.
Some common and important unitary operators on a single qubit:
- $X$ matrix: quantum NOT gate, bit flip matrix.
- $Z$ matrix: phase flip matrix.
- Hadamard gate.

Exercise 2.51

$H$ is unitary.

solution

\begin{matrix} \begin{matrix} H H^{†} = H^{†} H = \frac{1}{2} [\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}] [\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}] = \frac{1}{2} [\begin{matrix} 2 & 0 \\ 0 & 2 \end{matrix}] = I \end{matrix} \end{matrix}

Exercise 2.52

$H^2 = I$ .

solution

\begin{matrix} \begin{matrix} H^{2} = \frac{1}{2} [\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}] [\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}] = \frac{1}{2} [\begin{matrix} 2 & 0 \\ 0 & 2 \end{matrix}] = I \end{matrix} \end{matrix}

Exercise 2.53

$H$ ?

solution

$\lambda^2-1=0$ $\lambda=\pm1$ .

\begin{matrix} {\begin{cases} λ = - 1, \frac{1}{\sqrt{2}} [\begin{array}{c} 1 & 1 \\ 1 & - 1 \end{array}] [\begin{array}{c} a \\ b \end{array}] = - [\begin{array}{c} a \\ b \end{array}], | λ_{- 1} ⟩ = \frac{1}{\sqrt{4 + 2 \sqrt{2}}} [\begin{array}{c} 1 \\ - 1 - \sqrt{2} \end{array}] \\ λ = + 1, \frac{1}{\sqrt{2}} [\begin{array}{c} 1 & 1 \\ 1 & - 1 \end{array}] [\begin{array}{c} a \\ b \end{array}] = + [\begin{array}{c} a \\ b \end{array}], | λ_{+ 1} ⟩ = \frac{1}{\sqrt{4 - 2 \sqrt{2}}} [\begin{array}{c} 1 \\ - 1 + \sqrt{2} \end{array}] \end{cases} \end{matrix}

Actually we have a revised version of postulate 2 for continuous time case:

Postulate 2 (refined)

The time evolution of the state of a closed quantum system is described by the Schrödinger equation,

\begin{matrix} (86) & i ℏ \frac{d | ψ ⟩}{d t} = H | ψ ⟩ \end{matrix}

$\hbar$ $H$ is the Hamiltonian (not Hadamard matrix) of the closed system.

The above refined postulate implies that uf we know the Hamiltonian of a system, then we understand its dynamics completely. But the question is, figuring out the Hamiltonian of the system is difficult even for today's physicists.
The spectral decomposition of Hamiltonian (因為它是Hermitian所以是normal的)
$\begin{matrix} (87) & H = \sum_{E} E | E ⟩ ⟨ E | \end{matrix}$
$\ket E$ be the energy eigenstatesstationary states $E$ the energy of that state. The lowest energy is known as the ground state energy for the system, and the corresponding energy eigenstate (or eigenspace) is known as the ground state.
$H$ are called stationary states is that the effect of time evolution on them only acquires a numerical factor:
$\begin{matrix} (88) & | E ⟩ \to e^{- i E t / ℏ} | E ⟩ . \end{matrix}$
驗證 postulate 2&2' 是等價的: First write down the solution of the Schrödinger equation,
$\begin{matrix} (89) & | ψ (t_{2}) ⟩ = \exp [\frac{- i H (t_{2} - t_{1})}{ℏ}] | ψ (t_{1}) ⟩ = U (t_{1}, t_{2}) | ψ (t_{1}) ⟩, where U (t_{1}, t_{2}) \equiv \exp [\frac{- i H (t_{2} - t_{1})}{ℏ}] \end{matrix}$
$U$ $U = e^{iK}$ $K$ .
$\rightarrow$ $\rightarrow$ continuous time description.

Exercise 2.54

$A$ $B$ $e^Ae^B = e^{A+B}$ .

solution

Theorem 2.2 $[A,B]=0$ an $A$ $B$ $A = \sum_ia_i\ket i\bra i$ $B = \sum_jb_j\ket j\bra j$ $A+B = \sum_k(a_k+b_k)\ket k\bra k$ .

\begin{aligned} e^{A} e^{B} & = \sum_{i} \sum_{j} e^{a_{i}} e^{b_{j}} | i ⟩ ⟨ i | j ⟩ ⟨ j | \\ = \sum_{i} \sum_{j} e^{a_{i} + b_{j}} δ_{i j} | i ⟩ ⟨ j | \\ = \sum_{i} e^{a_{i} + b_{i}} | i ⟩ ⟨ i | \\ = \sum_{k} e^{a_{k} + b_{k}} | k ⟩ ⟨ k | = e^{A + B} \end{aligned}

Exercise 2.55

$U(t_1, t_2)$ defined here is unitary.

solution

$H = \sum_EE\ket E\bra E$ , therefore we have:

\begin{aligned} U (t_{1}, t_{2}) U^{†} (t_{1}, t_{2}) & = \exp [\frac{- i H (t_{2} - t_{1})}{ℏ}] \exp [\frac{i H (t_{2} - t_{1})}{ℏ}] \\ = (\sum_{E} \exp [\frac{- i E (t_{2} - t_{1})}{ℏ}] | E ⟩ ⟨ E |) (\sum_{E^{'}} \exp [\frac{i E^{'} (t_{2} - t_{1})}{ℏ}] | E^{'} ⟩ ⟨ E^{'} |) \\ = \sum_{E} \sum_{E^{'}} \exp [\frac{i (E^{'} - E) (t_{2} - t_{1})}{ℏ}] | E ⟩ δ_{E E^{'}} ⟨ E^{'} | \\ = \sum_{E} \exp (0) | E ⟩ ⟨ E | = I \end{aligned}

$U^\dagger(t_1,t_2)U(t_1,t_2)= I$ , hence the operator is unitary.

Exercise 2.56

$K\equiv -i\log(U)$ $U$ $U=\exp(iK)$ $K$ .

solution

$U$ $e^{i\theta}$ $\theta$ .

\begin{matrix} K \equiv - i \log (U) = - i \sum_{k} \log (k) | k ⟩ ⟨ k | = - i \sum_{k} i θ_{k} | k ⟩ ⟨ k | = \sum_{k} θ_{k} | k ⟩ ⟨ k | \end{matrix}

$\theta_k\in\mathbb R$ $k$ $\theta_k^* = \theta_k$ $K$ is Hermitian.

Sometimes we may describe a quantum system which is not closed using a time-varying Hamiltonian, and it indeed serve as a good approximation to a closed system, that way we can apply unitary operators on the system withoout feeling too guilty.

2.2.2 Quantum measurement

When we make a measurement to a closed system, the interaction with the system is sufficient to make it not-closed, hence cannot be strictly described by a unitary evolution, therefore how do we compensate to this?

Postulate 3

$\{M_m\}$ measurement operators $m$ $\ket\psi$ $m$ occurs is given by

\begin{matrix} (90) & p (m) = ⟨ ψ | M_{m}^{†} M_{m} | ψ ⟩ \end{matrix}

, and the state of the system after the measurement is

\begin{matrix} (91) & \frac{M_{m} | ψ ⟩}{\sqrt{⟨ ψ | M_{m}^{†} M_{m} | ψ ⟩}} \end{matrix}

. The measurement operators satisfy the completeness equation,

\begin{matrix} (92) & \sum_{m} M_{m}^{†} M_{m} = I \end{matrix}

$1=\sum_m p(m) = \sum_m\expval{M_m^\dagger M_m}{\psi}$ .

the measurement of a qubit in the computational basis $M_0 = \ket0\bra0$ $M_1 = \ket1\bra1$ . Notice that the completeness equation is obeyed.
- $\ket\psi = a\ket0+b\ket1$ $0$ $p(0) = \expval{M_0^\dagger M_0}{\psi} = \expval{M_0}{\psi} = \vert a\vert^2$ $1$ $\vert b\vert^2$ $\dfrac{M_0\ket\psi}{\vert a\vert} = \dfrac{a}{\vert a\vert}\ket0$ $\dfrac{M_1\ket\psi}{\vert b\vert} = \dfrac{b}{\vert b\vert}\ket1$ .
- $1$ $a/\vert a\vert$ $b/\vert b\vert$ $\ket0$ $\ket1$ .

Exercise 2.57

(Cascaded measurements are single measurements) $\{L_l\}$ $\{M_m\}$ $\{L_l\}$ $\{M_m\}$ $\{N_{lm}\}$ $N_{lm}\equiv M_mL_l$ .

solution

It is essential to prove the post-measurement states for two ways are equivalent.

$\{L_l\}$ $\{M_m\}$ $\ket{\psi_1}$ $\ket{\psi_2}$ are:
$\begin{aligned} | ψ_{1} ⟩ & = \frac{L_{l} | ψ ⟩}{\sqrt{⟨ ψ | L_{l}^{†} L_{l} | ψ ⟩}} \\ | ψ_{2} ⟩ & = \frac{M_{m} | ψ_{1} ⟩}{\sqrt{⟨ ψ_{1} | M_{m}^{†} M_{m} | ψ_{1} ⟩}} \\ = \frac{M_{m} L_{l} | ψ ⟩}{\sqrt{⟨ ψ | L_{l}^{†} L_{l} | ψ ⟩}} \frac{\sqrt{⟨ ψ | L_{l}^{†} L_{l} | ψ ⟩}}{\sqrt{⟨ ψ | L_{l}^{†} M_{m}^{†} M_{m} L_{l} | ψ ⟩}} \\ = \frac{M_{m} L_{l} | ψ ⟩}{\sqrt{⟨ ψ | L_{l}^{†} M_{m}^{†} M_{m} L_{l} | ψ ⟩}} \end{aligned}$
$N_{lm}$ measurement instead, we have:
$\begin{aligned} | ψ_{3} ⟩ & = \frac{N_{l m} | ψ ⟩}{\sqrt{⟨ ψ | N_{l m}^{†} N_{l m} | ψ ⟩}} \\ = \frac{M_{m} L_{l} | ψ ⟩}{\sqrt{⟨ ψ | L_{l}^{†} M_{m}^{†} M_{m} L_{l} | ψ ⟩}} \end{aligned}$
, which is identical to the first scenario.

2.2.3 Distinguishing quantum states

In chapter 1 we've discussed that non-orthogonal quantum states cannot be reliably distinguished. With postulate 3 as a firm foundation we can now give a much more convincing demonstration of this fact.
$\{\ket{\psi_i}, 1\leq i\leq n\}$ $i$ of the state?
- $\ket{\psi_i}$ $M_i\equiv \ket i\bra i$ $i$ $M_0$ $M_0 = \sqrt{I-\sum_{i\neq0}\ket{\psi_i}\bra{\psi_i}}$ $\ket{\psi_i}$ $p(i)=1$ .
- $\ket{\psi_i}$ $j$ $\ket{\psi_1}$ $\ket{\psi_2}$ $\ket{\psi_1}$ $j$ , which ultimately leads to wrong decision according to Bob's rule. Now let's see a more rigorous proof below.

✏️ Proof that non-orthogonal states can’t be reliably distinguished
$\ket{\psi_1}$ $\ket{\psi_2}$ $E_i\equiv \sum_{j:f(j)=i}M_j^\dagger M_j$ $j_1$ $j_2$ $i$ ), then these observations may be written as:
$\begin{matrix} (93) & ⟨ ψ_{1} | E_{1} | ψ_{1} ⟩ = 1; ⟨ ψ_{2} | E_{2} | ψ_{2} ⟩ = 1 \end{matrix}$
$\sum_iE_i = I$ $E$ $M$ $\sum_i\expval{E_i}{\psi_1} = 1$ $1$ $E_1$ $\expval{E_2}{\psi_1} = 0\;\Rightarrow \sqrt{E_2}\ket{\psi_1}=0$ .
$\ket{\psi_2}$ $\ket{\psi_1}$ $\ket{\psi_2} = \alpha\ket{\psi_1} + \beta\ket{\varphi}$ $\sqrt{E_2}\ket{\psi_2}=\beta\sqrt{E_2}\ket\varphi$ $\ket{\psi_1}$ $\ket{\psi_2}$ $\vert\beta\vert<1$ , thus implies:
$\begin{matrix} (94) & 1 = ⟨ ψ_{2} | E_{2} | ψ_{2} ⟩ = | β |^{2} ⟨ φ | E_{2} | φ ⟩ \leq | β |^{2} \leq 1 \end{matrix}$
$\leq$ $\expval{E_2}{\varphi}\leq \sum_i\expval{E_i}{\varphi}=\expval{I}{\varphi}=1$ ).

2.2.4 Projective measurements

observable $M$ , a Hermitian operator on the state space of the system being observed. The obersvable has a spectral decomposition,
$\begin{matrix} (95) & M = \sum_{m} m P_{m} \end{matrix}$
$P_m$ $M$ $m$ $M$ $m$ )
$\ket\psi$ $m$ is
$\begin{matrix} (96) & p (m) = ⟨ ψ | P_{m} | ψ ⟩ \end{matrix}$
$m$ was really measured, the state immediately after the measurement is
$\begin{matrix} (97) & \frac{P_{m} | ψ ⟩}{\sqrt{p (m)}} \end{matrix}$
$\sum_mM^\dagger_mM_m = I$ $M_m$ ) must be orthogonal projectors, that is,
- $M_m$ are Hermitian, and
- $M_mM_{m'} = \delta_{mm'}M_m$
the postulate 3 will reduce to equivalent with projective measurement.
Nice properties of projective measurements:
- Easy to calculate average values:
  $\begin{matrix} (98) & E (M) = \sum_{m} m \cdot p (m) = \sum_{m} m ⟨ ψ | P_{m} | ψ ⟩ = ⟨ ψ | (\sum_{m} m P_{m}) | ψ ⟩ = ⟨ ψ | M | ψ ⟩ \equiv ⟨ M ⟩ \end{matrix}$
- Able to calculate standard deviation:
  $\begin{matrix} (99) & Δ M = \sqrt{⟨ (M - ⟨ M ⟩)^{2} ⟩} = \sqrt{⟨ M^{2} ⟩ - ⟨ M ⟩^{2}} \end{matrix}$
- These formulation of measurement and standard deviations in terms of observables gives rise in an elegant way to results such as the Heisenberg uncertainty principle.

Exercise 2.58

$\ket\psi$ $M$ $m$ $M$ , and the standard deviation?

solution

\begin{aligned} (Δ M)^{2} & = ⟨ M^{2} ⟩ - ⟨ M ⟩^{2} \\ = ⟨ ψ | (\sum_{m, m^{'}} m m^{'} P_{m} P_{m^{'}}) | ψ ⟩ - ⟨ ψ | M {| ψ ⟩}^{2} \\ = m ⟨ ψ | (\sum_{m} m P_{m}) | ψ ⟩ - m^{2} \\ = m^{2} - m^{2} = 0 \end{aligned}

$\langle M \rangle=m$ .

✏️ The Heisenberg uncertainty principle
$A$ $B$ are two Hermitian operators, from simple derivation we have:
$\begin{matrix} (100) & | ⟨ ψ | [A, B] | ψ ⟩ |^{2} + | ⟨ ψ | {A, B} | ψ ⟩ |^{2} = 4 | ⟨ ψ | A B | ψ ⟩ |^{2} \Rightarrow | ⟨ ψ | [A, B] | ψ ⟩ |^{2} \leq 4 | ⟨ ψ | A B | ψ ⟩ |^{2} \end{matrix}$
$\expval{AB}{\psi} = x+iy$ $x, y\in\mathbb R$ $\expval{[A,B]}{\psi} = 2iy$ $\expval{\{A,B\}}{\psi} = 2x$ , therefore it follows the relation.)
$4\Big\vert \expval{AB}{\psi} \Big\vert^2$ in the above equation:
$\begin{matrix} (101) & | ⟨ ψ | A B | ψ ⟩ |^{2} \leq ⟨ ψ | A^{2} | ψ ⟩ ⟨ ψ | B^{2} | ψ ⟩ \end{matrix}$
Hence we have:
$\begin{matrix} (102) & | ⟨ ψ | [A, B] | ψ ⟩ |^{2} \leq 4 ⟨ ψ | A^{2} | ψ ⟩ ⟨ ψ | B^{2} | ψ ⟩ \end{matrix}$
$A$ $B$ $A = C-\langle C\rangle$ $B = D-\langle D\rangle$ $C$ $D$ are two observables, which makes:
$\begin{aligned} ⟨ ψ | [A, B] | ψ ⟩ & = ⟨ ψ | C D - C ⟨ D ⟩ - ⟨ C ⟩ D + ⟨ C ⟩ ⟨ D ⟩ - D C + D ⟨ C ⟩ + ⟨ D ⟩ C - ⟨ D ⟩ ⟨ C ⟩ | ψ ⟩ = ⟨ ψ | [C, D] | ψ ⟩ \\ ⟨ ψ | A^{2} | ψ ⟩ & = ⟨ ψ | C^{2} | ψ ⟩ - ⟨ ψ | C {| ψ ⟩}^{2} = (Δ C)^{2} \\ ⟨ ψ | B^{2} | ψ ⟩ & = ⟨ ψ | D^{2} | ψ ⟩ - ⟨ ψ | D {| ψ ⟩}^{2} = (Δ D)^{2} \end{aligned}$
, and will ultimate leads to the most used form of Heisenberg uncertainty principle:
$\begin{matrix} (103) & Δ C Δ D \geq \frac{| ⟨ ψ | [C, D] | ψ ⟩ |}{2} \end{matrix}$
$C$ $D$ $\ket\psi$ $C$ $D$ , 那麼兩者的標準差將符合上式規範.)
$X$ $Y$ $\ket0$ , the uncertainty principle tells us that,
$\begin{matrix} Δ X Δ Y \geq \frac{| ⟨ 0 | 2 i Z | 0 ⟩ |}{2} = 1 \end{matrix}$
$\Delta X$ $\Delta Y$ $0$ , as can be verified by direct calculation.

Two widely used nomenclatures for measurements deserve emphasis:
- $P_m$ $\sum_mP_m = I$ $P_mP_{m'} = \delta_{mm'}P_m$ $M = \sum_mmP_m$ .
- $\ket m$ $\ket m$ $P_m = \ket m\bra m$ .
$Z$ $\ket\psi = \frac{\ket0+\ket1}{\sqrt2}$ $+1$ $-1$ $Z$ $\lambda_\pm = \pm1$ $+1$ with probability
$\begin{matrix} ⟨ ψ | P_{1} | ψ ⟩ = ⟨ ψ | λ_{+ 1} ⟩ ⟨ λ_{+ 1} | ψ ⟩ = ⟨ ψ | P_{1} | ψ ⟩ = ⟨ ψ | 0 ⟩ ⟨ 0 | ψ ⟩ = \frac{1}{2} \end{matrix}$
$-1$ $50\%$ probability.
$\vec v$ is any real three-dimensional unit vector. Then we can define an observable:
$\begin{matrix} (104) & \vec{v} \cdot \vec{σ} \equiv v_{1} σ_{1} + v_{2} σ_{2} + v_{3} σ_{3} \end{matrix}$
. Measurement of this observable is sometimes referred to as $\vec v$ axis.

Exercise 2.59

$\ket0$ $X$ $X$ ?

solution

\begin{aligned} ⟨ X ⟩ & = ⟨ 0 | X | 0 ⟩ = [\begin{array}{c} 1 \\ 0 \end{array}] [\begin{array}{c} 0 & 1 \\ 1 & 0 \end{array}] [\begin{array}{c} 1 \\ 0 \end{array}] = 0 \\ Δ (X)^{2} & = ⟨ 0 | X^{2} | 0 ⟩ - 0^{2} = [\begin{array}{c} 1 \\ 0 \end{array}] [\begin{array}{c} 1 & 0 \\ 0 & 1 \end{array}] [\begin{array}{c} 1 \\ 0 \end{array}] = 1 \\ \Rightarrow Δ (X) = 1 \end{aligned}

Exercise 2.60

$\vec v\cdot \vec\sigma$ $\pm1$ $P_\pm = \dfrac{I\pm\vec v\cdot \vec\sigma}{2}$ .

solution

Exercise 2.35 $\pm1$ . Now we calculate their eigenvectors:

$\lambda = 1$ :
$\begin{aligned} [\begin{array}{c} v_{3} - 1 & v_{1} - i v_{2} \\ v_{1} + i v_{2} & - v_{3} - 1 \end{array}] [\begin{array}{c} a \\ b \end{array}] = [\begin{array}{c} 0 \\ 0 \end{array}] \Rightarrow | λ_{+} ⟩ = [\begin{array}{c} a \\ b \end{array}] = \frac{1}{\sqrt{2 (1 - v_{3})}} [\begin{array}{c} v_{1} - i v_{2} \\ 1 - v_{3} \end{array}] \end{aligned}$
, then we can calculate projector:
$\begin{aligned} P_{+} = | λ_{+} ⟩ ⟨ λ_{+} | & = \frac{1}{2 (1 - v_{3})} [\begin{array}{c} v_{1} - i v_{2} \\ 1 - v_{3} \end{array}] [\begin{array}{c} v_{1} + i v_{2} & 1 - v_{3} \end{array}] \\ = \frac{1}{2 (1 - v_{3})} [\begin{array}{c} 1 - v_{3}^{2} & (v_{1} - i v_{2}) (1 - v_{3}) \\ (v_{1} + i v_{2}) (1 - v_{3}) & (1 - v_{3})^{2} \end{array}] \\ = \frac{1}{2} [\begin{array}{c} 1 + v_{3} & v_{1} - i v_{2} \\ v_{1} + i v_{2} & 1 - v_{3} \end{array}] \\ = \frac{1}{2} ([\begin{array}{c} 1 & 0 \\ 0 & 1 \end{array}] + [\begin{array}{c} v_{3} & v_{1} - i v_{2} \\ v_{1} + i v_{2} & - v_{3} \end{array}]) = \frac{1}{2} (I + \vec{v} \cdot \vec{σ}) \end{aligned}$
$\lambda = -1$ :
$\begin{aligned} [\begin{array}{c} v_{3} + 1 & v_{1} - i v_{2} \\ v_{1} + i v_{2} & - v_{3} + 1 \end{array}] [\begin{array}{c} a \\ b \end{array}] = [\begin{array}{c} 0 \\ 0 \end{array}] \Rightarrow | λ_{-} ⟩ = [\begin{array}{c} a \\ b \end{array}] = \frac{1}{\sqrt{2 (1 + v_{3})}} [\begin{array}{c} v_{1} - i v_{2} \\ - 1 - v_{3} \end{array}] \end{aligned}$
, then we can calculate projector:
$\begin{aligned} P_{-} = | λ_{-} ⟩ ⟨ λ_{-} | & = \frac{1}{2 (1 + v_{3})} [\begin{array}{c} v_{1} - i v_{2} \\ - 1 - v_{3} \end{array}] [\begin{array}{c} v_{1} + i v_{2} & - 1 - v_{3} \end{array}] \\ = \frac{1}{2 (1 + v_{3})} [\begin{array}{c} 1 - v_{3}^{2} & - (v_{1} - i v_{2}) (1 + v_{3}) \\ - (v_{1} + i v_{2}) (1 + v_{3}) & (1 + v_{3})^{2} \end{array}] \\ = \frac{1}{2} [\begin{array}{c} 1 - v_{3} & - (v_{1} - i v_{2}) \\ - (v_{1} + i v_{2}) & 1 + v_{3} \end{array}] \\ = \frac{1}{2} ([\begin{array}{c} 1 & 0 \\ 0 & 1 \end{array}] - [\begin{array}{c} v_{3} & v_{1} - i v_{2} \\ v_{1} + i v_{2} & - v_{3} \end{array}]) = \frac{1}{2} (I - \vec{v} \cdot \vec{σ}) \end{aligned}$

$P_\pm = \dfrac{I\pm\vec v\cdot \vec\sigma}{2}$ .

Exercise 2.61

$+1$ $\vec v\cdot \vec\sigma$ $\ket0$ $+1$ is obtained?

solution

\begin{aligned} p (+ 1) = ⟨ 0 | P_{+} | 0 ⟩ & = ⟨ 0 | \frac{I + \vec{v} \cdot \vec{σ}}{2} | 0 ⟩ \\ = \frac{1}{2} + \frac{v_{3}}{2} = \frac{1 + v_{3}}{2} \end{aligned}

, and the post-measurement state is

\begin{aligned} \frac{P_{+} | 0 ⟩}{\sqrt{p (+ 1)}} = \frac{1}{\sqrt{\frac{1 + v_{3}}{2}}} \cdot \frac{1}{2} [\begin{array}{c} 1 + v_{3} \\ v_{1} + i v_{2} \end{array}] = \frac{1}{\sqrt{2 (1 + v_{3})}} [\begin{array}{c} 1 + v_{3} \\ v_{1} + i v_{2} \end{array}] & = \sqrt{\frac{1 + v_{3}}{2}} [\begin{array}{c} 1 \\ \frac{v_{1} + i v_{2}}{1 + v_{3}} \end{array}] \\ = \sqrt{\frac{1 + v_{3}}{2}} \frac{1}{v_{1} - i v_{2}} [\begin{array}{c} v_{1} - i v_{2} \\ 1 - v_{3} \end{array}] \\ = \sqrt{\frac{1 + v_{3}}{2}} \frac{1}{\sqrt{1 - v_{3}^{2}}} [\begin{array}{c} v_{1} - i v_{2} \\ 1 - v_{3} \end{array}] (這步驟其實不太數學) \\ = \frac{1}{\sqrt{2 (1 - v_{3})}} [\begin{array}{c} v_{1} - i v_{2} \\ 1 - v_{3} \end{array}] = | λ_{+} ⟩ \end{aligned}

2.2.5 POVM measurements

Postulate 3 gives us two core values:
- a rule describing the measurement statistics (即給出測量到各種 outcome 的機率是多少).
- a rule describing the post-measurement state of the system.
But sometimes, we just don't care about the post-measurement state of the system. All we want to know is what we've got for the measurement exactly. In such instances there is a mathematical tool known as the POVM formalism which is especially well adapted to the analysis of the measurements.
POVM stands for Positive Operator-Valued Measure. (先不管這名字是怎麼來的)
- $M_m$ $\ket\psi$ $m$ $p(m) = \expval{M_m^\dagger M_m}{\psi}$ .
- $E\equiv M_m^\dagger M_m$ .
- $E$ $\sum_mE_m = I$ $p(m) = \expval{E_m}{\psi}$ .
- $E_m$ POVM elements $\{E_m\}$ is POVM.
$P_m$ $P_m$ $P_mP_{m'}= \delta_{mm'}P_m$ $\sum_mP_m = I$ $E_m\equiv P_m^\dagger P_m = P_m$ .

✏️ General measurements, projective measurements, and POVMs
為什麼先學 general measurements, 再學 projective measurements 或 POVM? 一般物理學家都直接從 projective measurement 開始, 但 QCQI 講求對量子系統的精確控制, 因此相較於一般只能粗略測量的系統, 從 general measurement 開始介紹會比較合適.
General measurement 又有幾個優點:
$P_iP_j = \delta_{ij}P_i$ 這種規定.
There are important problems in QCQI (such as the optimal way to distinguish a set of quantum states) the answer to which involves a general measurement, rather than a projective measurement.
$P_m$ $P_m$ , 並不會改變系統的 state. 所以這時候勢必要引入 general measurement formalism 來描述測量過程.
Where do POVMs fit in this picture? POVMs are best viewed as a special case of the general measurement formalism, providing the simplest means by which one can study general measurement statistics, without the necessity for knowing the post-measurement state. They are a mathematical convenience that sometimes gives extra insight into quantum measurements.

Exercise 2.62

Show that any measurement where the measurement operators and the POVM elements coincide is a projective measurement.

solution

$M_m$ , writing down the statement mathematically we have

\begin{matrix} E_{m} = M_{m}^{†} M_{m} = M_{m} \end{matrix}

Exercise 2.25 $M_m$ $M_m^\dagger M_m = M_m^2 = M_m$ $M_m$ are projective operators.

$\{E_m\}$ $\sum_mE_m = I$ $M_m$ $\{E_m\}$ .
- $M_m\equiv\sqrt{E_m}$ $\sum_mM_m^\dagger M_m = I$ .
- $\{E_m\}$ such that:
  - $E_m$ is positive.
  - $\sum_mE_m = I$ is obeyed (probabilities sum to one).
- $\{E_m\}$ $m$ $p(m) = \expval{E_m}{\psi}$ .
到現在為止還看不出 POVM 能發揮什麼鳥用, 所以來舉個例子. Suppose Alice gives Bob a qubit prepared in one of two states:
$\begin{matrix} | ψ_{1} ⟩ = | 0 ⟩, | ψ_{2} ⟩ = \frac{| 0 ⟩ + | 1 ⟩}{\sqrt{2}} \end{matrix}$
$\ket\psi$ with reliability, it is possible for him to perform a measurement which distinguishes the states some of the time, but never makes an error of mis-identification:
- Consider a POVM containing three elements:
  $\begin{aligned} (105) & E_{1} & \equiv \frac{\sqrt{2}}{1 + \sqrt{2}} | 1 ⟩ ⟨ 1 | \\ (106) & E_{2} & \equiv \frac{\sqrt{2}}{1 + \sqrt{2}} \frac{(| 0 ⟩ - | 1 ⟩) (⟨ 0 | - ⟨ 1 |)}{2} \\ (107) & E_{3} & \equiv I - E_{1} - E_{2} \end{aligned}$
  , you can verify they satisfy positive operators and completeness relation, therefore form a legitimate POVM.
- $\ket{\psi_1} = \ket0$ $\{E_1, E_2, E_3\}$ :
  - $E_1$ $E_1$ $\ket{\psi_2}$ )
- $\ket{\psi_2} = \frac{\ket0+\ket1}{\sqrt2}$ $\{E_1, E_2, E_3\}$ :
  - $E_2$ $E_2$ 的設計很巧妙吧!!)
- Therefore he can perform the measurement with the rule in mind:
  $\begin{matrix} {\begin{cases} outcome: E_{1} \Rightarrow he received | ψ_{2} ⟩ \\ outcome: E_{2} \Rightarrow he received | ψ_{1} ⟩ \\ outcome: E_{3} \Rightarrow he have no idea which state he received \end{cases} \end{matrix}$
  , 這種永遠不會出錯的代價就是 Bob 有時候做的測量會給出0資訊.

Exercise 2.63

$M_m$ $U_m$ $M_m = U_m\sqrt{E_m}$ $E_m$ is the POVM associated to the measurement.

solution

Theorem 2.3 $A = UJ,\;J\equiv\sqrt{A^\dagger A}$ $M_m = U_m\sqrt{M_m^\dagger M_m} = U_m\sqrt{E_m}$ .

Exercise 2.64

$\ket{\psi_1},\cdots,\ket{\psi_m}$ $\{E_1, E_2, \cdots, E_{m+1}\}$ $E_i$ $1\leq i\leq m$ $\ket{\psi_i}$ $\expval{E_i}{\psi_i}>0$ $i$ .)

solution

$i$ $i$ $0$ $E_i$ $i$ 的分量, 即:

\begin{matrix} (108) & E_{i} = A | ψ_{i}^{'} ⟩ ⟨ ψ_{i}^{'} | \end{matrix}

, where

\begin{matrix} (109) & | ψ_{i}^{'} ⟩ = | ψ_{i} ⟩ - \sum_{j = 1, j \neq i}^{m} \frac{⟨ ψ_{i} | ψ_{j} ⟩ | ψ_{j} ⟩}{‖ | ψ_{j} ⟩ ‖^{2}} \end{matrix}

$A$ $E_{m+1} = I-\sum_{i=1}^mE_i$ .

2.2.6 Phase & Composite systems

$e^{i\theta}\ket\psi$ $\ket\psi$ global phase factor $e^{i\theta}$ .
The statistics of measurement $\expval{e^{-i\theta}M_m^\dagger M_me^{i\theta}}{\psi} = \expval{M_m^\dagger M_m}{\psi}$ , therefore from an observational point of view these two states are identical.
Relative phase $\frac{\ket0+\ket1}{\sqrt2}$ $\frac{\ket0-\ket1}{\sqrt2}$ $\ket0$ $\ket1$ $-1$ .
- the relative phase is a basis-dependent concept unlike global phase.
- this give rise to physically observable differences in measurement statistics.

Exercise 2.65

$\frac{\ket0+\ket1}{\sqrt2}$ $\frac{\ket0-\ket1}{\sqrt2}$ in a basis in which they are not the same up to a relative phase shift.

solution

$\ket+\equiv\frac{\ket0+\ket1}{\sqrt2}$ $\ket-\equiv\frac{\ket0-\ket1}{\sqrt2}$ to see the difference.

How should we describe states of the composite system? Therefore a postulate 4 is needed:

Postulate 4

$1$ $n$ $i$ $\ket{\psi_i}$ $\ket{\psi_1}\otimes\ket{\psi_2}\otimes\cdots\otimes\ket{\psi_n}$ .

$X_2$ $\sigma_x$ operator acting on the second qubit.

Exercise 2.66

$X_1Z_2$ $\frac{\ket{00}+\ket{11}}{\sqrt2}$ is zero.

solution

\begin{matrix} (110) & \frac{⟨ 00 | + ⟨ 11 |}{\sqrt{2}} (X_{1} Z_{2}) \frac{| 00 ⟩ + | 11 ⟩}{\sqrt{2}} = \frac{⟨ 00 | + ⟨ 11 |}{\sqrt{2}} \frac{| 10 ⟩ - | 01 ⟩}{\sqrt{2}} = 0 \end{matrix}

Projective measurements together with unitary dynamics are sufficient to implement a general measurement. The proof of this statement makes use of composite quantum systems, and is a nice illustration of Postulate 4 in action:
- $Q$ $M_m$ $Q$ .
- ancilla system $M$ $\ket m$ in one-to-one correspondence with the possible outcomes of the measurement we wish to implement. (這個擴充系統可被視為數學上虛構的東西或是真實的物理系統.)
- $\ket0$ $M$ $U$ $\ket\psi\ket0$ $\ket\psi$ $Q$ $\ket0$ by
  $\begin{matrix} (111) & U | ψ ⟩ | 0 ⟩ \equiv \sum_{m} M_{m} | ψ ⟩ | m ⟩ \end{matrix}$
- $\ket m$ $U$ $\ket\psi\ket0$ :
  $\begin{aligned} ⟨ 0 | ⟨ φ | U^{†} U | ψ ⟩ | 0 ⟩ & = \sum_{m, m^{'}} ⟨ φ | M_{m}^{†} M_{m^{'}} | ψ ⟩ ⟨ m | m^{'} ⟩ \\ = \sum_{m} ⟨ φ | M_{m}^{†} M_{m} | ψ ⟩ \\ = ⟨ φ | ψ ⟩ \end{aligned}$
- Exercise 2.67 $U$ $Q\otimes M$ $U$ .
- $U$ $\ket\psi\ket0$ $P_m\equiv I_Q\ket m\bra m$ $m$ occurs with probability
  $\begin{aligned} p (m) & = ⟨ ψ | ⟨ 0 | U^{†} P_{m} U | ψ ⟩ | 0 ⟩ \\ = \sum_{m^{'}, m^{″}} ⟨ m^{'} | ⟨ ψ | M_{m^{'}}^{†} (I_{Q} | m ⟩ ⟨ m |) M_{m^{″}} | ψ ⟩ | m^{″} ⟩ \\ = ⟨ ψ | M_{m^{'}}^{†} M_{m} | ψ ⟩ \end{aligned}$
  $QM$ $m$ occurring, is given by:
  $\begin{aligned} \frac{P_{m} U | ψ ⟩ | 0 ⟩}{\sqrt{⟨ 0 | ⟨ ψ | U^{†} P_{m} U | ψ ⟩ | 0 ⟩}} & = \frac{M_{m} | ψ ⟩ | m ⟩}{\sqrt{⟨ ψ | M_{m}^{†} M_{m} | ψ ⟩}} \\ = (\frac{M_{m} | ψ ⟩}{\sqrt{⟨ ψ | M_{m}^{†} M_{m} | ψ ⟩}}) \cdot | m ⟩ \\ = (state of system Q) \cdot (state of system M) \end{aligned}$
  $Q$ is just as exactly the same as given in Postulate 3. Thus unitary dynamics, projective measurements, and the ability to introduce ancillary systems, together allow any measurement of the form described in Postulate 3 to be realized.

Exercise 2.67

$V$ $W$ $U:W\rightarrow V$ $\ket{w_1}$ $\ket{w_2}$ $W$ ,

\begin{matrix} ⟨ w_{1} | U^{†} U | w_{2} ⟩ = ⟨ w_{1} | w_{2} ⟩ \end{matrix}

$U':V\rightarrow V$ extends $U$ $U'\ket w = U\ket w$ $\ket w$ $W$ $U'$ $V$ $'$ $U$ to denote the extension.

solution

$W$ $V$ $W^\perp$ $V = W\oplus W^\perp$ $\ket{w_i}$ $\ket{w_j'}$ $\ket{u_j'}$ $W$ $W^\perp$ $(U\text{ act on }W)^\perp$ , respectively.

$U':V\rightarrow V$ $U'\equiv \sum_i^{\dim W}\ket{u_i}\bra{w_i} + \sum_j^{\dim W^\perp}\ket{u_j'}\bra{w_j'}$ $\ket{u_i} \equiv U\ket{w_i}$ $U'$ by direct calculation:

\begin{matrix} (U^{'})^{†} U^{'} = (\sum_{i} | w_{i} ⟩ ⟨ u_{i} | + \sum_{j} | w_{j}^{'} ⟩ ⟨ u_{j}^{'} |) (\sum_{i} | u_{i} ⟩ ⟨ w_{i} | + \sum_{j} | u_{j}^{'} ⟩ ⟨ w_{j}^{'} |) = \sum_{i} | w_{i} ⟩ ⟨ w_{i} | + \sum_{j} | w_{j}^{'} ⟩ ⟨ w_{j}^{'} | = I \end{matrix}

, and similarly we have:

\begin{matrix} U^{'} (U^{'})^{†} = (\sum_{i} | u_{i} ⟩ ⟨ w_{i} | + \sum_{j} | u_{j}^{'} ⟩ ⟨ w_{j}^{'} |) (\sum_{i} | w_{i} ⟩ ⟨ u_{i} | + \sum_{j} | w_{j}^{'} ⟩ ⟨ u_{j}^{'} |) = \sum_{i} | u_{i} ⟩ ⟨ u_{i} | + \sum_{j} | u_{j}^{'} ⟩ ⟨ u_{j}^{'} | = I \end{matrix}

$U'$ $\ket{w}\in W$ ,

\begin{aligned} U^{'} | w ⟩ & = (\sum_{i} | u_{i} ⟩ ⟨ w_{i} | + \sum_{j} | u_{j}^{'} ⟩ ⟨ w_{j}^{'} |) | w ⟩ \\ = \sum_{i} | u_{i} ⟩ ⟨ w_{i} | w ⟩ + \sum_{j} | u_{j}^{'} ⟩ ⟨ w_{j}^{'} | w ⟩ \\ = \sum_{i} | u_{i} ⟩ ⟨ w_{i} | w ⟩ + 0 (since | w_{j}^{'} ⟩ ⊥ | w ⟩) \\ = \sum_{i} U | w_{i} ⟩ ⟨ w_{i} | w ⟩ = U | w ⟩ \end{aligned}

$U'$ $U$ .

$\ket\psi=\frac{\ket{00}+\ket{11}}{\sqrt2}$ $\ket\psi = \ket a\ket b$ . Hence we have the following exercise:

Exercise 2.68

$\ket\psi\neq\ket a\ket b$ $\ket a$ $\ket b$ .

solution

$\ket\psi=\frac{\ket{00}+\ket{11}}{\sqrt2} = \ket a\ket b = (a_0\ket0+a_1\ket1)(b_0\ket0+b_1\ket1)$ , therefore we have:

\begin{matrix} {\begin{cases} a_{0} b_{0} = 1 / \sqrt{2} \\ a_{0} b_{1} = 0 \\ a_{1} b_{0} = 0 \\ a_{1} b_{1} = 1 / \sqrt{2} \end{cases} \end{matrix}

, which is algebraically impossible.

We say that a state of a composite system having this property (that it can’t be written as a product of states of its component systems) is an entangled state. For reasons which nobody fully understands, entangled states play a crucial role in quantum computation and quantum information, and arise repeatedly through the remainder of this book.

A global view of quantum mechanics:

Postulate 1 sets the arena for quantum mechanics, by specifying how the state of an isolated quantum system is to be described.
Postulate 2 tells us that the dynamics of closed quantum systems are described by the Schrödinger equation, and thus by unitary evolution.
Postulate 3 tells us how to extract information from our quantum systems by giving a prescription for the description of measurement.
Postulate 4 tells us how the state spaces of different quantum systems may be combined to give a description of the composite system.

Might it be possible to reformulate quantum mechanics in a mathematically equivalent way so that it had a structure more like classical physics? It turns out by proving Bell's Inequality we show that quantum mechanics can's excape from its counter-intuitive nature.

2.3 Superdense coding

Alice can send 2 classical bits of information via transmission of only a single qubit to Bob.
$\ket\psi = \frac{\ket{00}+\ket{11}}{\sqrt2}$ , then if Alice wishes to:
$\begin{matrix} (112) & \begin{matrix} encode classical information: {\begin{cases} 00, | ψ ⟩ \overset{perform nothing}{\to} \frac{| 00 ⟩ + | 11 ⟩}{\sqrt{2}} \\ 01, | ψ ⟩ \overset{perform Z gate}{\to} \frac{| 00 ⟩ - | 11 ⟩}{\sqrt{2}} \\ 10, | ψ ⟩ \overset{perform X gate}{\to} \frac{| 01 ⟩ + | 10 ⟩}{\sqrt{2}} \\ 11, | ψ ⟩ \overset{perform i Y gate}{\to} \frac{\pm | 01 ⟩ \mp | 10 ⟩}{\sqrt{2}} \end{cases} \end{matrix} \end{matrix}$
These four states are known as the Bell basis, Bell states, or EPR pairs. Since they form an orthonormal basis, they can be distinguished by an appropriate quantum measurement.
Now after Alice done performing actions to her own qubit, she then send it to Bob.
Bob then do a measurement in the Bell basis, and he can determine which of the four possible bit strings Alice sent.

Exercise 2.69

Verify that the Bell basis forms an orthonormal basis for the two qubit state space.

solution

The bell states defined here are labeled as convention:

\begin{matrix} \begin{matrix} | Φ^{+} ⟩ = \frac{1}{\sqrt{2}} [\begin{matrix} 1 \\ 0 \\ 0 \\ 1 \end{matrix}] | Φ^{-} ⟩ = \frac{1}{\sqrt{2}} [\begin{matrix} 1 \\ 0 \\ 0 \\ - 1 \end{matrix}] | Ψ^{+} ⟩ = \frac{1}{\sqrt{2}} [\begin{matrix} 0 \\ 1 \\ 1 \\ 0 \end{matrix}] | Ψ^{-} ⟩ = \frac{1}{\sqrt{2}} [\begin{matrix} 0 \\ \pm 1 \\ \mp 1 \\ 0 \end{matrix}] \end{matrix} \end{matrix}

Check their linear independence:

\begin{aligned} a | Φ^{+} ⟩ + b | Φ^{-} ⟩ + c | Ψ^{+} ⟩ + d | Ψ^{-} ⟩ & = 0 \\ \Rightarrow a + d = a - d = b + c = b - c = 0 \\ \Rightarrow a = b = c = d = 0 \end{aligned}

$1$ ), and form a set of orthonormal basis (for two qubit space).

Exercise 2.70

$E$ $\expval{E\otimes I}{\psi}$ takes the same value $\ket\psi$ $00,01,10,11$ Alice is trying to send? If so, how, or if not, why not?

solution

$\expval{E\otimes I}{\psi_i} = E_{00} + E_{11} = \expval{E}{0}+\expval{E}{1}$ $\ket{\psi_i}\in\text{Bell states}$ $M_m$ $m$ $p(m) = \expval{M_m^\dagger M_m\otimes I}{\psi_i}$ $M_m^\dagger M_m$ $p(m)$ are the same.

2.4 The Density Operator

An alternative formulation of quantum mechanics other than using the language of state vectors is the tool of density operator or density matrix.
They are mathematically equivalent, but it serves as a more convenient language for thinking about some commonly encountered scenarios in quantum mechanics.

In the following 3 sections, we first intorduce the density operator using the concept of an ensemble of quantum states; next we derive some general properties of the density operator; last we describe an application, as the density operator being a tool for the description of individual subsystems of a composite quantum system.

2.4.1 Ensembles of Quantum States

The density operator language provides a convenient means for describing quantum systems whose state is not completely known.
$p_i$ $\ket{\psi_i}$ $\{p_i,\ket{\psi_i}\}$ an ensemble of pure states. The density operator for the system is defined:
$\begin{matrix} (113) & ρ \equiv \sum_{i} p_{i} | ψ_{i} ⟩ ⟨ ψ_{i} | \end{matrix}$
, which is often known as the density matrix. (the two terms can be used interchangeably)
All postulates regarding quantum mechanics from section 2.3 can be reformulated into the language for density operator. For example, the evolution of the density operator is described by the equation:
$\begin{matrix} (114) & ρ = \sum_{i} p_{i} | ψ_{i} ⟩ ⟨ ψ_{i} | \overset{U}{\to} \sum_{i} p_{i} U | ψ_{i} ⟩ ⟨ ψ_{i} | U^{†} = U ρ U^{†} \end{matrix}$
$\ket{\psi_i}$ $m$ is
$\begin{matrix} (115) & p (m | i) = ⟨ ψ_{i} | M_{m}^{†} M_{m} | ψ_{i} ⟩ = tr (M_{m}^{†} M_{m} | ψ_{i} ⟩ ⟨ ψ_{i} |) \end{matrix}$
here $m$ is:
$\begin{aligned} p (m) & = \sum_{i} p (m | i) p_{i} \\ = \sum_{i} p_{i} tr (M_{m}^{†} M_{m} | ψ_{i} ⟩ ⟨ ψ_{i} |) \\ = tr (M_{m}^{†} M_{m} ρ) \end{aligned}$
$m$ $\ket{\psi_i}$ $m$ is
$\begin{matrix} (116) & | ψ_{i}^{m} ⟩ = \frac{M_{m} | ψ_{i} ⟩}{\sqrt{⟨ ψ_{i} | M_{m}^{†} M_{m} | ψ_{i} ⟩}} \end{matrix}$
$m$ an ensemble of states $\ket{\psi_i^m}$ $p(i|m)$ $\rho_m$ is therefore:
$\begin{matrix} (117) & ρ_{m} = \sum_{i} p (i | m) | ψ_{i}^{m} ⟩ ⟨ ψ_{i}^{m} | = \sum_{i} p (i | m) \frac{M_{m} | ψ_{i} ⟩ ⟨ ψ_{i} | M_{m}^{†}}{⟨ ψ_{i} | M_{m}^{†} M_{m} | ψ_{i} ⟩} \end{matrix}$
, here we do some math manipulation for calculation convenience: (以下轉換你可以自己驗證)
$\begin{matrix} (118) & p (i | m) = \frac{p (i, m)}{p (m)} = \frac{p (m | i) \cdot p_{i}}{p (m)} \end{matrix}$
, therefore the equation becomes:
$\begin{aligned} ρ_{m} & = \sum_{i} p (i | m) \frac{M_{m} | ψ_{i} ⟩ ⟨ ψ_{i} | M_{m}^{†}}{⟨ ψ_{i} | M_{m}^{†} M_{m} | ψ_{i} ⟩} \\ = \sum_{i} \frac{p (m | i) \cdot p_{i}}{p (m)} \frac{M_{m} | ψ_{i} ⟩ ⟨ ψ_{i} | M_{m}^{†}}{⟨ ψ_{i} | M_{m}^{†} M_{m} | ψ_{i} ⟩} \\ = \sum_{i} \frac{⟨ ψ_{i} | M_{m}^{†} M_{m} | ψ_{i} ⟩ \cdot p_{i}}{tr (M_{m}^{†} M_{m} ρ)} \frac{M_{m} | ψ_{i} ⟩ ⟨ ψ_{i} | M_{m}^{†}}{⟨ ψ_{i} | M_{m}^{†} M_{m} | ψ_{i} ⟩} \\ = \sum_{i} \frac{p_{i}}{tr (M_{m}^{†} M_{m} ρ)} M_{m} | ψ_{i} ⟩ ⟨ ψ_{i} | M_{m}^{†} \\ (119) & = \frac{M_{m}^{†} ρ M_{m}}{tr (M_{m}^{†} M_{m} ρ)} \end{aligned}$
$\ket\psi$ $=1$ pure state $\rho = \ket\psi\bra\psi$ $\rho$ is in a mixed statemixture $\rho$ ). In the exercise later we will prove:
$\begin{matrix} (120) & {\begin{cases} a pure state & ⟶ & tr (ρ^{2}) = 1 \\ a mixed state & ⟶ & tr (ρ^{2}) < 1 \end{cases} \end{matrix}$
$\rho_i$ $p_i$ $\sum_ip_i\rho_i$ $\rho_i$ $\{p_{ij},\ket{\psi_{ij}}\}$ $\rho_i$ $p_ip_{ij}$ , 根據此關係我們可以求出此系統的 density matrix:
$\begin{matrix} (121) & ρ = \sum_{i j} p_{i} p_{i j} | ψ_{i j} ⟩ ⟨ ψ_{i j} | = \sum_{i} p_{i} ρ_{i} \end{matrix}$
$\rho_i = \sum_jp_{ij}\ket{\psi_{ij}}\bra{\psi_{ij}}$ $\rho$ mixture $\rho_i$ $p_i$ .
$\rho_m$ $p(m)$ $m$ , then the state of such a system can be described by density operator:
$\begin{matrix} (122) & ρ = \sum_{m} p (m) ρ_{m} = \sum_{m} tr (M_{m}^{†} M_{m} ρ) \frac{M_{m}^{†} ρ M_{m}}{tr (M_{m}^{†} M_{m} ρ)} = \sum_{m} M_{m}^{†} ρ M_{m} \end{matrix}$
, which is a nice compact formula which may be used as the starting point for analysis of further operations on the system.

2.4.2 General Properties of the Density Operator

In this section we move to an intrinsic characterization of density operators that does not rely on an ensemble interpretation.

Theorem 2.5

(Characterization of density operators) $\rho$ $\{p_i, \ket{\psi_i}\}$ if and only if it satisfies the conditions:

Trace condition $\rho$ has trace equal to one. Positivity condition $\rho$ is a positive operator.

proof

$\rho = \sum_ip_i\ket{\psi_i}\bra{\psi_i}$ is a density operator. Then

\begin{matrix} (123) & tr (ρ) = \sum_{i} p_{i} tr (| ψ_{i} ⟩ ⟨ ψ_{i} |) = \sum_{i} p_{i} = 1 \end{matrix}

$\ket\varphi$ is an arbitrary vector in state space. Then

\begin{matrix} (124) & ⟨ φ | ρ | φ ⟩ = \sum_{i} p_{i} ⟨ φ | ψ_{i} ⟩ ⟨ ψ_{i} | φ ⟩ = \sum_{i} p_{i} | ⟨ φ | ψ_{i} ⟩ |^{2} \geq 0 \end{matrix}

, hence the positivity condition is satisfied.

$\rho$ $\rho$ is positive, it must have a spectral decomposition

\begin{matrix} (125) & ρ = \sum_{j} λ_{j} | j ⟩ ⟨ j | \end{matrix}

$\ket j$ $\lambda_j$ real, non-negative eigenvalues $\rho$ $\sum_j\lambda_j=1$ $\ket j$ $\lambda_j$ $\rho$ $\{\lambda_j,\ket{j}\}$ $\rho$ .

define $\rho$ which has trace equal to one. We can now reformulate the postulates of quantum mechanics in the density operator picture:

Postulate 1: Associated to any isolated physical system is a complex vector space with inner product (that is, a Hilbert space) known as the state spacedensity operator $\rho$ $\rho_i$ $p_i$ $\sum_ip_i\rho_i$ .

Postulate 2: The evolution of a closedunitary $\rho$ $t_1$ $\rho'$ $t_2$ $U$ only on the times $t_1$ $t_2$ ,

\begin{matrix} (126) & ρ^{'} = U ρ U^{†} \end{matrix}

Postulate 3 $\{M_m\}$ of measurement operatorson the state space $m$ $\rho$ immediatelyprobability $m$ occurs is given by

\begin{matrix} (127) & p (m) = tr (M_{m}^{†} M_{m} ρ) \end{matrix}

, and the state of the system after the measurement is

\begin{matrix} (128) & \frac{M_{m} ρ M_{m}^{†}}{tr (M_{m}^{†} M_{m} ρ)} \end{matrix}

The measurement operators satify the completeness equation,

\begin{matrix} (129) & \sum_{m} M_{m}^{†} M_{m} = I \end{matrix}

Postulate 4tensor product $1$ $n$ $i$ $\rho_i$ $\rho_1\otimes\rho_2\otimes\cdots\otimes\rho_n$ .

以上的四條量子力學假設重建構是與 state vector-based 的詮釋方式 mathematically equivalent 的. 但換成這種思考方式的好處聽說是可以比較好的描述:

the quantum systems whhose state is not known
the subsystems of a composite quanutm system

Exercise 2.71

(Criterion to decide if a state is mixed or pure) $\rho$ $\tr(\rho^2)\leq 1$ $\rho$ is a pure state.

solution

\begin{aligned} tr (ρ^{2}) & = tr (\sum_{i, j} p_{i} p_{j} | i ⟩ ⟨ i | j ⟩ ⟨ j |) \\ = tr (\sum_{i, j} p_{i} p_{j} | i ⟩ δ_{i j} ⟨ j |) \\ = tr (\sum_{i} p_{i}^{2} | i ⟩ ⟨ i |) \\ = \sum_{i} p_{i}^{2} \end{aligned}

$\rho$ $0\leq p_i\leq 1$ $p_i^2\leq p_i$ $i$ $\sum_ip_i = 1$ $\sum_ip_i^2 \leq 1$ $\rho$ $\rho = \ket\psi\bra\psi$ ).

注意!! $\rho$ 的 eigenvalue & eigenvector 當成是構成它的 quantum states 們, 或是認為兩者之間有什麼神秘關係, 我現在就來讓你們美夢破碎! For example, one might suppose a quantum system with density matrix

\begin{matrix} ρ = \frac{3}{4} | 0 ⟩ ⟨ 0 | + \frac{1}{4} | 1 ⟩ ⟨ 1 | \end{matrix}

$\ket0$ $\frac{3}{4}$ $\ket1$ $\frac{1}{4}$ . 現在就來舉個反例, now if we define:

\begin{aligned} | a ⟩ & \equiv \sqrt{\frac{3}{4}} | 0 ⟩ + \sqrt{\frac{1}{4}} | 1 ⟩ \\ | b ⟩ & \equiv \sqrt{\frac{3}{4}} | 0 ⟩ - \sqrt{\frac{1}{4}} | 1 ⟩ \end{aligned}

$\rho = \frac12\ket{a}\bra{a}+\frac12\ket{b}\bra{b} = \frac{3}{4}\ket0\bra0+\frac{1}{4}\ket1\bra1$ . That is, these two different ensembles of quantum states give rise to the same density matrix!

${\ket{\tilde\psi_i}}$ ${\ket{\tilde\varphi_i}}$ $\rho$ ${\ket{\tilde\psi_i}} = \sqrt{p_i}\ket{\psi_i}$ $\rho = \sum_i{\ket{\tilde\psi_i}}{\bra{\tilde\psi_i}}$ , 就是把 probability 吃進去 statevector 裡面, may become not normalized in length) The answer to this question has many applications in QCQI:

Theorem 2.6

(Unitary freedom in the ensemble for density matrices) ${\ket{\tilde\psi_i}}$ ${\ket{\tilde\varphi_j}}$ generate the same density matrix if and only if

\begin{matrix} (130) & | {\tilde{ψ}}_{i} ⟩ = \sum_{j} u_{i j} | {\tilde{φ}}_{j} ⟩ \end{matrix}

$u_{ij}$ $i$ $j$ $\ket{\tilde\psi_i}$ $\ket{\tilde\varphi_j}$ $0$ $\ket{\psi_i}, \ket{\varphi_j}$ $\sqrt{p_i}\ket{\psi_i} = \sum_ju_{ij}\sqrt{q_j}\ket{\varphi_j}$ $u_{ij}$ .

proof

$\tilde{\ket{\psi_i}} = \sum_ju_{ij}\tilde{\ket{\varphi_j}}$ $u_{ij}$ . Then

\begin{aligned} \sum_{i} \tilde{| ψ_{i} ⟩} \tilde{⟨ ψ_{i} |} & = \sum_{i j k} u_{i j} \tilde{| φ_{j} ⟩} u_{i k}^{*} \tilde{⟨ φ_{k} |} \\ = \sum_{j k} (\sum_{i} u_{k i}^{†} u_{i j}) \tilde{| φ_{j} ⟩} \tilde{⟨ φ_{k} |} \\ = \sum_{j k} I_{k j} \tilde{| φ_{j} ⟩} \tilde{⟨ φ_{k} |} \\ = \sum_{j} \tilde{| φ_{j} ⟩} \tilde{⟨ φ_{j} |} \end{aligned}

$\tilde{\ket{\psi_i}}$ $\tilde{\ket{\varphi_j}}$ generate the same operator.

Conversely, suppose

\begin{matrix} (131) & A = \sum_{i} \tilde{| ψ_{i} ⟩} \tilde{⟨ ψ_{i} |} = \sum_{j} \tilde{| φ_{j} ⟩} \tilde{⟨ φ_{j} |} . \end{matrix}

$A = \sum_k\lambda_k\ket k\bra k$ $A$ $\ket k$ $\lambda_k$ $\tilde{\ket{\psi_i}}$ $\tilde{\ket k}\equiv \sqrt{\lambda_k}\ket k$ $\tilde{\ket{\varphi_j}}$ $\tilde{\ket k}$ , and finally combine the two relations, done.

$\ket\psi$ $\tilde{\ket k}$ $\braket{\psi}{\tilde k}\braket{\tilde k}{\psi}=0$ $k$ $0 = \expval{A}{\psi} = \sum_i\braket{\psi}{\tilde{\psi_i}}\braket{\tilde{\psi_i}}{\psi} = \bigg\Vert\braket{\psi}{\tilde{\psi_i}}\bigg\Vert^2$ .
$\braket{\psi}{\tilde{\psi_i}} = 0$ $i$ $\ket\psi$ $\tilde{\ket k}$ .
$\tilde{\ket{\psi_i}}$ $\tilde{\ket k}$ $\tilde{\ket{\psi_i}} = \sum_kc_{ik}\tilde{\ket k}$ .
$A = \sum_k\lambda_k\ket k\bra k = \sum_k\tilde{\ket k}\tilde{\bra k} = \sum_i\tilde{\ket{\psi_i}}\tilde{\bra{\psi_i}}$ , we see that
$\begin{matrix} (132) & \sum_{k} \tilde{| k ⟩} \tilde{⟨ k |} = \sum_{k l} (\sum_{i} c_{i k} c_{i l}^{*}) \tilde{| k ⟩} \tilde{⟨ l |} \end{matrix}$
$\tilde{\ket k}\tilde{\bra l}$ linearly independent $A$ $\ket k$ 們是 orthonormal 的)
$\sum_i c_{ik}c_{il}^* = \delta_{kl}$ columns $c$ $v$ $\tilde{\ket{\psi_i}} = \sum_kv_{ik}\tilde{\ket k}$ $w$ $\tilde{\ket{\varphi_j}} = \sum_kw_{jk}\tilde{\ket k}$ .
$\tilde{\ket{\psi_i}} = \sum_ju_{ij}\tilde{\ket{\varphi_j}}$ $u = vw^{-1} = vw^\dagger$ . (別忘了 unitary 的性質)

Exercise 2.72

(Bloch sphere for mixed states) The Bloch sphere picture for pure states of a single qubit was introduced in Section 1.2. This description has an important generalization to mixed states as follows.

(1) Show that an arbitrary density matrix for a mixed state qubit may be written as

\begin{matrix} (133) & ρ = \frac{I + \vec{r} \cdot \vec{σ}}{2} \end{matrix}

$\vec r$ $\Vert\vec r\Vert\leq 1$ Bloch vector $\rho$ (2) $\rho = I/2$ (3) $\rho$ $\Vert\vec r\Vert= 1$ . (4) Show that for pure states the description of the Bloch vector we have given coincides with that in Section 1.2

solution

(1)Exercise 2.35 $\rho$ $a,b,c$ .)

\begin{matrix} (134) & \begin{matrix} \frac{I + \vec{r} \cdot \vec{σ}}{2} = \frac{1}{2} [\begin{matrix} 1 + r_{3} & r_{1} - i r_{2} \\ r_{1} + i r_{2} & 1 - r_{3} \end{matrix}] \overset{must be}{\to} ρ = [\begin{matrix} a & b \\ b^{*} & c \end{matrix}] \end{matrix} \end{matrix}

Another property of density matrix is that it's positive, i.e. real non-negative eigenvalues:

\begin{aligned} \det (ρ - λ I) & = \frac{1}{4} ((1 - λ)^{2} - (r_{1}^{2} + r_{2}^{2} + r_{3}^{2})) = 0 \\ \Rightarrow λ = \frac{2 \pm \sqrt{4 - 4 (1 - ‖ \vec{r} ‖^{2})}}{2} = 1 \pm ‖ \vec{r} ‖ \geq 0 (∵ positive matrix) \end{aligned}

$\Vert\vec r\Vert$ $\leq 1$ (2) $\rho = I/2$ $\vec r=0$ , which corresponds to the origin of Bloch sphere. (3)here $\rho$ $\tr(\rho^2)=1$ , hence (可參考 Exercise 2.40)

\begin{matrix} (135) & tr (\frac{1}{4} (I + 2 \vec{r} \cdot \vec{σ} + (\vec{r} \cdot \vec{σ})^{2})) = tr (\frac{1}{4} (I + 2 \vec{r} \cdot \vec{σ} + ‖ \vec{r} ‖^{2} I)) = \frac{1}{4} (2 + 2 ‖ \vec{r} ‖^{2}) = 1 \end{matrix}

$\Vert\vec r\Vert=1$ (4) $\ket\psi=\alpha\ket0+\beta\ket1$ $\rho = \ket\psi\bra\psi$ $\tr(\rho)=0$ $\Vert\alpha\Vert^2+\Vert\beta\Vert^2=1$ (which cioncides with here from section 1.2), it follows that we can rewrite them in the form of:

\begin{matrix} (136) & | ψ ⟩ = e^{i γ} (\cos \frac{θ}{2} | 0 ⟩ + e^{i φ} \sin \frac{θ}{2} | 1 ⟩) \end{matrix}

Exercise 2.73

$\rho$ minimal ensemble $\rho$ $\{p_i, \ket{\psi_i}\}$ $\rho$ $\ket\psi$ $\rho$ support $A$ $A$ non-zero eigenvalues $\rho$ $\ket\psi$ $\ket\psi$ must appear with probability

\begin{matrix} (137) & p_{i} = \frac{1}{⟨ ψ_{i} | ρ^{- 1} | ψ_{i} ⟩} \end{matrix}

$\rho^{-1}$ $\rho$ $\rho$ $\rho$ . (This definition $\rho$ may not have an inverse.)

solution

Referencing from the last two pages in this paper and from the solution of this paper.

to be continued...

.

(end of document)

0 Nomenclature and notation

0.0 Linear algebra and quantum mechanics

0.1 Information theory and probability

0.2 Frequently used gates & symbols

1 Introduction and overview

1.1 Global perspectives

1.2 Quantum bits

1.3 Quantum computation

1.3.1 Single qubit gates

✏️ Decomposing single qubit operations

1.3.2 Multiple qubit gates

1.3.3 Measurements & Quantum circiuts

✏️ The no-cloning theorem

1.3.4 Bell states & Quantum teleportation

1.4 Quantum algorithms

1.4.1 Simulating Classical Computer

1.4.2 Quantum parallelism

1.4.3 Deutsch's Algorithm

1.4.4 The Deutsch–Jozsa algorithm

1.4.5 Summarization

1.5 Experimental quantum information processing

1.5.1 The Stern–Gerlach experiment

1.5.2 Prospects for practical quantum information processing

1.6 Quantum Information

2 Introduction to quantum mechanics

2.1 Linear algebra

2.1.1 Bases, operators and matrices

2.1.2 The Pauli matrices and inner products

✏️ Proof of the Cauchy–Schwarz inequality

2.1.3 Eigenvectors and Hermitian operators

2.1.4 Tensor products

2.1.5 Operator functions

2.1.6 The commutator and anti-commutator

2.1.7 The polar and singular value decompositions

2.2 The postulates of quantum mechanics

2.2.1 State space & Evolution

2.2.2 Quantum measurement

2.2.3 Distinguishing quantum states

✏️ Proof that non-orthogonal states can’t be reliably distinguished

2.2.4 Projective measurements

✏️ The Heisenberg uncertainty principle

2.2.5 POVM measurements

✏️ General measurements, projective measurements, and POVMs

2.2.6 Phase & Composite systems

2.3 Superdense coding

2.4 The Density Operator

2.4.1 Ensembles of Quantum States

2.4.2 General Properties of the Density Operator

.