Operators on Inner Product Space

Happy Thanksgiving! This note and the next one contain possibly the most relevant concepts and theorems in the context of quantum computing(QC) and quantum information(QI) theory. Not much argument of QC and QI is made here as a decent discussion of operators used in QC/QI research can not be justified in a short note. Therefore, current note still serves as a brief reference to important concepts and theorems. Because the difference between real and complex vector space does matter here, we will specify the type of vector space if a theorem is applicable only to complex (

C^{n}

) or real vector spaces (

R^{n}

1. Self-adjoint (Hermitian) and Normal Operators 2. The Spectral Theorem 3. Positive Operators and Isometries 4. Polar Decomposition and Singular Value Decomposition 5. How the notes are made?

1. Self-adjoint (Hermitian) and Normal OperatorsWe start by making an analogy between operators and complex numbers. For each complex number

z \in C

, we have its conjugate

\bar{z}

such that

z \bar{z} = | | z | |^{2}

. Also,

z / | | z | |

is said to be the normalized complex number living on the unit circle on the complex plane. If each complex number is analogous to an operator, then we shall see in the later discussions that we can define "conjugate" and "normalized" operators, corresponding to the concept of

\bar{z}

and

z / | | z | |

, perspectively. But why do we care this analogy? One of shallow reasons would be to make the study of operators complete. With analogous concepts defined we can manipulate operators in a way similar to vectors in the spaces of our interest."Adjoint" of a linear map is analogous to the concept of "conjugacy", and it is defined as the following:

Definition

Suppose

v \in V

w \in W

, and

T \in L (V, W)

, then the adjoint of

T

T^{†} \in L (W, V)

, is an operator satifying the following condition

⟨ T v, w ⟩ = ⟨ v, T^{†} w ⟩

for every

v

and

w

To make sense out of the definition, we first notice that

⟨ T v, w ⟩

is a linear functional for all

v \in V

. According to Riesz representation theorem, we can transform

⟨ T v, w ⟩

into a format of

⟨ v, u ⟩

with

u \in V

being an unique vector. Replacing

T^{†} w

with

u

, we recover the condition in Definition 1. A natural follow-up question would be: how do matrices of

T

and

T^{†}

are related? The next theorem states that the two matrices are conjugate transposes to each other. To get the conjugate transpose of a matrix we simply transpose it and replace each entry with corresponding complex conjugate.

Theorem

let

T \in L (V, W)

and let orthonormal bases of

V

and

W

(e_{1}, . . ., e_{n})

and

(f_{1}, . . ., f_{m})

, respectively. Then

M (T)

with respect to the two bases is the conjugate transpose of

M (T^{†})

Proof:

M (T)

maps

e_{i}

⟨ T e_{i}, f_{1} ⟩ f_{1} + \dots + ⟨ T e_{i}, f_{m} ⟩ f_{m}

due to the definition of orthonormal basis. According to our previous discussion of matrix representation, the entry at

j

th row and

i

th column is

⟨ T e_{i}, f_{j} ⟩

. On the other hand,

M (T^{†})

maps

f_{j}

⟨ T^{†} f_{j}, e_{1} ⟩ e_{1} + \dots + ⟨ T^{†} f_{j}, e_{n} ⟩ e_{n}

. So the entry at

i

th row and

j

th column is

⟨ T^{†} f_{j}, e_{i} ⟩

. Because

⟨ T e_{i}, f_{j} ⟩ = ⟨ e_{i}, T^{†} f_{j} ⟩ = \bar{⟨ T^{†} f_{j}, e_{i} ⟩}

for each entry of

M (T)

, we have

M (T^{†})

being conjugate transpose of

M (T)

.□In the next box we list properties of the adjoint, and note that inner product is defined on the vector spaces mentioned below. We will prove

(3)

and

(5)

as they are common exercises given in introductory quantum mechanics class.

Proposition

\begin{array}{c} (S + T)^{†} = S^{†} + T^{†}, \forall S, T \in L (V, W) \\ (𝜆 T)^{†} = \bar{𝜆} T^{†} \\ (T^{†})^{†} = T \\ I^{†} = I \\ (S T)^{†} = T^{†} S^{†}, \forall S \in L (W, U) and \forall T \in L (V, W) \end{array}

Proof: To show

(3)

, suppose

T \in L (V, W)

, we have

⟨ w, (T^{†})^{†} v ⟩ = ⟨ T^{†} w, v ⟩ = ⟨ w, T v ⟩

because of Definition 1 for any

v \in V

and

w \in W

. And the equality implies that

(T^{†})^{†} = T

as desired. To show

(5)

we have

⟨ v, (S T)^{†} u ⟩ = ⟨ (S T) v, u ⟩ = ⟨ T v, S^{†} u ⟩ = ⟨ v, T^{†} S^{†} u ⟩

which implies

(S T)^{†} = T^{†} S^{†}

for

u \in U

and

v \in V

as desired.□Theorem 1 and Proposition 1 apply to general linear maps, including operators. When

T \in L (V)

M (T)

and

M (T^{†})

are two square matrices, making it possible to have

M (T) = M (T^{†})

. Recall that

M

is an isomorphism between

L (V)

and

F^{n, n}

, so this equality also implies

T = T^{†}

, which gives the definition of self-adjoint operator.

Definition

Let

T \in L (V)

, and it is a self-adjoint operator if

T = T^{†}

. In the language of inner product, an operator is self-adjoint if and only if

⟨ T v, w ⟩ = ⟨ v, T w ⟩

for every

v, w \in V

From Definition 2, it should be easy to prove that self-adjoint operators have the following properties:

Proposition

Suppose operators

T

and

S

are self-adjoint, then the following equalities hold

\begin{array}{c} ⟨ (T + S) v, w ⟩ = ⟨ v, (T + S) w ⟩ \\ ⟨ 𝜆 T v, w ⟩ = ⟨ v, 𝜆 T w ⟩ for 𝜆 \in R \end{array}

Going back our analogy at the beginning, the self-adjoint operators are analogous to real number in

C .

We note here that self-adjoint operators in the literature of physics are referred as Hermitian operators. In quantum mechanics, it is postulated that the energy of a physical system can be calculated by applying corresponding energy operator to wavefunction(a vector in Hilbert space). Such an operator is called as Hamiltonian operator, and it must have real eigenvaluse because the energy is a measurable quantity. All the Hamiltonian operators are postulated to be Hermitian because of the following proposition:

Proposition

Every eigenvalue of a self-adjoint (Hermitian) operator is real.

Proof: Let

v \in V

be an eigenvector of self-joint

T \in M (V)

. Suppose

T v = 𝜆 v

, then

⟨ T v, v ⟩ = 𝜆 | | v | |^{2} = ⟨ v, T v ⟩ = \bar{𝜆} | | v | |^{2}

which implies

𝜆 = \bar{𝜆}

, i.e.,

𝜆

must be real number.To check if an operator is self-adjoint, the following theorem is of some use, which is only applicable to complex vector spaces.

Theorem

Suppose

V

is a complex inner product space and

T \in L (V)

. Then

T

is self-adjoint if and only if

⟨ T v, v ⟩ \in R

for every

v \in V

For a vector space

V

, the self-adjoint operators compose of subspace of

L (V)

, and it is encompassed by another subspace of

L (V)

made of normal operators whose definition is the following:

Definition

T \in L (V)

is normal operator if

T T^{†} = T^{†} T

Apparently, self-adjoint operators must be normal. The following result provides a way to characterize normal operators:

Proposition

T \in L (V)

is normal if and only if

| | T v | | = | | T^{†} v | |

for all

v \in V

Being normal also gives interesting results regarding eigenvectors as shown below:

Theorem

Suppose

T \in L (V)

is normal and

v \in V

is an eigenvector of

T

with eigenvalue

𝜆

. Then

v

is also an eigenvector of

T^{†}

with eigenvalue

\bar{𝜆}

Proof: To see this, notice that

T - 𝜆 I

is normal when

T

is normal. Because

T v = 𝜆 v

, we have

| | (T - 𝜆 I) v | | = 0 = | | (T - 𝜆 I)^{†} v | | = | | (T^{†} - \bar{𝜆} I) v | |

which implies that

v

is also an eigenvector of

T^{†}

with associated eigenvalue

\bar{𝜆}

. □Also, eigenvectors of normal operators are orthogonal as shown in the following result.

Theorem

Suppose

T \in L (V)

is normal. Then eigenvectors of

T

corresponding distinct eigenvalues are orthogonal.

Proof: Let

v, u \in V

, and

T v = 𝛼 v, T u = 𝛽 u

. We have

(𝛼 - 𝛽) ⟨ v, u ⟩ = ⟨ 𝛼 v, u ⟩ - ⟨ v, \bar{𝛽} u ⟩ = ⟨ T v, u ⟩ - ⟨ v, T^{†} u ⟩ = ⟨ T v, u ⟩ - ⟨ T v, u ⟩ = 0

where the second equality comes from Theorem 3. Because

𝛼 \neq 𝛽

, we must have

⟨ v, u ⟩ = 0

.□In previous notes, we already know that finite-dimensional complex spaces have eigenvalues, and we have not characterized eigenvalues of operators on real vector spaces. The following theorem provides a fact that as long as an operator is self-adjoint, no matter it is on real or complex vector space, it has eigenvalues.

Theorem

Suppose

V \neq {0}

and

T \in L (V)

is a self-adjoint operator. Then

T

has an eigenvalue.

2. The Spectral TheoremFor a given operator, the spectral theorem decides whether it is diagonalizable w.r.t orthonormal basis. On real vector spaces, such a condition is satisfied when the operator is self-adjoint as shown below.

Theorem

Suppose

F = R

and

T \in L (V)

. Then the following are equivalent:(a)

T

is self-adjoint.(b)

V

has an orthonormal basis consisting of eigenvectors of

T

.(c)

T

has a diagonal matrix with respect to some orthonormal basis of

V

We are not goint to prove the theorem but a proof might require the following lemma

Lemma

Suppose

T \in L (V)

is self-adjoint and

b, c \in R

are such that

b^{2} < 4 c

. Then

T^{2} + b T + c I

is invertible.

Proof: we want to show that

T^{2} + b T + c I

is injective as

V

is finite-dimensional. For nonzero

v \in V

, we can expand the following

⟨ (T^{2} + b T + c I) v, v ⟩ = c | | v | |^{2} + | | T v | |^{2} + b ⟨ T v, v ⟩ .

Because

b ⟨ T v, v ⟩ \leq | b | | ⟨ T v, v ⟩ | \leq | b | | | T v | | | | v | |

, we must have

⟨ (T^{2} + b T + c I) v, v ⟩ \geq c | | v | |^{2} + | | T v | |^{2} - | b | | | T v | | | | v | | = (| | v ‖ - \frac{| b | | | v ‖}{2})^{2} + (c - \frac{b^{2}}{4}) | | v ‖^{2} > 0.

Thus,

n u l l (T^{2} + b T + c I) = {0}

as desired.□When

F = C

and

V

is complex vector space we have the complex spectral theorem.

Theorem

Suppose

F = C

and

T \in L (V)

. Then the following are equivalent:(a)

T

is normal.(b)

V

has an orthonormal basis consisting of eigenvectors of

T

.(c)

T

has a diagonal matrix with respect to some orthonormal basis of

V

3. Positive Operators and IsometriesWe have known that self-adjoint operators are composed of a subset of normal operators, but is there a famous subset to self-adjoint operators? Yes! And one of famous subset comprises positive operators defined as the following:

Definition

An operator

T \in L (V)

is called positive if

T

is self-adjoint and

⟨ T v, v ⟩ \geq 0

for all

v \in V

In the definition above, we can drop the contraint of being self-adjoint for operators on complex spaces because of Theorem 2. Before we proceed to introduce the characterization of positive operators, we need to first define the concept of "square root" of a operator.

Definition

An operator

R

is called a square root of an operator

T

R^{2} = T

The set of propositions listed below are equivalent when

T \in L (V)

is positive operator, and we will prove (c) using the spectral theorem introduced above.

Theorem

Let

T \in L (V)

. Then the following are equivalent:(a)

T

is positive;(b)

T

is self-adjoint and all the eigenvalues of

T

are nonnegative;(c)

T

has a positive square root;(d)

T

has a self-adjoint square root;(e) there exists an operator

R \in L (V)

such that

T = R^{†} R

Proof for (c): Because (b) holds, we have an orthonormal basis made of eigenvectors of

T

due to Theorem 6 and Theorem 7. Let the basis be

e_{1}, . . . e_{n}

, with corresponding eigenvalues being

𝜆_{1}, . . . ., 𝜆_{n}

. Since

𝜆_{j}

is nonnegative. Let

R

be a linear map from

V

V

such that

R e_{j} = \sqrt{𝜆_{j}} e_{j}

, resulting

⟨ R v, v ⟩ = ⟨ R (\sum ⟨ v, e_{i} ⟩ e_{i}), (\sum ⟨ v, e_{i} ⟩ e_{i}) ⟩ = \sum | ⟨ v, e_{i} ⟩ |^{2} \sqrt{𝜆_{i}} > 0.

R

is positive, and it is obvious that

R^{2} e_{j} = 𝜆_{j} e_{j} = T e_{j}

.□A positive operator can have infinite number of square roots, but only one of them is positive. In other words,

Proposition

Every positive operator has a unique positive square root.

We end this section by introducing another important class of operators, i.e. isometry, as defined below:

Definition

An operator

S \in L (V)

is called an isometry if

| | S v | | = | | v | |

for all

v \in V

. In other words, an operator is an isometry if it preserves norms.

Like what we did for positive operator, the characterization of isometry is listed below. The equivalence of (a) and (c) [or (d)] shows that an operator is an isometry if and only if the list of columns of its matrix with respect to every [or some] basis is orthonormal.

Theorem

Suppose

S \in L (V)

. Then the following are equivalent:(a) S is an isometry;(b)

⟨ S u, S v ⟩ = ⟨ u, v ⟩

for all

u, v \in V

(c)

S e_{1}, \dots, S e_{n}

is orthonormal for every orthonormal list of vectors

e_{1}, \dots, e_{n}

V

(d) there exists an orthonormal basis

e_{1}, \dots, e_{n}

V

such that

S e_{1}, \dots, S e_{n}

is orthonormal;(e)

S^{†} S = I

(f)

S S^{†} = I

(g)

S^{†}

is an isometry;(h)

S

is invertible and

S^{- 1} = S^{†}

Isometry is closely related to unitary operators, the ones for describing behaviors of quantum gates. In Axler's book the author refers an isometry on a complex vector space as unitary operator, and orthogonal operator on a real vector sapce. Both isometry and unitary operators satisfying the condition of

S^{†} S = S S^{†} = I

. Accoding to literature of operators on Hilbert space, the major differences between the two are that a) unitary operators are defined on Hilbert space,

H

, and b) unitary operators are bounded linear operators. An operator

S \in L (H)

is bounded if and only if there exists some scalar

M > 0

such that for all

v \in H

| | S v | | \leq M | | v | |

.(e) and (f) of Theorem 9 indicate that isometries are normal operators. By Theorem 7, we can show that eigenvalues of isometry have their absolute values being unity.

Theorem

Suppose

V

is a complex inner product space and

S \in L (V)

. Then the following are equivalent:(a)

S

is an isometry.(b) There is an orthonormal basis of

V

consisting of eigenvectors of

S

whose corresponding eigenvalues all have absolute value 1.

Proof: Suppose (a) holds, by Theorem 7

V

has an orthonormal basis made of eigenvector of

S

as isometries are normal. Let

e_{1}, . . ., e_{n}

and

𝜆_{1}, . . ., 𝜆_{n}

be the basis and associated eigenvalues, respectively. Because

S e_{i} = 𝜆_{i} e_{i}

, we have

⟨ S e_{i}, S e_{i} ⟩ = ⟨ e_{i}, e_{i} ⟩ = | 𝜆_{i} |^{2} = 1

due to (b) in Theorem 9.□4. Polar Decomposition and Singular Value DecompositionLet us review the analogy between operators and complex number proposed in Section 1. Each complex number

z

can be written as

z = (\frac{z}{| z |}) \sqrt{z \bar{z}} .

We would expect that on a vector space there should be two classes of operators analogous to

z / | z |

and

\sqrt{z \bar{z}}

, respectively. Indeed, the theorem of polar decomposition tells us that for an arbitrary operator

T

there is an isometry serving as

z / | z |

and

\sqrt{T^{†} T}

\sqrt{z \bar{z}}

Theorem

Suppose

T \in L (V)

. Then there exists an isometry

S \in L (V)

such that

T = S \sqrt{T^{†} T}

It is worth noting that

\sqrt{}

here refers to the unique positive square root of

T^{†} T

as implied in Proposition 5.When

V

in Theorem 11 is a complex space, then

S

is a diagonal matrix with respect to an orthonormal basis. Because

\sqrt{T^{†} T}

is positive, it is also a diagonal matrix with respect to an orthonormal basis. We emphasize here that these two bases may NOT be the same.Before we introduce the singular value decomposition, let us define what singular values are:

Definition

Suppose

T \in L (V)

. The singular values of

T

are the eigenvalues of

\sqrt{T^{†} T}

, with each eigenvalue

𝜆

repeated

d i m E (𝜆, \sqrt{T^{†} T})

times.

The singular values must be nonnegative, because

T^{†} T

is positive for any operator, and

\sqrt{T^{†} T}

is the positive square root. With this, the singular value decomposition tells us that matrix of every operator is diagonal with respect to two distinct sets of bases, and diagonal values are singular values.

Theorem

Suppose

T \in L (V)

has singular values

s_{1}, \dots, s_{n}

. Then there exist orthonormal bases

e_{1}, \dots, e_{n}

and

f_{1}, \dots, f_{n}

V

such that

T v = s_{1} ⟨ v, e_{1} ⟩ f_{1} + \dots + s_{n} ⟨ v, e_{n} ⟩ f_{n}

for every

v \in V

Calculating singular values is not a trivial task in the study of computational linear algebra. To do so, people first calculate

M (T^{†} T)

and its approximate eigenvalues. The nonnegative square roots of these eigenvalues are approximate singular values because of Theorem 8.5. How the notes are made?A supervisor of undergrad classes once asked me (then a Ph.D. candidate) the reason for making class notes. I am not a professor, and perhaps will never be one (not a big fan of teaching). So writing notes is not getting prepared for my future students. Rather, I do enjoy the time I spent on ranting things out of my own logic. The facts in these notes are the same as the ones in standard texts but the flow of logic must be mine. To prepare each note, I start by reading a chapter or two from a (non-)standard text, after which the author would review the content again and write down an outline for developing a note. The outline might be modified and even abandoned during the writting process, as details of a story might sound more intuitive if it is presented in a particular sequence. While writing the author definitely goes back to the text to assure himself nothing was missed, but the challenge is to avoid checking textbooks as much as possible. In order to relate the notes to quantum computing and quantum information, so extra arguments of quantum mechanics are made whenever the author finds it fit the big picture. As the final disclaimer, the posted notes were not examined with sufficient care so typos might not be uncommon. The author apologizes for that in advance just in case someone find them.