Happy Thanksgiving! This note and the next one contain possibly the most relevant concepts and theorems in the context of quantum computing(QC) and quantum information(QI) theory. Not much argument of QC and QI is made here as a decent discussion of operators used in QC/QI research can not be justified in a short note. Therefore, current note still serves as a brief reference to important concepts and theorems. Because the difference between real and complex vector space does matter here, we will specify the type of vector space if a theorem is applicable only to complex (Cn) or real vector spaces (Rn).1. Self-adjoint (Hermitian) and Normal Operators 2. The Spectral Theorem 3. Positive Operators and Isometries 4. Polar Decomposition and Singular Value Decomposition 5. How the notes are made? 1. Self-adjoint (Hermitian) and Normal OperatorsWe start by making an analogy between operators and complex numbers. For each complex number zC, we have its conjugate z¯ such that zz¯=||z||2. Also, z/||z|| is said to be the normalized complex number living on the unit circle on the complex plane. If each complex number is analogous to an operator, then we shall see in the later discussions that we can define "conjugate" and "normalized" operators, corresponding to the concept of z¯ and z/||z||, perspectively. But why do we care this analogy? One of shallow reasons would be to make the study of operators complete. With analogous concepts defined we can manipulate operators in a way similar to vectors in the spaces of our interest."Adjoint" of a linear map is analogous to the concept of "conjugacy", and it is defined as the following:
Definition1Suppose vV, wW, and TL(V,W), then the adjoint of T, TL(W,V), is an operator satifying the following conditionTv,w=v,Twfor every v and w.
To make sense out of the definition, we first notice that Tv,w is a linear functional for all vV. According to Riesz representation theorem, we can transform Tv,w into a format of v,u with uV being an unique vector. Replacing Tw with u, we recover the condition in Definition 1. A natural follow-up question would be: how do matrices of T and T are related? The next theorem states that the two matrices are conjugate transposes to each other. To get the conjugate transpose of a matrix we simply transpose it and replace each entry with corresponding complex conjugate.
Theorem1let TL(V,W) and let orthonormal bases of V and W be (e1,...,en) and (f1,...,fm), respectively. Then M(T) with respect to the two bases is the conjugate transpose of M(T).
Proof: M(T) maps ei to Tei,f1f1++Tei,fmfm due to the definition of orthonormal basis. According to our previous discussion of matrix representation, the entry at jth row and ith column is Tei,fj. On the other hand, M(T) maps fj to Tfj,e1e1++Tfj,enen. So the entry at ith row and jth column is Tfj,ei. Because Tei,fj=ei,Tfj=Tfj,ei¯ for each entry of M(T), we have M(T) being conjugate transpose of M(T).In the next box we list properties of the adjoint, and note that inner product is defined on the vector spaces mentioned below. We will prove (3) and (5) as they are common exercises given in introductory quantum mechanics class.
Proposition1 (S+T)=S+T,S,TL(V,W)(𝜆T)=𝜆¯T(T)=TI=I(ST)=TS,SL(W,U) and TL(V,W)
Proof: To show (3), suppose TL(V,W), we have w,(T)v=Tw,v=w,Tv because of Definition 1 for any vV and wW. And the equality implies that (T)=T as desired. To show (5) we have v,(ST)u=(ST)v,u=Tv,Su=v,TSuwhich implies (ST)=TS for uU and vV as desired.Theorem 1 and Proposition 1 apply to general linear maps, including operators. When TL(V), M(T) and M(T) are two square matrices, making it possible to have M(T)=M(T). Recall that M is an isomorphism between L(V) and Fn,n, so this equality also implies T=T, which gives the definition of self-adjoint operator.
Definition2Let TL(V), and it is a self-adjoint operator if T=T. In the language of inner product, an operator is self-adjoint if and only ifTv,w=v,Twfor every v,wV.
From Definition 2, it should be easy to prove that self-adjoint operators have the following properties:
Proposition2Suppose operators T and S are self-adjoint, then the following equalities hold(T+S)v,w=v,(T+S)w𝜆Tv,w=v,𝜆Twfor𝜆R
Going back our analogy at the beginning, the self-adjoint operators are analogous to real number in C.We note here that self-adjoint operators in the literature of physics are referred as Hermitian operators. In quantum mechanics, it is postulated that the energy of a physical system can be calculated by applying corresponding energy operator to wavefunction(a vector in Hilbert space). Such an operator is called as Hamiltonian operator, and it must have real eigenvaluse because the energy is a measurable quantity. All the Hamiltonian operators are postulated to be Hermitian because of the following proposition:
Proposition3Every eigenvalue of a self-adjoint (Hermitian) operator is real.
Proof: Let vV be an eigenvector of self-joint TM(V). Suppose Tv=𝜆v, then Tv,v=𝜆||v||2=v,Tv=𝜆¯||v||2which implies 𝜆=𝜆¯, i.e., 𝜆 must be real number.To check if an operator is self-adjoint, the following theorem is of some use, which is only applicable to complex vector spaces.
Theorem2Suppose V is a complex inner product space and TL(V). Then T is self-adjoint if and only ifTv,vRfor every vV.
For a vector space V, the self-adjoint operators compose of subspace of L(V), and it is encompassed by another subspace of L(V) made of normal operators whose definition is the following:
Definition3TL(V) is normal operator if TT=TT.
Apparently, self-adjoint operators must be normal. The following result provides a way to characterize normal operators:
Proposition4TL(V) is normal if and only if ||Tv||=||Tv|| for all vV.
Being normal also gives interesting results regarding eigenvectors as shown below:
Theorem3Suppose TL(V) is normal and vV is an eigenvector of T with eigenvalue 𝜆. Then v is also an eigenvector of T with eigenvalue 𝜆¯.
Proof: To see this, notice that T-𝜆I is normal when T is normal. Because Tv=𝜆v, we have||(T-𝜆I)v||=0=||(T-𝜆I)v||=||(T-𝜆¯I)v||which implies that v is also an eigenvector of T with associated eigenvalue 𝜆¯. Also, eigenvectors of normal operators are orthogonal as shown in the following result.
Theorem4Suppose TL(V) is normal. Then eigenvectors of T corresponding distinct eigenvalues are orthogonal.
Proof: Let v,uV, and Tv=𝛼v,Tu=𝛽u. We have(𝛼-𝛽)v,u=𝛼v,u-v,𝛽¯u=Tv,u-v,Tu=Tv,u-Tv,u=0where the second equality comes from Theorem 3. Because 𝛼𝛽, we must have v,u=0.In previous notes, we already know that finite-dimensional complex spaces have eigenvalues, and we have not characterized eigenvalues of operators on real vector spaces. The following theorem provides a fact that as long as an operator is self-adjoint, no matter it is on real or complex vector space, it has eigenvalues.
Theorem5Suppose V{0} and TL(V) is a self-adjoint operator. Then T has an eigenvalue.
2. The Spectral TheoremFor a given operator, the spectral theorem decides whether it is diagonalizable w.r.t orthonormal basis. On real vector spaces, such a condition is satisfied when the operator is self-adjoint as shown below.
Theorem6Suppose F=R and TL(V). Then the following are equivalent:(a) T is self-adjoint.(b) V has an orthonormal basis consisting of eigenvectors of T.(c) T has a diagonal matrix with respect to some orthonormal basis of V.
We are not goint to prove the theorem but a proof might require the following lemma
Lemma1Suppose TL(V) is self-adjoint and b,cR are such that b2<4c. ThenT2+bT+cIis invertible.
Proof: we want to show that T2+bT+cI is injective as V is finite-dimensional. For nonzero vV, we can expand the following(T2+bT+cI)v,v=c||v||2+||Tv||2+bTv,v.Because bTv,v|b||Tv,v||b|||Tv||||v||, we must have(T2+bT+cI)v,vc||v||2+||Tv||2-|b|||Tv||||v||=(||v-|b|||v2)2+(c-b24)||v2>0.Thus, null(T2+bT+cI)={0} as desired.When F=C and V is complex vector space we have the complex spectral theorem.
Theorem7Suppose F=C and TL(V). Then the following are equivalent:(a) T is normal.(b) V has an orthonormal basis consisting of eigenvectors of T.(c) T has a diagonal matrix with respect to some orthonormal basis of V.
3. Positive Operators and IsometriesWe have known that self-adjoint operators are composed of a subset of normal operators, but is there a famous subset to self-adjoint operators? Yes! And one of famous subset comprises positive operators defined as the following:
Definition4An operator TL(V) is called positive if T is self-adjoint andTv,v0for all vV.
In the definition above, we can drop the contraint of being self-adjoint for operators on complex spaces because of Theorem 2. Before we proceed to introduce the characterization of positive operators, we need to first define the concept of "square root" of a operator.
Definition5An operator R is called a square root of an operator T if R2=T.
The set of propositions listed below are equivalent when TL(V) is positive operator, and we will prove (c) using the spectral theorem introduced above.
Theorem8Let TL(V). Then the following are equivalent:(a) T is positive;(b) T is self-adjoint and all the eigenvalues of T are nonnegative;(c) T has a positive square root;(d) T has a self-adjoint square root;(e) there exists an operator RL(V) such that T=RR.
Proof for (c): Because (b) holds, we have an orthonormal basis made of eigenvectors of T due to Theorem 6 and Theorem 7. Let the basis be e1,...en, with corresponding eigenvalues being 𝜆1,....,𝜆n. Since 𝜆j is nonnegative. Let R be a linear map from V to V such that Rej=𝜆jej, resulting
Rv,v=R(v,eiei),(v,eiei)=|v,ei|2𝜆i>0.So R is positive, and it is obvious that R2ej=𝜆jej=Tej.A positive operator can have infinite number of square roots, but only one of them is positive. In other words,
Proposition5Every positive operator has a unique positive square root.
We end this section by introducing another important class of operators, i.e. isometry, as defined below:
Definition6An operator SL(V) is called an isometry if||Sv||=||v||for all vV. In other words, an operator is an isometry if it preserves norms.
Like what we did for positive operator, the characterization of isometry is listed below. The equivalence of (a) and (c) [or (d)] shows that an operator is an isometry if and only if the list of columns of its matrix with respect to every [or some] basis is orthonormal.
Theorem9Suppose SL(V). Then the following are equivalent:(a) S is an isometry;(b) Su,Sv=u,v for all u,vV(c) Se1,,Sen is orthonormal for every orthonormal list of vectors e1,,en in V(d) there exists an orthonormal basis e1,,en of V such that Se1,,Sen is orthonormal;(e) SS=I(f) SS=I(g) S is an isometry;(h) S is invertible and S-1=S.
Isometry is closely related to unitary operators, the ones for describing behaviors of quantum gates. In Axler's book the author refers an isometry on a complex vector space as unitary operator, and orthogonal operator on a real vector sapce. Both isometry and unitary operators satisfying the condition of SS=SS=I. Accoding to literature of operators on Hilbert space, the major differences between the two are that a) unitary operators are defined on Hilbert space, H, and b) unitary operators are bounded linear operators. An operator SL(H) is bounded if and only if there exists some scalar M>0 such that for all vH, ||Sv||M||v||.(e) and (f) of Theorem 9 indicate that isometries are normal operators. By Theorem 7, we can show that eigenvalues of isometry have their absolute values being unity.
Theorem10Suppose V is a complex inner product space and SL(V). Then the following are equivalent:(a) S is an isometry.(b) There is an orthonormal basis of V consisting of eigenvectors of S whose corresponding eigenvalues all have absolute value 1.
Proof: Suppose (a) holds, by Theorem 7 V has an orthonormal basis made of eigenvector of S as isometries are normal. Let e1,...,en and 𝜆1,...,𝜆n be the basis and associated eigenvalues, respectively. Because Sei=𝜆iei, we have
Sei,Sei=ei,ei=|𝜆i|2=1due to (b) in Theorem 9.4. Polar Decomposition and Singular Value DecompositionLet us review the analogy between operators and complex number proposed in Section 1. Each complex number z can be written as z=(z|z|)zz¯.We would expect that on a vector space there should be two classes of operators analogous to z/|z| and zz¯, respectively. Indeed, the theorem of polar decomposition tells us that for an arbitrary operator T there is an isometry serving as z/|z| and TT as zz¯.
Theorem11Suppose TL(V). Then there exists an isometry SL(V) such thatT=STT
It is worth noting that here refers to the unique positive square root of TT as implied in Proposition 5.When V in Theorem 11 is a complex space, then S is a diagonal matrix with respect to an orthonormal basis. Because TT is positive, it is also a diagonal matrix with respect to an orthonormal basis. We emphasize here that these two bases may NOT be the same.Before we introduce the singular value decomposition, let us define what singular values are:
Definition7Suppose TL(V). The singular values of T are the eigenvalues of TT, with each eigenvalue 𝜆 repeated dimE(𝜆,TT) times.
The singular values must be nonnegative, because TT is positive for any operator, and TT is the positive square root. With this, the singular value decomposition tells us that matrix of every operator is diagonal with respect to two distinct sets of bases, and diagonal values are singular values.
Theorem12Suppose TL(V) has singular values s1,,sn. Then there exist orthonormal bases e1,,en and f1,,fn of V such thatTv=s1v,e1f1++snv,enfnfor every vV.
Calculating singular values is not a trivial task in the study of computational linear algebra. To do so, people first calculate M(TT) and its approximate eigenvalues. The nonnegative square roots of these eigenvalues are approximate singular values because of Theorem 8.5. How the notes are made?A supervisor of undergrad classes once asked me (then a Ph.D. candidate) the reason for making class notes. I am not a professor, and perhaps will never be one (not a big fan of teaching). So writing notes is not getting prepared for my future students. Rather, I do enjoy the time I spent on ranting things out of my own logic. The facts in these notes are the same as the ones in standard texts but the flow of logic must be mine. To prepare each note, I start by reading a chapter or two from a (non-)standard text, after which the author would review the content again and write down an outline for developing a note. The outline might be modified and even abandoned during the writting process, as details of a story might sound more intuitive if it is presented in a particular sequence. While writing the author definitely goes back to the text to assure himself nothing was missed, but the challenge is to avoid checking textbooks as much as possible. In order to relate the notes to quantum computing and quantum information, so extra arguments of quantum mechanics are made whenever the author finds it fit the big picture. As the final disclaimer, the posted notes were not examined with sufficient care so typos might not be uncommon. The author apologizes for that in advance just in case someone find them.