This note is based on Linear Done Right by Sheldon Axler, explaining some aspects that confuse me when reading Sec. 2.1 of the book. I personally don't own any credit for the proofs showing here. This is just a compilation of better ways(in my personal opinion) to present the content in the book. By following the same nomenclature in the book, we use F to represent either R or C. TOC1. Definitions: Span and Linear Independence 2. Useful lemmas of linear (in)dependency 3. Interesting Examples of Using Linearly (In)Dependency Lemmas 4. Finite- and Infinite-dimensional vector space 1. Definitions: Span and Linear Independence
Definition1Let a list of vectors, v1,v2,...,vmV. The list spans the vector space V, when every element vV can be represented by the vectors in the list, i.e.,V={v=imaivi:viV,aiF,fori=1,2,3...,m}In other words, V=span(v1,v2,...,vm).
The Definition 1 represents element in V using linear combination, i.e., sum of products of scalar and vector. The Definition 1 DOES NOT implies the uniqueness of the linear combination for vV. To see this, let v1=(1,0,0),v2=(0,1,0),v3=(1,1,0) and v4=(0,0,1), and it's easy to show that R3=span(v1,v2,v3,v4). Suppose we have another vector, v=(3,3,2), we havev=3v1+3v2+2v4=3v3+2v4So we have two different ways to present v in the vector space R3. With this example A natural follow-up question would be,
Question
1
Is there a collection of vector {vi} in V that has an unique way to express any vector in V if possible?
To answer this question, let's first introduce the concept of linear independence.
Definition2A list of vectors v1,v2,...,vm is called linearly independent if the only choice of {ai} that makes a1v1+a2v2++amvm=0 is a1=a2==am=0. The empty list is also considered to be linearly independent.
The Definition 2 indicates the following lemma that relates to the unique way of expressing vectors.
Lemma1If a vector can be uniquely expressed by a list {vi}of vectors, then {vi} are linearly independent.
Proof: () Suppose {vi} are linearly independent, and we have v=aivi=bivi,whereaibithen, 0=(ai-bi)viFrom Definition 2, we know ai-bi0, which contradicts with the condition aibi. Thus, the expression must be unique.() Suppose v=aivi and 0=civi, so we havev=(ai+ci)viBy the uniqueness of the expression, we must have ai+ci=ai, which gives ci=0. From Definition 2, {vi} are linearly independent. Lemma 1 tells partial answer to the Question 1, i.e., the collection {vi} must be made of linear independent vectors in the space. To make sure any vV can be expressed by {vi}, we must have V=span({vi}). The linearly independent vectors {vi} that span a vector space V is called the bases of V. A vector space that has finite number of bases is called finite-dimentional, and infinite-dimensional otherwise. In quantum computing, quantum states(vectors) of a quantum system live only in subspaces of a much bigger(often infinite-dimensional) vector space. Physically, this means that the quantum system of our interest usually has finite number of pure states(eigenvectors that are known to us). The superposition(linear combination) of these pure states gives arbitrary states of the quantum system. With this in mind, the following theorem lays out a way to find the subspace that contains at least the quantum states(vectors) of our interest.
Theorem1The span of a list of vectors in V is the smallest subspace of V containing all the vectors in the list of v1,v2,...,vm.
Proof:Suppose M is the smallest subspace of V containing all the vectors in the list. Because vj=0v1+0v2++1vj++0vmwe have vjspan(v1,...,vm), and Mspan(v1,...,vm). Also, M is a subspace and vjM, we have, a1v1+a2v2++amvmMfor all ajF. Thus, all the elements in span(v1,...,vm) are also contained in M, i.e.Mspan(v1,...,vm).It now follows that M=span(v1,...,vm) 2. Useful lemmas of linear (in)dependency There are two lemmas that are repeatedly used in proofs of interesting properties of vector (sub)spaces, as listed below:
Lemma2Linear Dependence LemmaSuppose v1,v2,...,vm is a linearly dependent list of V. Then there exists j{1,2,3,...,m} such that the following hold:a) vjspan(v1,...,vj-1);b) if the jth term is removed from {vj} list, the span of the remaining list equal original span.
Proof:The linear dependency of the list results in some of the coefficients ajF in the equation below are nonzero:0=a1v1++amvmchossing j to be the largest number that has aj0. Then,vj=-a1ajv1-a2ajv2--aj-1ajvj-1span(v1,...,vj-1)This proves a). To prove b), we notice that for arbitrary vector uspan(v1,v2,...,vm), we haveu=c1v1++cmvmWe can replace vj with an expression of other vectors using the second equation above. This shows that u is in the span of the list without vj.
Lemma3Length of linearly independent list length of spanning listIn a finite-dimensional vector space, the length of every linearly independent vector is less than or equal to the length of every spanning list of vectors
Proof (Axler's proof with slight modification):Suppose u1,...,um is linearly independent in V, and w1,...,wn spans V. To show mn, we replace wj with ui step by step. Step 1: Let B={wi}, and adding u1 in B gives a list of linearly dependent vectors, as ui can be represented by vectors in B. Thus, by Lemma 2, we can remove one of the vectors in B so that the remaining vectors still span V. Step j: The list B from step j-1 spans V. And the list now contains u1,...,uj-1 and some wi. Because {uj} are linearly independent. By Lemma 2, there must be a vector in {wi} that can be represented as a linear combination of {uj} and the rest of wj. At this point, we need to assure there is enough w's for us to proceed this replacement process. It turns out it must be! Assume uk+1,...,um are left out after the replacement at the step k. Then, u1,u2,...,uk span V now. Since uk+1V, we have uk+1=a1u1++akuk with some of {ai} being nonzero, causing a contradiction to the condition of {ui} being linearly independent. Therefore, we should always have vectors in B waiting for replacement before we finish the step m. Step m: We have added all the u's in to the list that spans V. Because at each step we move one w out of B, there are at least m w's in B. 3. Interesting Examples of Using Linearly (In)Dependency LemmasWe first introduce an example that uses the concept of linearly dependent to make the proof easier.
Lemma4Suppose v1,...,vm is linearly independent in V and wV. Then v1,...,vm,w is linearly independent if and only ifwspan(v1,...,vm)
Proof:We first notice that Lemma 4 can be rephrased as v1,...,vm,w is linearly dependent if and only if wspan(v1,...,vm). () v1,...,vm,w is linear dependent, a1v1+a2v2++bw=0 does not have all the cofficients being zero. If b=0, then a1v1+a2v2++amvm=0 and ai0, contradicting with the linear dependency. Thus b0, and w=-1bi=1maiviwhich follows wspan({vi}).() wspan(v1,...,vm),w=b1v1++bmvm. According to Lemma 4, we have v1,...,vm,w being linearly dependent. The following two proofs show how we can use Lemma 3 to identify whether a vector list spans vector spaces.
Question
2
: Explain why there does not exist a list of six polynomials that is linearly independent in P4(F)
Solution: Pm(F) denotes the set of all polynomials with coefficients in F and degree at most m. Also, Pm(F)=span(1,z,...,zm). Notice that P4(F)=span(1,z,z2,z3,z4), from Lemma 3 the length of linearly independent vector list cannot longer than the length of spanning vector list.
Question
3
: Explain why no list of four polynomials spans P4(F).
Solution: P4(F)=span(1,z,...,z4)length of spanning vector list is 5. If there is a four-polynomial list spanning P4(F), then the length of linearly independent list (e.g.,1,z,z3,z4) is larger than the length of spanning list (e.g., 1,z,z2,z3,z4), contradicting Lemma 3. So no list of four polynomials spans P4(F).
4. Finite- and Infinite-dimensional vector spaceWe don't care about infinite-dimensional vector space in quantum computing, because we cannot have infinite number of qubits after all. But we introduce its concept here anyway for completeness.
Definition3A vector space is called finite-dimensional if a list of finite number of vectors spans the space. Otherwise, it is infinite-dimensional.
Definition 3 is related to linear dependency through the following Lemma:
Lemma5V is infinite-dimensional if and only if there is a sequence of v1,v2,...V such that v1,...,vm is linearly independent for every positive integer m.
Proof: By induction if v1,v2,,vmV is linearly independent, because V is infinite-dimensional, there must exist some vm+1V such that vm+1span(v1,v2,v3,...vm). From Lemma 2, v1,v2,v3,...,vm,vm+1 are linearly independent. Because V is infinite-dimensional, we can keep adding new vector into the list that does not belong to the subspaces spanning by the old list. Thus, we have the list of v1,v2,v3,...,vm being linearly independent for any positive interger m.