Span and Linear Dependency

This note is based on Linear Done Right by Sheldon Axler, explaining some aspects that confuse me when reading Sec. 2.1 of the book. I personally don't own any credit for the proofs showing here. This is just a compilation of better ways(in my personal opinion) to present the content in the book. By following the same nomenclature in the book, we use

F

to represent either

R

C

. TOC

1. Definitions: Span and Linear Independence 2. Useful lemmas of linear (in)dependency 3. Interesting Examples of Using Linearly (In)Dependency Lemmas 4. Finite- and Infinite-dimensional vector space

1. Definitions: Span and Linear Independence

Definition

Let a list of vectors,

v_{1}, v_{2}, . . ., v_{m} \in V

. The list spans the vector space

V

, when every element

v \in V

can be represented by the vectors in the list, i.e.,

V = {v = \sum_{i}^{m} a_{i} v_{i} : v_{i} \in V, a_{i} \in F, f o r i = 1, 2, 3. . ., m}

In other words,

V = s p a n (v_{1}, v_{2}, . . ., v_{m})

The Definition 1 represents element in

V

using linear combination, i.e., sum of products of scalar and vector. The Definition 1 DOES NOT implies the uniqueness of the linear combination for

v \in V

. To see this, let

v_{1} = (1, 0, 0), v_{2} = (0, 1, 0), v_{3} = (1, 1, 0)

and

v_{4} = (0, 0, 1)

, and it's easy to show that

R^{3} = s p a n (v_{1}, v_{2}, v_{3}, v_{4})

. Suppose we have another vector,

v = (3, 3, 2)

, we have

v = 3 v_{1} + 3 v_{2} + 2 v_{4} = 3 v_{3} + 2 v_{4}

So we have two different ways to present

v

in the vector space

R^{3}

. With this example A natural follow-up question would be,

Question

Is there a collection of vector

{v_{i}}

V

that has an unique way to express any vector in

V

if possible?

To answer this question, let's first introduce the concept of linear independence.

Definition

A list of vectors

v_{1}, v_{2}, . . ., v_{m}

is called linearly independent if the only choice of

{a_{i}}

that makes

a_{1} v_{1} + a_{2} v_{2} + \dots + a_{m} v_{m} = 0

a_{1} = a_{2} = \dots = a_{m} = 0

. The empty list is also considered to be linearly independent.

The Definition 2 indicates the following lemma that relates to the unique way of expressing vectors.

Lemma

If a vector can be uniquely expressed by a list

{v_{i}}

of vectors, then

{v_{i}}

are linearly independent.

Proof: (

\Leftarrow

) Suppose

{v_{i}}

are linearly independent, and we have

v = \sum a_{i} v_{i} = \sum b_{i} v_{i}, where a_{i} \neq b_{i}

then,

0 = \sum (a_{i} - b_{i}) v_{i}

From Definition 2, we know

a_{i} - b_{i} \equiv 0

, which contradicts with the condition

a_{i} \neq b_{i}

. Thus, the expression must be unique.(

\Rightarrow

) Suppose

v = \sum a_{i} v_{i}

and

0 = \sum c_{i} v_{i}

, so we have

v = \sum (a_{i} + c_{i}) v_{i}

By the uniqueness of the expression, we must have

a_{i} + c_{i} = a_{i}

, which gives

c_{i} = 0

. From Definition 2,

{v_{i}}

are linearly independent. Lemma 1 tells partial answer to the Question 1, i.e., the collection

{v_{i}}

must be made of linear independent vectors in the space. To make sure any

v \in V

can be expressed by

{v_{i}}

, we must have

V = s p a n ({v_{i}})

. The linearly independent vectors

{v_{i}}

that span a vector space

V

is called the bases of

V

. A vector space that has finite number of bases is called finite-dimentional, and infinite-dimensional otherwise. In quantum computing, quantum states(vectors) of a quantum system live only in subspaces of a much bigger(often infinite-dimensional) vector space. Physically, this means that the quantum system of our interest usually has finite number of pure states(eigenvectors that are known to us). The superposition(linear combination) of these pure states gives arbitrary states of the quantum system. With this in mind, the following theorem lays out a way to find the subspace that contains at least the quantum states(vectors) of our interest.

Theorem

The span of a list of vectors in

V

is the smallest subspace of

V

containing all the vectors in the list of

v_{1}, v_{2}, . . ., v_{m}

Proof:Suppose

M

is the smallest subspace of

V

containing all the vectors in the list. Because

v_{j} = 0 v_{1} + 0 v_{2} + \dots + 1 v_{j} + \dots + 0 v_{m}

we have

v_{j} \in s p a n (v_{1}, . . ., v_{m})

, and

M \subseteq s p a n (v_{1}, . . ., v_{m})

. Also,

M

is a subspace and

v_{j} \in M

, we have,

a_{1} v_{1} + a_{2} v_{2} + \dots + a_{m} v_{m} \in M

for all

a_{j} \in F

. Thus, all the elements in

s p a n (v_{1}, . . ., v_{m})

are also contained in

M

, i.e.

M \supseteq s p a n (v_{1}, . . ., v_{m}) .

It now follows that

M = s p a n (v_{1}, . . ., v_{m})

□ 2. Useful lemmas of linear (in)dependency There are two lemmas that are repeatedly used in proofs of interesting properties of vector (sub)spaces, as listed below:

Lemma

Linear Dependence LemmaSuppose

v_{1}, v_{2}, . . ., v_{m}

is a linearly dependent list of

V

. Then there exists

j \in {1, 2, 3, . . ., m}

such that the following hold:a)

v_{j} \in s p a n (v_{1}, . . ., v_{j - 1})

;b) if the

j^{t h}

term is removed from

{v_{j}}

list, the span of the remaining list equal original span.

Proof:The linear dependency of the list results in some of the coefficients

a_{j} \in F

in the equation below are nonzero:

0 = a_{1} v_{1} + \dots + a_{m} v_{m}

chossing

j

to be the largest number that has

a_{j} \neq 0

. Then,

v_{j} = - \frac{a_{1}}{a_{j}} v_{1} - \frac{a_{2}}{a_{j}} v_{2} - \dots - \frac{a_{j - 1}}{a_{j}} v_{j - 1} \in s p a n (v_{1}, . . ., v_{j - 1})

This proves a). To prove b), we notice that for arbitrary vector

u \in s p a n (v_{1}, v_{2}, . . ., v_{m})

, we have

u = c_{1} v_{1} + \dots + c_{m} v_{m}

We can replace

v_{j}

with an expression of other vectors using the second equation above. This shows that

u

is in the span of the list without

v_{j}

.□

Lemma

Length of linearly independent list

\leq

length of spanning listIn a finite-dimensional vector space, the length of every linearly independent vector is less than or equal to the length of every spanning list of vectors

Proof (Axler's proof with slight modification):Suppose

u_{1}, . . ., u_{m}

is linearly independent in

V

, and

w_{1}, . . ., w_{n}

spans

V

. To show

m \leq n

, we replace

w_{j}

with

u_{i}

step by step. Step 1: Let

B = {w_{i}}

, and adding

u_{1}

B

gives a list of linearly dependent vectors, as

u_{i}

can be represented by vectors in

B

. Thus, by Lemma 2, we can remove one of the vectors in

B

so that the remaining vectors still span

V

. Step j: The list

B

from step

j - 1

spans

V

. And the list now contains

u_{1}, . . ., u_{j - 1}

and some

w_{i}

. Because

{u_{j}}

are linearly independent. By Lemma 2, there must be a vector in

{w_{i}}

that can be represented as a linear combination of

{u_{j}}

and the rest of

w_{j}

. At this point, we need to assure there is enough

w

's for us to proceed this replacement process. It turns out it must be! Assume

u_{k + 1}, . . ., u_{m}

are left out after the replacement at the step k. Then,

u_{1}, u_{2}, . . ., u_{k}

span

V

now. Since

u_{k + 1} \in V

, we have

u_{k + 1} = a_{1} u_{1} + \dots + a_{k} u_{k}

with some of

{a_{i}}

being nonzero, causing a contradiction to the condition of

{u_{i}}

being linearly independent. Therefore, we should always have vectors in

B

waiting for replacement before we finish the step

m

. Step m: We have added all the

u

's in to the list that spans

V

. Because at each step we move one

w

out of

B

, there are at least

m

w

's in

B .

□ 3. Interesting Examples of Using Linearly (In)Dependency LemmasWe first introduce an example that uses the concept of linearly dependent to make the proof easier.

Lemma

Suppose

v_{1}, . . ., v_{m}

is linearly independent in

V

and

w \in V

. Then

v_{1}, . . ., v_{m}, w

is linearly independent if and only if

w \notin s p a n (v_{1}, . . ., v_{m})

Proof:We first notice that Lemma 4 can be rephrased as

v_{1}, . . ., v_{m}, w

is linearly dependent if and only if

w \in s p a n (v_{1}, . . ., v_{m})

. (

\Rightarrow

)

∵ v_{1}, . . ., v_{m}, w

is linear dependent,

∴ a_{1} v_{1} + a_{2} v_{2} + \dots + b w = 0

does not have all the cofficients being zero. If

b = 0

, then

a_{1} v_{1} + a_{2} v_{2} + \dots + a_{m} v_{m} = 0

and

a_{i} \equiv 0

, contradicting with the linear dependency. Thus

b \neq 0

, and

w = - \frac{1}{b} \sum_{i = 1}^{m} a_{i} v_{i}

which follows

w \in s p a n ({v_{i}})

\Leftarrow

)

∵ w \in s p a n (v_{1}, . . ., v_{m}), ∴ w = b_{1} v_{1} + \dots + b_{m} v_{m}

. According to Lemma 4, we have

v_{1}, . . ., v_{m}, w

being linearly dependent. The following two proofs show how we can use Lemma 3 to identify whether a vector list spans vector spaces.

Question

: Explain why there does not exist a list of six polynomials that is linearly independent in

P_{4} (F)

Solution:

P_{m} (F)

denotes the set of all polynomials with coefficients in

F

and degree at most

m

. Also,

P_{m} (F) = s p a n (1, z, . . ., z^{m})

. Notice that

P_{4} (F) = s p a n (1, z, z^{2}, z^{3}, z^{4})

, from Lemma 3 the length of linearly independent vector list cannot longer than the length of spanning vector list.

Question

: Explain why no list of four polynomials spans

P_{4} (F)

.Solution:

∵ P_{4} (F) = s p a n (1, z, . . ., z^{4}) ∴

length of spanning vector list is 5. If there is a four-polynomial list spanning

P_{4} (F)

, then the length of linearly independent list (e.g.,

1, z, z^{3}, z^{4}

) is larger than the length of spanning list (e.g.,

1, z, z^{2}, z^{3}, z^{4}

), contradicting Lemma 3. So no list of four polynomials spans

P_{4} (F)

4. Finite- and Infinite-dimensional vector spaceWe don't care about infinite-dimensional vector space in quantum computing, because we cannot have infinite number of qubits after all. But we introduce its concept here anyway for completeness.

Definition

A vector space is called finite-dimensional if a list of finite number of vectors spans the space. Otherwise, it is infinite-dimensional.

Definition 3 is related to linear dependency through the following Lemma:

Lemma

V

is infinite-dimensional if and only if there is a sequence of

v_{1}, v_{2}, . . . \in V

such that

v_{1}, . . ., v_{m}

is linearly independent for every positive integer m.

Proof: By induction if

v_{1}, v_{2}, \dots, v_{m} \in V

is linearly independent, because

V

is infinite-dimensional, there must exist some

v_{m + 1} \in V

such that

v_{m + 1} \notin s p a n (v_{1}, v_{2}, v_{3}, . . . v_{m})

. From Lemma 2,

v_{1}, v_{2}, v_{3}, . . ., v_{m}, v_{m + 1}

are linearly independent. Because

V

is infinite-dimensional, we can keep adding new vector into the list that does not belong to the subspaces spanning by the old list. Thus, we have the list of

v_{1}, v_{2}, v_{3}, . . ., v_{m}

being linearly independent for any positive interger

m

.□