Rank-Nullity Theorem
Recall from the previous section that two matrices and are equivalent if there exist invertible and such that
The objective of this chapter is to classify all matrices of a given size up to equivalence.
Fundamental Subspaces: Kernel and Range
In order to begin talking about this, we need to talk about two fundamental subspaces associated with a linear function , the range and the kernel. But first let us properly define the image and pre-image of a vector space.
Definition: The image of a vector subspace under a linear function , denoted , is the set of all elements of which can be written
for some . Note the image of a vector space is also a vector space.
Definition: The preimage of a vector subspace under a linear function , denoted , is the set of all elements of which map to , i.e.,
The preimage of a vector space is also a vector space.
With these definitions, the most important image and preimage of a function are the range and kernel,
Definition: The range of a linear function , denoted , is the image of under .
Definition: The kernel of a linear function , denoted , is the preimage of the trivial subspace under . That is, it is the set of all elements that map to zero (i.e., "annihilated" by ).
Both of these are fundamental subspaces that determine the structure of . In particular, their dimensions are important properties of ,
Definition: The rank of a linear function , denoted , is the dimension of its range.
Definition: The nullity of a linear function , denoted is the dimension of its kernel.
The key to rank-nullity theorem is to recognize that essentially acts as a "filter" on the algebraic structure of , it annihilates some parts of the structure --- the structure contained within the kernel space , but preserves the remaining structure and embeds it into the range space . The idea of the rank-nullity theorem is that if one "glues" the structure of the kernel and the range together, we can "recover" the structure of the original domain . More precisely, this means that the dimensions of and should sum to the dimension original space . This is the statement of rank-nullity theorem,
Theorem (Rank-Nullity): For any linear function ,
Moreover, any matrix representation of is equivalent to
Where above is the identity of size and is the zero matrix of size .
Proof of Theorem
The proof of this statement is not particularly difficult. It just requires finding the "right" basis for . To begin, let be a basis for . Note that we can expand any linearly independent set with dimension less than to a linearly independent set with one more element by simply selecting an element from to add to it (you can verify for yourself that the resulting set is still linearly independent). This allows us to expand the basis for to a basis for all of . The core part of the proof is now to show that are a basis for . If this is the case, then the first part of the theorem follows automatically since then , , and .
First, we note that must span the range of . This is because anything in the range of can be written as
where we have used the fact that by definition of the kernel. Now, in order to show that is a basis, we need linear independence. To show this, suppose for contradiction that there exist nontrivial such that
Then, by linearity,
But this means that must be a member of the kernel of , thus, there must exist such that
But this contradicts the assumption that the full basis is linearly independent since we have just found a nonzero set of coordinates that gives the zero vector,
Thus, must be a basis for the range of .
Now that we have this basis, we can easily prove the second part of the theorem as well. Let . Expand to a basis for all of . Then we have that
Let be an arbitrary array in . We multiply each equation by the corresponding entry of ,
Summing all these equalities and letting and , we rewrite the above in matrix form,
Hence,
From the discussion of matrix equivalence in the previous section, we therefore have the second part of the theorem.