Learn about a useful interpretation of matrices that helps to understand matrix multiplication and determinants.
In the last article we covered matrix fundamentals, but matrices are much more than tables of numbers. That's why in this article we'll discuss a way to think about matrices visually. This perspective makes a lot of what initially seems difficult about matrices become intuitive to understand. We'll only need this perspective for square matrices in multivariable calculus, so we will limit ourselves to those here.
Matrices as movement
What is the action of a matrix? What does a matrix look like? These questions may seem nonsensical, but we will answer them both by visualizing how
matrices move the 2D plane.
Here is a drawing of the plane, along with the unit vectors
and , which stand for and .
Let's consider matrix
This is how the matrix acts upon the grid:
- The columns of the matrix tell us where it moves the unit vectors
and , which again stand for and .
- The rest of the grid follows accordingly, always keeping grid lines parallel and evenly spaced. The origin stays frozen in place.
moves and . Here's what that looks like:
The unit vector
didn't move because it started at . The unit vector moved upward one unit, and this dragged the grid with it. Notice that there's a faint copy of the original lines in the background to help us stay oriented.
Let's see the same process for another matrix.
moves and . That looks like this:
Here's a question to practice.
Suppose we have a matrix
What does the grid look like after we apply
To sum up, the action of a matrix is to move the entire grid. We can understand it by thinking about how it moves the unit vectors. We can visualize what this looks like by drawing a modified 2D grid.
These ideas extend into three dimensions as well. The third row of the matrix contains
-coordinates for all the unit vectors, and the third column of the matrix tells us where lands.
If you'd like, play around with matrices as movement with this interactive demonstration. Drag the vectors to make the grid move, and see the matrix that corresponds to the movement in the top left corner.
How matrices move vectors
We already know how a given matrix moves the unit vectors
and (just look at the columns), but how can we find where a matrix moves any arbitrary vector? Let's consider a specific example using the first matrix from the previous section.
move the non-unit vector ? Before anything, let's get a feel for this visually. First, the vector with no matrix movement:
Now, the vector after the matrix moves the grid:
The vector just comes along for the ride as the matrix moves the grid, ultimately landing on
. This is the essence of how matrices move vectors, which is formally called matrix-vector multiplication.
Now let's go over how we could calculate this. We represent
as a combination of the unit vectors by saying .
This combination remains the same after we apply
, but instead of using and , we use the result of applying to and :
Here's what the whole process looks like in symbols.
The critical step is when we break
into . That is when we are able to represent where lands in terms of where and land.
Let's try a couple practice questions.
move the vector in terms of where it moves and ?
move the vector ?
Like in the section above, the idea behind matrices moving vectors extends into three dimensions. We just decompose our vector into a sum of
, , and , then we use where the matrix takes all these unit vectors to find where it takes our vector.
Here's another interactive demonstration to play around with matrices moving vectors.
Matrix multiplication intuition (optional)
With the perspective of matrices as movement, we have the tools to understand what it means to multiply two matrices. The core idea is composition.
just means apply , then apply . When we apply second, we treat the transformed and as regular vectors getting moved by the way we learned in the previous section.
To calculate the end result, we follow
and along the two movements. First, takes and . Second, we find where takes these vectors:
Putting these in a matrix, we have the product. Notice that our calculations are reflected in the visual above.
To conclude, we can think of matrix multiplication as composing the movements each matrix represents. When we follow unit vectors along these movements, we can calculate the product.
As a challenge problem, try to derive the general formula for
matrix multiplication. Hint: follow and as moves them, then follow where moves the already transformed vectors.
As a bonus challenge problem, try to find the formula for multiplying
matrices. Hint: follow , , and .
Now that we have a solid grasp of how matrices move space, we are in a prime position to understand the final concept that we'll cover in this review series: the determinant.
Want to join the conversation?
- does the explanation for a general formula of 2x2 matrix multiplication skip steps of exzplanation?
"first we follow i-hat."
---> [1,0], sure
"the rightmost matrix moves it to (e,g)"
---> i agree that [e,g] moves [1,0] to [e,g]
"when the left matrix moves (e,g), we can think of it as moving the combination of e[1,0] + g[0,1]
---> why can we think of it like that? where did [0,1] come from? why does it look like j-hat is now in the picture when we agreed to only follow i-hat?(4 votes)
- That's a good question, Joseph. The reason [0, 1] comes into play is that the left matrix is now moving an entire vector, (e, g). The way matrix vector multiplication works is detailed in the section "How matrices move vectors." I hope this helps!(2 votes)
- I am the only one who breezed through everything (I already did calc II with vectors last semester)...until I got to matrices, and then I was like oh drat I learned this at one point...or is it just me...
That matrix notation is weird, the [3 onTopOf 2] etc
For my question, what on earth is going on in the last example? The Matrix multiplication intution?
Pretty lonely over here...(2 votes)
- Hi This is very good explanation. My question for section "How matrices move vectors" :
I understand the matrix A also moves co-ordinate i and j to new values. Say i2 and j2.
When A moves v1 = (1, 2) to v2 = (1, 3), the 1 and 3 are with respect to original co-ordinate i and j. With respect new co-ordinate i2 and j2 the value of v2 is still (1,2). Right?(1 vote)
- There is a typo:
Here's what the whole process looks like in symbols.
The first blue vector