Main content
Algebra (all content)
Course: Algebra (all content) > Unit 20
Lesson 10: Matrices as transformations- Transforming vectors using matrices
- Use matrices to transform 3D and 4D vectors
- Transforming polygons using matrices
- Transform polygons using matrices
- Matrices as transformations
- Matrix from visual representation of transformation
- Visual representation of transformation from matrix
- Understand matrices as transformations of the plane
© 2023 Khan AcademyTerms of usePrivacy PolicyCookie Notice
Matrices as transformations
Learn how exactly 2x2 matrices act as transformations of the plane.
Introduction
If we think about a matrix as a transformation of space it can lead to a deeper understanding of matrix operations. This viewpoint helps motivate how we define matrix operations like multiplication, and, it gives us a nice excuse to
draw pretty pictures. This material touches on linear algebra (usually a college topic).
Multiplication as a transformation
The idea of a "transformation" can seem more complicated than it really is at first, so before diving into how matrices transform -dimensional space, or how matrices transform -dimensional space, let's go over how plain old numbers (a.k.a. matrices) can be considered transformations of -dimensional space.
" -dimensional space" is simply the number line.
What happens when you multiply every number on the line by a particular value, like ? One way to visualize this is as follows:
We keep a copy of the original line for reference, then slide each number on the line to times that number.
Similarly, multiplication by could be visualized like this:
And so that negative numbers don't feel neglected, here is multiplication by :
For those of you fond of fancy terminology, these animated actions could be described as "Linear transformations of -dimensional space". The word “transformation” means the same thing as “function”: something which takes in a number and outputs a number, like . However, while we typically visualize functions with their graphs, people tend to use the word “transformation” to indicate that you should instead visualize some object moving, stretching, squishing, etc. So the function visualized as a transformation gives us the "Multiplication by " video above. It moves the point on the number line to where starts off, moves to where starts off, etc.
Before we move on to -dimensional space, there's one simple but important fact we should keep in the back of our minds. Suppose you watch one of these transformations, knowing that it's multiplication by some number, but without knowing what that number is, like this one:
You can easily figure out which number is being multiplied into the line by . In this case, lands where started off, so you can tell that the animation represents multiplication by .
What do linear transformations in dimensions look like?
A -dimensional linear transformation is a special kind of function which takes in a -dimensional vector and outputs another -dimensional vector. As before, our use of the word “transformation” indicates we should think about smooshing something around, which in this case is -dimensional space. Here are some examples:
For our purposes, what makes a transformation linear is the following geometric rule: The origin must remain fixed, and all lines must remain lines. So all the transforms in the above animation are examples, but the following are not:
Following specific vectors during a transformation
Imagine you are watching one particular transformation, like this one
How could you describe this to a friend who is not watching the same animation? You can no longer describe it using a single number, the way we could just follow the number in the one dimensional case. To help keep track of everything, let's put a green arrow over the vector
,
put a red arrow over the vector
,
and fix a copy of the grid in the background.
Now it's a lot easier to see where things land. For example, watch the animation again, and focus on the vector , we can more easily follow it to see that it lands on the vector .
We can represent this fact with the following notation:
Notice, a vector like , which starts off as times the green arrow, continues to be times the green arrow after the transformation. Since the green arrow lands on , we can deduce that
And in general
Similarly, the destination of the entire -axis is determined by where the red arrow
lands, which for this transformation is .
In fact, once we know where
and
land, we can deduce where every point on the plane must go. For example, let's follow the point
in our animation:
It starts at times the green arrow plus times the red arrow, but it also ends at times the green arrow plus times the red arrow, which after the transformation means
This ability to break up a vector in terms of its components both before and after the transformation is what's so special about linear transformations.
Representing two dimensional linear transforms with matrices
In general, since each vector
can be broken down as
If the green arrow
lands on some vector
,
and the red arrow
lands on some vector
,
then the vector
must land on
A really nice way to describe all this is to represent a given linear transform with the matrix
where the first column tells us where
lands and the second column tells us where
lands. Now we can describe where any vector
lands very compactly as the matrix-vector product
In fact, this is where the definition of a matrix-vector product comes from.
So in the same way that -dimensional linear transforms could be described as multiplication by some number, namely whichever number lands on top of, -dimensional linear transforms can always be described by a matrix, namely the one whose first column indicates where lands, and whose second column indicates where lands.
Want to join the conversation?
- The specific 2d linear transformation with background goes by a little fast. Can you slow it down so I can follow what is going on?(24 votes)
- You can go frame by frame on youtube videos by pressing period (.) to go to the next frame and comma (,) to go to the previous frame. All this with video paused.(53 votes)
- "Practice Problem: Even though it has gone off screen, can you predict where the point 3,0 has landed?"
No. No, I can't. Even if it was all on-screen, it moves so fast. Yes, (1,1) goes to (4,-2), but I can only see this because there's a grid underneath the overlay of the transformation.
I see nowhere on Khan where it's explained how to just "know" what the transformation is or how to calculate it backwards without having coordinates. Okay, set the video to slow-motion and pause it, and decipher the coordinates.. Good. Now, how do we know what the transformation is?
I aced matrices up until the last problem set of this section, where you aren't given data for a transformation. Almost zero instruction on a topic hugely removed from the rest of mathematics.(13 votes)- You need to look at the now transformed grid lines. It's actually super easy if you look at the vectors <1,0> and <0,1> specifically.
<1,0> shifts to <1, -2> so now any x component in a vector gets transformed so it is however long it was multiplied by <1, -2> so for instance <1, 0> goes to <1, -2>, <2,0> would go to <2, -4> and so on.
This is only the x component though, so we want to look at y as well. y goes from <0, 1> to <3,0> so <0, 2> would go to <6,0> and so on.
Now, we can put them all together, let's use the one in particular it asks for.
<3,0> only has an x component so it's actually easy. I will showhow to do this with <1,1> after. but <3,0> goes to 3*<1, -2> = <3, -6>.
Since you can see <1,1> it should be easy to see. Anyway, if we pretended we couldn't see <1,1> we would just multiply each component by its transformation. so 1*<1, -2> + 1*<3, 0> = <1, -2> + <3, 0> = <4, -2>
<2, 2> would be similar
2*<1, -2> + 2<3,0>
<2, -4> + <6,0>
<8, -4>
Let me know if this didn't help, to summarize though look at how <1,0> and <0,1> initially transform then you can solve for any other vector/ point.(23 votes)
- Why does the two vectors that when transform, can deduce where others go have to be (1 0) and
(0 1)? Why can't there be any ones?(5 votes)- They don't. They just have to be two vectors that aren't parallel. But since we could use any pair of (non-parallel) vectors, it makes sense to use the two simplest vectors we can think of -- i.e., (1 0)) and (0 1).(21 votes)
- Where do
f(v+w)=f(v)+f(w)
andf(cx)=cf(x)
proprities come from ?(6 votes)- Algebra. They are the distributive property.(9 votes)
- If the reality is all about transformations, why do they teach us normal algebra like a set of rules?
At least for me, visualization helps to understand concepts quickly.I was always curious to know why different graphs have different curves.I guess 'transformation of points' is the answer to that.(8 votes)- Reality is we do math not because it gives us a lot of knowledge. But, it's actually the easiest language your brain can understand. It's like the code for a computer.
But, today world is creating new stuff which could be done using math stuff. Our brain is very curious. That's why we study math. You would never can experience the complex numbers in real life. But, why do we study them? Cause math sharpens brain. Think of it like a computer. If a computer can understand as many languages as possible, it's more efficient. Same with math, and math is big branch of subjects. And each subject can be referred as language. This is my theory and I love math. Probably, I'm the best lover of math!!(3 votes)
- For those of you who have a hard time with the idea of matrices as transformations, I suggest you try watching 3b1b's Essence of Linear Algebra series, it clears a lot of fundamental questions about matrix operations I often see in this site, such as why Identity matrices are such, why columns have to match rows for matrix multiplication and almost any question you can think of. (well, most of it, isomorphism is really difficult)(8 votes)
- Thank you so much! I was stuck with this article and started to watch 3b1b. It really helped.(0 votes)
- Perhaps linear algebra should not be a "wait-for-college" topic. Maybe it should be taught in conjunction with different coordinate systems much earlier. Do you know of anyone teaching along these lines?(5 votes)
- Why do we put practices before the explanation? That makes no sense.(5 votes)
- A complex number p = a + b∙i can be thought of as a vector in complex space p = [a b], and therefore a linear transformation by a 2x2 matrix T on the vector p would be
p * T = s
I show this sequence since originally I learned these complex vectors as row-oriented, and they are easier to conceptualize relative to rectangular and polar forms of complex numbers.
While I understand modifying the notation presented in this review involves trivial matrix transposition:
v = p'
A = T'
such that A * v = s'
I am curious if anyone else prefers to conceptualize linear transforms of 2-space vectors in the above way?(3 votes)- It's always wise to consider concepts of mathematics through the eyes of a different field of mathematics. One of the greatest achievements of mathematics is to combine geometry and algebra together, because it gave us the first possibilities to extend concepts from reality.
A simple example: Algebra allowed lines to go on forever by expressing them with symbols, which in turn made it possible to find possible points of intersections. With just Geometry it would require an infinitely long piece of paper to determine if two lines never intersect.(5 votes)
- It was so confusion and amazing. Thanks to this article, I finally understood.(3 votes)