Skip to main content
Logo image

Section 8.1 Linearization, critical points, and equilibria

Note: 1 lecture, §6.1–§6.2 in [EP], §9.2–§9.3 in [BD]
Except for a few brief detours in Chapter 1, we considered mostly linear equations. Linear equations suffice in many applications, but in reality most phenomena require nonlinear equations. Nonlinear equations, however, are notoriously more difficult to understand than linear ones, and many strange new phenomena appear when we allow our equations to be nonlinear. Not to worry, we did not waste all this time studying linear equations. Nonlinear equations can often be approximated by linear ones if we only need a solution “locally”—only for a short period of time or only for certain parameters. Understanding linear equations can also give us qualitative understanding about a more general nonlinear problem. The idea is similar to what you did in calculus in trying to approximate a function by a line with the right slope.
In Section 2.4 we looked at the pendulum of length \(L\text{.}\) The goal was to solve for the angle \(\theta(t)\) as a function of the time \(t\text{.}\) The equation for the setup is the nonlinear equation
\begin{equation} \theta'' + \frac{g}{L} \sin \theta = 0 . \end{equation}
Instead of solving this equation, we solved the rather easier linear equation
\begin{equation} \theta'' + \frac{g}{L} \theta = 0 . \end{equation}
While the solution to the linear equation is not exactly what we were looking for, it is rather close to the original, as long as the angle \(\theta\) is small and the time period involved is short.
You might ask: Why don’t we just solve the nonlinear problem? Well, it might be very difficult, impractical, or impossible to solve analytically, depending on the equation in question. We may not even be interested in the actual solution, we might only be interested in some qualitative idea of what the solution is doing. For example, what happens as time goes to infinity?

Subsection 8.1.1 Autonomous systems and phase plane analysis

We restrict our attention to a two-dimensional autonomous system
\begin{equation} x' = f(x,y) , \qquad y' = g(x,y) , \end{equation}
where \(f(x,y)\) and \(g(x,y)\) are functions of two variables, and the derivatives are taken with respect to time \(t\text{.}\) Solutions are functions \(x(t)\) and \(y(t)\) such that
\begin{equation} x'(t) = f\bigl(x(t),y(t)\bigr), \qquad y'(t) = g\bigl(x(t),y(t)\bigr) . \end{equation}
The way we will analyze the system is very similar to Section 1.6, where we studied a single autonomous equation. The ideas in two dimensions are the same, but the behavior can be far more complicated.
It may be best to think of the system of equations as the single vector equation
\begin{equation} \begin{bmatrix} x \\ y \end{bmatrix} ' = \begin{bmatrix} f(x,y) \\ g(x,y) \end{bmatrix} .\tag{8.1} \end{equation}
As in Section 3.1, we draw the phase portrait (or phase diagram), where each point \((x,y)\) corresponds to a specific state of the system. We draw the vector field given at each point \((x,y)\) by the vector \(\left[ \begin{smallmatrix} f(x,y) \\ g(x,y) \end{smallmatrix} \right]\text{.}\) And as before if we find solutions, we draw the trajectories by plotting all points \(\bigl(x(t),y(t)\bigr)\) for a certain range of \(t\text{.}\)

Example 8.1.1.

Consider the second-order equation \(x''=-x+x^2\text{.}\) Write this equation as a first-order nonlinear system
\begin{equation} x' = y , \qquad y' = -x+x^2 . \end{equation}
The phase portrait with some trajectories is drawn in Figure 8.1.

A direction field plot (a grid of arrows) in the xy-plane. In the center, the arrows go around the critical point at the origin in a clockwise direction. Then at the point (1,0) the direction field is a saddle point, that is, the arrows along a diagonally down sloped curve go towards the critical point and in a diagonally upwards sloped curve go away from the critical point. To the left of the critical point, the arrows go roughly down, above the critical point the arrows go roughly rightwards, on the right of the critical point the arrows point roughly upwards, and below the critical point the arrows point roughly leftwards. The arrows appear longer the further away from the critical point they are. Several trajectories are also drawn. Around the critical point at (0,0) there are several egg-shaped trajectories with the pointy end in the positive x-direction. Then there is one trajectory seeming to going ’through’ the critical point at (1,0) making a loop, although if one were to make a loop, one would have to go against the arrows once one goes through the critical point. Then there are several other trajectories, there are several trajectories that start on the bottom right go diagonally up but then turn leftwards until they get to the left half of the picture staying below the x-axis. Then they turn upwards until we are in the upper half of the picture and then turn rightwards until they are in the right half of the picture where they turn diagonally upwards leaving the picture. Then a couple of other trajectories start below the x-axis on the right side of the picture close to the x-axis, then simply turn rightwards until they leave the picture, never making it past x equals 1.
Figure 8.1. Phase portrait with some trajectories of \(x' = y\text{,}\) \(y' = -x+x^2\text{.}\)

From the phase portrait it should be clear that even this simple system has fairly complicated behavior. Some trajectories keep oscillating around the origin, and some go off towards infinity. We will return to this example often, and analyze it completely in this (and the next) section.
If we zoom into the diagram near a point where \(\left[ \begin{smallmatrix} f(x,y) \\ g(x,y) \end{smallmatrix} \right]\) is not zero, then nearby the arrows point generally in essentially that same direction and have essentially the same magnitude. In other words the behavior is not that interesting near such a point. We are of course assuming that \(f(x,y)\) and \(g(x,y)\) are continuous.
Let us concentrate on those points in the phase diagram above where the trajectories seem to start, end, or go around. We see two such points: \((0,0)\) and \((1,0)\text{.}\) The trajectories seem to go around the point \((0,0)\text{,}\) and they seem to either go in or out of the point \((1,0)\text{.}\) These points are precisely those points where the derivatives of both \(x\) and \(y\) are zero. Let us define the critical points as the points \((x,y)\) such that
\begin{equation} \begin{bmatrix} f(x,y) \\ g(x,y) \end{bmatrix} = \vec{0} . \end{equation}
In other words, these are the points where both \(f(x,y)=0\) and \(g(x,y)=0\text{.}\)
The critical points are where the behavior of the system is in some sense the most complicated. If \(\left[ \begin{smallmatrix} f(x,y) \\ g(x,y) \end{smallmatrix} \right]\) is zero, then nearby, the vector can point in any direction whatsoever. Also, the trajectories are either going towards, away from, or around these points, so if we are looking for long-term qualitative behavior of the system, we should look at what is happening near the critical points.
Critical points are also sometimes called equilibria, since we have so-called equilibrium solutions at critical points. If \((x_0,y_0)\) is a critical point, then we have the solutions
\begin{equation} x(t) = x_0, \quad y(t) = y_0 . \end{equation}
In Example 8.1.1, there are two equilibrium solutions:
\begin{equation} x(t) = 0, \quad y(t) = 0, \qquad \text{and} \qquad x(t) = 1, \quad y(t) = 0. \end{equation}
Compare this discussion on equilibria to the discussion in Section 1.6. The underlying concept is exactly the same.

Subsection 8.1.2 Linearization

In Section 3.5 we studied the behavior of a homogeneous linear system of two equations near a critical point. For a linear system of two variables given by an invertible matrix, the only critical point is the origin \((0,0)\text{.}\) Let us put the understanding we gained in that section to good use understanding what happens near critical points of nonlinear systems.
In calculus we learned to estimate a function by taking its derivative and linearizing. We work similarly with nonlinear systems of ODEs. Suppose \((x_0,y_0)\) is a critical point. First change variables to \((u,v)\text{,}\) so that \((u,v)=(0,0)\) corresponds to \((x_0,y_0)\text{.}\) That is,
\begin{equation} u=x-x_0, \qquad v=y-y_0 . \end{equation}
Next we need to find the derivative. In multivariable calculus you may have seen that the several variables version of the derivative is the Jacobian matrix
 1 
Named for the German mathematician Carl Gustav Jacob Jacobi (1804–1851).
. The Jacobian matrix of the vector-valued function \(\left[ \begin{smallmatrix} f(x,y) \\ g(x,y) \end{smallmatrix} \right]\) at \((x_0,y_0)\) is
\begin{equation} \begin{bmatrix} \frac{\partial f}{\partial x}(x_0,y_0) & \frac{\partial f}{\partial y}(x_0,y_0) \\ \frac{\partial g}{\partial x}(x_0,y_0) & \frac{\partial g}{\partial y}(x_0,y_0) \end{bmatrix} . \end{equation}
This matrix gives the best linear approximation as \(u\) and \(v\) (and therefore \(x\) and \(y\)) vary. We define the linearization of the equation (8.1) as the linear system
\begin{equation} \begin{bmatrix} u \\ v \end{bmatrix} ' = \begin{bmatrix} \frac{\partial f}{\partial x}(x_0,y_0) & \frac{\partial f}{\partial y}(x_0,y_0) \\ \frac{\partial g}{\partial x}(x_0,y_0) & \frac{\partial g}{\partial y}(x_0,y_0) \end{bmatrix} \begin{bmatrix} u \\ v \end{bmatrix} . \end{equation}

Example 8.1.2.

Let us keep with the same equations as Example 8.1.1: \(x' = y\text{,}\) \(y' = -x+x^2\text{.}\) There are two critical points, \((0,0)\) and \((1,0)\text{.}\) The Jacobian matrix at any point is
\begin{equation} \begin{bmatrix} \frac{\partial f}{\partial x}(x,y) & \frac{\partial f}{\partial y}(x,y) \\ \frac{\partial g}{\partial x}(x,y) & \frac{\partial g}{\partial y}(x,y) \end{bmatrix} = \begin{bmatrix} 0 & 1 \\ -1+2x & 0 \end{bmatrix}. \end{equation}
At \((0,0)\text{,}\) that is, when \(x=0\) and \(y=0\text{,}\) the Jacobian matrix is
\begin{equation} \begin{bmatrix} 0 & 1 \\ -1 & 0 \end{bmatrix} , \end{equation}
and our new variables are \(u=x\) and \(v=y\text{.}\) The linearization at \((0,0)\) is
\begin{equation} \begin{bmatrix} u \\ v \end{bmatrix} ' = \begin{bmatrix} 0 & 1 \\ -1 & 0 \end{bmatrix} \begin{bmatrix} u \\ v \end{bmatrix} . \end{equation}
At the point \((1,0)\text{,}\) the Jacobian matrix is \(\left[ \begin{smallmatrix} 0 & 1 \\ 1 & 0 \end{smallmatrix} \right]\text{,}\) and our new variables are \(u=x-1\) and \(v=y\text{.}\) The linearization at \((1,0)\) is
\begin{equation} \begin{bmatrix} u \\ v \end{bmatrix} ' = \begin{bmatrix} 0 & 1 \\ 1 & 0 \end{bmatrix} \begin{bmatrix} u \\ v \end{bmatrix} . \end{equation}
The phase diagrams of the two linearizations at the critical points \((0,0)\) and \((1,0)\) are given in Figure 8.2. Note that the variables are now \(u\) and \(v\text{,}\) which are different at each critical point. Compare Figure 8.2 with Figure 8.1, and look especially at the behavior near the critical points.

Two direction field plots (grid of arrows) in the uv-plane. On the left the arrows simply go clockwise around the origin, with several circular trajectories drawn. On the right is a picture of a saddle where along the line v equals u the arrows point away from the origin and along v equals minus u the arrows point towards the origin. The lines v equals u and v equals minus u are drawn as trajectories. In the picture divided by these lines, in the region on the left, the arrows point generally downward, towards the top going slightly to the right and towards the bottom going slightly to the left. There is a trajectory in this region starting out of the picture on the left side going down and slightly to the right before turning to the left and leaving the picture. The situation is analogous in the other 3 regions, where in the top region the arrows go roughly rightwards, the arrows in the right region go roughly upwards, and the arrows in the bottom region go roughly leftwards.
Figure 8.2. Phase diagram with some trajectories of linearizations at the critical points \((0,0)\) (left) and \((1,0)\) (right) of \(x' = y\text{,}\) \(y' = -x+x^2\text{.}\)

Exercises 8.1.3 Exercises

8.1.1.

Sketch the phase plane vector field for:
  1. \(x'=x^2\text{,}\)    \(y'=y^2\text{,}\)
  2. \(x'=(x-y)^2\text{,}\)    \(y'=-x\text{,}\)
  3. \(x'=e^y\text{,}\)    \(y'=e^x\text{.}\)

8.1.3.

Find the critical points and linearizations of the following systems.
  1. \(x'=x^2-y^2\text{,}\)    \(y'=x^2+y^2-1\text{,}\)
  2. \(x'=-y\text{,}\)    \(y'=3x+yx^2\text{,}\)
  3. \(x'=x^2+y\text{,}\)    \(y'=y^2+x\text{.}\)

8.1.4.

For the following systems, verify they have critical point at \((0,0)\text{,}\) and find the linearization at \((0,0)\text{.}\)
  1. \(x'=x+2y+x^2-y^2\text{,}\)    \(y'=2y-x^2\)
  2. \(x'=-y\text{,}\)    \(y'=x-y^3\)
  3. \(x'=ax+by+f(x,y)\text{,}\) \(y'=cx+dy+g(x,y)\text{,}\) where \(f(0,0) = 0\text{,}\) \(g(0,0) = 0\text{,}\) and all first partial derivatives of \(f\) and \(g\) are also zero at \((0,0)\text{,}\) that is, \(\frac{\partial f}{\partial x}(0,0) = \frac{\partial f}{\partial y}(0,0) = \frac{\partial g}{\partial x}(0,0) = \frac{\partial g}{\partial y}(0,0) = 0\text{.}\)

8.1.5.

Take \(x'=(x-y)^2\text{,}\)    \(y'=(x+y)^2\text{.}\)
  1. Find the set of critical points.
  2. Sketch a phase diagram and describe the behavior near the critical point(s).
  3. Find the linearization. Is it helpful in understanding the system?

8.1.6.

Take \(x'=x^2\text{,}\)    \(y'=x^3\text{.}\)
  1. Find the set of critical points.
  2. Sketch a phase diagram and describe the behavior near the critical point(s).
  3. Find the linearization. Is it helpful in understanding the system?

8.1.101.

Find the critical points and linearizations of the following systems.
  1. \(x'=\sin(\pi y)+(x-1)^2\text{,}\)    \(y'=y^2-y\text{,}\)
  2. \(x'=x+y+y^2\text{,}\)    \(y'=x\text{,}\)
  3. \(x'=(x-1)^2+y\text{,}\)    \(y'=x^2+y\text{.}\)
Answer.
a) Critical points \((1,0)\) and \((1,1)\text{.}\) At \((1,0)\) using \(u=x-1\text{,}\) \(v=y\) the linearization is \(u'=\pi v\text{,}\) \(v'=-v\text{.}\) At \((1,1)\) using \(u=x-1\text{,}\) \(v=y-1\) the linearization is \(u'=-\pi v\text{,}\) \(v'=v\text{.}\)
b) Critical points \((0,0)\) and \((0,-1)\text{.}\) Using \(u=x\text{,}\) \(v=y\) the linearization is \(u'=u+v\text{,}\) \(v'=u\text{.}\)
At \((0,0)\) using \(u=x\text{,}\) \(v=y\) the linearization is \(u'=u+v\text{,}\) \(v'=u\text{.}\) At \((0,-1)\) using \(u=x\text{,}\) \(v=y+1\) the linearization is \(u'=u-v\text{,}\) \(v'=u\text{.}\)
c) Critical point \((\nicefrac{1}{2},\nicefrac{-1}{4})\text{.}\) Using \(u=x-\nicefrac{1}{2}\text{,}\) \(v=y+\nicefrac{1}{4}\) the linearization is \(u'=-u+v\text{,}\) \(v'=u+v\text{.}\)

8.1.103.

The idea of critical points and linearization works in higher dimensions as well. You simply make the Jacobian matrix bigger by adding more functions and more variables. For the following system of 3 equations find the critical points and their linearizations:
\begin{equation} x' = x + z^2, \qquad y' = z^2-y, \qquad z' = z+x^2. \end{equation}
Answer.
Critical points are \((0,0,0)\text{,}\) and \((-1, 1, -1)\text{.}\) The linearization at the origin using variables \(u=x\text{,}\) \(v=y\text{,}\) \(w=z\) is \(u' = u\text{,}\) \(v'=-v\text{,}\) \(z' = w\text{.}\) The linearization at the point \((-1,1,-1)\) using variables \(u=x+1\text{,}\) \(v=y-1\text{,}\) \(w=z+1\) is \(u'=u-2w\text{,}\) \(v'=-v-2w\text{,}\) \(w'=w-2u\text{.}\)

8.1.104.

Any two-dimensional non-autonomous system \(x'=f(x,y,t)\text{,}\) \(y'=g(x,y,t)\) can be written as a three-dimensional autonomous system (three equations). Write down this autonomous system using the variables \(u\text{,}\) \(v\text{,}\) \(w\text{.}\)
Answer.
\(u' = f(u,v,w)\text{,}\) \(v'=g(u,v,w)\text{,}\) \(w' = 1\text{.}\)
For a higher quality printout use the PDF version: https://www.jirka.org/diffyqs/diffyqs.pdf or https://jirilebl.github.io/diffyqs/diffyqs.pdf