Null Space

5. Null Space#

In the previous chapter we saw examples where a system of linear equations has a single solutions. In this chapter, we’ll consider an example where there are many possible solution. One domain where systems like this are common is conservation of atoms (and sometimes charge) in chemical reactions – that is, stoichiometry. If you studied chemistry some time ago, or never, I’ll explain what you need to know.

Click here to run this notebook on Colab.

5.1. Stoichiometry#

In chemistry, a reaction equation specifies a set of chemicals that can react to form another set of chemicals. In this context, the chemicals are called reagents.

As an example, we’ll write the reaction equation for combustion of ethanol – that is, the reaction of ethanol with molecular oxygen to form carbon dioxide and water. We’ll define SymPy symbols to represent each of these reagents.

import sympy as sp

C2H5OH = sp.Symbol('C_{2} H_5 OH')
O2 = sp.Symbol('O2')
CO2 = sp.Symbol('CO2')
H2O = sp.Symbol('H_{2} O')

Now we can define the left hand side and the right hand side of the equation.

lhs = C2H5OH + O2
rhs = CO2 + H2O

And we can display it like this.

from IPython.display import display, Math

display(Math(f"{sp.latex(lhs)} \\rightarrow {sp.latex(rhs)}"))

\[\displaystyle C_{2} H_5 OH + O_{2} \rightarrow CO_{2} + H_{2} O\]

In this form the equation is not balanced, because it does not have the same number of atoms, of each kind, on both sides. For example, there are two carbon atoms on the left and only one on the right. In chemical reactions (as opposed to nuclear reactions) atoms are conserved.

To balance the equation, we have to find a set of proportions in which these species of chemicals can be combined while conserving atoms. To do that, we’ll introduce unknown coefficients to the equation.

x1, x2, x3, x4 = sp.symbols('x1 x2 x3 x4')

We’ll rewrite the equations with each species multiplied by a coefficient.

lhs = x1 * C2H5OH + x2 * O2
rhs = x3 * CO2 + x4 * H2O

And collect the coefficients and species into lists.

coefficients = [x1, x2, x3, x4]
species = [C2H5OH, O2, CO2, H2O]

Now we can use the following function to display the reaction equation.

def display_reaction(coefficients, species, i=2):
    terms = [c * s for c, s in zip(coefficients, species)]
    lhs, rhs = sum(terms[:i]), sum(terms[i:])
    display(Math(f"{sp.latex(lhs)} \\rightarrow {sp.latex(rhs)}"))

Here’s what the combustion of ethanol looks like with stoichiometric coefficients.

display_reaction(coefficients, species)

\[\displaystyle C_{2} H_5 OH x_{1} + O_{2} x_{2} \rightarrow CO_{2} x_{3} + H_{2} O x_{4}\]

Now let’s solve for the unknown coefficients. We can do that by writing an equation to represent conservation of each type of atom involved in the reaction.

For example, there are \(2 x_1\) carbon atoms on the left and \(x3\) on the right, so we can write:

eq_carbon = sp.Eq(2*x1, x3)         
eq_carbon

\[\displaystyle 2 x_{1} = x_{3}\]

Similarly, there are six hydrogen atoms in each molecule of ethanol and two in each molecule of \(H_2\), so we can write:

eq_hydrogen = sp.Eq(6*x1, 2*x4)     
eq_hydrogen

\[\displaystyle 6 x_{1} = 2 x_{4}\]

Finally, we can write this equation to represent conservation of oxygen atoms.

eq_oxygen = sp.Eq(x1 + 2*x2, 2*x3 + x4)  
eq_oxygen

\[\displaystyle x_{1} + 2 x_{2} = 2 x_{3} + x_{4}\]

Together, these form a system of linear equations, which we will solve simultaneously. As we did in the previous chapter, we can write the system as a matrix equation. We’ll use the SymPy function linear_eq_to_matrix to assemble the equations into a matrix.

A, b = sp.linear_eq_to_matrix([eq_carbon, eq_hydrogen, eq_oxygen], coefficients)
A

\[\begin{split}\displaystyle \left[\begin{matrix}2 & 0 & -1 & 0\\6 & 0 & 0 & -2\\1 & 2 & -2 & -1\end{matrix}\right]\end{split}\]

The results represent the matrix equation \(A x = b\), where \(x\) is the vector of coefficients.

x = sp.Matrix(coefficients)
x

\[\begin{split}\displaystyle \left[\begin{matrix}x_{1}\\x_{2}\\x_{3}\\x_{4}\end{matrix}\right]\end{split}\]

If we multiply the left hand side, we can confirm that we recover the system of equations.

eqn = sp.Eq(A * x, b)
eqn

\[\begin{split}\displaystyle \left[\begin{matrix}2 x_{1} - x_{3}\\6 x_{1} - 2 x_{4}\\x_{1} + 2 x_{2} - 2 x_{3} - x_{4}\end{matrix}\right] = \left[\begin{matrix}0\\0\\0\end{matrix}\right]\end{split}\]

Because there are no constants in the equations, \(b\) is all zeros – which means that this is a homogeneous equation. We can use solve to solve this system.

solution = sp.solve(eqn, x)
solution

{x1: x4/3, x2: x4, x3: 2*x4/3}

The result is a dictionary that represents a system of equations that express \(x_1\), \(x_2\), and \(x_3\) in terms of \(x_4\).

equations = [sp.Eq(var, expr) for var, expr in solution.items()]
for eq in equations:
    display(Math(sp.latex(eq)))

\[\displaystyle x_{1} = \frac{x_{4}}{3}\]

\[\displaystyle x_{2} = x_{4}\]

\[\displaystyle x_{3} = \frac{2 x_{4}}{3}\]

This is not the most convenient way to represent the solution. An alternative is to compute the null space of the system, which represents the solution as a list of vectors.

nullspace = A.nullspace()
nullspace

[Matrix([
 [1/3],
 [  1],
 [2/3],
 [  1]])]

This vector represents a solution to the system of equations.

basis_vector = nullspace[0]
basis_vector

\[\begin{split}\displaystyle \left[\begin{matrix}\frac{1}{3}\\1\\\frac{2}{3}\\1\end{matrix}\right]\end{split}\]

If we multiply the solution by \(A\), we can confirm that the result is \(b\).

A * basis_vector

\[\begin{split}\displaystyle \left[\begin{matrix}0\\0\\0\end{matrix}\right]\end{split}\]

And we can use it to write a balanced version of reaction equation.

display_reaction(basis_vector, species)

\[\displaystyle \frac{C_{2} H_5 OH}{3} + O_{2} \rightarrow \frac{2 CO_{2}}{3} + H_{2} O\]

But it is not standard to write reaction equations with fractional coefficients. So we’ll use the following function to find the least common multiplier of the fractional coefficients and multiply through.

def scale_vector(vector):
    denoms = [sp.denom(x) for x in vector]
    lcm = sp.lcm(denoms)
    return vector * lcm

Here’s the scaled version of the solution.

solution = scale_vector(basis_vector)
solution

\[\begin{split}\displaystyle \left[\begin{matrix}1\\3\\2\\3\end{matrix}\right]\end{split}\]

Any multiple of the solution vector is also a solution.

A * solution

\[\begin{split}\displaystyle \left[\begin{matrix}0\\0\\0\end{matrix}\right]\end{split}\]

Finally here’s the balanced reaction equation in standard form.

display_reaction(solution, species)

\[\displaystyle C_{2} H_5 OH + 3 O_{2} \rightarrow 2 CO_{2} + 3 H_{2} O\]

Because every molecule contains an integer number of each atom type, we know that the coefficients can be expressed as integers. So using SymPy to compute the solution symbolically is a good option. But we can also find null spaces numerically.

5.2. Finding a Null Space Numerically#

To find the null space of \(A\) numerically, we’ll convert it to a NumPy array.

A_numpy = np.array(A).astype(float)
A_numpy

array([[ 2.,  0., -1.,  0.],
       [ 6.,  0.,  0., -2.],
       [ 1.,  2., -2., -1.]])

And use the null_space function from SciPy.

from scipy.linalg import null_space

ns = null_space(A_numpy)
ns

array([[0.20851441],
       [0.62554324],
       [0.41702883],
       [0.62554324]])

The result is an array where each column is a vector in the null space. In this example there is only one, so we can select it like this.

vector = ns.T[0]
vector

array([0.20851441, 0.62554324, 0.41702883, 0.62554324])

If we multiply by \(A\), we can confirm that the result is all zeros, within floating-point error.

A_numpy @ vector

array([ 0.0000000e+00, -4.4408921e-16,  0.0000000e+00])

We might not recognize this vector as the solution we got from SymPy. That’s because it is normalized so its magnitude is 1, within floating-point error.

np.linalg.norm(vector)

np.float64(0.9999999999999999)

If we scale it so the smallest element is 1, its easier to recognize.

denom = np.min(np.abs(vector))
scaled = vector / denom
scaled

array([1., 3., 2., 3.])

And again, any multiple of this vector is also a solution.

A_numpy @ scaled

array([ 0.00000000e+00, -1.77635684e-15,  0.00000000e+00])

5.3. Null Spaces With More Dimensions#

Let’s consider a more complex example, the oxidation of hydrogen peroxide (\(H_2O_2\)) by potassium permanganate (\(K \mathit{Mn} O_4\)). This reaction is a classic demonstration in chemistry classes because the color of the solution changes dramatically. It is also used in oxidative treatment of wastewater.

Here are the species we’ll need to write the reaction equation.

KMnO4 = sp.Symbol('K \mathit{Mn} O_4')
H2O2 = sp.Symbol('H_{2}O_{2}')
H_plus = sp.Symbol('H^{+}')
Mn2_plus = sp.Symbol('\mathit{Mn}^{2+}')
K_plus = sp.Symbol('K^{+}')

species2 = [KMnO4, H2O2, H_plus, Mn2_plus, O2, H2O, K_plus]

Since we have seven species, we’ll need seven coefficients.

a, b, c, d, e, f, g = sp.symbols('a b c d e f g')
coefficients2 = [a, b, c, d, e, f, g]

Here is the version of the reaction equation with unknown coefficients.

display_reaction(coefficients2, species2, 3)

\[\displaystyle H^{+} c + H_{2}O_{2} b + K \mathit{Mn} O_4 a \rightarrow H_{2} O f + K^{+} g + O_{2} e + \mathit{Mn}^{2+} d\]

To find the coefficients, we can write four equations, one for each type of atom involved in the reaction.

eq_mn = sp.Eq(a, d)
eq_k = sp.Eq(a, g)
eq_h = sp.Eq(2*b + c, 2*f)
eq_o = sp.Eq(4*a + 2*b, 2*e + f)

Notice that three of the reagents are charged ions: \(H^{+}\) and \(K^{+}\) and \(\mathit{Mn}^{2+}\). So we can write an additional equation that requires charge to be conserved.

eq_charge = sp.Eq(c, 2*d + g)

eqns = [eq_mn, eq_k, eq_h, eq_o, eq_charge]
for eqn in eqns:
    display(Math(sp.latex(eqn)))

\[\displaystyle a = d\]

\[\displaystyle a = g\]

\[\displaystyle 2 b + c = 2 f\]

\[\displaystyle 4 a + 2 b = 2 e + f\]

\[\displaystyle c = 2 d + g\]

Now we can express this system as a matrix equation.

A, b = sp.linear_eq_to_matrix(eqns, coefficients2)
A

\[\begin{split}\displaystyle \left[\begin{matrix}1 & 0 & 0 & -1 & 0 & 0 & 0\\1 & 0 & 0 & 0 & 0 & 0 & -1\\0 & 2 & 1 & 0 & 0 & -2 & 0\\4 & 2 & 0 & 0 & -2 & -1 & 0\\0 & 0 & 1 & -2 & 0 & 0 & -1\end{matrix}\right]\end{split}\]

Because we have seven coefficients and only five equations, the system is underconstrained. In fact, it is even less constrained than the previous example: the null space contains two vectors.

nullspace = A.nullspace()
len(nullspace)

We’ll scale the vectors so the coefficients are integers.

solutions = [scale_vector(basis_vector) for basis_vector in nullspace]
solutions

[Matrix([
 [0],
 [2],
 [0],
 [0],
 [1],
 [2],
 [0]]),
 Matrix([
 [ 2],
 [-3],
 [ 6],
 [ 2],
 [ 1],
 [ 0],
 [ 2]])]

Here is what the first solution looks like as a balanced equation.

display_reaction(solutions[0], species2, 3)

\[\displaystyle 2 H_{2}O_{2} \rightarrow 2 H_{2} O + O_{2}\]

This equation represent the decomposition of hydrogen peroxide (H2O2) into water (H2O) and oxygen gas (O2). This reaction happens slowly at room temperature, and more quickly in the presence of catalase, an enzyme found in living cells, which is the cause of the foaming you might see if you apply hydrogen peroxide to a wound.

Here’s the second solution.

display_reaction(solutions[1], species2, 3)

\[\displaystyle 6 H^{+} - 3 H_{2}O_{2} + 2 K \mathit{Mn} O_4 \rightarrow 2 K^{+} + O_{2} + 2 \mathit{Mn}^{2+}\]

Notice that it contains a negative coefficient, which is not meaningful – there can’t be a negative amount of a reagent. So this solution is mathematically valid in the sense that it solves the equations that conserve atoms and charge, but by itself it does not represent a physically possible reaction.

However, any linear combination of these solutions is also a solution, and some of those combinations are physically possible. For example, if we combine the solutions in the ratio 4:1, we get the following reaction.

display_reaction(4 * solutions[0] + solutions[1], species2, 3)

\[\displaystyle 6 H^{+} + 5 H_{2}O_{2} + 2 K \mathit{Mn} O_4 \rightarrow 8 H_{2} O + 2 K^{+} + 5 O_{2} + 2 \mathit{Mn}^{2+}\]

This equation is balanced in atoms and charge, and it is theoretically possible because all of the coefficients are positive. In fact this is the reaction that occurs spontaneously when these reagents are combined.

However, if we combine the solutions in a ratio of 2:1, the result is also balanced and theoretically possible.

display_reaction(2 * solutions[0] + solutions[1], species2, 3)

\[\displaystyle 6 H^{+} + H_{2}O_{2} + 2 K \mathit{Mn} O_4 \rightarrow 4 H_{2} O + 2 K^{+} + 3 O_{2} + 2 \mathit{Mn}^{2+}\]

But under normal conditions, this reaction does not happen at non-negligible levels, I presume because it is not energetically favored. And there are many more combinations that are theoretically possible, but also unlikely.

So this is an example where balance of atoms and charge can tell us which reactions are theoretically possible, but stoichiometry alone is not enough to tell us which reactions are thermodynamically likely.

5.4. Discussion#

This chapter presents underdetermined systems, where the number of unknowns is greater than the number of equations. These systems often have many solutions.

The set of solutions forms a null space, which is defined by a set of basis vectors. Every linear combination of these vectors is a solution to the system.

In the ethanol example, the null space had one basis vector, so the solution space was one-dimensional. In the potassium permanganate example, there were two basis vectors, forming a two-dimensional space of solutions.

In general, there is a relationship between the rank of a matrix and the number of dimensions in its null space, given by the rank-nullity theorem:

\[ \text{rank}(A) + \text{nullity}(A) = n \]

where \(n\) is the number of columns in \(A\). The rank tells us how many independent constraints the equations impose. The nullity tells us how many degrees of freedom remain.

In the next chapter we consider overdetermined systems with no solutions – so we will try to find the best of the non-solutions.

5.4.1. Exercise#

Balance the equation for the combustion of methanol.

CH3OH = sp.Symbol('C H_{3} OH')

lhs = x1 * CH3OH + x2 * O2
rhs = x3 * CO2 + x4 * H2O

display_reaction([x1, x2, x3, x4], species)

\[\displaystyle C_{2} H_5 OH x_{1} + O_{2} x_{2} \rightarrow CO_{2} x_{3} + H_{2} O x_{4}\]

eq_carbon = sp.Eq(x1, x3)         
eq_carbon

\[\displaystyle x_{1} = x_{3}\]

Hydrogen: 6 from ethanol = 2 per H2O

eq_hydrogen = sp.Eq(4*x1, 2*x4)     
eq_hydrogen

\[\displaystyle 4 x_{1} = 2 x_{4}\]

Oxygen: 1 in ethanol + 2 per O2 = 2 per CO2 + 1 per H2O

eq_oxygen = sp.Eq(x1 + 2*x2, 2*x3 + x4)  
eq_oxygen

\[\displaystyle x_{1} + 2 x_{2} = 2 x_{3} + x_{4}\]

A, b = sp.linear_eq_to_matrix([eq_carbon, eq_hydrogen, eq_oxygen], (x1, x2, x3, x4))
A

\[\begin{split}\displaystyle \left[\begin{matrix}1 & 0 & -1 & 0\\4 & 0 & 0 & -2\\1 & 2 & -2 & -1\end{matrix}\right]\end{split}\]

\[\begin{split}\displaystyle \left[\begin{matrix}0\\0\\0\end{matrix}\right]\end{split}\]

nullspace = A.nullspace()
len(nullspace)

basis_vector = nullspace[0]
basis_vector

\[\begin{split}\displaystyle \left[\begin{matrix}\frac{1}{2}\\\frac{3}{4}\\\frac{1}{2}\\1\end{matrix}\right]\end{split}\]

solution = scale_vector(basis_vector)
solution

\[\begin{split}\displaystyle \left[\begin{matrix}2\\3\\2\\4\end{matrix}\right]\end{split}\]

display_reaction(solution, species)

\[\displaystyle 2 C_{2} H_5 OH + 3 O_{2} \rightarrow 2 CO_{2} + 4 H_{2} O\]

Think Linear Algebra

Code license: MIT License

Text license: Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International

Null Space

Contents

5. Null Space#

5.1. Stoichiometry#

5.2. Finding a Null Space Numerically#

5.3. Null Spaces With More Dimensions#

5.4. Discussion#

5.4.1. Exercise#