Mathematics

Detailed explanation of how mathematics is written in the NUbots codebase.

Updated Last updated 17 Apr 2024

The NUbots codebase uses specific mathematical concepts and notation that are relevant to the field of robotics. In the codebase there are many uses of vectors, matrices, spaces, and transforms between spaces. A specific notation is used for these transforms for better readability. This page details how these concepts and notations are implemented in the codebase.

Linear Algebra

We currently use the Eigen C++ template library for linear algebra in the NUbots codebase. Be aware that Eigen has some common pitfalls that are listed on the Eigen pitfalls page.

Bases

A basis is a maximal set of linearly independent vectors.

In this context, bases usually span $\mathbb{R}^3$ . That is, they cover three-dimensional space. Given a basis with three vectors $\mathbf{b}_1, \mathbf{b}_2, \mathbf{b}_3$ , any three dimensional vector $\mathbf{u}$ can be written as

\mathbf{u} = a_1 \cdot \mathbf{b}_1 + a_2 \cdot \mathbf{b}_2 + a_3 \cdot \mathbf{b}_3

where $a_1, a_2, a_3 \in \mathbb{R}$ . In other words, $a_1, a_2, a_3$ are scalars. These scalars are unique.

If the vectors in the basis are mutually perpendicular, then they are orthogonal.

An orthonormal basis (ONB) is a basis which is orthogonal and each vector has length 1.

An example of an ONB is the standard basis

e_1 = (1, 0, 0) \quad e_2 = (0, 1, 0) \quad e_3 = (0, 0, 1)

Vectors

As mentioned in the previous section, any vector $\mathbf{r}$ can be expressed as a linear combination of the elements of a basis

\mathbf{r}^a = r_1^a \mathbf{a}_1 + r_2^a \mathbf{a}_2 + r_3^a \mathbf{a}_3

The scalars $r_1^a, r_2^a, r_3^a$ are the coordinates for the vector $\mathbf{r}$ in the basis $a$ .

In the NUbots codebase, vectors are written as rABb which denotes a vector from point $B$ to point $A$ in the basis (or space) $b$ . These are column matrices of the coordinates of the vectors

\mathbf{r}^a = \begin{bmatrix} r_1^a \\ r_2^a \\ r_3^a \end{bmatrix}

To make computations such as inner product (also called dot product) or cross product with coordinates, the vectors must be relative to the same basis.

When drawing a vector $\mathbf{r}_{A/B}^b$ , that is a vector from $B$ to $A$ in the space $b$ , the tail is at $B$ and the head is at $A$ . Vector addition requires that the head of one vector is the same as the tail of the other. This is demonstrated here. We have that

\mathbf{r}^b_{A/C} = \mathbf{r}^b_{A/B} + \mathbf{r}^b_{B/C} = \mathbf{r}^b_{B/C} + \mathbf{r}^b_{A/B}

since vector addition is commutative. This is written in code as

Eigen::Vector3d rACb = rABb + rBCb;

Rotations

To describe the motion of the robot, we need a reference frame and a coordinate system.

The reference frame is the physical rigid body and the coordinate system is a mathematical concept, which is represented as a basis.

NUbots use a right-handed coordinate system to measure positions and rotations.

Right-Hand Rule. Image (modified) from Wikimedia Commons https://en.wikipedia.org/wiki/File:Right_hand_rule_cross_product.svg

Rotations are defined by a 3x3 matrix. If we have a vector $\mathbf{r}^a$ , that is the vector $\mathbf{r}$ in the coordinate system $a$ , the rotation

\begin{bmatrix} \cos(\psi) & -\sin(\psi) & 0 \\ \sin(\psi) & \cos(\psi) & 0 \\ 0 & 0 & 1 \end{bmatrix}

will rotate $\mathbf{r}^a$ by $\psi$ radians around the coordinate axis $\mathbf{a}_3$ .

More on rotation matrices can be found here.

The orientation of the basis $b$ relative to $a$ can be described by three consecutive rotations about the main axis. There are 12 different orders to do these rotations in, and each triplet of rotated angles is called a set of Euler angles. In robotics, the triplet is yaw, pitch and roll (Z-Y-X).

In the NUbots codebase, we denote a rotation matrix from space $b$ to space $a$ as Rab.

The rotation matrices used for changing between spaces belong to the special orthogonal group of order 3. These all have the following properties

\mathbf{R}^{-1} = \mathbf{R}^T

\text{det}(\mathbf{R}) = 1

The first property reduces the complexity of the computation to find the inverse matrix. The second property tells us this matrix can be inverted and it preserves scale between the two spaces.

If we have a vector $\mathbf{r}_{C/D}^a$ , that is it goes from point D to point C in a space $a$ , and we want to rotate it from space $a$ to space $b$ , we write

\mathbf{r}^b_{C/D} = \mathbf{R}^b_a \cdot \mathbf{r}^a_{C/D}

This is written in the NUbots codebase as rCDb = Rba * rCDa. Note that the notation is $\mathbf{R}_{from}^{to}$ .

Homogeneous Transformations

A homogeneous transformation matrix in three dimensions is a 4x4 matrix containing a rotation component and a translation component.

If we have some rotation $\mathbf{R}_a^b$ and a translation component $\mathbf{r}^b_{A/B}$ , then we have a transformation matrix that transforms from space $a$ to space $b$ :

\mathbf{H}_a^b = \begin{bmatrix} \mathbf{R}_a^b & \mathbf{r}_{A/B}^b \\ \mathbf{0} & 1 \end{bmatrix}

This is written in the NUbots codebase as Hba.

The vector $\mathbf{r}_{A/B}^b$ is the vector from B (the origin of space b) to A (the origin of space a) in b space, which would be written in the codebase as rABb.

The reason for this is that if we apply the transform $H^b_a$ to some point $\rho^a$ , then $\rho^a$ will first be rotated by $R_a^b$ , giving $\rho^b$ . $\rho^b$ will then be translated by some vector according to the transformation matrix. Since $\rho^b$ is now rotated in the $b$ coordinate system, the translation vector must also be in this $b$ coordinate system. The vector is from B to A because a point (0, 0, 0) in space $a$ should be moved to the origin of $a$ in $b$ space after transformation. Thus, the translation vector is $\mathbf{r}_{A/B}^b$ .

This is illustrated in the following diagram.

Two three dimensional coordinate spaces, one labelled as space 'a' and the other as space 'b'. There is a vector from space 'a' to some point 'c', and is labelled 'rCAb'. There is another vector from 'b' to 'a' and is labelled 'rABb'. There is a final vector from 'b' to 'c' which is labelled 'rCBb = rABb + rCAb'

This assumes the original vector, rCAa has already been rotated into space $b$ , i.e. we have rCAb.

Finding the inverse of $H^b_a$ is made simpler by the properties of the matrix.

(H^b_a)^{-1} = H^a_b = \begin{bmatrix} (\mathbf{R}_a^b)^T & (\mathbf{R}_a^b)^T \cdot -\mathbf{r}_{A/B}^b \\ \mathbf{0} & 1 \end{bmatrix}

Transformation matrices left multiply with vectors to get a vector in the new space. This vector is rotated and then translated.

\mathbf{r}^b_{CB} = \mathbf{H}^b_a \cdot \mathbf{r}^a_{C/A}

This is written in the codebase as rCBb = Hba * rCAa.

This is equivalent to

r_{CB}^b = r_{A/B}^b + (R^b_a * r_{C/A}^a)

or in code,

Eigen::Vector3d rCBb = rABb + (Rba * rCAa);

Vectors should only be multiplied with a transformation matrix if the last two letters of the vector match. That is, the position they are going from should match the origin of the space they are measured in. This means that Hab * rCBb is valid, but Hab * rCDb is invalid. Note that the first vector is rCBb and the second is rCDb.

If you would like the vector only rotated, extract the rotational component of the transform and apply it to the vector. The spaces that the vector and rotation matrix are in should match.

Eigen::Vector3d rCDb = Hba.rotation() * rCDa;

Homogeneous transformation matrices can be multiplied together to get a new homogeneous transformation matrix.

\mathbf{H}_a^c = \mathbf{H}_b^c \cdot \mathbf{H}^b_a

This is written in the codebase as

Eigen::Isometry3d Hca = Hcb * Hba;

Unit Vectors

A unit vector is a vector with a length of 1. They are used to represent directions in space.

They are written by describing their origin, the direction they are pointing and the space they are in.

$\mathbf{u}^b_{A/B}$ is a unit vector at B pointing towards A measured in space b.

The direction of a vector can be found by normalising it. This is done by dividing the vector by its length.

Eigen::Vector3d uABb = rABb.normalized();

Unit vectors can be rotated by rotation matrices but not translated. They should not be multiplied by a homogeneous transformation matrix.

Velocity and Acceleration

The velocity and acceleration of some object is represented relative to some space.

If we have the velocity of object A moving relative to space b $\mathbf{v}^b_{A}$ , then we write this in the codebase as vAb.

If we have the acceleration of object A moving relative to space b $\mathbf{a}^b_{A}$ , then we write this in the codebase as aAb.

Velocities and accelerations can only be rotated, not translated. They should not be multiplied by a homogeneous transformation matrix, but they can be multiplied by a rotation matrix.

If a velocity is multiplied with time, it becomes a displacement vector.

Eigen::Vector3d rA_tAb = vAb * t;

If an acceleration is multiplied with time, it becomes a velocity vector.

Eigen::Vector3d vA_tAb = aAb * t;

If you only have a partial velocity or acceleration, such as velocity in the direction of an object you can represent this by labelling the target the velocity is towards.

For example $\mathbf{v}^b_{A/B}$ would describe the velocity of B towards the point A.

Examples

Using Eigen, we define our vectors as Eigen::Vector3d, rotation matrices as Eigen::Matrix3d, and homogeneous transformation matrices as Eigen::Isometry3d. The d denotes double and instead can be f for float. Eigen::Isometry3d allows us to do computations easier than with Eigen::Matrix4d.

It allows multiplication with a Eigen::Vector3d even though the dimensions do not match. Using Eigen::Isometry3d makes it quicker to find the inverse, as it employs the techniques mentioned above. Eigen::Isometry3d has functions to individually return the rotational component of the matrix and the translation component of the matrix, by using Htw.rotation() and Htw.translation() respectively, where Htw is of type Eigen::Isometry3d. It also makes it easy to individually assign the rotational and the translational components of the matrix by using Htw.linear() = Rtw and Htw.translation() = rWTt respectively.

When we get a message that is a homogeneous matrix it will be of type mat4. This can then be cast to Eigen::Isometry3d. Let the matrix be called Htw (world to torso transform) and the message be called sensors (see: Sensors.proto). Then this is written as

Eigen::Isometry3d Htw(sensors.Htw);

If you have a transformation matrix Htw (world to torso transform) and you want the matrix Hwt (torso to world transform), this is can be achieved by calling Htw.inverse(). The inverse function swaps the letter order. The returned inverse matrix is not mutable.

If you have a vector rTWf (world to torso in foot space) and you want the opposite letter order (torso to world in foot space), this is done by negating the vector, rWTf = -rTWf.

Negating does not change the space the vector is measured in.

If you have two transformation matrices and you would like to multiply them to get a new transformation matrix, the inner two letters should be identical, i.e. the same space.

Eigen::Isometry3d Htf = Htw * Hwf;

This gives the foot to torso transform by multiplying the world to torso transform with the foot to world transform.

These two inner letters are cancelled, with the outer two remaining. This also works for rotation matrices.

If you have two vectors in the same space and their inner (or outer, since vector addition is commutative) letters are both the same then they can be cancelled in a similar way to the transformation matrices.

Eigen::Vector3d rXZa = rXYa + rYZa;

Reference Frames

The NUbots codebase uses a number of different reference frames (spaces) for different purposes. The following is a list of the commonly used spaces in the codebase.

Space	Letter	Description
`camera`	C	Located at camera (left/right) with x pointed "outwards", y to the "left" and z "up"
`world`	W	Located on the ground below the robot on startup. (Fixed inertial frame)
`torso`	T	Fixed midway between the robots hip yaw joints, with x "forward", y to the "left" and z "up"
`robot`	R	Located directly beneath torso projected onto the ground, with x "forward", y to the "left" and z "up"
`field`	F	Located at the centre circle of the field, with x pointing towards the "own" goal and z "up"

Summary

Item	Description
`Hab`	Denotes a 3D affine transformation matrix going from space $b$ to space $a$ .
`Rab`	Denotes a 3D rotation matrix from space $b$ to space $a$ .
`rABb`	Denotes a vector from point $B$ to point $A$ in space $b$ .
`uABb`	Denotes a unit vector from point $B$ towards point $A$ in space $b$ .
`vAb`	The velocity of $A$ in space $b$ .
`vABb`	The velocity of $B$ towards $A$ in space $b$ .
`aAb`	The acceleration of $A$ in space $b$ .
`aABb`	The acceleration of $B$ towards $A$ in space $b$ .
`rXZa = rXYa + rYZa`	Create a new vector from two other vectors.
`rACc = Hcb * rABb`	Changing a vector from one space to another using a homogeneous transform.
`rABd = Rdc * rABc`	Changing a vector from one space to another using a rotation transform
`Hba = Hab.inverse()`	Inverting a matrix swaps the letter order.
`Hac = Hab * Hbc`	Creating a new transform from two other transforms.
Euler Angles	Are of the form Z-Y-X.

Useful Links

Spherical Coordinates

Spherical coordinates are a system of representing points in 3D space. Rather than representing points as $\left(x, y, z\right)$ cartesian coordinates, points are represented as $\left(r, \varphi, \theta\right)$ , where r is radial distance, $\varphi$ is the polar angle, and $\theta$ is the azimuthal angle.

Spherical coordinates can be calculated from cartesian coordinates using the follow equations:

r = \sqrt{x^2 + y^2 + z^2} \\ \varphi = \textrm{acos}\left(\frac{z}{r}\right) = \textrm{atan}\left(\frac{\sqrt{x^2 + y^2}}{z}\right) \\ \theta = \textrm{atan}\left(\frac{y}{x}\right)

Cartesian coordinates can be calculated from spherical coordinates using the follow equations:

x = r\textrm{cos}\left(\theta\right)\textrm{sin}\left(\varphi\right) \\ y = r\textrm{sin}\left(\theta\right)\textrm{sin}\left(\varphi\right) \\ z = r\textrm{cos}\left(\varphi\right) \\

A vector representing a point in the spherical coordinates would be written as $s_{A/B}^b$ , which would be sABb in the codebase.

In order to ensure a unique representation of coordinates, the following restrictions are enforced on spherical coordinates:

r \geqslant 0 \\ 0 \leqslant \varphi \leqslant \pi \\ 0 \leqslant \theta \leqslant 2\pi \\

Reciprocal Distance

A common variation on spherical coordinates in our codebase is to use reciprocal distance. If a vector has spherical coordinates $\left(r, \varphi, \theta\right)$ , then a vector in spherical coordinates with reciprocal distance has coordinates $\left(\frac{1}{r}, \varphi, \theta\right)$ .

This is used in the vision detectors and localisation when representing the distance from the camera to an object. Using reciprocal distance also means our covariance (our uncertainty in our measurement) scales better with distance. The further away an object is the less certain we are about out measurement of it.

If $s_{A/B}^b$ is the vector in spherical coordinates, then $sr_{A/B}^b$ is the same vector in spherical coordinates, but with reciprocal radial distance.

Interpolation

Given a set of data points, often we want to find a function that contains all these points, which we call an interpolant. Interpolants are commonly polynomials, because they can approximate complex curves with a few points, with relatively low computational costs. Interpolant polynomials can be affected by Runge's phenomenon, where the interpolant is locally accurate for the given data points but has oscillations issues beyond these points. A further issue with interpolants is uniqueness - if there are too few constraints then there may be an infinite number of interpolants, and if there are too many constraints then there may not be a solution. Generally, for $n+1$ distinct points, there is a unique polynomial of degree $\leq n$ that interpolates the points. Polynomials of degree higher than $n$ may have aliasing issues where the interpolant is unnecessarily 'wiggly' between points.

Splines

Rather than have one global polynomial, it can be useful to use piecewise polynomials, connected at their edges. These so-called splines are all defined locally around the interpolation points, and can avoid Runge's phenomenon. Splines are used in the Quintic Walk Engine.

2D graph plot of a piecewise polynomial function that passes through specific highlighted points. — Spline. Interpolation points are in blue.

Every pair of consecutive points on the graph has an associated polynomial that is plotted in the range between those two points.

The Quintic Walk specifically uses quintic splines. This means that each polynomial is of the form

S_i = a_{5i} x^5 + a_{4i} x^4 + a_{3i} x^3 + a_{2i} x^2 + a_{1i} x + a_{0i}

For $i = 0, ..., n-1$ , where $n$ is the number of interpolation points.

These particular quintic splines have interpolation points $(x_0, y_0, y'_0, y''_0) \cdots (x_n, y_n, y'_n, y''_n)$ . They must satify the conditions

$S_j(x)$ is a polynomial in $[x_j, x_{j+1}]$ for each $j = 0, \cdots , n-1$
$S_j(x_j) = y_j$ and $S_j(x_{j+1}) = y_{j+1}$ for each $j = 0, \cdots , n-1$
$S'_{j+1}(x_{j+1}) = S'_j(x_{j+1}) = y'_{j+1}$ for each $j = 0, \cdots , n-2$
$S''_{j+1}(x_{j+1}) = S''_j(x_{j+1}) = y''_{j+1}$ for each $j = 0, \cdots , n-2$
$S'_0(x_0) = b_{0}$ and $S'_{n-1}(x_n) = y'_{n}$
$S''_0(x_0) = b_{0}$ and $S''_{n-1}(x_n) = y''_{n}$

These conditions result in smooth joints between each polynomial, meaning the position, velocity and acceleration at those joints is the same for both polynomials. Each data point has an x and y value, as well as first and second derivative value. The conditions above ensure the splines satisfy the given values for each data point.

Units

See the glossary.

On this page