Euler Parameters

Andrea Del Bravo February 19, 2021

1) Preface

In geometry, Euler's rotation theorem states that, in three-dimensional space, any displacement of a rigid body such that a point on the rigid body remains fixed, is equivalent to a single rotation about some axis that runs through the fixed point. It also means that the composition of two rotations is also a rotation. The axis of rotation is known as an Euler axis (source Wikipedia)

So, according to the above mentioned Euler's rotation theorem, any 3D rotation (or sequence of rotations) can be specified using two parameters: a unit vector that defines an axis of rotation; and an angle μ describing the magnitude of the rotation about that axis, rotation to be considered positive according to the right hand rule.

Let’s assume now to have a fixed reference system XYZ, and consider a versor V (our rotation axis) forming with the three axis, the three angles α β and γ.

Suppose to have an additional reference system (X’Y’Z’) in origin coincident with XYZ.

We want to apply a rotation of an angle μ to X’Y’Z’ around the vector V, and to calculate the relative transformation matrix. The scope is very simple: have an instrument to calculate the attitude (the angular position) of an object (an aircraft) once the rotation is applied. In fact we will learn that knowing this transformation matrix, allow us to know the asset of the aircraft.

The problem could seem very complex, but we can reduce the difficulties with some smart ideas.

If we assume to be able to rotate the X’Y’Z’ reference system (by means of the transformation matrix A) in such e way that the X’ axis is coincident with the vector V, we would be in a good position. From now on it would be sufficient to make a rotation around the X’ axis (by means of the transformation matrix R) to have the rotation around V: note that this transformation would be very simple. In order to obtain the our final result would be now sufficient to apply the inverse transformation by means of the matrix A^-1

2) The transformation matrix

Now let’s discover the matrix A but firstly let’s choose the way to superimpose X’ to V:

first rotate X’Y’Z’ reference system around the Z≡Z’ axis until the plane formed by V and X’ becomes perpendicular the XY≡X’Y’ plane
second rotate the X’Y’Z’ reference system around the Y’ axis until X’ is coincident with V.

It’s to be noted that in this way the Y’ axis is restricted on the XY plane, and being the X’ axis superimposed to V, the Z’ axis is perpendicular to V.

The generic orthogonal transformation matrix have the form:

$A = (\begin{matrix} a_{11} & a_{12} & a_{13} \\ a_{21} & a_{22} & a_{23} \\ a_{31} & a_{32} & a_{33} \end{matrix})$

but from matrix theory we know that:

the rows represent the unit vector of each axis of the new reference system respect to the old one
each row represents a unit vector perpendicular to the unit vectors represented by the other rows and their components are the director cosines of the unit vector
each column represents a unit vector perpendicular to the unit vector represented by the other columns

As consequence of the above properties we have:

$a_{11} = \cos (α)$ $a_{12} = \cos (β)$ $a_{13} = \cos (γ)$

The fact that Y’ lies on the XY plane implies that $a_{23} = 0$ (the third component of its unit vector is null)

Considering the last column we can write: $a_{13}^{2} + a_{23}^{2} + a_{33}^{2} = 1$ and

$a_{33}^{2} + \cos^{2} (γ) = 1$ which leads to $a_{33} = \pm \sin (γ)$

The resulting matrix becomes

$A = (\begin{matrix} \cos α & \cos β & \cos γ \\ a_{21} & a_{22} & 0 \\ a_{31} & a_{32} & \pm \sin γ \end{matrix})$

Let’s now multiply first the column by the last column

$\cos (α) * \cos (γ) \pm a_{31} * \sin (γ) = 0$ which leads to $a_{31} = \mp \cos α * \cot γ$

redoing the same operation between the second and last columns

$\cos β * \cos γ \pm a_{32} * \sin γ = 0$ which leads to $a_{32} = \mp \cos β * cotg γ$

The matrix A now becomes

$A = (\begin{matrix} \cos α & \cos β & \cos γ \\ a_{21} & a_{22} & 0 \\ \mp \cos α * cotg γ & \mp \cos β * cotg γ & \pm \sin γ \end{matrix})$

We need now to compute the last two members, $a_{21}$ and $a_{22}$ .

One simple relation comes considering that the second row is a vector unit, that is:

$a_{21}^{2} + a_{22}^{2} = 1$

considering the first and second rows they represent two perpendicular unity vector and thus

$a_{21} * \cos (α) + a_{22} * \cos (β) = 0$ extracting a21 from this relation and substituting it in the above

$a_{22}^{2} + \frac{\cos^{2} (β)}{\cos^{2} (α)} * a_{22}^{2} = 1$ $a_{22}^{2} (1 + \frac{\cos^{2} (β)}{\cos^{2} (α)}) = 1$

$a_{22}^{2} \frac{(\cos^{2} β + \cos^{2} α)}{\cos^{2} α} = 1$ consider that from row 1 of matrix A

$\cos^{2} β + \cos^{2} α = 1 - \cos^{2} γ = \sin^{2} γ$ and thus $a_{22}^{2} * \frac{\sin^{2} γ}{\cos^{2} α} = 1$

Finally $a_{22} = \pm \cos α * \csc γ$

similar considerations lead to

$a_{21} = \mp \cos β * \csc γ$

The sign ambiguities can be solved adding the requirements that for α = 0 the matrix reduces to an identity matrix

The matrix A takes now the final form

$A = (\begin{matrix} \cos α & \cos β & \cos γ \\ - \cos β * \csc γ & \cos α * \csc γ & 0 \\ - \cos α * cotg γ & - \cos β * cotg γ & \sin γ \end{matrix})$

Let’s now consider, as stated at the beginning, the rotation of an angle μ around the X’ axis:

the corresponding matrix R takes the very easy form:

R = $(\begin{matrix} 1 & 0 & 0 \\ 0 & \cos μ & \sin μ \\ 0 & - \sin μ & \cos μ \end{matrix})$

The matrix A^-1to remove the effect of matrix A, is very simple too.

Considering Matrix theory, being matrix A unitary and orthogonal, the inverse of A^-1is simply the transposed of A, that means that the rows of A^-1are the columns of A, i.e.:

A^-1= $(\begin{matrix} \cos α & - \cos β * \csc γ & - \cos α * cotg γ \\ \cos β & \cos α * \csc γ & - \cos β * cotg γ \\ \cos γ & 0 & \sin γ \end{matrix})$

Now we have all the components to find the matrix T = A^-1 R A describing the rotation of an angle μ around the vector V of the reference system X’Y’Z’ (we omit the tedious math transformations):

T = $(\begin{matrix} 1 - 2 \sin^{2} \frac{μ}{2} \sin^{2} α & 2 (\sin^{2} \frac{μ}{2} \cos α \cos β + \sin \frac{μ}{2} \cos \frac{μ}{2} \cos γ) & 2 (\sin^{2} \frac{μ}{2} \cos α \cos γ - \sin \frac{μ}{2} \cos \frac{μ}{2} \cos β) \\ 2 (\sin^{2} \frac{μ}{2} \cos α \cos β - \sin \frac{μ}{2} \cos \frac{μ}{2} \cos γ) & 1 - 2 \sin^{2} \frac{μ}{2} \sin^{2} β & 2 (\sin^{2} \frac{μ}{2} \cos β \cos γ + \sin \frac{μ}{2} \cos \frac{μ}{2} \cos α) \\ 2 (\sin^{2} \frac{μ}{2} \cos α \cos γ + \sin \frac{μ}{2} \cos \frac{μ}{2} \cos β) & 2 (\sin^{2} \frac{μ}{2} \cos β \cos γ - \sin \frac{μ}{2} \cos \frac{μ}{2} \cos α) & 1 - 2 \sin^{2} \frac{μ}{2} \sin^{2} γ \end{matrix})$ (1)

The matrix (1) is extremely important and even if could seem strange or too complicated, has its logic. Let’s try to discover the logic by making the following substitution:

$a_{1} = \cos \frac{μ}{2}$

$a_{2} = \cos α \sin \frac{μ}{2}$

$a_{3} = \cos β \sin \frac{μ}{2}$

$a_{4} = \cos γ \sin \frac{μ}{2}$

These four parameters are called the Euler parameters. It may be seen from their definition that they obey the relationship:

$a_{1}^{2} + a_{2}^{2} + a_{3}^{2} + a_{4}^{2} = 1$

We can say that Euler in the 18^th century anticipated what Hamilton formulated precisely in the 19^th century: the quaternions. We will introduce quaternions in another post, showing later how to find their components, and from the components how to derive the attitude angles.

Matrix T takes now the form:

T = $(\begin{matrix} a_{1}^{2} + a_{2}^{2} - a_{3}^{2} - a_{4}^{2} & 2 (a_{1} a_{4} + a_{2} a_{3}) & 2 (a_{2} a_{4} - a_{1} a_{3}) \\ 2 (a_{2} a_{3} - a_{1} a_{4}) & a_{1}^{2} - a_{2}^{2} + a_{3}^{2} - a_{4}^{2} & 2 (a_{3} a_{4} + a_{1} a_{2}) \\ 2 (a_{2} a_{4} + a_{1} a_{3}) & 2 (a_{3} a_{4} - a_{1} a_{2}) & a_{1}^{2} - a_{2}^{2} - a_{3}^{2} + a_{4}^{2} \end{matrix})$ (2)

This means that if I have a vector $\bar{A}$ whose coordinates on the reference system X, Y, Z are $(x_{a}, y_{a}, z_{a})$ , the same vector in the reference system X’, Y’, Z’ will have the coordinates

$(\begin{matrix} x'_{a} \\ y'_{a} \\ z'_{a} \end{matrix}) = T (\begin{matrix} x_{a} \\ y_{a} \\ z_{a} \end{matrix})$ 3)

This result is very important and will be used in the following findings

Comments