Euler Parameters
1) Preface
In geometry, Euler's rotation theorem states that, in three-dimensional space, any displacement of a rigid body such that a point on the rigid body remains fixed, is equivalent to a single rotation about some axis that runs through the fixed point. It also means that the composition of two rotations is also a rotation. The axis of rotation is known as an Euler axis (source Wikipedia)
So, according to the above mentioned Euler's rotation theorem, any 3D rotation (or sequence of rotations) can be specified using two parameters: a unit vector that defines an axis of rotation; and an angle μ describing the magnitude of the rotation about that axis, rotation to be considered positive according to the right hand rule.
Let’s assume now to have a fixed reference system XYZ, and consider a versor V (our rotation axis) forming with the three axis, the three angles α β and γ.
Suppose to have an additional reference system (X’Y’Z’) in origin coincident with XYZ.
We want to apply a rotation of an angle μ to X’Y’Z’ around the vector V, and to calculate the relative transformation matrix. The scope is very simple: have an instrument to calculate the attitude (the angular position) of an object (an aircraft) once the rotation is applied. In fact we will learn that knowing this transformation matrix, allow us to know the asset of the aircraft.
The problem could seem very complex, but we can reduce the difficulties with some smart ideas.
If we assume to be able to rotate the X’Y’Z’ reference system (by means of the transformation matrix A) in such e way that the X’ axis is coincident with the vector V, we would be in a good position. From now on it would be sufficient to make a rotation around the X’ axis (by means of the transformation matrix R) to have the rotation around V: note that this transformation would be very simple. In order to obtain the our final result would be now sufficient to apply the inverse transformation by means of the matrix A-1
2) The transformation matrix
Now let’s discover the matrix A but firstly let’s choose the way to superimpose X’ to V:
- first rotate X’Y’Z’ reference system around the Z≡Z’ axis until the plane formed by V and X’ becomes perpendicular the XY≡X’Y’ plane
- second rotate the X’Y’Z’ reference system around the Y’ axis until X’ is coincident with V.
It’s to be noted that in this way the Y’ axis is restricted on the XY plane, and being the X’ axis superimposed to V, the Z’ axis is perpendicular to V.
The generic orthogonal transformation matrix have the form:
but from matrix theory we know that:
- the rows represent the unit vector of each axis of the new reference system respect to the old one
- each row represents a unit vector perpendicular to the unit vectors represented by the other rows and their components are the director cosines of the unit vector
- each column represents a unit vector perpendicular to the unit vector represented by the other columns
As consequence of the above properties we have:
The fact that Y’ lies on the XY plane implies that
Considering the last column we can write:
The resulting matrix becomes
Let’s now multiply first the column by the last column
redoing the same operation between the second and last columns
The matrix A now becomes
We need now to compute the last two members,
One simple relation comes considering that the second row is a vector unit, that is:
considering the first and second rows they represent two perpendicular unity vector and thus
Finally
similar considerations lead to
The sign ambiguities can be solved adding the requirements that for α = 0 the matrix reduces to an identity matrix
The matrix A takes now the final form
Let’s now consider, as stated at the beginning, the rotation of an angle μ around the X’ axis:
the corresponding matrix R takes the very easy form:
R =
The matrix A-1 to remove the effect of matrix A, is very simple too.
Considering Matrix theory, being matrix A unitary and orthogonal, the inverse of A-1 is simply the transposed of A, that means that the rows of A-1 are the columns of A, i.e.:
A-1 =
Now we have all the components to find the matrix T = A-1 R A describing the rotation of an angle μ around the vector V of the reference system X’Y’Z’ (we omit the tedious math transformations):
T =
The matrix (1) is extremely important and even if could seem strange or too complicated, has its logic. Let’s try to discover the logic by making the following substitution:
These four parameters are called the Euler parameters. It may be seen from their definition that they obey the relationship:
We can say that Euler in the 18th century anticipated what Hamilton formulated precisely in the 19th century: the quaternions. We will introduce quaternions in another post, showing later how to find their components, and from the components how to derive the attitude angles.
Matrix T takes now the form:
T =
This means that if I have a vector
This result is very important and will be used in the following findings
Comments