This post was prompted by a user question raised in this issue on Manifolds.jl. Ronny Bergmann implemented these results in Manifolds.jl in this PR.
Suppose you held up an object and began rotating it at a fixed speed. After some amount of time, you stop rotating it. If I know the starting pose of the object and the amount of time that has passed, under what conditions can I also tell you the rotational velocity (both direction and speed) of the spinning object?
There's really just one condition: that the rotational distance (i.e. angle) between the initial and final positions does not exceed some maximum value. For example, if you rotate 180° in any direction, it's impossible for me to know whether you rotated the object clockwise or counterclockwise. Worse, if you rotate a full 360°, I couldn't know whether you didn't move the object at all or performed a whole rotation or a million rotations. This maximum allowed distance that allows one to still infer the initial and final orientation is called the injectivity radius[1].
The question motivating this post is, what is the injectivity radius for the rotations in not just 2 dimensions and 3 dimensions but any dimension? And more generally, what is it for the unitary group and its most common subgroups? Since these groups are featured in many (all?) of the introductory texts on Lie groups and manifolds, and since the injectivity radius is a basic property introduced in differential geometry textbooks, I was surprised that I could not find a single reference giving these radii for these groups.
In this post I'll work out these radii. Marvelously, we don't need any differential geometry or group theory to do this, just linear algebra! Nevertheless, this post assumes familiarity with these topics and for the sake of space will try not define all common terms or notation.[2]
Consider a point on some manifold with tangent vectors . Assume a Riemannian metric defining an inner product , which induces a norm .
Let's denote the exponential map for and the logarithmic map . The injectivity radius at is defined as the norm of the smallest for which , or in other words, the smallest for which the logarithmic map no longer is the inverse of the exponential map. We further define the global injectivity radius of as the infimum of the injectivity radii at all points on the manifold.
Notationally, we define this global injectivity radius as
We'll also consider the related supremum
These two quantities form the lower and upper bound radii, respectively, of two geodesic balls within which the exponential map is invertible.
The Unitary group over some number system is the group of all matrices for which , where denotes the matrix adjoint. could be the real numbers , complex numbers , or quaternions .
We will also deal with the following subgroups:
: the special unitary group, which consists of complex unitary matrices whose determinant is +1
: the orthogonal group, i.e. the group of rotations and reflections
: the special orthogonal group, i.e. the group of real rotations
We will focus on the real and complex fields, but the unitary quaternionic case immediately follows from the complex one.
The unitary group is a compact group and when equipped with the Frobenius inner product becomes a Riemannian manifold.[3] The Riemannian exponential and logarithm are related to the Lie group exponential and logarithm , which for these matrix groups are just the matrix exponential and logarithm.
In the following then, is always the identity matrix and will not be mentioned, while is always an element of the Lie algebra, that is, the tangent space at the identity matrix.
The orthogonal group is comprised of two submanifolds, , whose elements have determinant +1, and another subgroup whose elements have determinant -1. These submanifolds are disconnected, so that the geodesic cannot join two points from the different submanifolds.
All unitary matrices have a unit determinant . The inverse of any unitary matrix is just its adjoint
The logarithm of any unitary matrix is a skew-hermitian matrix .[4] Unitary and skew-hermitian matrices are normal matrices, which means they are always diagonalizable with unitary eigenvectors. Let be the eigendecomposition of and be the eigendecomposition of .
The unitary/skew-Hermitian condition then implies
The norm of under the Frobenius metric is
The special unitary group has the additional constraint that the determinant is +1. This constraint implies that . Given values of , the th value is thus fixed.
Special orthogonal matrices are real, and real matrices have the extra property that if they have a complex eigenvalue then they also have a complex eigenvalue , i.e. the complex eigenvalues come in conjugate pairs. The sum of the elements in each pair is thus 0, and they don't contribute to the the sum . When is odd, at least one eigenvalue is 0.
Using the eigendecomposition, the matrix exponential and logarithm can be computed by applying the corresponding function to the eigenvalues. Then
We are finally ready to work out the injectivity radii. For , the eigenvalues of are for .
The matrix exponential is invertible when is invertible. But this is just rotation in the complex plane by the angle , which is invertible for . The injectivity radii are then computed by the constraint for at least one .
For and , the largest value of achievable subject to the constraints occurs when for all . Then . The smallest value of occurs when for any . So
For , we have the additional constraint that . For even , is maximized subject to these constraints when entries in are and when entries are . On the other hand, for odd , it is maximized when values each are and and the th value is . is minimized subject to these constraints when there is a single nonzero pair for . As a result,
The constraints on required for and are also satisfied by the lower and upper bounds of the norms considered for , so
It's always a good idea to numerically check that the results make sense. For the 2D and 3D rotations and , respectively, we then have
These injectivity radii correspond to a rotation of 180°. [3] Coming back to our motivating example, if we start from any pose and rotate an object more than 180° in any direction, we can no longer uniquely determine the initial rotational velocity.
, the complex unit circle, is isomorphic to , so it may seem like a contradiction that its injectivity radius is . But this difference is again caused by the choice of metric, which causes the inner product on to be scaled by compared to .[3]
Analogously, represents the unit quaternions (also the compact symplectic group), which are an alternative way to represent 3D rotations, and is equivalent to . The injectivity radii again only differ by the factor of due to the scaling convention of the metric.
This has been one of the rare moments where we got to dabble with group theory and manifolds without needing too much geometry. I hope it was enjoyable!
[1] | The injectivity radius is so called because it is the radius of a geodesic ball within which the exponential map is injective (one-to-one). |
[2] | It's really hard to write anything about manifolds or groups without writing a whole introductory text to differential geometry or group theory. |
[3] | Because of the skew-hermitian nature of the elements of the Lie algebra, some texts use the scaled Frobenius metric . In some cases, this allows the norm of a tangent vector to be interpreted as the angle of the rotation. To get the injectivity radii for this metric, one would just divide ours by . |
[4] | This comes from differentiating the constraint . Then we have . Letting , then . When is the identity matrix, . |
[5] | For the complex unitary group, is the usual imaginary number, while for quaternions, it would be a pure unit quaternion. |