1 Introduction
In a recent paper [1] the group of proper orthochronous Lorentz transformations that leave one of the spatial coordinates fixed was represented by 2 × 2 matrices whose entries are double numbers (also known as hyperbolic numbers). This representation is useful if one is interested in Lorentz transformations involving one or two spatial directions only.
As is well known, the composition of two pure boosts in two different directions is equivalent to the composition of a boost and a spatial rotation through an angle known as the Wigner angle (see, e.g., Ref. [2] and the references cited therein). In Ref. [1] the Wigner angle was calculated making use of the representation mentioned above. Actually, the representation obtained in Ref. [1] is one of the group SO(2,1), which can be regarded as a subgroup of the Lorentz group, SO(3,1). In the same manner as the SO(3) transformations can be viewed as the Lorentz transformations maintaining the condition t = 0, the SO(2,1) transformations correspond to the Lorentz transformations maintaining, e.g., z = 0. Besides the representation given in Ref. [1], the group SO(2,1) can also be represented by complex 2 × 2 matrices (belonging to the group SU(1,1)) and by real 2 × 2 matrices (belonging to SL(2,R)) (see, e.g., Refs. [3,
4]).
The existence of the Wigner angle is the basis of the Thomas precession, which is relevant in atomic and nuclear physics, explaining the observed fine-structure intervals and the spin-orbit interaction in atomic nuclei, respectively.
The aim of this paper is to show explicitly that the orthochronous proper Lorentz transformations maintaining z = 0 can be represented by (two-component) SU(1,1) spinors or, equivalently, that the inertial frames sharing the origin and the z-axis can be represented by SU(1,1) spinors. We also show that this correspondence can be employed to find the Wigner angle making use of one scalar product between spinors.
The spinor formalism is a very useful tool in the standard general relativity, which deals with four-dimensional curved space-times (see, e.g., Refs. [5,
6]). Similarly, the SU(1,1) spinors employed here are useful in the study of 2 + 1 space-times (see, e.g., Ref. [4]), which, even though are not the main objective in the theory of gravitation, are of interest as guides for the more realistic case (see, e.g., Ref. [7]). The group SU(1,1) is also of interest in connection with squeezed light and polarization optics, and the Wigner angle has analogs in those fields (see, e.g., Refs. [8,
9]).
In Sec. 2 we summarize some basic facts about the representation of SO(2,1) by SU(1,1) and the SU(1,1) spinors, and we show that there exist two scalar products between S(1,1) spinors. In Sec. 3 we show that each spinor represents an inertial frame of the restricted class mentioned above and we relate the Wigner angle with one of the scalar products between spinors. As a second application we derive the standard formulas for the relativistic Doppler effect and the aberration of light making use of the fact that a null vector with one of its spatial components equal to zero can be represented by a SU(1,1) spinor. In Sec. 4 the connection with the hyperbolic geometry is briefly discussed.
2 Representation of SO(2,1)
The homogeneous Lorentz group can be defined as the set of linear transformations leaving
x2+y2+z2-(ct)2
invariant, where (x,y,z,ct) are the coordinates of an event with respect to some inertial frame. Hence, the homogeneous Lorentz transformations that leave, e.g., z = 0 invariant, also leave
x2+y2-(ct)2
(1)
invariant. The linear transformations leaving (1) invariant form the group O(2,1).
Apart from the natural representation of O(2,1) by real 3 × 3 matrices there exist representations of this group, or its subgroups, by 2 × 2 matrices (see, e.g., Refs. [3,
4,
1]). In this section we show, in an elementary manner, that some O(2,1) transformations (namely, those with positive determinant belonging to the connected component of the identity) can be represented by complex 2 × 2 matrices (a detailed discussion can be found in Refs. [3,
4]).
As in Refs. [1,
10], we start by establishing a one-to-one correspondence between points and certain 2 × 2 matrices. In the present case we associate the point with (real) coordinates (x,y,ct) with the 2 × 2 complex matrix
P=ct-x-iyx-iy-ct.
(2)
Then, one can readily verify that
tr P=0, P†=ηPη,
(3)
where
η≡100-1
and that any complex 2 × 2 matrix satisfying Eqs. (3) must be of the form (2). Thus, there is a one-to-one correspondence between points (x,y,ct) and matrices (2), which is employed below.
Since detP=x2+y2-(ct)2, and the determinant is invariant under similarity transformations, the transformation P↦P' given by
P'=QPQ-1,
(4)
where P’ is the matrix (2) corresponding to (x',y',ct'), represents an O(2,1) transformation provided that P'†=ηP'η [see Eqs. (3)], that is, (Q-1)†P†Q†=η (QPQ-1) η, or (Q†)-1ηPηQ†=ηQPQ-1η, which is satisfied if ηQ†=Q-1η or, equivalently,
QηQ†=η.
(5)
(Note that the condition tr P=0 is preserved by any similarity transformation (4).)
Taking the determinant on both sides of Eq. (5), using the fact that detQ† is the conjugate of detQ, one finds that detQ=eiα, for some α∈R, then the matrix Q~=e-iα/2Q also satisfies Eq. (5) and detQ~=1. Furthermore, Q~PQ~-1=QPQ-1, which means that Q and Q~ produce the same O(2,1) transformation. Hence, we can restrict ourselves to matrices with determinant equal to 1 that satisfy Eq. (5). These matrices form the group SU(1,1). (The group SU(p,q) is formed by que (p+q)×(p+q) complex matrices, Q, with determinant equal to 1 such that QηQ†=η, where η is the diagonal matrix with p entries equal to +1 and q entries equal to -1.) It may be noticed that if Q belongs to SU(1,1) then -Q also belongs to this group and from Eq. (4) it follows that Q and -Q produce the same O(2,1) transformation.
It can be shown that the transformations (4) with Q∈SU(1,1) correspond to SO(2,1) transformations (that is, transformations belonging to O(2,1) with determinant equal to 1) [3, 4]. For our purposes it suffices to verify that the transformations (4) with Q∈SU(1,1) reproduce the pure boosts in arbitrary directions (in the xy-plane) and the rotations in the xy-plane. In fact, a straightforward computation shows that a rotation through an angle ϕ on the xy-plane is given by (4) with Q given by
e-iϕ/200eiϕ/2
(6)
or its negative. Similarly, for a pure boost in an arbitrary direction in the xy-plane which makes an angle θ with the x-axis, Q is given by
coshw/2-eiθsinhw/2-e-iθsinhw/2coshw/2
(7)
or its negative, with the relative velocity, v, given by
v=ctanhw.
(8)
The parameter w is called rapidity. Then, from Eq. (4) it follows that the matrix corresponding to the composition of a pure boost, in the direction making an angle θ with the x-axis, followed by a rotation through the angle ϕ is given by the product of (6) (as the factor on the left) by (7), namely
e-iϕ/2coshw/2-e-iϕ/2eiθsinhw/2-eiϕ/2e-iθsinhw/2eiϕ/2coshw/2.
(9)
2.1 Two-component spinors. Scalar products
The 2 × 2 matrices Q∈SU(1,1) act on two-component vectors
ψ=uv, (u,v∈C),
called spinors (or SU(1,1) spinors), in the simple manner: ψ↦Qψ (see also Eq. (25) below). As a consequence of (5), the scalar product of the spinors χ and ψ defined by χ†ηψ is invariant under the action of SU(1,1). In fact, χ'†ηψ'=(Qχ)†η(Qψ)=χ†(Q†ηQ)ψ=χ†ηψ, since (5) is equivalent to Q†ηQ=η. (This can be proved in the following way: since η2 is the unit matrix, Eq. (5) implies that Q-1=ηQ†η, hence (ηQ†η)Q is the unit matrix and multiplying on the left by η one gets the desired relation.)
There is a second scalar product between spinors that can be constructed by defining first the mate of a spinor. A straightforward computation shows that any matrix belonging to SU(1,1) is of the form
αββ¯α¯,
(10)
where α¯ is the complex conjugate of α, and α,β are complex numbers such that |α|2-|β|2=1 (cf. Ref. [10]). (The complex numbers α and β are analogous to the Cayley-Klein parameters employed in the case of the SO(3) transformations, see, e.g., Ref. [10] and the references cited therein.) Hence, if ψ'=Qψ, with
ψ'=u'v',
(11)
we have, in explicit form,
u'=αu+βv,v'=β¯u+α¯v,
(12)
which is equivalent to
v'¯=αv¯+βu¯u'¯=β¯v¯+α¯ u¯.
(13)
Comparing (13) with (12) one concludes that the mate of ψ, defined by
ψ^≡v¯u¯,
(14)
transforms in the same way as ψ
[3], hence, for any pair of spinors χ and ψ we can form another scalar product: χ^†ηψ (also invariant under the action of SU(1,1)). (The definition (14) differs from that given in Ref. [4] by a constant factor.) One can readily verify that χ†ηψ=ψ†ηχ¯ and ψ^†ηχ=-χ^†ηψ.
3 Representation of the Lorentz transformations by spinors
Given a two-component spinor
ψ=uv,
with |u|2-|v|2=1 (that is, ψ†ηψ=1), which we shall call a unit spinor, there exists a unique Q∈SU(1,1) such that
ψ=Q-110
or, equivalently,
Qψ=10.
(15)
In fact, Q is explicitly given by
u¯-v¯-vu
[see Eq. (10)] and since any Q∈SU(1,1) gives rise to a SO(2,1) transformation, which represents a Lorentz transformation of a restricted class, it follows that any unit spinor defines a proper orthochronous Lorentz transformation that leaves invariant the z-axis.
Hence, starting with an inertial frame, S0, any other inertial frame, S~, sharing the z-axis with S0, is represented by a unit spinor (defined up to sign). S0 itself is represented by
χ=10
(16)
or its negative (since the Lorentz transformation leading from S0 to S0 is the identity) and, according to Eqs. (9) and (15), the unit spinor representing S~ is
ξ=eiϕ~/2coshw~/2eiϕ~/2e-iθ~sinhw~/2
(17)
or its negative, if S~ is moving with respect to S0 with velocity ctanhw~ in the direction making an angle θ~ with the x-axis, and the axes x~ and y~ are rotated an angle ϕ~ with respect to the axes x and y. The scalar products χ†ηξ and χ^†ηξ are
χ†ηξ=eiϕ~/2coshw~/2
(18)
and
χ^†ηξ=-eiϕ~/2e-iθ~sinhw~/2.
(19)
Even though the spinors χ and ξ depend on the choice of the inertial frame S0, the scalar products (18) and (19), being invariant under the SU(1,1) transformations, only depend on the inertial frames represented by χ and ξ (in the same manner as the position vectors, r1 and r2, of two points of the Euclidean space depend on the choice of the origin, but the distance |r2-r1| is independent of that choice).
Thus, given two inertial frames S0 and S~ sharing the z-axis, they can be represented by unit spinors χ and ξ, respectively, and, according to Eq. (18), the modulus of the scalar product χ†ηξ determines the velocity of S~ with respect to S0, while the argument of χ†ηξ determines the angle between the axes x~,y~ and the axes x, y. The modulus of χ^†ηξ also gives the relative velocity of the two frames, but its argument does not give the angles ϕ~ and θ~ separately [see Eq. (19)].
3.1 The Wigner angle
Now, let S and S’ be two inertial frames moving with respect to S0 with velocities v and v' in the xy-plane which make angles θ and θ' with respect to the x-axis (of S0), respectively, with the spatial axes of S parallel to those of S0 and, similarly, the spatial axes of S’ parallel to those of S 0; in other words, S and S’ are related to S0 by means of pure boosts and these boosts are represented (with respect to S0) by the unit spinors
ψ=coshw/2e-iθsinhw/2 andψ'=coshw'/2e-iθ'sinhw'/2,
respectively, with v=ctanhw and v'=ctanhw' [see Eq. (17)].
Then, the scalar products ψ†ηψ' and ψ^†ηψ' are
ψ†ηψ'=coshw/2coshw'/2-ei(θ-θ')sinhw/2sinhw'/2
(20)
and
ψ^†ηψ'=e-iθsinhw/2coshw'/2-e-iθ'coshw/2sinhw'/2.
(21)
By equating Eqs. (18) and (20), we conclude that, with respect to S, the inertial frame S’ is moving with rapidity w~, and with its axes rotated by an angle ϕ~ (the Wigner angle), given by
eiϕ~/2coshw~/2=coshw/2coshw'/2-ei(θ-θ')sinhw/2sinhw'/2.
(22)
One can readily verify that the real and imaginary parts of this equation are precisely Eqs. (20) and (21) of Ref. [1]. Similarly, the real and imaginary parts of the equation
-eiϕ~/2e-iθ~sinhw~/2=e-iθsinhw/2coshw'/2-e-iθ'coshw/2sinhw'/2,
(23)
obtained by equating Eqs. (19) and (21), are precisely Eqs. (22) and (23) of Ref. [1].
3.2 Aberration of light and the relativistic Doppler effect
The aberration of light and the relativistic Doppler effect can be conveniently studied with the aid of the formalism introduced above. An electromagnetic plane wave is characterized by a wave four-vector kα, which is null. Assuming that k3=0, we are left with three possibly nonzero components, k0,k1 and k2, such that (k1)2+(k2)2-(k0)2=0, hence k1=k0cosα, k2=k0sinα, where α is the angle between the propagation direction and the x-axis, and in this case the analog of the matrix (2) can be expressed in terms of a SU(1,1) spinor, κ,
k0-k1-ik2 k1-ik2-k0 = k0-k0eiα k0e-iα-k0 =κκ†η
with
κ≡k0eiα/2e-iα/2.
(24)
The two-component spinor κ transforms according to
α'=Qα.
(25)
That is, in the case of the wave four-vector, the transformation rule (4) is a consequence of (25). Hence, for a boost along the x-axis [see Eq. (7)] the transformation (25) takes the form
k'0eiα'/2e-iα'/2=coshw/2-sinhw/2-sinhw/2coshw/2k0eiα/2e-iα/2
which is equivalent to the single equation
k'0eiα'/2=k0(eiα/2coshw/2-e-iα/2sinhw/2)
(26)
which, separating the modulus and the argument of the complex number on the left-hand side, amounts to
k'0=k0(coshw-sinhwcosα)=k0(γ-γ(v/c)cosα)
and
tan12α'=ewtan12α=1+v/c1-v/c tan12α.
(27)
It may be remarked that even though Eq. (27) is a well-known result, in the standard procedure, making use of the four-vector transformations, it is somewhat laborious to get it; and even with the two-component SL(2,C) spinors its derivation is not so straightforward.
4 Geometric interpretation
The spinor
ψ=uv
defines two vectors, Rψ and Mψ, with components R1,R2,R3 and M1,M2,M3 given by [4]
Rj=iψ†ητjψ, Mj=iψ^†ητjψ,
(28)
j=1,2,3, with
τ1≡0-ii0, τ2≡0110,τ3≡-i00i,
(29)
that is
Rψ=(uv¯+vu¯,-iuv¯+ivu¯,|u|2+|v|2),Mψ=(u2+v2,-iu2+iv2,2uv).
(30)
(The matrices τj are a basis for the Lie algebra (over R) of the traceless 2 × 2 complex matrices, A, such that A†η=-ηA, which is the Lie algebra of SU(1,1). One can readily verify that the matrices (6) and (7) can be expressed in the form exp(12ϕ τ3) and exp12w(sinθ τ1-cosθ τ2), respectively.) The components Rj are real and the Mj are complex. The vectors Rj and Mj are orthogonal, in the sense that gijRiMj=0, where
(gij)=diag(1,1,-1),
Mj is null, that is gijMiMj=0, and if ψ is a unit spinor then Ri is normalized, gijRiRj=-1.
In the case of the unit spinor ξ given by (17), the real vectors Rξ and Re Mξ are unit vectors along the axes t~ and x~ of S~ (with respect to the x,y,ct-axes of S0). Since gijRiRj=-1 and R3>0, the point (R1,R2,R3) belongs to a sheet of a hyperboloid. With ξ parametrized as in (17), the point (R1,R2,R3) is given by (sinhw~cosθ~,sinhw~sinθ~,coshw~) and therefore w and θ can be used as local coordinates on the hypersurface M≡{(x,y,ct)∈R3 | x2+y2-(ct)2=-1,ct>0}. The (indefinite) metric tensor gijdxidxj=(dx)2+(dy)2-(cdt)2 induces a (positive definite) metric tensor on M given by
(dw)2+sinh2w (dθ)2.
(31)
(This straightforward construction of the metric (31) can be compared with the one presented in Ref. [11].)
The meridians θ=const. of M are geodesics of the metric (31). Hence, the distance (on M) between the point with w=0 (corresponding to S0) and the point (sinhw~cosθ~,sinhw~sinθ~,coshw~) of M (corresponding to S~), along the geodesic joining these points, is w~.
Thus, each point of M correspond to the time axis of some inertial frame and the geodesic distance between two points of M is the rapidity of one the frames with respect to the other. (It should be stressed that M with the metric (31) is a maximally symmetric space, with constant curvature, analogous to the standard sphere of R3; among other things, M is a homogeneous space, which is in agreement with the fact that all the inertial frames are equivalent.)
As in the case of the SU(2) and the SL(2,C) spinors, the SU(1,1) spinors can be represented by flags (see, e.g., Refs. [4,
5]) and in all cases one spinor and its negative define the same flag. For a unit spinor ψ the flagpole is given by the vector Rψ and the flag points along Re Mψ. A straightforward computation shows that, for a unit spinor ξ parametrized in the form (17),
Re Mξ=cos(θ~-ϕ~) ew-sin(θ~-ϕ~) eθ,
(32)
where {ew,eθ} is the orthonormal basis of the tangent space of M (at Rξ) defined by the coordinates (w,θ) of M.
The flag corresponding to the unit spinor χ points along (1,0,0) [see Eqs. (30)], which forms an angle θ~ with the tangent vector to the geodesic curve θ=θ~, at the point (0,0,1) of M, while the flag corresponding to ξ forms an angle θ~-ϕ~ with the tangent vector to this curve, at the point Rξ of M [see Eq. (32)]. As is well known, when a vector is translated parallelly to itself along a geodesic, the angle between this vector and the tangent vector to the geodesic does not vary along the geodesic, hence, the angle, ϕ~, between the spatial axes of the inertial frames S0 and S~, is the angle between the flag corresponding to S~ and the flag corresponding to S0 parallelly translated to itself along the geodesic joining the points of M that represent these frames. (An entirely similar result holds in the case of the SU(2) spinors [12].)
In the case of the frames S0, S and S’ considered in Sec. 3.1, we will have three points of M determined by the intersections of the time axes of the frames with M. These three points define a geodesic triangle (constructed joining each pair of points by a geodesic), with the distance between vertices [determined with the metric (31)] being the relative rapidity between the corresponding frames.
When the flag corresponding to S0 is translated parallelly to itself along the geodesics going from S0 to S and from S0 to S’ it coincides with the flags corresponding to S and S’, respectively (by construction, since S and S’ are obtained by pure boosts from S0), but when this flag is translated parallelly to itself from S to S’, the resulting flag forms an angle equal to the Wigner angle with the flag corresponding to S’. In differential geometry, this angle is called the holonomy angle of the closed curve (in this case the triangle) and, for a constant curvature hypersurface, such as M, this angle is equal to the curvature of the hypersurface multiplied by the area enclosed by the curve (see also Refs. [2,
11] and the references cited therein).
5 Concluding remarks
As is well known, the entire proper orthochronous Lorentz group can be represented by SL(2,C) matrices (see, e.g., Refs. [5,
6]), which amounts to say that, excluding spatial reflections and time inversions, each inertial frame can be identified with a pair of two-component spinors (frequently denoted by o and ι in the Newman-Penrose notation). The alternatives presented in Ref. [1] and this paper constitute a considerable simplification in the restricted case discussed here, where a single two-component spinor is enough.
Acknowledgement
The author wishes to thank the referee for useful suggestions.
References
[1] G.F. Torres del Castillo, Application of the double numbers in the representation of the Lorentz transformations, Rev. Mex. Fís. E 20 (2023) 010204. https://doi.org/10.31349/RevMexFisE.20.010204.
[ Links ]
[2] P.K. Aravind, The Wigner angle as an anholonomy in rapidity space, Am. J. Phys. 65 (1997) 634. https://doi.org/10.1119/1.18620.
[ Links ]
[3] G.F. Torres del Castillo, Spinors in three dimensions. II, Rev. Mex. Fís. 40 (1994) 195.
[ Links ]
[4] G.F. Torres del Castillo, 3-D Spinors, Spin-Weighted Functions and their Applications (Springer, New York, 2003), Secs. 1.4, 5.2.2, 5.4. https://doi.org/10.1007/978-0-8176-8146-3.
[ Links ]
[5] R. Penrose and W. Rindler, Spinors and Space-Time, Vol. 1 (Cambridge University Press, Cambridge, 1984). https://doi.org/10.1017/CBO9780511564048.
[ Links ]
[6] G.F. Torres del Castillo, Spinors in Four-Dimensional Spaces (Springer, New York, 2010). https://doi.org/10.1007/978-0-8176-4984-5.
[ Links ]
[7] S. Carlip, Quantum Gravity in 2 + 1 Dimensions (Cambridge University Press, Cambridge, 1998).
[ Links ]
[8] D. Han, E.E. Hardekopf and Y.S. Kim, Thomas precession and squeezed states of light, Phys. Rev. A 39 (1989) 1269. https://doi.org/10.1103/PhysRevA.39.1269.
[ Links ]
[9] G.S. Agarwal, Quantum theory of partially polarizing devices and SU(1,1) Berry phases in polarization optics, Opt. Commun. 82 (1991) 213. https://doi.org/10.1016/0030-4018(91)90447-L.
[ Links ]
[10] H. Goldstein, Classical Mechanics, 2nd ed. (Addison-Wesley, Reading, Mass., 1980). Sec. 4-5.
[ Links ]
[11] J.A. Rhodes, M.D. Semon, Relativistic velocity space, Wigner rotation and Thomas precession, Am. J. Phys. 72 (2004) 943. https://doi.org/10.1119/1.1652040.
[ Links ]
[12] G.F. Torres del Castillo, Spinor representation of an electromagnetic plane wave J. Phys. A: Math. Theor. 41 (2008) 115302. https://doi.org/10.1088/1751-8113/41/11/115302.
[ Links ]