3

SR Dynamic Implications

 

Return to Special Relativity

 

In many texts mass has been defined in a circular manner. Some such texts have asserted a four-vector momentum in the form of pm = mUm as a premise which doesn't work for massless particles and then defined mass as the contraction of that vector or visa-versa. In order to avoid circularity and to include massless particles and also in order to facilitate a smoother transition to relativistic quantum mechanics this text will take a newer though not unique approach. For example, here there will be two different momentum four-vectors distinguished by capitalization pm, and Pm. The lower case will be the momentum four-vector of the first kind and the upper case will be the momentum four-vector of the second kind. This is done in part because a particle's mass will be defined as the contraction of the momentum four-vector of the first kind which is the momentum four-vector referred to in classical relativistic (non-quantum relativistic) texts. The momentum four vector of the second kind is here defined mainly because its elements are what will correspond to quantum operators in relativistic quantum mechanics. (Some authors choose the capitals the other way around)

From experiments or due to quantum mechanics we know that the magnitude of the three component momentum of a particle can be related to a wavelength (whether or not the particle has mass).

(3.1.1a)

and the relation between the three element momentum and the three element k (whether or not the particle has mass) is

(3.1.1b)

Also, a fourth element corresponding to a time coordinate can be related to a frequency (whether or not the particle has mass) and that element times c we will term relativistic energy ER.

(3.1.2a)

(3.1.2b)

where w can be related to the wavelength by

(3.1.2c)

and from bc = dw/dk:

w2 = c2k2 + constant

(3.1.2d)

The integration constant will turn out to be proportional to the square of the mass.

21

We will start with the premise that this four component definition of momentum constitutes a four-vector that will be called the momentum four-vector of the first kind pm.

Next consider the introduction of a four-vector potential fm to which the test particle responds with a charge q. It does not matter at this point if this charge is electric, only that the vector potential to which it responds is a four-vector. We will define the momentum four-vector of the second kind by

Pm = pm + (q/c)fm

(3.1.3a)

and we will call its time element P0 total energy E

As an artifact from physics texts that do not make this distinction, when the potential is not zero one might think of the total energy E as p0/c even though it is really P0/c which is ER + qf in special relativity. At the same time, one would think of the relativistic momentum as pi. As a result one may think of energy E due to containing a potential as something that can have an arbitrary constant added to it, but would think of momentum as something that can not. This superficially seems to draw a distinction between time and space, as energy corresponds to a time element and momentum corresponds to spatial elements. However here where the distinction between momentum four-vectors is made, one finds that there is no such distinction between time and space. This is because it is in Pm that such an arbitrary constant can be added to the potential, and it is also in that four-vector that such arbitrary constants can be added to the spatial components of the vector potential. One can do this so long as one demands that they transform as the coordinates do, as a four-vector.

The special relativistic definition for the mass of a particle given those relations is

m2c2 = |hmn[Pm - (q/c)fm][Pn - (q/c)fn]|

(3.1.4a)

or

m = [(ER/c2)2 - (p/c)2]1/2 = E0/c2

(3.1.4b)

This could just as well be expressed as

m2c2 = |hmnpmpn|

(3.1.4c)

In the second we define E0 as the relativistic energy evaluated at zero velocity, E0 = ER|v=0. Magnitude bars are included above merely so that the choice of sign convention for the metric's signature is arbitrary.

Though all 3.1.4 are equivalent given the above relations, the definition in terms of the momentum four-vector of the second kind equation 3.1.4a is preferable in quantum mechanics discussions because that is what yields relativistic quantum mechanics. For example, when one replaces the elements of the momentum four-vector of the second kind with the energy and momentum operators of quantum mechanics, and operates that on the wave function it yields the Klein-Gordon equation with the inclusion of a nonzero vector potential

hmn[Popm - (q/c)fm][Popn - (q/c)fn]Y = m2c2Y

(3.1.5a)

which can also be written

[(H - f)2 - (Popc - qf)2]Y = m2c4Y

(3.1.5b)

where and

 

The above definition of mass 3.1.4b, that a mass is rest energy m = E0/c2 , E0 = ER|v=0. or that its is center of momentum frame relativistic energy m = Ecm/c2, Ecm = p0cmc|vcm=0, in the case of a system of particles, is the definition that we will use throughout the rest of the special relativity site where ever the letter m or the word mass is used unqualified. This is the m that goes into the relativistic version of Newton's second law in the form

Fl = mAl

(four-force = mass times four-acceleration see - Eqn 3.2.3)

This mass is an invariant. It does not change with speed! Equations 3.1.4 is called the mass-shell condition, because they are of isomorphic form to the equation of a spherical shell. Under the above definition of mass, a photon does not have mass. Due to quantum mechanical issues, virtual particles do not tend to have the expected value of energy for a given momentum. So sometimes it is said a particle lays off shell.

The coordinate velocity of a particle is simply given by

um = dxm/dt

(3.1.6)

We write Four-Vector Velocity or Proper Velocity

Um = dxm/dt

(3.1.7)

where t is called proper time, which is just time according to the frame of the particle at its location.

Through time dilation we can relate the two

Um = gum

(3.1.8)

Consider the following expression

hmnmUmmUn

 

3.1 Mass, Energy and the Universe's Speed Limit 23

hmnmUmmUn = [(mU0)2 - (mU1)2 - (mU2)2 - (mU3)2]

hmnmUmmUn = m2[(dct/dt)2 - (dx/dt)2 - (dy/dt)2 - (dz/dt)2]

hmnmUmmUn = m2(dt/dt)2[c2 - (dx/dt)2 - (dy/dt)2 - (dz/dt)2]

hmnmUmmUn = m2g2c2(1 - v2/c2)

hmnmUmmUn = m2c2g2g -2

hmnmUmmUn = m2c2

Next we refer to the definition of mass 3.1.4c to arrive at

hmnmUmmUn = hmnpmpn

Final examination of this reveals the relation between four-vector velocity and the four-vector momentum of the first kind.

pm = mUm

(3.1.9)

We can then from equation 3.1.8 discover the relation between four-momentum and coordinate velocity for massive particles

pm = gmum

(3.1.10a)

or write the relation for particles that may or may not have mass

pm = (ER/c)(um /c)

(3.1.10b)

The g term is physically associated to the velocity term through time dilation. In the past a few physicists starting with Planck, Lewis, and Tolman, not Einstein, have miss-associated the g term with the mass defining a new kind of mass

M = gm ¬ Bad

(3.1.11)

This M is then inappropriately called "relativistic mass". In the absence of a potential, the zeroth element of the momentum four-vector is defined as the energy divided by c, resulting in

p0 = Mu0

E/c = Mc

M = E/c2 ¬ Bad

(3.1.12)

Though much more complicated in the long run, the math is consistent and leads to consistent predictions concerning observation and so one might argue that the physics is therefor correct. But, in keeping with Occam's razor this definition and method must be done away. The m in this method is then inappropriately qualified and called the "rest mass". It is wrong to do this for the following reason. Calling m the "rest mass" infers to the listener that m is not the mass according

 

24 Chapter 3 SR Dynamic Implications

to other frames for which it is not at rest. We have already noted that m is an invariant as it is the same value as calculated according to any frame. It is not just the value for the rest frame. The relativistic mass method also leads to many erroneous conclusions. By that method light has zero "rest mass". For one of many examples, it has been argued that since light is not at rest in any frame, that the question of whether it has mass at rest or "rest mass" is unanswerable. No. m = 0 is observed as the contraction of a photon's four-momentum according to any frame, not just the "rest frame".

 

In short the terms "relativistic mass" and "rest mass" need to be done away and the real mass m which is actually observed is an invariant. It does not change with speed. Also, by this, the physically correct definition a photon, or anything that travels at the Lorentz invariant speed c, has zero mass.

We have

p0 = ER/c.

We have also demonstrated the relation between Four-Momentum and Four-Velocity Eqn 3.1.9 resulting in

p0 = mU0.

Putting these together we have

ER/c = mU0

ER/c = m(dct/dt)

ER = (dt/dt)mc2

ER = gmc2 ¬ Good

(3.1.13)

This is the mass - relativistic energy relationship for a massive particle. Now this energy does not go to zero as v goes to zero so we see that a massive particle still has energy even when it is at rest. This tells us that mass is equivalent to rest energy meaning relativistic energy at zero velocity

E0 = ER|v=0 = mc2 ¬ Good

(3.1.14)

 

3.1 Mass, Energy and the Universe's Speed Limit 25

The kinetic energy of a particle is the amount of energy that is associated with its motion only. Therefor

EK = ER - E0

(3.1.15a)

This results in

EK = (g - 1)mc2

(3.1.15b)

The stress energy tensor is a tensor that contains information about the density of energy, momentum, stresses, etc.. contained in the space. The energy tensor mass alone is Eqn. 5.1.4

Tmn = r0UmUn

The T00 component of this is

T00 = (dt/dt)2r0c2

r0 is the mass density according to a frame moving with that bit of mass, but because of special relativistic Lorenz length contraction on the local mass the coordinate frame mass density is then

r = (dt/dt)r0

So this becomes

T00 = (dt/dt)rc2

But this is just the coordinate frame energy density.

The simplest consistent general relativistic definition of coordinate frame energy density is then just

coordinate frame energy density º T00

(6.3.6)

For more general stress-energy tensors it is common to define r0 as

r0 = TmnUmUn /c4

(3.1.16)

If r0 is to be positive then TmnUmUn must be greater than 0. For this not to be the case is called a violation of the weak energy condition. More generally speaking a the weak energy condition is

TmnVmVn ³ 0

(3.1.17)

for any timelike vector Vm . Matter may only violate this condition within limits set by the Pfenning inequality. See-

http://xxx.lanl.gov/abs/gr-qc/9805037

Other elements have other interpretations. For instance Tii is a flow of momentum per area in the xi direction or the pressure on a plane whose normal is in the xi direction. Tij is the xi component of momentum per area in the xj direction or describes a shearing from stresses. T0i is the volume density of the ith component of momentum flow.

Next we will discuss the concept of system mass. We have seen that for a single particle mass is equivalent to rest energy. Eqn 3.1.14

E0 = mc 2.

For a system of particles the best concept for system mass m is defined as center of momentum frame energy Ecm.

Ecm = mc2.

(3.1.18)

The system mass does not turn out to be equal to the total or sum of masses mtot of the constituent parts. Instead it is the total energy summed for all of the constituent parts according to the center of momentum frame.

 

Consider for a moment a Lorenz invariant for the system consistent with the mass shell condition.

 

First define

psys' = [Ecm/c , 0, 0, 0]

(3.1.19)

as a four-element vector for the inertial center of momentum frame. Then define Psys as the Lorentz transform of this for any frame of interest.

psys = Lpsys'

(3.1.20)

Due to relative simultaneity psys as defined here is not always equal to the "simultaneous" sum of the four-momentum of the constituent parts when there are external forces acting at various locations on the system. The system mass is defined as the following invariant.

m2c2 = hmnpsysm psysn

(3.1.21)

or for the scenario described above,

m = [(ERsys/c2)2 - (psys/c)2]1/2 = Ecm/c2

(3.1.22a)

Considering the time element of equation 3.1.21 restores the relation

ERsys = gcmmc2

(3.1.22b)

Proof that this definition of system four momentum is the same as the sum of the four-momenta of the systems components goes as follows. Start with the sum of four-momentum for an arbitrary frame.

psys = Si pi

Lorentz boost to another frame

Lpsys = LSi pi

Interchange sum and transformation symbols

Lpsys = Si Lpi

The Lorentz transform of each four-momentum is the four momentum according to the new frame

Lpsys = Si pi'

But the right side is the net four momentum according to the new frame

Lpsys = psys'

This proves that the system net four-momentum is indeed a four-vector itself and yields

ERsys = gcmmc2

where m is the system's mass and is its center of momentum frame energy as well as

p = gcmmucm

(3.1.22c)

and

m2c2 = hmnpsysmpsysn = ERsys2/c2 - psys2 = Ecm2/c2

The reason that the mass of 3.1.21/3.1.22a is not the same as the "total" of constituent masses, mtot , is that the sum of masses of the constituent parts does not always equal the center of momentum frame energy. For example, a system of massless particles have a zero mass shell condition when they all move the same direction while the system has a nonzero mass shell condition when they move in different directions. One advantage the definition of center of momentum frame energy for mass has over "total mass" is that by this definition, not only is mass an invariant, but this mass of a system is also conserved. Note that the concept of captive mass is equivalent to center of momentum frame energy m and not the total of masses mtot. In order to increase the system mass m one must increase the total center of momentum frame energy Ecm equivalently. This demonstrates that mass defined by m is conserved in the same way that energy is. Transferring energy from some external matter to change the center of momentum frame energy of an object will increase its individual system mass, but when you extend the system to include the matter from which the energy was transferred, it will always be found that center of momentum frame energy or system mass m is ultimately conserved.

A sum of invariants is also an invariant and so one could just as well write the total of masses mtot as a sum of the constituent parts. For a system of n particles this could be written

mtot = (hmnp1m p1n)1/2 + (hmnp2m p2n)1/2 + ... + (hmnpnm pnn)1/2

(3.1.23)

or

mtot = m1 + m2 + ... + mn

(3.1.24)

The subscript indicates the particle number. Again, the major problem with thinking of a system mass as this is that this total of masses is not conserved. However, part of the reason this is brought up is that people do tend to think that mass is that kind of sum. This leads to another missunderstanding as to what is meant by "mass to kinetic energy conversion". Consider for example a massive particle that decays into two massless photons. Because system energy is conserved and the system energy for the center of momentum frame is the system mass, the system mass did not change in the decay. What did change was that the energy initially was associated with rest, the rest energy of the particle, but finally was associated with motion, the kinetic energies of the photons. In that light one should really not say there is mass-energy conversion. The energy and system mass for the system is conserved. One should instead say that energy associated with the resting particles of the initial state becomes associated with the motion of the particles in the final state. Since this is cumbersome the term mass-energy conversion is used, but be wary that what it refers to is that a change in the sum of masses can be the cause of the change in kinetic energies of the remaining masses. Just remember that the system mass of a closed system doesn't change or "convert" into anything.

Sometimes it is more useful to define an invariant mass density instead. Just as there are two ways to describe the masses of a system above, m and mtot, there are two important ways of describing an invariant mass density. The first is the r0 in the following relation equation 5.1.4

Tmn = r0UmUn

This definition most closely corresponds to the mass m description of system mass above. It relates the stress energy tensor for matter composed of non-interacting constituents to the four-velocity of the unpressurized "fluid" at any given location.

 

The total energy for the system is conserved and could instead be defined by the following volume integral

Esys = ò ò ò T00dxdydz

(3.1.25a)

For the example stress energy tensor 5.1.4 this would become

Esys = ò ò ò gfluid2r0c2 dxdydz

(3.1.25b)

One can also instead define the system momentum from the next integral

psys = ò ò ò (T 0i ei/c) dxdydz

(3.1.26a)

For the example stress energy tensor 5.1.4 this would become

psys = ò ò ò (gfluid2r0uiei)dxdydz

(3.1.26b)

ei is a unit vector in the direction of the ith momentum component.

One then still can define the system's mass as the center of momentum frame energy. It is the energy for the frame according to which

psys = 0. So we still have

Ecm = mc2.

 

The other invariant mass density concept corresponding to the total of masses mtot would be

rTot = hmnTmn/c2

(3.1.27)

This kind of mass density is an invariant, but its volume integral is not conserved. This kind of mass is what is meant when one refers to a field such as the electromagnetic field as a massless field, or when one refers to any system as massless. This is zero for any system of massless particles.

Of the two descriptions of system mass, the mass m concept is far more useful.

Refer to Eqn 3.1.11

ER = gmc2

where g was given by

g = (1 - v2/c2)-1/2.

Notice that the energy becomes divergent at v = c for nonzero mass. Thus no matter how hard or how long you push on a mass, you can never impart enough energy to it so that it reaches the speed c. The only way such a thing can travel at the speed c and still have finite energy is if it had zero mass. In that case, instead of a mass energy relation, there is a energy momentum relation from Eqn 3.1.5 resulting in,

E = ER = pc,

(3.1.28)

where E and p are related to frequency and wavelength.

One might consider the case of a particle that instead of being pushed beyond the speed c, moves faster than c upon its creation. Such a hypothetical particle is called a tachyon. Notice that if v is greater than c then g is imaginary. Since imaginary energy makes no physical sense, we would expect that the mass would also have to be imaginary so that the energy(and momentum) would be real.

The primary problem with the existence of such a particle is that it could be use to violate the principle of causality. The principle of causality is simply the statement that effect never precedes cause. Imagine setting up a tachyon emitter and a tachyon receiver at different points along an S frame x axis. Lets say that the signal travels arbitrarily fast so that the event of transmission and the event of reception are virtually simultaneous. Next recall that events simultaneous in one frame are not all simultaneous in other frames. We could then easily pick a frame to look at the situation in which the event of reception precedes the event the event of transmission. This is a violation of causality.

Worse yet, it then leads to grandfather paradox's. The grandfather paradox is the idea that a time traveler goes back in time and kills his grandfather before his father was conceived. To set up a grandfather paradox with tachyons we simply set the receiver in motion away from the

 

26 Chapter 3 SR Dynamic Implications

transmitter and give it a relay transmitter. We call the frame in which it is stationary the S' frame. We also connect a receiver to the S frame transmitter. We then program the S frame transmitter so that in say 1hr it will send a signal unless it's receiver receives a signal. To begin lets say that it receives no signal and so it sends one. The signal arrives at the relay, which is moving away and sets off the relay transmitter. Now the relay transmitter sends a return signal, but the return signal travels back to the S frame receiver/transmitter setup virtually instantaneously according to the relay's S' frame. Due to the Lack of simultaneity between the frames the return signal will be received back at the S frame transmitter at a time prior to the original transmission. But because we programmed it not to send a signal if it receives one it will now not send a signal. But then there is no signal to receive and so it sends one......

It is sometimes said that special relativity says that nothing can travel faster than the speed of light. As discussed, what it really implies is that long as we restrict our physics to special relativity, and we wish to preserve causality, information can not travel faster than c. Likewise, as long as we restrict our physics to special relativity, nothing with mass can travel at the speed c.

There have been experiments done in which the physicists involved say that they have indeed been able to get electromagnetic waves to propagate information faster than c through a dispersive medium.

In particular the controversy is over gain assisted faster than c group velocity transmission demonstrated in anomalous dispersive media.

http://www.iitk.ac.in/infocell/Archive/dirjuly3/science_light.html

If their claims that it was the "information" that has indeed been transferred at faster than c speeds are correct, then SR implies that we can find a frame according to which the reception of a signal at one end of the apparatus precedes the transmission of the signal at the other end. This would indeed be a causality violation and brings to question the validity of the principle of causality and causes us to reevaluate the (im)possibility of a physical grandfather paradox. However, it is conceivable that the universe may be structured in such a way that such a causality violation is attainable, but that grandfather paradoxes will still not be allowed. For example, if one of their dispersive media faster than c experiments were devised in attempt to simulate the relay-transmitter paradox discussed above, one could hypothetically set such a receiver in motion such a medium, but that medium is what determines the speed of the electromagnetic waves according to its rest frame. A relay transmitter sending the signal through the same medium would not end with the signal arriving at a time prior to transmission.

The following is an explanation why the experiments are not completely convincing of faster than c "information" transfer.

As an example, in a 6.0cm medium a laser pulse has been transmitted that transversed the distance at a speed of 310c. This is a group wave speed, not a phase wave speed. The below figures are a recreation representative of a receiver's data for two pulses. The blue dotted curve represents the intensity Vs time curve for the reception of the 310c speed pulse and the red curve represents the intensity Vs time curve for the arrival of a c speed pulse sent at the same time.

Figures 3.1.1a,b

One can see on the close up second graph from the horizontal shift that the 310c curve arrived 62ns earlier than the c speed pulse. The question then arises whether this experiment is an example of a causality violation. In order for causality to be violated one must have faster than c "information" transfer. Under typical "long" transmissions one can consider the information transfer speed to be the group wave speed or the speed of the energy carried by the pulse. It has long been known that phase wave speeds often exceed c which is why it is often pointed out that the energy transmission in ordinary wave-guides occurs at the group speed which is less than c. As such, the information transfer speed is less than c in ordinary wave guides. The reason that the group speed exceeding c for this experiment is not convincing of faster than c information transfer and the reason why the information transfer for this experiment can not be taken to be the group speed is because of the following. Notice that the 62ns time shift between the two pulses is much less than the time it takes to receive the entirety of a pulse itself. Thus the time it took from the time the pulse began to enter the medium until the time the receiver read the entirety of the information was the sum of group transfer and read times. The full width at half-max FWHM of a pulse here is approximately 4.0ms. Take that to be read time. The group transfer time was

6.0cm/310c = 0.65ps. Taking the information transfer time to be the sum of these, approximately just 4.0ms, one finds that the information speed was

6.0cm/4.0ms = 5.0x10-5c, a mere measly fraction of the vacuum speed of light. There are two ways one might modify this experiment so that if successful it would clearly demonstrate faster than c information transfer. First, one might made the medium of transfer much longer so that in the information transfer time it is the read time that is insignificant instead. The reason that this may be an impossible task is that due to the dispersive nature of the media itself, even with the gain assistance, there will be a trade off between the signal degradation and length. There may be a limiting trade off so that in a long enough transmission line so that the information transfer time yields a faster than c speed the signal would have been lost. Second, one might try to significantly narrow the pulse so that the information transfer time is approximately just the group transfer time. The reason that this may be an impossible task two fold. There is a narrow frequency range at which the light must be sent through the medium in order for it to transfer with a faster than c group speed. This in itself puts a limit on how narrow the pulse may be. Also, the narrower the pulse is made the more rapidly it will tend to widen itself as it travels across the medium. By the time it gets to the other end the read time will always be longer than the send time so no matter how short the send time is made one will have to contend with a longer read time.

In conclusion, though such experiments do successfully demonstrate faster than c group transfer, they do not conclusively demonstrate the faster than c information transfer, which they would have to in order to show a causality violation.

Exercises

Problem 3.1.1

Consider monochromatic plane waves of light with a wavelength of 500nm. What is the frequency of the light?

Problem 3.1.2

a. How much kinetic energy does a 75 kg man have if he is traveling to another star at (4/5)c ?

b. Though "system mass" i.e. center of momentum frame energy is conserved in an isolated reaction, the sum of masses is not. The sum of kinetic energies of a system's parts after a reaction is the sum of kinetic energies of the system's parts before the reaction plus the difference in the sum of masses that the system decreased by through the reaction. Changes in binding energies of interacting particles in the system are typically accounted for in the sum of masses. For example a hydrogen atom in a system is least massive in its ground state. So, such a reaction is sometimes called a "mass to kinetic" energy conversion. How much of the sum of masses of a system would have to be annihilated in order to produce the same amount of kinetic energy as the man had in part a? (An atomic bomb "converts" about the mass of a coin to kinetic energy in its explosion)

Problem 3.1.3

Show that if the speed of an object is v = c(1 - e) where e is an extremely small number for a high speed, that e can be approximated by

Problem 3.1.4

(Modified from Misner, Thorne, and Wheeler's Gravitation problem 5.4)

Consider a system with v << c.

a. Use the Lorentz transformation property of a tensor to show that

T 0i = Sk i i kr0 ukc

where i i k is defined by

i i k r0c2 = T ' 00 d i k + T ' i k

Note- r 0 is defined in equation 3.1.16 for general stress-energy tensors.

b. Derive the equations of part a. from Newtonian considerations plus the conservation of relativistic energy. (Hint: The total energy carried past the observer by a volume of the medium V includes both the rest energy T ' 00 V and the work done by forces acting across the volume's face as they push the volume of the medium through a distance.)

c. Refer to equation 3.1.26 a. Considering that force is a change in momentum with respect to time, this results in a force per unit volume f i of

f i = dT 0i/dct

Which for constant i i k results in

f i = Sk i i k r0duk/dt

This equation suggests that one call r0Sk i i k an inertia density matrix. What does this become for the special case of a perfect fluid?

d. Consider an isolated, stressed body at rest and in equilibrium (T mn,0 = 0) according to the laboratory frame. Show that its total inertia matrix I i k (not moment of intertia) defined by

I i k = ò ò ò i i k r0 dxdydz

is isotropic and results from the mass or center of momentum frame energy of the body

:

I ' i k c2 = d i kò ò ò T ' 00 dx'dy'dz'

 

Problem 3.1.5

Show that if v << c

EK » (1/2)mv2

Problem 3.1.6

What is the speed, frequency, and wavelength of a 2Mev proton?

Problem 3.1.7

According to problem 3.1.4 combining the results of parts c and d results in an isolated stressed body in equilibrium obeying Newton's second law at low speeds

f i = ò ò ò f i dxdydz = ò ò ò (T ' 00/c2) dx'dy'dz'dui/dt

which can be expressed by the four-vector equation F = mA. This indicates that isolated stressed bodies in equilibrium will obey all the same dynamics as relativistic individual particle dynamics and must therefor obey

E = gcmEcm. Consider a box containing a photon gass. The interior wall lengths will be L,W, and H. And the wall thicknesses will all be da.

The proper frame stress-energy tensor of the photon gass is

The proper frame stress energy tensor for two of the walls whose surface normal is (anti)parallel the direction of motion has stresses in the wall both from being pushed on by the gass pressure normally and from being stretched from the edge attachments to the other walls that are also being pushed on by the gass. The average x-x component of the stress energy tensor for these two walls would be proportional to the gass pressure on them. i.e.

T ' xx = e T ' xx/3

Assuming a material for which e~1 the proper frame stress-energy tensor consistent with the entire system being an isolated stressed body in equilibrium for those two walls will be

 

The proper frame stress energy tensor for two of the four of the walls whose surface normal is perpendicular the direction of motion is

The proper frame stress energy tensor for the other two of the four of the walls whose surface normal is perpendicular the direction of motion is

Lorentz transform the stress energy tensors and integrate T 00 throughout the volume in order to show

E = gcmEcm

Don't forget length contraction.

Answer

Problem 3.1.8

For a particle in a four vector potential fm, the mass will be defined as the following invariant m in:

m2c2 = gmn[Pm - (q/c)fm][Pn - (q/c)fn]

where q is the charge reacting to that field.

In special relativity this results in

(E - qf)2 = (Pc - qf)2 + m2c4

If one were to replace the energy and momentum with their operators and operate this on a wave function the result would be second order differential equations. In order to get a first order differential equation one could take the square root of the equation if the right hand side could be expressed as a perfect square. As it stands one can not do this, but consider the expression multiplied by the 2x2 identity matrix I of 2x2 identity matrices s0.

(E - qf)2I = (Pc - qf)2I + m2c4I

Now one can hope to find a perfect square for the right hand side in the form

(E - qf)I = F×(Pc - qf) + Gmc2

where the three elements of F are Fi, and where Fi and G are 2x2 martices of 2x2 matrices.

Try inserting

and

where si is any one of the three conventional Pauli matrices and verify that this yields

(E - qf)2 = (Pc - qf)2 + m2c4

This results in a hamiltonian of

HDirac = F×(Popc - qf) + Gmc2 + qfI

(often multiplied identity matrices are suppressed and identity by itself written as just 1)

and the Dirac equation for a nonzero potential:

_______________________________________________________________________________________ 

3.2 The SR Dynamical Equations 27

In special relativity we define a four-vector force as

Fl = dpl/dt

(3.2.1)

For a particle with mass we have

pl = mUl

The Acceleration Four-Vector for special relativity is given by

Al = dUl/dt

(3.2.2)

and so we can write the relativistic version of Newton's second law as

Fl = mAl

(3.2.3)

28 Chapter 3 SR Dynamic Implications

Considering an inertial frame according to which the test mass is instantaneously at rest it is easy to show that

hmnAmUn = 0

(3.2.4)

which yields

hmnFmUn = 0

(3.2.5)

This serves as the work energy theorem for modern special relativity.

Consider where hmnFmUn = 0 leads:

hmnFmUn = 0

hmn(dpm/dt)Un = 0

g2hmn(dpm/dt)un = 0

hmn(dpm/dt)un = 0

(dp0/dt)u0 - (dp/dt)× u = 0

u0 = c and p0c = ER so

(dER/dt) - (dp/dt)× u = 0

dER = (dp/dt)× dx

dER = dEk so

òdEK = ò(dp/dt)× dx

W = ò(dp/dt)× dx

If we define another kind of force that is not a four-vector which we will call ordinary force as

f = dp/dt

(3.2.6)

then we arrive at

W = òf×dx

(3.2.7)

and there we see the work energy theorem of pre-modern relativistic physics.

So Newton's second law for special relativity in terms of ordinary force is 3.2.6

f i = dp i/dt

Using this force definition has its purposes, but in a lot of ways thinking of relativistic physics in terms of nontensor quantities very much complicates things. For example let us work out the relation between ordinary force and coordinate acceleration.

we can write 3.2.6 as

f i = m(d/dt)(dx i/dt)

We then define al by

al = (d/dt)(dxl/dt)

(3.2.8a)

Which for acceleration in the direction of motion results in

a = g3a

(3.2.8b)

and for that case of motion a will be equal to the proper acceleration A' which is the acceleration as observed from a frame according to which the particle is instantaneously at rest. The magnitude of this can be calculated from any frame as it is an invariant, |A'| = (-hmnAmAn)1/2. This is the amount of acceleration "felt" by the accelerated observer and according to the inertial frame in which the accelerated or "proper frame" observer is instantaneously at rest a = A'. We define a according to 3.2.8 as well because it is useful. a as calculated from any inertial frame according to which the force is in the direction of the motion turns out to be equal to the proper acceleration.)

This also restores a Newtonian form

f i = ma i

 (3.2.9)

( Note also - When the force is in the same direction as the motion, then the force felt by the object being pushed is equal to the ordinary force. In that case we have F' l felt = f l = mal )

3.2 The SR Dynamical Equations 29

we can eliminate dt, in terms of dt from time dilation

f i = m(d/dt)(gdx i/dt)

Use of the chain rule and simplification results in

f = gm[a + g2u(u×a)/c2]

(3.2.10)

where al is the coordinate acceleration.

 

The four-vector force for special relativity is sometimes called the Minkowski force and is related to the electromagnetic field tensor Fmn

(3.2.11)

by

Fl = qhmn(Um/c)Fnl

(3.2.12)

From this we can work out the relation between the components of the electromagnetic field, the coordinate velocity and the ordinary force, which yields

f = q(E + u´B)

(3.2.13)

In the case that the force is in the direction of the motion 3.2.10 yields,

f i = g3mai

(3.2.11)

Note that no matter what finite value the ordinary force is, as u approaches c, g diverges and so the acceleration must vanish.

We expect this as nothing with mass can be pushed all the way up to the speed of light.

We have seen how c is a speed limit for the universe. Because of this, we must answer a question concerning velocity addition. Lets say an S' frame observer observes an object at speed u'. An S

 

30 Chapter 3 SR Dynamic Implications

frame observer observes the S frame observer to be moving at speed v. u' can be any value less than c and v can be any value less than c. People tend to come to the wrong conclusion that the S frame observer observes that the object moves at a speed u = u' + v and that this speed should therefor be any speed less than 2c. They are using the wrong velocity addition formula. Consider the following Lorentz coordinate transformation equations in differential form.

dx = g(dx' + bdct')

and

dct= g(dct' + bdx')

To obtain the correct velocity addition equation divide equations and simplify.

dx/dct = [g(dx' + bdct')]/[g(dct' + bdx')]

simplified

dx/dt = [dx'/dt' + v]/[1 + (dx'/dt')v/c2]

Now making the replacements u = dx/dt and u' = dx'dt' we arrive at

u = (u' + v)/(1 + u'v/c2)

(3.2.12)

This is the correct equation to use for that velocity addition. u' and v can be any values less than c but the result will always be that the speed of the object according to the S frame, u , will always be less than c. One can also use the same method to find the velocity addition equations for the case that the object moves in the y or z directions.

Exercises

Problem 3.2.1

Find hmnUmUn, hmnUmAn, and hmnUmFn, hmnpmpn using the fact that the contraction of any tensor is an invariant. How is the last related to the definition of mass? How is the result for hmnUmFn related to the time derivative of the work energy relation, (dp/dt)×v = dKE/dt?

Problem 3.2.2

Show that the momentum for a charged particle in cyclotron motion or orbital radius R in a magnetic field is given by p = qBR.

Problem 3.2.3

Use Eqn 3.2.10 to show that the motion of a charged particle in an electromagnetic field is given by

a = (q/m)g -1[E + u´B - (u/c)(u×E/c)]

Problem 3.2.4

Refer to Eqn 3.2.2 and show that Am = Un(Um,n)

Problem 3.2.5

Consider an S' frame moving in the x direction with speed v with respect to an S frame.

Show that the ordinary force on a particle moving with velocity u with respect to S transforms between the two frames according to

Problem 3.2.6

Consider the ordinary force as given by the column vector

and the coordinate acceleration given by the column vector

.

Show that according to equation 3.2.10, these can be related by

if we define *matrix mass* by

________________________________________________________________________________________

3.3 Rotations, Rockets, and Frequency Shifts 31

We have shown that velocities do not add linearly in special relativity. For motion along one direction velocities were adding nonlinearly according to Eqn 3.2.12

u = (u' + v)/(1 + u'v/c2)

Rapidity q as a function of v is given by

tanhq = v/c

(3.3.1)

This definition is useful as it simplifies much of dynamics equations. It does this because, unlike velocity, rapidity does add linearly.

qu = qu' + qv

Recall the Lorentz transformation matrix

Rapidity also has the following relations to g and gb

g = coshq

(3.3.2a)

gb = sinhq

From these, the Lorentz transformation matrix becomes

(3.3.2b)

Comparing this to an ordinary rotation matrix makes it clear why Lorentz transformations can be thought of as a rotation in space-time. At this point the relation between Lorentz transformation and rotation may still seem to be a superficial one, but once one becomes familiar with spinor calculus a much more intimate relation is revealed.

 

32 Chapter 3 SR Dynamic Implications

Here we will derive and discuss the implications of single stage relativistic rocket equations. The non-relativistic rocket equation is

Dv = vexln(mi/m)

(3.3.3)

This gives the change in velocity Dv a rocket undergoes accelerating in one direction given a measure of exhaust speed vex which is a constant and the initial mass of the rocket mi and the final mass of the rocket m after some of the ships mass in fuel has been burnt off.

 

The relativistic version of this equation in terms of rapidity q is similar

Dq = (vex/c)ln(mi/m)

 (3.3.4)

The speed of the rocket is then calculated from the rapidity Eqn 3.3.1.

v = ctanhq

Notice that since tanhq < 1 for any q , v is always less than c no matter how much of the ships mass is burnt off as fuel and no matter how fast the exhaust speed is. We can even consider tachyon exhaust where vex > c and yet the rocket still never reaches the speed of light.

 To derive 3.3.4 Start with conservation of momentum and energy relating the initial and final states of the rocket and exhaust for a small element mfex burned off.

gmv = (m + dm)[gv + d(gv)] + mfexgfexvfex

gmc2 = (m + dm)(g + dg)c2 + mfexgfexc2

Simplified

0 = gvdm + md(gv) + mfexgfexvfex

0 = gdm + mdg + mfexgfex

Eliminate mfexgfex

0 = gvdm + md(gv) - (gdm + mdg)vfex

Insert relativistic velocity addition

0 = gvdm + md(gv) - (gdm + mdg)[(v - vex)/(1 - vvex/c2)]

Simplify

0 = [gvdm + md(gv)] (1 - vvex/c2) - (gdm + mdg)(v - vex)

Switch variables to rapidity

0 = [sinhqdm + md(sinhq)] [1 - tanhq(vex/c)]c - [coshq(dm) + md(coshq)](tanhq - vex/c)c

Simplify

0 = [sinhqdm + mcoshqdq] [1 - tanhq(vex/c)] - [coshqdm + msinhqdq](tanhq - vex/c)

0 = sinhqdm + mcoshqdq - (sinhqdm + mcoshqdq)tanhq(vex/c) - (coshqdm + msinhqdq)tanhq + (coshqdm + msinhqdq)(vex/c)

 

0 = sinhqdm + mcoshqdq - (vex/c) sinhqtanhqdm - (vex/c)msinhqdq - sinhqdm - msinhqtanhqdq + (vex/c)coshqdm + (vex/c)msinhqdq

0 = mcoshqdq - (vex/c) sinhqtanhqdm - msinhqtanhqdq + (vex/c)coshqdm

0 = (mcoshq - msinhqtanhq)dq + (vex/c)(coshq - sinhqtanhq)dm

0 = m(cosh2q - sinh2q)dq + (vex/c)(cosh2q - sinh2q)dm

0 = mdq + (vex/c)dm

dq = -(vex/c)dm/m

After integration equation 3.3.4 is obtained

Dq = (vex/c)ln(mi/m)

Now consider the ships proper acceleration for motion in one direction refer to equation 3.2.8b

a = g3a = cosh3qdv/dt = cosh3q(dv/dq)(dq/dm)(dm/dt')(dt'/dt)

a = cosh3q(csech2q)(-(vex/mc))(dm/dt')sechq

a = (vex/m)(dm/dt')

If the proper acceleration is kept constant then integration results in

aDt'/c = (vex/c)ln(mi/m) = Dq

Consider initial conditions of v = 0 at t = t' = 0.

at'/c = q

If the rocket starts at rest and is run at a constant proper acceleration a , then the equation can be written

v = ctanh(at'/c)

(3.3.5)

These initial conditions also result in

v = ctanh[(vex/c)ln(mi/m)]

(3.3.6a)

equivalently

(3.3.6b)

Inverting these results in

mi/m = exp[(c/vex)tanh-1(v/c)]

(3.3.6c)

equivalently

mi/m = [g(1 + b)]c/vex

(3.3.6d)

Running it at a constant proper acceleration also results in

b = tanh(at'/c)

(3.3.7a)

g = cosh(act'/c2)

(3.3.7b)

gb = sinh(act'/c2)

(3.3.7c)

 

3.3 Rotations, Rockets, and Frequency Shifts 33

Using the Lorentz like transformation equations

ct = òct'gdct' + gbx'
x = gx' + òct'gbdct'
y = y'
z = z'

(3.3.8)

Results in a good global coordinate transformation from the accelerated ship frame to an inertial frame. Take the inertial frame to be the one in which it starts instantaneously at rest at t' = 0 and these become

ct = (c2/a + x')sinh(act'/c2)
x = (c2/a + x')cosh(act'/c2) - c2/a
y = y'
z = z'

(3.3.9)  

 There is a difference between what frequency one observes as being emitted from a source and what frequency an observer actually sees as coming from the source. This is true even in nonrelativistic physics. For instance, as a car drives past you will hear a shift in the tone of the engine as it goes from coming toward you to going away from you. This is the frequency you hear. You may use the ordinary Doppler shift formula with the speed it was traveling to then extrapolate what frequency it really emits according to your coordinate frame. This is the frequency you observe. The relativistic Doppler shift formula is really the same thing as the ordinary Doppler shift formula except that it is usually written in terms of the source frame's emitted frequency instead of the observed emitted frequency. Its just that in the nonrelativistic case these are the same. In relativistic Doppler shift, you accounts for the fact that due to time dilation the frequency you observe to be emitted is different then the frequency according to the frame of the object.

 

If you are at rest with respect to the medium of propagation for a wave, then the ordinary Doppler shift formula is

f = f0/[1 + (v/c)cosq ]

(3.3.10)

In terms of sound we make the following relations.

f is the frequency you hear (for instance if this was sound).

 

34 Chapter 3 SR Dynamic Implications

f0 is the frequency you observe have been emitted according to your frame at the time of the emission. It is the transverse or

q = p /2 frequency for f.

v is the speed the emitter travels with respect to the medium of the waves at the time the wave was actually emitted.

c is the speed of the waves of the medium with respect to the medium.

q is the angle it was traveling off of strait away from you at the time the heard frequency was actually emitted.

 

Relativistic Doppler shift IS ordinary Doppler shift. This formula happens to stand correct for the relativistic Doppler shift of light with the following adjustments to the relations.

 

f is the frequency you see.

f0 is the frequency you observe to have been emitted according to your frame at the time of the emission. It is the transverse or

q = p /2 frequency for f.

v is the speed the emitter travels with respect your frame at the time the light was actually emitted.

c is the Lorentz invariant vacuum speed of light.

q is the angle it was traveling off of strait away from you at the time the heard frequency was actually emitted according to your frame. <--- The angle is different according to the other frame and so use of the other frames angle changes the form of the equation.

Now to write it in terms of the frequency emitted according to the frame of the object we start by writing the periods according to the two frames in terms of time dilation.

T0 = T0'(1 - v2/c2)-1/2

We then relate frequency to period.

f0 = 1/T

f0' = 1/T0'

 

 3.3 Rotations, Rockets, and Frequency Shifts 35

Putting these together results in

f0 = f0' (1 - v2/c2)1/2

Inserting this into the Doppler shift formula results in

f = f0' (1 - v2/c2)1/2/[1 + (v/c)cosq ]

(3.3.11)

The wavelength of the light will be l = c/f , or...

l = l 0' [1 + (v/c)cosq ]/(1 - v2/c2)1/2

(3.3.12)

Next consider the case that the object travels strait toward the observer. q = p . Then after algebraic simplification these becomes

f = f0 ' [(c + v)/(c - v)]1/2

(3.3.13)

l = l 0' [(c - v)/(c + v)]1/2

If the object traveled strait away from the observer q = 0, it would have become

f = f0 ' [(c - v)/(c + v)]1/2

(3.3.14)

l = l 0' [(c + v)/(c - v)]1/2

Exercises

Problem 3.3.1

Consider an ideal matter/antimatter rocket with vex = c. What is the mass ratio mi/m in order to reach (4/5)c? Even if it were technologically possible to make that much, think about how much anti-matter becomes a serious safety concern.

Problem 3.3.2

Consider an ideal matter/antimatter rocket with vex = c. What is the mass ratio in order to accomplish the round trip of problem 1.2.4 ?

Problem 3.3.3

What wavelength do you see from a 557nm proper frame source approaching at (5/13)c?

Problem 3.3.4

a. Show that 3.3.6a yields 3.3.6b

b. Show that inverting 3.3.6b yields 3.3.6d

Problem 3.3.5

Show that the relativistic Doppler formula for a source traveling toward an observer equation 3.3.13 can be written as

v = ctanh[ln(f/f0')]

or

f/f0' = exp[tanh-1(v/c)]

and the case of the source traveling away from the observer 3.3.14 can be written

v = ctanh[ln(f0'/f)]

or

f0'/f = exp[tanh-1(v/c)]

With these forms one can punch numbers easier on a scientific calculator.

Problem 3.3.6

Show that the position of the ship undergoing constant proper acceleration considered in equations 3.3.9 is given as a function of proper time by

x = (c2/a)[cosh(at/c) - 1]

a. Use 3.3.9 to show that the wrist watch time or proper time for the ship frame observer t describing events at the location of the ship is related to coordinate time t by

at/c = sinh(at/c)

b. Use the results of "a" and 3.3.5 to show that

v = at/[(1 + a2t2/c2)1/2]

and look at the limiting behavior of v for both small and large times for both the velocity equation here and 3.3.5.

c. Show that the position of the ship undergoing constant proper acceleration considered in equations 3.3.9 is given as a function of proper time by

x = (c2/a)[cosh(at/c) - 1]

and use the results of "a" to show that

(1 + ax/c2)2 - (at/c)2 = 1.

Note that this is the equation of a hyperbola.

Problem 3.3.7

A space ship of constant mass is somehow propelled by an external force. The ordinary force on the ship will be constant and will be in the direction of motion so one may use 3.2.11. Use this and 3.2.9 to show that

v = at/[(1 + a2t2/c2)1/2]

and use this result to show that

(1 + ax/c2)2 - (at/c)2 = 1.

Then compare the results to problem 3.3.6.