Einstein’s equation describing the equivalence of mass and energy is arguably the most famous equation in physics. But his beautifully elegant *derivation* of this formula (here is the English translation) from previously understood laws of physics is considerably less famous. (There is an amusing Far Side cartoon in this regard, with the punchline “squared away”, which you can find on-line by searching hard enough, though I will not link to it directly.)

This topic had come up in recent discussion on this blog, so I thought I would present Einstein’s derivation here. Actually, to be precise, in the paper mentioned above, Einstein uses the postulates of special relativity and other known laws of physics to show the following:

Proposition. (Mass-energy equivalence) If a body at rest emits a total energy of E while remaining at rest, then the mass of that body decreases by .

Assuming that bodies at rest with zero mass necessarily have zero energy, this implies the famous formula – but only for bodies which are at rest. For moving bodies, there is a similar formula, but one has to first decide what the correct definition of mass is for moving bodies; I will not discuss this issue here, but see for instance the Wikipedia entry on this topic.

Broadly speaking, the derivation of the above proposition proceeds via the following five steps:

- Using the postulates of special relativity, determine how space and time coordinates transform under changes of reference frame (i.e. derive the Lorentz transformations).
- Using 1., determine how the temporal frequency (and wave number k) of photons transform under changes of reference frame (i.e. derive the formulae for relativistic Doppler shift).
- Using Planck’s law (and
*de Broglie’s law*) and 2., determine how the energy E (and momentum p) of photons transform under changes of reference frame. - Using the law of conservation of energy (and momentum) and 3., determine how the energy (and momentum) of bodies transform under changes of reference frame.
- Comparing the results of 4. with the classical Newtonian approximations (and ), deduce the relativistic relationship between mass and energy for bodies at rest (and more generally between mass, velocity, energy, and momentum for moving bodies).

Actually, as it turns out, Einstein’s analysis for bodies at rest only needs to understand changes of reference frame at infinitesimally low velocity, . However, in order to see enough relativistic effects to deduce the mass-energy equivalence, one needs to obtain formulae which are accurate to second order in v (or more precisely, ), as opposed to those in Newtonian physics which are accurate to first order in v. Also, to understand the relationship between mass, velocity, energy, and momentum for moving bodies rather than bodies at rest, one needs to consider non-infinitesimal changes of reference frame.

*Important note*: Einstein’s argument is, of course, a physical argument rather than a mathematical one. While I will use the language and formalism of pure mathematics here, it should be emphasised that I am not exactly giving a formal proof of the above Proposition in the sense of modern mathematics; these arguments are instead more like the classical proofs of Euclid, in that numerous “self evident” assumptions about space, time, velocity, etc. will be made along the way. (Indeed, there is a very strong analogy between Euclidean geometry and the Minkowskian geometry of special relativity.) One can of course make these assumptions more explicit, and this has been done in many other places, but I will avoid doing so here in order not to overly obscure Einstein’s original argument.

— Lorentz transforms to first order —

To simplify the notation, we shall assume that the ambient spacetime S has only one spatial dimension rather than three, although the analysis here works perfectly well in three spatial dimensions (as was done in Einstein’s original paper). Thus, in any inertial reference frame F, the spacetime S is parameterised by two real numbers (t,x). Mathematically, we can describe each frame F as a bijection between S and . To normalise these coordinates, let us suppose that all reference frames agree to use a single event O in S as their origin (0,0); thus

(1)

for all frames F.

Given an inertial reference frame , one can generate new inertial reference frames in two different ways. One is by *reflection*: one takes the same frame, with the same time coordinate, but reverses the space coordinates to obtain a new frame , thus reversing the orientation of the frame. In equations, we have

if , then (2)

for any spacetime event E. Another way is by replacing the observer which is stationary in F with an observer which is moving at a constant velocity v in F, to create a new inertial reference frame with the same orientation as F. In our analysis, we will only need to understand infinitesimally small velocities v; there will be no need to consider observers traveling at speeds close to the speed of light.

The new frame and the original frame must be related by some transformation law

(3)

for some bijection . A priori, this bijection could depend on the original frame F as well as on the velocity v, but the principle of relativity implies that is in fact the same in all reference frames F, and so only depends on v.

It is thus of interest to determine what the bijections are. From our normalisation (1) we have

(4)

but this is of course not enough information to fully specify . To proceed further, we recall Newton’s first law, which states that an object with no external forces applied to it moves at constant velocity, and thus traverses a straight line in spacetime as measured in any inertial reference frame. (We are assuming here that the property of “having no external forces applied to it” is not affected by changes of inertial reference frame. For non-inertial reference frames, the situation is more complicated due to the appearance of fictitious forces.) This implies that transforms straight lines to straight lines. (To be pedantic, we have only shown this for straight lines corresponding to velocities that are physically attainable, but let us ignore this minor technicality here.) Combining this with (4), we conclude that is a linear transformation. (It is a cute exercise to verify this claim formally, under reasonable assumptions such as smoothness of . ) Thus we can view now as a matrix.

When v=0, it is clear that should be the identity matrix I. Making the plausible assumption that varies smoothly with v, we thus have the Taylor expansion

(5)

for some matrix and for infinitesimally small velocities v. (Mathematically, what we are doing here is analysing the Lie group of transformations via its Lie algebra.) Expanding everything out in coordinates, we obtain

(6)

for some absolute constants (not depending on t, x, or v).

The next step, of course, is to pin down what these four constants are. We can use the reflection symmetry (2) to eliminate two of these constants. Indeed, if an observer is moving at velocity v in frame F, it is moving in velocity -v in frame , and hence . Combining this with (2), (3), (6) one eventually obtains

and . (7)

Next, if a particle moves at velocity v in frame F, and more specifically moves along the worldline , then it will be at rest in frame , and (since it passes through the universally agreed upon origin O) must then lie on the worldline . From (3), we conclude

for all t. (8)

Inserting this into (6) (and using (7)) we conclude that . We have thus pinned down to first order almost completely:

(9)

Thus, rather remarkably, using nothing more than the principle of relativity and Newton’s first law, we have almost entirely determined the reference frame transformation laws, save for the question of determining the real number . [In mathematical terms, what we have done is classify the one-dimensional Lie subalgebras of which are invariant under spatial reflection, and coordinatised using (8).] If this number vanished, we would eventually recover classical Galilean relativity. If this number was positive, we would eventually end up with the (rather unphysical) situation of *Euclidean relativity*, in which spacetime had a geometry isomorphic to that of the Euclidean plane. As it turns out, though, in special relativity this number is negative. This follows from the second postulate of special relativity, which asserts that the speed of light c is the same in all inertial reference frames. In equations (and because has the same orientation as F), this is asserting that

for all t (10+)

and

for all t. (10-)

Inserting either of (10+) or (10-) into (9) we conclude that , and thus we have obtained a full description of to first order:

(11)

— Lorentz transforms to second order —

It turns out that to get the mass-energy equivalence, first-order expansion of the Lorentz transformations is not sufficient; we need to expand to second order. From Taylor expansion we know that

(12)

for some matrix . To compute this matrix, let us make the plausible assumption that if the frame is moving at velocity v with respect to F, then F is moving at velocity -v with respect to . (One can justify this by considering two frames receding at equal and opposite directions from a single reference frame, and using reflection symmetry to consider how these two frames move with respect to each other.) Applying (3) we conclude that . Inserting this into (12) and comparing coefficients we conclude that . Since is determined from (11), we can compute everything explicitly, eventually ending up at the second order expansion

(13)

One can continue in this fashion (exploiting the fact that the must form a Lie group (with the Lie algebra already determined), and using (8) to fix the parameterisation of that group) to eventually get the full expansion of , namely

,

but we will not need to do so here.

— Doppler shift —

The formula (13) is already enough to recover the relativistic Doppler shift formula (to second order in v) for radiation moving at speed c with some wave number k. Mathematically, such radiation moving to the right in an inertial reference frame F can be modeled by the function

for some amplitude A and phase shift . If we move to the coordinates provided by an inertial reference frame F’, a computation then shows that the function becomes

where . (actually, if the radiation is tensor-valued, the amplitude A might also transform in some manner, but this transformation will not be of relevance to us.) Similarly, radiation moving at speed c to the left will transform from

to

where . This describes how the wave number k transforms under changes of reference frame by small velocities v. The temporal frequency is linearly related to the wave number k by the formula

, (14)

(15+)

for right-ward moving radiation and by the (blue-shift) formula

(15-)

for left-ward moving radiation. (As before, one can give an exact formula here, but the above asymptotic will suffice for us.)

— Energy and momentum of photons —

From the work of Planck, and of Einstein himself on the photoelectric effect, it was known that light could be viewed both as a form of radiation (moving at speed c), and also made up of particles (photons). From Planck’s law, each photon has an energy of and (from de Broglie’s law) a momentum of , where h is Planck’s constant, and the sign depends on whether one is moving rightward or leftward. In particular, from (14) we have the pleasant relationship

(16)

for photons. [More generally, it turns out that for arbitrary bodies, momentum, velocity, and energy are related by the formula , though we will not derive this fact here.] Applying (15+), (15-), we see that if we view a photon in a new reference frame , then the observed energy E and momentum p now become

; (17+)

for right-ward moving photons, and

; (17-)

for left-ward moving photons.

These two formulae (17+), (17-) can be unified using (16) into a single formula

(18)

for any photon (moving either leftward or rightward) with energy E and momentum p as measured in frame F, and energy E’ and momentum p’ as measured in frame . Actually, the error term can be deleted entirely by working a little harder. From the linearity of and the conservation of energy and momentum, it is then natural to conclude that (18) should also be valid not only for photons, but for any object that can exchange energy and momentum with photons. This can be used to derive the formula fairly quickly, but let us instead give the original argument of Einstein, which is only slightly different.

— Einstein’s argument —

We are now ready to give Einstein’s argument. Consider a body at rest in a reference frame F with some mass and some rest energy . (We do not yet know that is equal to .) Now let us view this same mass in some new reference frame , where v is a small velocity. From Newtonian mechanics, we know that a body of mass moving at velocity v acquires a kinetic energy of . Thus, assuming that Newtonian physics is valid at low velocities to top order, the net energy E’ of this body as viewed in this frame should be

(19)

If assumes that the transformation law (18) applies for this body, one can already deduce the formula for this body at rest from (19) (and the assumption that bodies at rest have zero momentum), but let us instead give Einstein’s original argument.

We return to frame F, and assume that our body emits two photons of equal energy , one moving left-ward and one moving right-ward. By (16) and conservation of momentum, we see that the body remains at rest after this emission. By conservation of energy, the remaining energy in the body is . Let’s say that the new mass in the body is . Our task is to show that .

To do this, we return to frame . By (16+), the rightward moving photon has energy

; (20+)

in this frame; similarly, the leftward moving photon has energy

. (20-)

What about the body? By repeating the derivation of (18), it must have energy

(20)

By the principle of relativity, the law of conservation of energy has to hold in the frame as well as in the frame F. Thus, the energy (20-)+(20+)+(20) in frame after the emission must equal the energy E’=(19) in frame before emission. Adding everything together and comparing coefficients we obtain the desired relationship .

[One might quibble that Einstein’s argument only applies to emissions of energy that consist of equal and opposite pairs of photons. But one can easily generalise the argument to handle arbitrary photon emissions, especially if one takes advantage of (18); for instance, another well-known (and somewhat simpler) variant of the argument works by considering a photon emitted from one side of a box and absorbed on the other. More generally, any other energy emission which could potentially in the future decompose entirely into photons would also be handled by this argument, thanks to conservation of energy. Now, it is possible that other conservation laws prevent decomposition into photons; for instance, the law of conservation of charge prevents an electron (say) from decomposing entirely into photons, thus leaving open the possibility of having to add a linearly charge-dependent correction term to the formula . But then one can renormalise away this term by redefining the energy to subtract such a term; note that this does not affect conservation of energy, thanks to conservation of charge.]

## 88 comments

Comments feed for this article

28 December, 2007 at 6:49 pm

RobertaThank you for giving students around the world a chance to learn from this blog.

29 December, 2007 at 2:51 am

janeTerry, thank you for your latest articles. Being an undergraduate, I could understand both of these. (I once complained that your articles were too mathematically tense for undergrads). Thanks!

29 December, 2007 at 6:43 am

anonymousJust find a typo.

” Indeed, if an observer is moving at velocity v in frame F, it is moving in velocity -v in frame \overline{F}, and hence \overline{F_v} = \overline{F}_{-v}. ”

I think it should be

F_v = \overline{F}_{-v}

29 December, 2007 at 7:19 pm

KeaCool post. BTW, you can use \hspace (for instance) to set the equation numbers further to the right.

30 December, 2007 at 3:18 pm

Grad StudentThis post isn’t (necessarily) directed at Dr. Tao or directly related to the topic, so accept my apologies if it is out of line. I am currently about 1/2 way through a graduate program in mathematics, and happily working on problems I enjoy. Looking back on my undergraduate days I wish I had taken more physics, because (1) physical motivation/concepts are showing up more and more in the areas I am working in and (2) despite having avoided physics past the introductory level in college I have always found it interesting. I am curious if anyone has had experience diving back into physics in late math graduate school or later and any advice or thoughts they had on this matter.

30 December, 2007 at 4:30 pm

links for 2007-12-31 « Simply… A User[…] Einstein’s derivation of E=mc^2 « What’s new (tags: einstein physics mathematics relativity toread **) […]

31 December, 2007 at 4:51 am

DougHi Terence,

Thank you for your demonstration of the derivation of E=mc^2.

As I am more medically [bio-physics] oriented than physics oriented perhaps you may be able to mathematically explain two questions:

1 – Why do physicists equate voltage energy [eV] to mass when engineers equate inductance [henry or weber per ampere] to mass?

This question arises from reading Raghbir Singh Khandpur, “Biomedical Instrumentation: Technology and Applications’, specifically from a table of electrical / mechanical equivalents, “The inductance symbol represents the effective vibrating mass of the element.” and figure 3.4 on pages 76-77 in chapter 3 Physiological transducers, 3.4 Displacement, Position and Motion transducers.

Voltage is listed as corresponding to Force.

2 – Could you mathematically explain the Lorentz length contraction which from my perspective appears to be a paradox to the theory of inflation and the reaction of ordinary matter to non-relativistic energy changes?

For example one kilogram of water ice remains one kilogram when phase changes to either liquid or vapor, but the volume increases as the energy increases.

Naively, I would expect a length expansion.

31 December, 2007 at 9:33 am

GHi There, I am afraid the good old Albert would prefer the physics over the math, after all, that is why he is regarded as the father of the subject, even though we talk about Poincare relativity and Lorentz transformations. There is yet another derivation of E/c^2 = m by him, although he used the *murky* notion of center of mass…

31 December, 2007 at 9:37 am

John ArmstrongDoug, they don’t equate inductance with mass in general. This comes from the fact that a series RLC circuit obeys almost the same differential equation as a pendulum, as shown in the table here.

2 January, 2008 at 9:59 am

DougHi John Armstrong,

Thank you for the wiki reference listing the table in the section ‘equivalent systems’ on the Harmonic oscillator page with the explanation that “… analogous quantities in four harmonic oscillator systems in mechanics and electronics. If analogous parameters on the same line in the table are given numerically equal values, the behavior of the oscillators will be the same.”

I have seen a similar table from Erik Cheever [PhD bioengineering Swarthmore] Analogous Electrical and Mechanical Systems.

Cheever has only one electrical column and his two mechanical columns have different labels. The primary difference is that Cheever lists the capacitor energy as equivalent to mass energy and spring energy while inductor energy is equivalent to spring energy and mass energy from Force-current and Force-voltage columns, respectively.

1 – Where I am confused, is why physicists measure mechanical mass in eV which seems to be an electrical energy measure?

2 – I was surprised by the next section of your wiki reference ‘Applications’ stating “The problem of the simple harmonic oscillator occurs frequently in physics because of the form of its potential energy function: V(x)=(1/2)kx^2.”

This form is usually associated with kinetic energy.

2 January, 2008 at 10:52 am

John ArmstrongDoug: as the post discusses, mass and energy are equivalent, and can be changed into each other. Imagine a society where we measure furniture in inches, but measure room dimensions in fathoms. When someone realizes that “inches” and “fathoms” are both measuring the same quantity — length — we can use a conversion factor and just use one set of units, making interior decoration much easier.

Similarly, eV and grams are just two different ways to measure mass-energy. We want to just pick one and go with it, using a conversion factor to change the other measurements. Physicists have settled on eV as the unit because it’s appropriate to the scale of the experiments (you wouldn’t measure furniture in microns or miles) and because it’s appropriate to the sorts of experiments we’re doing. We actually

arepushing charged particles through voltage differences, which is how an eV is defined. But the whole thing could be worked out in grams or kilocalories if you wanted to.As for your other question, be careful. The form associated with kinetic energy is . That is, it’s the same form, but with the mass as the constant and the

derivativeof position as the variable. It’s sort of fascinating that the potential energy that gives Hooke’s law looks so much like the usual formula for kinetic energy, though.4 January, 2008 at 6:39 am

Sp3w[…] Einstein’s derivation of E=mc^2 […]

4 January, 2008 at 2:21 pm

Laurens GunnarsenOne rather mysterious aspect of this argument of Einstein’s is that it invokes the quantum theory of light to establish a relationship between variables that are already completely defined within a purely classical theory of mechanics (i.e., by special relativity.)

It is natural to ask whether this is really necessary — and in fact it is not. Indeed, it turns out that the purely classical explanation of the relation E = mc^2, though essentially different from Einstein’s original one, is also profoundly illuminating.

We must begin by spelling out the essential assumptions underpinning the classical theory, on which the explanation will rest. And, thanks to Minkowski, we can do this very elegantly by saying that special relativity ascribes to the set of all events the structure of a 4-dimensional affine space whose underlying vector space has an inner product of Lorentz signature.

The (connected component of the identity of the Lie) group preserving this special relativistic spacetime structure is called the Poincare group, and the building blocks of special-relativistic mechanics are just those physical systems whose phase spaces admit a transitive action of this group by symplectomorphisms.

A basic theorem (due essentially to Wigner) allows us to classify these physical systems. It turns out that their phase spaces are exactly the coadjoint orbits of the Poincare group, which may in turn be characterized by exactly two independent parameters: mass and spin.

So, within special-relativistic mechanics, this is how the “m” that appears in the formula E = mc^2 is defined; it’s a parameter that (partially) characterizes an elementary system — what might reasonably be called a “special-relativistic point particle.” But what about the “E” variable? How is that defined, anyway?

Well, as always in classical mechanics, energy is that function of the state of a system which determines, via Hamilton’s equations, the evolution of that state — its change with time. So this “E” that appears in the formula E = mc^2 cannot be defined until we have chosen a “time axis,” along which the system can evolve. And since, in special-relativistic mechanics, there are (infinitely) many such “time axes,” all on an a priori equal footing, we need to explain exactly what choice we’re making, and why.

The required explanation is easiest to give and to understand when the spin parameter is zero (although the general explanation isn’t all that much more complicated, really.) For then it happens that the isotropy subgroup of Poincare group determined by the state leaves a unique “time axis” invariant. The choice of “time axis” is, in effect, made for us by the state itself, in a geometrically natural way.

And it is precisely this particular “time axis” — the one aligned with the (instantaneous) motion of the particle as encoded in its state variables — that defines the “E” in the formula E = mc^2. Time translation (via the Poincare group action) of the state along this particular time axis corresponds precisely to Hamiltonian evolution of the state, with this particular energy function as the Hamiltonian.

Once this framework is in place, and all these definitions are clearly understood, the proof that E = mc^2 consists of a single line. And this is how things usually go in a fully developed, fully understood theory: all the work goes in to the formulation of the correct conceptual context, which then trivializes the key theorems. As Spivak points out in his CALCULUS ON MANIFOLDS, the proof of Stokes’s Theorem, in its most modern form, is also a single line — once you’ve introduced and understood the whole panoply of definitions needed to state it.

Is it worth the effort, though, to construct such a framework? I think we can now say rather confidently, after more than a century of working with special relativity theory, that the answer is “yes.” The rewards, both technical and conceptual, are vast, and the difficulties that remain for the learner to overcome are precisely those that really deserve deep, serious thought.

[By the way, a rather more detailed and explicit formulation of all these ideas is given in Shlomo Sternberg’s superb GROUP THEORY AND PHYSICS, and I would strongly recommend Sternberg’s textbooks, monographs, and review articles (as well as those of George Mackey) to any mathematician who wants better to understand the fundamental ideas of physics.]

4 January, 2008 at 2:44 pm

DougHi John Armstrong,

1 – I think the source of my confusion about the units of mass lies within inconsistencies in the literature such as these wiki references:

a – Dr Tao’s reference Mass in special relativity in the paragraph before the section ‘Mass and the momentum 4-vector’ states “Energy is typically in units of electron volts (eV), momentum in units of eV/c, and mass in units of eV/c^2.”

b – section ‘Types of neutrinos’ has a table ‘Neutrinos in the Standard Model of elementary particles’ listing mass variants as expected values in eV.

This should be mass in units of eV/c^2 not eV.

2 – Your comment on treating mass as a constant in kinetic energy may also have meaning in treating mass as a constant and c^2 as a maximum extrema in E=mc^2 when mass fissions into energy.

I will discuss this implication in my next comment for Dr Tao to which I hope he responds.

4 January, 2008 at 2:50 pm

DougHi Terence,

Note: 1b for 4 January, 2008 at 2:44 pm comment to John Armstrong should read Neutrino section ‘Types …

In your demonstration of the derivation of E=mc^2, you have the equation:

E’ = E + (1/2)mv^2 + O(v^3).

Consider:

a – Let O(v^3) = epsilon; as limit epsilon -> 0; then O(v^3) -> 0. This may represent the difference between the ideal yield and actual yield of mass fission into energy?

b – There should be a bonding energy of the mass that must be overcome in the situation of mass fission into energy that may be necessary to subtract from the potential energy to determine the resultant kinetic energy. Thus we may have the situation for energy:

E_kinetic = E_potential + E_bonding where E_bonding = minus E_kinetic?

If the E = mc^2 is this potential energy, then we may have:

(1/2)mv^2 = mc^2 – (1/2)mv^2 for

E_kinetic = E_potential – E_bonding or

E’ = E + (1/2)mv^2 when O(v^3) = 0?

19 January, 2008 at 9:20 am

A theoretical blog « My Name Is Legion[…] I’ll leave with one quick link. Terry Tao wrote up a very interesting blog post showing Einstein’s derivation of the famous equation E=mc^2. Check it […]

12 March, 2008 at 10:43 pm

santoshbeing a 9th class student, I could not understand ur description.But,I thank to ur explanation.

8 April, 2008 at 6:10 pm

AMRYT DHAKALThanx for giving chance for students to read this but no one can understand it because of complexion derivation

18 April, 2008 at 9:44 am

DougHi Terence,

I may have stumbled onto a means of graphically illustrating the derivation of E=mc^2 from an energy economic perspective, using the Gini coefficient, a measure of statistical dispersion from economics?

If the upper left of the graphical representation in the wiki article is completed as a square, then these representations may exist:

– the entire square as total potential energy,

– the diagonal as the maximum relative velocity which would occur in orthogonal movement,

– the upper left triangle as kinetic energy,

– the segment labeled gini index as O(v^3) and

– the remaining portion of the lower right triangle as E’ or perhaps bonding energy?

24 October, 2008 at 8:56 am

my coffee bar » simple-sederhana, dan ya seperti itu…[…] (menghasilkan) persamaan itu, bukannya sesuatu yang mudah. Kalau kalian tidak percaya pergilah ke sini. :), dan rasakan betapa tidak nyamannya upaya memahami sesuatu yang (sesungguhnya) tidaklah […]

16 November, 2008 at 12:59 am

jianThank you for taking time to have written this post. I found it more clear than those in other sources. I helps me to understand Einstein’s original paper too.

28 December, 2008 at 11:39 am

IanDClarkMScPhD(1969 1973)Extraordinarily simple even for those of us whose major fields are pure mathematics, statistics, symbolic logic and economics.

10 January, 2009 at 1:56 pm

AnonymousA minor typo: “it is moving in velocity -v in frame $latex[\overline{F}]$, and hence $latex[F_v=\overline{F_{-v}}]$” (one of the overlines shouldn’t be there).

[Corrected, thanks – T.]19 March, 2009 at 11:59 am

anonSo is E = mc^2 not exactly true, but only up to order O(v^3)?

20 March, 2009 at 3:20 am

Terence TaoYes. In fact, for moving objects the exact formula for the energy is , thus one can view the term as a relativistic correction to the Newtonian formula for the kinetic energy.

20 March, 2009 at 10:11 am

Anonymousshoudn’t it be v^2/c^2 in the denominator?

[Corrected, thanks – T.]15 April, 2009 at 3:11 am

Anirudh Kumar SatsangiHow did Einstein derive conversion factor c^2 in his very famous formula E=mc^2. Whether he did some empirical work for this or he put this constant or conversion factor intuitively? I also feel that this formula does not hold good at black holes where the speed of gravitational wave is much higher than the speed of light.

9 May, 2009 at 2:26 am

A.K.SatsangiBased on E=mc^2, can it be said that mass is the ‘potential state’ of matter and energy is the ‘kinetic state’ of matter and just multiply mass with c^2 you will get huge amount of energy and divide energy by c^2 you get very small amount of mass OR some other factors/ mechanisms are essential for these conversions ?

Anirudh

9 May, 2009 at 5:16 pm

Terence TaoDear Anirudh,

You can read about Einsteins’ motivation in his original paper, which is linked to at the top of the post.

All matter (and radiation, e.g. a “box of photons”) has both mass and energy simultaneously, and does not alternate between a “mass-state” or an “energy-state”, just as a yardstick is simultaneously three feet long and one yard long, rather than somehow having a “feet-state” where it has three feet but no yards, and a “yard-state” where it has one yard but no feet. The formula describes an

equivalenceof mass and energy, rather than aconvertibilitybetween mass and energy.Note though that the formula is only valid for stationary objects. For moving objects, which have non-zero momentum , the correct formula is , thus a moving object would have more energy than just . One could call the excess energy here the “kinetic” energy of the moving object, and distinguish it from the “rest” energy , but this is a somewhat artificial distinction; the total energy is a more physically relevant quantity than either of these two somewhat arbitrary components.

Einstein’s formula is still valid in the presence of strong gravitational fields, but it should be noted that the concepts of energy and momentum then depend on location as well as on the frame of the observer; in particular, the law of conservation of energy or momentum is now a local conservation law only, rather than a global one. So one cannot directly equate energy or momentum at, say, a location near a black hole, to a nominally equivalent amount of energy or momentum on Earth.

10 May, 2009 at 10:14 pm

Anirudh Kumar SatsangiHow equivalence can be established without one getting converted into other ? How equivalence can be verified? If E=mc^2 is valid only for stationary objects then this formula has no application and can predict nothing since nothing is stationary in the universe.

18 May, 2009 at 9:07 am

conformalHi Terry,

The correct formula with momentum is puzzling. E.g., in http://en.wikipedia.org/wiki/Klein-Gordon it leads to a nonlocal (cumbersome) expression. However, it seems that after an expansion we get just a Schrodinger equation with convolution modulo identity instead of the laplacian. Would physicists agree?

19 May, 2009 at 4:22 pm

Terence TaoDear conformal,

It is true that if one splits up solutions to the Klein-Gordon equation into positive time-oscillating () and negative time-oscillating components (), then the (local) second-order PDE becomes factorised into two decoupled first-order non-local PDE. But this non-locality is an artefact of the decomposition into positive and negative time-oscillating components, which is itself a non-local decopmosition; any physically measurable quantity of the equation (e.g. the stress-energy tensor) still evolves in a purely local fashion.

20 May, 2009 at 5:09 am

conformalHi Terry,

If we can split up like this, then after applying absolute value we get a pseudofirst order equation.

20 May, 2009 at 8:53 am

Terence TaoWell, absolute values don’t quite work, but one can certainly convert the scalar local second-order Klein-Gordon equation to a complex non-local first-order equation by introducing the complex field

which solves the Schrodinger-like equation

.

One can view this reduction as a factorisation of the second-order Klein-Gordon operator as the product of two first-order pseudodifferential operators.

This type of reduction is occasionally useful in the analysis of such PDE, since Fourier-analytic techniques don’t distinguish between differential and pseudo-differential operators very much. But this perspective does certainly obscure some of the physical space features of the equation, for instance locality and finite speed of propagation are not obvious at all from the first-order formulation.

21 May, 2009 at 5:06 am

conformalHi Terry,

Is it because absolute value is a noncausal pseudodifferential operator that is doesn’t work? I mean http://en.wikipedia.org/wiki/Anticausal , e.g., Anticausal behavior in the imaging of a moving object: The implications for high-speed photography, Phys. Rev. A 19, 2377 (1979).

21 May, 2009 at 9:20 am

Terence TaoAh, I begin to understand what you meant by “absolute value” – I presume you are talking about transforming the temporal frequency variable (which I would call , but which I suppose physicists would call or ) to its absolute value . Mathematically, this is essentially the same (modulo some minor operations, such as conjugation) as taking a Hilbert transform in time and is, as you say, acausal, and also non-local. Applying it to a solution to the Klein-Gordon equation would indeed recover a solution to a first-order equation quite similar to the one in my previous comment, but it is not so surprising that a non-local transformation applied to a solution to a local PDE gives a solution to a non-local PDE.

22 May, 2009 at 5:20 am

conformalHi Terry,

Slightly offtopic: can we approximate the absolute value symbol with adding a dark dimension? I mean )(g=ng, where ng is the Neumann derivative of the solution of the system Lu=0, u=g in the upper half-plane, with L the laplacian.

12 January, 2013 at 11:44 am

Anonymoushey conformal,, are u listening to the master???

27 May, 2009 at 1:52 pm

gravityandlevityI like this thought experiment a lot. Thanks for writing it up.

Have you seen the “Einstein’s box” derivation of E = mc^2? It uses a center of mass argument. I gave an explanation of it here: http://gravityandlevity.wordpress.com/2009/05/16/the-equivalence-of-mass-and-energy-the-center-of-energy/

3 October, 2009 at 4:37 am

s.chetan kumarwhat do you understand by e=mc2 and how it is relatedt to theory of relativity?

3 October, 2009 at 2:51 pm

Keith DowHere is a simple approach which not 100% correct, but gives good insight.

Assume a black box in outer space with a long axis of length L. At one end of the

box is a flashlight attached to the box. The flashlight turns on for an instant and sends a burst of light to the other end of the box, where it is absorbed. The light

has energy E.

The momentum of the light is:

1. P = E/c

The light took time to travel to the other side:

2. T = L/c also c= L/T

When the light was emitted, because of conservation of momentum the box

gained momentum:

3. P = Mv

Since the box has no external forces on it, its center of mass can’t move. So the

movement of the box has to be equivalent to a “mass” of the light moving a

distance L. Now the box has mass M and moved a distance of v*T. So the center

of mass not moving means:

4. MvT = mL

From equations 1 and 3 we get:

5. Mv = E/c

Plug this into equation 4 and we get:

6. ET/c = mL

Or:

7. E = mcL/T

Plug in formula 2 we get:

8. E = mc*c

The history of E=Mc*c is complicated. Einstein wasn’t the first to publish a form of it. If you want more information read:

http://arxiv.org/pdf/0805.1400

13 February, 2010 at 10:34 am

sunkhiroesa=dv/dt , dx=vdt , , dF=d(ma)dt , E=2 intergral[Fdx]from 0 to c where X1=v1=0 , x2=v2=c when one does integral the first part is E the second part one must define variable m which a type of energy !

13 February, 2010 at 10:56 am

sunkhiroesNote from your page. Actually, as it turns out, Einstein’s analysis for bodies at rest only needs to understand changes of reference frame at infinitesimally low velocity, . But the mathematical go about is two part integral where m and a both can be constant and varible.

9 April, 2010 at 5:21 am

James SmithDear Prof Tao,

this might appear to be a really cheeky request but I’ll try my luck anyway. You have mentioned somewhere in the past that you use a LaTeX to WordPress converter. If this is the case here, could I possibly have the LaTeX source for this post? It would make an excellent basis for a project for my students. I can only offer to add exercises, footnotes and so on to the original source by way of thanks (if this doesn’t seem too presumptous – I thought some of your readers here might appreciate a document they could download. And I can’t think of another way of repaying you, to be honest).

Apologies if this is asking too much.

13 April, 2010 at 1:55 pm

Conrad J CountessHi all, this is very interesting and right up my alley

I have a Geometrical Interpretation of (E=mc^2), that not only shoes that energy and matter are equal, and related through conversion factor c^2, but actually shows how energy equals and turns to matter at a frequency wavelength of c^2. In short it shows that, analogous to “a line of 1 inch in horizontal direction, x a line of 1 in in vertical direction, to create a square inch”, c^2 is “c in the linear direction, x c in 90 degree angular direction, creating a balance of centripetal and centrifugal forces which create circular motion. Energy equals and turns to matter at c^2, because it acquires circular and or spherical motion, at the frequency wavelength of c^2, which is where “E=hf=mc^2”

here is a more complete description

http://wbabin.net/science/countess.pdf

What do you all think

Conrad J Countess

1 August, 2010 at 8:16 am

rashmiits wonderful to have a website like tis to learn about the work of einstein and others physicians

1 August, 2010 at 4:26 pm

agmProfessor Tao, I’m working through this, and I have a question about your approach to equation 7, delta = 0. I’ve established it by applying the requirement that the interval is constant: (x^2-c^2 t^2) = (x’2-c^2 t’^2) in the notation of Einstein’s 1920 version. You get terms that of the form delta*v*x that require delta = 0, which then forces the terms of the form delta*gamma*v^2*x*t to zero without constraining gamma. It works, but it could be more elegant and doesn’t seem to follow the spirit of your exposition.

So how did you arrive at delta = 0?

4 August, 2010 at 12:38 am

Terence TaoThis comes from the symmetry , or equivalently that , where is the reflection map .

19 August, 2010 at 1:59 am

anjansince every body have initial velocity 0

e=mc^2

here

e=total energy of substance

-E=K.E.+P.E.

orE=1/2mv^2+mgh

orE=m[1/2v^2+gh]but[2gh=v^2-u^2] or 2gh=v^2

orE=m[1/2{2gh}+gh]

orE=m[gh+gh}

orE=m2gh

from above

E=mv^2

where v^2=c^2

cant it be derived from mechanical phyics by above method.

27 September, 2013 at 7:51 am

indiaYou have assumed that the object of mass m attains a speed of light ‘c’ when dropped from earth and that too on Earth!!! If you succeed in doing so, you will win a Nobel price!!! Also Einsteins law violates law of energy conservation.

And you are deriving d same law using conservation principle!!!

7 October, 2010 at 12:26 am

Wang XiuliEinstein’s derivation of the formula is concise,simple and subtle.Yes Subtle is the lord.And many work’s derivations from Lorentz Tranformation and other physical law thereafter are very boring and lack physical intuitions.Physical law can be deducedt or induced by intuition into physics which could almont not be understanded approach of formal proof or mathematical proof.

7 October, 2010 at 12:29 am

Wang XiuliI think maybe the insight or intuition which are more concise ,obvious are more inportant in finding law or presenting law

20 November, 2010 at 3:07 am

eklavyathank’s for the equation.

2 April, 2011 at 1:56 am

vishnu vijayDerivation of Einstein’s Energy –Mass equation

Actually Einstein gave energy mass equation but without any derivation ……nobody I think have exact derivation of this equation.

Here I tried to give the derivation ……. i don’t know it is correct or not ….. but scientist can apply their mind on this concept also.

As we all know that ,

Energy =work done

And work done can be represented as, W=f*s {f=force,s=distance covered } …………………………..(1)

Now,

F=m*a {m=mass,a=acceleration}}

Now,

F=m*(v\t) {as a=velocity \time} ……………………………………………(2)

From (1)and(2)

W=m*(v\t)*s

W= m*v*v {here the both the velocities are same because the object is moving with some velocity and the time taken to cover that particular distance give us back the same velocity }

W=m*v2,

But we know that Einstein gave E=m*c2, {c=speed of light}

So here we will consider v=c as in the universe only light has the maximum speed and also if we want to gain maximum energy we have to keep the speed of light. So here,we will get …………

E=m*c2

By ,

Vishnu vijay,

24 October, 2011 at 2:24 am

Anonymousu r absolutely wrong as per einstien formula.if u want full inquiry about it call at +91226265856

3 May, 2011 at 9:58 pm

GuiriAs light has now been slowed down to a speed of zero, and the velocity of light is also slower when passing through glass or water for example, does this mean that e sometimes equals mc2 only under certain conditions?

4 May, 2011 at 8:12 am

Terence Taoc is only equal to the speed of light in vacuum. In other media it is certainly possible for the speed of light to be significantly less than c.

Actually, even if we lived in a universe in which there was no electromagnetism (and hence, no light), it would still be possible to define c; the “speed of light in vacuum” is a convenient definition, but hardly a necessary one. For instance, c is also the speed of gravitational propagation, and also shows up (albeit in a more complicated way) in the laws of physics for the other two fundamental forces, namely the strong and weak nuclear forces; alternatively, it is the unique constant which makes the laws of physics (locally) invariant with respect to Lorentz transformations.

1 September, 2011 at 7:36 am

ateixeiraAlso note this very important and beautiful article by Feigenbaum: http://arxiv.org/abs/0806.1234.

In it he shows that it is possible to derive with full generality Einstein-Poincaré-Lorentz relativity with just one postulate and making no mention of the speed of light.

3 June, 2011 at 8:09 pm

JonDear Terrence Tao,

Could you, pls, tell me what do I need to know to understand Einstein general relativity proof ?

I know that is necessary to understand Differential Geometry, right ?

What more ?

I want to set this as one of my lifetime Goals. :o)

Thanks

17 March, 2013 at 11:11 am

AnonymousStudy Tensors. Start with the metric tensor .

31 August, 2011 at 9:37 pm

Anonymouse=mc 2 is wrong …there must be a factor v that is subtracted from the velocity of light inorder to get the correct equation and that subtracted quantity is the limiting value for the reference frame …if we got that reference frame we can understand the success of reaction ….

8 December, 2014 at 6:49 pm

hoyhivE= mc q<- einstein forgot theory of relativity

24 October, 2011 at 5:17 am

amateurHow can someone explain changes in 4-d space-time (t,x(t),y(t),z(t)) mathematically since 1 is constant:

(1,dx/dt,dy/dt,dz/dt)

8 June, 2012 at 3:45 pm

ScottparkSo useful post.Thank you!!

25 July, 2012 at 1:39 am

Kuhanhttp://www.ams.org/bookstore/pspdf/mbk-59-prev.pdf

has similar article

2 October, 2012 at 7:51 pm

Einstein’s derivation of E=mc^2 revisited « What’s new[…] back in 2007, I wrote a blog post giving Einstein’s derivation of his famous equation . This derivation used a number of […]

3 October, 2012 at 6:37 pm

Wow Terrence Tao « The logic of images[…] back in 2007, I wrote a blog post giving Einstein’s derivation of his famous equation for the rest energy of a body with mass . […]

29 November, 2012 at 10:18 pm

MiladReblogged this on Milad Ebrahimpour 's Blog and commented:

You see how the universe is ruled by math?

22 December, 2012 at 2:40 pm

An introduction to special relativity for a high school math circle « What’s new[…] equivalence relation E=mc^2, largely following Einstein’s original derivation as discussed in this previous blog post); instead we will probably spend a fair chunk of time on related topics which do not actually […]

31 December, 2012 at 9:00 pm

Einstein’s derivation of E=mc^2 | What about being a physicist[…] Einstein’s derivation of E=mc^2. […]

8 January, 2013 at 3:13 am

bert latifgood justification about e=mcc,,,,,,

23 January, 2013 at 2:10 am

Everything is Relative. (Trippy) Consequences of Special Relativity. Prove E = mc2 | A Revolving Wheel[…] Terrytao: Deriving E=mc2 Ask a Mathematician: Q: Why does E=MC2? Wiki: Theory of Relativity Wiki: Special Relativity Wiki: Introduction to Special Relativity Wiki: Relativity of Simultaneity […]

17 March, 2013 at 11:07 am

jussilindgrenI’ve recently developed a possible model for gravitation that reduces to the Newtonian limit with low velocities/masses and yields also the mass-energy equivalence. The model is based on the idea to consider tensor products in flat Minkowski space. When one generalises then the concept of work-energy principle, one obtains the following nonlinear PDE system:

The respective energy equation is the following general wave equation

This is work in progress, I have a blog on the issue at jussilindgren.wordpress.com

2 April, 2013 at 3:46 am

OsumaWhat is the meaning of c in einstein equation

2 April, 2013 at 10:40 am

ReeceThe speed of light

23 April, 2013 at 5:54 am

johngokulI can’t believe it what a scientist was einstein. this website give chance to know about the universal equation.it will useful the student.

5 July, 2013 at 3:38 am

Aayush DhitalSir, I searched for the answer of the quesion “why mass causes curvature in space-time?” and I think I’ve found it. However, I’am not 100% sure on it. In my view every atoms contain a unit charge so any body with mass can be considered as a massive charge itself since it is the resultant of all the charges. In my view, this charge actually pushes the dark energy in the space and so does the dark energy. Hence, space-time is curved along the massive body….

16 August, 2013 at 6:15 pm

Anonymousthis cannot happen bcuz atom is electrically neutral

23 November, 2013 at 2:18 pm

socratusThe strange and magic E= Mc^2

1

In 1905 Einstein asked:

“ Does the inertia of a body depend upon its energy content?”

As he realized the answer was:

“ Yes, inertia depends on its energy content E= Mc^2 ”

Newton’s inertia doesn’t have force/energy,

Einstein’s inertia has energy. How to understand this difference?

2.

In 1928 Dirac said that E= Mc^2 can be as positive

as negative too. What is interaction between them ?

3 –

Sometimes E= Mc^2 can be ‘rest’ particle and sometimes

can be ‘active’ particle and can destroy cities like

Hiroshima and Nagasaki

Why E= Mc^2 is so strange ? Nobody gives answer

===.

All the best.

Israel Sadovnik Socratus.

==============..

25 January, 2014 at 7:51 pm

Ask a physicist anything. (8) - Page 61 - Christian Forums[…] Equation The original formula dealt with the rest mass of a body, forgotten in modern textbooks. https://terrytao.wordpress.com/2007/1…ation-of-emc2/ __________________ "If one closes their eyes they can imagine a universe of infinite […]

16 February, 2014 at 11:32 pm

James AllenWhere’s Leibniz? Tsk-tsk.

9 June, 2014 at 11:49 am

AnonymousStillni havevthe easiest way to prove it.

28 October, 2014 at 11:39 am

dominic nentawe.Please can someone deduce the einstein famous energy formula from the de broglie equation

17 December, 2014 at 12:27 am

yusrahow can we find the dimension of E=MC2

17 December, 2014 at 12:31 am

yusratell me quick.

10 January, 2015 at 8:13 am

On the Beauty Of Physical Theories | ENIGMA[…] https://terrytao.wordpress.com/2007/12/28/einsteins-derivation-of-emc2/ […]

15 January, 2015 at 7:41 pm

mjg0I urge everyone to read the paper by physicist Eugene Hecht entitled

“How Einstein confirmed E0=mc^2”, which you download from

http://www.loreto.unican.es/Carpeta2012/AJP(Hecht)Einstein2011.pdf.

Here is Hecht’s abstract:

The equivalence of mass m and rest-energy E0 is one of the great discoveries of all time. Despite the

current wisdom, Einstein did not derive this relation from first principles. Having conceived the idea

in the summer of 1905 he spent more than 40 years trying to prove it. We briefly examine all of

Einstein’s conceptual demonstrations of E0=mc2, focusing on their limitations and his awareness of

their shortcomings. Although he repeatedly confirmed the efficacy of E0=mc2, he never constructed

a general proof. Leaving aside that it continues to be affirmed experimentally, a rigorous proof of the

mass-energy equivalence is probably beyond the purview of the special theory [of relativity].

That last sentence is an important conjecture about the logic of SR.

A crucial point – which Terence emphasized he was avoiding – is whether mass changes with motion, i.e., whether mass is relativistic.

15 January, 2015 at 8:19 pm

mjg0Hecht’s article is critical of Einstein’s Sept. 1905 argument, which Terence presented above. Some of Hecht’s criticisms:

This derivation came very early in the development of

relativity, and the formal concept of “rest-energy” had not

yet evolved, nor had E0 been introduced to symbolize it.

Unless Einstein was willing to assume that whenever E0=0,

m=0—that all of the mass of a material entity was equivalent

to energy—he could not take the analysis any farther

than he had in September 1905. Unfortunately, he did not

overtly share his concerns about this issue with his readers.

it would not be until 1912 that he was writing statements

like: “According to this conception, we would have to view a

body with inertial mass m as an energy store of magnitude

mc^2 rest-energy of the body.” To attribute E0=mc^2 or

even worse, E=mc^2 to Einstein via the September

1905 paper is to do injustice to the gradual nature of his

development of the concepts.

Einstein made

two intuitive and seemingly innocent leaps that went beyond

what he could prove. “Since obviously here it is inessential

that the energy withdrawn from the body happens to turn into

energy of radiation rather than some other kind of energy, we

are led to the more general conclusion: The mass of a body is

a measure of its energy content [Energieinhalt]….”

The concept

of rest-energy was still unformulated, and he was not

even using the term. Without proof, Einstein guessed that the

loss of other forms of energy besides electromagnetic would

result in an equivalent loss of mass. That is no small point,

especially because many physicists at the time believed mass

was entirely electromagnetic.

An unspoken assumption was made early in the September

1905 paper that each emitted light pulse was not material,

in that it carried energy but had no intrinsic mass.

Throughout his work on the special theory, Einstein maintained

that light was other than matter. As he put it in 1911,

“the comparison of light with other ‘stuff’ is not

permissible.” Thus, Einstein avoided a more complicated

calculation by conjecturing, without discussion or proof,

that light was massless. But of course he could not prove

such a thing, one way or the other.

Finally, it should be noted that plane waves are a mathematical

contrivance. No real extended body can actually

radiate electromagnetic plane waves as required by this

thought experiment. Naturally enough Einstein said nothing

about the physical structure of the hypothetical emitting

body that could perform such a feat. This omission likely

troubled him because, as we will see, he returned to the issue

seven years later with a more elaborate emitter.

What Einstein proved in September 1905 was that if an

imaginary material body could somehow emit radiant energy

E0 which is itself massless in the form of plane waves,

that extraordinary object would diminish in mass by E0 /c^2.

He did not prove that every form of energy was equivalent to

mass; that, he simply proclaimed. Yet, he was quite aware

that his analysis had limitations, and in 1907 he discussed the

need for a more general approach.