Einstein’s derivation of E=mc^2

28 December, 2007 in expository, math.MP | Tags: Albert Einstein, E=mc^2, mass-energy equivalence, special relativity | by Terence Tao

Einstein’s equation $E=mc^2$ describing the equivalence of mass and energy is arguably the most famous equation in physics. But his beautifully elegant derivation of this formula (here is the English translation) from previously understood laws of physics is considerably less famous. (There is an amusing Far Side cartoon in this regard, with the punchline “squared away”, which you can find on-line by searching hard enough, though I will not link to it directly.)

This topic had come up in recent discussion on this blog, so I thought I would present Einstein’s derivation here. Actually, to be precise, in the paper mentioned above, Einstein uses the postulates of special relativity and other known laws of physics to show the following:

Proposition. (Mass-energy equivalence) If a body at rest emits a total energy of E while remaining at rest, then the mass of that body decreases by $E/c^2$ .

Assuming that bodies at rest with zero mass necessarily have zero energy, this implies the famous formula $E = mc^2$ – but only for bodies which are at rest. For moving bodies, there is a similar formula, but one has to first decide what the correct definition of mass is for moving bodies; I will not discuss this issue here, but see for instance the Wikipedia entry on this topic.

Broadly speaking, the derivation of the above proposition proceeds via the following five steps:

Using the postulates of special relativity, determine how space and time coordinates transform under changes of reference frame (i.e. derive the Lorentz transformations).
Using 1., determine how the temporal frequency $\nu$ (and wave number k) of photons transform under changes of reference frame (i.e. derive the formulae for relativistic Doppler shift).
Using Planck’s relation $E = h\nu$ (and de Broglie’s law $p = \hbar k$ ) and 2., determine how the energy E (and momentum p) of photons transform under changes of reference frame.
Using the law of conservation of energy (and momentum) and 3., determine how the energy (and momentum) of bodies transform under changes of reference frame.
Comparing the results of 4. with the classical Newtonian approximations $KE \approx \frac{1}{2} m|v|^2$ (and $p \approx mv$ ), deduce the relativistic relationship between mass and energy for bodies at rest (and more generally between mass, velocity, energy, and momentum for moving bodies).

Actually, as it turns out, Einstein’s analysis for bodies at rest only needs to understand changes of reference frame at infinitesimally low velocity, $|v| \ll c$ . However, in order to see enough relativistic effects to deduce the mass-energy equivalence, one needs to obtain formulae which are accurate to second order in v (or more precisely, $v/c$ ), as opposed to those in Newtonian physics which are accurate to first order in v. Also, to understand the relationship between mass, velocity, energy, and momentum for moving bodies rather than bodies at rest, one needs to consider non-infinitesimal changes of reference frame.

Important note: Einstein’s argument is, of course, a physical argument rather than a mathematical one. While I will use the language and formalism of pure mathematics here, it should be emphasised that I am not exactly giving a formal proof of the above Proposition in the sense of modern mathematics; these arguments are instead more like the classical proofs of Euclid, in that numerous “self evident” assumptions about space, time, velocity, etc. will be made along the way. (Indeed, there is a very strong analogy between Euclidean geometry and the Minkowskian geometry of special relativity.) One can of course make these assumptions more explicit, and this has been done in many other places, but I will avoid doing so here in order not to overly obscure Einstein’s original argument.

— Lorentz transforms to first order —

To simplify the notation, we shall assume that the ambient spacetime S has only one spatial dimension rather than three, although the analysis here works perfectly well in three spatial dimensions (as was done in Einstein’s original paper). Thus, in any inertial reference frame F, the spacetime S is parameterised by two real numbers (t,x). Mathematically, we can describe each frame F as a bijection between S and ${\Bbb R} \times {\Bbb R}$ . To normalise these coordinates, let us suppose that all reference frames agree to use a single event O in S as their origin (0,0); thus

$F(O)=(0,0)$ (1)

for all frames F.

Given an inertial reference frame $F: S \to {\Bbb R} \times {\Bbb R}$ , one can generate new inertial reference frames in two different ways. One is by reflection: one takes the same frame, with the same time coordinate, but reverses the space coordinates to obtain a new frame $\overline{F}: S \to {\Bbb R} \times {\Bbb R}$ , thus reversing the orientation of the frame. In equations, we have

if $F(E) = (t,x)$ , then $\overline{F}(E) = (t,-x)$ (2)

for any spacetime event E. Another way is by replacing the observer which is stationary in F with an observer which is moving at a constant velocity v in F, to create a new inertial reference frame $F_v: S \to {\Bbb R} \times {\Bbb R}$ with the same orientation as F. In our analysis, we will only need to understand infinitesimally small velocities v; there will be no need to consider observers traveling at speeds close to the speed of light.

The new frame $F_v: S \to {\Bbb R} \times {\Bbb R}$ and the original frame $F: S \to {\Bbb R} \times {\Bbb R}$ must be related by some transformation law

$F_v = L_v \circ F$ (3)

for some bijection $L_v: {\Bbb R} \times {\Bbb R} \to {\Bbb R} \times {\Bbb R}$ . A priori, this bijection $L_v$ could depend on the original frame F as well as on the velocity v, but the principle of relativity implies that $L_v$ is in fact the same in all reference frames F, and so only depends on v.

It is thus of interest to determine what the bijections $L_v: {\Bbb R} \times {\Bbb R} \to {\Bbb R} \times {\Bbb R}$ are. From our normalisation (1) we have

$L_v (0,0) = (0,0)$ (4)

but this is of course not enough information to fully specify $L_v$ . To proceed further, we recall Newton’s first law, which states that an object with no external forces applied to it moves at constant velocity, and thus traverses a straight line in spacetime as measured in any inertial reference frame. (We are assuming here that the property of “having no external forces applied to it” is not affected by changes of inertial reference frame. For non-inertial reference frames, the situation is more complicated due to the appearance of fictitious forces.) This implies that $L_v$ transforms straight lines to straight lines. (To be pedantic, we have only shown this for straight lines corresponding to velocities that are physically attainable, but let us ignore this minor technicality here.) Combining this with (4), we conclude that $L_v$ is a linear transformation. (It is a cute exercise to verify this claim formally, under reasonable assumptions such as smoothness of $L_v$ . ) Thus we can view $L_v$ now as a $2 \times 2$ matrix.

When v=0, it is clear that $L_v$ should be the identity matrix I. Making the plausible assumption that $L_v$ varies smoothly with v, we thus have the Taylor expansion

$L_v = I + L'_0 v + O(v^2)$ (5)

for some matrix $L'_0$ and for infinitesimally small velocities v. (Mathematically, what we are doing here is analysing the Lie group of transformations $L_v$ via its Lie algebra.) Expanding everything out in coordinates, we obtain

$L_v(t, x) = ( (1 + \alpha v + O(v^2)) t + (\beta v + O(v^2)) x,$

$(\gamma v + O(v^2)) t + (1 + \delta v + O(v^2)) x )$ (6)

for some absolute constants $\alpha, \beta, \gamma, \delta \in {\Bbb R}$ (not depending on t, x, or v).

The next step, of course, is to pin down what these four constants are. We can use the reflection symmetry (2) to eliminate two of these constants. Indeed, if an observer is moving at velocity v in frame F, it is moving in velocity -v in frame $\overline{F}$ , and hence $\overline{F_v} = \overline{F}_{-v}$ . Combining this with (2), (3), (6) one eventually obtains

$\alpha = 0$ and $\delta = 0$ . (7)

Next, if a particle moves at velocity v in frame F, and more specifically moves along the worldline $\{ (t, vt): t \in {\Bbb R} \}$ , then it will be at rest in frame $F_v$ , and (since it passes through the universally agreed upon origin O) must then lie on the worldline $\{ (t', 0): t' \in {\Bbb R} \}$ . From (3), we conclude

$L_v(t,vt) \in \{ (t',0): t' \in {\Bbb R} \}$ for all t. (8)

Inserting this into (6) (and using (7)) we conclude that $\gamma = -1$ . We have thus pinned down $L_v$ to first order almost completely:

$L_v (t, x) = ( t + \beta v x, x - vt ) + O( v^2 (|t|+|x|) ).$ (9)

Thus, rather remarkably, using nothing more than the principle of relativity and Newton’s first law, we have almost entirely determined the reference frame transformation laws, save for the question of determining the real number $\beta$ . [In mathematical terms, what we have done is classify the one-dimensional Lie subalgebras of $\mathfrak{gl}_2({\Bbb R})$ which are invariant under spatial reflection, and coordinatised using (8).] If this number vanished, we would eventually recover classical Galilean relativity. If this number was positive, we would eventually end up with the (rather unphysical) situation of Euclidean relativity, in which spacetime had a geometry isomorphic to that of the Euclidean plane. As it turns out, though, in special relativity this number is negative. This follows from the second postulate of special relativity, which asserts that the speed of light c is the same in all inertial reference frames. In equations (and because $F_v$ has the same orientation as F), this is asserting that

$L_v(t,ct) \in \{ (t',ct'): t' \in {\Bbb R} \}$ for all t (10+)

and

$L_v(t,-ct) \in \{ (t',-ct'): t' \in {\Bbb R} \}$ for all t. (10-)

Inserting either of (10+) or (10-) into (9) we conclude that $\beta = -1/c^2$ , and thus we have obtained a full description of $L_v$ to first order:

$L_v (t, x) = ( t - \frac{v x}{c^2}, x - vt ) + O( v^2 (|t|+|x|) ).$ (11)

— Lorentz transforms to second order —

It turns out that to get the mass-energy equivalence, first-order expansion of the Lorentz transformations $L_v$ is not sufficient; we need to expand to second order. From Taylor expansion we know that

$L_v = I + L'_0 v + \frac{1}{2} L''_0 v^2 + O( v^3 )$ (12)

for some matrix $L''_0$ . To compute this matrix, let us make the plausible assumption that if the frame $F_v$ is moving at velocity v with respect to F, then F is moving at velocity -v with respect to $F_v$ . (One can justify this by considering two frames receding at equal and opposite directions from a single reference frame, and using reflection symmetry to consider how these two frames move with respect to each other.) Applying (3) we conclude that $L_{-v} \circ L_v = I$ . Inserting this into (12) and comparing coefficients we conclude that $L''_0 = (L'_0)^2$ . Since $L'_0$ is determined from (11), we can compute everything explicitly, eventually ending up at the second order expansion

$L_v (t, x) = ( t - \frac{v x}{c^2} + \frac{t v^2}{2c^2}, x - vt + \frac{xv^2}{2c^2}) + O( v^3 (|t|+|x|) ).$ (13)

One can continue in this fashion (exploiting the fact that the $L_v$ must form a Lie group (with the Lie algebra already determined), and using (8) to fix the parameterisation $v \mapsto L_v$ of that group) to eventually get the full expansion of $L_v$ , namely

$L_v (t, x) = \left( \frac{t - vx/c^2}{\sqrt{1-v^2/c^2}}, \frac{x-vt}{\sqrt{1-v^2/c^2}} \right)$ ,

but we will not need to do so here.

— Doppler shift —

The formula (13) is already enough to recover the relativistic Doppler shift formula (to second order in v) for radiation moving at speed c with some wave number k. Mathematically, such radiation moving to the right in an inertial reference frame F can be modeled by the function

$A \cos( k (x-ct) + \theta )$

for some amplitude A and phase shift $\theta$ . If we move to the coordinates $(t',x') = L_v(t,x)$ provided by an inertial reference frame F’, a computation then shows that the function becomes

$A \cos( k_+ (x'-ct') + \theta)$

where $k_+ = (1-v/c+v^2/2c^2 + O(v^3)) k$ . (actually, if the radiation is tensor-valued, the amplitude A might also transform in some manner, but this transformation will not be of relevance to us.) Similarly, radiation moving at speed c to the left will transform from

$A \cos( k (x+ct) + \theta )$

$A \cos( k_- (x+ct) + \theta )$

where ${}k_- = (1+v/c+v^2/2c^2 + O(v^3)) k$ . This describes how the wave number k transforms under changes of reference frame by small velocities v. The temporal frequency $\nu$ is linearly related to the wave number k by the formula

$\nu = \frac{c}{2\pi} k$ , (14)

and so this frequency transforms by the (red-shift) formula

$\nu_+ = (1-v/c+v^2/2c^2 + O(v^3))\nu$ (15+)

for right-ward moving radiation and by the (blue-shift) formula

$\nu_- = (1+v/c+v^2/2c^2 + O(v^3))\nu$ (15-)

for left-ward moving radiation. (As before, one can give an exact formula here, but the above asymptotic will suffice for us.)

— Energy and momentum of photons —

From the work of Planck, and of Einstein himself on the photoelectric effect, it was known that light could be viewed both as a form of radiation (moving at speed c), and also made up of particles (photons). From Planck’s law, each photon has an energy of $E = h \nu$ and (from de Broglie’s law) a momentum of $p = \pm \hbar k = \pm \frac{h}{2\pi} k$ , where h is Planck’s constant, and the sign depends on whether one is moving rightward or leftward. In particular, from (14) we have the pleasant relationship

$E = |p| c$ (16)

for photons. [More generally, it turns out that for arbitrary bodies, momentum, velocity, and energy are related by the formula $p = \frac{1}{c^2} E v$ , though we will not derive this fact here.] Applying (15+), (15-), we see that if we view a photon in a new reference frame $F_v$ , then the observed energy E and momentum p now become

$E_+ = (1-v/c+v^2/2c^2 + O(v^3)) E$ ; $p_+ = (1-v/c+v^2/2c^2 + O(v^3)) p$ (17+)

for right-ward moving photons, and

$E_- = (1+v/c+v^2/2c^2 + O(v^3)) E$ ; $p_- = (1+v/c+v^2/2c^2 + O(v^3)) p$ (17-)

for left-ward moving photons.

These two formulae (17+), (17-) can be unified using (16) into a single formula

$(E'/c^2, p') = L_v (E/c^2,p) + O(v^3)$ (18)

for any photon (moving either leftward or rightward) with energy E and momentum p as measured in frame F, and energy E’ and momentum p’ as measured in frame $F_v$ . Actually, the error term $O(v^3)$ can be deleted entirely by working a little harder. From the linearity of $L_v$ and the conservation of energy and momentum, it is then natural to conclude that (18) should also be valid not only for photons, but for any object that can exchange energy and momentum with photons. This can be used to derive the formula $E=mc^2$ fairly quickly, but let us instead give the original argument of Einstein, which is only slightly different.

— Einstein’s argument —

We are now ready to give Einstein’s argument. Consider a body at rest in a reference frame F with some mass $m$ and some rest energy $E$ . (We do not yet know that $E$ is equal to $m c^2$ .) Now let us view this same mass in some new reference frame $F_v$ , where v is a small velocity. From Newtonian mechanics, we know that a body of mass $m$ moving at velocity v acquires a kinetic energy of $\frac{1}{2} m v^2$ . Thus, assuming that Newtonian physics is valid at low velocities to top order, the net energy E’ of this body as viewed in this frame $F_v$ should be

$E' = E + \frac{1}{2} m v^2 + O(v^3).$ (19)

If assumes that the transformation law (18) applies for this body, one can already deduce the formula $E=mc^2$ for this body at rest from (19) (and the assumption that bodies at rest have zero momentum), but let us instead give Einstein’s original argument.

We return to frame F, and assume that our body emits two photons of equal energy $\Delta E/2$ , one moving left-ward and one moving right-ward. By (16) and conservation of momentum, we see that the body remains at rest after this emission. By conservation of energy, the remaining energy in the body is $E - \Delta E$ . Let’s say that the new mass in the body is $m - \Delta m$ . Our task is to show that $\Delta E = \Delta m c^2$ .

To do this, we return to frame $F_v$ . By (16+), the rightward moving photon has energy

$(1-v/c+v^2/2c^2 + O(v^3)) \frac{\Delta E}{2}$ ; (20+)

in this frame; similarly, the leftward moving photon has energy

$(1+v/c+v^2/2c^2 + O(v^3)) \frac{\Delta E}{2}$ . (20-)

What about the body? By repeating the derivation of (18), it must have energy

$(E - \Delta E) + \frac{1}{2} (m-\Delta m)v^2 + O(v^3).$ (20)

By the principle of relativity, the law of conservation of energy has to hold in the frame $F_v$ as well as in the frame F. Thus, the energy (20-)+(20+)+(20) in frame $F_v$ after the emission must equal the energy E’=(19) in frame $F_v$ before emission. Adding everything together and comparing coefficients we obtain the desired relationship $\Delta E = \Delta m c^2$ .

[One might quibble that Einstein’s argument only applies to emissions of energy that consist of equal and opposite pairs of photons. But one can easily generalise the argument to handle arbitrary photon emissions, especially if one takes advantage of (18); for instance, another well-known (and somewhat simpler) variant of the argument works by considering a photon emitted from one side of a box and absorbed on the other. More generally, any other energy emission which could potentially in the future decompose entirely into photons would also be handled by this argument, thanks to conservation of energy. Now, it is possible that other conservation laws prevent decomposition into photons; for instance, the law of conservation of charge prevents an electron (say) from decomposing entirely into photons, thus leaving open the possibility of having to add a linearly charge-dependent correction term to the formula $E = mc^2$ . But then one can renormalise away this term by redefining the energy to subtract such a term; note that this does not affect conservation of energy, thanks to conservation of charge.]

109 comments

Comments feed for this article

3 June, 2011 at 8:09 pm

Jon

Dear Terrence Tao,

Could you, pls, tell me what do I need to know to understand Einstein general relativity proof ?

I know that is necessary to understand Differential Geometry, right ?
What more ?
I want to set this as one of my lifetime Goals. :o)

Thanks

17 March, 2013 at 11:11 am

Anonymous

Study Tensors. Start with the metric tensor .

31 August, 2011 at 9:37 pm

Anonymous

e=mc 2 is wrong …there must be a factor v that is subtracted from the velocity of light inorder to get the correct equation and that subtracted quantity is the limiting value for the reference frame …if we got that reference frame we can understand the success of reaction ….

8 December, 2014 at 6:49 pm

hoyhiv

E= mc q<- einstein forgot theory of relativity

24 October, 2011 at 5:17 am

amateur

How can someone explain changes in 4-d space-time (t,x(t),y(t),z(t)) mathematically since 1 is constant:

(1,dx/dt,dy/dt,dz/dt)

8 June, 2012 at 3:45 pm

Scottpark

So useful post.Thank you!!

25 July, 2012 at 1:39 am

Kuhan

Click to access mbk-59-prev.pdf

has similar article

2 October, 2012 at 7:51 pm

Einstein’s derivation of E=mc^2 revisited « What’s new

[…] back in 2007, I wrote a blog post giving Einstein’s derivation of his famous equation . This derivation used a number of […]

3 October, 2012 at 6:37 pm

Wow Terrence Tao « The logic of images

[…] back in 2007, I wrote a blog post giving Einstein’s derivation of his famous equation for the rest energy of a body with mass . […]

29 November, 2012 at 10:18 pm

Milad

Reblogged this on Milad Ebrahimpour 's Blog and commented:
You see how the universe is ruled by math?

22 December, 2012 at 2:40 pm

An introduction to special relativity for a high school math circle « What’s new

[…] equivalence relation E=mc^2, largely following Einstein’s original derivation as discussed in this previous blog post); instead we will probably spend a fair chunk of time on related topics which do not actually […]

31 December, 2012 at 9:00 pm

Einstein’s derivation of E=mc^2 | What about being a physicist

[…] Einstein’s derivation of E=mc^2. […]

8 January, 2013 at 3:13 am

bert latif

good justification about e=mcc,,,,,,

23 January, 2013 at 2:10 am

Everything is Relative. (Trippy) Consequences of Special Relativity. Prove E = mc2 | A Revolving Wheel

[…] Terrytao: Deriving E=mc2 Ask a Mathematician: Q: Why does E=MC2? Wiki: Theory of Relativity Wiki: Special Relativity Wiki: Introduction to Special Relativity Wiki: Relativity of Simultaneity […]

17 March, 2013 at 11:07 am

jussilindgren

I’ve recently developed a possible model for gravitation that reduces to the Newtonian limit with low velocities/masses and yields also the mass-energy equivalence. The model is based on the idea to consider tensor products in flat Minkowski space. When one generalises then the concept of work-energy principle, one obtains the following nonlinear PDE system:

$\frac{1}{c}\frac{\partial \vec{u}}{\partial t}+\frac{1}{c}\vec{u}\cdot \nabla \vec{u}+ \frac{1}{c^2}\frac{\partial \psi}{\partial t}\vec{u}=\nabla \psi$

The respective energy equation is the following general wave equation

$\frac{1}{c^2}\frac{\partial \psi}{\partial t^2}-\Delta \psi=-\nabla \cdot (\frac{1}{c}\vec{u}\cdot \nabla \vec{u})- \nabla \cdot (\frac{1}{c^2}\frac{\partial \psi}{\partial t}\vec{u})$

This is work in progress, I have a blog on the issue at jussilindgren.wordpress.com

2 April, 2013 at 3:46 am

Osuma

What is the meaning of c in einstein equation

2 April, 2013 at 10:40 am

Reece

The speed of light

23 April, 2013 at 5:54 am

johngokul

I can’t believe it what a scientist was einstein. this website give chance to know about the universal equation.it will useful the student.

5 July, 2013 at 3:38 am

Aayush Dhital

Sir, I searched for the answer of the quesion “why mass causes curvature in space-time?” and I think I’ve found it. However, I’am not 100% sure on it. In my view every atoms contain a unit charge so any body with mass can be considered as a massive charge itself since it is the resultant of all the charges. In my view, this charge actually pushes the dark energy in the space and so does the dark energy. Hence, space-time is curved along the massive body….

16 August, 2013 at 6:15 pm

Anonymous

this cannot happen bcuz atom is electrically neutral

23 November, 2013 at 2:18 pm

socratus

The strange and magic E= Mc^2
1
In 1905 Einstein asked:
“ Does the inertia of a body depend upon its energy content?”
As he realized the answer was:
“ Yes, inertia depends on its energy content E= Mc^2 ”
Newton’s inertia doesn’t have force/energy,
Einstein’s inertia has energy. How to understand this difference?
2.
In 1928 Dirac said that E= Mc^2 can be as positive
as negative too. What is interaction between them ?
3 –
Sometimes E= Mc^2 can be ‘rest’ particle and sometimes
can be ‘active’ particle and can destroy cities like
Hiroshima and Nagasaki
Why E= Mc^2 is so strange ? Nobody gives answer
===.
All the best.
Israel Sadovnik Socratus.
==============..

25 January, 2014 at 7:51 pm

Ask a physicist anything. (8) - Page 61 - Christian Forums

[…] Equation The original formula dealt with the rest mass of a body, forgotten in modern textbooks. https://terrytao.wordpress.com/2007/1…ation-of-emc2/ __________________ "If one closes their eyes they can imagine a universe of infinite […]

16 February, 2014 at 11:32 pm

James Allen

Where’s Leibniz? Tsk-tsk.

9 June, 2014 at 11:49 am

Anonymous

Stillni havevthe easiest way to prove it.

28 October, 2014 at 11:39 am

dominic nentawe.

Please can someone deduce the einstein famous energy formula from the de broglie equation

17 December, 2014 at 12:27 am

yusra

how can we find the dimension of E=MC2

17 December, 2014 at 12:31 am

yusra

tell me quick.

10 January, 2015 at 8:13 am

On the Beauty Of Physical Theories | ENIGMA

[…] https://terrytao.wordpress.com/2007/12/28/einsteins-derivation-of-emc2/ […]

15 January, 2015 at 7:41 pm

mjg0

I urge everyone to read the paper by physicist Eugene Hecht entitled
“How Einstein confirmed E0=mc^2”, which you download from

http://www.loreto.unican.es/Carpeta2012/AJP(Hecht)Einstein2011.pdf.

Here is Hecht’s abstract:

The equivalence of mass m and rest-energy E0 is one of the great discoveries of all time. Despite the
current wisdom, Einstein did not derive this relation from first principles. Having conceived the idea
in the summer of 1905 he spent more than 40 years trying to prove it. We briefly examine all of
Einstein’s conceptual demonstrations of E0=mc2, focusing on their limitations and his awareness of
their shortcomings. Although he repeatedly confirmed the efficacy of E0=mc2, he never constructed
a general proof. Leaving aside that it continues to be affirmed experimentally, a rigorous proof of the
mass-energy equivalence is probably beyond the purview of the special theory [of relativity].

That last sentence is an important conjecture about the logic of SR.

A crucial point – which Terence emphasized he was avoiding – is whether mass changes with motion, i.e., whether mass is relativistic.

15 January, 2015 at 8:19 pm

mjg0

Hecht’s article is critical of Einstein’s Sept. 1905 argument, which Terence presented above. Some of Hecht’s criticisms:

This derivation came very early in the development of
relativity, and the formal concept of “rest-energy” had not
yet evolved, nor had E0 been introduced to symbolize it.
Unless Einstein was willing to assume that whenever E0=0,
m=0—that all of the mass of a material entity was equivalent
to energy—he could not take the analysis any farther
than he had in September 1905. Unfortunately, he did not
overtly share his concerns about this issue with his readers.

it would not be until 1912 that he was writing statements
like: “According to this conception, we would have to view a
body with inertial mass m as an energy store of magnitude
mc^2 rest-energy of the body.” To attribute E0=mc^2 or
even worse, E=mc^2 to Einstein via the September
1905 paper is to do injustice to the gradual nature of his
development of the concepts.

Einstein made
two intuitive and seemingly innocent leaps that went beyond
what he could prove. “Since obviously here it is inessential
that the energy withdrawn from the body happens to turn into
energy of radiation rather than some other kind of energy, we
are led to the more general conclusion: The mass of a body is
a measure of its energy content [Energieinhalt]….”

The concept
of rest-energy was still unformulated, and he was not
even using the term. Without proof, Einstein guessed that the
loss of other forms of energy besides electromagnetic would
result in an equivalent loss of mass. That is no small point,
especially because many physicists at the time believed mass
was entirely electromagnetic.

An unspoken assumption was made early in the September
1905 paper that each emitted light pulse was not material,
in that it carried energy but had no intrinsic mass.
Throughout his work on the special theory, Einstein maintained
that light was other than matter. As he put it in 1911,
“the comparison of light with other ‘stuff’ is not
permissible.” Thus, Einstein avoided a more complicated
calculation by conjecturing, without discussion or proof,
that light was massless. But of course he could not prove
such a thing, one way or the other.

Finally, it should be noted that plane waves are a mathematical
contrivance. No real extended body can actually
radiate electromagnetic plane waves as required by this
thought experiment. Naturally enough Einstein said nothing
about the physical structure of the hypothetical emitting
body that could perform such a feat. This omission likely
troubled him because, as we will see, he returned to the issue
seven years later with a more elaborate emitter.

What Einstein proved in September 1905 was that if an
imaginary material body could somehow emit radiant energy
E0 which is itself massless in the form of plane waves,
that extraordinary object would diminish in mass by E0 /c^2.
He did not prove that every form of energy was equivalent to
mass; that, he simply proclaimed. Yet, he was quite aware
that his analysis had limitations, and in 1907 he discussed the
need for a more general approach.

23 November, 2015 at 9:07 am

milkias tesfalem

it was amazing but you should write it in other African languages such as Amharic

16 May, 2016 at 1:31 am

Anonymous

Best

3 June, 2016 at 6:24 am

Anonymous

I think the link to Wikipedia article about Planck’s law should be replaced by a link to the page about Planck–Einstein relation.

[Corrected, thanks – T.]

3 June, 2016 at 9:10 am

Anonymous

There is also a similar error in the section “Energy and momentum of photons” and this post: https://terrytao.wordpress.com/2012/10/02/einsteins-derivation-of-emc2-revisited/

[Corrected, thanks – T.]

31 July, 2016 at 9:04 am

E=mc^2 – Making Sense of Complications

[…] https://terrytao.wordpress.com/2007/12/28/einsteins-derivation-of-emc2/ […]

6 August, 2016 at 2:46 pm

Michael Banashak

Dear Terence,

There is no such thing as a mass- energy relationship. What there is, is a mass-energy-momentum relationship. Einstein rejected the idea of relativistic mass in his later years, strongly warning against it. An objects energy can change with speed (velocity) but its mass NEVER changes.

Einstein’s great discovery was the “rest-energy.” The only energy being put equal to mass is the ” rest -energy.” Mass is invariant.

Eistein real equation for an object at rest was Eo=moc2 , also written Eo=mc2. “E=mc2” was a result of his careless writing that the medi latched on to. For further information, look inot the great Russian physicists Lev Okun’s articles.

Best wishes,
Mike Banashak

7 August, 2016 at 1:39 am

Anonymous

A zero rest mast particle (e.g. photon) demonstrates very clearly your remark! (in this case, $m_0 c^2 = 0$ – implying that the photon energy is contributed only by its momentum and not by its zero rest mass.)

18 November, 2016 at 4:50 pm

cybersharque

Which college-level course teaches this, and what are the pre-reqs? Does it (shudder) need complex analysis?

21 November, 2016 at 6:57 am

Lukasz T. Stepien

Dear Debaters

Michael Banashak has written that the mass-energy relationship does not exist. I think that saying about “equivalence of mass energy” is even more misleading.
Energy, as a conserved quantity, is connected to the symmetry of a system, with respect to time translations (it follows from Noether Theorem).
The square of mass is equal to (with the sign “minus”) the eigenvalue of the Casimir operator, which here is square of four-momentum
P^{\mu}P_{\mu} (the representation of the Poincare algebra – the massive case).

Best wishes,
Lukasz T. Stepien

4 December, 2016 at 2:02 am

Maxis Jaisi

Dear Terry, in Einstein’s original paper, there’s an additive constant C ( H_1 – E_1 = K_1 + C). But your derivation doesn’t. Are you setting C = 0? Is there any physical significance to the constant C?

9 November, 2017 at 1:02 pm

dendisuhubdy

Reblogged this on Artificial Intelligence Research Blog.

6 July, 2020 at 6:30 am

Zcp

like here, Thanks

28 July, 2020 at 11:12 pm

Rabi

Apparently not meant for ordinary folks.
A lot of mumbo-jumbo. Plain english is missing.
A well understood topic should be easy to communicate.

19 August, 2020 at 2:29 pm

Hollis Williams

Einstein also wrote a simpler ‘proof’ later in life, I’m not sure if Tao has read it. Anyway it’s in a book called The Essential Einstein and its only a few pages.

1 September, 2020 at 2:53 pm

Ahmed kamel

Great ♥️♥️

28 October, 2020 at 4:44 pm

fpmarin

It’s curious that $\vec{F}\cdot\mathrm{d}\vec{r} = c^{2}\,\mathrm{d}m$ .

29 October, 2020 at 9:10 am

Anonymous

It is important to note that the classical physical concepts such as mass, electrical charge and vector angular momentum, although “measurable” are not(!) rigorously well defined.
A rigorous mathematical definition of such concepts (for a given 3D “spatial domain slice” – with a fixed time-coordinate in 4D spacetime) should be based on the 4D spacetime metric.

A well-known example is the Kerr-Newman blackhole metric which is completely described by its intrinsic parameters (mass, electric charge, and vector angular momentum).

29 April, 2021 at 1:49 am

isengbelajar

Nice proof!

27 March, 2022 at 9:07 am

Механика Пуанкаре-Эйнштейна — Блог друга Вигнера

[…] потребуется дополнительное рассуждение (см. например, эйнштейновский вывод формулы ; но что более важно, если , то энергия и импульс не […]

	Anonymous on Erratum for “An inverse…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on Infinite partial sumsets in th…
	Anonymous on A Banach algebra proof of the…
	Anonymous on A Banach algebra proof of the…
	Aleksandar on 245C, Notes 4: Sobolev sp…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Anonymous on Erratum for “An inverse…
	Terence Tao on 245C, Notes 4: Sobolev sp…
	Terence Tao on 275A, Notes 3: The weak and st…
	Terence Tao on What is a gauge?
	Terence Tao on Erratum for “An inverse…
	Terence Tao on 275A, Notes 3: The weak and st…

Einstein’s derivation of E=mc^2

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

109 comments

Leave a comment Cancel reply

For commenters

Einstein’s derivation of E=mc^2

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

109 comments

Leave a comment Cancel reply

For commenters