Skip to content

The simplest derivation of the Pythagorean theorem

May 8, 2013
by

Sometimes I am amazed by the permanence of mathematical discovery.  Math, it seems to me, is quite unique among the creative intellectual pursuits (science, art, engineering) for the seemingly unlimited lifetime of its innovations.

For example, Aristotle was a brilliant natural philosopher, as much a genius as just about any modern scientist, and he advanced (what would become) physics tremendously during the 4th century BC.  But by now his theory of the five elements is completely unnecessary for anyone to learn.  While it produced an important advancement in our thinking, it has been replaced by more correct physical theories.  Thus, Aristotle suffered that same fate that meets seemingly every scientist or inventor eventually: further discoveries made him obsolete.

Pythagoras, on the other hand, who lived roughly 200 years before Aristotle, is someone whose major contribution to mathematics is still used every day.  I literally could not do my job without the Pythagorean theorem, and neither could just about any scientist or engineer.  Unlike nearly all other kinds of innovations, it has very much not been replaced.

After 2,500 years, a^2 + b^2 is still equal to c^2.

What’s important to notice is not just that Pythagoras’s result is still important, but that the type of reasoning that leads to his result is still important.  Put simply, a good scientist or engineer needs to be capable of understanding and reproducing a derivation of the 2500-year-old Pythagorean theorem, not just because the theorem is important, but because that level of logical thinking is necessary for his/her job.

So in this post I think it’s worth sharing my own favorite derivation of the Pythagorean theorem.  This derivation is the simplest one I know of, and it doesn’t require any tremendous geometric cleverness (like a tangram puzzle) or complicated diagrams.  Instead, it relies only on a very basic use of scaling arguments.

\hspace{1mm}

\hspace{1mm}

Scaling arguments are among the simplest and most powerful tools in theoretical physics.  They allow you to reach remarkably concrete conclusions about a problem even when you don’t know essentially any details about the system in question.  The key idea is to imagine scaling the system up or down in size, and then saying something about how it should change as you do so.

For example, suppose you don’t know anything about triangles except that they have an area.  Since area is measured in units of length squared, you can immediately say that if you take some triangle and make its length X times bigger, than its area must get X^2 times larger.

In other words, if the following triangle has area Asmall_triangle

then the triangle below, which is the same as the previous one only magnified two times, must have an area 2^2 A.

triangle

 Meanwhile, all the side lengths of the  bigger triangle are exactly two times longer than for the smaller one.

What all this means is that, for a given triangle, the area is proportional to the square of any one of its side lengths.  I know this because as I make the triangle X times bigger, the side lengths all get X times longer, and the area gets X^2 times bigger.  So if I want I can write

A = (\text{something}) \times (\text{hypotenuse length})^2.

The “something” in that equation depends on the angles in the triangle, but for now let’s assume that I am more or less completely ignorant about triangles and I can’t tell you what it is.  Luckily enough for ignorant me, it turns out I don’t need to know what the “something” is in order to prove the Pythagorean theorem.

The key trick is to divide the large triangle into two smaller and completely equivalent triangles.  That is, take this triangle:

triangle-with_angles

and draw one line (an altitude through the right angle) so that it gets divided into two smaller triangles, like this:

triangle_divided

You can tell that the two newly-created triangles are just scaled-down versions of the original one, because they have all the same angles.  This means that the original triangle can be written as the sum of two smaller but otherwise completely identical triangles.  Like this:

triangle_equality

Finally, to prove the Pythagorean theorem, we just have to invoke the one equation in this post, A = (\text{something}) \times (\text{hypotenuse length})^2 for each triangle.  This gives:

(\text{something}) \times c^2 = (\text{something}) \times a^2 + (\text{something}) \times b^2.

Since all the triangles are the same, all the “something”s are also the same, which means

a^2 + b^2 = c^2.

Not bad, eh?

\hspace{1mm}

I don’t know whether you found the above proof “aesthetic,” but I certainly did.  And it’s a pretty nice feeling to think that an insight had by someone more than 2,500 years ago can still feel beautiful to someone like me.  And even more remarkably, that my life (and professional career) continue to profit from it.

\hspace{1mm}

\hspace{1mm}

Footnote

I learned the proof above from Leonid Levitov.  As it happens, he presented it during a talk about atomic collapse!

UPDATE: A number of readers have pointed out that they learned this argument from Migdal’s wonderful book Qualitative Methods in Quantum Theory (which is probably where Levitov learned it also).

9 Comments leave one →
  1. Lagrangian Mechanic permalink
    May 8, 2013 7:46 pm

    I love it! That’s my new favorite proof of the theorem.

  2. sabre51 permalink
    May 9, 2013 9:37 am

    Loving the new blogging, keep it coming!

  3. jacobus permalink
    May 9, 2013 4:14 pm

    I don’t know whether you found the above proof “aesthetic”

    With a variable name like “something,” how could it not be?

  4. May 19, 2013 10:34 am

    This is basically kind of scaling derivation of Pythagoras theorem

  5. jaskho permalink
    December 1, 2015 3:04 pm

    It just so happens that 2 days ago I was teaching my son the pythagorean theorem and wishing that I could further encourage his filial delusions about my brilliance (yeah, that’s the sort of person I am) with an extemporaneous proof.
    It also just so happens that 1 day ago I fell down the wikipedia rabbit hole and before long found myself at G&L, reading quite compulively and to the detriment of my health, finances and marriage–don’t blame yourself–trying once again to scratch the perennial itch that is my desire to feel like I (am smart enough to) have more than a “Popular Science” grasp on the central concepts of quantum theory.
    Anyway, the convergence, in this post, between those two things that just so happened was the “vacuum fluctuation” that nudged me out of the energy well that normally keeps me from posting online, to say thanks.
    I mean that quite sincerely, and with more force than might be indicated by the word as written. Beyond what I’ve received in terms of learning and entertainment, spending time in the field of your erudition, clear thinking, elegant prose, passion for teaching, humor, humility, personableness, and, not least, immense generosity, have delighted, engaged and uplifted me. So, again, thanks.

    • Brian permalink*
      December 5, 2015 8:54 am

      You are entirely too kind, but I’m very glad that you’ve enjoyed the blog.

  6. Anonymous permalink
    October 30, 2016 5:01 pm

    Thanks Brian. But something is not scientific.

    • Brian permalink*
      October 31, 2016 10:33 am

      Maybe you just have an inflated opinion of the dignity of scientific arguments. 🙂
      I can assure you that plenty of science proceeds through arguments that are no more stately than my (area) = (something) * (side length)^2.

Trackbacks

  1. The Fibonacci sequence, under duress | Gravity and Levity

Leave a reply to Brian Cancel reply