The History of Algebra, part I: Negative Numbers

This is the post I promised over a month ago on two landmark books in the history of algebra:

Kitab al-Jabr wa-l-Muqabala, aka The Compendium on Calculating by Completion and Reduction

and

Ars Magna, aka The Great Art, or The Rules of Algebra
by Girolamo Cardano

A lot can be and has been said about these books. I’m going to zero in on one particular story they tell:

Take-home lesson #1: the mathematical world’s understanding of negative numbers came incredibly slowly, in very gradual stages. We tend to treat learning about negatives like there’s just one big idea to understand. Really, there are like twenty.

Reading these books has given me more respect than ever for the depth of the process we ask kids to go through between sixth and ninth grade as they get comfortable working with negatives.

Take-home lesson #2: In the process of understanding a new and difficult idea, the ability to understand and use the idea to answer a question comes way before the ability to pose a question about the idea. So, it makes sense to get very comfortable with -2 as the answer to 5-7 before ever asking yourself to add -2 to something.

Take-home lesson #3: The development of algebra is an important motivator, historically anyway, for the development of negatives.

Pedagogical idea: How can we use this historical motivation to develop negatives with students?
a) Al-Khwarizmi’s book contains a very limited idea of negativeness: that which has been subtracted. But since he is thinking about how to multiply, for example, an unknown with 2 subtracted, from the same unknown with 3 subtracted, he needs to see that, once everything has been distributed, the product of the subtracted 2 and the subtracted 3 contribute an added 6 to the total. It is not immediately obvious how this becomes a classroom activity but I think it definitely can. 20*20 is 400; how does taking away 2 from one of the factors and 3 from the other affect the product? We get kids thinking hard about this and it would support the most contrivance-free explanation for why (neg)(neg)=(pos) that I have ever seen.
b) Allowing the coefficients of equations to be negative significantly cleaned up the theory of equations. Our students know more about negatives than the inventors of algebra did. It might be really exciting and powerful, increasing their appreciation for both negatives and quadratics, to show or let them develop the original (negative-impaired) theory of quadratics, and then have them use negatives to clean it up.

* * * * *

The first of these books was written in Arabic, and published around 820 in Baghdad. 820. Just to make sure you didn’t miss that. The translation I read is 180 years old. The full text of it is available online.

What I am even more anxious to make sure you didn’t miss are certain bits of the author’s name and the book title. The author is Muhammad Ibn Musa Al-Khwarizmi. (Muhammad, son of Musa, from Khwarizm.) He is often referred to just as Al-Khwarizmi. This is the origin of the English word “algorithm.”

And as if that weren’t awesome enough, the “completion” in the title is the Arabic word “al-jabr.” This is the origin of the English word “algebra.”

The second book was written in Latin and published in 1545 in Renaissance Italy. I read a 1968 translation by T. Richard Witmer. I can’t find it online, but in case you read Latin, here is a pdf of the original. Its distinction historically is that it was the first publication of a general method for solving what we would now call cubic and quartic equations. (Cardano attributes the solution for one class of cubics to Niccolo Tartaglia and Scipione del Ferro, the generalization of this solution to other classes of cubics to himself, and the solution of quartics to his student Lodovico Ferrari.)

Both these books have been widely written about. I was reading them in the hopes of learning how these mathematical breakthroughs were understood in their own day. My original intent was to use this information to help me design the group theory course I am teaching. We are getting into the Galois theory of equations. The modern treatment of this subject, which is what I learned, doesn’t feel to me like it could serve as the basis of a natural and meaningful development for people who don’t already know it. (The way I learned it, which is how almost everybody learns it in this day and age, was the opposite of “natural” or “meaningful.” Very cool, but only in an after-the-fact sort of a way. Like you came to the show at the very end and only saw the climactic scene, and everybody in the audience gasped and shrieked, except you because you didn’t care about any of the people in the show because you just walked in one second ago. And then after it was over your friend explained to you what had been going on and you understood why the other audience members cared, and were kind of mad you hadn’t gotten to watch the rest of the play first. If that analogy made any sense.) My idea was, let me study how the theory of polynomial equations developed over time; then I’ll be able to put the class in the place of the developers of the theory, and so the insights will come about naturally, and make sense, and be compelling, against the backdrop of the questions they were designed to answer. Learning the historical context would be pedagogically fertile.

As it happened, I overshot the historical mark a bit – for the purposes of the class, the mindframes of these two books are unnecessarily archaic. There is real pedagogical fertility here, but it’s around ideas that the participants in my class (who are teachers and mathematicians) already understand.

On the other hand, I’ve taught plenty of students who don’t understand them. In particular, I found myself surprised and intrigued by what each book did and did not say about negative numbers. I felt like I was watching this idea (the negative) coalesce and congeal, roughly and haltingly, over time. Like a churning mixture of hard crystal clarity and murky goo. If I may.

Though separated by 700 years, both books find it necessary to give three different quadratic formulas. Because, you see, you need a different method to solve

$x^2 + 10x = 20$

than to solve

$x^2 = 10x + 20$

or

$x^2 + 20 = 10x$.
(Actually, this notation is anachronistic. Neither author uses anything resembling modern notation. Muhammad Ibn Musa writes everything in prose. For the first of these equations, for instance, he would write, “A square and ten roots equal 20 dirhems.” Dirhems are an Arabic unit of currency.)

We think of there being only one quadratic formula because we are comfortable moving everything to the left; the only difference a modern reader can see between these equations is a difference in the signs of the coefficients:

$x^2 + 10x - 20 = 0$

$x^2 - 10x + 20 = 0$

etc. And all the equations can be solved exactly the same way. But for neither of these authors had the idea of negativeness grown adequately supple to make this possible.

Since Cardano presents the full algebraic solution to cubic equations, the situation is even more extreme in Ars Magna. Each of the following gets its own chapter:
“On the cube and first power equal to the number”
“On the cube equal to the first power and number”

“On the cube, first power, and number equal to the square”
“On the cube, square, and number equal to the first power”
These are the first two and last two in a sequence of 13 chapters. This is over 20% of the book. Not only does each equation type get its own method of solution, each method gets its own (geometric) proof.

Histories of mathematics often mention the situation I’ve described here. For example, Mactutor’s history of quadratic, cubic and quartic equations says something like “the different types arise because Al-Khwarizmi had no zero or negatives.” This is the story I’d gotten before I picked up the originals, and what I found out is that it’s not true.

Both books calculate comfortably with something translated as “negative numbers”. Ars Magna goes so far as to contain a calculation with imaginaries. But the scope of the idea of negativeness is limited, in a different way, in each book. And I think I learned something important about how people come to understand negative numbers by taking note of these limitations.

In Muhammad Ibn Musa’s work, a “negative” is a number that’s been subtracted from another number. That’s it; that’s all it is. But this is enough to justify all the rules of arithmetic with negatives that we teach middle schoolers, because Muhammad makes use of all of them:

If there are greater numbers combined with units to be added to or subtracted from them, then four multiplications are necessary; namely, the greater numbers by the greater numbers, the greater numbers by the units, the units by the greater numbers, and the units by the units.

He is talking about FOIL in case that wasn’t clear.

If the units, combined with the greater numbers, are positive, then the last multiplication is positive; if they are both negative, then the fourth multiplication is likewise positive. But if one of them is positive, and one negative, then the fourth multiplication is negative.

This is on pp. 21-22. Elsewhere, he fluently adds and subtracts these “negative” (i.e. subtracted) quantities. For example, on p. 27,

The root of two hundred, minus ten, subtracted from twenty minus the root of two hundred, is thirty minus twice the root of two hundred; twice the root of two hundred is the root of eight hundred.

In other words,

$20 - \sqrt{200} - \left(\sqrt{200}-10\right) = 30 - 2\sqrt{200}$

My point is that Ibn Musa’s use of the idea of negativeness is so limited in scope that the word “negative” might even be sort of a mistranslation to a modern reader; however, this limited-scope idea fully supports all the rules of arithmetic we teach.

Cardano’s understanding of negativeness is much broader. For example, in the first chapter of the book, he explicitly discusses the possibility that a negative number might satisfy an equation. But throughout, his dealings with negatives are marked by a kind of choppiness, an inconsistency. Firstly, he refers to negative solutions to equations as “false” or “fictitious” (as opposed to “true”). Then, once he gets into the nitty gritty of solving equations, he pretty much stops mentioning them entirely. For example in chapter 8 he says “it is evident that when the middle power is equal to the highest power and the constant, there are necessarily two solutions…” We would say there are three (1 negative), and Cardano would have acknowledged this third solution in chapter 1.

What Cardano virtually never does with negatives (the one exception is below) is treat them like they can be coefficients. Solutions, but not coefficients: i.e. a negative number can be the answer to a question I asked but they can’t be the language in which the question is posed. Most of the time, the idea of working with negative coefficients appears simply to not occur to him. On one occasion, the spectre is invoked only to be dismissed (for reasons that are opaque to me). Cardano is discussing positive and negative solutions to equations in which a power equals a certain number. (I.e. solvable by the simple extraction of one root.)

It is always presumed in this case, of course, that the number to which the power is equated is true and not fictitious. To doubt this would be as silly as to doubt the fundamental rule itself for, though opposite reasoning must be observed in opposite cases, the reasoning is still the same. p. 11

What??

The point that I am making is that if Cardano is any example, negatives are much easier to get your head around as an answer than as part of the question. Allowing coefficients to be negative would have caused a massive increase in the efficiency of the theory: as noted above, Cardano gave separate solutions for thirteen forms of cubic equations. With negative coefficients, these thirteen cases are reduced to 2: quadratic term is zero vs. nonzero. I don’t know when this cleaning-up of the theory actually historically took place. Avital Oliver, whom I mentioned in my last post, told me that noticing how much negative coefficients would simplify the theory of equations was a major reason, historically, that negative numbers gained acceptance as numbers. That makes sense to me.

The one moment in the book where the idea of a negative number is entertained as part of the statement of a problem is in the absolutely fascinating chapter 37, On the Rule for Postulating a Negative:

This rule is threefold, for one either assumes a negative, or seeks a negative square root, or seeks what is not. p. 217

Cardano is being highly speculative here. He seems to think maybe the entire chapter he’s writing is crazy talk. He begins by considering equations with negative solutions. Even though he already spent chapter 1 talking about negative solutions, he feels the need to justify them here. He notes that

$x^2 = 4x + 32$

and

$x^2 = x + 20$

don’t appear to have a common solution, since 8 solves the first while 5 solves the second. However, the “turned-around” equations

$x^2 + 4x =32$

and

$x^2 + x = 20$

do have a common solution, namely 4. In chapter 1, Cardano asserted that a quadratic and its “turnaround” have opposite solutions: a “true” (positive) solution for one is a “fictitious” (negative) solution, equal in magnitude, for the other. So here, the original pair of equations have a common solution after all: -4. Cardano seems to feel (and I kind of relate) that the presence of the common positive solution between the turned-around equations and the formal relationship between the turned-around pair and the original pair means there ought to be a common solution for the original pair; the fact that this common solution turns out to exist if you allow negative solutions is then a reason to believe in negative solutions.

Anyway, he follows with two problems about the property of a man named Francis. The problems are totally contrived but they lead to negative solutions for Francis’ property, which he interprets as meaning that Francis has debt. Tellingly, though, he sets up the equations letting -x be Francis’ property, so that the equations he actually solves have positive solutions.

Then, he poses a problem that has no positive or negative solution: divide 10 into two parts whose product is 40. He follows the procedure he uses on comparable problems with real solutions (e.g. divide 10 into two parts whose product is 21): “… it is clear that this case is impossible. Nevertheless, we will work thus:…” (p. 219). The procedure forces him to subtract 40 from 25 and then take the square root of this. He already seems dubious about the subtraction 25-40:

The square root of the remainder, then - if anything remains - added to or subtracted from [five] shows the parts. But since such a remainder is negative, you will have to imagine $\sqrt{-15}$. p. 219

Note the “if anything remains.” So this “square root of a negative” business is a bunch of new hooey built on something that might be hooey to begin with. In that context it almost feels like what we’d now call imaginaries (and what Cardano calls “the sophistic negative”) are only a comparatively small speculative step beyond the craziness of negative numbers in the first place. The whole chapter has this I-know-this-is-complete-madness-but-I’m-just-gonna-do-it tone. A famous passage:

... you will have that which you seek, namely $5 + \sqrt{25-40}$ and $5 - \sqrt{25-40}$, or $5 + \sqrt{-15}$ and $5 - \sqrt{-15}$. Putting aside the mental tortures involved, multiply $5 + \sqrt{-15}$ and $5 - \sqrt{-15}$, making $25 - (-15)$ which is $+15$. Hence this product is 40... So progresses arithmetic subtlety the end of which, as is said, is as refined as it is useless. p. 219-220

(As above, the notation here is anachronistic; but the translation I read modernized all Cardano’s notation for ease of reading.)

It is in this wildly speculative chapter that Cardano – for the only time in the book – suggests a problem posed in terms of negatives:

... If it be said, Divide 6 into two parts the product of which is 40, the problem is one of the sophistic negative... But if it is said, Divide 6 into two parts the product of which is -40, or divide -6 into two parts producing -40, in either case the problem will be one of the pure negative... and the parts will be those that have been given [10 and -4, or -10 and 4]. If it be said, Divide -6 into two parts the product of which is +24, the problem will be one of the sophistic negative. pp. 220-221

What am I getting at with all this? Well I can’t tell you what to think but I am left with a completely new sense of the natural contours of learning about negatives.

I taught Algebra I for a long time. My students entered the class having trouble both conceptually and computationally with negative numbers. I did my duty and explained their meaning and operation, along with lots of practice for the kiddies, early in the year. Having always been concerned with understanding, I looked for models of negatives that would support all the operations I wanted kids doing. I wanted the model to instantiate as much of the mathematical structure as possible. The school I taught at had a woodshop program, and I got them to build me a board with a flat surface with holes cut in it and wooden pucks to fill the holes, so that I could physically model 1 + -1 = 0 and people would physically see how a hole combined with a wooden puck to make a flat surface. Subtraction of negatives would become removing holes, and this clearly required adding pucks to the surface; thus subtraction of negatives is adding positives. The model required another layer of contrivance to support multiplication: I had to ask students to imagine standing upside down, on the other side of the surface, so the holes became pucks and the pucks could be imagined as holes; then 3*-4 could be 3 people with the normal point of view, each standing by 4 holes, while -3*4 was 3 upside down people each standing by what appeared to them as 4 pucks.

It didn’t work as the centerpiece of teaching about positives and negatives. The multiplication problems make the contrivance really obvious, but actually there’s a certain amount of contrivance even in how it models addition. If I combine some pucks and some holes, who says that the pucks need to fall into the holes? I made kids draw tons of pictures of the whole thing, which completely wore them out, and I don’t know how much it added to their understanding. Meanwhile, the model, as all models do, made problems bigger, clunkier. Subtracting -5 from -7 was no thing: just fill 5 holes. But subtracting (-5) from 1 was like a whole production. The kids either needed to create 5 holes by removing pucks from them (and retaining the pucks – why would you do either thing?) before adding 5 new ones to fill the holes, or they needed to make the intensely abstract and not-adequately-justified leap that because subtracting -5 amounted to adding +5 when you were subtracting from a negative, the same thing should be true when subtracting from a positive. Retrospectively the fact that I asked my kids to make this leap of faith and told myself that I was actually helping them understand how math makes sense is kind of embarrassing.

But the thing is, as models go, I’ll stand behind this one as one of the better ones. I’ve seen cuter models for multiplication, e.g. on the wall of the classroom of my first former student to become a math teacher (yes I am now old enough for that to happen):
Do you LOVE to LOVE? You’re a LOVER.
Do you LOVE to HATE? You’re a HATER.
Do you HATE to LOVE? You’re a HATER.
Do you HATE to HATE? You’re a LOVER.
But none of these cuter models supports addition or subtraction as well, and sometimes it’s hard to see that they are even related to multiplication. Meanwhile, the only model I’ve ever seen, besides mine, that supports all four operations is the IMP curriculum‘s “hot and cold cubes.” And if you see the contrivance and unnaturalness in what I described above, “hot and cold cubes” is another level. Again, I think it’s kind of a brilliant model. But if you’ve ever tried to use it with low-skilled kids, you know how much production is involved in even getting them to imagine and buy into the scenario in the first place, let alone use all that machinery to solve problems.

It’s been a few years now that it’s seemed clear to me that the whole idea of teaching negatives through a particular model is not the way to go. People who use negatives effectively have gotten them down to a very slim abstract notion that supports all their operations and all their uses as representations of real things. (I would describe my own understanding with words like “opposite directionyness” – don’t laugh.) Teaching has to be aimed at this slim, efficient understanding as an end product. Forcing kids to engage with a whole clunky megilla of story and visual image every time they want to do a computation with negatives can’t possibly be the right path.

In more recent years I’ve found much more effective ways to teach negatives. I’ve been beginning by brainstorming with my students what negative numbers are actually, in the real world, used to represent. Not just debt, temperature and elevation. These aren’t enough. They capture the “below zeroiness” but not the “opposite directionyness,” since the positive direction is so fixed in each case. Also needed are examples of net change: gain or loss of money by a business; football yardage; etc. Furthermore, examples where negatives are used to specify direction in space or time: say uptown is positive; what would negative mean? What if east were positive? What if downtown were positive? If positive 3 means the space shuttle took off 3 seconds ago, what would -3 mean?

Using this conversation as groundwork has brought me much more success than the wooden board did, but there’s still something missing. It’s hard to find convincing examples familiar to kids that support multiplication, for one thing, except for the private tutoring student whose father was a stockbroker, because then short-selling a stock that goes down in price is (neg)(neg) = pos. But it’s more fundamental than that. I’ve still been starting from the question “what is a negative?” when the student’s only legitimate reason to believe negatives even exist is that school says so and her only legitimate reason to care is that she’ll be accountable for an answer.

This question puts the cart before the horse. A corollary of that amazing conversation with Avital Oliver I described last time is that when I teach a new idea I want to cause it to be needed, or at least cause its presence to be felt, cause students to become aware of it in the room with them, before it is ever named. So “what is a negative?” is not ultimately my desired opener for teaching about negatives.

What I’m left with after reading Cardano and Muhammad Ibn Musa is the beginning of an idea, modeled on the history of the concept itself, for what could take its place. So, here’s a curriculum brainstorm. It spans a lot of years and doesn’t fit in with anybody’s state frameworks, so I hope you’ll forgive the impracticality. I’m just fantasizing.

First, laying the groundwork (inspired by Ibn Musa): When you do arithmetic, how does subtracting something from the numbers affect the answer? How does 20 + 10 change if I subtract 3 from the 10? (To focus attention on the key point, what does the subtracted 3 do to the answer?) How does 20 – 10 change? How does 20*10 change? How does 20*10 change if I subtract 4 from the 20? How about if I subtract 4 from the 20 and 3 from the 10? What if I add 4 to the 20 and subtract 3 from the 10? The point is to engage the students in sorting out all these questions. (Why would they care about these questions? That’s a whole other thing but I don’t think a very hard one, and it will depend on the group of students – but I’m sure given any set of folks we can find a context to make these questions compelling.) Note that there is no “new kind of number” here. Some 3’s are subtracted, some added, that’s all. We very gently call their attention to the “subtracted 3” as an object worth talking about, but they already know what we mean; there’s nothing new to learn. I think this sorting-out is going to attune students’ antennae to the frequency in the universe on which negative numbers live.

Much later, once negatives come into play, stay respectful of the fact that they make sense as answers more easily than they make sense as questions. What number could you add to 7 and get 4? (No number! Even if you add nothing, it’s still 7.) If you could add something, what would that thing be like? In other words, bring forth the idea of negativeness as the answer to questions. (Perhaps your earlier “subtracted 3” will be what they come up with; perhaps not.) Do a lot and a lot and a lot of this, before ever asking anybody anything about negatives.

Later still, it will be time to develop equation solving intently. The way we do this in Algebra now, we build in the necessity for the methods to generalize to negative coefficients. Instead, start it earlier and use Muhammad Ibn Musa-typed problems. Let them develop techniques that feel most natural to them. (From lots of classroom experience, I can tell you that these will not be methods that generalize to negative coefficients.) Allow problems with negative solutions to creep in, but not negative coefficients. Negative numbers and their operations are becoming familiar, but still let the students do what’s comfortable in the realm of equation solving. Increase the sophistication of the equations; develop the solution of one of the three forms of the quadratic (what number can I multiply by itself, and then add 6 of itself, to get 40?). Pose problems in the other forms as well though. Finally, as a last act, lead them to the fact that allowing coefficients to be negative unifies all three cases of the quadratic into one and they can use a single method on all problems. How useful! Negatives are now official.

I would really love to do this with an out-of-school math circle of youngish kids or mathphobic adults. I need to get on that.

* * * * *

Two tidbits from these books that didn’t fit in with the main lines of thought above. There’s lots more where these came from but as usual I’ve already OD-ed so I have to draw the line somewhere.

a) Muhammad Ibn Musa gives a beautiful, though not rigorous, justification for the circle area formula that I’ve never seen before. He expresses the circle’s area as half its circumference times half its diameter. He explains that this is true because any regular polygon has an area equal to half its circumference times half the diameter of the inscribed circle. (Draw lines from the center to every vertex, and think about the areas of the triangles you get, to see that this is true.)

b) Cardano says something really darling about the solution of the cubic, that I just found delightful and have to share:

In our own days Scipione del Ferro of Bologna has solved the case of the cube and first power equal to a constant, a very elegant and admirable accomplishment. Since this art surpasses all human subtlety and the perspicuity of mortal talent and is a truly celestial gift and a very clear test of the capacity of men's minds, whoever applies himself to it will believe that there is nothing that he cannot understand. p. 8

20 thoughts on “The History of Algebra, part I: Negative Numbers”

1. Thank you once again. This blog post is ‘a very elegant and admirable accomplishment’, and gave me so much to think about.

I really liked thinking about 17*18 as the product of numbers just a bit less than 20. (20*20 is 400. If you take of a 20 by 3 strip and a 20 by 2 strip, you will have removed the 2 by 3 bit twice, so now you have to put it back in. Ahh, that reminds me of one of the probability rules.)

I’ve always used money, and talked about multiplication of negatives as taking away debts. (I owe everyone $5. 3 people say, “Never mind, you don’t have to pay me back.” That’s 3 people taking away the$5 debts, or -3*-5, and it makes me $15 richer.) 2. That’s fascinating stuff. Thank you. Normally, a person, a teacher, they wouldn’t learn this. But I think maybe they should. Where? When? 3. vlorbik says: terrific work here. and congratulations on having grandstudents. (whenever i’ve learned of any student of mine working as a math teacher i’ve always felt like it’s some kind of a win for me.) 4. Ben, thanks for this great post!! I printed it out and underlined all over it. Re: the goal of having the most contrivance-free explanation for dealing with negative numbers — back in the day, my 6th grade pre-algebra teacher tried to reassure me that negative numbers were not unnatural by telling me that congnitively/neurologically/or something, we don’t subtract — we add negatives. (I think he meant this to be reassuring, but I did not find it helpful at the time.) On the other hand, right now I’m tutoring a 2nd grader who taught himself how to add negative numbers with no outside involvement whatsoever. I appreciated your discussion of trying to create manipulatives to deal with negative numbers (removing holes). I’m curious to see how the Math U See curriculum would deal with that — since it’s manipulative-based and goes all the way up to calculus. The parts of the curriculum I’ve worked with so far have been quite elegant. I’ll let you know if how they deal with negative numbers via manipulatives is slim and supple, as is the goal! When you quoted that passage where al-Khwarizmi was talking about FOIL — do you happen to know if that was the first example of FOILing? Or does FOILing go back earlier in the history of algebra? And the part when you quote Cardano talking about “you will have to imagine the square root of -15” … I was intrigued by the fact that he said “you will have to IMAGINE” while describing IMAGINARY numbers. Do you think he meant it in terms of, “something we would have to imagine” or “something that’s the opposite of real”? You probably already know this, but while imaginary numbers were first observed by the ancient Greek mathematician Heron of Alexandria, they didn’t gain widespread acceptance until way later — even Descartes used the term “imaginary” in a derogatory way! 1. Hi Rebecca, thanks for the thoughtful engagement! I’ve never heard of the Math U See curriculum but it sounds intriguing. Definitely keep me posted. Re: FOIL. I don’t rightly know. Places to look for earlier instances might be Diophantus (Alexandria, 3rd century) and Brahmagupta (India, 6th century), neither of whose work I’ve studied in detail. Also, for the record, Al-Khwarizmi presents all the contents of his book as an exposition rather than an invention, i.e. he makes it sound like everything he’s explaining is well-known among the experts of the time. Regarding Cardano’s use of the word “imaginary,” this I can definitely answer. He was just saying that thinking about the square root of -15 requires a flight of fancy. He was certainly not saying anything like “the opposite of real.” To Cardano, negatives were barely real. The idea that the positive and negative numbers constitute a self-contained system (which we now call “the reals”) was centuries away. My understanding is that the term “real numbers” to refer to both positives and negatives actually arose after and because of the term “imaginary” being used to refer to the square roots of negatives. (I’ve read this several places but I can’t remember where right now.) Actually I didn’t know that about Heron. Do you know more about this? In what sense did he “observe” them? One of the big lessons for me in doing the research I did for this post was finding out that when secondary sources tell me “so-and-so talked about negatives/imaginaries/whatever other modern concept” there is usually a lot more to learn about what exactly the historical source meant and it’s probably pretty different from what we mean these days. 5. shana donohue says: Because my students have such a hard time with negative numbers (ie: solve for y in y + 25x = 3x + 7), I started thinking about what the problem was. I would get answers like “y = -28x + 7” or “y = 22x + 7” so it was obvious there was a lack of understanding of negatives. For my thesis, I began looking into when negative numbers are taught- 7th grade! What?? That’s too late in my opinion. Then I began to look into HOW they are taught- with a number line. But at the very beginning of the first lesson in 7th grade, there is a picture of a boy with a caption above his head reading “I owe my dad$4. I have -$4” So this idea of owing is tied directly into negatives. So I thought about owing someone some money, paying some back, and figuring out how much more I owed. If I borrowed$12 and paid you back \$7, the problem would look like “-12 + 7” but I would solve the problem, in my head, by counting from 7 to 12. This is not the way we are taught in school. The way we are taught in school is to “find -12 on the number line, count 7 to the right, see what number you land on.” But this isn’t what we do in real life!

Absolute value is the answer. Although “take the difference between the absolute values of the two numbers” is a bit of a mouthful, it is the way to go. This way both numbers, -12 and 7, are treated as real numbers instead of -12 being treated as a number and 7 being treated as a movement. I really think that if we teach kids this way they will begin to see the relationship between positives and negatives and no longer make mistakes when they get to me!

6. Largo says:

Teach the bare minimum f complex arithmetic, with a model that makes it easy to visualize. WRT multiplication, complex numbers are “twirls” == combinations of rotations and zooms of a camera looking down on the plane. (-1)(-1)==(1) becomes obvious.

I speak as one with more ideas about teaching than practical experience! :-p

1. This is a beautiful context for revisiting (-1)(-1) down the line. I can’t see it as a way into teaching (-1)(-1) though, at least to the students I had in mind. (People first learning about negative numbers.) My experience is that approaching an abstraction (negative multiplication) with a more elaborate abstraction (complex multiplication) is never the way to go. The approach to abstraction has to be from the concrete side. Also, a new abstract idea plays much better to a crowd that can already sense its usefulness for answering a question about something they already understand. (Introducing complex numbers because you know they are going to explain something about negative numbers, once they are fully developed, to a class or student who barely understands negatives, doesn’t have this virtue because the class/student can’t perceive where it’s headed till it’s already there. They are going to have to bear with you through some heavy stuff, without the context of the problem it’s solving to orient their learning.) But if you love the idea enough, find some students and give it a whirl! And let me know how it goes.

7. shana donohue says:

I feel like a dummie being a math teacher and being in grad school for math education and all, but can someone please explain to me, in layman’s terms: “Teach the bare minimum f complex arithmetic, with a model that makes it easy to visualize. WRT multiplication, complex numbers are “twirls” == combinations of rotations and zooms of a camera looking down on the plane. (-1)(-1)==(1) becomes obvious.”?

Are you talking about i? What’s WRT multiplication? I don’t get the whirls and zooms of a camera reference…

1. I’m assuming Largo meant “bare minimum of complex arithmetic” and that WRT = “with respect to.” Yes, i is what’s being talked about.

As for the twirl/zoom thing, this is one of the totally most awesome things about complex numbers. I looked briefly for a good reference online but can’t find. I’ll explain very very briefly and give you a book reference.

A complex number is a real number added with a multiple of i. (So, a+bi with a and b real.) Analogous with real numbers on a number line, you can see complex numbers as points on a plane, with the real part as the x-coordinate and the i part as the y-coordinate – so, 3+5i would get plotted at (3,5). If you draw them that way, then when you muliply them by each other a very beautiful thing happens.

First consider the simplest case: multiplication by a real number, say 2. Every a+bi becomes 2a+2bi – its x and y coordinates both double. So every point zooms out from the origin by a factor of 2. You can see it as the whole plane uniformly stretching out away from the origin. This is what Largo was talking about with the “zoom.”

Next simplest case: multiplication by i. a+bi times i becomes ai+bi^2 = -b+ai. The x coordinate becomes the y coordinate and the y becomes the -x. If you look at what’s happening visually, every point in the plane is rotating 90 degrees counterclockwise about the origin. Thus the whole plane is being “twirled” through 90 degrees.

Here I considered multiplication by 2 and by i, but it is provably true that multiplication by _any_ complex number a+bi has an effect on the whole plane that is one or both of these effects: it stretches the whole plane away from the origin (or shrinks it toward it), and/or rotates it about the origin. How rad!

This is covered in great detail in my new favorite complex analysis textbook, Visual Complex Analysis by Tristan Needham.

Btw, I would be wary of the idea that “absolute value is the answer” as you put it. First of all, “take the difference between the absolute values of the two numbers” is a maxim that it is very easy to apply without actually thinking of the meanings of the numbers. Secondly, students ultimately need to be able to think of numbers flexibly either as “real numbers” in your sense or as movements. I believe (after having spent years doing it) that it’s always a doomed enterprise to try to pick which of several mental models for a math concept kids should be using. This is what I was getting at in the post about “teaching negatives through a particular model is never the way to go.” They need to get in touch with the abstraction, and the abstraction is connected to all possible ways negatives can be thought of. In particular, if you can’t see positive and negative as forward and backward motion (or even, motion in one arbitrary direction and motion in the opposite direction), you’re really missing part of the story. Debt isn’t enough. Kids need to deal with problems that invoke every possible idea of negativeness, and to see connections between all of them.

8. shana donohue says:

Thank you benbluesmith for that great explanation. In addition to Algebra 2, if I stay at the same school next year I’ll also be teaching an elective on number theory and the hitory of numbers. I took a siper class at Harvard Extension with Professor Oliver Knill on this topic, but my whole class won’t even touch one lecture of his. Still, Ima try!

But I do still think that my math tool will help kids with combining positives and negatives, so I’ll have to disagree with you there! I’ll be testing it out with a group of students as part of my thesis in the fall, and I’ve sold a bunch already through my WordPress blog. No complaints yet! “Fill the hole” says Teacher Escalante in Stand and Deliver, “What’s -1 + 1?”

9. Great and helpful article, but your curriculum brainstorm seems too slow. Why don’t you actually tell them the history of negative numbers that you have disclosed in this article, instead of having them reinvent that history themselves? Of course it is important for them to figure out things themselves, but your time-span of “a lot of years” seems too long, and I don’t think that degree of figuring-out-things-themselves will enhance their understanding of negative numbers in a way that it won’t get enhanced later anyway, when they go further into math that uses and subsumes negative numbers.

10. I saved this and just reread it! Such a nice piece of work!

I was actually hunting for the history of the cubic, and the problem solving duels and deathbed declarations – and I will find them – but this was a great detour.

1. Hi jd2718! Great to see you here in 2020/2021! And thank you for the kind words! I’m still proud of it!