₪ in the twenty- first century

Martin Gardnerin the Twenty-First Century

Michael Henle and Brian Hopkins, Editors

D

D

EE

0

E

E

F

FF

F

D

D

G

G G

G

4

1 or 3

2

2

A

B 128.57o

1

1

1

102.86o

1.80

2

2.247

C

D

E

F

G

Martin Gardner enormously expanded the field of

recreational mathematics with the Mathematical Games

columns he wrote for Scientific American for over 25 years

and the more than 70 books he published. He also had

a long relationship with the Mathematical Association

of America, publishing articles in the MAA journals

right up to his death in 2010. This book collects articles

Gardner wrote for the MAA in the twenty-first century,

together with other articles the MAA published from

1999 to 2012 that spring from and comment on his work.

Martin Gardnerin the Twenty-First Century

Michael Henle and Brian Hopkins, Editors

9 780883 859131

ISBN: 978-0-88386-913-1

v1 vh vi vj vn

Martin G

ardner in the Twenty-First Century

Michael Henle and

Brian Hopkins, Editors

MAA-Gardner cover v7POD_Layout 1 11/19/12 4:10 PM

i

i

“master” — 2012/11/15 — 11:23 — page i — #1i

i

i

i

i

i

Martin Gardner

in the

Twenty-First Century

i

i

“master” — 2012/11/15 — 13:32 — page ii — #2i

i

i

i

i

i

c 2012 by the Mathematical Association of America, Inc.

Library of Congress Catalog Card Number 2012954077

Print edition ISBN: 978-0-88385-913-1

Electronic edition ISBN: 978-1-61444-801-3

Printed in the United States of America

Current Printing (last digit):

10 9 8 7 6 5 4 3 2 1

i

i

“master” — 2012/11/15 — 11:23 — page iii — #3i

i

i

i

i

i

Martin Gardner

in the

Twenty-First Century

edited by

Michael Henle

Oberlin College

and

Brian Hopkins

Saint Peter’s University

Published and Distributed by

The Mathematical Association of America

i

i

“master” — 2012/11/15 — 11:23 — page iv — #4i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page v — #5i

i

i

i

i

i

Preface

Martin Gardner and the MAA share a long history. In 1958, around the time he started his

famous “Mathematical Games” column for Scientific American, he submitted the first of

many problems to The American Mathematical Monthly. In 1982, as his column wound

down, Gardner’s first MAA article was published in The Two Year College Mathematics

Journal. He wrote for MAA journals the rest of his life, particularly The College Mathe-

matics Journal and Math Horizons. Gardner contributed to the latter almost annually from

its founding in 1993 until 2005.

Gardner’s prodigious writing activity continued right until his death in 2010. Articles,

stories, problems, solutions, Quickies, and other kinds of contributions continued to flow.

His last mathematical article to appear in an MAA journal, “L-tromino Tiling of Mutilated

Chessboards,” was the centerpiece of a special puzzle issue of The College Mathematics

Journal in 2009, and it is included here.

Early in 2010, Math Horizons editors Steve Abbott and Bruce Torrence were surprised

to receive a typescript manuscript. Gardner used a typewriter his whole life, never email.

The submission was accompanied by a note, “Is this short story something you can use? I

wrote the math column in Scientific American for 25 years. If my piece is not right for Math

Horizons, there is no need to send it back. All best, Martin.” There was not enough time

for the editors to thank Martin for his submission [6]. Fittingly, this story, “Superstrings

and Thelma,” is the last piece in this collection.

Apart from his own work, Martin Gardner, by enormously expanding the field of recre-

ational mathematics, opened up vast mathematical tracts for exploration by others. This

was quite deliberate. In an interview in The Two-Year College Mathematics Journal [1],

Gardner said, “I’m defining [recreational mathematics] in the very broad sense to include

anything that has a spirit of play about it.” Gardner, of course, had a refined and very

well-developed sense of play, one quality that made his pieces so enjoyable to read. In

almost everything he wrote, Gardner posed problems to challenge his readers, and they

responded. He maintained an extensive correspondence with mathematicians, both profes-

sional and amateur. Their work fueled his own pieces, but then his correspondents turned

around and wrote their own articles.

One consequence was the expansion of recreational mathematics into a major research

area (also helped by the development of the computer and the corresponding expanded in-

terest in discrete mathematics) that is such a feature of the current mathematical landscape.

The CMJ devoted the January 2012 issue to papers on topics that Gardner introduced to the

mathematical public. There were so many articles to include that half of the March 2012

issue continued the theme.

v

i

i

“master” — 2012/11/15 — 11:23 — page vi — #6i

i

i

i

i

i

vi Preface

Another consequence of the flowering of recreational mathematics is this volume. We

have collected MAA journal articles starting from 1999 on topics that Gardner developed.

Some are written by Gardner, but most are by others. The tribute articles from the January

and March 2012 CMJ issues are all here, but they constitute less than half of this collection.

All the MAA journals are represented, Mathematics Magazine and the Monthly, as well as

CMJ and Math Horizons. The limitation to pieces published roughly in the twenty-first

century is a practical one. Even so, some puzzle collections, longer articles, and pieces less

directly linked to Martin Gardner have been omitted.

The 41 pieces collected here are grouped around common themes, such as geometry,

number theory and graph theory, and cards and probability. Flexagons, the topic of Gard-

ner’s first Scientific American column, are seen to be associated with Catalan numbers and

together merit their own section. Geometric tiling and various “magic” number puzzles

are all about “Making Things Fit,” and there are enough other puzzles and games to fill

another section.

Gardner’s interests ranged far beyond mathematics. A fan of magicians and magic

tricks from childhood (“I waste a lot of time on it” [2]), he wrote several books on magic.

He annotated Lewis Carroll’s Alice books and other classics, and produced two novels of

his own. Other topics included philosophy, religion, literature and pseudo-science, leading

to some 70 books.

The last section of this volume highlights some of these other facets of Martin Gard-

ner’s wide-ranging interests. It includes two short stories by Gardner and several other

pieces that demonstrate his support for amateur mathematicians, his love of play (about an

April Fool’s joke he played on his Scientific American readers), and his interest in debunk-

ing false science. Also included is Gardner’s review of a 2004 novel in which an important

character seems to be based on him.

Our hope is that this volume will play a role in perpetuating the memory of Martin

Gardner, modest celebrity, larger-than-life character, self-confessed amateur as a mathe-

matician, popularizer of recreational mathematics in the broadest sense, prolific and bril-

liant writer. Given the lasting impression he made on several generations of mathematics

enthusiasts of all backgrounds, we are confident that the MAA and others will be publish-

ing articles inspired by Gardner’s work for a long time.

Gardner, like the readers of this book, loved mathematics. We close this preface with

Gardner’s own words on his background, the community, and why he enjoyed the field so

much (from [5], [4], and [3], respectively).

I took no math in college. I’m like a person who loves music but can’t hum

a tune or play an instrument. My understanding of math does not go beyond

a minimal understanding of calculus. I hasten to add that I consider this one

reason for the success of my Scientific American column. I had to work hard

to understand whatever I wrote about, and this made it much easier for me to

write in a way that readers could understand.

When I first started the column, I was not in touch with any mathemati-

cians, and gradually mathematicians who were creative in the field found out

about the column and began corresponding with me. So my most interesting

columns were columns based on material that I got from them, so I owe them

a big debt of gratitude.

i

i

“master” — 2012/11/15 — 11:23 — page vii — #7i

i

i

i

i

i

Preface vii

[I enjoy mathematics] because it has a strange kind of unearthly beauty.

There is a strong feeling of pleasure, hard to describe, in thinking through an

elegant proof, and even greater pleasure in discovering a proof not previously

known.

Aclnowledgments

The editors are grateful for the encouragement and hard work of the MAA publications

staff: Director of Publications Ivars Peterson, who conceived the book, Production Man-

ager Carol Baxter, who designed it, and especially Electronic Publication Manager Beverly

Joy Ruedi, who set the book in type.

Bibliography

[1] Anthony Barcellos and Martin Gardner, A Conversation with Martin Gardner, Two-Year College

Math. J. 10 (1979) 233–244.

[2] Don Albers and Martin Gardner, On the Way to “Mathematical Games”: Part I of an Interview

with Martin Gardner, College Math. J. 36, (2005) 178–190.

[3] Don Albers and Martin Gardner, “Mathematical Games” and Beyond: Part II of an Interview

with Martin Gardner, College Math. J. 36 (2005) 301–314.

[4] Colm Mulcahy and Martin Gardner, An Interview with Martin Gardner, Card Colm, October

2006, available at www.maa.org/columns/colm/cardcolm200610.html

[5] Michael Henle and Martin Gardner, Interview with Martin Gardner, College Math. J. 40 (2009)

158–161.

[6] Bruce Torrence and Stephen Abbott, To Our Readers, Math Horizons 18 (2010) 2–4.

www.maa.org/columns/colm/cardcolm200610.html

http://www.maa.org/columns/colm/cardcolm200610.html

i

i

“master” — 2012/11/15 — 11:23 — page viii — #8i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page ix — #9i

i

i

i

i

i

Contents

Preface v

I Geometry 1

1 The Asymmetric Propeller 3

Martin GardnerGardner, paying tribute to dentist and geometer Leon Bankoff, discusses some of his unpub-

lished results and concludes with an open question.

2 The Asymmetric Propeller Revisited 7

Gillian Saenz, Christopher Jackson, and Ryan CrumleyThree University of Texas students use dynamic geometry software to confirm Bankoff’s re-

sults and resolve Gardner’s question.

3 Bracing Regular Polygons As We Race into the Future 11

Greg W. FredericksonA problem Gardner published in 1963 continues to spur generalizations and improved solu-

tions around the world.

4 A Platonic Sextet for Strings 19

Karl SchafferThe professor and dance company co-director details string polyhedra constructions for ten

participants.

5 Prince Rupert’s Rectangles 25

Richard P. Jerrard and John E. WetzelA 17th century puzzle that Gardner posed in higher dimensions is here solved in the case of

three-dimensional boxes.

II Number Theory and Graph Theory 35

6 Transcendentals and Early Birds 37

Martin GardnerGardner moves from Liouville to an “innocent but totally useless amusement” that nonetheless

captured the attention of Solomon Golomb.

7 Squaring, Cubing, and Cube Rooting 39

Arthur T. BenjaminThe professor and “mathemagician,” inspired as a high school student by Gardner’s tricks for

mental calculations, extends some of them here.

ix

i

i

“master” — 2012/11/15 — 11:23 — page x — #10i

i

i

i

i

i

x Contents

8 Carryless Arithmetic Mod 10 45

David Applegate, Marc LeBrun, and N. J. A. SloaneInspired by the carryless arithmetic of the game Nim, this trio of authors explores the number

theory of a South Pacific island.

9 Mad Tea Party Cyclic Partitions 53

Robert Bekes, Jean Pedersen, and Bin ShaAnother playful trio analyzes cyclic arrangements that build from integer partitions in a Lewis

Carroll setting.

10 The Continuing Saga of Snarks 65

sarah-marie belcastroA type of graph, given a fanciful name by Gardner from Lewis Carroll, was the subject of a

Branko Grunbaum conjecture for 39 years.

11 The Map-Coloring Game 73

Tomasz Bartnicki, Jaroslaw Grytczuk, H. A. Kierstead, and Xuding ZhuDaltonism and half-dollar coins are used in this exploration of a Steven Brams game theory

approach to the Four Color Theorem.

III Flexagons and Catalan Numbers 85

12 It’s Okay to Be Square If You’re a Flexagon 87

Ethan J. Berkove and Jeffrey P. DumontThis article details the 1939 origin of flexagons at Princeton University and focuses on the

neglected tetraflexagons.

13 The V-flex, Triangle Orientation, and Catalan Numbers in Hexaflexagons 103

Ionut E. Iacob, T. Bruce McLean, and Hua WangThis trio of Georgia Southern University authors examines a once-illegal variety of flex and

makes a connection between “pat classes” and Catalan numbers.

14 From Hexaflexagons to Edge Flexagons to Point Flexagons 109

Les PookAn engineer and author of two books on flexagons considers the more general edge flexagons

and recently discovered point flexagons.

15 Flexagons Lead to a Catalan Number Identity 113

David CallanExamining the descent permutation statistic on flexagon pats leads the author to full binary

trees and a combinatorial proof.

16 Convergence of a Catalan Series 119

Thomas Koshy and Z. GaoCalculus is brought to bear on the infinite sum of Catalan number reciprocals and related

series; � and the golden ratio make appearances.

IV Making Things Fit 125

17 L-Tromino Tiling of Mutilated Chessboards 127

Martin GardnerIn his last MAA mathematics article, Gardner moves from classic chessboard domino tiling

problems to new results.

i

i

“master” — 2012/11/15 — 11:23 — page xi — #11i

i

i

i

i

i

Contents xi

18 Polyomino Dissections 135

Tiina Hohn and Andy LiuThe authors introduce a new technique for solving dissection problems, often presented in the

context of quilts, leaving several puzzles for the reader.

19 Squaring the Plane 143

Frederick V. Henle and James M. HenleA father and son team resolve Golomb’s “heterogenous tiling conjecture” and discuss another

dozen open questions.

20 Magic Knight’s Tours 153

John BeasleyThe author surveys results combining a knight’s tour on the chessboard with magic squares,

including a computer-aided solution to a Gardner question.

21 Some New Results on Magic Hexagrams 159

Martin GardnerHere Gardner focuses on three types of puzzles about placing numbers on six-pointed stars,

mentioning a “rare mistake” of the British puzzlist Henry Dudeney.

22 Finding All Solutions to the Magic Hexagram 167

Alexander Karabegov and Jason HollandThe authors relate magic hexagrams to magic edge labelings of cubes, using card shuffling to

enumerate distinct solutions.

23 Triangular Numbers, Gaussian Integers, and KenKen 173

John J. WatkinsMiyamoto’s contemporary puzzle is expanded to complex numbers where a different unique

factorization adds to the challenge.

V Further Puzzles and Games 179

24 Cups and Downs 181

Ian StewartOne of Gardner’s mathematical successors at Scientific American uses graph theory and linear

algebra on two related puzzles.

25 30 Years of Bulgarian Solitaire 187

Brian HopkinsSome recent math history explains this oddly-named puzzle on integer partitions, visualized

with state diagrams and generalized to a new two-player game.

26 Congo Bongo 195

Hsin-Po WangA high school student uses state diagrams and Dennis Shasha’s detectives to open a tricky

treasure chest.

27 Sam Loyd’s Courier Problem with Diophantus, Pythagoras,

and Martin Gardner 201

Owen O’SheaA Classroom Capsule extends Gardner’s solution of related Sam Loyd puzzles to other army

formations.

i

i

“master” — 2012/11/15 — 11:23 — page xii — #12i

i

i

i

i

i

xii Contents

28 Retrolife and The Pawns Neighbors 207

Yossi ElranAn inverse version of Conway’s game Life, famously popularized by Gardner, is examined

using chessboards.

29 RATWYT 213

Aviezri FraenkelThe combinatorial game theorist uses the Calkin Wilf tree to devise a rational number version

of Wythoff’s Nim.

VI Cards and Probability 219

30 Modeling Mathematics with Playing Cards 221

Martin GardnerIn addition to probability applications, Gardner uses a deck of cards for a discrete version of a

fluid mixing puzzle and mentions a correction to W. W. Rouse Ball.

31 The Probability an Amazing Card Trick Is Dull 227

Christopher N. SwansonRook polynomials and the principle of inclusion-exclusion help determine the likelihood that

the author’s students were underwhelmed.

32 The Monty Hall Problem, Reconsidered 231

Stephen Lucas, Jason Rosenhouse, and Andrew ScheplerThese authors remind us of Gardner’s early role in this infamous problem that still “arouses

the passions” and examine variations.

33 The Secretary Problem from the Applicant’s Point of View 243

Darren GlassChanging perspective, the author reconsiders a classic strategy in order to help job seekers

choose the best interview slot.

34 Lake Wobegon Dice 249

Jorge Moraleda and David G. StorkLake Wobegon Dice, named after Garrison Keillor’s Minnesota town, have the property that

each is “better than the set average.”

35 Martin Gardner’s Mistake 257

Tanya KhovanovaAnother controversial problem about probability and information is carefully discussed,putting

Gardner in the company of Dudeney and Ball.

VII Other Aspects of Martin Gardner 263

36 Against the Odds 265

Martin GardnerIn this short story, a principal recognizes the potential in a student whose unconventional

thinking irritates his teacher.

37 A Modular Miracle 271

John StillwellGardner used an obscure result of Hermite and the limitations of 1970’s calculators for an

April Fool’s Day prank.

i

i

“master” — 2012/11/15 — 11:23 — page xiii — #13i

i

i

i

i

i

Contents xiii

38 The Golden Ratio—A Contrary Viewpoint 273

Clement FalboBuilding on a Gardner article in The Skeptical Inquirer, the author argues that � “is not entirely

astonishing.”

39 Review of The Mysterious Mr. Ammann by Marjorie Senechal 285

Philip StraffinThis Media Highlight discusses an example of Gardner’s support of an amateur mathematician

who independently discovered Penrose tiles.

40 Review of PopCo by Scarlett Thomas 287

Martin GardnerThis popular 2004 novel includes a character based on Gardner, so he was a natural choice to

review the book.

41 Superstrings and Thelma 289

Martin GardnerGardner’s last MAA submission, a short story about a physics graduate student and a waitress

who quips, “How are strings?”

Index 293

About the Editors 297

i

i

“master” — 2012/11/15 — 11:23 — page xiv — #14i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 1 — #15i

i

i

i

i

i

IGeometry

i

i

“master” — 2012/11/15 — 11:23 — page 2 — #16i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 3 — #17i

i

i

i

i

i

1The Asymmetric Propeller

Martin Gardner

The late Leon Bankoff (he died in 1997) was a Beverly Hills, California, dentist who also

was a world expert on plane geometry. (For G. L. Alexanderson’s interview with Bankoff,

see [1].) We became good friends. In 1979 he told me about a series of fascinating discov-

eries he had made about what he called the asymmetric propellor theorem. He intended to

discuss them in an article, but never got around to it. This is a summary of what he told me.

The original propeller theorem goes back to at least the early 1930’s and is of unknown

origin. It concerns three congruent equilateral triangles with corners meeting at a point as

shown shaded in Figure 1.1. The triangles, which resemble the blades of a propeller, need

not form a symmetrical pattern, but may be in any position. They may touch one another or

even overlap. Lines BC , DE , and FA are drawn to form a hexagon inscribed in a circle.

The midpoints of the three lines mark the vertices of an equilateral triangle.

A

B

C

DE

F

Figure 1.1.

A proof of the theorem, using complex numbers, appeared in [2] as the answer to

Problem B-1 in the annual William Lowell Putnam Competition. H. S. M. Coxeter sent the

proof to Bankoff on a Christmas card, asking him if he could provide a Euclidean proof of

the theorem.

Reprinted from The College Mathematics Journal, Vol. 30, No. 1 (Jan. 1999), pp. 18–22. This article receivedthe George Polya Award in 2000.

3

i

i

“master” — 2012/11/15 — 11:23 — page 4 — #18i

i

i

i

i

i

4 Part I. Geometry

Bankoff had no difficulty finding such a proof. In a paper titled “The Asymmetric

Propeller” [3], Bankoff, Paul Erdos, and Murray Klamkin made the first generalization of

the theorem. They showed that the three equilateral triangles need not be congruent. They

can be of any size, as shown in Figure 1.2, and the theorem still holds. Two proofs are

given, one a simple Euclidean proof, the other with complex numbers. As before, and in

all subsequent extensions, the triangles may touch one another or even overlap.

Figure 1.2. Figure 1.3.

Later, Bankoff made three further generalizations. As far as I know they have not been

published.

Second generalization: The propeller triangles need not meet in a point. They may meet

at the corners of any equilateral triangle, as shown in Figure 1.3.

Third generalization: The propeller triangles need not be equilateral! They need only

be similar triangles of any sizes that meet at a point. The midpoints of the three added lines

will then form a triangle similar to each of the propellers, as shown in Figure 1.4.

Figure 1.4.

Fourth generalization: The similar triangles need not meet at a point! If the propellers

meet at the corners of a fourth triangle of any size, provided it is similar to each propeller,

the midpoints of the added lines will form a triangle similar to each propeller. Vertices of

the interior triangle must touch corresponding corners of the propellers.

Here is how Bankoff proved his final generalization on a sheet that he typed in 1973. It

makes use of Figure 1.5.

i

i

“master” — 2012/11/15 — 11:23 — page 5 — #19i

i

i

i

i

i

1. The Asymmetric Propeller 5

A

B C

D

E

F

G

H

J

X

Y

Z

P

QR

S

Figure 1.5.

The propellers shown are right triangles, although they can be any type of triangle.

Perhaps the proof that follows can be simplified.

If ABC , AHJ , DBE , and F CG are similar triangles, all labelled in the same sense

and situated so that corresponding angles meet at the vertices of triangle ABC , then

X; Y; Z, the midpoints of DF , GH and JE , are vertices of a triangle similar to the other

four.

Proof. Let †BCA D †GCF D ˛; †DBE D †ABC D ˇ; †JAH D †CAB D ; and

let P; Q; R; S denote the midpoints of the segments DC , AC , AE and CH respectively.

We proceed stepwise to show that triangles PQR, PSZ and finally XYZ are similar to

triangle ABC .

If triangle ABD is pivoted about B so that AB falls along BC and DB along EB , it is

seen by the relation AB=BC D DB=BE and by the equality of angles ABD and CBE that

triangles ABD and CBE are similar and that EC=AD D BC=AB , with †EC; AD D†BC; BA D ˇ.

Since RQ is parallel to and equal to half EC while QP is parallel to and equal to

half AD, we extend the previous relation to read RQ=QP D EC=AD D BC=BA, with

†RQ; QP D †EC; AD D †BC; BA D ˇ. It follows that triangles PQR and ABC are

similar.

In like manner, because of the relationship of AJ and AH to RZ and QS as well as

to RP and QP in both relative length and in direction, we find triangles ZRP and SQP

similar. Then ZP=SP D ZR=QS D AJ=AH D AC=AB , with the angles between the

segments in the numerator and in the denominator all equal to . As a result, triangles

PSZ and ABC are similar.

i

i

“master” — 2012/11/15 — 11:23 — page 6 — #20i

i

i

i

i

i

6 Part I. Geometry

Continuing as before, we find triangles ZPX and ZSY similar since PX=SY DCF=CB D CA=CG and †PX; SY D †CF; CG D †CA; CB D ˛.

Noting that in the similar triangles ZPX and ZSY we have ZX=ZY D ZP=ZS DCA=CB and †ZX; ZY D †CA; CB D ˛, we conclude that triangles XYZ and ABC

are similar.

And now a question for interested readers to explore. Do the propellers have to be

triangles? It occurred to me that if squares are substituted for triangles, as in Figure 1.6,

that equilateral triangle still shows up.

Figure 1.6.

I have written this piece as a tribute to one of the most remarkable mathematicians I

have been privileged to know.

Bibliography

[1] G. L. Alexanderson, A conversation with Leon Bankoff, College Math. J. 23 (1992) 98–117.

[2] J. H. McKay, The William Lowell Putnam Mathematical Competition, Amer. Math. Monthly 75

(1968) 732–739.

[3] Leon Bankoff, Paul Erdos, and Murray Klamkin, The asymmetric propeller, Math. Mag. 46

(1973) 270–272.

i

i

“master” — 2012/11/15 — 11:23 — page 7 — #21i

i

i

i

i

i

2The Asymmetric Propeller Revisited

Gillian Saenz, Christopher Jackson, and Ryan Crumley

In [1] Martin Gardner proved an asymmetric propeller theorem that was originally pro-

posed by Leon Bankoff. He showed that by connecting one vertex from each of three

equilateral triangles inscribed in a circle to the center of the circle, one can form a fourth

equilateral triangle. This fourth triangle is formed by connecting the remaining vertices

with segments as seen in Figure 2.1. Then the midpoints of these segments are connected

to form a triangle. This works regardless of the arrangement of the original triangles. We

confirmed Gardner’s findings and proceeded to consider his final question: when working

with squares, does the same phenomenon occur? We show that the answer is no. Here we

give counterexamples.

H

J

I

Figure 2.1.

Using Geometer’s Sketchpad, we attempted to show that the propeller theorem also

worked for squares and possibly other polygons. In [1, Fig. 6] a midpoint of a side of each

of three squares occurred at the vertices of an equilateral triangle. We had the squares meet

at a single point by shrinking the equilateral triangle to a point. If the conjecture is false for

this case, then it will not hold true if the point is expanded to a triangle. By experimenting

with Geometer’s Sketchpad we found that even when using squares inscribed in a circle

Reprinted from The College Mathematics Journal, Vol. 31, No. 5 (Nov. 2000), pp. 347–349.

7

i

i

“master” — 2012/11/15 — 11:23 — page 8 — #22i

i

i

i

i

i

8 Part I. Geometry

I

J

K

Figure 2.2.

the propeller theorem was not true, see Figure 2.2. The triangle shown in this figure was

not equilateral not did its angles remain constant when the squares were moved.

Seeing that the square conjecture would not work, we set up the following proof. We

start with three congruent squares inscribed in a circle. The squares meet at the midpoints

of one of their edges. The vertices of the squares are then connected as in Figure 2.3. The

midpoints of these new segments are then connected as before to form a triangle. Using

the distance formula, it is evident that the edges of the triangle are not congruent, because

3 ¤p

34=2.

(–1, 2) (1, 2)

(2, 1)

(2, –1)

(–1, –2) (1, –2)

Figure 2.3.

We also investigated whether the midpoint connection was essential when working with

equilateral triangles. We used the simplest case possible, three congruent equilateral trian-

gles inscribed in a circle meeting at a point. Rather than constructing the midpoints of the

connecting segments, we constructed a point one quarter along each segment. These points

were connected to form a triangle. Figure 2.4 shows that the triangle formed is not an equi-

lateral triangle. The theorem seems to be true only when the triangle formed is constructed

using the midpoints of the connected vertices. Attempts using other polygons were unsuc-

i

i

“master” — 2012/11/15 — 11:23 — page 9 — #23i

i

i

i

i

i

2. The Asymmetric Propeller Revisited 9

K

L

M

Figure 2.4.

cessful. Based on our results, a propeller theorem for squares and other polygons seems

unlikely.

Bibliography

[1] Martin Gardner, The asymmetric propeller, College Math. J., 30 (Jan. 1999) 18–22; Chapter 1 in

this volume.

i

i

“master” — 2012/11/15 — 11:23 — page 10 — #24i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 11 — #25i

i

i

i

i

i

3Bracing Regular Polygons

As We Race into the Future

Greg N. Frederickson

If today’s math-and-science-oriented kids could see the resources available to like-minded

youths of the 1960s and 70s, they might pity their counterparts back then: no laptop com-

puters, no high-speed internet, no online bookstores. Yet such pity would be wasted on that

earlier generation, who could well respond, “Ah, but we had Martin Gardner.”

For a quarter century, Gardner wrote Mathematical Games, a wonderful column in

Scientific American [3]. Each month he treated his readers to a tour of some mathematical

recreation that he made tantalizing for people who, like himself, had no advanced training

in mathematics. He was at the center of an amazing human network, referred to as “Martin

Gardner’s mathematical grapevine” by Doris Schattschneider [7].

Long before our current electronic networks, Gardner gathered juicy mathematical tid-

bits from his far-flung contacts, turned them into mouth-watering morsels, and encouraged

readers to send back their own variations. How many thousands leaped into mathemati-

cal, scientific, and engineering fields because of the enthusiasm nurtured by this singular

man? Let’s reach back in time to sample a problem that Gardner introduced and explore the

magic that he created, magic which would expand as we raced towards our technological

future.

Bracing a square

In his November 1963 column, Gardner dished up a tasty problem devised by Raphael

Robinson, a mathematician at the University of California, Berkeley, who is best known

for determining the minimum number of pieces for the Banach-Tarski paradox. Gardner

wrote:

Imagine that you have before you an unlimited supply of rods all of the same

length. They can be connected only at the ends. A triangle formed by joining

three rods will be rigid but a four-rod square will not: it is easily distorted into

other shapes without bending or breaking a rod or detaching the ends. The

simplest way to brace the square so that it cannot be deformed is to attach

eight more rods to form the rigid octahedron.

Reprinted from The College Mathematics Journal, Vol. 43, No. 1 (Jan. 2012), pp. 51–57.

11

i

i

“master” — 2012/11/15 — 11:23 — page 12 — #26i

i

i

i

i

i

12 Part I. Geometry

Suppose, however, you are confined to the plane. Is there a way to add

rods to the square, joining them only at the ends, so that the square is made

absolutely rigid? All rods must, of course, lie perfectly flat in the plane. They

may not go over or under one another or be bent or broken in any way. The

answer is: Yes, the square can be made rigid. But what is the smallest number

of rods required?

Readers are asked to try to solve the problem and, if they succeed, to save

their solution or send it to this department. Next month we shall publish the

minimum answer Robinson has so far been able to achieve.

The next month, Gardner displayed two solutions by Robinson, each with 31 rods in

addition to the four rods of the square. Robinson used equilateral triangles to form lengths

of 2, 3, andp

13, producing a right triangle whose right angle braces the square. Clever?

Yes! And yet, instead of administering the coup de grace to this intriguing little problem,

the high-powered Berkeley mathematician had merely set in motion a long sequence of

improvements as the problem began to bounce around in Gardner’s network.

Superior solutions from readers

Two months later, in the February 1964 column, Gardner reported that 44 readers had

surprised him with a 25-rod solution, bettering each of Robinson’s solutions by 6 rods!

Moreover, Gardner was stunned that seven readers had designed an even niftier 23-rod

solution!

AB C

D

E

AB C

D

E

Figure 3.1. (a) 23-rod solution for bracing a square; (b) Underlying geometry.

The 23-rod solution appears in Figure 3.1a. Two pairs of equilateral triangles guarantee

that the distances BE and CE are eachp

3. Two trusses, each consisting of three equilateral

triangles, guarantee that the rods form straight lines between B and D, and between C and

D. These straight lines force points A, B , and C to be collinear. That the distances BE and

CE are equal then forces reflection symmetry along the axis containing A and D. Thus

†BAE and †CAE are both right angles, forcing distances AB and AC to both bep

2. We

can then conclude that the distance AD is alsop

2, forcing the rhombus containing A and

D to actually be a square. Figure 3.1b depicts this underlying structure.

Typically, Gardner collected his columns from Scientific American into books that he

published every several years. When doing this, he added new material that had come to

light once the columns had appeared. Including this column in his Sixth Book of Mathe-

matical Games [4], Gardner noted on page 55 about the square bracing problem:

i

i

“master” — 2012/11/15 — 11:23 — page 13 — #27i

i

i

i

i

i

3. Bracing Regular Polygons As We Race into the Future 13

The problem can be extended to other regular polygons. The hexagon is simply

solved with internal braces (how many?), but the pentagon is a tough one. T. H.

O’Beirne managed to rigidify a regular pentagon with 64 additional rods, but

it is not known if this number is minimal.

O’Beirne’s solutions for various polygons

In 1999 I was selecting material to include in Hinged Dissections [1]. The 23-rod solution

for bracing a square, along with O’Beirne’s 64-rod solution for bracing a regular pentagon,

seemed perfect for a short segment in the book. There was just one hitch—I couldn’t find

O’Beirne’s braced pentagon anywhere. In desperation I contacted Gardner.

As it turned out, neither he nor O’Beirne had ever published the pentagon bracing.

Gardner generously sent me all of O’Beirne’s correspondence on this problem. One aero-

gramme from O’Beirne showed three progressively better solutions for the pentagon, each

added as he excitedly discovered a further improvement. During one remarkable week in

December 1963, O’Beirne had found solutions not only for the pentagon but also for three

other regular polygons as well! Almost certainly prompted by Robinson’s square bracings

in Gardner’s December column, O’Beirne had found ways to brace the pentagon (64 rods),

the octagon (105 rods), the decagon (183 rods), and the dodecagon (45 rods). I included

the pentagon in [1].

Armed with an M.A. in math and physics from the University of Glasgow, Thomas H.

O’Beirne (1915–1982) had worked in turn as a government scientist, a chief mathematician

for an engineering firm, and a lecturer in the university’s computing department. In 1961–

62, he wrote a weekly series for the New Scientist, which was collected into a book, Puzzles

and Paradoxes.

Lying dormant for more than thirty years, O’Beirne’s nifty constructions might well

have been forgotten. But that would have belied the residual strength of Gardner’s network

and the growing strength of modern networks.

An improved octagon bracing

After publishing my book with its discussion of the braced square and braced pentagon,

I continued to think about bracing problems. In 2003, I discovered an improvement to

O’Beirne’s solution for the octagon, using only 51 additional rods, as shown in Figure 3.2a.

The solution incorporates several ideas. First, the rods in the interior of the octagon

form rhombuses, forcing opposite pairs of sides of the octagon to be parallel. For every

pair of opposite sides, not only are those two rods parallel, but also a pair of bracing rods

in the interior.

Second, some rods outside of the octagon force other rods to be parallel to rods on the

boundary of the octagon. Thus a truss consisting of five equilateral triangles forces three

rods to be on the line LM. Also, a truss of three equilateral triangles forces two rods to be

on the line CN.

Third, other rods outside the octagon enforce certain distances between points on the

inside of the octagon. The group of four equilateral triangles between M and C force a

i

i

“master” — 2012/11/15 — 11:23 — page 14 — #28i

i

i

i

i

i

14 Part I. Geometry

M

E

CI

N

B

H

J

L

K

F

G D

A CH

DG

L

BA

I

EF

JK

Figure 3.2. (a) 51-rod solution for an octagon; (b) Underlying geometry.

distance ofp

3 between these vertices. Since rods ME and CI are parallel, distance EI

must also bep

3.

Similarly, a group of four equilateral triangles between H and N force a distance ofp3. Rods HJ and BN transmit this restriction to points J and B . A final group of four

equilateral triangles forces a distance ofp

3 between H and L. Two sets of parallel rods

force distance BK to also bep

3. Yet it is not so clear that the octagon is then rigid.

We can verify this claim by modeling the braced structure (Figure 3.2b) with equations

that we solve using Mathematica. First, fix the location of an edge, say AB, by xA D 0,

yA D 0, and xB D 1, and yB D 0.

Next, for each of the three rods along the boundary from B to E , such as CD, force its

length to be 1 by .xC � xD/2 C .yC � yD/2 D 1. Then, for each of the six rhombuses

inside the octagon, such as ABCI, force an opposite pair of rods to be parallel by xB �xC DxA � xI and yB � yC D yA � yI . Do the same for the (degenerate) “flat rhombus” LFEF.

And to enforce a distance ofp

3 between each of the three pairs of two vertices, such as

H and L, introduce .xH � xL/2 C .yH � yL/2 D 3.

This gives us 24 equations determining the 24 total coordinates of the twelve points.

However there may be more than one solution to this system of equations, because we

have not specified, for example, whether rod IC is above or below rod AB. So introduce

more constraints to force a specific topology as in Figure 3.2b. These constraints would be

yC > yB , yD > yB , xC > xB , and xD > xB . The resulting set of restrictions leaves us

with a unique solution that then guarantees rigidity.

Theobald’s solution for bracing a pentagon

In 2004, after studying O’Beirne’s braced pentagon (in [1]), Gavin Theobald improved

it, reducing the number of additional rods from 64 to 56, as in Figure 3.3a. Theobald

adapted key ideas of O’Beirne and also introduced new ones. Let’s verify carefully that the

pentagon is braced.

O’Beirne had first set out to produce an angle of 108ı, the interior angle of the regular

pentagon. He had known that a diagonal in a regular pentagon divides the pentagon into

i

i

“master” — 2012/11/15 — 11:23 — page 15 — #29i

i

i

i

i

i


a trapezoid and an isosceles triangle with this angle. Furthermore, he had recognized that

if the two equal sides of the triangle are of lengthp

3, then the length of the third side isp3.1 C

p5/=2. Trusses consisting of three equilateral triangles will create 4ABC with

sides 1, 2, and 2, as in Figure 3.3a. By the Pythagorean Theorem, such a triangle has height

.p

15/=2. Adding the height of an equilateral triangle (top of Figure 3.3b) gives the desired

distance ofp

3.1 Cp

5/=2.

A

B

C

D

E

G

I

H

J

K

L

M

N

F

A

B

C

B

C

D

I

H

J

22

1

2

2

108 ◦

Figure 3.3. (a) Theobald’s braced pentagon; (b) Underlying geometry.

While O’Beirne had then gone on to generate the angle of 108ı and transfer it elsewhere

in the structure, Theobald chose instead to transfer the distance itself. Theobald noted

that he already had a suitable angle, namely †BCD, so that the desired length would be

between points B and D (See the middle of Figure 3.3b).

Theobald then transferred the desired length by constructing parallelogram BDHI as

follows. He made a truss of four equilateral triangles to force distance BI to bep

7, and

then made a “zigzag truss” with five equilateral triangles to force distance DH to bep

7.

He forced BE to be parallel to DG by creating a sequence of three parallelograms between

them. Then BI must be parallel to DH . Since BDHI is a parallelogram, its opposite sides

must be the same length, and thus distance HI is alsop

3.1 Cp

5/=2.

Theobald positioned the vertices of an isosceles triangle HIJ with HI as the base.

Then two equilateral triangles force distance IJ to bep

3 and four equilateral triangles

force distance HJ to bep

3. This gives the isosceles triangle with †IJH D 108ı. The

two equilateral triangles touching at J supply two of the sides of the regular pentagon.

Once O’Beirne had positioned one angle of 108ı in the regular pentagon, he had po-

sitioned the opposite side by using a sequence of an equilateral triangle, two rhombuses,

and another equilateral triangle. Theobald used a similar idea to position side LM of his

pentagon with just one rhombus.

Theobald’s construction forces the side LM of the pentagon to be at the correct angle

to make the pentagon be regular: First, rods EL and FM form a rhombus, so that rod LM

i

i

“master” — 2012/11/15 — 11:23 — page 16 — #30i

i

i

i

i

i

16 Part I. Geometry

is parallel to rod BC . Rod BC is at an angle of 30ı with the line BD. Then rod LM is at an

angle of 30ı with line HI . Pentagon edges JK and JN are rotated 30ı counterclockwise

from the lines IJ and HJ , which means that diagonal KN is parallel to LM . Thus LM is

at the correct angle. Even knowing O’Beirne’s approach, Theobald still needed creativity

and persistence to reduce the number of rods by eight.

Gavin Theobald was born in London, England, in 1961. Graduating with a B.Sc. in

Computer Studies at Loughborough University in 1983, he worked as a computer program-

mer on low-level high-performance graphics rendering engines. Theobald is an expert in

geometric dissections [8], an area into which he was drawn after reading one of Gardner’s

books.

Khodulev’s solutions when rods cross

Even more progress was made by Andrei Khodulev, who had learned of the problem from

the Russian translation of Gardner’s book [5]. Noting that one can use fewer rods if they

are allowed to cross, Khodulev discarded the restriction on crossing. Then, for example, a

regular octagon can be rigidified with only 23 additional rods: eight rods to create rhom-

buses within the octagon, and three sets of five rods each to produce three distances ofp

3.

Published in the bulletin of the Russian puzzle club Diogen (Diogenes) [6], Khodulev’s

results are reproduced on Erich Friedman’s webpage [2].

Andrei Khodulev (1953–1999)earned a Ph.D. in computer science in 1984 from Keldysh

Institute of Applied Mathematics, in Moscow, where he was a senior research scientist. His

research areas were in graphics, imaging, approximation theory, and stochastic simulation.

Gardner’s enduring challenge

Thanks to Martin Gardner’s gift for connecting people together, a problem that originated

with Raphael Robinson (1911–1995) spread to Gardner’s wide readership, resulting in im-

provements from dozens of his correspondents. Thomas H. O’Beirne then generalized the

problem and produced solutions for four other polygons, one of which Gardner mentioned

in his subsequent book. That led the current author to include O’Beirne’s solution in a

book, which in turn attracted the attention of Gavin Theobald, who found an improvement

for the pentagon, as I found an improvement for the octagon. Separately, Andrei Khodulev

made progress by allowing rods to cross. So don’t be surprised when further delectable

developments are announced over our networks—just brace yourself!

Acknowledgment I thank Gavin Theobald for permission to include his pentagon brac-

ing, Serhiy Grabarchuk for copies of Sharada, Svitlana Mayboroda for translating Russian,

and the referees for good suggestions.

Bibliography

[1] G. N. Frederickson, Hinged Dissections: Swinging and Twisting, Cambridge Univ. Press, New

York, 2002.

i

i

“master” — 2012/11/15 — 11:23 — page 17 — #31i

i

i

i

i

i


[2] E. Friedman, Problem of the month (January 2000); available at

www.stetson.edu/˜efriedma/mathmagic/0100.html, 2000.

[3] M. Gardner, Mathematical games, Scientific American, Nov., 1963, p. 144; Dec., 1963,

pp. 144, 146; Feb., 1964, p. 128.

[4] ——, Martin Gardner’s Sixth Book of Mathematical Games from Scientific American, Charles

Scribner’s Sons, New York, 1971.

[5] ——, Mathematical Leisure (Matematiqeskie Dosugi), Mir (Mir), Moscow, 1972.

[6] A. Khodulev, A task about a rigid polygon (zadaqa o �estkom mnogougolbnike),

Sharada (Xarada) 7, no. 61 (1998) 2–8.

[7] D. Schattschneider, In praise of amateurs, in The Mathematical Gardner, D. A. Klarner, ed.,

Prindle, Weber & Schmidt, Boston, 1981, 140–166.

[8] G. Theobald, Geometric dissections; available at

home.btconnect.com/GavinTheobald/Index.html.

www.stetson.edu/~efriedma/mathmagic/0100.html

http://www.stetson.edu/~efriedma/mathmagic/0100.html

home.btconnect.com/GavinTheobald/Index.html

http://home.btconnect.com/GavinTheobald/Index.html

i

i

“master” — 2012/11/15 — 11:23 — page 18 — #32i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 19 — #33i

i

i

i

i

i

4A Platonic Sextet for Strings

Karl Schaffer

Our dance company, the Dr. Schaffer and Mr. Stern Dance Ensemble, began performing

mathematical dance shows in 1990. Scott Kim joined us in 1993, and in 1995 Scott, another

Bay area dancer Barbara Susco, and I developed a school-age show entitled “Through the

Loop, In Search of the Perfect Square,” which included the performance of many string

figures.

Martin Gardner wrote several columns on polyhedra and one on traditional string fig-

ures, in addition to his many references to magic tricks and puzzles with string [5, Ch. 1],

[6, Ch. 17], [7, Ch. 19], [8, Ch. 10], [6, pp. 74–75]. Traditional string figures, made on

the hands with a simple loop of string, are found worldwide, particularly associated with

ancient cultures [7]. In 1994 our dance company began experimenting with oversize loops

carried on stage by dancers, and manipulated to form a variety of string figures. String

figure constructions and the associated mathematics make a pleasurable classroom activ-

ity that introduces graph theory and polyhedra to geometry, liberal arts, and discrete math

students, or as a hands-on activity for workshops and math festivals.

In the process of creating one of our shows, “Through the Loop,” which includes over

20 string figures, we began altering well-known figures, using large loops held by several

dancers, or using a sequence of finger manipulations different from the customary methods.

Traditional string figures often display shapes or actions reminiscent of the natural world,

as indicated by such titles as: Bear Climbing a Tree, the Yam Thief, and Mosquito. Yet

these figures also involve complex mathematical patterns in both their construction and

final designs. We began experimenting with polyhedral designs which we thought might

be more recognizable by contemporary audiences as being mathematical.

Scott Kim and the author came up with several string constructions of the tetrahedron,

octahedron, and cube [8, 9], though Scott’s are particularly beautiful for their symmetry

and simplicity. All these constructions keep the string loops taut, and are simplified by

involving symmetrical transfers by participants. Several of our mathematical designs, in-

cluding that of the octahedron and its collapsed form as a six pointed star, were used in

one of the dances in the Loop show; other string figures in that show were traditional ones

as well as a series of star polygons [7]. Subsequently the author found several ways to

build all five Platonic solids sequentially, including a method employing identical loops

of three colors which creates interesting symmetric color patterns. This six loop version

Reprinted from The College Mathematics Journal, Vol. 43, No. 1 (Jan. 2012), pp. 64-69.

19

i

i

“master” — 2012/11/15 — 11:23 — page 20 — #34i

i

i

i

i

i

20 Part I. Geometry

was demonstrated for the first time at a Math and the Arts conference at UC Berkeley in

1998 and at the Gathering for Gardner 3 in Atlanta in 1998. It has since has been handed

around at conferences and workshops, in the manner of folk mathematics familiar to fans

of Martin Gardner. It is published here for the first time.

These multiple loop constructions are inspired by contemporary modular origami which

uses identical copies of modular units to construct complex polyhedral forms. Though

some observers insist on calling this string sequence a dance, these more complex string

polyhedra are too stationary in space and require too many careful manipulations by the

hands of participants crowding around the figures to be usable as a part of actual staged

dances.

Euler circuits

An Euler circuit is a sequence of adjacent edges in a graph that includes each edge once.

The name refers to Leonhard Euler’s 1736 solution of the Seven Bridges of Konigsberg

Problem, though such figures also appear in sand drawings found in West Africa and else-

where [1, pp. 30–43]. Traditional string figures, which invariably use simple unknotted

loops of string, accomplish the same thing.

Think of the points of support, where hands or fingers keep the string in tension, as the

vertices, and the strands of string as the edges of a polyhedral graph. For our polyhedra we

use the thumb and first finger of one hand for each vertex, so that a polyhedron with 20

vertices, such as the dodecahedron, requires ten people. As Euler found, a connected graph

can be decomposed into a circuit (or collection of circuits) only if all vertex degrees are

even. Therefore some edges of our polyhedra need to be doubled and made with at least

two strands to even out the vertex degrees. For example, the four vertices of the tetrahedron

are each of degree three, and so we need to double at least two opposite edges in order to

Eulerize its graph and so allow it to be made with a circuits (i.e., loops of string). Four of

the Platonic solids have odd degree vertices, and require edge doublings; the octahedron

has degree four vertices, and so can be made from loops without edge-doubling.

By utilizing the symmetries of the polyhedra, we allow all the loops to be of the same

size, and this simplifies the constructions and facilitates the directions, as multiple partic-

ipants do the same string manipulations. After choosing edges to double, we essentially

decompose the graph of each polyhedron into cycles, but the struggle to insure that the

strings do not end in tangles nor the participants interfere with each other makes the search

for these decompositions more challenging. The most pleasing figures seem to allow the

least slack in the strings and make use of simple and surprising transformations.

Transformations

The overall manner in which we handle the string can be modeled by several standard

graph theory operations. An edge subdivision replaces an edge uv having vertices u and

v with a new vertex w and two new edges uw and vw. This is accomplished with string

when one grasps and pulls on the middle of a string edge. In order to manipulate the string

smoothly, each hand involved holds it by making a loop with the thumb and index finger,

i

i

“master” — 2012/11/15 — 11:23 — page 21 — #35i

i

i

i

i

i

4. A Platonic Sextet for Strings 21

so that the string can slide easily “through” the vertex. This is why we require a whole

hand for a single vertex.

We also use the reverse operation, smoothing a vertex, releasing a vertex w while si-

multaneously pulling outwards at all other vertices to take up the slack, and replacing two

edges uw and wv by single edge uv. Pulling two or more vertices together and holding

them in one hand is vertex amalgamation. We retain the separate identification of the amal-

gamated vertices, since we sometimes need to reverse the operation and release some of

the vertices from the amalgamation. Vertex separation is the release of a string from an

amalgamated vertex. We may also perform an edge contraction by sliding two vertices

joined by an edge together to amalgamate their endpoint vertices, while removing the edge

between them by pulling outward on the other vertices, as with vertex smoothing. In all

cases the participants need to adjust their hands slightly to make the polyhedra more regu-

lar; letting the strings slide through the loop made by the thumb and forefinger facilitates

this, while grabbing the strings makes it more difficult.

Six-loop platonic solids

Figure 4.1 shows the six-string transformations of the Platonic solids. All six strings should

be same length, and it is helpful to use two loops of each of three colors. Each vertex is

labeled Ri or Li , indicating that it is held by person i with the Right or Left hand. Figure

4.1a shows the opening pattern, seen from above, made with the two loops of the same color

held by opposite participants. It is important to keep the doubled loops from crisscrossing,

as otherwise the later figures will contain extra edges. One of the pairs of loops must sit

inside the other two pairs (that held by R1 and R3), and one pair must be external to both

the others (that held by R5 and R6), so this construction is not as symmetrical as other of

the polyhedral string figures; a more symmetric alternative is to link the strings like the

Borromean rings, but this leads to tangles!

Figure 4.1a shows that first R5 and R6 subdivide and hand their loops off, forming

the square R1R2R3R4 in Figure 4.1b. R5 and R6 then grasp new vertices as shown in

Figure 4.1b, forming squares R4R5R2R6, and R1R5R3R6, and thus the octahedron shown

in Figure 4.1c. The six participants still use only one hand apiece, though actually both

hands are needed to perform the transition from Figure 4.1a to Figure 4.1b. The octahedron

is usually held with one vertex high and the opposite near the floor to make the instructions

easier to follow, though one person must reach high and another low in order to grasp

the vertices. The octahedron thus lines up an axis of four-fold rotational symmetry with

the vertical; we might also sit the octahedron on one of its triangles to accentuate the

3-fold symmetries.

In order to transform the octahedron to the icosahedron, we must double the number of

vertices, and transform the 12 edges of the octahedron to the 30 edges of the icosahedron.

This is accomplished by adding one vertex and 3 edges for each of the octahedron’s 8

vertices as follows (see the arrows in Figure 4.1c for the R1, R2, and R6 vertices.) Note that

all edges in the octahedron are doubled, but only six non-adjacent edges in the icosahedron

are doubled. The two vertices from the horizontal R1R2R3R4 loop held at the R1 vertex

are separated, one sliding upwards, the other downwards along the R1R5R3R6 loop (in the

i

i

“master” — 2012/11/15 — 11:23 — page 22 — #36i

i

i

i

i

i

22 Part I. Geometry

(e)

L1L7 L3L4

R2

R4

R3R1

L5

L6

R5

R6

L2

L8L9

L10

R7

R8R9

R10

L10

R7R9

(g)

(d)

L1

L2

L3L4

R2

R4

R3R1

L5

L6

R5

R6

(a)

R1 R3

R2

R4 R5

R6 (b)

R1

R3

R2

R4

R5

R6

L8(f)

L7

L8L9

L10

R7

R8R9

R10

R1

R3

R2

R4

R5

R6

(c)

R10

Figure 4.1. Six loop Platonic solids.

plane of the drawing), to form the new R1 and L1 vertices. Each participant will execute a

similar maneuver. The exact same operation as at R1 in Figure 4.1c is performed at the R3

vertex. At R2 and R4 the vertical R4R5R2R6 loop’s amalgamated vertices are separated

horizontally to form new R2; L2; R4, and L4 vertices. At R5 and R6 the R1R6R3R5

loop’s vertices are separated horizontally to form new R5; L5; R6, and L6 vertices. All

this produces the icosahedron in Figure 4.1d. When performing these operations make

sure that the pairs of same colored loops do not cross. Note that the remaining 6 doubled

edges are parallel in pairs, forming a perfect matching of the vertices of the icosahedron,

and hence a minimal Eulerization.

At this point notice that there are 8 triangles, 4 high and 4 low, which have edges of

3 colors, none doubled. To transform the icosahedron, with 12 vertices and 30 edges, to

the dodecahedron, with 20 vertices and 30 edges, we need to add 8 vertices, and some-

how change 20 triangles to 12 pentagons. To accomplish this, each edge of those eight,

3-colored triangles will be subdivided, and the three new vertices on each such face amal-

i

i

“master” — 2012/11/15 — 11:23 — page 23 — #37i

i

i

i

i

i

4. A Platonic Sextet for Strings 23

gamated as one vertex. We call this operation on all the edges of a single face squeezing

the face. This is illustrated in Figure 4.1d by arrows (just for the R2R3; L5 and R2L3; R6

triangles), resulting in the dodecahedron with all edges doubled (some doubled edges not

shown), in Figure 4.1e. Removing these 8 triangles results in a polyhedron with 20�8 D 12

pentagons. This operation requires 4 new participants, who each will most conveniently

squeeze two triangles above and below each other, with say, right hands high, left hands

low. Two of the same colored loops are shown as part of sets of doubled edges in Figure

4.1e. These sets of edges form what are known as the Petrie polygons of the dodecahedron,

skew polygons no three of whose successive edges are part of the same face.

The step from Figure 4.1e to 4.1f is simple: participants 1 through 6 release their ver-

tices, while the new participants (7 through 10) hang on to theirs. Their 8 vertices (white

circles in Figure 4.1e) are the vertices of a cube in Figure 4.1f, which magically appears

with all its edges doubled. Strings of like colors appear on opposite faces.

The transition to the tetrahedron (Figure 4.1g) is equally simple: two of the remaining

participants opposite each other on the upper face of the cube, say 7 and 9, hold on to

their high vertices with their right hands and release their left hand vertices, while 8 and

10 hold on to their left hand vertices and release their right. In the tetrahedron formed

each pair of string loops of the same color are non-incident and skew perpendicular. To

complete the cycle back to the octahedron, each of these four participants uses their free

hand to squeeze the edges of the three colors of one triangle back to a new vertex, re-

forming the cube; this operation is indicated by the arrows on one of the faces in Figure

4.1g. Participants 1 through 6 now return to form the rhombic dodecahedron by squeezing

one face per participant to a new vertex, taking care to use edges of two colors only per

vertex, and such that the same two colors are used for the vertices on opposite faces of the

cube. Also, one of the pairs of same color loops must be external to the others, and one

pair internal, or the octahedron will become tangled. Participants 1 through 6 now hang on

to their vertices, and 7 through 10 release theirs, forming the octahedron again.

Conclusion

Many, many other polyhedral string figures similar to these are possible. For example,

the author has found fairly direct transformations of the six loop polyhedra shown to the

Archimedean polyhedra, as well as a series of constructions of the Platonic solids using

either one or four loops. See also [8, pp. 85–97], [9], [3, pp. 280–287], and [7]. These con-

structions make excellent classroom investigations of Euler circuits and polyhedra. Start

simply. For example, a five-pointed star is easy to draw but making one with a loop of

string requires 3 people cooperating carefully. Have students find their own constructions

of a tetrahedron and octahedron; there are many. If the students find a nice construction

have them practice several times so the actions are quick and fluid. Directions and lore on

traditional string figures can be found at [7]. Many variations on the figures mentioned in

this article are possible.

Note on making loops for use in these constructions

We use loops which can be stretched to “arm span,” length or about 5 to 6 feet. We either

use cotton string, taping the ends together with cloth or duct tape, or nylon string with the

i

i

“master” — 2012/11/15 — 11:23 — page 24 — #38i

i

i

i

i

i

24 Part I. Geometry

ends melted together over a candle flame. Use string that is at least 1=8th to 1=4 inch in

diameter. For the Platonic solids, we use 1=2 inch diameter flexible cotton rope, purchased

in 3 colors. One can also dye cotton rope a variety of colors with fabric dye. Make sure the

rope or string is manufactured with a soft weave, rather than using brittle or inflexible string

or rope. The string polyhedra constructed in this paper use multiple loops of the same size.

To store the loops without getting them all tangled, double each one over several times and

tie it in a knot.

Videos

George Csicsery has produced several videos of sequences of string polyhedra, including a

three-polyhedra sequence similar to, but shorter than, the one described here, and involving

only four participants. This video and other videos relevant to string polyhedra and to Mar-

tin Gardner are available on YouTube at http://www.youtube.com/user/g4gman.

Bibliography

[1] M. Ascher, Ethnomathematics, a Multicultural View of Mathematical Ideas, Brooks/Cole Pub-

lishing Company, Pacific Grove CA, 1991.

[2] M. Gardner, The Second Book of Mathematical Diversions, Simon and Schuster, New York,

1961.

[3] ——, The Unexpected Hanging and Other Mathematical Diversions, Simon and Schuster, New

York, 1968.

[4] ——, Martin Gardner’s Sixth Book of Mathematical Diversions from “Scientific American,”

Simon and Schuster, New York, 1971.

[5] ——, Wheels, Life, and other Mathematical Amusement, W.H. Freeman and Company, New

York, 1983.

[6] ——, The Last Recreations Hydras, Eggs, and Other Mathematical Mystifications, Springer-

Verlag, New York, 1997.

[7] K. Schaffer, Star String Polygons, Gathering for Gardner 9, 2010 (to appear).

[8] K. Schaffer, E. Stern, and S. Kim, Math Dance with Dr. Schaffer and Mr. Stern, MoveSpeak-

Spin; available at www.movespeakspin.org, 2001.

[9] ——, math dance activities; available at www.mathdance.org.

[10] M. Sherman, ed., International String Figure Association, string figure references and instruc-

tions; available at www.isfa.org.

[11] I. Stewart, Cows in the Maze and Other Mathematical Explorations, Oxford Univ. Press,

Oxford, 2010.

http://www.youtube.com/user/g4gman

http://www.youtube.com/user/g4gman

www.movespeakspin.org

http://www.movespeakspin.org

www.mathdance.org

http://www.mathdance.org

www.isfa.org

http://www.isfa.org

i

i

“master” — 2012/11/15 — 11:23 — page 25 — #39i

i

i

i

i

i

5Prince Rupert’s Rectangles

Richard P. Jerrard and John E. Wetzel

Introduction

More than three hundred years ago, according to the contemporaneous John Wallis [11,

pp. 470–471], Prince Rupert� (1619–1682) won a wager that a hole can be cut in one of

two equal cubes large enough to permit the second cube to pass through.

Nearly a century later the Dutch scientist Pieter Nieuwland (1764–1794) showed that

the largest cube that can be so passed through a cube of side one has side 3p

2=4 � 1:061.

An accessible discussion with enlightening anaglyphs appears in Ehrenfeucht [8]. In 1950,

D. J. E. Schreck [10] gave an interesting, historically based survey of Prince Rupert’s

problem and Nieuwland’s extension. Schreck includes a photograph of a model showing

the cube in transit.

Nieuwland’s “passage” problem of finding the largest cube that can pass through a unit

cube is equivalent to finding the largest square that fits in the unit cube, because once the

largest square is located, the hole through the cube having that largest square as its cross

section clearly provides the desired passage. In higher dimensions one might seek the side

of the largest m-dimensional cube that fits in an n-dimensional cube of side one. Not much

is known about this question, which apparently was first raised, at least in the case of a cube

in a hypercube, by Martin Gardner (see Gardner [2, pp. 172–173] or Guy and Nowakowski

[7, pp. 967–68]).

The literature on Prince Rupert’s and Nieuwland’s problems is extensive and we cannot

claim to have examined it all, but nowhere have we found any mention of the analogous

questions for rectangles and “boxes” (i.e., rectangular parallelepipeds):

Question 1. Find the largest rectangle R with given aspect ratio � that fits in the unit cube,

i.e., for given � with 0 � � � 1, find the largest L so that an L � �L rectangle fits in the

unit cube.�

This question is an instance of a general unsolved fitting problem for boxes in higher

dimensions: in d -dimensional Euclidean space Ed , find necessary and sufficient conditions

Reprinted from The American Mathematical Monthly, Vol. 111, No. 1 (Jan. 2004), pp. 22–31.�Count Palatine of the Rhine and Duke of Bavaria, son of Frederick V, the Winter King, Elector Palatine, and

king of Bohemia, and Elizabeth, daughter of James I of England.�It is convenient to regard a line segment as a rectangle with one side 0, and a rectangle as a box with one

edge 0.

25

i

i

“master” — 2012/11/15 — 11:23 — page 26 — #40i

i

i

i

i

i

26 Part I. Geometry

on the k C d edges for a k-dimensional box .k � d/ with edges p1, p2; : : : ; pk to fit

in a d -dimensional box with edges s1, s2; : : : ; sd . For k D 1 the matter is trivial for

each d : a line segment of length p fits in the d -dimensional box precisely when p �q

s21 C s2

2 C � � � C s2d

. For k D d D 2 the question of when one rectangle fits in another

was asked in 1956 by L. R. Ford [4], and a necessary and sufficient condition was soon

given by W. B. Carver [2] (see also Wetzel [12]). The problem was posed for k D d D 3

by F. M. Garnett in 1923, and an incomplete answer was supplied in 1925 by Carver [1].

Question 2. Find the largest box of given shape that can be passed through a suitably

perforated unit cube, i.e., for given � and � with 0 � � � � � 1, find the largest D so that

a D � �D � �D box can pass through a suitable hole in the unit cube.

This “passage” question bears the same relation to the “fitting” question 1 as Rupert’s

does to Nieuwland’s: once the largest rectangle with aspect ratio � D �=� in the unit cube

is known, then the hole through the cube having that rectangular cross section provides the

desired passage; and no smaller hole with similar cross section suffices. As an example,

suppose that � D 0:7 and � D 0:2. The largest D � �D � �D box that can pass through

the unit cube has D D 2.p

143 �p

8/=9 � 2:02885 (see Theorem 1 and its corollary),

so that �D � 1:42020 and �D � 0:40577. Figure 5.1a shows the largest rectangle with

aspect ratio � D 2=7 that fits in the unit cube and the cube perforated to accommodate the

box. Figure 5.1b shows the box in transit through the perforated cube.

(a) The perforated unit cube. (b) The box in transit.

Figure 5.1. A box through the unit cube.

The Fitting Lemma

Let C be the unit cube, and let @C denote its boundary, viz., the union of its six closed

faces. Once the aspect ratio � is fixed in Œ0; 1�, there is a largest real number Lmax so that

i

i

“master” — 2012/11/15 — 11:23 — page 27 — #41i

i

i

i

i

i

4. Prince Rupert’s Rectangles 27

the Lmax � �Lmax rectangle Rmax fits in C. The first thing we need to ask is precisely how

this largest rectangle Rmax is situated in C. The answer is given by the “fitting lemma,”

Lemma 4, whose proof is the objective of this section.

Note first that, because of the central symmetry, any rectangle R that fits in C must also

fit in C with its center at the center of symmetry of the cube.

Lemma 1. If a rectangle R D ABPQ (Figure 5.2) fits in the unit cube C in any way

whatsoever, then it also fits in C with its center at the center of C.

A

B

Q

P

Figure 5.2. A rectangle with aspect ratio 0.4.

Proof. Suppose that the rectangle R D ABPQ fits in C and let A0, B 0, P 0, Q0 be the

points symmetric to A, B , P , Q in the center O of C (Figure 5.3). Then the rectangle

R0 D A0B 0P 0Q0 is congruent to R and forms with R a parallelepiped that fits in C with

center at the center O of C. The medial section of this parallelepiped midway between R

and R0 is a rectangle that is congruent to R and has its center at O .

R

R'

A A'

O

Figure 5.3. Centering.

As a consequence of this lemma, we may always assume that a rectangle under consid-

eration is centrally placed in C. We call such a rectangle centered. A corner of a centered

rectangle lies on an edge or face of C precisely when the opposite corner of the rectangle

lies on the opposite edge or face of C.

Corners at vertices If one corner of a centered rectangle R in C lies at a vertex of C, then

the opposite corner of R lies at the opposite vertex of C, and the two remaining corners of

R are at opposite vertices of C. There are only two possibilities: � D 0 and R is a diagonal

line segment of C regarded as ap

3�0 rectangle, or � D 1=p

2 and R is ap

2�1 diagonal

cross section of C. In the following analysis, we usually suppose that the corners of R are

not at the vertices of C.

i

i

“master” — 2012/11/15 — 11:23 — page 28 — #42i

i

i

i

i

i

28 Part I. Geometry

Centered maximal rectangles How does a centered maximal rectangle R fit in C? We

show next that its corners must lie on the boundary @C of C.

Lemma 2. The corners of a centered maximal rectangle R lie on the boundary @C of C.

Proof. Our principal tool is the observation that a rectangle that fits in C with all four of

its corners inside C is not maximal. In particular, if a centered rectangle has two adjacent

corners in the interior of C, then it is not maximal. It follows that we need examine only

those centered rectangles having at least one corner on a face (but not at a vertex) of C.

Suppose that one corner A of a centered rectangle R D ABPQ lies inside C and an

adjacent corner B lies on a closed face f of C but not at a vertex (Figure 5.4). If B lies in

the interior of a face f , then an appropriate small rotation of R in its circumcircle (Figure

5.4a) moves B inside C as well; and consequently R is not maximal. If B lies on an edge e

of f (Figure 5.4b), then a suitable small rotation of R about the line ` normal to f through

O leaves A inside C and moves B off e into f , the case just considered; so again R is not

maximal. It follows as claimed that all four corners of a maximal centered rectangle must

lie on faces of C.

Of

A

B`

e

f

O

AB

`

(a) Corner B on an open face. (b) Corner B on an edge.

Figure 5.4. Corner A inside the cube.

But more is true: the corners of a maximal centered rectangle must be on the edges of

C.

Lemma 3. The corners of a centered maximal rectangle R lie on edges of C.

Proof. We show first that at least one of the corners of a centered maximal rectangle R

must lie on an edge of C. If adjacent corners A and B of R both lie on the same (open) face

of C (Figure 5.5a), then the other two corners P and Q lie on the opposite face of C, and

a suitable small rotation about the axis ` through O parallel to AB moves R completely

inside C, contrary to Lemma 2. If adjacent corners A and B of R lie on opposite (open)

faces of C, then the adjacent corners A and Q lie on the same (open) face of C, which as we

have just seen is a contradiction. If adjacent corners A and B of R lie on adjacent (open)

faces of C (Figure 5.5b), a suitable small rotation of R about the diagonal AP moves B

inside C, again contrary to Lemma 2. Consequently, at least one corner of R must lie on an

edge of C; of course, the opposite corner of R then lies on the opposite edge of C.

i

i

“master” — 2012/11/15 — 11:23 — page 29 — #43i

i

i

i

i

i


A

B

O`

A

B

P

`

(a) Corners A and B on the same face. (b) Corners A and B on adjacent faces.

Figure 5.5. Corners A and B on open faces.

Suppose next that a corner A of the centered maximal rectangle R lies on an edge e of

C. If a corner B adjacent to A lies on an (open) face f of C adjacent to e (Figure 5.6a),

then a suitable small rotation of R about the line ` through O perpendicular to f moves

both A and B into the (open) face f , contrary to the first part of this proof. If a corner B

adjacent to A lies on an (open) face f that does not meet e, then the opposite corner Q is

adjacent to A and lies on the face f 0 opposite the face f , which is adjacent to e. Again it

follows that R is not maximal. Finally, if a corner A of R lies on an edge e and an adjacent

corner B lies on a face f that meets e only at a vertex of C (Figure 5.6b), the corner P

opposite A lies on the edge opposite e, and a suitable small rotation about the diagonal AP

of R moves both B and Q into the interior of C, once more contrary to Lemma 2.

A

B

e

f O

`

A

P

e

f

B

(a) Corner B on an open face adjacent to e. (b) Corner B on an open face normal to e.

Figure 5.6. Corner A on an edge.

It follows, as claimed, that all four corners of a centered maximal rectangle lie on edges

of C.

Lemma 4 (Fitting Lemma). A centered maximal rectangle R whose corners are not at the

vertices of C fits in C in exactly one of the following two ways:

a. two adjacent corners of R lie on the same edge of C (Figure 5.8); or

b. two adjacent corners of R lie on adjacent edges of C (Figure 5.9).

i

i

“master” — 2012/11/15 — 11:23 — page 30 — #44i

i

i

i

i

i

30 Part I. Geometry

Figure 5.7. Rectangle is not maximal.

Proof. The four corners of R lie on a sphere centered at the center O of C, and each lies

at the same distance from the nearest vertex of C. If (a) and (b) are both false, then the

corners must also lie on four parallel edges of C, one on each. It follows that there are two

adjacent corners of R (labeled A and B in Figure 5.7) whose edge is parallel to an edge of

C. But then a small rotation about an axis through O parallel to AB would move all four

vertices off the edges of C into the (open) faces, contrary to Lemma 3.

The Maximal Rectangles

According to Lemma 4, there are only two possibilities to examine.

Situation (a). Suppose that two adjacent corners of the maximal centered rectangle R lie

on the edge e of C (Figure 5.8). Write a for the distance from A to the nearest vertex V of

C. Since AB � 1 � AQ, we see that L D AQ Dp

2 and �L D AB D 1 � 2a. Since

0 � a � 1=2, we find that when 0 � � � 1=p

2

L Dp

2: (5.1)

V

A

B

Q

P

O

Figure 5.8. Situation (a). Figure 5.9. Situation (b).

i

i

“master” — 2012/11/15 — 11:23 — page 31 — #45i

i

i

i

i

i


Situation (b). Suppose that two adjacent corners A and B of a centered maximal rect-

angle R of aspect ratio � are on the adjacent open edges eA and eB that share a vertex V

(Figure 5.9). Write a for the distance VA and b for the distance VB . Then

AB Dp

a2 C b2; BP Dp

1 C .1 � a/2 C .1 � b/2: (5.2)

Since OA D OB , it is easy to see that b must be either a or 1�a. Further, AB could be the

smaller dimension �L or the larger dimension L of R. Thus situation (b) splits into four

apparent subcases, one of which proves to be impossible.

Case b1:

b D a and AB D �L. Setting a D b and � D 1 in (5.2), we find that a D 3=4 so that

L D 3p

2=4, which is Nieuwland’s result. More generally, setting a D b, eliminating a,

and writing f1.�/ D L, we learn that when 0 � � � 1

L D 3p3 � �2 C

p2�

D f1.�/ (5.3)

and

a D 3p2

�p3 � �2 C

p2�

: (5.4)

Case b2:

b D 1 � a and AB D �L. Substituting in (5.2), eliminating a, requiring that 0 � a � 1,

and writing f2.�/ D L, we see that for � satisfying 1=p

3 � � � 1=p

2

L D 1p

1 � �2D f2.�/: (5.5)

In this case,

a D 1

2˙ 1

2

r

3�2 � 1

1 � �2:

Case b3:

b D a and AB D L. Eliminating a in (5.2) leads to the equation

.1 � �2/L2 � 2p

2L C 3 D 0:

Writing L D f3.�/, we discover that when 1=p

3 � � � 1

L D 3p

2 Cp

3�2 � 1D f3.�/: (5.6)

(The second root is too large when � < 1.) Then

a D 2 �p

2p

3�2 � 1

1 � �2; (5.7)

and to ensure that a � 1 we require additionally that � � 1=p

2. In other words, the

appropriate domain for f3.�/ in (5.6) is described by 1=p

2 � � � 1:

The apparent possibility with b D 1�a and AB D L leads to an equation for L having

no real solutions, so this possibility can not be realized.

i

i

“master” — 2012/11/15 — 11:23 — page 32 — #46i

i

i

i

i

i

32 Part I. Geometry

L

ff

f1

23

1

2

max

3

2

10 l1 l2

l

3 4

Figure 5.10. The longer side Lmax in terms of the aspect ration �.

Conclusions

All that remains is to collect our results. We find the following answer to Question 1 of the

introduction (see Figure 5.10).

Theorem 1. Let �1 D 1 �p

2=2 � 0:29289 and �2 D 1 � �1 Dp

2=2 � 0:70711. For

each � with 0 � � � 1, the longer side Lmax.�/ of the largest rectangle Rmax with aspect

ratio � that fits in the cube of side 1 is given as follows:

Lmax.�/ D

8

ˆˆˆ<

ˆˆˆˆ:

3p3 � �2 C

p2�

if 0 � � � �1;

p2 if �1 � � � �2;

3p

2 Cp

3�2 � 1if �2 � � � 1:

(5.8)

Proof. When 0 � � � �1 the longer side Lmax.�/ is the larger of the number f1.�/,

where f1 is the function of case b1, andp

2 (from (5.1)). If �1 � � � �2, Lmax.�/ Dp

2,

because f1.�/ �p

2 on the entire interval Œ�1; �2� and f2.�/ �p

2 whenp

3=3 � � ��2. Similarly Lmax.�/ D f3.�/ for � satisfying �2 � � � 1.

Figure 5.11 shows the positions of the maximal rectangles. When 0 � � � �1, there are

twelve maximal rectangles, all centered and located as shown in Figure 5.11a. Their longer

dimension L and the distance a D VA are given by formulas (5.3) and (5.4), respectively.

If �1 � � � �2, then L Dp

2, and there are six centered maximal rectangles and infinitely

many that are not centered, all located on the diagonal planes of C as shown in Figure 5.8.

If �2 � � � 1 there are again twelve maximal rectangles, all centered and located as

shown in Figure 5.11b, and their longer dimension L and the distance a D VA are given

by the formulas (5.6) and (5.7).

Theorem 1 gives a necessary and sufficient condition for a rectangle to fit in a unit

cube: an a � b rectangle with a � b fits into the unit cube C if and only if a � Lmax.b=a/,

where Lmax.�/ is given by (5.8).

i

i

“master” — 2012/11/15 — 11:23 — page 33 — #47i

i

i

i

i

i


A

(a) The case 0 � � � �1. (b) The case �2 � � � 1.

Figure 5.11. Maximal rectangles.

The answer to Question 2 of the introduction follows immediately from this theorem.

Corollary. Given � and � with 0 � � � � � 1, a D � �D � �D box can pass through a

suitable hole in a unit cube C if and only if

D � 1

�Lmax

� �

�

�

:

Proof. The largest L � .�=�/L rectangle that fits in C has L D Lmax.�=�/, so the box can

pass through the cube precisely when �D � L.

Final Remarks

Greg Huber, a physicist at the University of Massachusetts, and Terry J. Ligocki of Lawrence

Berkeley National Laboratory have generated data for the diagonal dmax.�/ of Rmax.�/ by

modeling the problem with an iterative numerical procedure known as Boltzmann simu-

lated annealing. Professor Huber emailed us as follows, “The basic idea comes from sta-

tistical mechanics. The size of the embedded rectangle is taken to be an “energy,” and con-

figurations (orientations of the rectangle) are perturbed and accepted stochastically with

respect to a continuously changing “temperature,” according to an annealing schedule.

Asymptotic convergence to a global extremum is guaranteed under a few strong conditions

on the acceptance function and scheduling, but in practice [we] chose heuristic functions.”

In March 2002 he forwarded to us two hundred computer-generated values of dmax.�/ for

� in Œ0; 1�. The graph of the corresponding values of Lmax.�/ D dmax=p

1 C �2 matches

closely the function described in Theorem 1 and sketched in bold in Figure 5.10.

In response to a problem proposed by Mauldon [9], Chapman [3] provided an algebraic

answer for the question of finding the largest square that fits in the unit cube, thereby

supplying a solution to this well-known problem of intuitive geometry that is “not overly

dependent on geometric intuition,” as the Monthly’s 1995 Problems Editors remarked. An

algebraic argument independent of the fitting lemma can probably be given for Theorem 1

along the same lines.

Finally, it is a pleasure to acknowledge the assistance of Jonathan P. Green of the UIUC

Department of Germanic Languages and Literatures with John Wallis’s Latin.

i

i

“master” — 2012/11/15 — 11:23 — page 34 — #48i

i

i

i

i

i

34 Part I. Geometry

Bibliography

[1] W. B. Carver, Solution to Problem 3036, Amer. Math. Monthly 32 (1925) 47–49.

[2] ——, Solution to Problem E1225, Amer. Math. Monthly 64 (1957) 114–116.

[3] R. J. Chapman, Solution to Problem 10251, Amer. Math. Monthly 102 (1995) 465–467.

[4] L. R. Ford, Problem E1225, Amer. Math. Monthly 63 (1956) 421.

[5] M. Gardner, The Colossal Book of Mathematics, W. W. Norton, New York, 2001.

[6] F. M. Garnett, Problem 3036, Amer. Math. Monthly 30 (1923) 337.

[7] R. K. Guy and R. J. Nowakowski, Monthly unsolved problems, 1969–1997, Amer. Math.

Monthly 104 (1997) 967–973.

[8] A. Ehrenfeucht, The Cube Made Interesting, Pergamon Press, Oxford, 1964.

[9] J. G. Mauldon, Problem 10251, Amer. Math. Monthly 99 (1992) 782.

[10] D. J. E. Schreck, Prince Rupert’s problem and its extension by Pieter Nieuwland, Scripta Math.

16 (1950) 73–80, 261–267.

[11] J. Wallis, De Algebra Tractatus, 1685, in Opera Mathematica, vol. 2, Oxoniae: E Theatro Shel-

doniano, 1693, pp. 470–471.

[12] J. E. Wetzel, Rectangles in rectangles, Math. Mag. 73 (2000) 204–211.

i

i

“master” — 2012/11/15 — 11:23 — page 35 — #49i

i

i

i

i

i

IINumber Theory

and

Graph Theory

i

i

“master” — 2012/11/15 — 11:23 — page 36 — #50i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 37 — #51i

i

i

i

i

i

6Transcendentals and Early Birds

Martin Gardner

It’s hard to believe that it was not until 1844 that transcendental numbers were known to

exist! But first, a few definitions.

A rational number is one that can written as a=b, where a and b are integers. In decimal

form, rational numbers either terminate (1=4 D :25) or they have a pattern of consecutive

digits that repeat endlessly (1=7 D :142857 142857 142857 : : :).

An irrational number is one that cannot be expressed as a=b where a and b are inte-

gers. In decimal form, it never ends, and it has no pattern of consecutive digits that keep

repeating.

An algebraic number is the root of a polynomial equation with rational coefficients. It

can be rational or irrational. Thus 2 is algebraic because it is the root of such equations as

10x D 20. The square root of 2 is algebraic because it is the root of x2 D 2.

Transcendental numbers, such as � and e, are irrational numbers that are not the roots

of algebraic equations with rational coefficients.

In 1844, the French mathematical Joseph Liouville (1809-1882) first proved the ex-

istence of transcendentals by actually constructing an infinite number of them. Liouville

numbers, as they are known, can be called artificial because Liouville did not find them

anywhere in mathematics. He simply made them up out of whole cloth.

The simplest Liouville number is the binary transcendental shown below:

:110001000000000000000001 : : :

The ones are at positions given by consecutive factorials. The first 1 is at position

1Š D 1, the second one is at position 2Š D 2, the third is at position 3Š D 6, the fourth is at

position 4Š D 24, and so on.

It’s easy to construct artificial numbers that are obviously irrational but not transcen-

dental (like the square root of two). It is not so easy to construct such numbers that are

transcendental. Consider

:101001000100001 : : :

Liouville showed that any number of this form, where the ones can be replaced by any

constant digit from 2 through 9, is transcendental.

It was not long until � , e, and scores of other famous irrationals were shown to be

transcendental. In spite of the great recent progress in proving numbers transcendental,

Reprinted from Math Horizons, Vol. 13, No. 2 (Nov. 2005), pp. 5, 34.

37

i

i

“master” — 2012/11/15 — 11:23 — page 38 — #52i

i

i

i

i

i

38 Part II. Number Theory and Graph Theory

many deep unsolved questions remain. For example, e� has been proved transcendental,

but no one yet knows if �e is transcendental or even if � C e or �e or ee or �� are

irrational! All are believed to be transcendental, but until some genius proves otherwise, it

is possible that the sum of � and e is a rational number!

Other than Liouville numbers, the most famous of all artificial transcendentals is made

simply by putting a decimal point in front of the counting numbers, like so:

:123456789101112131415 : : :

This is called Mahler’s number, after Kurt Mahler, a mathematician who first proved

that this counting number is transcendental is all bases. You will find a neat proof for

base 10 in Ivan Niven’sNiven, Ivan marvelous little book Irrational Numbers, recently

republished by the Mathematical Association of America.

A number is normal if every digit, every doublet, every triplet, or any specified pattern

of digits appear the number of times you would expect if the number were generated by a

randomizer. So far, � , e, and the irrational roots of all algebraic numbers have passed all

statistical tests for normalcy, but none has yet been proved normal.

In 1934, it was shown ab is transcendental if a is an algebraic number not 0 or 1, and

b is any irrational number. Thus 2� and 10p

2 are transcendental.

Now for a startling coincidence. Note the last five digits of Mahler’s number. They

are � to four decimals! (I’m indebted to Jaime Poniachik, an Argentine puzzle maker, for

telling me about this.)

Every positive integer obviously appears somewhere in Mahler’s number, but many

numbers show up far ahead of where they normally would occur as a counting number. Pi

turns up so early because 3 is the last digit of 13, followed by 14 and 15. Poniachik points

out that � to 5 decimals appears as early as 141593-141594. Consider 666. Of course it

shows up in the counting sequence, but you can spot it early in the sequence 65-66-67.

Much innocent but totally useless amusement can be had by searching for the earliest

appearance of such familiar numbers as integral powers of roots, the last four digits of

your phone number, your house number, and so on. Better still, see if you can formulate

a procedure for determining the earliest appearance of any given integer in the counting

number.

If a number appears ahead of its position as a counting number, let’s call it an early bird.

The famous year 1492, for another example, is an early bird (491-492). Poniachik points

out that 8192, the thirteenth power of 2, is an early bird (18-19-20). Like � , the transcen-

dental number e, to four decimals (2.7182) is also an early bird (2718-2719), though not

as early as � .

What is the smallest early bird prime? When will the next coming year be an early

bird? Mathematician Solomon W. Golomb, in a personal letter, mentioned that his home

zip code 91011 (9-10-11) is a very early bird. The early bird numbers, Golomb wrote, are

a subset of the counting numbers, and therefore countably infinite. He is convinced that

“almost all” integers are early birds. There are no one-digit early birds, but half (45 out of

90) two-digit numbers are early birds, with rapidly increasing members of the species as

the number of their digits increases.

i

i

“master” — 2012/11/15 — 11:23 — page 39 — #53i

i

i

i

i

i

7Squaring, Cubing, and Cube Rooting

Arthur T. Benjamin

I still recall the thrill and simultaneous disappointment I felt when I first read Mathematical

Carnival [4] by Martin Gardner. I was thrilled because, as my high school teacher had told

me, mathematics was presented there in a playful way that I had never seen before. I was

disappointed because Gardner quoted a formula that I thought I had “invented” a few years

earlier. I have always had a passion for mental calculation, and the formula (7.1) below

appears in Gardner’s chapter on “Lightning Calculators.” It was used by the mathematician

A. C. Aitken to square large numbers mentally.

Squaring

Aitken took advantage of the following algebraic identity.

A2 D .A � d/.A C d/ C d 2: (7.1)

Naturally, this formula works for any value of d , but we should choose d to be the distance

to a number close to A that is easy to multiply.

Examples To square the number 23, we let d D 3 to get

232 D 20 � 26 C 32 D 520 C 9 D 529:

To square 48, let d D 2 to get

482 D 50 � 46 C 22 D 2300 C 4 D 2304:

With just a little practice, it’s possible to square any two-digit number in a matter of

seconds. Once you have mastered those, you can quickly square three-digit numbers by

rounding up and down to the nearest hundred.

Examples

2232 D 200 � 246 C 232 D 49;200 C 529 D 49;729

9522 D 1000 � 904 C 482 D 904;000 C 2;304 D 906;304:


39

i

i

“master” — 2012/11/15 — 11:23 — page 40 — #54i

i

i

i

i

i


To do mental calculations of this size, one needs to be quick at multiplying 2-digit and

3-digit numbers by 1-digit numbers, generating the answer from left to right. More details

and examples are given in Secrets of Mental Math [1], which Martin encouraged me to

write. As I prepared a DVD version of this book [2] recently, I learned some new methods

for cubing and cube-rooting. Although much of that material did not end up in the DVD, I

thought it would be of interest to readers of the College Mathematics Journal.

Cubing

To cube a two-digit number in your head quickly, it is worth learning how to multiply two

two-digit numbers that are near each other. I call it the “close together method” and it is

based on the formula

.z C a/.z C b/ D z.z C a C b/ C ab; (7.2)

where z is typically a number that ends in zero.

Example For the problem 23 � 26, z D 20, a D 3, b D 6, leads to

23 � 26 D 20 � 29 C 3 � 6 D 580 C 18 D 598:

Notice that on both sides of (7.2), the two-digit numbers being multiplied have the

same sum (23 C 26 D 49 D 20 C 29).

Example Thus, to do a problem like 88 � 86, since the numbers sum to 174 D 90 C 84,

we can begin with the product 90 � 84:

88 � 86 D 90 � 84 C .�2/ � .�4/ D 7560 C 8 D 7568:

To cube a two-digit number, we can exploit the algebra

A3 D .A � d/A.A C d/ C d 2A:

Example Thus to cube 23, we can do

233 D 20 � 23 � 26 C 32 � 23 D 20 � 598 C 9 � 23 D 11;960 C 207 D 12;167;

where we used the close-together method to do 23 � 26.

But there is a faster way. Since we know that the problem will begin by doing 20 � 29,

let’s build that into the algebra. Writing the problem a different way, to cube z C d , we do

.z C d/3 D zŒz.z C 3d/ C 3d 2� C d 3: (7.3)

Example To cube 23, z D 20 and d D 3 leads to the easier calculation

233 D 20 � Œ20 � 29 C 27� C 33 D 20 � 607 C 33 D 12;140 C 27 D 12;167:

Equation (7.3) is an instance of Horner’s method for polynomial evaluation. Notice that

when cubing a two-digit number, d will always be one of the numbers ˙1; ˙2; ˙3; ˙4; ˙5

and so 3d 2 will always be 3; 12; 27; 48 or 75, which reduces the mental effort. Also notice

that in the first two-digit multiplication, the number z C 3d will always end with a digit

that is three times the original last digit.

i

i

“master” — 2012/11/15 — 11:23 — page 41 — #55i

i

i

i

i

i

7. Squaring, Cubing, and Cube Rooting 41

Example For 883, by tripling the last digit, we know that we will multiply our rounded

number (90) by the number closest to 88 that ends in 4, namely 84. Thus,

883 D 90 � Œ90 � 84 C 12� C .�2/3 D 90 � 7572 � 23 D 681;480 � 8 D 681;472:

As a side note, the last part of the calculation is not as hard as it looks. By splitting 7572 D7500 C 72,

9 � 7572 D .9 � 7500/ C .9 � 72/ D 67;500 C 648 D 68;148:

To complete the calculation, simply multiply this number by 10, and subtract 8 at the same

time.

The calculation of two-digit cubes this way is surprisingly fast. With practice, the

squares and cubes of two-digit numbers below 50 will be so quick that you will be able to

cube three-digit numbers as well.

Example Exploiting the previous calculations:

3233 D 300 � Œ300 � 369 C 3.232/� C 233

D 300 � Œ110;700 C 1587� C 233

D 3 � 112;287 � 100 C 233

D 33;686;100 C 12;167

D 33;698;267:

Another calculating tip. After the second multiplication, you can almost always say

the millions part of the answer (here, 33 million) and hold the hundreds digit on your

fingers. Here, just raise 1 finger to “hold onto” the hundreds digit so you only have to

remember 686 while you calculate the cube of 23. By the way, if you just want a good

cubing approximation, simply compute z.z.z C 3d//.

For example,

3233 � 369 � 300 � 300 D 33;210;000:

This will come in handy in the calculation of cube roots.

Cube rooting

One of the easiest feats of lightning calculation is determining the cube root of a perfect

cube when the cube root is two digits. The following description comes from Martin Gard-

ner’s classic book, Mathematics, Magic, and Mystery [5].

The cube root demonstration begins by asking members of the audience to

select any number from 1 through 100, cube it, then call out the result. The

performer instantly gives the cube root of each number called. To do the trick

it is necessary first to memorize the cubes of the numbers from 1 through 10.

An inspection of this table reveals that each cube ends in a different digit.

The digit corresponds to the cube root in all cases except 2 and 3 and 7 and

i

i

“master” — 2012/11/15 — 11:23 — page 42 — #56i

i

i

i

i

i


Table 7.1. Table of cubes.

1 2 3 4 5 6 7 8 9 10

1 8 27 64 125 216 343 512 729 1000

8. In these four cases the final digit of the cube is the difference between the

cube root and 10.

To see how this information is used by the lightning calculator, let us sup-

pose that a spectator calls out the cube 250047. The last number is a 7 which

tells the performer immediately that the last number of the cube root must be

3. The first number of the cube root is determined as follows. Discard the last

three figures of the cube (regardless of the size of the number) and consider

the remaining figures—in this example they are 250. In the above table, 250

lies between the cubes of 6 and 7. The lower of the two figures—in this case 6

—will be the first figure of the cube root. The correct answer, therefore, is 63.

One more example will make this clear. If the number called out is 19,683,

the last digit, 3, indicates that the last digit of the cube root is 7. Discarding

the final three digits leaves 19, which falls between the cubes of 2 and 3. Two

is the lower number, therefore we arrive at a final cube root of 27.

I learned this trick in a book on mental magic [3] as a high school student, and won-

dered how it might be extended to larger problems, where the cube root was a three-digit

number. I didn’t pursue that question seriously at the time, since the cube of a three-digit

number can be as large as nine digits, and most calculators back then only went up to

eight. Now that calculators with greater capacity are quite common, I have learned two

quick ways to do this problem, using casting out nines and casting out elevens.

Casting out nines

Ask someone to cube a three-digit number, and ask how many digits are in the answer.

(It should be seven, eight, or nine digits.) Ask the volunteer to recite (or write down) the

answer, and you can pretty easily determine the cube root in your head. The first digit and

last digit of the cube root are determined exactly as before. The last digit of the cube root

is uniquely determined by the last digit of the cube. (In fact, the last digit of the cube root

is the last digit of the cube of the last digit of the cube!) For the first digit, we look at the

magnitude of the millions.

Example Let’s find the cube root of the perfect cube 377,933,067. The first digit of the

cube root must be 7 (since 377 lies between 73 D 343 and 83 D 512) and the last digit of

the cube root must be 3 (since the cube ends in 7). Hence the answer must be of the form

7‹3. How do we determine the middle digit? The method I use is simply to add the digits of

the cube, mod 9. Here the digits sum to 45, which is a multiple of 9. This can only happen

if the cube root is a multiple of three. In other words, the cube root must be 723 or 753 or

783. Since 377 is much closer to 343 than 512, we would go with a cube root of 723. For

extra verification, we could estimate 7203 � 700 � 700 � 760 D 372;400;000.

i

i

“master” — 2012/11/15 — 11:23 — page 43 — #57i

i

i

i

i

i

7. Squaring, Cubing, and Cube Rooting 43

This same approach can be used when the cube is not a multiple of nine, since there

are only two other possible outcomes, as seen in Table 7.2. If the cube reduces to 1 (mod

9), then the cube root reduces to 1 (mod 3). If the cube reduces to 8 (mod 9), then the cube

root reduces to 2 (mod 3).

Table 7.2. Table of cubes mod 9.

n 0 1 2 3 4 5 6 7 8

n3 0 1 8 0 1 8 0 1 8

Example If the 3-digit cube results in 19,248,832, then we know that the cube root is

of the form 2‹8. To determine the middle digit, notice that the cube has digit sum 37,

which reduces to 1 mod 9. Hence the original number is congruent to 1 mod 3, so the

cube root must be either 208 or 238 or 268 or 298. Judging from the proximity of 19 to its

surrounding cubes, 8 and 27, the answer 268 seems most likely. Since 2703 � 300�300�210 D 18;900;000, we have more confidence in our answer of 268.

Casting out elevens

With a little extra mental effort and memory, we can determine the middle digit without

any guesswork, because the cubes of the numbers 0 through 10 are all distinct mod 11, as

shown in Table 7.3.

Table 7.3. Table of cubes mod 11.

n 0 1 2 3 4 5 6 7 8 9 10

n3 0 1 8 5 9 4 7 2 6 3 10

Example revisited For the cube root of 19,248,832, we know that the answer will begin

with 2 and end in 8 as in the last example. By alternately adding and subtracting its digits

from right to left, we see that it reduces to 2 � 3 C 8 � 8 C 4 � 2 C 9 � 1 D 9 .mod 11/:

Hence the cube root must reduce to 4 (mod 11). Since the answer is of the form 2?8, then

2 C 8�‹ D 4 (mod 11) results in an answer of 268, as well. Note that if the cube has 7 or 9

digits, then you can alternately add and subtract the digits in their natural order. If the cube

has 8 digits, you can start with 11 then alternately subtract and add the subsequent digits.

Example To find the number with cube 111,980,168, we see immediately that the answer

will be of the form 4‹2. Casting out 11s from left to right, we compute 1 � 1 C 1 � 9 C8 � 0 C 1 � 6 C 8 D 3. Here, the cube root must reduce to 9 (mod 11). Thus 4 C 2�‹ D 9

.mod 11/ tells us that the missing digit is 8. Hence the cube root is 482.

Between the two approaches, I prefer casting out nines over casting out elevens, since

it is easier to add the digits of the cube as they are called out. I recommend that the reader

play with a few examples using both methods, choose one to master, and cast out the other.

i

i

“master” — 2012/11/15 — 11:23 — page 44 — #58i

i

i

i

i

i


Acknowledgment The author wishes to thank the referee for many helpful suggestions,

and especially Martin Gardner for being such an important factor in his life.

Bibliography

[1] A. Benjamin and M. Shermer, Secrets of Mental Math: The Mathemagician’s Guide to Lightning

Calculation and Amazing Math Tricks, Random House, New York, 2006.

[2] A. T. Benjamin, The Secrets of Mental Math (DVD Course), The Teaching Company, Chantilly,

VA, 2011.

[3] T. Corinda, 13 Steps to Mentalism, 4th ed., Tannen Magic Publishing, New York, 1968.

[4] M. Gardner, Mathematical Carnival: A New Round-up of Tantalizers and Puzzles from Scientific

American, Random House, New York, 1965.

[5] ——, Mathematics, Magic, and Mystery, Dover, New York, 1956.

i

i

“master” — 2012/11/15 — 11:23 — page 45 — #59i

i

i

i

i

i

8Carryless Arithmetic Mod 10

David Applegate, Marc LeBrun, and N. J. A. Sloane

Nim

Forms of Nim have been played since antiquity and a complete theory was published as

early as 1902 (see [3]). Martin Gardner described the game in one of his earliest columns

[7] and returned to it many times over the years ([8]–[16]).

Central to the analysis of Nim is Nim-addition. The Nim-sum is calculated by writing

the terms in base 2 and adding the columns mod 2, with no carries. A Nim position is a

winning position if and only if the Nim-sum of the sizes of the heaps is zero [2], [7].

Is there is a generalization of Nim in which the analysis uses the base-b representa-

tions of the sizes of the heaps, for b > 2, in which a position is a win if and only if the

mod-b sums of the columns is identically zero? One such game, Rimb (an abbreviation of

Restricted-Nim) exists, although it is complicated and not well known. It was introduced in

an unpublished paper [6] in 1980 and is hinted at in [5]. Despite his interest in Nim, Martin

Gardner never mentions Rimb , nor does it appear in Winning Ways [2], which extensively

analyzes Nim variants.

In the present paper we focus on b D 10, and consider, not Rim10 itself, but the arith-

metic that arises if calculations, addition and multiplication, are performed mod 10, with

no carries. Along the way we encounter several new and interesting number sequences,

which would have appealed to Martin Gardner, always a fan of integer sequences.

The Carryless Islands

The fabled, carefree residents of the Carryless Islands in the remote South Pacific have

very few possessions, which is just as well, since their arithmetic is ill-suited to accurate

bookkeeping. When they add or multiply numbers, they follow rules similar to ours, except

that there are no carries into other digit positions. Sociologists explain this by noting that

the Carryless Islands were originally penal colonies, and, as penal institutions are generally

known to have excellent dental care, the islanders were, happily, generally free of carries.

We will use C and � for their operations�, and C and � for the standard operations used

Reprinted from The College Mathematics Journal, Vol. 43, No. 1 (Jan. 2012), pp. 43–50.�We prefer not to use outlandish symbols such as | and �, since C and � are perfectly reasonable operations,

although to our eyes they have rather strange properties. As Marcia Ascher remarks, writing about mathematics

in indigenous cultures, “in many cases these cultures and their ideas were unknown beyond their own boundaries,or misunderstood when first encountered by outsiders” [1].

45

i

i

“master” — 2012/11/15 — 11:23 — page 46 — #60i

i

i

i

i

i


by the rest of the world. Addition and multiplication of single-digit numbers are performed

by “reduction mod 10.” Carry digits are simply ignored, so 9C4 D 3, 5C5 D 0, 9C4 D 6,

5�4 D 0, and so on. Adding or multiplying larger numbers also follows the familiar

procedures, but again with the proviso that there are no carries. For example, adding 785

and 376 produces 51, and the product of 643 and 59 is 417 (see Figure 8.1).

(a) 7 8 5

C 3 7 6

0 5 1

(b) 6 4 3

� 5 9

4 6 7

0 0 5

0 4 1 7

Figure 8.1. (a) Carryless addition. (b) Carryless multiplication.

What does elementary number theory look like on these islands? Let’s start with the

carryless squares n�n. For n D 0; 1; 2; 3 we get 0, 1, 4, 9, Then for n > 3 we have

4�4 D 6, 5�5 D 5, 6�6 D 6, 7�7 D 9, 8�8 D 4, 9�9 D 1, 10�10 D 100; : : : ; giving

the sequence

0; 1; 4; 9; 6; 5; 6; 9; 4; 1; 100; 121; 144; 169; 186; 105; 126; 149; 164; : : : :

It turns out that this is entry A059729 in the OEIS [17], contributed by Henry Bottomley

on February 20, 2001, although without any reference to earlier work on these numbers.

Bottomley also contributed sequence A059692, giving the carryless multiplication table,

and several other sequences related to carryless products. Likewise the sequence of values

of nCn,

0; 2; 4; 6; 8; 0; 2; 4; 6; 8; 20; 22; 24; 26; 28; 20; 22; 24; 26; 28; 40; 42; : : : ;

is entry A004520, submitted to the OEIS by one of the present authors around 1996, again

without references. (If these numbers are sorted and duplicates removed, we get the carry-

less “evenish” numbers, that is, numbers all of whose digits are even, A014263.) Carryless

arithmetic must surely have been studied before now, but the absence of references in [17]

suggests that it is not mentioned in any of the standard texts on number theory.

The carryless primes

If we require that a prime � is a number whose only factorization is 1 times itself, we are

out of luck, since every carryless number is divisible by 9, and there would be no primes

at all. (For 9�1 D 9, 9�2 D 8, 9�3 D 7; : : : ; 9�9 D 1. So if we construct a number � by

replacing all the 1’s in � by 9’s, all the 2’s by 8’s, : : : then � D 9��, and � would not be

a prime.)

There are primes, when defined in the right way. Since 1�1 D 1, 3�7 D 1 and 9�9 D1, all of 1; 3; 7 and 9 divide 1 and so divide any number. We call 1; 3; 7 and 9 units, the usual

name for integers that divide 1. Units should not be counted as factors when considering if

i

i

“master” — 2012/11/15 — 11:23 — page 47 — #61i

i

i

i

i

i

8. Carryless Arithmetic Mod 10 47

a number is prime (just as factors of �1 are ignored in ordinary arithmetic: 7 D .�1/�.�7/

doesn’t count as a factorization when considering if 7 is a prime).

So we define a carryless prime to be a non-unit � whose only factorizations are of the

form � D u�� where u is a unit. Computer experiments suggest that the first few primes

are

21; 23; 25; 27; 29; 41; 43; 45; 47; 49; 51; 52; 53; 54; 56; 57; 58; 59; 61; 63; : : : ; (8.1)

but there are surprising omissions in this list, resulting from some strange factorizations:

2 D 2�51, 10 D 56�65, 11 D 51�61. It is hard to be sure at this stage that the above list

is correct, since there exist factorizations where one of the numbers is much larger than the

number being factored, such as 2 D 4�5005505553. One property that makes carryless

arithmetic interesting is the presence of zero-divisors: the product of two numbers can be

zero without either of them being zero: 2�5 D 0, 628�55 D 0. Perhaps 21 is the product of

two really huge numbers? Nonetheless, the list is correct, as we will see (it is now sequence

A169887 in [17]).

Algebra to the rescue

The secret to understanding carryless arithmetic is to introduce a little algebra. Let R10 de-

note the ring of integers mod 10, and R10ŒX� the ring of polynomials in X with coefficients

in R10. Then we can represent carryless numbers by elements of R10ŒX�: 21 corresponds

to 2X C 1, 109 to X2 C 9, and so on. Carryless addition and multiplication are simply

addition and multiplication in R10ŒX�: our first example,

785C376 D 51

corresponds to

.7X2 C 8X C 5/ C .3X2 C 7X C 6/ D 5X C 1;

where the polynomials are added or multiplied in the usual way, and the coefficients then

reduced mod 10. Conversely, any element of R10ŒX� represents a unique carryless number

(just set X D 10 in the polynomial). In fact arithmetic in R10ŒX� is clearly exactly the

same as the arithmetic of carryless numbers. This could be used as a formal definition of

carryless arithmetic mod 10. It also shows that this arithmetic is commutative, associative

and distributive.

Since R10ŒX� is a ring, we can not only add and multiply, we can also subtract, some-

thing the Carryless Islanders never considered. The negatives of the elements of R10 are

�1 D 9, �2 D 8; : : : ; �9 D 1, and similarly for the elements of R10ŒX�. So the nega-

tive of a carryless number is its “10’s complement,” obtained by replacing each nonzero

digit d by 10 � d , for example �702 D 308. To subtract A from B , we add �A to B:

650�702 D 650C308 D 958. This is equivalent to doing elementary school subtraction

where we can “borrow” but don’t have to pay back!

The units in R10ŒX�, that is, the elements that divide 1, are the constants 1; 3; 7; 9, and

the carryless primes that we defined are the irreducible elements in R10ŒX�, that is, non-

units f10.X/ 2 R10ŒX� whose only factorizations are of the form f10.X/ D ug10.X/,

where u is a unit and g10.X/ 2 R10ŒX�. The units can also be written as 1; �1; 3 and �3,

i

i

“master” — 2012/11/15 — 11:23 — page 48 — #62i

i

i

i

i

i


which more closely relates them to the units 1 and �1 in ordinary arithmetic (3 and �3 act

in some ways like the imaginary units i and �i , squaring to �1, for example).

The key to further progress is to notice that R10 is the direct sum of the ring R2 of

integers mod 2 and the ring R5 of integers mod 5. Given r10 2 R10, we read it mod 2 and

mod 5 to obtain a pair Œr2; r5� with r2 2 R2, r5 2 R5. The elements 0; 1; : : : ; 9 2 R10

(or equivalently the carryless digits 0; 1; : : : ; 9) and their corresponding pairs Œr2; r5� are

given by the following table. The Chinese Remainder Theorem guarantees that this is a

one-to-one correspondence.

0 1 2 3 4 5 6 7 8 9

[0,0] [1,1] [0,2] [1,3] [0,4] [1,0] [0,1] [1,2] [0,3] [1,4](8.2)

As a check, we note that f1g is the (singleton) set of units in R2, while f1; 2; 3; 4g is the set

of units in R5, so the pairs Œ1; 1�, Œ1; 2�, Œ1; 3� and Œ1; 4� correspondingly produce the units

1, 7, 3 and 9 of R10.

Similarly, polynomials f10.X/ 2 R10ŒX� correspond to pairs of polynomials Œf2.X/;

f5.X/�, obtained by reading f10.X/ respectively mod 2 and mod 5. Conversely, given any

such pair of polynomials Œf2.X/; f5.X/�, there is a unique f10.X/ 2 R10ŒX� that corre-

sponds to them, which can be found using (8.2). We indicate this by writing f10.X/ $Œf2.X/; f5.X/�. If also g10.X/ $ Œg2.X/; g5.X/�, then

f10.X/ C g10.X/ $ Œf2.X/ C g2.X/; f5.X/ C g5.X/�

and

f10.X/g10.X/ $ Œf2.X/g2.X/; f5.X/g5.X/�:

We are now in a position to answer many questions about carryless arithmetic.

The carryless primes, again

What are the irreducible elements f10.X/ 2 R10ŒX�? If f10.X/ $ Œf2.X/; f5.X/� is

irreducible then certainly f2 and f5 must be either units or irreducible, for if f2 D g2h2

then we have the factorization Œf2; f5� D Œg2; f5�Œh2; 1�. Also Œf2; f5� D Œf2; 1�Œ1; f5�, so

one of f2, f5 must be irreducible and the other must be a unit. So the irreducible elements

in R10ŒX� are of the form Œf2.X/; u�, where f2.X/ is an irreducible polynomial mod 2

of degree � 1 and u 2 f1; 2; 3; 4g, together with elements of the form Œ1; f5.X/�, where

f5.X/ is an irreducible polynomial mod 5 of degree � 1.

The irreducible polynomials mod 2 are X , X C 1, X2 C X C 1; : : : ; and the irre-

ducible polynomials mod 5 are uX , uX C v; : : : ; where u; v 2 f1; 2; 3; 4g (see entries

A058943, A058945 in [17]). The first few irreducible elements in R10ŒX� are therefore

ŒX; 1�, ŒX; 2�, ŒX; 3�, ŒX; 4�, ŒX C1; 1�, ŒX C1; 2�; : : : ; and Œ1; X�, Œ1; 2X�, Œ1; 3X�, Œ1; 4X�,

Œ1; X C 1�, Œ1; 2X C 1�; : : : : The corresponding carryless primes, according to (8.2), are

56; 52; 58; 54; 51; 57; : : : ; and 65; 25; 85; 45; 61; 21; : : : : And so we can verify that the list

in (8.1) is correct.

We will call a number with at least two digits in which all digits except the rightmost

are even but the rightmost is odd an e-type number (A143712), and a number with at

least two digits in which all digits except the rightmost are 0 or 5 and the rightmost is

i

i

“master” — 2012/11/15 — 11:23 — page 49 — #63i

i

i

i

i

i


neither 0 nor 5 an f-type number (A144162). Similarly, we call the primes corresponding

to the irreducible elements Œ1; f5.X/� e-type primes, and the primes corresponding to the

irreducible elements Œf2.X/; u� f-type primes.

We also see that our earlier concern about the primality of 21 was groundless. It is

impossible for the length (in decimal digits) of a nonzero carryless product to be less than

the length of both of the factors. This follows from the fact that if `.n/ is the number of

decimal digits in the number n > 0 corresponding to a pair Œf2.X/; f5.X/�, then `.n/ D1 C maxfdeg f2; deg f5g. So if mn > 0, `.mn/ � minf`.m/; `.n/g.

Also, since we know how many irreducible polynomials mod 2 and mod 5 there are of

given degree (see A001037, A001692 in [17]), we can write down a formula for the number

of k-digit carryless primes, something that we cannot do for ordinary primes, namely

4

k � 1

X

d dividesk�1

�

�k � 1

d

�

.2d C 5d /;

for k � 2, where � is the Mobius function (A008683). There are 28 primes with two digits

(the twenty listed in (8.1), together with 65; 67; 69; 81; 83; 85; 87; 89), 44 with three digits,

: : : (A169962). For large k the number is about 4 � 5k�1=.k � 1/, whereas the number of

ordinary primes with exactly k digits is much larger, about 9�10k�1=.k log 10/, so carryless

primes are much rarer than ordinary primes.

Incidentally, the prime ideals in R10ŒX�, as distinct from the irreducible elements, all

have a single generator, which is one of Œ0; 1�; Œ1; 0�; Œ1; 1�; Œf2.X/; 1�; Œ1; f5.X/�, where

f2.X/, f5.X/ are irreducible (cf. [18, Chap. III, Thm. 30]).

The carryless squares, again

Squaring a mod 2 polynomial is easy: f2.X/2 D f2.X2/. So if n corresponds to the

pair Œf2.X/; f5.X/�, n2 corresponds to Œf2.X2/; f5.X/2� D Œf2.X2/; 0� C Œ0; f5.X/2�.

This gives a two-step recipe for producing all carryless squares. First find, using (8.2), the

carryless number m corresponding to Œ0; f5.X/2�, where f5.X/ is any polynomial mod 5.

The effect of adding a nonzero Œf2.X2/; 0� changes some subset of the digits in positions

0; 2; 4; : : : of m by the addition of 5 mod 10.

For example, if f5.X/ D X C 2, f5.X/2 D X2 C 4X C 4, and by (8.2) Œ0; f5.X/2 �

corresponds to the carryless square m D 644. We now add 5 mod 10 to any subset of

the digits in positions 0; 2; 4; 6; : : : of m (considering m extended by prefixing it with

any number of zeros), obtaining infinitely many squares 644; 649; 144; 149; : : : ;

50644; 5050649; : : : :

This also leads to a formula for the number of k-digit carryless squares. For even k the

number is 0, and for odd k it is

1

29 � 10.k�1/=2 C 2.k�3/=2

(zero is excluded from the count). There are five squares of length 1 (namely 1; 4; 5; 6 and

9), 46 of length 3, : : : (see A059729, A169889, A169963). For large odd k there are about

twice as many k-digit carryless squares as ordinary squares.

i

i

“master” — 2012/11/15 — 11:23 — page 50 — #64i

i

i

i

i

i


Divisors and factorizations

What about the factorization of numbers into the product of carryless primes? Unfortu-

nately, the existence of zero-divisors complicates matters, and it turns out that there is no

natural way to define, for example, an analog of the usual sum-of-divisors function �.n/.

In our analysis we define several classes of carryless numbers:

U WD f1; 3; 7; 9g, the units,

E WD f0; 2; 4; 6; 8; 20; 22; : : :g, the “evenish” numbers, in which all digits are even

(A014263),

F WD f0; 5; 50; 55; : : :g, the “fiveish” numbers, in which all digits are 0 or 5 (A169964),

Z WD E [ F D f0; 2; 4; 5; 6; 8; 20; 22; : : :g, the zero-divisors (A169884),

N WD f1; 3; 7; 9; 10; 11; 12; 13; : : :g, the positive numbers not in Z (A169968).

Suppose d is a carryless divisor of n, that is, there is a number q such that d�q D n.

What can be said about the possible choices for q? One can show—we omit the straight-

forward proofs—that

� if d 2 N then there is a unique q,

� if d 2 E then d�q0 D n if and only if q0 D q C v for some v 2 F ,

� if d 2 F then d�q0 D n if and only if q0 D q C e for some e 2 E .

The same distinctions are needed to describe factorizations into primes.

� If n 2 N then n has a unique factorization as a carryless product of primes, up to

multiplication by units. For example, we already saw 10 D 56�65. But we also

have 10 D .3�56/�.7�65/ D 58�25 D .9�56/�.9�65/ D 54�45 D 9�52�25,

etc., illustrating the nonuniqueness. Also 11 D 51�61; 101 D 21�29�51, 1234 D23�23�23�51�51�52. It follows that any non-unit in N can be written both as

e�f and e0Cf 0, where e and e0 are e-type numbers and f and f 0 are f-type num-

bers. For example, 12 D 81�52 D 61C51.

� If n 2 E then n has a unique factorization as 2 times a product of e-type primes, up

to multiplication by units (in this case, every f-type prime divides n). For example,

20 D 2�65, 22 D 2�61, 2468 D 2�69�69�69.

� If n 2 F then n has a unique factorization as 5 times a product of f-type primes, up

to multiplication by units (in this case, every e-type prime divides n). For example,

50 D 5�52, 505 D 5�51�51.

Here are the analogous statements about divisors:

� if n 2 N , n has only finitely many divisors. If d divides n and u 2 U , then d�u

divides n. The divisors may be grouped into equivalence classes d�U . Since the sum

of the elements of U is zero, so is the sum of the divisors of n.

� if n 2 E , d divides n, u 2 U and v 2 F , then d�u C v divides n. So n has infinitely

many divisors, belonging to equivalence classes d�U C F .

� if n 2 F , d divides n, u 2 U and e 2 E , then d�u C e divides n. So n has infinitely

many divisors, belonging to equivalence classes d�U C E .

Any attempt to define a sum-of-divisors function must specify how to choose repre-

sentatives from the equivalence classes. There seems to be no natural way to do this. One

possibility would be to choose the smallest decimal number in each class, but this seems

unsatisfactory (since it depends on the ordering of decimal numbers, another concept the

islanders seem not to be familiar with).

i

i

“master” — 2012/11/15 — 11:23 — page 51 — #65i

i

i

i

i

i


Further number theory

In summary, we can help the Carryless Islanders by defining subtraction, prime numbers,

and factorization into primes. But further concepts such as the number of divisors, the sum

of divisors and perfect numbers seem to lie beyond these Islands.

However, many other carryless analogs are well-defined, including including triangu-

lar numbers (A169890), cubes (A169885), partitions (A169973), greatest common divi-

sors and least common multiples, and so on. Some seem exotic, while other familiar se-

quences simply become periodic. For example, the analog of the Fibonacci numbers co-

incides with the sequence of Fibonacci numbers read mod 10, A003893, which becomes

periodic with period 60 (the periodicity of the Fibonacci numbers to any modulus being

a well-studied subject, see sequence A001175). Similarly, the analogue of the powers of

2 (A000689) becomes periodic with period 4. We might also generalize beyond simple

squares, cubes, etc., and investigate the properties of polynomials or power series based on

carryless operations—How do these factor? What are their fixed points?—and so on.

Taking a different tack, carryless mod 10 partitions are enumerated in A169973, which

may be derived as the coefficients of zn in the formal expansion of the analog of the classic

partition generating functionQ1

kD1.1 C zk/, wherein powers of z are multiplied together

by combining their exponents with carryless mod 10 addition instead of the ordinary sum.

Afterword

There’s a great deal yet to be explored in these Carryless Islands! Watch for our next

paper on another carryless arithmetic, in which operations on single digits are defined by

a ˚ b D maxfa; bg, a ˝ b D minfa; bg. We call this “dismal arithmetic.”

When the Handbook of Integer Sequences was published 39 years ago, Martin Gardner

was kind enough to write in his Mathematical Games column of July 1974 that “every

recreational mathematician should buy a copy forthwith.” That book contained 2372 se-

quences: today its successor, the On-Line Encyclopedia of Integer Sequences (or OEIS)

[17], contains nearly 200,000 sequences. We were about to write to Martin about carryless

arithmetic when we heard the sad news of his death. This article, the first of a series on

various kinds of carryless arithmetic, is offered in his honor.

Bibliography

[1] Marcia Ascher, Mathematics Elsewhere: An Exploration of Ideas Across Cultures, Princeton

Univ. Press, Princeton, NJ, 2002.

[2] E. R. Berlekamp, J. H. Conway, and R. K. Guy, Winning Ways for Your Mathematical Plays,

A K Peters, Wellesley MA, 2nd ed., 4 vols, 2004.

[3] C. L. Bouton, Nim, a game with a complete mathematical theory, Ann. Math. 3 (1902) 35–39;

available at dx.doi.org/10.2307/1967631

[4] J. H. Conway, On Numbers and Games, Academic Press, NY, 1976.

[5] T. S. Ferguson, Some chip transfer games, Theoret. Comput. Sci. 191 (1998) 157–171; available

at dx.doi.org/10.1016/S0304-3975(97)00135-7

[6] J. A. Flanigan, NIM, TRIM and RIM, unpublished document, Mathematics Department, Uni-

versity of California at Los Angeles, 1980; available at

citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.74.955.

dx.doi.org/10.2307/1967631

http://dx.doi.org/10.2307/1967631

dx.doi.org/10.1016/S0304-3975(97)00135-7

http://dx.doi.org/10.1016/S0304-3975(97)00135-7

citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.74.955

http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.74.955

i

i

“master” — 2012/11/15 — 11:23 — page 52 — #66i

i

i

i

i

i


[7] Martin Gardner, Nim and Tac Tix, The Scientific American Book of Mathematical Puzzles and

Diversions, Simon & Schuster NY, 1959.

[8] ——, Jam, Hot and Other Games, Mathematical Carnival, Vintage Books NY, 1977.

[9] ——, Nim and Hackenbush, Wheels, Life and Other Mathematical Amusements, W. H. Free-

man NY, 1983.

[10] ——, Sim, Chomp and Race Track, Knotted Doughnuts and Other Mathematical Entertain-

ments, W. H. Freeman NY, 1986.

[11] ——, Dodgem and Other Simple Games, Time Travel and Other Mathematical Bewilderments,

W. H. Freeman NY, 1988.

[12] ——, Wythoff’s Nim, Penrose Tiles to Trapdoor Ciphers : : : and the Return of Dr. Matrix,

W. H. Freeman NY, 1989.

[13] ——, Matches, Mathematical Circus, Mathematical Association of America, Washington DC,

revised ed., 1992.

[14] ——, The Rotating Table and Other Problems, Fractal Music, Hypercards and More : : : ; W. H.

Freeman NY, 1992.

[15] ——, Lavinia Seeks a Rule and Other Problems, The Last Recreations:Hydras, Eggs and Other

Mathematical Mystifications, Springer NY, 1997.

[16] ——, Surreal Numbers, The Colossal Book of Mathematics, W. W. Norton NY, 2001.

[17] The OEIS Foundation Inc., The On-Line Encyclopedia of Integer Sequences (2011); available

at oeis.org.

[18] O. Zariski and P. Samuel, Commutative Algebra, Van Nostrand NY, vol. I, 1958.

The Mad Tea Party Ride

Alice and friends begin a mathematical adventure (see next page).

oeis.org

http://oeis.org

i

i

“master” — 2012/11/15 — 11:23 — page 53 — #67i

i

i

i

i

i

9Mad Tea Party Cyclic Partitions

Robert Bekes, Jean Pedersen, and Bin Shao

Alice, the March Hare, the Hatter, and the Dormouse, were standing by the Mad Tea Party

(MTP) ride (see opposite page). “We’ll start with the same number of teacups as people,”

said the Hatter, bossy as usual. “The teacups are arranged in a circle, and each person sits

in his or her own teacup.”

“Won’t that be lonely?” objected Alice.

“Don’t worry,” replied the Hatter, “At the end of this ride, everyone stands up, one of

the teacups is removed, then everyone finds a new place to sit. Every teacup has to be

occupied by at least one person, so on the second ride there will be one teacup with two

people in it. After that ride, another cup will disappear and eventually, when there is just

one teacup, no one will be lonely.”

The rest said in unison, “You must be mad. Everyone will be squished and it will be

quite unbearable.”

“Not my problem,” the Hatter said, and walked away.

“Wait,” said Alice, who was a keen amateur mathematician, “If there are n people, how

many ways are there to fit them into k cups arranged in a circle, assuming, naturally, all

people are alike?”

She quickly wrote out the possibilities for n D 5 and k � 5 in Table 9.1, where

the actual partitions are in the second row and the number of partitions are in the third

row (consistent with the results in [7]). Alice explained to the group that the sum of the

numbers in the third row (which is 7) is the answer to her question. Then she noticed that

product of the numbers in the third row (which is 4) tells us the number of different paths

through the various partitions available at each stage. (This paper obtains both results, but

concentrates on the nature and number of the actual partitions.)

Table 9.1.

k D Number of cups 5 4 3 2 1

The listing of partitions for k cups 1 1 1 1 1 1 1 1 2 1 1 3 or 1 2 2 1 4 or 2 3 5

The number of paritions for k cups 1 1 2 2 1

But finding them became much tougher with more teacups. “This is hard,” the March

Hare said, “Martin Gardner wrote several times about partitions [5]–[7]. The scariest part,


53

i

i

“master” — 2012/11/15 — 11:23 — page 54 — #68i

i

i

i

i

i


for us, is in [7] where Gardner wrote:

Although there are many recursive procedures for counting unordered parti-

tions, using at each step the number of known partitions for all smaller num-

bers, an exact asymptotic formula was not obtained until recent times. The

big breakthrough was made by the British mathematician G. H. Hardy, work-

ing with his Indian friend Srinivasa Ramanujan. Their not quite exact for-

mula was perfected by Hans A. Rademacher in 1937. The Hardy-Ramanujan-

Rademacher formula is a horribly shaggy infinite series that involves (among

other things) pi, square roots, complex roots, and derivatives of hyperbolic

functions!

So, how can we find the answer to an even more difficult problem?”

Now Alice had read the paper-folding results of Hilton and Pedersen ([4, pp. 129–

136], [5, pp. 107–109], and [6]) and thought to apply their ideas. A recursive algorithm

would do. Though it wouldn’t give a closed form, it would be, conceptually, far sim-

pler than the Hardy-Ramanujan-Rademacher formula. “Sometimes it’s easier to solve a

more difficult problem and use that solution to answer the easier problem,” Alice re-

minded the others, “That’s what we should do here. The first thing we need to do is

give our cyclic arrangements a name and decide on an appropriate notation. Let’s call

them MTP partitions. Our problem is, how many MTP partitions are there for any positive

integer n?”

For notation we will write, as an example, .2; 1; 3/ to mean there are 2, 1, and 3 people

in three cups arranged clockwise in a circle beginning with a teacup containing 2 people.

Sometimes, when no confusion occurs, instead of .2; 1; 3/ we will simply write 2 1 3.

The March Hare commented that the partition .1; 3; 2/ must be the same as .1; 2; 3/,

because if you turned the MTP ride upside down they would be the same. “No way.” the

Hatter responded contemptuously, “Disney won’t turn the ride upside down. And if you

can’t do that, then these two partitions are different. People in teacups on the left and right

side of the teacup with 2 people in it are clearly different in these 2 cases.”

Notice that in Table 9.1 the partition 1 1 1 2 would be the same as 1 1 2 1, 1 2 1 1, or

2 1 1 1, because they are cyclic permutations of each other.

The modest Dormouse said, “All I know how to do is subtract odd numbers from odd

numbers, divide even numbers by 2, and add numbers that sum to less than 100.”

“That’s great!” said Alice, ”you have just the mathematical skills we need to tackle this

problem.” And she began to explain:

First, I’ll teach you, with some examples, how to construct coaches, and how they

constitute symbols. We always start with an odd number b and use all the odd numbers

ai , where ai < b2

. You’re going to wonder what on earth these symbols are about—

but I can’t take time to discuss that; you simply must consult [4]–[6] if you want to

know more.

Second, I’ll describe how to construct the coaches in symbols in purely mathematical

language. After more examples I’ll state two theorems about the coaches and symbols, give

a reconstruction algorithm, and observe some persistent patterns among coaches in two

special sequences. All this information gives us a non-recursive algorithm that answers our

main question: How many MTP partitions are there for any positive integer n?

i

i

“master” — 2012/11/15 — 11:23 — page 55 — #69i

i

i

i

i

i

9. Mad Tea Party Cyclic Partitions 55

Constructing coaches and symbols

We are going to show how to write a coach that involves an initial odd number b and odd

ai , such that ai < b2

. As an example, I’ll begin with b D 35. The possible values for ai

are 1; 3; 5; 7; 9; 11; 13; 15 and 17. We begin with the smallest value, which we call a1 and

write the following

b D 35

ˇˇˇˇ

a1 D 1:

Now subtract 1 from 35 and divide the answer (34) as many times as possible by 2 (in

this case, once), obtaining remainder 17. The number of factors of 2 that were divided out

(1) is written at the bottom of the first column. Then 17 is written at the top of next column.

This gives

b D 35

ˇˇˇˇ

a1 D 1 a2 D 17

k1 D 1:

We repeat this process (subtracting ai from b and dividing by factors of 2 until we get

an odd number) with a2; a3; : : : until the next value of ai is equal to the original a1 D 1 (in

this case), as must happen (see [4] or [5]). In the last step we subtract 3 from 35 and divide

the answer (32) as many times as possible by 2 (5 times), obtaining 1. We write 5 under

the 3 and stop, since 1 is our original number a1 for this coach. This completed verbose

coach is:

b D 35

ˇˇˇˇ

a1 D 1 a2 D 17 a3 D 9 a4 D 13 a5 D 11 a6 D 3

k1 D 1 k2 D 1 k3 D 1 k4 D 1 k5 D 3 k5 D 5

ˇˇˇˇ:

Eliminating the labels ai ; ki ; we have

b D 35

ˇˇˇˇ

1 17 9 13 11 3

1 1 1 1 3 5

ˇˇˇˇ:

In sum, the arithmetic for this coach is

35 � 1 D 34 D 21 � 17; k1 D 1; a2 D 17I35 � 17 D 18 D 21 � 9; k2 D 1; a3 D 9I35 � 9 D 26 D 21 � 13; k3 D 1; a4 D 13I35 � 13 D 22 D 21 � 11; k4 D 1; a5 D 11I35 � 11 D 24 D 23 � 3; k5 D 3; a6 D 3I35 � 3 D 32 D 25 � 1; k6 D 5; a7 D 1 D a1 .StopŠ/:

Now not all the odd numbers ai < 352

have appeared in the top row of this coach. So

we begin again, using the smallest unused ai (so we begin with a1 D 5), and construct the

next coach of the symbol obtaining:

b D 35

ˇˇˇˇ

5 15

1 2

ˇˇˇˇ:

The arithmetic for this coach is:

35 � 5 D 30 D 21 � 15; k1 D 1; a2 D 15I35 � 15 D 20 D 22 � 5; k2 D 2; a3 D 5 D a1 .StopŠ/:

i

i

“master” — 2012/11/15 — 11:23 — page 56 — #70i

i

i

i

i

i


But 7 still hasn’t appeared in the top row of a coach, so we begin again, with a1 D 7, and

construct the next coach of the symbol obtaining:

b D 35

ˇˇˇˇ

7

2

ˇˇˇˇ:

The arithmetic for this coach is:

35 � 7 D 28 D 22 � 7; k1 D 2; a2 D 7 D a1 .StopŠ/:

Now that all possible ai s have appeared in the top row of some coach we have all the

possible coaches. We combine them to obtain the complete symbol for b D 35:

b D 35

ˇˇˇˇ

1 17 9 13 11 3

1 1 1 1 3 5

ˇˇˇˇ

5 15

1 2

ˇˇˇˇ

7

2

ˇˇˇˇ:

In order to make our symbols unique we will always require that each coach have the

smallest available ai in the position of a1. In general, each coach of a symbol is written:

b

ˇˇˇˇ

a1 a2 � � � ar

k1 k2 � � � kr

ˇˇˇˇ; (?)

where b and ai are odd, with each ai < b2

, and

b � ai D 2ki aiC1; i D 1; 2; : : : ; r; arC1 D a1: (??)

Note that ki is maximal (that is, all powers of 2 have been factored from b � ai /, since

aiC1 must be odd.

In [4]–[6] it is shown that, given any two odd numbers a and b, with a < b2

, there is

always a completely determined, unique symbol consisting of coaches as determined by

(?) and .??); sometimes a symbol may consist of only one coach.

If for any i , gcd.b; ai / D 1, we say that the coach (?) is reduced, and if there are no

repeats among all the ai ’s we say that the coach (?) is contracted. At this stage, we do not

assume that our coaches are reduced, but we do assume they are contracted.

The Dormouse said, “Aren’t all coaches contracted because of the way we constructed

them?” “Yes,” said Alice, “but later we’ll have good reason to construct non-contracted

coaches.”

Alice produces examples and theorems

If a coach in a symbol is contracted, but not reduced (allowing gcd.b; ai / ¤ 1) we call that

coach and the symbol that contains it hybrid. If all the coaches in a symbol are contracted

and reduced we call it pure. Now let’s look at some examples (including b D 35) of pure

and hybrid symbols. “Look for patterns,” Alice instructed the Dormouse and the March

Hare. “See if you can anticipate the theorems. There is a special reason for singly- or

doubly-underlining some coaches and boxing some entries in Figure 9.1.” Can you see

why?

i

i

“master” — 2012/11/15 — 11:23 — page 57 — #71i

i

i

i

i

i


b D 23

ˇˇˇˇ

1 11 3 5 9 7

1 2 2 1 1 4

ˇˇˇˇ

b D 35

ˇˇˇˇ

1 17 9 13 11 3

1 1 1 1 3 5

ˇˇˇˇ

5 15

1 2

ˇˇˇˇ

7

2

ˇˇˇˇ

b D 41

ˇˇˇˇ

1 5 9

3 2 5

ˇˇˇˇ

3 19 11 15 13 7 17

1 1 1 1 2 1 3

ˇˇˇˇ

b D 57

ˇˇˇˇ

1 7 25

3 1 5

ˇˇˇˇ

3 27 15 21 9

1 1 1 2 4

ˇˇˇˇ

5 13 11 23 17

2 2 1 1 3

ˇˇˇˇ

19

1

ˇˇˇˇ

b D 63

ˇˇˇˇ

1 31

1 5

ˇˇˇˇ

3 15

2 4

ˇˇˇˇ

5 29 17 23

1 1 1 3

ˇˇˇˇ

7

3

ˇˇˇˇ

9 27

1 2

ˇˇˇˇ

11 13 25 19

2 1 1 2

ˇˇˇˇ

21

1

ˇˇˇˇ

b D 65

ˇˇˇˇ

1

6

ˇˇˇˇ

3 31 17

1 1 4

ˇˇˇˇ

5 15 25

2 1 3

ˇˇˇˇ

7 29 9

1 2 3

ˇˇˇˇ

11 27 19 23 21

1 1 1 1 2

ˇˇˇˇ

13

2

ˇˇˇˇ

b D 91

ˇˇˇˇ

1 45 23 17 37 27

1 1 2 1 1 6

ˇˇˇˇ

3 11 5 43

3 4 1 4

ˇˇˇˇ

7 21 35

2 1 3

ˇˇˇˇ

ˇˇˇˇ

9 41 25 33 29 31 15 19

1 1 1 1 1 2 2 3

ˇˇˇˇ

13 39

1 2

ˇˇˇˇ

Figure 9.1. Examples of symbols.

Did you notice that:

� Where the symbol has no underlined coaches, b is prime?

� The underlined coaches are not reduced, furthermore there is a common divisor

(other than 1) between all of the numbers in the top row (the ai ) and b?

� Each of the boxed numbers divides into b and all the ai for that coach?

� In the doubly-underlined hybrid coaches the sum of the bottom row is a proper divi-

sor of the sum of the bottom row for the pure coaches?

If we eliminate the underlined coaches in Figure 9.1, those remaining constitute pure

symbols (gcd.b; ai / D 1). For a given b what do you notice about the sum of the ki s in

the coaches of the pure symbols? For a given b what is r (the number of columns) in each

pure coach? These are crucial questions to attempt to answer before going on.

Your observations may help you to understand the following (proved in [4]–[6]):

The Quasi-Order Theorem. Suppose that a coach of a pure symbol has the form (?),

and formula (??) holds. LetPr

iD1 ki D k. Then k is the smallest positive integer such that

2k � ˙1 .mod b/. In fact, 2k � .�1/r .mod b/.

In other words the sum of the numbers, ki , in the bottom row of any coach of a pure

symbol is the smallest number, k, such that b exactly divides 2k � .�1/r .

For the pure symbols in Figure 9.1 the Quasi-Order Theorem results are in Table 9.2.

Notice that the conclusion of the Quasi-Order Theorem can be obtained from any coach

of the pure symbol since k is the same and r is always either even or odd.

i

i

“master” — 2012/11/15 — 11:23 — page 58 — #72i

i

i

i

i

i


Table 9.2.

b 23 35 41 57 63 65 91

k 11 12 10 9 6 6 12

r 6 6 3 or 7 3 or 5 2 or 4 1; 3; or 5 4; 6; or 8

2k � .1/r 4047 4096 1025 513 63 65 4095

Here is another pertinent Hilton-Pedersen theorem [4] connected with the Quasi-Order

Theorem.

The Coach Theorem. Let � be the Euler phi function. In a complete, pure symbol, withPr

iD1 ki D k, in .?/, and c the number of coaches, we have �(b/ D 2kc.

Recall that if p is a prime then �.pn/ D .p � 1/pn�1, and if p1, p2 are distinct prime

numbers then �(pn1

1 � pn2

2 / D �.pn1

1 /�.pn2

2 ). The data confirming that �.b/ D 2kc for

the pure symbols in Figure 9.1 is in Table 9.3.

Table 9.3.

b 23 35 41 57 63 65 91

k 11 12 10 9 6 6 12

c 1 1 2 2 3 4 3

�.b/ 22 24 40 36 36 48 72

The Dormouse complained bitterly that Alice had said they knew enough arithmetic to

work this problem and now she was describing unbelievable things—things about divisi-

bility, primes, and other stuff. Alice said that she had once been told by the White Queen

that, “Why, sometimes I’ve believed as many as six impossible things before breakfast.”

Nevertheless Alice told them that if they didn’t want to apply themselves by thinking about

extraordinary things they should take a walk and return later. The Dormouse left and Alice

began to explain to the March Hare the

Reconstruction Algorithm. Given .k1, k2, : : : ; kr/, one can recover b, and a1, a2, : : : ;

ar , to obtain the coach .?/, which may be pure or hybrid.

The proof for pure coaches is in [4]. Here we present an example of a particular, but not

special, case that should enable you to reproduce the process for yourself in other instances.

Suppose you are given k1 D 2; k2 D 1; k3 D 3; and you are asked to produce a coach

of the form

b

ˇˇˇˇ

a1 a2 a3

2 1 3

ˇˇˇˇ:

By .??/ we have the three simultaneous linear equations in a1; a2; a3:

b � a1 D 22 � a2; b � a2 D 21 � a3; b � a3 D 23 � a1:

Alice solved this system of linear equations using Cramer’s rule, evaluating the de-

terminants by a condensation method invented by Charles Dodgson (Lewis Carroll) ([8];

i

i

“master” — 2012/11/15 — 11:23 — page 59 — #73i

i

i

i

i

i


[9, pp. 118–119]). She told the March Hare to pay particular attention to the form of the

solution compared with the numbers in the original problem:

a1

bD 26�3 � 26�3�1 C 26�3�1�2

26 � .�1/3D 23 � 22 C 20

26 C 1D 5

65; so a1 D 5; b D 65I

a2

bD 26�2 � 26�2�3 C 26�2�3�1

26 � .�1/3D 24 � 21 C 20

26 C 1D 15

65; so a2 D 15; b D 65I

a3

bD 26�1 � 26�1�2 C 26�1�2�3

26 � .�1/3D 25 � 23 C 20

26 C 1D 25

65; so a3 D 25; b D 65:

The March Hare checked this result by constructing the coach of the hybrid symbol

for b D 65, starting with a1 D 5, getting the following coach, which satisfies the original

data:

b D 65

ˇˇˇˇ

5 15 25

2 1 3

ˇˇˇˇ:

But, as Alice noticed immediately, all the b’s are the same and you really didn’t need to

solve for a2, or a3, because once you had the solution for a1 and b you could then construct

the symbol to get the remaining values for ai .

“To obtain a coach for a pure symbol with the given values of ki ;” Alice explained, “all

you need to do is reduce each of the fractions 565

D 113

; 1565

D 313

, and 2565

D 513

. Then,

from the first reduced fraction you see that you can begin with b D 13 and a1 D 1 and

construct the following coach, which also satisfies the original data:

b D 13

ˇˇˇˇ

1 3 5

2 1 3

ˇˇˇˇ:”

The Dormouse and the Hatter returned at this point. Alice convinced them to construct

systematically some (possibly) hybrid symbols involving b D 2n � 1, and b D 2n C 1.

Persistent coach patterns

The Dormouse set to work, constructing the symbols in Figure 9.2, with only doubly-

underlined markings, while Alice, the Hatter, and the March Hare looked for patterns.

As soon as the last symbol was written down the March Hare said, “For all n � 3 it

looks as if when b D 2n � 1 and a1 D 1, we have a persistent coach pattern

2n � 1

ˇˇˇˇ

1 2n�1 � 1

1 n � 1

ˇˇˇˇ:”

Alice proved it using .??/.

Not to be outdone, the Hatter said, “For n � 3, except for n D 4, every coach with

b D 2n � 1 that begins with a1 D 3 always has the form

2n � 1

ˇˇˇˇ

3 2n�2 � 1

2 n � 2

ˇˇˇˇ:” (~n)

i

i

“master” — 2012/11/15 — 11:23 — page 60 — #74i

i

i

i

i

i


22 � 1 D 3

ˇˇˇˇ

1

1

ˇˇˇˇ

23 � 1 D 7

ˇˇˇˇ

1 3

1 2

ˇˇˇˇ

24 � 1 D 15

ˇˇˇˇ

1 7

1 3

ˇˇˇˇ

3

2

ˇˇˇˇ

5

1

ˇˇˇˇ

25 � 1 D 31

ˇˇˇˇ

1 15

1 4

ˇˇˇˇ

3 7

2 3

ˇˇˇˇ

5 13 9 11

1 1 1 2

ˇˇˇˇ

26 � 1 D 63

ˇˇˇˇ

1 31

1 5

ˇˇˇˇ

3 15

2 4

ˇˇˇˇ

5 29 17 23

1 1 1 3

ˇˇˇˇ

7

3

ˇˇˇˇ

9 27

1 2

ˇˇˇˇ

11 13 25 19

2 1 1 2

ˇˇˇˇ

21

1

ˇˇˇˇ

Figure 9.2. The Dormouse’s symbols.

The cautious Dormouse began to check the Hatter’s work. When n D 3, in (~n) he got the

coach

23 � 1 D 7

ˇˇˇˇ

3 1

2 1

ˇˇˇˇ;

which is the same as

7

ˇˇˇˇ

1 3

1 2

ˇˇˇˇ

because it is cyclic. But when he substituted n D 4 in (~n) he got a coach no one had ever

seen before (since they had always stopped when a repeat occurred). It looked like this:

24 � 1 D 15

ˇˇˇˇ

3 3

2 2

ˇˇˇˇ: (~4)

At the same time the Hatter came up with another coach, valid for all n � 5, except for

n D 6 using b D 2n � 1 and a1 D 9 in (??),

2n � 1

ˇˇˇˇ

9 2n�1 � 22 � 1 2n�2 � 2n�3 C 1 2n�1 � 2n�3 C 2n�4 � 1

1 2 1 n � 4

ˇˇˇˇ: (�n)

By substituting n D 6 into (�n), the group obtained:

26 � 1

ˇˇˇˇ

9 27 9 27

1 2 1 2

ˇˇˇˇ: (�6)

Both (~4) and (�6) caused great puzzlement. They could see that these coaches were

not contracted, but they really didn’t want to reject coaches that fit the persistent coach

patterns (~n/ and (�n/.

“Maybe these curious coaches give us a clue that we shouldn’t always stop at the first

repeat in a contracted symbol,” Alice suggested, “Maybe we should go on until the bottom

row adds up to n.”

The group studied (~n), (~4), (�n), and (�6) (as the reader should also), along with

other coaches with a fixed a1 where b is always 2n � 1, or 2n C 1. They discovered that

i

i

“master” — 2012/11/15 — 11:23 — page 61 — #75i

i

i

i

i

i


in each sequence of coaches, every time a new number ai enters the top row of a coach

it represents the beginning of yet another persistent coach pattern, where if ai is the first

element in the top row, the ki values all remain the same, except for kr (the last entry in the

bottom row) which will always increase by 1 every time n increases by 1. To see another

example of this look at the form of the coaches where b D 2n C 1, and a1 D 3, for n D 4,

5, and 6. It turns out that if a coach, with fixed a1 where b is always 2n �1 or 2nC1, is such

that the k value for that coach is less than n then the process of duplicating the columns in

that coach until the resultant value of k in that coach equals n, will always make that coach

appear as it would in some persistent coach sequence without being contracted.

Now the Dormouse asked Alice when they were going to get to the main problem.

Alice said “That’s next! We have everything we need to answer the question now.”

The big payoff—A non-recursive algorithm for finding the

number of MTP partitions

Let’s begin with n D 5. Because we know the MTP partitions in this case (Table 9.1) it will

give us a plausibility check. Alice instructed them to look at the following two, (possibly)

hybrid, symbols:

b D 25 � 1 D 31ˇˇˇ1 15

1 4

ˇˇˇ3 7

2 3

ˇˇˇ5 13 9 11

1 1 1 2

ˇˇˇ;

b D 25 C 1 D 33ˇˇˇ1

5

ˇˇˇ

3 15 9

1 1 3

ˇˇˇ5 7 13

2 1 2

ˇˇˇ

11

1

ˇˇˇ:

Because of the form of b in these two cases, we know in advance (by the Quasi-Order

Theorem) that if the symbol is reduced and contracted the value of k for each coach is 5.

In one coach the value of k is less than 5. However, in that deficient case it is possible to

repeat columns until the sum of entries in the bottom row sums to 5. For our purposes, we

only need to know the duplicate entries in the bottom row.

Thus, in bottom of the fourth coach for b D 33 we repeat (1) to get 1 1 1 1 1. Having

done this, you can simply record the numbers that appear in the bottom row of the various

coaches to obtain all the MTP partitions of 5. The partitions contributed by b D 31 are

all the even MTP partitions of 5. The values contributed by b D 33 are all the odd MTP

partitions of 5. In Table 9.4 we record the MTP partitions by reading across the bottom row

of the coaches, repeating the doubly-underlined entries where appropriate.

Table 9.4. MTP partition for n D 5.

Even MTP partitions from the coaches of b D 31 1 4, 2 3, 1 1 1 2

Odd MTP partitions from the coaches of b D 33 5, 1 1 3, 2 1 2, 1 1 1 1 1

There were, in fact, exactly the same partitions that they had found earlier, even though

some were in a different cyclic order. In this case if each partition is written in ascending

order there are no repeats so these yield the 7 unordered partitions for n D 5.

Then the Dormouse asked, “How can we be sure we have all the MTP partitions?”

i

i

“master” — 2012/11/15 — 11:23 — page 62 — #76i

i

i

i

i

i


Alice explained, “This is easy! If you think one is missing, just tell me which one it is.

By the reconstruction algorithm, I can construct the coach having that partition. But, we

know such a coach must already exist because in [4]–[6] it was shown that (??) defines a

unique permutation of the ai s and our symbol contains all of the ai s. Finally, it turns out

that, if the coach connected with a set of the ai s involves a repeat of the columns then this

coach must fit one of the persistent coach patterns. This means we must have all the MTP

partitions!” She then summarized their algorithm for finding MTP partitions, for a given n:

1. Construct the (possibly) hybrid symbol for b D 2n � 1 and for b D 2n C 1.

2. In any hybrid coaches for either symbol repeat the bottom row, if necessary, so that

the k value for that coach is n.

3. From the bottom row of the coaches record the MTP partitions.

Example with n D 8

That’s it! The bottom row of the coaches in the symbols for b D 2n � 1, or b D 2n C 1, are

the even or odd, MTP partitions, respectively. And, you can always write each of the MTP

partitions in descending order, strike out the duplicates, and obtain the unordered partitions

of n. This algorithm will work for any n—and a computer can be programmed to produce

the symbols.

The March Hare, the Hatter, and the Dormouse were really into the process now. They

decided to try it for n D 8. Figures 9.3 and 9.4 give the (possibly hybrid) symbols for

b D 28 � 1 D 255 and b D 28 C 1 D 257, as calculated by the Dormouse.

255

ˇˇˇˇ

1 127

1 7

ˇˇˇˇ

3 63

2 6

ˇˇˇˇ

5 125 65 95

1 1 1 5

ˇˇˇˇ

7 31

3 5

ˇˇˇˇ

9 123 33 111

1 2 1 4

ˇˇˇˇ

ˇˇˇˇ

11 61 97 79

2 1 1 4

ˇˇˇˇ

13 121 67 47

1 1 2 4

ˇˇˇˇ

15

4

ˇˇˇˇ

17 119

1 3

ˇˇˇˇ

19 59 49 103

2 2 1 3

ˇˇˇˇ

ˇˇˇˇ

21 117 69 93 81 87

1 1 1 1 1 3

ˇˇˇˇ

23 29 113 71

3 1 1 3

ˇˇˇˇ

25 115 35 55

1 2 2 3

ˇˇˇˇ

ˇˇˇˇ

27 57 99 39

2 1 2 3

ˇˇˇˇ

37 109 73 91 41 107

1 1 1 2 1 2

ˇˇˇˇ

43 53 101 77 89 83

2 1 1 1 1 2

ˇˇˇˇ

ˇˇˇˇ

45 105 75

1 1 2

ˇˇˇˇ

51

2

ˇˇˇˇ

85

1

ˇˇˇˇ

Figure 9.3.

In Table 9.5 they recorded the MTP partitions from the bottom row of the coaches,

repeating the doubly-underlined coaches as appropriate. Ordering the numbers in each of

these 35 MTP partitions and striking out the duplicates they verified that the number of

ordinary, unordered partitions for n D 8 is 22.

i

i

“master” — 2012/11/15 — 11:23 — page 63 — #77i

i

i

i

i

i


257

ˇˇˇˇ

1

8

ˇˇˇˇ

3 127 65

1 1 6

ˇˇˇˇ

5 63 97

2 1 5

ˇˇˇˇ

7 125 33

1 2 5

ˇˇˇˇ

9 31 113

3 1 4

ˇˇˇˇ

11 123 67 95 81

1 1 1 1 4

ˇˇˇˇ

ˇˇˇˇ

13 61 49

2 2 4

ˇˇˇˇ

15 121 17

1 3 4

ˇˇˇˇ

19 119 69 47 105

1 1 2 1 3

ˇˇˇˇ

21 59 99 79 89

2 1 1 1 3

ˇˇˇˇ

ˇˇˇˇ

23 117 35 111 73

1 2 1 1 3

ˇˇˇˇ

25 29 57

3 2 3

ˇˇˇˇ

27 115 71 93 41

1 1 1 2 3

ˇˇˇˇ

ˇˇˇˇ

37 55 101 39 109

2 1 2 1 2

ˇˇˇˇ

43 107 75 91 83 87 85

1 1 1 1 1 1 2

ˇˇˇˇ

ˇˇˇˇ

45 53 51 103 77

2 2 1 1 2

ˇˇˇˇ

Figure 9.4.

As we leave our friends the Dormouse went off to buy a small Mickey Mouse hand

calculator so that he could more easily do the MTP calculations for n D 9.

Table 9.5. MTP partitions for n D 8.

Even MTP partitions

from the

coaches of Figure 3

1 7, 2 6, 1 1 1 5, 3 5, 1 2 1 4, 2 2 1 4, 1 1 2 4, 4 4, 1 3 1 3, 2 2 1 3

1 1 1 1 1 3, 3 3 1 3, 1 2 2 3, 2 1 2 3, 1 1 1 2 1 2, 2 1 1 1 1 2,

1 1 2 1 1 2, 2 2 2 2, 1 1 1 1 1 1 1 1

Odd MTP partitions

from the

coaches of Figure 4

8, 1 1 6, 2 1 5, 1 2 5, 3 1 4, 1 1 1 1 4, 2 2 4, 1 3 4, 1 1 2 1 3

2 1 1 1 3, 1 2 1 1 3, 3 2 3, 1 1 1 2 3, 2 1 2 1 2, 1 1 1 1 1 1 2, 2 2 1 1 2

Acknowledgment The authors are grateful to Don Albers, Gerald L. Alexanderson, Vic-

tor Garcia, Jennifer Hooper, Geoffrey Shephard, and Robin Wilson for reading earlier ver-

sions of this manuscript and offering valuable suggestions for improving the text; and we

also thank Veronica Garcia for producing the cartoon.

The authors accept full responsibility for any typos that may have escaped our scrutiny—

but we blame any arithmetical errors on the Dormouse who tended to nod off doing the

longer calculations.

Bibliography

[1] M. Gardner, Fractal Music, Hypercards, and More Mathematical Recreations from Scientific

American Magazine, W. H. Freeman, 1992, 24–38.

[2] ——, Mathematical Carnival, Mathematical Association of America, 1989, 137.

[3] ——, The Last Recreations: Hydras, Eggs, and Other Mathematical Mystifications, Copernicus

Series, Springer, NY, 1997, 37–42.

[4] P. Hilton, D. Holton, and J. Pedersen, Mathematical Reflections—In Room with Many Mirrors,

Second printing, Undergraduate Texts in Mathematics, Springer, NY, 1998.

[5] P. Hilton and J. Pedersen, A Mathematical Tapestry: Demonstrating the Beautiful Unity of Math-

ematics, Cambridge Univ. Press, 2010.

i

i

“master” — 2012/11/15 — 11:23 — page 64 — #78i

i

i

i

i

i


[6] P. Hilton, J. Pedersen, and B. Walden, A property of complete symbols: An ongoing saga con-

necting geometry and number theory, Homage to a Pied Puzzler, A K Peters, 2009, 47–50.

[7] A. Knopfmacher and N. Robbins, Some properties of cyclic compositions, Fibonacci Quart. 48

(2010) 249–255.

[8] A. Rice and E. Torrence, “Shutting up like a Telescope,” Lewis Carroll’s “curious” condensation

method for evaluating determinants, College Math. J. 38 (2007) 85–95.

[9] R. Wilson, Lewis Carroll in Numberland: His Fantastical Mathematical Logical Life, W. W.

Norton, NY, 2010.

i

i

“master” — 2012/11/15 — 11:23 — page 65 — #79i

i

i

i

i

i

10The Continuing Saga of Snarks

sarah-marie belcastro

Way back in 1852, Francis Guthrie conjectured that every map drawn on the plane can be

colored so that regions sharing a border have different colors—and only four colors are

necessary. This became known as the Four Color Conjecture. In 1880, Tait [16] proved

that the Four Color Conjecture is equivalent to a problem of edge coloring graphs. This is

where our story begins, because the study of snarks grew from exactly this edge-coloring

problem. To avoid confusion, we note that there is no relationship between the English

word ‘snark’ (or its derivatives snarky, snarkiness, etc.) and the mathematical term; in fact,

the origin of the latter is a pivotal point in our story.

Recall that a graph is a collection of nodes, called vertices, and a collection of connec-

tions between these nodes, called edges. You may also remember that a graph is planar if

it can be drawn in the plane with no edge-crossings, at which point the regions of the plane

demarcated by the graph are called faces. You are less likely to know that a bridgeless

graph has no edge whose deletion disconnects the graph, that a 3-regular graph has three

edges meeting at every vertex, and that a proper edge coloring assigns a color to every

edge of a graph such that no two edges meeting at a vertex have the same color. A graph to

which all three of these terms apply is shown in Figure 10.1.

Figure 10.1. A bridgeless, 3-regular, properly 3-edge colored planar graph, but not a snark.

In this language, Tait showed that the Four Color Conjecture holds if and only if every

bridgeless, 3-regular, planar graph can be properly edge colored using only three colors.

That is, instead of coloring regions of maps, one can approach the Four Color Conjecture

by coloring edges of bridgeless, 3-regular, planar graphs. As Martin Gardner pointed out

in his 1976 introduction to this subject [9], it follows that the search for a counterexample


65

i

i

“master” — 2012/11/15 — 11:23 — page 66 — #80i

i

i

i

i

i


to the Four Color Conjecture is really a search for a bridgeless, 3-regular, planar graph that

is not 3-edge colorable.

In Gardner’s language, this was a search for a nontrivial (no bridges or cycles of lengths

4 or less), uncolorable (using only three colors), trivalent (another word for 3-regular)

planar graph. Because “Nontrivial Uncolorable Trivalent” is a mouthful, Gardner sought a

shorter name. He considered ‘NUT’, decided it wasn’t serious enough, and then proposed

‘Snark’ from the Lewis Carroll poem The Hunting of the Snark—which does not seem, at

least to this author, to be a more serious name. (Carroll invented the name Snark, but his

poetic use was not related to the already existing English noun, or, it seems, to the verb

and adjective that came later [15].) The name stuck, and has invigorated all subsequent

discourse on the topic.

Classical snarkiness

We now reveal the full technical definition.

Definition. A snark is a bridgeless, 3-regular graph that

� is not edge colorable using only three colors,

� has smallest cycle length (girth) at least 5, and

� is cyclically 4-edge connected, meaning that at least 4 edges must be removed in

order to separate the graph into two components that each contain a cycle.

Figure 10.2. The Petersen graph, BlanuLsa-1, and BlanuLsa-2.

The three smallest snarks are shown in Figure 10.2. In 1976, Gardner gave several

examples both of individual snarks and of infinite families of snarks [9]. Much of this

column was a translation into lay language of an excellent paper by Isaacs [11], published

the year before and still cited in modern research. As with many of his columns, Gardner’s

writing on snarks was exciting partly because he gave puzzle-like exercises, some of which

led toward proofs!

Martin Gardner’s showcasing (and naming) of snarks is only one part of their story. Ear-

lier, in 1968, Branko Grunbaum [10] posed a generalization of the Four Color Conjecture

that can be phrased in terms of snarks. To understand it, we need a few more definitions.

A planar graph should really be drawn on a sphere to avoid the unbounded face present

in a planar drawing. Focusing on the faces of a graph takes us beyond ordinary graph theory

i

i

“master” — 2012/11/15 — 11:23 — page 67 — #81i

i

i

i

i

i

10. The Continuing Saga of Snarks 67

into topological graph theory. The topology referred to here is the topology of surfaces such

as the sphere, torus, and Mobius band. (The lovely classification theorem for surfaces is

presented superbly in [8].)

(a) (b)

Figure 10.3. The complete graph K5 is nonplanar (a) but does embed on the torus (b).

In Figure 10.3b we show a small nonplanar graph embedded (drawn without edges

crossing) on the torus, as well as a way of representing the torus that allows us to avoid the

use of dotted lines for unseen parts of edges. The arrows indicate gluing; when an edge of a

graph goes off the boundary of the rectangle, it returns on the matching arrowed boundary

segment. It is quite an enjoyable exercise to embed the Petersen graph on a torus.

We only consider cellular embeddings, in which every face is a topological disk. (In

other words, faces cannot have punctures or doughnut holes.) A graph is polyhedrally

embeddable on a surface if it can be drawn so that no face uses an edge twice and no two

faces share more than one edge. Figure 10.3b depicts a cellular embedding, but because

some edges do not separate distinct faces, the embedding is not polyhedral.

Grunbaum conjectured that every 3-regular graph polyhedrally embeddable on an ori-

entable surface (a sphere, torus, or n-holed torus) is three-edge colorable. Phrased in terms

of snarks, this says that no snark has a polyhedral embedding on any orientable surface.

When Martin Gardner published [9], there had been no progress on Grunbaum’s Conjec-

ture, but a few months later, Appel and Haken turned the Four Color Conjecture into the

Four Color Theorem [2], [3]. They did not use edge coloring in their proof (for an excel-

lent exposition of the saga, see [20]), and so by Tait’s theorem we obtain, as a corollary of

the Four Color Theorem, that every bridgeless 3-regular planar graph is 3-edge colorable;

equivalently, no snark embeds on a sphere.

It’s not just the connection to the Four Color Theorem that makes snarks interesting.

Vizing’s theorem (proved only a few years before Grunbaum made his conjecture) implies

that every 3-regular graph needs at most 4 colors to properly color its edges. Of course, at

least 3 colors are needed, so there is not much wiggle room in the land of edge coloring.

Snarks are simultaneously barely not 3-edge colorable, and as difficult to edge-color as

possible.

Modern snarkiness

For a long time, the main research goal was to find snarks, as very few were known. In 1950

there were only four known snarks, but, during the 1970s, five infinite families of snarks

and about five individual snarks (not in any families) were found [7]; some are pictured

i

i

“master” — 2012/11/15 — 11:23 — page 68 — #82i

i

i

i

i

i


Figure 10.4. Six sample snarks from the seventies (drawn by Mathematica).

in Figure 10.4. Twenty years later, the computer age made it possible to generate snarks,

so in the 1990s, most work on snarks took a structural approach. These results are highly

technical; thus, we give only some flavor of the ideas.

First, we consider ways to break down snarks. If one can remove some set of k edges

from a snark so that the remaining graph is a disconnected collection of subgraphs of

snarks, then the original snark is considered reducible, and if this is not possible, the snark

is irreducible. Many snarks are reducible in this sense, and irreducible snarks have the

property that removal of two vertices leaves a 3-edge colorable graph. By definition, a

snark cannot have a Hamilton circuit (a circuit containing all vertices), as otherwise it

would be 3-edge colorable (2-edge color the circuit and give the remaining edges a third

color). However, many snarks almost have Hamilton circuits in the sense that they have

circuits including all but one vertex. And, some snarks have the stronger property that

leaving out any one vertex, the remainder of the graph forms a circuit. All such snarks are

irreducible [6].

Tutte conjectured in 1966 that every snark contains a subgraph that is a Petersen graph

but subdivided with extra vertices [18]. This can be rephrased by saying that every snark

has a Petersen minor. Robertson, Sanders, Seymour, and Thomas announced a proof in

1999, much of which is yet to be published [17, p.14]. Because the Petersen graph occupies

such a revered position in graph theory, the fact that every snark has a Petersen minor is in

some sense the ultimate structural result.

In contrast to the structural approach is the topological approach. From this perspec-

tive, we are interested in knowing on which surface(s) a given snark embeds, and with

what type of embedding. This essentially amounts to investigating Grunbaum’s Conjec-

ture. Aside from the resolution of the Four Color Conjecture in 1976—proved using a

structural approach!—no serious progress was made on Grunbaum’s Conjecture for more

than 30 years after it was posed. The graph theory community was divided on whether the

conjecture was true, with some leading figures taking opposite sides of the issue at a 2003

conference.

i

i

“master” — 2012/11/15 — 11:23 — page 69 — #83i

i

i

i

i

i


Very little was known about embedding properties of snarks until the 2000s, though

thousands of snarks were tested by computer and proved not to be polyhedrally embed-

dable on any orientable surface [14], [4] (and therefore none provided a counterexample

to Grunbaum’s Conjecture). It was only known that the Petersen graph and one of the two

Blanusa snarks embed on the torus. In 2004, several infinite families of snarks were ex-

hibited that embed on the torus [5], [19] and on the 2-holed torus [5] (none polyhedrally).

Because they are pretty, two seed snarks for these families are shown in Figure 10.5.

1 5

3 4 6

82

7 11 15

1 14 16

1812

179

10

19 23

21 22 24

2620

25

1 5

3 4 6

82

7 11 15

13 14 16

1812

179

10

19

23

21

22

2426

20

25

Figure 10.5. The smallest snarks in two distinct infinite families of toroidal snarks.

That same year, it was proved that for each number of Mobius-band-like twists g �1, there exists a snark with a polyhedral embedding on the corresponding nonorientable

surface [14]. (Whereas orientable surfaces are classified by the number of torus holes they

have, nonorientable surfaces are classified by the number of Mobius-band-like twists they

have.) Both of these results provided evidence in favor of Grunbaum’s Conjecture.

At a conference late in 2006, Grunbaum himself was asked whether he still thought

his eponymous conjecture was true. He demurred, but noted that saying it should be true

caused more people to work on it! Then, in 2007 everything blew up. Counterexamples to

Grunbaum’s Conjecture were found on the 9-holed torus, and then shortly thereafter, on the

5-holed torus [13] using snarks exhibited in [5]. (In fact, infinitely many counterexamples

were found! The powerful superposition technique of [12] did the trick.) At the same time,

Grunbaum’s Conjecture was affirmed for most three-regular graphs embedded on the torus

[1]. While this deflates an inspiring balloon, a smaller (and perhaps more manageable?)

balloon still exists: What is true about Grunbaum’s Conjecture restricted to n-holed tori for

i

i

“master” — 2012/11/15 — 11:23 — page 70 — #84i

i

i

i

i

i


1 � n � 4? While perhaps all of mathematical research is about refining the boundaries

between different categories of objects, it somehow seems that this is especially true for

snarks. It often feels that doing research on snarks is like honing the edge of a knife used to

separate two crumbly loaves of bread. Much, much more can (and should!) be said about

snarks. The interested reader is encouraged to consult the references and contact the author

for further information!

Bibliography

[1] M. O. Albertson, H. Alpert, s-m. belcastro, and R. Haas, Grunbaum colorings of toroidal

triangulations, J. Graph Theory 63 (2010), 68–81; available at dx.doi.org/10.1002/

jgt.20406

[2] K. Appel, W. Haken, and J. Koch, Every planar map is four colorable, I: Discharging, Illinois

J. Math. 21 (1977) 429–490.

[3] K. Appel and W. Haken, Every planar map is four-colorable, II: Reducibility, Illinois J. Math.

21 (1977) 491–567.

[4] D. Archdeacon, Problems in Topological Graph Theory: Three-Edge-Coloring Orientable

Triangulations (1995); available at www.emba.uvm.edu/˜archdeac/problems/

grunbaum.htm.

[5] s.-m. belcastro and J. Kaminski, Families of dot-product snarks on orientable surfaces of

low genus, Graphs Combin. 23 (2007), 229–240; available at dx.doi.org/10.1007/

s00373-007-0729-9.

[6] A. Cavicchioli, T. E. Murgolo, B. Ruini, and F. Spaggiari, Special classes of snarks, Acta Appl.

Math. 76 (2003), 57–88; available at dx.doi.org/10.1023/A:1022864000162.

[7] A. G. Chetwynd and R. J. Wilson, Snarks and supersnarks, in The Theory and Applications of

Graphs (Kalamazoo, Mich., 1980), Wiley, New York, 1981, 215–241.

[8] G. K. Francis and J. R. Weeks, Conway’s ZIP proof. Amer. Math. Monthly 106

(1999), 393–399; available at dx.doi.org/10.2307/2589143; also available at

new.math.uiuc.edu/zipproof/.

[9] M. Gardner, Mathematical games, Scientific American, April 1976, 126–130.

[10] B. Grunbaum, Conjecture 6, in Recent Progress in Combinatorics, W.T. Tutte, ed., Academic

Press, New York, 1969, 343.

[11] R. Isaacs, Infinite families of nontrivial trivalent graphs which are not Tait colorable, Amer.

Math. Monthly 82 (1975), 221–239; available at dx.doi.org/10.2307/2319844.

[12] M. Kochol, Snarks without small cycles, J. Combin. Theory Ser. B 67 (1996), 34–47; available

at dx.doi.org/10.1006/jctb.1996.0032.

[13] ——, Polyhedral embeddings of snarks in orientable surfaces, Proc. of the Amer. Math.

Soc., 137 (2009), 1613–1619; available at dx.doi.org/10.1090/S0002-9939-

08-09698-6.

[14] B. Mohar and A. Vodopivec, On polyhedral embeddings of cubic graphs, Combin. Prob. Com-

put. 15 (2006), 877–893; available at dx.doi.org/10.1017/S0963548306007607.

[15] snark, v. and n. The Oxford English Dictionary, 2nd ed., Oxford Univ. Press, 1989.

[16] P. G. Tait, Note on a theorem in geometry of position. Trans. Roy. Soc. Edinburgh 29 (1880),

657–660.

[17] R. Thomas, Recent excluded minor theorems for graphs, Surveys in combinatorics, 1999 (Can-

terbury), London Math. Soc. Lecture Note Ser., 267, Cambridge Univ. Press, Cambridge, 1999,

201–222; available at people.math.gatech.edu/˜thomas/PAP/bcc.pdf.

dx.doi.org/10.1002/

http://dx.doi.org/10.1002/jgt.20406

jgt.20406

http://dx.doi.org/10.1002/jgt.20406

www.emba.uvm.edu/~archdeac/problems/

http://www.emba.uvm.edu/~archdeac/problems/grunbaum.htm

grunbaum.htm

http://www.emba.uvm.edu/~archdeac/problems/grunbaum.htm

dx.doi.org/10.1007/

http://dx.doi.org/10.1007/s00373-007-0729-9

s00373-007-0729-9

http://dx.doi.org/10.1007/s00373-007-0729-9

dx.doi.org/10.1023/A:1022864000162

http://dx.doi.org/10.1023/A:1022864000162

dx.doi.org/10.2307/2589143

http://dx.doi.org/10.2307/2589143

new.math.uiuc.edu/zipproof/

http://new.math.uiuc.edu/zipproof/

dx.doi.org/10.2307/2319844

http://dx.doi.org/10.2307/2319844

dx.doi.org/10.1006/jctb.1996.0032

http://dx.doi.org/10.1006/jctb.1996.0032

dx.doi.org/10.1090/S0002-9939-

http://dx.doi.org/10.1090/S0002-9939-08-09698-6

08-09698-6

http://dx.doi.org/10.1090/S0002-9939-08-09698-6

dx.doi.org/10.1017/S0963548306007607

http://dx.doi.org/10.1017/S0963548306007607

people.math.gatech.edu/~thomas/PAP/bcc.pdf

http://people.math.gatech.edu/~thomas/PAP/bcc.pdf

i

i

“master” — 2012/11/15 — 11:23 — page 71 — #85i

i

i

i

i

i


[18] W. T. Tutte, On the algebraic theory of graph colorings, J. Combin. Theory Ser. B 1 (1966),

15–50; available at dx.doi.org/10.1016/S0021-9800(66)80004-2.

[19] A. Vodopivec, On embeddings of snarks in the torus, Discrete Math. 308 (2008), 1847–1849;

available at dx.doi.org/10.1016/j.disc.2006.09.051.

[20] R. Wilson, Four Colors Suffice: How the Map Problem Was Solved, Princeton Univ. Press,

2003.

dx.doi.org/10.1016/S0021-9800(66)80004-2

http://dx.doi.org/10.1016/S0021-9800(66)80004-2

dx.doi.org/10.1016/j.disc.2006.09.051

http://dx.doi.org/10.1016/j.disc.2006.09.051

i

i

“master” — 2012/11/15 — 11:23 — page 72 — #86i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 73 — #87i

i

i

i

i

i

11The Map-Coloring Game

Tomasz Bartnicki, Jarosław Grytczuk,

H. A. Kierstead, and Xuding Zhu

Introduction

Suppose that Alice wants to color a planar map using four colors in a proper way, that

is, so that any two adjacent regions get different colors. Despite the fact that she knows

for certain that it is eventually possible, she may fail in her first attempts. Indeed, there are

usually many proper partial colorings not extendable to proper colorings of the whole map.

Thus, if she is unlucky, she may accidentally create such a bad partial coloring.

Now suppose that Alice asks Bob to help her in this task. They color the regions of a

map alternately, with Alice going first. Bob agrees to cooperate by respecting the rule of a

proper coloring. However, for some reason he does not want the job to be completed—his

secret aim is to achieve a bad partial coloring. (For instance, he may wish to start the color-

ing procedure over and over again just to stay longer in Alice’s company.) Is it possible for

Alice to complete the coloring somehow, in spite of Bob’s insidious plan? If not, then how

many additional colors are needed to guarantee that the map can be successfully colored,

no matter how clever Bob is?

This map-coloring game was invented about twenty-five years ago by Steven J. Brams

with the hope of finding a game-theoretic proof of the Four Color Theorem, avoiding per-

haps the use of computers. Though this approach has not been successful, at least we are

left with a new, intriguing map-coloring problem: What is the fewest number of colors

allowing a guaranteed win for Alice in the map-coloring game in the plane?

Brams’s game was published by Martin Gardner in his Mathematical Games column

in Scientific American in 1981. Surprisingly, it remained unnoticed by the graph-theoretic

community until ten years later, when it was reinvented by Hans L. Bodlaender [1] in the

wider context of general graphs. In this version Alice and Bob play as before by coloring

properly the vertices of a graph G. The game chromatic number �g.G/ of G is the smallest

number of colors for which Alice has a winning strategy. As every map is representable

by a graph whose edges correspond to adjacent regions of the map, Brams’s question is

equivalent to determining the game chromatic number of planar graphs. Since then the

problem has been analyzed in serious combinatorial journals and gained the attention of

Reprinted from The American Mathematical Monthly, Vol. 114, No. 9 (Nov. 2007), pp. 793–803.

73

i

i

“master” — 2012/11/15 — 11:23 — page 74 — #88i

i

i

i

i

i


several experts. In this article we present what is known and what remains unknown about

this fascinating game.

Four colors do not suffice

One can see quickly that there are maps demanding more colors in the game than normally.

Perhaps the simplest example of this phenomenon is due to Robert High. Consider the map

with six regions corresponding to the faces of a cube. Clearly, the map can be 3-colored,

but Bob can win even if there are four colors available. Suppose that Alice uses color 1

in her first move. Then Bob answers with color 2 on the opposite face. Thus colors 1 and

2 cannot be used any further, and Alice has to play color 3 (see Figure 11.1). Then Bob

answers with color 4 on the opposite face and wins the game.

1

2

Alice

Bob Bob

1

1

Alice

Figure 11.1. First two moves on the cube and the dodecahedron.

Earlier, Lloyd Shapley gave another elegant example based on a regular dodecahedron,

showing that even five colors do not suffice. Here Bob’s strategy is similar, but this time he

replies with the same color on the face opposite to the face that Alice has just colored (see

Figure 11.1). In this way, Alice must use a new color in each of her moves, which leads to

a win for Bob after the fifth round.

When Steven Brams and Martin Gardner started to think that perhaps six colors would

be the maximum number that Bob could force, Robert High found a map for which Alice

needed seven colors to win. Indeed, there is a modification of the dodecahedron map for

which a win by Alice requires eight colors. Unfortunately, it is not possible to verify this

without a tedious case-by-case analysis. A natural suspicion arose that perhaps there was

no finite bound at all, a suspicion that persisted for thirteen years.

Possible strategies for Alice

Let us consider the problems posed for Alice in designing an effective strategy for the map-

coloring game. Whenever Bob colors a vertex v, he poses a threat to each of its uncolored

neighbors. Alice can counter this threat at one of these neighbors w by immediately color-

ing it, but this in turn creates new threats for the neighbors of w. Moreover, the remaining

uncolored neighbors of v are still open to attack, and Bob can strengthen his position on

subsequent moves by coloring their uncolored neighbors. Thus Alice should design a strat-

egy that prioritizes responses to threats to uncolored vertices so that she can keep the total

threat under control. She must also limit the damage done when she colors a vertex.

i

i

“master” — 2012/11/15 — 11:23 — page 75 — #89i

i

i

i

i

i

11. The Map-Coloring Game 75

The following idea allows Alice to avoid creating new points of vulnerability when she

colors a vertex. Let �.G/ denote the usual chromatic number of a graph G, that is, the

least number of colors in a proper coloring of the vertices of G. Suppose that f W V.G/ !fc1; : : : ; ctg is an optimal coloring, with t D �.G/, and that the game is played on a set of

kt colors of the form fcij W i 2 Œk�; j 2 Œt �g, where we use the standard abbreviation Œn� for

f1; 2; : : : ; ng. We refer to cij as a shade of cj . We say that a color is correct for a vertex v

if it is a shade of f .v/; otherwise it is incorrect. Alice should always try to color a vertex

v with a correct color. If she is successful, then her early moves will never conflict with

her later moves. This is possible as long as she can play in such a way that at any time any

uncolored vertex is adjacent to fewer than k vertices incorrectly colored by Bob.

This leaves Alice with the problem of limiting the number of neighbors colored in-

correctly by Bob that an uncolored vertex can have. In ordinary (noncompetitive) graph-

coloring there is a well-known strategy for managing such threats. The coloring number

col.G/ of a graph G is the smallest integer k such that every subgraph of G has a vertex

with degree less than k. For example, it is a simple consequence of Euler’s formula that the

coloring number of an arbitrary planar graph is at most six. The vertices of a graph with

coloring number at most k can be ordered as v1; : : : ; vn so that each vertex vi has fewer

than k neighbors that precede it in the ordering: we simply construct the ordering from

right to left, always choosing a vertex whose degree among the unchosen vertices is less

than k. It is also clear that the existence of such an ordering implies that col.G/ � k. Now,

if Alice colors the vertices from left to right, she will need at most k colors, since each un-

colored vertex will have fewer than k colored neighbors. In other words, �.G/ � col.G/

for all graphs G.

Suppose that Alice attempts to modify this strategy for the map-coloring game. She

might naturally try to color the left-most (in the ordering) neighbor of the last vertex col-

ored by Bob. This vertex will have fewer than k backward (left) neighbors colored by Bob,

but the number of its forward (right) neighbors colored by Bob is unbounded. This was

the situation with the problem in the early nineties when help came from an unexpected

source.

Ramsey numbers and arrangeability

A simple graph G consists of a vertex set V.G/ and an edge set E.G/ where each edge

is an unordered pair of distinct vertices. A complete graph or clique is a simple graph

in which every pair of vertices forms an edge. A clique on t vertices is denoted by Kt .

According to Ramsey’s famous theorem, any graph G has a Ramsey number r.G/ defined

as the smallest integer t such that for any red-blue coloring of the edges of Kt there is

a monochromatic copy of G in Kt . For a class C of graphs define rC.n/ by rC.n/ Dmaxfr.G/ W G 2 C; jV.G/j D ng. Burr and Erdos conjectured that rC.n/ is linear in n

for any class of graphs satisfying jE.G/j D O.jV.G/j/. This conjecture remains open

in general, although some special cases have been proved. Chvatal, Rodl, Szemeredi, and

Trotter [5] verified the conjecture for the special case of graphs with bounded degree by

using Szemeredi’s regularity lemma. The proof involved constructing an embedding of

G into a monochromatic part of Kn one vertex at a time in an order v1; : : : ; vn. It was

i

i

“master” — 2012/11/15 — 11:23 — page 76 — #90i

i

i

i

i

i


crucial to their argument that each vertex have a bounded number of backward and forward

neighbors. If the argument had required only that the backward degree be bounded, they

would have established the whole conjecture. Notice that planar graphs satisfy jE.G/j DO.jV.G/j/ but may have arbitrarily large degree.

Chen and Schelp made the brilliant observation that something less was required: the

number of vertices vh with h < i that are adjacent to vi or to a forward neighbor vj of vi

should be bounded (see Figure 11.2). This idea, in one form or another, has turned out to

be absolutely crucial for the theory of coloring games.

v1 vh vi vnvj

Figure 11.2. The number col2.G/ bounds the number of predecessors of vi that are adjacent to vi

(grey nodes) or to a forward neighbor of vi (black nodes).

Assume that the vertices of a graph G are ordered as v1; v2; : : : ; vn. A vertex vh is

called a loose backward neighbor of vi if h < i and either (a) vh is adjacent to vi or (b)

vh is adjacent to vj and vj is adjacent to vi for some j with j > i (see Figure 11.2). We

define the 2-coloring number col2.G/ of a graph G to be the least integer k such that there

is an ordering of V.G/ for which each vertex has fewer than k loose backward neighbors.

This is an inessential variant of the parameter that Chen and Schelp called arrangeability

in [3], where it is proved that the conjecture of Burr and Erdos is true for the family of

graphs with bounded 2-coloring numbers. They then proved the Burr-Erdos conjecture for

planar graphs by showing that the 2-coloring number of any planar graph is at most 761.

Kierstead and Trotter [14] realized that this was the missing tool for bounding the game

chromatic number �g.G/ of planar graphs G. They proved the following theorem:

Theorem 1. Every graph G satisfies �g.G/ � �.G/.1 C col2.G//.

Proof. Let �.G/ D t and col2.G/ D k. To prove the assertion we describe a strategy for

Alice using at most t.kC1/ colors. Alice should begin by fixing an optimal coloring f of G

using the colors c1; : : : ; ct and an ordering v1; : : : ; vn of G that satisfies col2.G/ D k. She

prioritizes responses to threats according to this ordering. Whenever Bob colors a vertex v,

Alice colors the least-indexed uncolored backward neighbor u of v with a correct color; if

v has no such backward neighbor then Alice correctly colors the least-indexed uncolored

vertex. To be certain that u has a correct color available (i.e., a color not used by any of

its colored neighbors), it suffices to show that u has at most k neighbors that are colored

by Bob. Assume that Bob has colored s loose backward neighbors of u and that Alice has

colored s0 loose backward neighbors of u. In accordance with the stated strategy, each time

Bob colors a forward neighbor z of u, Alice colors the least-indexed uncolored backward

neighbor u0 of z. Since u is an uncolored backward neighbor of z, u0 precedes u, so u0 is

a loose backward neighbor of u. As Alice has colored s0 loose backward neighbors of u,

we conclude that Bob has colored at most s0 forward neighbors of u. Therefore the number

of neighbors of u colored by Bob is at most s C s0 .< k/. Note that this counts only the

i

i

“master” — 2012/11/15 — 11:23 — page 77 — #91i

i

i

i

i

i


number of colored neighbors of u at the end of Alice’s previous move. After that move,

Bob may have colored another neighbor of u, meaning that u now has at most k neighbors

colored by Bob. Thus u has an available correct color, and Alice can win the game with

t.k C 1/ colors.

This reasoning implies that �g.G/ � 3044 for a planar graph G. Kierstead and Trot-

ter obtained much tighter bounds on the game chromatic number of planar graphs. They

demonstrated that col2.G/ � 10 holds for each planar graph G (which gives �g.G/ � 44)

and provided an example of a planar graph G with col2.G/ D 8. They further improved

their bound on the game chromatic number of planar graphs to �g.G/ � 33 by using a

more carefully designed parameter than 2-coloring number.

Acyclic coloring

A cycle on vertices v1; : : : ; vn is a graph whose edges are v1v2; v2v3; : : : ; vnv1. A forest

is a graph without cycles. A coloring of the vertices of a graph G is acyclic if it is a proper

coloring such that no cycle of G is 2-colored. In other words, a subgraph of G whose vertex

set is a union of any two color classes is a forest. The minimum number of colors needed

is the acyclic chromatic number of G, denoted by a.G/. This notion was introduced by

Grunbaum, who conjectured that a.G/ � 5 holds for any planar graph G. The conjecture

was turned into a theorem by Borodin in 1979 [2].

Just as the chromatic number of a graph G is bounded by its coloring number, its acyclic

chromatic number is bounded by its 2-coloring number. Let v1; : : : ; vn be an ordering of

the vertices of G that realizes its 2-coloring number. Color vertices recursively so that no

vertex is colored the same color as a loose backward neighbor. This yields a proper coloring

using at most col2.G/ colors. We still must check that each cycle C in G receives at least

three colors. The largest-indexed vertex vj in C has two backward neighbors vh and vi

with h < i < j . Since vh is a loose backward neighbor of vi , they have different colors.

Since vj is adjacent to both vi and vh, it has a third color.

Twenty years after Grunbaum introduced acyclic coloring, Dinski and Zhu [6] applied

the acyclic chromatic number to the Brams game for arbitrary graphs. They proved the

following theorem, which implies that thirty colors suffice for Alice to win on any planar

map:

Theorem 2. Every graph G satisfies �g.G/ � a.G/.a.G/ C 1/.

Proof. The argument is similar to the proof of Theorem 1. Suppose that G has an acyclic

coloring using the color set fc1; : : : ; ctg, and let V1; : : : ; Vt be the corresponding color

classes of vertices. When i ¤ j , the subgraph Fij of G consisting of all edges with ends

in Vi [ Vj is a forest. Fix an orientation of each Fij so that every vertex has outdegree at

most one. Since every edge of G appears in exactly one of the forests Fij , this provides

an orientation of G such that every vertex has at most one outneighbor in each color class.

Now a vertex v in Vi is endangered if it is as yet uncolored but there is an inneighbor u

of v colored with a shade of ci (see Figure 11.3). Since Alice always colors with a correct

color, an endangered vertex can be created only by Bob. Clearly, in a single move Bob can

create at most one endangered vertex v. Alice should color this vertex immediately after it

i

i

“master” — 2012/11/15 — 11:23 — page 78 — #92i

i

i

i

i

i


Viv

u

Figure 11.3. An endangered vertex v will be colored by Alice in her next move.

is created by Bob. This is always possible, for there are at most t neighbors of v colored

with shades of ci at this moment (at most t � 1 outneighbors and only one inneighbor u

that has just been colored by Bob).

Daltonism may help

Both strategies for Alice considered earlier were local. Alice responded to any threat posed

when Bob colored a vertex by coloring one of its neighbors. As any Go player knows,

global strategies can be more powerful. Our next aim is to develop a global strategy for

Alice.

Curiously, further progress in bounding the game chromatic number was achieved by

making the game more difficult for Alice. Suppose Alice suffers from full daltonism (col-

orblindness): she just cannot distinguish between colors. Because of her disease she pro-

poses to modify the rules of the coloring game so that she can still have some fun playing

it. Rather than coloring vertices, the players create an ordering <g of vertices by taking

turns choosing the next vertex in the ordering. For a fixed positive integer k Alice’s goal

is to have <g demonstrate that col.G/ � k. Bob, wishing to remain in her company, tries

to thwart her. For the purposes of actual play it is convenient for the players to indicate

that they have chosen a vertex by coloring it gray. Bob wins if at some time some uncol-

ored vertex has k gray neighbors, and Alice wins if this never occurs. We denote the least

number k allowing success for Alice by colg.G/ and call it the game coloring number of

G. Clearly, if Alice can win this ordering game for some integer k, she can also win the

coloring game with k colors (even as a completely colorblind person!). In other words,

�g.G/ � colg.G/ is true for every graph G.

Alice is now forced to use a strategy that does not rely on shades of correct colors to

control threats created by her moves. Suppose that she again selects a specific ordering

v1; : : : ; vn (not to be confused with <g ) of the vertex set for the purpose of prioritizing

responses to threats. Suppose also that no vertex has too many backward neighbors in this

order. When Bob chooses a vertex v (colors it gray), Alice faces conflicting requirements.

On the one hand, she should choose a backward neighbor x of v, addressing the threat

to x posed by v. But this in turn produces a threat from x to its backward neighbors.

Alternatively, she could put off choosing x and instead choose a backward neighbor y of

x so that y would already be protected when later she finally did choose x. Or should she,

perhaps, choose a backward neighbor of y? Here we propose a compromise plan: the first

time Alice contemplates choosing x she should move on to considering y, but the second

time she considers choosing x she should actually do so.

i

i

“master” — 2012/11/15 — 11:23 — page 79 — #93i

i

i

i

i

i


Here are the details of Alice’s activation strategy. Imagine that the vertices of G are

made of light bulbs that shine a warning when activated. Let N C.v/ and N �.v/ denote the

sets of backward and forward neighbors of v, respectively. On her first move Alice activates

vertex v1 and colors it. The basic step of the strategy is to activate a vertex (leaving a warn-

ing that it is in danger) and then jump to its least-indexed uncolored backward neighbor.

Suppose that Bob has just colored a vertex v. Then Alice starts by activating v (provided

it has not been activated hitherto) and jumps to the uncolored vertex x of smallest index in

N C.v/. (See Figure 11.4.) If x is already active, then it is endangered, so Alice stops and

colors it. Otherwise she repeats the activation step for x, that is, she activates x and jumps

to the least-indexed uncolored vertex y in N C.x/. This process goes on until she stops

at some vertex u, either because u is active or because there are no uncolored vertices in

N C.u/. In each case she activates and colors u. If it happens that N C.v/ (see Figure 11.4,

second line) already contains no uncolored vertices, then she picks the uncolored vertex

of smallest index, activates it (provided it has not yet been activated), and colors it. Notice

that after Alice’s move all gray vertices are active, whereas after Bob’s move there can be

only one gray inactive vertex.

u y x v

u y x v

Figure 11.4. Basic step of the activation procedure.

For example, let’s see how this strategy works for trees (see Figure 11.5). Any tree

T can be ordered so that each vertex has at most one backward neighbor. Suppose that

Bob has colored a vertex v, and let x be its unique backward neighbor. Then Alice should

activate all vertices on the unique backward path starting from v and ending at some active

vertex u. Notice that Alice can jump to the same vertex only twice (the first time she

activates it, the second time she colors it). It follows that if x stayed uncolored after her

move, it could have no more than two active neighbors (one backward and one forward).

Hence, after Bob’s move there are at most three colored vertices around x, which gives

u

x

v v v

xx

u u

Figure 11.5. Activation on a tree.

i

i

“master” — 2012/11/15 — 11:23 — page 80 — #94i

i

i

i

i

i


colg.G/ � 4. This result was proved by Faigle, Kern, Kierstead, and Trotter in [8]. The

following result reveals the information provided by the activation strategy for general

graphs:

Theorem 3. Every graph G satisfies colg.G/ � 3 col2.G/ � 1.

Proof. Fix an order of the vertices v1; : : : ; vn realizing col2.G/ D k, and assume that

Alice applies the activation strategy with respect to this order. For each vertex vj let S.vj /

be the set of loose backward neighbors of vj . By the definition of col2.G/, the size of

S.vj / is at most k � 1. We count the number of active forward neighbors that vj could

have before it gets colored. From each active vertex in N �.vj / Alice jumps to the least-

indexed uncolored backward neighbor in S.vj / or possibly to vj . If she jumps to vj , then

she immediately colors it or immediately jumps to another vertex in S.vj /. Since she can

jump only twice to a vertex, there can be at most 2.k � 1/ active vertices in N �.vj / after

Alice’s move. Hence, there are at most 3.k � 1/ active vertices around vj after Alice’s

move. Counting Bob’s last move, there are at most 3k � 2 colored vertices around vj when

it gets colored.

v

v

v

1

2

3

v

Figure 11.6. Construction of a 3-tree.

There is a nice class of graphs for which the bound in Theorem 3 is optimal (as long

as we assume that Alice cannot distinguish between colors). Fix a number k and start with

a clique on k vertices v1; : : : ; vk . Then in each subsequent step add a new vertex v and

join it to k vertices of any clique in the graph constructed so far. In this recursive manner

we produce graphs called k-trees (see Figure 11.6). The resulting ordering of the vertices

shows that col2.G/ D k C 1, which gives colg.G/ � 3k C 2 for any k-tree G. It can be

proved that this bound is optimal for each k (> 1). For planar graphs the theorem gives

�g.G/ � 29, which is not bad but is still not optimal.

Six dollar order

We now demonstrate that the estimate �g.G/ � 18 holds for every planar graph G, which

is almost the best upper bound discovered so far. This time we take more care in ordering

the vertices v1; : : : ; vn so as to make the activation strategy more efficient. We define this

i

i

“master” — 2012/11/15 — 11:23 — page 81 — #95i

i

i

i

i

i


order inductively (in reverse order), starting with any vertex vn of degree at most five.

Suppose that we have already picked vertices vn; vn�1; : : : ; viC1 and are looking for a

candidate for vi . Partition the vertex set V into two parts, C D fvn; vn�1; : : : ; viC1g and

U D V n C , and construct a new graph H as follows:

1. Delete each edge between pairs of vertices in C .

2. Delete each vertex v in C with at most three neighbors in U .

3. For each deleted vertex v in C add edges between its neighbors in U so that these

neighbors form a clique.

It can be verified that the new graph H is planar: the new edges in H can be drawn

near deleted edges incident to deleted vertices of C . By Euler’s formula H satisfies the

inequality jE.H/j � 3jV.H/j � 6.

Now suppose that each edge of H is assigned $2 (in half-dollar coins). These coins are

distributed to the vertices of H as follows: if an edge e links two vertices in U , then e gives

$1 to each of the two vertices; if e links a vertex x in C and a vertex y in U , then e gives

$1.50 to x and $0.50 to y (see Figure 11.7). As the total amount of dollars is equal to 2jEj(< 6jV.H/j), there is a vertex v that receives less than $6. This will be the i th vertex vi in

our order. Notice that vi cannot belong to C , for each remaining vertex in C is incident to

at least four edges of H , hence receives at least $6.

$1.5

$0.5

$1

$1

vi

CU

Z

D

N-(vi)

Figure 11.7. Ordering the set of vertices of G by finding a poor vertex in a graph H .

Now we are ready to prove the result stated at the beginning of this section:

Theorem 4. Every planar graph G satisfies colg.G/ � 18.

Proof. Let v1; : : : ; vn be the ordering of the vertices of G obtained in the manner just de-

scribed. It suffices to show that if Alice applies the activation strategy, then at the end of

each of her turns there will be at most sixteen active neighbors around any uncolored ver-

tex. Let v D vi be any uncolored vertex, let H , C , and U be as indicated in the discussion

preceding Theorem 4 (notice that all of them depend on vi ), let D be the set of those for-

ward neighbors of v (in G) that were deleted while constructing H , and let Z be the set of

those vertices of U that are adjacent to v in H . Since we didn’t delete any edges between

vertices of U , we see that N C.v/ is contained in Z. Also Z includes all vertices, other

than v, to which Alice can jump after activating any vertex in the set D (see Figure 11.7).

Hence, arguing as in the previous proof, there can be at most

3jZj C jN �.v/ n Dj

i

i

“master” — 2012/11/15 — 11:23 — page 82 — #96i

i

i

i

i

i


active vertices around v at the end of any move by Alice. Since v received strictly less than

$6 in the ordering procedure,

1 � jZj C 0:5 � jN �.v/ n Dj < 6:

This implies that 3jZj C jN �.v/ n Dj � 16, which completes the proof.

Variants of the game

Currently, the best upper bounds for a planar graph G are �g.G/ � colg.G/ � 17, as

proved in [20]. The proof is based on a modification of the activation strategy. At present,

the best lower bounds for planar graphs are 8 � �g.G/ and 11 � colg.G/. There is

evidence that the upper bound on the game coloring number colg.G/ may be sharp. How-

ever, we cannot even anticipate what the correct answer is for the game chromatic number

�g.G/ of planar graphs.

This situation probably exists because of the rather strange general behavior of the

parameter �g.G/. For instance, �g.G/ is already unbounded for graphs satisfying �.G/ D2 and is also not monotone with respect to subgraphs. Indeed, let U D fu1; : : : ; ung and

V D fv1; : : : ; vng be disjoint sets of vertices, and let Kn;n be the graph on these vertices

obtained by joining each vertex in U with all vertices in V . Let M be a perfect matching in

Kn;n (i.e., a set of n edges no two of which are incident with the same vertex; for instance,

M D fu1v1; : : : ; unvng). Clearly we have �g.Kn;n/ D 3, but �g.Kn;n � M/ D n. Bob’s

strategy is to copy Alice’s moves on the other ends of the missing matching (see Figure

11.8).

Alice

Bob

Figure 11.8. Copy cat strategy allows Bob to win on Kn;n �M if fewer than n colors are available.

More surprisingly, it is not clear what influence increasing the number of colors has

on the coloring game. Suppose that Alice wins with k colors on a graph G. Then it would

seem trivial that she should also win on G with k C 1 colors. But can you prove it? As yet,

nobody has been able to do so!

In view of such pathologies it is natural to look for more predictable variants of the

coloring game. For instance, in a modification considered in [3] Bob is not allowed to use

a new color until it is absolutely necessary, that is, unless all uncolored vertices are sur-

rounded by all previously used colors. Chen, Schelp, and Shreve prove there that ��g.T / �

3 holds for any tree T , where ��g.G/ is the related game chromatic number. However, no

better bounds for arbitrary planar graphs are provided. In this setting it is clear that increas-

ing the number of colors cannot help Bob. However, we are still left with the anomaly that

��g.Kn;n � M/ D n.

i

i

“master” — 2012/11/15 — 11:23 — page 83 — #97i

i

i

i

i

i


There are many natural variations of the coloring and ordering games. One variation is

to allow Alice more moves so as to limit Bob’s ability to interfere [12], [13], [17]. Other

variants consider oriented coloring [15], [16], [18] and d -relaxed coloring [4], [7]. It is

remarkable that the activation strategy is still effective in these diverse settings.

Acknowledgment We would like to thank Steven Brams for useful materials and com-

ments on the origin of the map-coloring game. The research of the fourth author is sup-

ported in part by the National Science Council under grant NSC95-2115-M-110-013-MY3.

Bibliography

[1] H. L. Bodlaender, On the complexity of some coloring games, Internat. J. Found. Comput. Sci.

2 (1991) 133–147.

[2] O. V. Borodin, On acyclic colorings of planar graphs, Discrete Math. 25 (1979) 211–236.

[3] G. Chen, R. H. Schelp, and W. E. Shreve, A new game chromatic number, European J. Combin.

18 (1997) 1–9.

[4] C. Chou, W. Wang, and X. Zhu, Relaxed game chromatic number of graphs, Discrete Math.

262 (2003) 89–98.

[5] V. Chvatal, V. Rodl, E. Szemeredi, and W. T. Trotter, The Ramsey number of a graph with a

bounded maximum degree, J. Combin. Theory Ser. B 34 (1983) 239–243.

[6] T. Dinski and X. Zhu, Game chromatic number of graphs, Discrete Math. 196 (1999) 109–115.

[7] C. Dunn and H. Kierstead, A simple competitive graph coloring algorithm III, J. Combin. The-

ory Ser. B 92 (2004) 137–150.

[8] U. Faigle, U. Kern, H. A. Kierstead, and W. T. Trotter, On the game chromatic number of some

classes of graphs, Ars Combin. 35 (1993) 143–150.

[9] M. Gardner, Mathematical games, Scientific American (April, 1981) 23.

[10] D. Guan and X. Zhu, The game chromatic number of outerplanar graphs, J. Graph Theory 30

(1999) 67–70.

[11] H. A. Kierstead, A simple competitive graph coloring algorithm, J. Combin. Theory Ser. B 78

(2000) 57–68.

[12] ——, Asymmetric graph coloring games, J. Graph Theory 48 (2005) 169–185.

[13] ——, Weak acyclic coloring and asymmetric graph coloring games, Discrete Math. 306 (2006)

673–677.

[14] H. A. Kierstead and W. T. Trotter, Planar graph coloring with an uncooperative partner, J. Graph

Theory 18 (1994) 569–584.

[15] ——, Competitive colorings of oriented graphs, Electron. J. Combin. 8 (2001) #12, 15 pp.

[16] H. A. Kierstead and Z. Tuza, Marking games and the oriented game chromatic number of partial

k-trees, Graphs Combin. 8 (2003) 121–129.

[17] H. A. Kierstead and D Yang, Very asymmetric marking games, Order 22 (2005) 93–107.

[18] J. Nesetril and E. Sopena, On the oriented game chromatic number, Electron. J. Combin. 8

(2001) #14, 13 pp.

[19] X. Zhu, The game coloring number of planar graphs, J. Combin. Theory Ser. B 75 (1999) 245–

258.

[20] ——, Refined activation strategy for the marking game, J. Combin. Theory Ser. B (2007, in

press).

i

i

“master” — 2012/11/15 — 11:23 — page 84 — #98i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 85 — #99i

i

i

i

i

i

IIIFlexagons and

Catalan Numbers

i

i

“master” — 2012/11/15 — 11:23 — page 86 — #100i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 87 — #101i

i

i

i

i

i

12It’s Okay to Be Square

If You’re a Flexagon

Ethan J. Berkove and Jeffrey P. Dumont

It has been said that a mathematician can be content with only paper and pencil. In fact,

there are times when one doesn’t even need the pencil. From a simple strip of paper it

is possible to make a surprisingly interesting geometric object, a flexagon. The flexagon

can credit its creation to the difference in size between English-ruled paper and American

binders. The father of the flexagon, Arthur Stone, was an English graduate student studying

at Princeton University in 1939. To accommodate his smaller binder, Stone removed strips

of paper from his notebook sheet. Not being wasteful, he creased these lengths of paper

into strips of equilateral triangles, folded them in a certain way, and taped their ends. Stone

noticed that it was possible to flex the resulting figure so that different faces were brought

into view—and the flexagon was born [1]. Stone and his colleagues, Richard Feynman,

Bryant Tucker, and John Tukey, spent considerable time cataloging flexagons but never

published their work.

Like many geometric objects, flexagons can be appreciated on many levels of mathe-

matical sophistication (the first author remembers folding flexagons in elementary school).

With so adaptable a form, it is not surprising that flexagons have been studied from points

of view that vary from art to algebra. Our interest in flexagons was sparked by a ques-

tion posed in a paper by Hilton, Pedersen, and Walser [9]. They studied one of the hex-

aflexagons, so-named because the finished model has the shape of a hexagon. They calcu-

lated the group of motions for a certain hexaflexagon, then inquired about other members

of the hexaflexagon family. We have determined that the trihexaflexagon is exceptional,

as it is the only member of the hexaflexagon family whose collection of motions forms a

group.

Working to generalize this result, we shifted our attention to tetraflexagons, which are

constructed from strips of squares folded into a 2 � 2 square final form. We discovered

that tetraflexagons are, if anything, more complicated and interesting than their hexagonal

cousins. These results convinced us that tetraflexagons, often only mentioned in passing

in the literature, deserve to be brought into the limelight. In this paper, we will summarize

the results of our investigations. Although most of the material on hexaflexagons is known,

the material on tetraflexagons includes new results and open questions. It is our intent to

Reprinted from Mathematics Magazine, Vol. 77, No. 5 (Dec. 2004), pp. 335-348.

87

i

i

“master” — 2012/11/15 — 11:23 — page 88 — #102i

i

i

i

i

i

88 Part III. Flexagons and Catalan Numbers

give interested readers enough material to start their own explorations of these fascinating

objects.

The hexaflexagon family

a

b

c

d

e

f

g

h

i

tab

Figure 12.1. Trihexaflexagon net.

It is easy to fold a flexagon, and we highly recommend making one of your own as this

experience will be helpful in following the results in this section (and it’s fun). Construct

a strip of nine equilateral triangles and a tab as in Figure 12.1; this strip is called the net

of the flexagon. Each triangle in the strip, and in general each polygon in a flexagon net, is

called a leaf of the flexagon. You may want to label both sides of each leaf and precrease

all edges in both directions. Hold the leaf marked a in your hand, fold leaf c over leaf b, f

over e, and i over h. Finish the flexagon by gluing or taping the tab onto leaf a. The final

model should look like the flexagon depicted in Figure 12.2. Clockwise from the top, one

can read off the leaves f; d; c; a; i; g. We call .f; d; c; a; i; g/ a face of the flexagon.

ci

a

f

dg

Figure 12.2. The .f; d; c; a; i; g/ face of the trihexaflexagon.

To flex your new creation, bring the three corners at the dashed lines down together

so they meet. The hexagon will form a Y, at which point it will be possible to open the

configuration at the middle. (This is the only possible way to perform a flex-down for

this flexagon. There is also an inverse operation, a flex-up.) The result is a different face

.f; e; c; b; i; h/ of the flexagon. The flex can be repeated to get a third face .g; e; d; b; a; h/,

and one more flex returns the flexagon to .g; f; d; c; a; i/, the original face rotated clock-

wise through an angle of �=3. Since it has three distinct faces, this flexagon is known as

the trihexaflexagon—it is the simplest member of the hexaflexagon family. The three faces

can be seen more easily if they are marked somehow: Wheeler [15] shows how to color the

net so each flex brings out a new color, and Hilton et al. [9] give a way of marking the net

so flexes bring out happy and sad pirate faces.

We would like a way to keep track of all the faces of a flexagon while we flex, which

we can do using a graph: vertices represent the faces of the flexagon, and an edge joins

two vertices if there is a flex that takes one face of the flexagon to the other. On occasion

i

i

“master” — 2012/11/15 — 11:23 — page 89 — #103i

i

i

i

i

i

12. It’s Okay to Be Square If You’re a Flexagon 89

fdcaig

fecbih gedbah

Figure 12.3. The structure diagram for the trihexaflexagon: vertices denote faces, edges denote flexes

between faces.

we will use a directed graph, where an arrow points towards the face that is the result of a

flex-down. We choose to ignore the orientation of a face in the graph as this has a tendency

to make the graph overly complicated. The completed graph is called a structure diagram.

The cycle in Figure 12.3 is the structure diagram for the trihexaflexagon. It shows the three

distinct faces, as well as their relationship via flexing.

The flex-down we described is a motion of the flexagon, a transformation that takes one

hexagonal face of the flexagon to another hexagonal face. We require that our flexagons

have no faces containing loose flaps that can be unfolded or moved so the hexagonal shape

is lost. This becomes a significant issue as the number of triangles in the net increases.

Indeed, in larger nets it is increasingly likely that a random folding of the net yields a face

containing a loose flap, which in turn causes the entire flexagon to fall apart into a Mobius

band with multiple twists. Therefore, we only consider flexagons that are folded in such a

way that every flex is a motion.

To determine the structure of the set of motions, let f denote a flex-down (hence f �1

denotes a flex-up), and denote by � a flip along the x-axis of the flexagon. Then f 3 D�2 D id and it is easy to confirm that �f � D f �1. The group with this presentation is

well known as S3, the symmetric group on 3 letters, which has 6 elements.

This analysis ignores the fact that f 3 is not strictly the identity flex, but is instead a ro-

tation through �=3 degrees. The complete set of motions of the trihexaflexagon, including

rotations, is analyzed in [9]; the only difference from the argument above is that f 18 D id ,

and the resulting group is D18, the dihedral group with 36 elements.

Motions of the hexahexaflexagon

We perform a similar analysis for the member of the hexaflexagon family with six faces.

This hexahexaflexagon can be constructed from the net of 18 triangular leaves with a tab

as in Figure 12.4. To create the flexagon, label all leaves front and back and precrease all

edges as before. Fold leaf a under the rest of the strip. Then fold the edge between leaves c

ab

cd

ef

gh

ij

kl

mn

op

qr

tab

Figure 12.4. The hexahexaflexagon net.

i

i

“master” — 2012/11/15 — 11:23 — page 90 — #104i

i

i

i

i

i


and d so that a and d are adjacent. Next fold the edge between e and f so that c and f are

adjacent. Continue rolling the strip in this manner until o and r are adjacent. The finished

roll should look like the initial strip for the trihexaflexagon with leaf b on the far left. Fold

this like the trihexaflexagon, then tape the tab to a to complete the model.

In contrast to the trihexaflexagon, from the initial face of the hexahexaflexagon either

alternating set of corners flexes down. We can distinguish the two flexes by looking at

what they do to the number of leaves in a triangular segment of the hexagon. Following

Oakley and Wisner [11], we call the entire triangular segment a pat. In our newly folded

hexahexaflexagon, pats alternately contain 2 and 4 leaves, or .2; 4/ for short. One of the

flexes, which we call f , preserves thicknesses so is pat-preserving. The other flex, g, is

pat-changing, from .2; 4/ to .1; 5/ and vice versa. The pat thicknesses are invariant under

� , a flip along the x-axis.

Starting with the initial face of the hexahexaflexagon, one finds that f 3 rotates the

hexagon clockwise through an angle of �=3, so f 18 D id . In addition, as �f � D f �1, f

and � generate a copy of D18 in the collection of hexahexaflexagon motions. On the other

hand, g leads to a face where the only possible flex is the pat-preserving f , which leads

to a face where the only possible flex is the pat-changing g. The three-flex combination

gfg rotates the hexagon clockwise through an angle of �=3, and g satisfies �g� D g�1.

Therefore, there is at least one other copy of D18 in the motions of the hexahexaflexagon.

We turn this information into the structure diagram in Figure 12.5. Solid arcs between

faces denote pat-preserving flexes, while dotted arcs denote pat-changing flexes. There is a

primary cycle that can be traversed using pat-preserving flexes, and three subsidiary cycles

that can be entered at faces where two flexes are possible. From the cycle structure we

learn an important fact: the collection of motions of the hexahexaflexagon must have at

least two generators, f and g. However, there are faces where only one of f and g can be

applied, so it is not always possible to apply f or g twice in a row. In fact, g3 does not even

make sense. We now have an answer to the question posed in [9]: the hexahexaflexagon’s

motions do not form a group!

This conclusion surprised us, as the collection of motions for every other geometric

object we know of has a group structure. Furthermore, all hexaflexagons except the trihex-

aflexagon share this characteristic, as they contain pat-changing flexes. Pat-changing flexes

f

g

f

Figure 12.5. Structure diagrams for the hexahexaflexagon: dotted edges are pat-changing flexes,

solid edges are pat-preserving flexes.

i

i

“master” — 2012/11/15 — 11:23 — page 91 — #105i

i

i

i

i

i


occur at faces with two possible flexes, and the only hexaflexagon structure diagram with-

out intersecting cycles is the trihexaflexagon’s. Structure diagrams are necessarily finite,

and results in Wheeler [15] imply that cycles cannot form a closed link. As a result, in any

hexaflexagon there are only so many times the pat-changing flex can be applied. Thus, for

every hexaflexagon with a pat-changing flex g there is a k such that gk is undefined, which

implies that the collection of motions for that hexaflexagon cannot form a group.

The astute reader might have noted that the structure diagram in Figure 12.5 seems to

have nine faces, not six, as the hexahexaflexagon’s name suggests. This discrepancy can

be explained by carefully studying the hexahexaflexagon’s faces. Upon closer inspection,

three identical sets of triangles occur in two separate faces, but in different orders. If a face

depends on both the triangles and their order, then indeed the hexahexaflexagon has nine

faces. If order is disregarded, however, there are only six faces. The latter is the standard

accounting, hence the name (although Oakley and Wisner [11] distinguish between the six

physical faces and the nine mathematical faces).

We can construct other members of the hexaflexagon family by increasing the number

of triangles in the initial net. For example, starting with a straight strip of 36 leaves, the

strip can be rolled, then rerolled to yield the net in Figure 12.1, then folded as in the

trihexaflexagon case to yield the dodecahexaflexagon. One can fold other members of the

family by starting with nets that are not straight. The article by Wheeler [15] and Pook’s

book [14] contain some nice directions for folding the tetrahexa- and pentahexa- cases.

There is also a HexaFind program [3], which generates all nets for hexaflexagons with any

given number of faces.

We have only discussed a bit of what is known about hexaflexagons. In [11], Oakley

and Wisner introduce the concept of the pat, then use it to count the total number of hex-

aflexagons with a particular number of faces. Madachy [10], O’Reilly [12], and Wheeler

[15] describe connections between hexaflexagons and their structure diagrams. McIntosh

[7], Madachy [10], and Pook [14] provide lengthy bibliographies to other work on hex-

aflexagons. Gilpin [8], Hilton et al. [9], and Pedersen [5] study the motions of the trihex-

aflexagon. Furthermore, the group of motions of the trihexaflexagon is identified in [8]

and [9].

The distant cousins: tetraflexagons

We were introduced to tetraflexagons in a Martin Gardner Mathematical Games column

in Scientific American [6]. We immediately noticed some differences; the direct analog

of a strip of triangles, a straight strip of squares, makes for a poor flexagon; when you

fold square over square you end up with a roll that does not flex at all (and has a trivial

structure diagram). Therefore, we allow the nets to have right-angled turns. These turns

occur at what we call corner squares, those attached to their neighbors on adjacent edges

rather than opposite ones. Because so much is known about hexaflexagons, we felt that the

tetraflexagon family would be readily analyzed. Our actual experience mirrored Stone’s,

as reported by Gardner [6]: “Stone and his friends spent considerable time folding and an-

alyzing these four-sided forms, but did not succeed in developing a comprehensive theory

that would cover all their discordant variations.” However, we have established some nota-

tion and proved some results; our investigations have convinced us that tetraflexagons are

as fascinating as, and more subtle than, their hexaflexagon relatives.

i

i

“master” — 2012/11/15 — 11:23 — page 92 — #106i

i

i

i

i

i


a b c

d

e

f g h i

j

k

l

Figure 12.6. A “4 � 4 ring” net.

Here is one way to fold a tetraflexagon using a 4 � 4 net with a cut adjacent to a corner

square. Label a net as in Figure 12.6 and lay it flat. Mark the cut edges of a and l so you

can tape them together after you have folded the tetraflexagon.

1. Fold leaf c over leaf d . Leaves a and b will remain to the left of c, but will flip over.

2. Fold c and d over e; the back of leaf c will touch the front of leaf e, and again, a

and b are left free.

3. Fold f over g; move the flap with a and b so it points to the right.

4. Fold f and g over h so f and h are touching. Then slip the flap with a and b under j .

5. Fold j over i (without moving a and b), and flip the partially folded tetraflexagon

upside down. Position the tetraflexagon so k and l are at the top of the model and a

and b are upside down and facing to the left.

6. Fold i and j over k so i and k are touching.

7. Slide the flap with a and b under l , then fold a over l . Finally, tape a and l together

on the right, along the edges that were originally cut.

The finished tetraflexagon will have 90-degree rotational symmetry, and should look like

Figure 12.8a.

Now that you have a tetraflexagon in front of you, let’s introduce some notation (see

also Figure 12.7). The tetraflexagon has four pats, which conveniently look like the four

quadrants in Cartesian coordinates. Therefore, we will refer to pats by the quadrant they

are in: pat I, pat IV, etc. If you look carefully at your tetraflexagon, you’ll notice that

adjacent pats are connected by a layer of paper, called a bridge. The two leaves, one in

pocket

looseflap

foldable edge

freesides

connectedsides

Figure 12.7. Tetraflexagon notation.

i

i

“master” — 2012/11/15 — 11:23 — page 93 — #107i

i

i

i

i

i


each quadrant, that make up this layer are bridge leaves. Leaves b and c are bridge leaves,

as are k and l . Bridges will figure prominently in our analysis.

Define a pocket as a strict subset of leaves within a pat attached to the rest of the

tetraflexagon on two adjacent sides. Without the strictness condition, an entire pat is a

pocket, connected to the rest of the flexagon by the two bridges; this is an extreme case we

wish to avoid. On the other hand, a pocket may consist of a single leaf, in which case the

pocket is a bridge leaf. As an example, squares f and g form a pocket in the tetraflexagon

you just folded and this is the only pocket in its pat. The attached sides of the pocket are

connected, whereas the other two sides are free. When a free side of a pocket lies along

the outside edge of the tetraflexagon (like the outside edge of h and i ), we call the entire

tetraflexagon edge a foldable edge. The foldable edge will be folded in half during the flex.

There are two conditions that a tetraflexagon must satisfy in order to flex: there must be

a pocket with a component bridge leaf, and the bridge in the pocket must be at right angles

to the foldable edge. (Since the bridge goes from a pocket to an adjacent pat it cannot

cross the pocket’s free side.) To perform a flex, orient the tetraflexagon so the foldable

edge is forward and the pocket lies on top of the tetraflexagon, as in Figure 12.8a. Fold the

tetraflexagon in half perpendicular to the foldable edge so that the pocket remains on the

outside. Put your thumbs into the pocket and its kitty-corner companion (Figure 12.8b),

then pull outwards in the direction of the arrows. The pocket layers will separate from the

rest of the pat, rotating outwards 180 degrees but staying in the same quadrant. The rest

of the pat layers will maintain their orientation but move to the top of the adjacent pat.

Simultaneously, the layers in the adjacent pats will rotate outwards 180 degrees. When you

flatten the tetraflexagon, you will see a new face as in Figures 12.8c and 12.8d.

a b c d

Figure 12.8. Flexing a tetraflexagon.

We call the flex with the pockets up and the foldable edge forward a book-down flex.

It is only one of four possibilities, depending on the position of the pocket and foldable

edge. We therefore distinguish between four types of flexes: flexes up or down along the

y-axis (book-up and book-down) and flexes up or down along the x-axis (laptop-up and

laptop-down). From Figure 12.8, we see that an up flex is the inverse of a down flex and

vice versa. As before, the top layer of leaf of the tetraflexagon, like .a; d; g; j /, is called a

face, and a flex is a motion if it takes one tetraflexagon face to another.

Before we go further into our exploration, we introduce some assumptions about the

tetraflexagons we consider to aid in our analysis.

1. Net assumption. All tetraflexagon nets have exactly four corner squares. The two

vertical strips are each composed of m leaves, and the two horizontal strips of n

leaves. All nets are folded into four-pat (2 � 2) tetraflexagons.

2. Winding assumption. There is only one bridge between adjacent pats of the tetraflex-

agon. In other words, as the net is folded, the strip of paper enters and exits each

i

i

“master” — 2012/11/15 — 11:23 — page 94 — #108i

i

i

i

i

i


quadrant exactly once. This assumption is easily justified; we will see in the Flexing

Lemma below that subsequent flexes maintain all four bridges as single layers.

3. Rigidity assumption. No tetraflexagon face contains a loose flap. A loose flap is a col-

lection of two or more leaves connected to the rest of the tetraflexagon on only one

side (see Figure 12.7). Since a loose flap determines one of two possible tetraflex-

agon faces depending on its position, movement of a loose flap can be considered

a motion. The rigidity assumption ensures that the only tetraflexagon motions are

flexes.

4. Symmetry assumption. Folded tetraflexagons have 180-degree rotational symmetry

(in contrast to the hexaflexagons’ 120-degree symmetry).

Figure 12.9. Rings, bolts, and snakes.

The net assumption implies that nets are one of three shapes, depending on parity. All

nets are shown in Figure 12.9: when m and n are both even, the only possible net is a ring;

when both m and n are odd, the only possible net is a lightning bolt; and when exactly one

of m and n are odd, the only possible net is a snake. We remark that analogs of the other

three assumptions hold for most hexaflexagons.

Basic analysis: folding 101 and structure diagrams with dead ends

Flexing for tetraflexagons is a bit more complicated than for hexaflexagons. We carefully

describe what happens during a flex in the following

Flexing Lemma. The following hold for tetraflexagons under our assumptions:

1. Corner squares remain in their pats during flexes.

2. A pocket that has a component bridge leaf is either a single layer (that is, it is the

bridge leaf), or consists of every leaf in the pat except for either the top or bottom

one. In the latter case, the leaf not belonging to the pocket is also a bridge leaf.

3. A flex maintains all four bridges as single layers.

Proof. To simplify the arguments, we will assume for this proof that the pocket is always

on top of pat IV with the foldable edge forward, as in Figure 12.8a. The other cases will

follow by rotation and mirror images. Moreover, by the symmetry assumption we only

need focus on pats III and IV.

We start with the first claim. When we perform the flex shown in Figure 12.8, all leaves

in pat III remain in pat III, which means that the only corner square that can change pats

i

i

“master” — 2012/11/15 — 11:23 — page 95 — #109i

i

i

i

i

i


lies in pat IV. However, if the book-down flex moves this corner square to pat III, then the

flex is also forced to move the strip of leaves connecting the corner squares in pats I and

IV. This is impossible as the corner square in pat I is fixed by the symmetry assumption.

For the second claim, assume that the pocket with a bridge leaf consists of more than

one layer. We claim that the corner square in pat IV must be a leaf of the pocket. By

way of contradiction, assume the corner square is below the pocket. Notice that the corner

squares in pats I and IV are connected by a vertical strip. As this strip is woven from the

corner square in pat IV to the corner square in pat I, it must at some point become the

unique bridge to pat I (by the winding assumption). One of the bridge leaves is a leaf of

the pocket, so the vertical strip of squares must also cross from the bottom of pat IV to the

pocket. Because the vertical strip is woven back and forth, it can only cross to the pocket

on the front or back edge of pat IV. The vertical strip cannot cross the gap at the front edge

of pat IV because that edge is free by assumption. The vertical strip cannot cross the gap

at the back edge of the pocket either, as that would lock the pocket to the rest of pat IV,

making a flex impossible. We conclude that the corner square must be a leaf of the pocket.

An analogous argument shows that the vertical strip cannot cross from the corner

square to the bottom layer either. By the rigidity assumption, this bottom layer must there-

fore consist of a single leaf. Now if the bridge leaf to pat III were any layer but the bottom,

it would force the left side of the pocket in pat IV to be connected, rather than free. This is

impossible as all pockets have two free sides.

For the final claim, the winding assumption implies that, when first folded, the tetra-

flexagon has single layer bridges between pats. Consider the effect of a book-down flex.

This flex flips the bridge leaves that lie in the “pages” of the book upside down, so those

bridges remain single layers. For the two bridges that cross the spine of the book, focus

on pats III and IV, and assume first that the pocket is a single layer; then the book-down

flex leaves a single layer in pat IV, which must be the bridge leaf. Otherwise, the pocket

has more than one leaf and the bottom leaf of pat IV is a single layer. In this case, the

completed flex moves the single layer from pat IV to pat III, again maintaining the bridge

as a single layer.

The Flexing Lemma gives us insight about what happens to a tetraflexagon during a

single flex. What about the larger picture? What can we say about the set of all the motions

of the tetraflexagon we folded? Again, we answer these questions with a structure diagram,

but now use the orientation of the graph’s edges to distinguish among the four possible

types of flexes. Denote a book flex by a horizontal line segment in the structure diagram,

a laptop flex by a vertical one, and let arrows point to the faces that are the result of a flex

down. With a little flexing and experimentation, we find that the structure diagram for the

tetraflexagon we folded is an L shape, with five vertices (corresponding to faces), and four

edges, two in each part of the L (corresponding to flexes). On all edges, the arrows point

away from the corner of the L. This structure diagram, which appears in the bottom left of

Figure 12.11, has a feature we have not seen before: there are vertices incident to a single

edge. We call the faces associated to these vertices dead ends, because once these faces are

reached via a flex, the only motion that can be applied is the flex’s inverse.

For the hexaflexagon cases, a flex of finite order appeared in the structure diagram as a

cycle. The structure diagram we just constructed has no cycles, hence no elements of finite

i

i

“master” — 2012/11/15 — 11:23 — page 96 — #110i

i

i

i

i

i


order. In addition, any flex can be applied at most twice, at which point some other flex

must be applied. In other words, there is really no group structure at all in the motions of

the tetraflexagon we constructed. An immediate question is whether there is a tetraflexagon

whose collection of motions forms a group.

There is, and it can be folded from the 3 � 3 lightning bolt net shown in Figure 12.10.

Start with a copy of this net—you may want to specially mark the shaded sides of a and

h as shown in the diagram, as those are the edges that get taped together at the end. Fold

leaf b over c, d (and a, b, and c) over e, and g over f . Make sure h lies on top of a, then

tape a and h together on the right, along their shaded sides. Your finished tetraflexagon

should have 180-degree rotational symmetry, as per the symmetry assumption. From this

initial face, the only possible down flex is a laptop-down. If this is followed with book-

down, laptop-down, and book-down flexes (again, the only possible down flexes) you get

back to the initial face. The resulting structure diagram is a square cycle, corresponding to

the symmetry group Z=4Z, the cyclic group with 4 elements. We believe that this is the

only tetraflexagon whose collection of motions forms a group (although a rigorous proof

eludes us).

a

b

c d e

f

g h

Figure 12.10. A 3 � 3 bolt net.

You may have noticed that although we call the net in Figure 12.10 a 3 � 3 lightning

bolt, there are only two leaves, g and h, in one row. Actually, that row does contain a third

leaf, a, which becomes part of the row after we finish taping. In general, when we refer to

a net as m by n, we include the corner squares in the count. For the cases of the bolt and

snake nets, this means that either one row or one column will be a leaf short when the net

is first constructed.

By reducing larger nets to the 3 � 3 case, one can construct many tetraflexagons whose

structure diagrams contain cycles. For example, starting with the net in Figure 12.6, folding

b over c and i over h turns the 4 � 4 ring net into a 4 � 3 snake net. Another pair of folds

results in a 3�3 lightning bolt net, which can then be folded as above. More generally, one

can turn an m�n net into either an m�1�n net or an m�n�1 net by making appropriate

folds. Here, appropriate means that the pair of folds results in a net that satisfies the sym-

metry assumption. Also, a given sequence of edge choices must result in a tetraflexagon

that satisfies our four assumptions. Once the net is a 3 � 3 lightning bolt, it is folded to

guarantee at least one cycle in the structure diagram. This process is reminiscent of how a

hexaflexagon is folded from a strip containing 9.2n/ triangles by rolling it n times to form

the trihexaflexagon net with 9 triangles. A difference, though, is that there are many ways

of choosing edges in pairs to reduce an m�n net to the 3�3 net, so a generic tetraflexagon

net can yield a large number of tetraflexagons with distinct structure diagrams.

i

i

“master” — 2012/11/15 — 11:23 — page 97 — #111i

i

i

i

i

i


Figure 12.11. Tetraflexagon structure diagrams folded from a 4 � 4 ring.

As an example, through exhaustive folding, both figurative and literal, we have identi-

fied in Figure 12.11 most (all?) of the possible structure diagrams that result from a 4 � 4

ring. Some of the diagrams are very basic, but notice that three of them contain cycles, all

but one contain dead ends, and two contain both. This variety in structure diagrams from

the same net demonstrates how difficult it is to classify even simple tetraflexagons.

Despite their “discordant variations,” the tetraflexagons we investigated have some fea-

tures in common. In a number of the examples we folded, we came across faces where all

four flexes were possible. We call such faces crossroads. For the reader who wants to see a

crossroad in action, start with the net in Figure 12.6, labeled front and back with each letter.

Fold b under c, c over d , e (and a–d ) under f , g (and the rest of the net) over f , i over

h, j over i , and l over k. Lift a over l , then tape a and l together on the right (outside)

edge. The final tetraflexagon is shown in Figure 12.12d. (What is the resulting structure

diagram?) Crossroads are among the most interesting features in a structure diagram, and

we wondered how many crossroads a structure diagram could contain. We noticed that

after we flexed a crossroad, we never saw another crossroad, which led us to prove the

Crossroad Theorem. Crossroad faces are never adjacent in tetraflexagons built from ring,

lightning bolt, or snake nets.

Proof. Let’s analyze the shape of a tetraflexagon at a crossroad face. In order to perform all

four types of flexes, every edge of the tetraflexagon must be a foldable edge and there must

be two pockets per edge to allow both types of flexes. It is easy to show that the pockets

cannot be on the same side of the foldable edge, as in Figure 12.12a. If the pockets were

on the same side, the remaining leaves of the two pats would be the bridge between the

pats, by the second claim of the Flexing Lemma. By the symmetry assumption, the same

would be true for the other two pats of the tetraflexagon, and the entire tetraflexagon would

simply fall apart.

We conclude that one pat along the foldable edge must have a pocket on the top of

the pat to allow a flex down, while the adjacent pat along the edge must have a pocket

on the bottom of the pat to allow a flex up. There are essentially two cases to consider,

shown in Figures 12.12b and 12.12c (Figure 12.12d is Figure 12.12c’s mirror image). We

can show that Figure 12.12b is impossible by a careful analysis of pat IV. Since one of the

two pockets in pat IV contains all the layers but one, assume, without loss of generality,

that it is the pocket associated to the foldable edge on the right. Then the bridge to pat III

crosses the edge marked by the arrow, and the component bridge leaf in pat IV is one of

i

i

“master” — 2012/11/15 — 11:23 — page 98 — #112i

i

i

i

i

i


a b c d

Figure 12.12. Two impossible configurations and two crossroads.

the leaves in the pocket by the second claim of the flexing lemma. However, this implies

that the edge marked by the arrow is a connected edge, which makes a book-down flex

impossible. Therefore, the only possible crossroad faces have configurations like Figures

12.12c and its mirror image 12.12d.

We next determine when a flex from a crossroad yields another crossroad. By symmetry

arguments, it is sufficient to consider Figure 12.12c. We claim that is impossible to perform

a book-down flex from Figure 12.12c and get the crossroad in Figure 12.12d. This time,

consider pat III. Note that by the second claim of the Flexing Lemma one of the two pockets

in pat III must consist of every layer in the pat except one; assume (by symmetry again)

that it is the bottom pocket. After a book-down flex is applied, the original layers in pat III

are flipped upside down, and are covered by a single layer from pat IV. This single layer

must be part of the new bridge to pat IV. Therefore, it is not possible for the right edge of

pat III to be free, as in Figure 12.12d.

Thus, if two crossroads are connected by a flex, they must be identical. We rule out

this possibility too. Consider the enlargement of pat IV of Figure 12.12c, shown in Figure

12.13. In order for a book-down flex to yield the same configuration, pat IV must contain

a pocket as part of the bridge. But the bridge must be a single layer, by the second claim of

the Flexing Lemma.

Figure 12.13. Closeup of pat IV.

The Crossroad Theorem tells us that cycles cannot be too dense in a structure diagram.

For example, we have a corollary to the theorem, named after the symbol that appears on

all Purina pet foods.

Purina Corollary. Under our assumptions, there is no “Purina Tetraflexagon” with the

structure diagram shown in Figure 12.14.

Surgery: adding new parts to an old flexagon

So far, we have a very general idea of what can and cannot be part of a structure diagram.

On the other hand, it would be desirable to add a given component to a structure diagram,

i

i

“master” — 2012/11/15 — 11:23 — page 99 — #113i

i

i

i

i

i


Figure 12.14. Purina structure diagram.

allowing us to tailor-make tetraflexagons with interesting properties. There is a technique

we learned from Harold McIntosh’s online notes [7] that allows us to do exactly that at

an appropriate spot in a structure diagram. We call this technique surgery. One performs

surgery on a tetraflexagon by symmetrically grafting two strips of squares to the model to

add the new feature.

An instance of surgery is shown in Figure 12.15. Start with a tetraflexagon at a face

where one of the pats is a single leaf, that is, a corner square. Place this tetraflexagon so

the single leaves are in pats II and IV with the foldable edge forward, as in the top left

picture in Figure 12.15. By considering mirror images, if necessary, we may assume that

the pocket in pat III is on the bottom of the pat. Tape a 2 � 1 strip to the right edge at pat

IV and make a cut in the tetraflexagon along the positive x-axis as in the top right picture.

Fold the strip on top of pat IV, then fold the strip up and in half as in the bottom left picture.

Finally, tape the top edge of the strip (in pat IV) to the cut edge in pat I as in the bottom

right picture. Perform the same procedure symmetrically on pat II; when you are done,

the front and back foldable edges will have pockets on both top and bottom. The resulting

tetraflexagon’s structure diagram will be the same as before, but with a cycle tacked on at

the appropriate face.

Following this technique, it is straightforward, at least in theory, to build a tetraflexagon

whose structure diagram consists of an arbitrarily long row of four-cycles connected one

to the other at opposite corners. (In light of the Crossroad Theorem, in some sense this is

Figure 12.15. How to add a cycle to a tetraflexagon.

i

i

“master” — 2012/11/15 — 11:23 — page 100 — #114i

i

i

i

i

i


Figure 12.16. A challenging structure diagram.

as crowded as collections of cycles can become.) In practice, the tetraflexagons quickly

become too thick to flex easily as the size of the net increases. We challenge the reader to

use surgery to construct a tetraflexagon with the structure diagram shown in Figure 12.16.

One can also use surgery to add dead ends to a structure diagram. Start with a tetraflex-

agon in the same position as before and make the same cut along the positive x-axis, but

this time add a 1 � n strip horizontally to the right edge of pat IV. Roll the strip counter-

clockwise until it lies on top of pat IV, then tape the top edge of the middle square of the

roll to the cut edge of pat I. Repeat this symmetrically on pat II. This procedure will add n

horizontal segments to the structure diagram at the appropriate place.

To finish this section we mention that besides appearing in Gardner articles [2] and [6],

“square” flexagons are the topic of a short note by Chapman [2] where he uses primary

and secondary colors to distinguish tetraflexagon faces. This note also contains directions

for constructing tetraflexagons whose structure diagrams are a cycle and two linked cycles.

For more information on surgery and tetraflexagons, the reader should consult McIntosh’s

notes [7] or Pook’s book [14].

Where do we go from here?

At this point, we know something about tetraflexagons—how they flex, how and where

cycles can occur, and why some structure diagrams are impossible—but there are many

questions we haven’t answered. What are the possible tetraflexagon structure diagrams?

How many tetraflexagons, up to rotation and mirror image, can be made from an m � n

net with four corners? We can get a rough upper bound by recalling that the corner squares

in a tetraflexagon stay fixed in their quadrant. Flex a tetraflexagon so that one pat consists

of a single leaf and its adjacent pats contain n C m � 1 leaves. With the exception of the

bottom (or top) leaf, these might be in any order, so there are at most .nC m� 2/Š possible

tetraflexagons that can be folded from a given m � n net. Many of these tetraflexagons will

not satisfy our four assumptions; is there a better upper bound? Finally, there is the question

that provided our initial motivation to study flexagons: Are there any other tetraflexagons

besides the one built from the net in Figure 12.10 whose collection of motions forms a

group?

We developed most of our tetraflexagon results before looking through McIntosh’s

material [7], and you can imagine our surprise when we saw directions for the Purina

Tetraflexagon in his notes! McIntosh introduced this creature in a discussion on surgery,

and we were no less surprised when we constructed it and confirmed that it worked. The

apparent discrepancy between beautiful theory and ugly counterexample was explained

i

i

“master” — 2012/11/15 — 11:23 — page 101 — #115i

i

i

i

i

i


once we took a scissors to the flexagon and opened it up; in the net, every one of the

24 leaves was a corner square! Clearly, in order to do surgery while maintaining the net

assumption, one must be careful that the transplanted leaves do not add any more corner

squares.

More importantly, this example shows that there are many interesting tetraflexagons

that do not satisfy the net assumption. In fact, perhaps our favorite flexagon violates both

the net and symmetry assumptions, as it is folded from regular pentagons. Of course, this

flexagon cannot be folded flat, but we were impressed that such an object could exist at all.

This flexagon, and many others, are described in McIntosh’s notes [7] as well as Pook’s

book [14]. We highly recommend these references, as well as an impressive report by

Conrad and Hartline [1] (also found in [7]) for the reader interested in deepening his or her

background about flexagons and their kin.

We have found flexagons to be interesting mathematical objects at many levels. Since

they are easily folded, they are a good addition to an introductory mathematics class, giving

students an opportunity to look for patterns and explore a topic at their own pace. In a

more advanced setting, such as a course in geometry or algebra, flexagons can be used

to introduce notions of symmetry and transformation. They are also an excellent topic of

study for a senior thesis, as most of the flexagon materials are written at an elementary level

and are appropriate for a student’s introduction to the reading of mathematical articles.

And, of course, there are many open questions that are easily asked but difficult to answer.

Despite their long history, flexagons still give an exciting twist to an otherwise boring strip

of paper, and are well worth a little study.

Acknowledgments We would like to thank Harold McIntosh for informative correspon-

dence, and Liz McMahon, Kyra Berkove, and the anonymous referees, whose comments

greatly improved the exposition of this paper.

Bibliography

[1] Anthony Conrad and Daniel Hartline, Flexagons, RIAS Technical Report 62-11, Baltimore,

MD, 1962.

[2] P. B. Chapman, Square flexagons, Math. Gaz. 45 (1961) 192–194.

[3] Antonio De Queioroz, The HexaFind Program,

www.coe.ufrj.br/˜acmq/hexaflexagons.

[4] Martin Gardner, Hexaflexagons and Other Mathematical Diversions, Univ. of Chicago Press,

Chicago, 1988.

[5] ——, The 2nd Scientific American Book of Mathematical Puzzles and Diversions, Simon and

Schuster, New York, 1961.

[6] ——, Mathematical games, tetraflexagons and tetraflexigation, Scientific American, May 1958,

122–126.

[7] Harold McIntosh, Flexagons,

delta.cs.cinvestav.mx/˜mcintosh/oldweb/pflexagon.html.

[8] Michael Gilpin, Symmetries of the trihexaflexagon, Math. Mag. 49 (1976) 189–192.

[9] Peter Hilton, Jean Pedersen, and Hans Walser, The faces of the trihexaflexagon, Math. Mag. 70

(1997) 243–251.

www.coe.ufrj.br/~acmq/hexaflexagons

http://www.coe.ufrj.br/~acmq/hexaflexagons

delta.cs.cinvestav.mx/~mcintosh/oldweb/pflexagon.html

http://delta.cs.cinvestav.mx/~mcintosh/oldweb/pflexagon.html

i

i

“master” — 2012/11/15 — 11:23 — page 102 — #116i

i

i

i

i

i


[10] Joseph Madachy, Mathematics on Vacation, Charles Scribner, New York, 1966.

[11] C. O. Oakley and R. J. Wisner, Flexagons, Amer. Math. Monthly 64 (1957) 143–154.

[12] Thomas O’Reilly, Classifying and counting hexaflexagrams, J. Recreat. Math. 8 (1975) 182–

187.

[13] Jean Pedersen, Sneaking up on a group, Two Year College Math. J. 3 (1972) 9–12.

[14] Les Pook, Flexagons Inside Out, Cambridge Univ. Press, Cambridge, 2003.

[15] Roger Wheeler, The flexagon family, Math. Gaz. 42 (1958) 1–6.

i

i

“master” — 2012/11/15 — 11:23 — page 103 — #117i

i

i

i

i

i

13The V-flex, Triangle Orientation, and

Catalan Numbers in Hexaflexagons

Ionut E. Iacob, T. Bruce McLean, and Hua Wang

Using only the pinch flex, the flex described by Martin Gardner in [4], the six triangles of

each hexagonal face of a hexaflexagon stay together. If the faces are colored, the face facing

up is always monochrome. To scramble the triangles and mix the colors, you need other

flexes. In this paper, we describe the V-flex. With the V-flex, faces become multi-colored

when flexed. It takes persistence to master, but the V-flex is worth it. A hexahexaflexagon

has only 9 mathematical faces under the pinch flex; with the V-flex it has 3,420.

We conjecture that many people performed a V-flex accidentally as they read Martin

Gardner’s 1956 article and had no expert help. In 1963, Bruce McLean discovered the

V-flex and, in 1978, provided a graph of the faces. He presented it to Arthur Stone at an

MAA meeting that year and was told “I never allowed my students to do that!” In that

paper, McLean approached the V-flex systematically as a new flex. The view that the V-

flex is somehow wrong, however, persists to the present. In 2011, a referee of this paper

wrote, “I’ve taken the majority view, that it was an illegal move and that my only purpose

in understanding it was to fix ‘broken’ flexagons.” The V-flex is now understood as a well-

defined operation (see [8] for an overview) that adds considerable complexity to an already

interesting object. After the meeting with McLean, Stone expressed pleasure that order still

prevailed under the V-flex. We hope the reader will be pleased too.

V-flexing

Begin by constructing a hexahexaflexagon (see Figure 12.4 in the previous chapter). For

clarity, we prefer to number all triangles in the strip of paper from left to right on the front

and back of the paper as in our Figure 13.1, and add arrows pointing to each triangle edge

not shared with another triangle. The tab is helpful for construction, but not required.

Fold the triangle UP 2 under the 1 and continue to spiral in the same direction to obtain

a shorter strip of 9 triangles as in Figure 13.1c. Next, fold triangle UP 5 under the 4, UP

11 under the 10, and UP 17 under the 16. Finish by taping the tab to UP 1. The result

is the hexahexaflexagon shown in Figure 13.2a. Note that each triangular region of the

hexahexaflexagon, called a pat, has a thickness, i.e., the number of triangles. This number


103

i

i

“master” — 2012/11/15 — 11:23 — page 104 — #118i

i

i

i

i

i


Face UP Face DOWN1 2 3 17 18 1 2 17 18

tab

(a) (b)

1

4

5 17

16

(c)

Figure 13.1. Constructing a hexahexaflexagon.

is the degree of the pat. The strip would continue straight and a hexagon could not be

constructed if the degree of any pat was divisible by three, hence pat degrees are always

congruent to 1 or 2 (mod 3).

Pat 2

Pat 1

Pat 6

Pat 5

Pat 4

4u

1u16u

13u10u

7

6

4

116

14

13

3

UP

(a) (b)

Figure 13.2. Starting the V-flex.

For the V-flex, separate Pat 2 with your left thumb into two halves (see Figure 13.2a).

It may help to fold the whole flexagon in half, so that 7, 10 and 13 are on one side. You

may also need your right thumb to assist in separating Pat 5. The flexagon should begin to

open (Figure 13.2b) and eventually look like Figure 13.3a (as seen from the top). Note that

Figure 13.3a is three-dimensional: an inverted pyramid with a square base and tetrahedra

Pat 2

Pat 1

Pat 6

Pat 5Pat 3

Pat 4

76

3

14

13

16

6

18

15

14

113

(a) (b)

Figure 13.3. Completing the V-flex.

i

i

“master” — 2012/11/15 — 11:23 — page 105 — #119i

i

i

i

i

i

13. The V-flex, Triangle Orientation, and Catalan Numbers in Hexaflexagons 105

on two opposite faces. Pinch 13 with your right thumb on top and pull it towards you while

pushing 6 away from you. Rotating 13 under 11 completes the flex (see Figure 13.3b). (An

online video may be helpful; see [9].)

Note that a V-flex is not possible if either Pat 2, 5, or 6 contains only 1 triangle. To

return to the original, flip the flexagon on the horizontal axis, rotate 60 degrees clockwise,

and V-flex again. Enjoy!

Triangle orientation

Many flexagon lovers wrote Martin Gardner [5] to point out that numbers alone are not

enough to describe a flexagon configuration completely. A triangle can come up face up

or face down and with different orientations, that is, the arrow mentioned in the previous

section may point in different directions within the face.

We define the state of a triangle as the following three pieces of information: (i) face

position: up (U) or down (D); (ii) triangle number: even (E) or odd (O); and (iii) triangle

orientation (the arrow’s direction): outside the hexagon (O), towards the next pat in the

face counterclockwise (N), or towards the previous pat in the face (P). There are thus 12

distinct states. For instance, the state UOO means that the triangle is facing up, is odd-

numbered, and the arrow points outwards in the hexaflexagon; the state DOP means a face

down triangle, odd-numbered, and with the arrow pointing clockwise.

Simple observations lead to the fact that from one triangle to the next, in increasing

numerical order, parity changes (evidently), but facing up or down changes only if both

triangles are in the same pat. We say that a transition from one triangle to the next one

is a folding transition (FT), if both triangles are in the same pat; or a pat transition (PT),

if they are in consecutive pats. Then, it is easy to observe that, for instance, from a DON

triangle we can only have a folding transition (FT) to the next triangle in the sequence,

which therefore must be UEO (U because it is in the same pat; E because it comes after

an odd numbered triangle; and O as one can determine by looking at the original strip

of paper).

We summarize these observations with the labeled, directed graph in Figure 13.4,

where each edge is either a FT (folding transition) or a PT (pat transition). The diagram

shows clearly which states allow a FT or PT, or both, and that the number of triangles

folded in consecutive pats forms a sequence consisting of numbers congruent alternately

to 1 and 2 (mod 3).

Given any snapshot of triangle numbers in a pat, we can easily determine the complete

state (with orientation) of every triangle. For instance, consider the pat consisting of trian-

gles 2, 1, 4, 3 (from top to bottom) in a hexahexaflexagon. Triangle 1 must be at a tip of

a PT edge (hence in either DOO or UON states) and must begin a path of three FT edges

leading to a state that allows the next PT (since the pat contains 4 triangles). This puts

triangle 1 in DOO state. Then, as we move to triangle 2, we must follow a FT edge (it is

the same pat), which places triangle 2 in UEP state. We continue with a FT to 3 in DON,

and another FT to 4 in UEO.

Catalan numbers in flexagons

The enumeration of pat classes (under a natural equivalence relation according to how the

degree splits, see [1]) enjoys a beautiful connection to the well-known Catalan numbers,

i

i

“master” — 2012/11/15 — 11:23 — page 106 — #120i

i

i

i

i

i


UEO

DEO

UOO

UOP

DEN

UON

DEP

DOP

UEN

DON

UEP

DOO

FT FT FT FT

FT FT

FT FT

FT FT

PT PT

FT FT

PT PT

Figure 13.4. Triangle state diagram.

which we proceed to describe. A pat of degree n; n > 1, splits naturally where a “thumb-

hole” is formed, the one place in the physical pat where the thumb can be inserted without

encountering a pocket [4]. This divides the pat into two parts whose degrees sum to n.

Thus each pat class can be represented as a partition of n into an ordered sum of two pos-

itive integers representing the degrees of its two principle parts. Like n, these integers are

equivalent either to 1 or to 2 (mod 3). Then each part in this partition is in turn a sum of

two such parts and so on, until every part equals 1. For example, a pat class of degree 10

may be represented as follows:

10 D 2 C 8

D .1 C 1/ C .1 C 7/

D .1 C 1/ C .1 C .5 C 2//

D .1 C 1/ C .1 C ..1 C 4/ C .1 C 1///

D .1 C 1/ C .1 C ..1 C .2 C 2// C .1 C 1///

D .1 C 1/ C .1 C ..1 C ..1 C 1/ C .1 C 1/// C .1 C 1///:

From a combinatorial point of view, such a representation assigns pairs of parenthesis

to the sum of n 1’s keeping the sum within each pair of parenthesis either 1 or 2 modulo 3.

The number of pat classes is the number of ways to complete this process. The well known

Catalan numbers (which Martin Gardner also popularized [3]) count, among other objects

(including planar trees, lattice paths, triangulations of a polygon), exactly this quantity

without the (mod 3) constraint. The number of pat classes can therefore be analyzed using

the same combinatorial tools used for the Catalan numbers (see [2]). In the recent paper of

Anderson et al. [1], a pair of recursive relations for the number of pat classes with a given

degree is given, namely

bn DnX

iD0

ai an�i and an Dn�1X

iD0

bibn�i�1;

i

i

“master” — 2012/11/15 — 11:23 — page 107 — #121i

i

i

i

i

i

13. The V-flex, Triangle Orientation, and Catalan Numbers in Hexaflexagons 107

where an is the number of pat classes of degree 3n C 1 and bn is the number of pat classes

of degree 3n C 2. The reader may have already noticed that when partitioning a number

of the form 3n C 1, the two parts must be of the form 3n C 2, and vice versa. This is the

reason for this bi-variable recursion.

The enumeration of various objects related to the flexagons that Martin Gardner intro-

duced to a wide audience in 1952 is now an interesting area of research. Both sequences

fang and fbng are in the On-Line Encyclopedia [11]. Among other interesting concepts

associated with an are the number of 4-ary (or quartic) trees with 3n C 1 leaves. It turns

out that

an D 1

3n C 1

4n

n

!

and bn D 2

3n C 2

4n C 1

n

!

:

(For more on the combinatorics of flexagons, see [1] and [4].)

Bibliography

[1] T. Anderson, T. B. McLean, H. Pajoohesh, C. Smith, The combinatorics of all regular

flexagons, European J. Combin. 31 (2010) 72–80; available at dx.doi.org/10.1016/

j.ejc.2009.01.005.

[2] R. Brualdi, Introductory Combinatorics, 5th ed., Pearson, 2008.

[3] M. Gardner, Catalan Numbers, Time Travel and Other Mathematical Bewilderments, W. H.

Freeman, 1987.

[4] ——, Hexaflexagons, College Math. J., 43 (2011) 2–5; available at dx.doi.org/

10.4169/college.math.j.43.1.002.

[5] ——, Hexaflexagons and Other Mathematical Diversions, Cambridge Univ. Press, 2008.

[6] I. E. Iacob, T. B. McLean, and H. Wang, About general order regular flexagons (submitted).

[7] T. B. McLean, V-Flexing the hexahexaflexagon, Amer. Math. Monthly 86 (1979) 457–466;

available at dx.doi.org/10.2307/2320415.

[8] ——, Flexagons; available at

math.georgiasouthern.edu/˜bmclean/flex/index.html.

[9] R. Moseley, Flexagon.net; available at

www.flexagon.net/index.php?option=com_content&task=view&id=22.

[10] C. O. Oakley and R. J. Wisner, Flexagons, Amer. Math. Monthly 64 (1957) 143–154; available

at dx.doi.org/10.2307/2310544.

[11] N. Sloane, The On-Line Encyclopedia of Integer Sequences; available at oeis.org/.

dx.doi.org/10.1016/

http://dx.doi.org/10.1016/j.ejc.2009.01.005

j.ejc.2009.01.005


dx.doi.org/

http://dx.doi.org/10.4169/college.math.j.43.1.002

10.4169/college.math.j.43.1.002


dx.doi.org/10.2307/2320415

http://dx.doi.org/10.2307/2320415

math.georgiasouthern.edu/~bmclean/flex/index.html

http://math.georgiasouthern.edu/~bmclean/flex/index.html

www.flexagon.net/index.php?option=com_content&task=view&id=22

http://www.flexagon.net/index.php?option=com_content&task=view&id=22

dx.doi.org/10.2307/2310544

http://dx.doi.org/10.2307/2310544

oeis.org/

http://oeis.org/

i

i

“master” — 2012/11/15 — 11:23 — page 108 — #122i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 109 — #123i

i

i

i

i

i

14From Hexaflexagons to Edge

Flexagons to Point Flexagons

Les Pook

Edge flexagons

Hexaflexagons, the subject of Martin Gardner’s column [5], were the first family of flexagons

to be discovered. They are, however, only one example of an edge flexagon, a folded band

of hinged, usually identical, convex polygons (called leaves). The strip of polygons used to

construct a flexagon is called a net. In a main position an edge flexagon has the appearance

of a ring of hinged polygons (see Figure 14.1). The rings are not necessarily flat and what

look like single polygons are folded piles of one or more leaves, called pats. By defini-

tion, edge flexagons can be flexed to other main positions to display other pairs of faces,

usually in a cyclic order. When flexing, leaf-bending is sometimes allowed, but in main

positions the leaves are flat. The preceding chapter in this part concerns hexaflexagons

(Figure 14.1a). Here we describe some non-hexagonal forms and say something about

their classification.

(a) Trihexaflexagon. (b) Square flexagon. (c) Pentagonal flexagon.

Figure 14.1. Main positions of some edge flexagons.

Workable paper models of flexagons are easy to make; however, the mathematics is

complex and just how a flexagon works is not immediately obvious from casual examina-

tion of a paper model. Even square flexagons (Figure 14.1b), the second family of flexagons

to be discovered and the subject of Chapter 12, are far from simple. Martin Gardner wrote

[6], “Stone and his friends spent considerable time folding and analyzing these four-sided


109

i

i

“master” — 2012/11/15 — 11:23 — page 110 — #124i

i

i

i

i

i


forms, but did not succeed in developing a comprehensive theory that would cover all their

discordant variations,” a significant statement as Stone’s friends included John Tukey and

Richard Feynman.

Classification of edge flexagons

This is a difficult subject, as Martin Gardner’s remark indicates. Certain edge flexagons,

however, can be classified on the basis of their characteristic polygon, which describes

how the hinge edges are linked by the leaves and is related to the vertex figures used to

characterize polyhedra [1]. In particular, by describing how the leaves are linked, the char-

acteristic polygon contains most of the information needed to construct the flexagon. For

those edge flexagons that have a characteristic polygon, classification amounts to classifi-

cation of all possible types of characteristic polygons.

We give two examples (in Figures 14.2 and 14.3) of edge flexagons and their char-

acteristic polygons. To make paper models of any of the flexagons described here, use

heavy paper stock and enlarge the nets to about twice the printed size. Transfer the num-

bers in parentheses from the upper side to the lower side and delete the numbers from the

upper side.

In Figure 14.2a, we have the net for a square edge flexagon. To assemble this model,

fold together pairs of leaves numbered 3 and 4, and join the ends (dashed lines). As assem-

bled the flexagon is in main position (Figure 14.1b). This is flat and consists of alternate

pats of single leaves and fan folded piles of three leaves. The flexagon can be flexed around

a 4-cycle by folding it in two (to an intermediate position) and then unfolding into a new

main position, and so on. The operation of this flexagon is similar to the traditional Chinese

falling block toy sometimes called Jacob’s Ladder [8].

3(2)

1(2)

1(4)

3(4)

3(2)

1(2)

1(4)

3(4)

a a

aa

a ab b

b b

b b

c c

c

c c

c

cd

a

bc

d

d

d d

d d

(a) Net.

(b) Characteristic

polygon.

Figure 14.2. A square edge flexagon.

The characteristic polygon is obtained by joining the midpoints of the hinge edges on

each leaf in the order they are linked by the leaves. This results in a twice-traced square

(Figure 14.2b). Incidentally, joining end-to-end two copies of the net shown in Figure 14.2a

results in the net for another square edge flexagon, the octopus flexagon. Main positions are

not flat, but it can still be flexed around a 4-cycle using a pinch flex similar to the flex used

for the trihexaflexagon. Other flexes are possible, and its overall behaviour is bewilderingly

complex [6].

There are two regular pentagons: the usual convex one and the pentagram. This gives

rise to two series of pentagonal flexagons, one of whose net is in Figure 14.3a. To assemble,

i

i

“master” — 2012/11/15 — 11:23 — page 111 — #125i

i

i

i

i

i

14. From Hexaflexagons to Edge Flexagons to Point Flexagons 111

3(2)

1(2)

5(4)

3(4) 5(1)

aa

a

a

bb

b

b c

cc

c

dd

d

dd

ea

b

c

d

e

ee

(a) Half the net.

(b) Characteristic

polygon.

Figure 14.3. A pentagonal edge flexagon.

first make two copies of Figure 14.3a, turn one over and join them end-to-end. Next, fold

together pairs of leaves numbered 2, 3, 4 and 5, and join the ends. Assembled, the flexagon

is in an intermediate position which looks like a pair of pentagons with a common edge.

From here, it can be opened into a type of main position (Figure 14.1c), and then closed

into another main position in one continuous aesthetically satisfying movement, and so

on around a 5-cycle. It can also be opened into another type of main position, but flexing

round a 5-cycle is then complicated [5]. This is a difficult flexagon to handle.

Point flexagons

Point flexagons are the duals of edge flexagons. They are a recent discovery, first described

by Scott Sherman in 2007 [5]. Nets are derived by truncating the leaves of the net of an

edge flexagon at the vertices, so that vertices become edges and edges become vertices

(see Figure 14.4). Edge hinges, therefore, become point hinges, usually modelled by short

paper strips. By definition, a point hinge has only two degrees of freedom. The triangles in

Figure 14.4 may rotate relative to each other about the y-axis, and the z-axis, but not be

twisted relative to the x-axis.

Edge hinge Point hinge

x

y

Figure 14.4. Truncating the edge-hinged triangles (dashed) creates point-hinged triangles (solid).

Figure 14.5a gives a net for a point flexagon with three faces. To assemble, crease the

lines between the leaves to form hinges. Fold pairs of leaves numbered 2 and 3 together and

glue the two part paper strips together at A-A. Assembled, the visible leaves, above and

below, will both be numbered 1. The flexagon can be flexed around a 3-cycle by turning

over the top leaf so that it becomes the bottom leaf, and so on, rather like the way cards

are flipped in a Rolodex rotary file. This is analogous to the 3-cycle of the trihexaflexagon

described by Martin Gardner [5]. The flex is not completely smooth since the hinge folds

must be reversed and they are not parallel to the axis of rotation.

i

i

“master” — 2012/11/15 — 11:23 — page 112 — #126i

i

i

i

i

i


1(2)

3(2)

3(1)

A A

(a) Net. (b) Assembled.

Figure 14.5. A triangle point flexagon (dual to the trihexaflexagon).

Conclusion

Martin Gardner opened a Pandora’s box when he introduced flexagons to a wide audience.

Fifty years later, they are a mature topic with an extensive literature. They continue to fas-

cinate with new results and new flexagons still being discovered and still to be discovered.

Acknowledgment Thanks to the anonymous reviewers for helpful comments. The fig-

ures are copyrighted by the author.

Bibliography

[1] H. S. M. Coxeter, Regular Polytopes, 2nd ed., The Macmillan Company, New York, 1963.

[2] M. Gardner. Hexaflexagons, College Math. J. 43 (2012) 2–5; available at

dx.doi.org/10.4169/college.math.j.43.1.002.

[3] ——, Mathematical games: tetraflexagons and tetraflexigation, Scientific American, May 1958,

122–126; available at dx.doi.org/10.1038/scientificamerican0558-122.

[4] E. Iacob, T. B. McLean, and H. Wang, The V-flex, College Math. J. 43 (2012) 6–10; Chapter 13

in this volume and available at dx.doi.org/10.4169/college.math.j.43.1.006.

[5] L. P. Pook. Serious Fun with Flexagons. A Compendium and Guide. Springer, Dordrecht, 2009.

[6] ——, Flexagons; available at www.pook.org.uk; accessed 11 April 2011.

[7] S. Sherman. Point Flexagons; available at loki3.com/flex/point-flexagon.html,

Accessed 17 April 2007.

[8] Wikipedia, Jacob’s Ladder, Wikipedia: the free encyclopedia; available at

en.wikipedia.org/wiki/Jacob%27s_ladder_%28toy%29.

dx.doi.org/10.4169/college.math.j.43.1.002


dx.doi.org/10.1038/scientificamerican0558-122

http://dx.doi.org/10.1038/scientificamerican0558-122

dx.doi.org/10.4169/college.math.j.43.1.006


www.pook.org.uk

http://www.pook.org.uk

loki3.com/flex/point-flexagon.html

http://loki3.com/flex/point-flexagon.html

en.wikipedia.org/wiki/Jacob%27s_ladder_%28toy%29

http://en.wikipedia.org/wiki/Jacob%27s_ladder_%28toy%29

i

i

“master” — 2012/11/15 — 11:23 — page 113 — #127i

i

i

i

i

i

15Flexagons Lead to a

Catalan Number Identity

David Callan

Introduction

Flexagons are remarkable objects obtained by suitably folding and glueing a strip of paper.

Several websites give detailed instructions on how to construct various kinds. After con-

struction, the flexagon lies flat with a number of faces, delineated by creases in the paper,

visible on top and bottom. The name derives from the fact that they can be pinched, or

flexed, in various ways into a 3-dimensional shape and then flattened again to reveal differ-

ent faces. First discovered by Arthur Stone at Princeton in 1939, they were popularized by

Martin Gardner [2] in his 1956 debut column in Scientific American and continue to attract

mathematical interest [1].

At about the same time as Gardner’s column appeared, Oakley and Wisner [4] consid-

ered the variety now called hexahexaflexagons—hexagonal shape when flat, with 6 trian-

gular faces visible. They showed that these flexagons can be constructed recursively using

a critical fold that splits a flexagon into two smaller ones. In particular, they are equinu-

merous with Dyck paths, counted by the Catalan numbers Cn D 1nC1

�2nn

�

[5]. A Dyck

path of size n is a lattice path of n upsteps .1; 1/ and n downsteps .1; �1/ that starts at the

origin and never goes below ground level. The first downstep that returns the path to the

x-axis splits a Dyck path into two smaller ones, in complete analogy to the decomposition

of flexagons into smaller ones. Oakley and Wisner used this decomposition to show that

hexahexaflexagons can be encoded by permutations called pats. Pats themselves are de-

fined recursively, with permutations represented as lists. A singleton permutation is a pat,

and a permutation p of length n � 2 is a pat if and only if (i) there is a unique split point

that divides p into subpermutations p1 and p2 such that all entries in p1 are greater than

all entries in p2, and (ii) the reverse of each of p1 and p2 is a pat. The only pat on [[2]] is

21; the pats on [[3]] are 231 and 312, and on [[4]] are 2431, 3241, 3412, 4132, 4213. The

number of pats on ŒŒn C 1�� is Cn.

The purpose of this note is to find the distribution of the statistic “number of descents”

on pats (a descent in a permutation p is a pair of adjacent entries .pi ; piC1/ with pi >

piC1). To do so, we establish a recurrence that leads to a quartic equation for the generating

Reprinted from The American Mathematical Monthly, Vol. 119, No. 5 (May 2012), pp. 415–419.

113

i

i

“master” — 2012/11/15 — 11:23 — page 114 — #128i

i

i

i

i

i


function. Lagrange inversion then yields a simple closed form for the number of pats on

ŒŒn�� with k descents, thereby giving a combinatorial interpretation of the identity

nX

kD0

1

2n � 2k C 1

2n � 2k C 1

k

!

2k

n � k

!

D Cn: (15.1)

The last section exhibits a bijection from pats to full binary trees showing that descents on

pats are distributed as even-level vertices on binary trees and concludes with a generaliza-

tion of (15.1).

Lagrange inversion and the distribution of descents on pats

From its definition, a pat p of length n determines two pats p1; p2 of lengths i and n � i

respectively for some i 2 Œ1; n � 1�. The connecting entries from p1 to p2 contribute a

descent to p and the remaining descents of p correspond to the ascents in p1 and p2.

Hence we obtain the recurrence

u.n; k/ Dn�1X

iD1

i�1X

j D0

u.i; j /u.n � i; n � j � k � 1/ for n � 2 and k � 0

for the number u.n; k/ of pats on Œn� with k descents, with u.1; 0/ D 1. The recur-

rence leads directly to a functional equation for the generating function F.x; y/ WDP

n�1; k�0 u.n; k/xnyk ,

F.x; y/ D x C 1

yF

�

xy;1

y

�2

: (15.2)

Iterating (15.2) once yields the algebraic equation

F D x C y�

x C F 2�2

; (15.3)

where F.x; y/ is now abbreviated to F .

The Lagrange inversion formula is a powerful technique for solving functional equa-

tions such as (15.3). In its basic form, it says that for a function f .x/ analytic about the

origin and nonzero at the origin, the coefficient of xn in the (compositional) inverse f ı of

f is given by Œxn�f ı.x/ D 1nŒxn�1�g.x/n where g is defined by f .x/ D x=g.x/. The

standard analytic proof (see [6, Chapter 5.1], for example) is based on the special property

of the case n D �1 in the contour integral of zn around the origin; it is the only value

of n that yields a nonzero integral. This is a little mysterious, but in the context of for-

mal power series, there is a transparent combinatorial proof involving ordered (plane) trees

with weighted edges (see [5, Theorem 5.4.2], for example). To apply the Lagrange inver-

sion formula, one typically formulates an equation of the form y D x=g.x/ and wishes to

solve for x in terms of y. With f .x/ WD x=g.x/, we then have x.y/ D f ı.y/ and so the

formula applies (with a little care not to get x and y mixed up).

Direct application of the formula to solve (15.3) is cumbersome but a trick shown

to me by Ira Gessel (see [3]) rapidly solves it. Introduce a new variable z and consider

i

i

“master” — 2012/11/15 — 11:23 — page 115 — #129i

i

i

i

i

i

15. Flexagons Lead to a Catalan Number Identity 115

F D F.x; y; z/ defined by

F D z�

x C y�

x C F 2�2�

: (15.4)

Equation (15.4) has the form z D F=�.F /where

�.F / D x C y�

x C F 2�2

;

and so Lagrange inversion says that

h

ykz2j C1i

F.x; y; z/ D 1

2j C 1

h

ykF 2ji

�.F /2j C1

D 1

2j C 1

2j C 1

k

!

2k

j

!

xj CkC1: (15.5)

The coefficient of xnyk in F.x; y; 1/ is obtained by setting j D n � k � 1 in (15.5),

yielding

u.n; k/ D Œxnyk �F.x; y/

D Œxnyk �F.x; y; 1/

D 1

2n � 2k � 1

2n � 2k � 1

k

!

2k

n � k � 1

!

;

and hence identity (15.1) after replacing n by n C 1. Here are the first few values of

u.n; k/.Table 15.1. Values of u.n; k/, the distribution of descents on pats

nnk 0 1 2 3 4 5 6

1 1 0 0 0 0 0 0

2 0 1 0 0 0 0

3 0 2 0 0 0 0 0

4 0 1 4 0 0 0 0

5 0 0 12 2 0 0 0

6 0 0 12 30 0 0 0

7 0 0 4 100 28 0 0

8 0 0 0 140 280 9 0

9 0 0 0 90 980 360 0

10 0 0 0 22 1680 2940 220

Pats as trees

A pat on ŒŒn C 1�� can be represented, using its successive split points, by a vertex-labeled

full binary tree on 2n edges as illustrated in Figure 15.1.

i

i

“master” — 2012/11/15 — 11:23 — page 116 — #130i

i

i

i

i

i


4

7 5 6

2 1

3

479856213

479856

79856

798

98

56

213

21

Figure 15.1. Full binary tree for pat 4 7 9 8 5 6 2 1 3, interior vertices at even level are enlarged.

In fact, the labels are unnecessary; they can be uniquely recovered from the underlying

tree. So we have a bijection from pats on ŒŒn C 1�� to full binary trees on 2n edges. Under

this bijection, a descent in the pat shows up as an interior vertex at even level in the tree

(where rain water would collect between the two leaves corresponding to the descent).

Pruning the leaf edges in a full binary tree to obtain a binary tree, we obtain the follow-

ing.

Theorem. Descents on pats are distributed as even-level vertices in binary trees.

Similar considerations for ternary and higher order trees yield a generalization of (15.1),

nX

kD0

1

rn � rk C 1

rn � rk C 1

k

!

rk

n � k

!

D 1

rn C 1

rn C 1

n

!

for r � 2.

Acknowledgments I thank Ira Gessel for suggesting the Lagrange inversion technique

used in the second section.

Bibliography

[1] T. Anderson, T. B. McLean, H. Pajoohesh, C. Smith, The combinatorics of all regular

flexagons, European J. Combin. 31 (2010) 72–80; available at dx.doi.org/10.1016/

j.ejc.2009.01.005.

[2] M. Gardner, Hexaflexagons and Other Mathematical Diversions: The First Scientific American

Book of Puzzles and Games, Univ. of Chicago Press, 1988.

[3] I. Gessel, A combinatorial proof of the multivariable Lagrange inversion formula, J. Combin.

Theory Ser. A 45 (1987) 178–195; available at

dx.doi.org/10.1016/0097-3165(87)90013-6.

[4] C. O. Oakley, R. J. Wisner, Flexagons, Amer. Math. Monthly 64 (1957) 143–154; available at

dx.doi.org/10.2307/2310544.

dx.doi.org/10.1016/


j.ejc.2009.01.005


dx.doi.org/10.1016/0097-3165(87)90013-6

http://dx.doi.org/10.1016/0097-3165(87)90013-6

dx.doi.org/10.2307/2310544

http://dx.doi.org/10.2307/2310544

i

i

“master” — 2012/11/15 — 11:23 — page 117 — #131i

i

i

i

i

i

15. Flexagons Lead to a Catalan Number Identity 117

[5] R. P. Stanley, Enumerative Combinatorics, Cambridge Studies in Advanced Mathematics, Vol.

62, Cambridge Univ. Press, Cambridge, 1999.

[6] H. S. Wilf, Generating Functionology, Academic Press, New York, 1990; also available at

www.math.upenn.edu/˜wilf/DownldGF.html

www.math.upenn.edu/~wilf/DownldGF.html

http://www.math.upenn.edu/~wilf/DownldGF.html

i

i

“master” — 2012/11/15 — 11:23 — page 118 — #132i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 119 — #133i

i

i

i

i

i

16Convergence of a Catalan Series

Thomas Koshy and Zhenguang Gao

The well known Catalan numbers Cn are named after Belgian mathematician Eugene

Charles Catalan (1814–1894), who found them in his investigation of well-formed se-

quences of left and right parentheses. As Martin Gardner (1914–2010) wrote in Scien-

tific American [2], they have the propensity to “pop up in numerous and quite unexpected

places.” They occur, for example, in the study of triangulations of convex polygons, planted

trivalent binary trees, and the moves of a rook on a chessboard [1, 2, 3, 4, 6].

The Catalan numbers Cn are often defined by the explicit formula Cn D 1nC1

�2nn

�

,

where n � 0 [1, 4, 6]. Since .nC1/ j�

2nn

�

, it follows that every Catalan number is a positive

integer. The first five Catalan numbers are 1, 1, 2, 5, and 14. Catalan numbers can also be

defined by the recurrence relation CnC1 D 4nC2nC2

Cn, where C0 D 1. So limn!1

CnC1

CnD 4.

Here we study the convergence of the seriesP1

nD01

Cnand evaluate the sum. Since

limn!1

CnC1

CnD 4, the ratio test implies that the series

P1nD0

xn

Cnconverges for jxj < 4. Con-

sequently, the seriesP1

nD01

Cnconverges. We evaluate this infinite sum using generating

functions, plus fundamental tools from the differential and integral calculus.

Sum of the series1P

nD0

1Cn

To this end, let f .x/ be the generating function of the reciprocals of Catalan numbers,

f .x/ DP1

nD0xn

Cn. We compute the sum in three steps. First, we find an ordinary differen-

tial equation satisfied by f .x/, then, after solving the differential equation in the interval

.0; 4/, we compute f .1/.

We first rewrite f .x/ as

f .x/ D 1 C1X

nD1

xn

Cn

:

So

f 0.x/ D1X

nD1

nxn�1

Cn

D1X

nD0

n C 1

CnC1

xn:

Reprinted from The College Mathematics Journal, Vol. 43, No. 2 (March 2012), pp. 141–146.

119

i

i

“master” — 2012/11/15 — 11:23 — page 120 — #134i

i

i

i

i

i


Since nC2Cn

D 4nC2CnC1

, by the recurrence relation, this yields

1X

nD0

n C 2

Cn

xn D1X

nD0

4n C 2

CnC1

xn;

1X

nD0

n

Cn

xn C 2

1X

nD0

xn

Cn

D1X

nD0

4.n C 1/

CnC1

xn � 2

1X

nD0

xn

CnC1

;

xf 0.x/ C 2f .x/ D 4f 0.x/ � 2

xŒf .x/ � 1�;

x.x � 4/f 0.x/ C .2x C 2/f .x/ D 2: (16.1)

This is a first-order differential equation for f .x/ with the initial conditions f .0/ D 1 Df 0.0/.

To facilitate solving (16.1) for x 6D 0, we introduce the function g.x/ Dˇˇ4�x

x

ˇˇ3=2

.

Then g0.x/g.x/

D �6x.4�x/

. This implies that

Œx.x � 4/g.x/�0 D .2x C 2/g.x/: (16.2)

Multiplying (16.1) by g.x/, we get

x.x � 4/f 0.x/g.x/ C .2x C 2/f .x/g.x/ D 2g.x/:

Using (16.2), this can be rewritten as

Œx.x � 4/f .x/g.x/�0 D 2g.x/:

But, again using (16.2),

fx.x � 4/Œf .x/ � 1�g.x/g0 D Œx.x � 4/f .x/g.x/�0 � Œx.x � 4/g.x/�0

D 2g.x/ � .2x C 2/g.x/

D �2xg.x/;

consequently,

x.x � 4/Œf .x/ � 1�g.x/ D �2

Z

xg.x/ dx C C1

f .x/ D 1 C 2R

xg.x/ dx � C1

x.4 � x/g.x/;

where C1 is a constant.

Suppose 0 < x < 4. Then

Z

xg.x/ dx DZ

x

�4 � x

x

�3=2

dx

DZ

.4 � x/3=2

x1=2dx:

i

i

“master” — 2012/11/15 — 11:23 — page 121 — #135i

i

i

i

i

i

16. Convergence of a Catalan Series 121

Letting x D u2, this implies that

Z

xg.x/ dx D 2

Z

.4 � u2/3=2 du

D1

2u.4 � u2/3=2 C 3u.4 � u2/1=2 C 12 arcsin

u

2C C2

D 1

2

px.4 � x/3=2 C 3

px.4 � x/1=2 C 12 arcsin

px

2C C2;

where C2 is another constant. Therefore, we have

f .x/ D 1 Cp

x.4 � x/3=2 C 6p

x.4 � x/1=2 C 24 arcsinp

x

2C 2C2 � C1

x.4 � x/�

4�xx

�3=2

D 1 Cx.4 � x/3=2 C 6x.4 � x/1=2 C 24

px arcsin

px

2C C

px

.4 � x/5=2;

where C D 2C2 � C1. Since f .0/ D 1 D f 0.0/, C D 0. Thus

f .x/ D 1 Cx.4 � x/3=2 C 6x.4 � x/1=2 C 24

px arcsin

px

2

.4 � x/5=2D

1X

nD0

xn

Cn

:

When x D 1, this yields

1X

nD0

1

Cn

D 1 C 33=2 C 6 � 31=2 C 24 arcsin 1=2

35=2

D 2 C 4p

3

27�: (16.3)

Thus, the seriesP1

nD01

Cnconverges to the limit 2 C 4

p3

27� , which is approximately

2.80613305077. Note that alreadyP22

nD01

Cn� 2:80613305077; so the series converges

to the limit remarkably fast.

Additional consequences of the differential equation

Letting x D 1 in (16.1), we get

3f 0.1/ D 4f .1/ � 2

D 4

2 C 4p

3

27�

!

� 2

f 0.1/ D 2 C 16p

3

81�:

Since f 0.x/ DP1

nD0nC1

CnC1xn, it follows that

1X

nD0

n C 1

CnC1

D 2 C 16p

3

81�:

i

i

“master” — 2012/11/15 — 11:23 — page 122 — #136i

i

i

i

i

i


Since the differential equation is infinitely differentiable, it follows from (16.1) that

x.x � 4/f 00.x/ C .4x � 2/f 0.x/ C 2f .x/ D 0: (16.4)

This yields 3f 00.1/ D 2f 0.1/ C 2f .1/, so f 00.1/ D 83

C 56p

3243

� . Since

f 00.x/ D1X

nD0

.n C 1/.n C 2/

CnC2

xn;

this implies that1X

nD0

.n C 1/.n C 2/

CnC2

D 8

3C 56

p3

243�:

Differentiating (16.4) with respect to x, it follows similarly that f 000.1/ D 2f 0.1/, so

1X

nD0

.n C 1/.n C 2/.n C 3/

CnC3

D 4 C 32p

3

81�:

Clearly, this technique can be employed to evaluate further sums of the form

1X

nD0

.n C 1/.n C 2/ � � � .n C k/

CnCk

;

where k � 1.

Sum of the series1P

nD0

.�1/n

Cn

We now turn to to solving (16.1) for �4 < x < 0. We have

Z

xg.x/ dx DZ

x

ˇˇˇˇ

4 � x

x

ˇˇˇˇ

3=2

dx

D �Z

.4 � x/3=2

px

dx:

Letting x D �u2, this gives

Z

xg.x/ dx D �Z

.u2 C 4/3=2

u.�2u/ du

D 2

Z

.u2 C 4/3=2 du

D 2

"

u.u2 C 4/3=2

4C 3

2up

u2 C 4 C 6 lnˇˇˇu C

p

u2 C 4ˇˇˇ

#

C C3

D 1

2u.u2 C 4/3=2 C 3u

p

u2 C 4 C 12 lnˇˇˇu C

p

u2 C 4ˇˇˇC C3

D 1

2

p

jxj.4 � x/3=2 C 3p

jxj.4 � x/ C 12 ln�p

jxj Cp

j4 � xj�

C C3;

i

i

“master” — 2012/11/15 — 11:23 — page 123 — #137i

i

i

i

i

i

16. Convergence of a Catalan Series 123

where C3 is a constant.

Consequently, as before, we have

f .x/ D 1 C

p

jxj.4 � x/3=2 C 6p

jxj.4 � x/ C 24 ln�p

jxjCp

j4�xjC4

�

x.4 � x/ˇˇ4�x

x

ˇˇ3=2

D 1 �jxj.4 � x/3=2 C 6

p

jxj.4 � x/ C 24p

jxj ln�p

�xCp

4�x

C4

�

.4 � x/5=2:

where C4 is a nonzero constant. Since f .0/ D 1 D f 0.0/, C4 D 2. Thus

f .x/ D 1 �jxj.4 � x/3=2 C 6

p

jxj.4 � x/ C 24p

jxj ln�p

�xCp

4�x

2

�

.4 � x/5=2:

Since f .�x/ generates the alternating seriesP1

nD0.�x/n

Cnfor 0 < x < 4, setting x D 1

we get

1X

nD0

.�1/n

Cn

D 1 � 53=2 C 6 � 51=2 C 24 ln�

55=2

D 14

25� 24

p5

125ln �; (16.5)

where � is the well-known golden ratio 1Cp

52

.

Formulas (16.3) and (16.5) can be employed to compute the sums of the subseriesP1

nD01

C2nand

P1nD0

1C2nC1

:

1X

nD0

1

C2n

D 32

25C 2

p3

27� � 12

p5

125ln �;

1X

nD0

1

C2nC1

D 18

25C 2

p3

27� C 12

p5

125ln �:

Acknowledgment The authors would like to thank the editor and reviewers for their

unwavering enthusiasm, and suggestions for the improvement of the exposition.

Bibliography

[1] D. I. A. Cohen, Basic Techniques of Combinatorial Theory, Wiley, New York, 1978.

[2] M. Gardner, Catalan Numbers: An Integer Sequence that Materializes in Unexpected Places,

Scientific American, 234 (1976), 120–125; available at

10.1038/scientificamerican0676-120.

[3] H. W. Gould, Bell and Catalan Numbers: Research Bibliography of Two Special Number Se-

quences, Revised Edition, Combinatorial Research Institute, Morgantown, West Virginia, 1978.

[4] T. Koshy, Catalan Numbers with Applications, Oxford Univ. Press, New York, 2009.

[5] ——, Fibonacci and Lucas Numbers with Applications, Wiley, New York, 2001.

[6] R. P. Stanley, Enumerative Combinatorics, Vol. 1, Wadsworth & Brooks/Cole, Monterey, Cali-

fornia, 1986.

10.1038/scientificamerican0676-120


i

i

“master” — 2012/11/15 — 11:23 — page 124 — #138i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 125 — #139i

i

i

i

i

i

IVMaking Things Fit

i

i

“master” — 2012/11/15 — 11:23 — page 126 — #140i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 127 — #141i

i

i

i

i

i

17L-Tromino Tiling of Mutilated

Chessboards

Martin Gardner

Suppose a standard chessboard is ‘mutilated’ by the removal of two diagonally opposite

corner cells. Can the remaining 62 squares be tiled with 31 dominos? The answer is ‘no’

because the removed squares are the same color. Say the color is white. The remaining

62 squares will have an excess of two black cells. Each domino covers one black and one

white cell. After 30 are placed, two black cells will remain uncovered. They cannot be

adjacent, therefore they can’t be covered by a domino. This famous puzzle, solved by a

simple parity check, is a simple example of a tiling problem on a mutilated chess board.

Less well known is the following related problem. Assume the chessboard is mutilated

by having two cells removed of opposite color from anywhere on the board. Can the re-

maining 62 squares always be tiled by dominoes? The answer is yes, and there is a lovely

proof by Ralph Gomory [2].

Figure 17.1. Gomory’s proof.

Imagine heavy lines drawn on the chessboard as shown in Figure 17.1. They outline

a closed path along which the squares are like beads of alternating color on a necklace.

If any two cells of opposite color are taken from the path, it will cut the path into two

open-ended segments, or one segment if the removed cells are adjacent. Each segment

will consist of an even number of cells of alternating colors, therefore it can be tiled with

Reprinted from The College Mathematics Journal, Vol. 40, No. 3 (May 2009), pp. 162–168.

127

i

i

“master” — 2012/11/15 — 11:23 — page 128 — #142i

i

i

i

i

i

128 Part IV. Making Things Fit

dominoes. Gomory’s clever proof is easily generalized to all square boards with an even

number of cells.

If, instead of dominos, we tile with L-trominos, also called bent, or V, or right trominos,

then all square boards with a number of cells divisible by 3 can be tiled except for the 3 � 3

board. We will not be concerned with such ‘whole’ boards, but only with mutilated boards

with a number of cells that is a multiple of 3 after a single cell has been removed from any

spot on the board. We will call such boards deficient. In other words, a board of side n is

deficient if n2 � 1 is a multiple of 3, i.e., n is not a multiple of 3. The sides of such boards

form the sequence

2; 4; 5; 7; 8; 10; 11; 13; 14; : : : .�/

We will call these numbers the orders of a board and, from now on, the word tromino will

mean an L-tromino exclusively.

Our basic question is this: What deficient boards with sides in the sequence .�/ can be

tiled without gaps or overlaps with L-trominos after a cell has been taken from anywhere

on the board? We will take up these boards roughly in numerical order, culminating with a

statement of the complete solution.

Powers of 2

Consider the order-2 board first. It obviously is tilable with any cell missing (see Figure

17.2, left). Figure 17.2, right, shows how the order-4 can be tiled. The 2 � 2 square takes

care of a missing cell in each of its four corners. The rest of the board is tiled by taking

advantage of what Solomon Golomb named a rep-tile—a tile that can form an enlarged

replica of itself. The top left 2 � 2 square rotates to put its missing cell in four places, and

the entire order-4 square rotates to carry the missing cell to any of its sixteen places.

Figure 17.2. Orders 2 and 4.

In 1953 Golomb, the “father” of polyominoes (he named them and was the first to study

them in depth) discovered a beautiful proof by induction that all boards with sides in the

doubling sequence 2; 4; 8; 16; : : : could be tiled with trominoes when any cell is missing.

The proof was first published in [3]. It is repeated on pages 27–28 of [4]. Numerous math-

ematicians have since included the proof in their books, often without credit to Golomb.

Roger Nelsen, in [6], gives Golomb’s proof with a wordless single diagram.

Golomb’s famous proof starts with the 2�2 case shown on the left of Figure 17.3. This

square is placed in the corner of the order-4 as shown at the center of Figure 17.3. The

4 � 4 then goes in the corner of an order-8 (shown on the right) and a tromino placed at the

corner of the shaded order-4. We know the dark square can be tiled with any cell missing,

and we know the three unshaded quadrants can be tiled with trominoes because each has a

i

i

“master” — 2012/11/15 — 11:23 — page 129 — #143i

i

i

i

i

i

17. L-Tromino Tiling of Mutilated Chessboards 129

Figure 17.3. Golomb’s induction proof.

missing corner cell. By rotating the board, a missing cell at any spot in the shaded quadrant

can be brought to any spot on the order-8 board.

Orders 5 and 7

The order-5 board is next, as 5 is the next unsolved number in the sequence .�/. It has a

neat symmetrical tiling when the center cell is gone, as shown in Figure 17.4, left. I have

tiled this board with four 2 � 3 tiles. Each is tilable with two trominoes in two different

ways. Using 2 � 3 tiles is a valuable device for solving tromino problems.

When the missing cell is the one shown black in Figure 17.4, center, the cell above it

must be covered by a tromino on either side. In each case, shown here with a tromino above

and on the right, this produces two cells (numbered 1 and 2) that cannot be covered with

a tile. Indeed, the order-5 square can be tiled only when the missing cell is one of the nine

shown in black in Figure 17.4 right. As a pleasant exercise, see if you can tile the board

when the missing cell is at a corner.

1

2

Figure 17.4. The order-5 square.

The order 7 board is more difficult to analyze. I was unable to find a single diagram

that would prove this board tilable, but Golomb sent me his unpublished way of proving

tilability with the aid of three diagrams.

His proof goes like this. Figure 17.5 shows three tilings of the order-7 board. In each

tiling, the 2 � 2 square obviously can be tiled with a tromino so that the missing cell is at

any of the four corners. By rotating the three patterns, the missing cell can be placed at any

spot on the board.

i

i

“master” — 2012/11/15 — 11:23 — page 130 — #144i

i

i

i

i

i


Figure 17.5. Golomb’s proof that order-7 is tilable.

Somewhat more difficult is to find tilings that maximize the number of 2 � 3 tiles. As a

challenge, can the reader find a tiling of the 7�7 board using six 2�3 tiles and 4 trominoes

(see Figure 17.6)? The solution is unique except for a single reflection (see page 134).

?

Figure 17.6. A challenge to the reader.

Note in Figure 17.5, that in each pattern the number of free trominoes—trominoes not

in any 2 � 3 tile—is always even. This is no coincidence. It led me to the following trivial

little law. When a board’s order is even, the number of free trominoes in a tiling pattern is

odd, and vice versa. When the board’s number is odd, the number of free trominoes must

be even.

The parity proof is simple. If a board’s order is even, after a cell is removed there will be

.n2 � 1/=3 trominoes in any tiling, an odd number. Each 2 � 3 tile contains two trominoes,

so the total number of trominoes in 2 � 3 tiles will be even. Subtracting this number from

the odd total of trominoes and you get an odd number of trominoes not in any 2 � 3 tiles.

Suppose the board’s order is odd. After a cell is removed there will remain an even

number of cells. Subtracting the even number of trominoes in the 2 � 3 tiles leaves an even

number of trominoes not in a 2 � 3 tile.

Beyond 7

Golomb’s induction proof can be applied to an infinity of other doubling sequences. In

particular, now that we have tiled the 7 � 7 board, we can tile boards of size n � n where n

is of the form 2k7. For example, consider the order-14 board. Divide it into quadrants with

a shaded order-7 board in the top left corner, and attach a tromino to its lower right corner

i

i

“master” — 2012/11/15 — 11:23 — page 131 — #145i

i

i

i

i

i


as before. Because the 7-board is tilable, the proof for order-14 follows, and of course leads

by induction to proofs for orders 28; 56; 112; : : : :

A similar proof for the order-10 board can’t be obtained by placing an order-5 in the

corner because order-5 is not tilable, but we can handle it in a slightly different way. Put

in the top left corner an order-8 which we know is tilable. The remaining area forms a

path of width 2 along the bottom and right sides of the large square (see Figure 17.7). By

rotations and reflections, each missing cell in the order-8 can be transferred to any cell on

the board. This leads to proofs for orders 20, 40, 80, and so on. A similar proof for order-11

has an order-7 square in the corner, and a path of width 4 along bottom and side. It leads by

induction to solutions for orders 22; 44; 88; : : : : Clearly this technique provides an infinity

of doubling sequences for tilable boards. Simply, put in the top left corner of any board a

tilable board with a side equal to or smaller than the larger board. If you can tile the path it

leaves at the bottom and side, then the board is tilable.

8x8

Figure 17.7. Proof that order-10 is tilable.

Boards with sides that are primes are usually the hardest to tile. Order 17 is solved by

a corner square of side 13 and a path of width 4. Order 19 is solved by a corner square of

order 14, in turn based on order-7, and a path of width 5. (See Figure 17.8.)

The complete result

By working with these patterns I came close, but not close enough, to finding an induction

proof that all deficient squares are tilable except for order-5. A proof was finally obtained

by I. Ping Chu and Richard Johnsonbaugh [1].

Chu and Johnsonbaugh not only took care of all deficient squares, but also all defi-

cient rectangles! Their induction proof is too technical to repeat here. To summarize, they

showed tilability for all m by n rectangles (including squares when m D n) which have a

number of cells that is a multiple of 3 after a cell is removed. Such boards are tilable if and

only if all of the following are true:

1. m is equal to or greater than 2.

2. n is equal to or greater than m.

i

i

“master” — 2012/11/15 — 11:23 — page 132 — #146i

i

i

i

i

i


14

Figure 17.8. Order-19 is tilable.

3. if m is 2, n must be 2,

4. m is not 5.

A 4 � 7 rectangle is the smallest deficient rectangle, not a square, that is tilable with

L-trominoes. As another exercise, see how long it takes you to tile it when the missing cell

is at a corner, and there are two 2 � 3 tiles.

Christopher Jensen, in an unpublished paper, showed that if two cells are taken from

a corner of any board, as shown in Figure 17.9, the board obviously cannot be tiled with

trominoes. However, if none of these five cases is allowed, a 3m � 1 by 3n C 1 board, with

any two cells missing can be tiled if an only if n D 1 or m and n are each equal to or

greater than 3.

A final word

Kate Jones, who founded and runs Kadon Enterprises, a firm that makes and sells hand-

some mechanical puzzles, games, and other recreational math items, has on the market a

game called Vee-21 [5]. The Vee is for V-trominoes, and 21 for the 21 tromino tiles in the

set. The trominoes are brightly colored, and there is an order-8 board on which to place

them. The basic task is to put a monomino (order-1 tile) at any spot on the board, then

cover the remaining 63 cells with the trominoes, thus solving an order-8 board. A 40-page

brochure comes with the set. It contains a short article on “The Deficient Checkerboard”

i

i

“master” — 2012/11/15 — 11:23 — page 133 — #147i

i

i

i

i

i


Figure 17.9. Impossible tiling patterns when two cells are missing at a corner.

by Norton Starr, and pictures of rectangular fields that offer other challenges.

Our final tiling (see Figure 17.10) is a beautiful, symmetric tiling of the standard chess-

board.

Figure 17.10. An order-8 tiling with no 2 � 3 tiles and 5 rep-tiles.

Bibliography

[1] I. P. Chu and R. Johnsonbaugh, Tiling deficient boards with trominoes, Math. Mag. 59 (1986)

34–40.

[2] M. Gardner, The Eight Queens and Other Chessboard Diversions, in The Unexpected Hanging

and Other Mathematical Diversions, Univ. of Chicago Press, 1991.

[3] S. W. Golomb, Checker boards and polyominoes, Amer. Math. Monthly 61 (1954) 675–682.

[4] ——, Polyominoes, Scribner, New York, 1965.

[5] K. Jones, Vee-21; available at www.gamepuzzles.com/polycub2.htm#V21.

[6] R. Nelsen, Proofs Without Words II: More Exercises in Visual Thinking, Mathematical Associa-

tion of America, Washington, 2000.

www.gamepuzzles.com/polycub2.htm#V21

http://www.gamepuzzles.com/polycub2.htm#V21

i

i

“master” — 2012/11/15 — 11:23 — page 134 — #148i

i

i

i

i

i


Solution to the puzzle on page 130

i

i

“master” — 2012/11/15 — 11:23 — page 135 — #149i

i

i

i

i

i

18Polyomino Dissections

Tiina Hohn and Andy Liu

The damaged patchwork quilt

Those of us fortunate to have been born before Martin Gardner’s retirement can relate

to the excitement of eagerly awaiting his monthly column Mathematical Games in the

magazine Scientific American. Particularly popular were those containing a selection of

short puzzles. Here is one from the very last column of this kind, for April, 1981.

The patchwork quilt in Figure 18.1 was originally made up of 108 unit squares. Part of

the quilt’s center became worn, making it necessary to remove 8 squares as indicated. Cut

the quilt along the lines into just two parts that can be sewn together to make a 10�10 quilt.

Figure 18.1.

This beautiful problem is rather old, and had appeared earlier in [1] and [10, p. 64–65].

Gardner’s version was later anthologized in [7, pp. 249–250] and [8, p. 168]. We give the

readers a chance to solve this problem on their own now, before we return to it later.

The father of the field of geometric dissections is generally acknowledged to be Harry

Lindgren who wrote the fundamental treatise [12]. The torch has now been passed on

to Greg Frederickson, with three outstanding books so far, [2]–[4]. He was motivated by

Lindgren’s book, which might not have been written without the influence of Gardner’s

November 1961 column, later anthologized in [5, pp. 43–51].

Martin Gardner devoted another column in July 1977 to geometric dissections, titled

“Cutting Shapes into N Congruent Parts.” This is later anthologized in [6, pages 165–

Reprinted from The College Mathematics Journal, Vol. 43, No. 1 (January 2012), pp. 88–94.

135

i

i

“master” — 2012/11/15 — 11:23 — page 136 — #150i

i

i

i

i

i


181]. About half of the problems share a common feature with the Damaged Patchwork

Quilt Problem, in that the figures to be dissected, as well as the pieces resulting from the

dissections, are polyominoes, plane figures consisting of unit squares joined edge to edge.

They have a long history. Solomon Golomb was the first to popularize them, and Gardner

wrote several columns about them.

One of Gardner’s passions was to introduce puzzles into the classroom. From a the-

oretical point of view, polyomino dissections are an excellent topic. They require little

background, and provide training in geometric visualization. The most important reason of

all is that they are fun.

However, there are practical difficulties. How we should we present such problems, say

the Damaged Patchwork Quilt Problem? Providing lots of scissors and pre-cut rectangle

with holes leaves a lot to be desired. Nobody would look forward to the preparation, and

mixing kids and scissors is never a good idea. Besides, there will eventually be a ton of

litter on the floor.

Practical objections aside, there is a more important shortcoming of this model. It does

not help anyone find a solution to the problem! Cutting at random will not work. In fact,

one has to have the solution already in mind before any meaningful cutting can be done.

Preliminary work

How do we solve the Damaged Patchwork Quilt problem? Since it appears quite difficult,

we “downsize” it. This also yields the benefit of providing subsidiary problems at lower

levels. We generalize the 9 � 12 rectangle with a 1 � 8 hole in the middle to a .2n � 1/ �.2n C 2/ rectangle with a 1 � .2n � 2/ hole, and the 10 � 10 square to an 2n � 2n square.

The original problem is the case n D 5.

For n D 1, we have a 1 � 4 rectangle with a 1 � 0 hole in the middle and a 2 � 2

square. This is going too far down. The case n D 2 turns out to be quite easy. Since the

top row of the rectangle has length 6 while the target square has side 4, we must have a

cut as indicated in Figure 18.2. Another cut in the symmetric position yields the desired

two parts.

-

Figure 18.2.

In the case n D 3, we also have to make a cut along the top edge of the 5 � 8 rectangle

at the point 6 units from a corner. However, if we cut through to the hole as we did in the

case n D 2, it will not work. After a bit of fiddling around, we may arrive at the solution

in Figure 18.3.

From these preliminary investigations, it is not unreasonable to assume that the two

parts in the solution are congruent to each other and rotationally symmetric. However, we

are certainly not confident at this point of solving the original problem.

i

i

“master” — 2012/11/15 — 11:23 — page 137 — #151i

i

i

i

i

i

18. Polyomino Dissections 137

-

Figure 18.3.

Eureka!

After much deep thought, we finally came up with a fantastic model. Not only does it solve

the practical problems mentioned before, it actually leads us step by step to a solution. We

illustrate with the case n D 4. (In [8], there is an erroneous statement that there are no

two-piece solutions to this case. That statement is removed in [7].)

We draw two playing boards, one the initial 7 � 10 rectangle with a 1 � 6 hole in the

middle, and the other the target 8 � 8 square, and provide ourselves with a large supply of

bingo chips of two different colors, say yellow and blue. The objective is to fill both boards

with chips in such a way that those of the same color form the same shape on both boards.

This solves the problem.

We begin with a yellow chip at the top left corner of the square board and a blue chip

at the bottom right corner. This signifies that these two squares do not belong to the same

part, a most reasonable assumption. We define these two corners as the principal corners

of the respective parts. We mark them in the same orientation since we anticipate that the

two parts are in rotational symmetry. See Figure 18.4a.

-?

� 6

e

u

-?

� 6e

e

e

e

e

e

b

e

e

e

e

e

e

e

u

u

u

u

u

u

u

r

u

u

u

u

u

u

(a) (b)

b

b

b

b

b

b

b

b

b

b

b

b

b

b e e e e e e

r

r

r

r

r

r

r

r

r

r

r

r

r

r

u u u u u ub

b

b

b

b

b

b

b

b

b

b

b

b

b b b b b b b

e e e e

r

r

r

r

r

r

r

r

r

r

r

r

r

r

r r r r r r

u u u u

(c) (d)

Figure 18.4.

i

i

“master” — 2012/11/15 — 11:23 — page 138 — #152i

i

i

i

i

i


We now move over to the punctured rectangular board and mark the principal corners.

Besides playing the chips at the corners, we observe that in the square board, the yellow

chips cannot extend beyond the eighth column from the left while the blue chips cannot

extend beyond the eight column from the right. This allows us to place the additional chips

on the rectangular board. See Figure 18.4b.

Moving back to the square board, we can fill the top row with yellow chips and the

bottom row with blue ones, because here, the yellow chips cannot extend beyond the sev-

enth row from the top and the blue chips cannot extend beyond the seventh row from the

bottom. See Figure 18.4c. Moving over to the rectangular board, we notice that the two

blue chips on the seventh row from the bottom cannot extend at all to the left. This means

that we must have six yellow chips on the second row from the top, going from left to

right. Similarly, there must be six blue chips on the second row from the bottom, going

from right to left. See Figure 18.4d. Continuing with the same strategy, we can complete

the solution of the problem as in Figure 18.5.

b b b b b b b b

b b b b b b

b b

b b

b b

b b

b b

e e

r r r r r r r r

r r r r r r

r r

r r

r r

r r

r r

u u

b b b b b b b b

b b b b b b

b b

b b

b b

b b b b

b b

e e

r r r r r r r r

r r r r r r

r r

r r

r r

r rr r

r r

u u

(a) (b)

e e e e

u u u u

(c) (d)

Figure 18.5.

Onward

This approach may be used to solve related problems. In the above problem, the two parts

we end up with are congruent to each other. Of course, they also have to fit together to form

a square. Since the punctured rectangle has rotational symmetry, there are many trivial di-

visions into two congruent parts. However, if a figure does not have any obvious symmetry,

the task may be quite challenging. Here is an example from [11, p. 37].

There are two natural choices for the pair of principal corners. Since the figure does

not have rotational symmetry, we choose opposite orientations for the corners. In the first

attempt as shown in Figure 18.7a, the boundary squares are easily filled out. We observe

i

i

“master” — 2012/11/15 — 11:23 — page 139 — #153i

i

i

i

i

i


Figure 18.6. Dissect into two congruent pieces.

that in the third row from the bottom, we can have at most two blue chips going from

right to left. This means that in the third column from the left, we must have at least four

blue chips going from bottom to top. This in turn means that in the second row from the

bottom, we must have at least four yellow chips going from left to right. However, this is

impossible as the blue chips just placed are in the way. The other choice leads easily to the

solution shown in Figure 18.7b.

-?

�6

b

b

b

b

b

b b b b b b

r r r r r r

r

r

r

r

r

u

u

u

�?

-6

(a) (b)

Figure 18.7.

Many polyomino dissection problems have appeared in recent years, no doubt influ-

enced by Gardner’s columns. (Serhiy Grabarchuk dedicated his book [9] to Martin Gard-

ner.) We offer the readers a choice selection here.

1. Solve the “Damaged Patchwork Quilt” problem.

2. Dissect each of the following figures into two congruent pieces. The first is taken

from [13] and the last two from [6].

3. The patchwork quilt in the diagram below was originally made up of 35 unit squares.

Part of the quilt’s became worn, making it necessary to remove 5 of the squares as

indicated, Cut the quilt along the lines into just two parts that can be sewn together

to make a 5 � 6 quilt. This is taken from [13].

i

i

“master” — 2012/11/15 — 11:23 — page 140 — #154i

i

i

i

i

i


4. Dissect each of the following figures into two congruent pieces. The first two are

taken from [6] and the last four from [11]. The last one is a composition of Solomon

Golomb.

5. Dissect each of the following figures into three congruent pieces. The first one is

taken from [13] and the second from [9].

6. Dissect each of the following figures into two parts that can be reassembled to form

an 8 � 8 square. The first one is taken from [13] and the second one from [9].

A final word

Any trick used twice becomes a technique. The technique discovered by us and described

here is good enough at solving certain polyomino dissection problems that it can be pre-

sented to school children. On October 21, 2010, which would have been Martin Gardner’s

96th birthday, The Damaged Patchwork Quilt was one of Gardner’s problems presented

at a Math Fair by students of Stratford Junior High School at the Telus World of Science

Edmonton. We hope this was something that would please Martin.

i

i

“master” — 2012/11/15 — 11:23 — page 141 — #155i

i

i

i

i

i


Bibliography

[1] H. Dudeney, Puzzles And Curious Problems, Thomas Nelson & Sons, 1931.

[2] G. Frederickson, Dissections: Plane & Fancy, Cambridge Univ. Press, Cambridge, 1997.

[3] ——, Hinged Dissections: Swinging & Twisting, Cambridge Univ. Press, Cambridge, 2002.

[4] ——, Piano-Hinged Dissections: Time To Fold!, A K Peters, Natick MA, 2006.

[5] M. Gardner, The Unexpected Hanging, Univ. of Chicago Press, Chicago, 1991.

[6] ——, Penrose Tiles To Trapdoor Ciphers, Mathematical Association of America, Washington,

1997.

[7] ——, The Last Recreations, Springer-Verlag, New York, 1997.

[8] ——, The Colossal Book Of Short Puzzles and Problems, ed. Dana Richards, W. W. Norton &

Co., New York, 2006.

[9] S. Grabarchuk, The New Puzzle Classics, Sterling Publishing Co., New York, 2005.

[10] B. Kordemsky, The Moscow Puzzles, Dover Publications Inc., New York, 1987.

[11] R. Kurchan, Mesmerizing Math Puzzles, Sterling Publishing Company, New York, 2001.

[12] H. Lindgren, Geometric Dissections. Dover Publications, New York, 1972.

[13] N. Yoshigahara, Geometric dissections, Puzzle World 1 (1992) 48–51.

i

i

“master” — 2012/11/15 — 11:23 — page 142 — #156i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 143 — #157i

i

i

i

i

i

19Squaring the Plane

Frederick V. Henle and James M. Henle

Introduction

This research was inspired by two lovely pieces of mathematics. The first is the discovery

by William T. Tutte, A. H. Stone, R. L. Brooks, and C. A. B. Smith of squares with integral

sides that can be tiled by smaller squares with integral sides, no two alike. Tutte tells the

story in “Squaring the Square,” a beautifully written article that conveys vividly the excite-

ment of mathematical research [9]. It became widely-read in 1958 when it was reprinted

in Martin Gardner’s Mathematical Games column in Scientific American. It undoubtedly

played a role in inspiring many to become mathematicians.

The second piece is the well-known tiling of the plane by squares whose sides are

the Fibonacci numbers (Figure 19.2).� The tiling is elegant, but in light of Tutte’s work,

possesses a minor flaw—it contains two squares of the same side. The flaw is easily reme-

died. We can use the squared square in Figure 19.1, together with a square of side 175

(Figure 19.3). This works, but some of the elegance is lost.

64

33 3543

31 29

8

51

56

3830

81

18 20

55

16 149

5539

Figure 19.1. 175� 175 perfect square.

Reprinted from The American Mathematical Monthly, Vol. 115, No. 1 (Jan. 2008), pp. 3–12.�Coincidentally, this tiling appears in the same volume of Gardner’s columns as “Squaring the Square” [1].

143

i

i

“master” — 2012/11/15 — 11:23 — page 144 — #158i

i

i

i

i

i


1 1

23

5

8

13

21

34

55

Figure 19.2. Fibonacci tiling.

175

350525

875

1400

2275

3675

5950

9625

Figure 19.3. Fibonacci rectangle seeded with Tutte’s

perfect square.

In looking for a natural tiling that doesn’t repeat squares, one quickly discovers that

the first five squares fit together neatly (Figure 19.4) but it becomes progressively more

difficult to add consecutive squares without overlapping or leaving gaps. This suggests the

following question: Can the squares with whole-number sides, one of each size, be fitted

together to tile the plane? The answer is that they can. In succeeding sections we will prove

this, discuss the history of the problem, and pose a number of questions.

1

23

45

Figure 19.4. The first five squares.

Squaring the plane

We are going to show that the plane admits a tiling that uses exactly one square of side-

length n (n D 1; 2; : : :). Our approach focuses on rectangles and ells. By “ell,” we mean

any six-sided figure whose sides are parallel to the coordinate axes.

Figure 19.5. An ell.

The following extends Tutte’s use of “perfect” for rectangles and squares.

Definition 1. A figure is perfect if it is composed entirely of squares of different sides.

All the remaining figures in this paper will be perfect. The key to our result is Lemma 9,

which states that given any perfect ell, it is possible to add squares to it to form a perfect

rectangle.

i

i

“master” — 2012/11/15 — 11:23 — page 145 — #159i

i

i

i

i

i

19. Squaring the Plane 145

5

2

Figure 19.6. A perfect ell.

5

7 9

16

28

33

25

36

Figure 19.7. The ell of Figure 19.6 squared up.

When we add squares to a perfect figure, keeping it perfect, we’ll say we are “puffing

it up.” When we puff an ell up to form a perfect rectangle, we’ll say we are “squaring up

the ell.” We can then “square the plane” as follows:

1. Start with any perfect ell and square it up.

2. Create a new ell by attaching to the rectangle the smallest square not yet used.

3. Square this ell up, making sure that new squares are added in all four directions.

4. Repeat steps 2 and 3 ad infinitum.

Definition 2. An ell in standard position is an ell oriented so that the single reflex angle

is at the upper right. We refer to the sides by the uppercase letters in Figure 19.8 and their

lengths by the corresponding lower case letters.

a

b

c

d

e

f

A

B

C

DE

F„ ƒ‚ …

8

ˆˆˆ<

ˆˆˆ:

)

9

>=

>;

‚ …„ ƒ‚ …„ ƒ

Figure 19.8. An ell in standard position.

When we add squares to a figure, we always do so in such a way that an added square’s

side-length matches that of at least one side of the figure against which it is placed. That

means that the only squares we add to an ell in standard position are squares adjacent to

sides A, B , C , D, E , or F of the appropriate lengths. We call these operations A, B, C, D,

E, and F. So, for example, applying B then A to an ell produces a larger ell (Figure 19.9).

We denote sequences of operations by sequences of letters with the understanding that they

are applied from left to right (so the combination of operations in Figure 19.9 would be

written BA).

i

i

“master” — 2012/11/15 — 11:23 — page 146 — #160i

i

i

i

i

i


B

A

Figure 19.9. An application of BA to the ell of Figure 19.8.

Note that in any ell in standard position, the length of side A is the sum of the lengths

of sides C and E ,

a D c C e;

and the length of side F is the sum of the lengths of sides B and D,

f D b C d:

Thus, to describe the dimensions of such an ell, we need describe only the lengths of sides

B , C , D, and E .

Definition 3. A 4-tuple hb; c; d; ei describes the dimensions of an ell if the sides B , C , D,

and E of the ell are b, c, d , and e, respectively.

Definition 4. A side of an ell is composite if it is not the side of a single square in the ell.

In squaring up an ell, we make particular use of the operations B, F, and ED. For that

reason, the following definition is most useful:

Definition 5. A perfect ell is regular if each of the moves B, F, and ED either results in a

perfect ell in standard position or results in a perfect rectangle.

Figure 19.10 shows the smallest regular ell.

31

4

7

10

17

Figure 19.10. The smallest regular ell has dimensions h27; 12; 1; 5i.

If an ell with dimensions hb; c; d; ei is regular, then there can be no squares with sides

b, f , e, or d Ce in the ell. In particular, sides B , F , and E must be composite. In addition,

it must be that c � d C e (or else move ED would create an ell that is not in standard

position). These properties—no squares of sides b, f , e, or d C e, and c � d C e—are

both necessary and sufficient for regularity.

Lemma 5. Every perfect ell P in standard position can be puffed up to form a regular ell

without increasing the length of side D.

i

i

“master” — 2012/11/15 — 11:23 — page 147 — #161i

i

i

i

i

i


Proof. Let the dimensions of P be hb; c; d; ei. In every perfect ell in standard position,

either A or F is the longest side, and whichever side is longest is necessarily composite.

We may assume that a � f , since if not, then side A is the longest side and composite, so

A can be performed, and in the new ell the length of side A is less than the length of side

F (Figure 19.11).

A

Figure 19.11. A makes a � f .

Since a � f , side F is the longest side and is composite, so F can be performed.

This allows us to assume further that c > d . If not, we can perform FABA, and in the

new ell, the length of side C is greater than the length of side D (Figure 19.12). Note that

FABA does indeed puff up the ell—the first square, added by F, is larger than any in the

ell, and each new square is larger than the one before. Note also that the length of side D

is unaffected.

F

A

B

A

Figure 19.12. The operation FABA.

With our assumptions now (f � a, c > d ), we perform FABA (Figure 19.13). Let

hb0; c0; d; e0i be the dimensions of the ell formed after FABA (d , since FABA doesn’t

affect side D). It should be clear that performing either F or B to this new ell would result

in a perfect ell in standard position.

To check move ED, note first that move E adds a square of side e0 D b C d C e. This

is larger than b C d but smaller than b C c C d C e, hence a square not previously used.

Applying D after E adds a square of side d C e0 D b C 2d C e. This is larger than the

square just added, but again less than b C c C d C e, since c > d . The result then, is a

perfect ell. To show that the ell is either a rectangle or in standard position, we need only

the fact that c0 � d C e0. But c0 D 2b C 2c C d C e and e0 D b C d C e, and using c > d ,

we have c0 D b C 2c C e0 > d C e0.

Key to proving our main lemma (Lemma 9) is an analysis of c � e modulo d . For a

regular ell c � d C e. Consequently, we can express c as kd C e C i where k � 1 and

0 � i < d .

i

i

“master” — 2012/11/15 — 11:23 — page 148 — #162i

i

i

i

i

i


c + e

b

b + db + c + d + e

b 0 = 5b +3c +3d +3e

c 0 = 2b + 2c + d + e

d

e0 = b + d + e

8

ˆ<

ˆ:

n‚ …„ ƒ

„ ƒ‚ …

9

>=

>;

9

>>>>>>>>>>=

>>>>>>>>>>;

‚ …„ ƒ

Figure 19.13. New dimensions after FABA.

Lemma 6. If a regular ell has dimensions hb; c; d; ei such that c D kd CeC i , with k � 2

and 0 � i < d , then the sequence of moves BFA produces a regular ell with dimensions

hb0; c0; d; e0i, where c0 D .k � 1/d C e0 C i .

Proof. Performing BFA (Figure 19.14) produces an ell with dimensions

hb0; c0; d; e0i D h3b C c C d C e; b C c; d; b C d C ei:

Computing, we obtain

c0 D b C c D b C kd C e C i D .k � 1/d C b C d C e C i D .k � 1/d C e0 C i:

To see that the new ell is regular, first note that a move of B or F would result in a perfect

ell in standard position.

c + ec

e

B

b

d

F

b + d

A2b + c + d + e

(

8

ˆˆˆˆˆˆˆˆ

<

ˆˆˆˆˆˆˆˆ

:

‚ …„ ƒ

„ ƒ‚ …

o

Figure 19.14. The operation BFA.

To see that ED would add only new squares, observe that BFA added squares of sides

f D bCd , b, and a0 D 2bCcCd Ce. Move ED would add squares of sides e0 D bCd Ce

and d C e0 D b C 2d C e. On the other hand, we have

2b C c C d C e > b C 2d C e > b C d C e > b C d > b;

the first inequality following from the fact that c > d .

All that remains to show that the ell is regular is that c0 � d C e0. This follows from

c0 D .k � 1/d C e0 C i , where k � 2.

i

i

“master” — 2012/11/15 — 11:23 — page 149 — #163i

i

i

i

i

i


Lemma 7. If a regular ell with dimensions hb; c; d; ei is such that c D kd C e for some

positive integer k, then the ell can be squared up.

Proof. We can apply Lemma 6 until we have a regular ell whose dimensions

hb0; c0; d; e0i satisfy c0 D d C e0. This ell can then be squared up with ED.

Lemma 8. Every regular ell can be squared up.

Proof. Given a regular ell with dimensions hb; c; d; ei, we have c D kd C e C i with

k � 1 and 0 � i < d . We do induction on d to show that the ell can be squared up.

If d D 1, then i D 0 and we are done by Lemma 7. Now assume that the result

holds for any ell with dimensions hb�; c�; d �; e�i such that d � < d . We apply Lemma 6

iteratively until we have an ell with dimensions hb0; c0; d; e0i with c0 D d C e0 C i . Then

ED produces an ell with dimensions hb0; i; d C e0; d C 2e0i (Figure 19.15). By flipping

this over we obtain in standard position an ell with dimensions h2e0 C d; e0 C d; i; b0i(Figure 19.16).

E

D

i

Figure 19.15. The operation ED.

E

D

i

Figure 19.16. A flip after ED.

By Lemma 5 we can puff this up to a regular ell without increasing the length of side

D (which is i ). Since 0 � i < d , by the inductive hypothesis this flipped ell can be

squared up. Clearly, the unflipped ell can also be squared up, completing the proof of the

lemma.

Our main lemma follows immediately:

Lemma 9. Every perfect ell can be squared up.

Proof. Given an ell, we puff it up to a regular ell (Lemma 5), then square it up (Lemma 8).

In other words, every perfect ell is a part of a perfect rectangle. We have now completed

the spadework necessary to answer the tiling question raised in the introduction:

Theorem. It is possible to tile the plane with nonoverlapping squares using exactly one

square of each integral side.

i

i

“master” — 2012/11/15 — 11:23 — page 150 — #164i

i

i

i

i

i


Proof. It is easy to see how to carry out the plan described earlier. We start with any perfect

ell and grow it by adding squares as described in the algorithm whose steps we presented

in Section 2.

Note that we don’t move squares around; once added to the figure they are fixed. In par-

ticular, we never actually flip the figure as in the proof of Lemma 9—the flipping described

there is simply to demonstrate that we can square up a particular ell. Note also that we are

guaranteed to fill the plane since we are careful to grow the figure in all four directions in

each cycle of the algorithm.

Finally, for any n the square of side n will be added to the figure by or before the nth

time we perform step 2 of the algorithm. Since we perform step 2 infinitely many times,

we are guaranteed to incorporate in the tiling a square of each integral side.

Reflections

In preparing this paper, we learned some of the history of the problem. The question was

first posed by Solomon Golomb in a 1975 article in the Journal of Recreational Mathemat-

ics. He called it the “heterogeneous tiling conjecture” and challenged readers to prove or

disprove it.

Martin Gardner wrote about the question four years later in his column in Scientific

American, later anthologized in [2]. He described one approach to a solution, a fairly or-

derly tiling of roughly three-quarters of the plane with squares and reported that Verner

Hoggatt Jr., then editor of The Fibonacci Quarterly, had shown that no square in the tiling

appeared more than once.

Grunbaum and Shepard wrote about the problem in their 1987 book Tilings and Pat-

terns [4]. They described there a second way in which a squared square S can generate a

tiling of the plane (in addition to the method indicated in the present paper): Take a second

copy of S and expand it to a square S1 such that the smallest square in S1 is the size of the

original square S and fit S into that square. Take another copy of S and expand it to S2

so that its smallest square is the size of S1, and so on. Grunbaum and Shepard record the

observation of Carl Pomerance that in every tiling of the plane by unique squares known

at that time, the sides of the squares grow exponentially.

In 1997, Karl Scherer [6] succeeded in tiling the plane using multiple copies of squares

of all integral sides. The number t.n/ of squares of side n is finite but not bounded. He

describes his tiling as “size-alternating,” in that no two squares of the same side share any

portion of an edge (though they may share a corner).

Questions

Tiling is an enormous field. This theorem might be said to reside in the subfield that con-

cerns infinite tilings of the plane that use exactly one specimen each of a well-defined

collection of similar figures. Much work has been done here and many questions remain:

Efficiency The algorithm presented in this paper is extravagant in that the ratio of the

largest square used so far to the smallest square not yet used rapidly diverges. The proce-

i

i

“master” — 2012/11/15 — 11:23 — page 151 — #165i

i

i

i

i

i


dure for squaring up, for example, when applied to the smallest possible ell, a 2 � 2 square

next to a 1 � 1 square, ends in a rectangle with dimensions 1106481365205154721693 �2659648557852203795117. The smallest square not used at this point is 4 � 4.

The squaring-up procedure can certainly be improved. By way of illustration, take

the ell in Figure 19.6. This can be squared up to a 69 � 61 rectangle with the sequence

of operations FEFEABC (Figure 19.7). Our procedure, however, doesn’t square the ell up

until a rectangle is reached with dimensions approximately .5:0�1014272/�.5:8�1014272/.

Can our squaring-up procedure be improved in some well-defined way? Does an algo-

rithm exist for tiling the plane that methodically expands a connected island of squares in

such a way that the ratio of the largest square used to the smallest not yet used is bounded

by a polynomial?

Simple tilings A perfect figure is simple if it contains no perfect subrectangle. Our tiling

is far from simple. Is there a simple tiling of the plane using one specimen of each integral

side?

The half-plane and quarter plane Can the half-plane be squared? Can the quarter-plane

be squared? We are especially interested in this question because if it is possible to tile a

quarter plane four times using, altogether, every integral square just once, then it’s possible

to tile the plane using all the integral squares plus one square of any given side (say � ).

¼

Figure 19.17. � � � .

Rational squares The algorithm of our theorem can easily be used to tile the plane using

one square of every rational side. As with integral squares, we don’t know if a similar

procedure works for the half- or quarter-plane.

Positive and negative squares We can tile the plane with squares whose sides are natural

numbers and with squares whose sides are rationals. What about squares whose sides are

(positive and negative) integers? We interpret the effect of placing a small negative square

on top of a large positive one as removing a part of the large square. Once again, our

algorithm works easily for this. Just as placing a positive square next to a rectangle creates

an ell, so does placing a negative square on a corner of a rectangle. With care, no point of

the plane will be touching more than three squares (one negative, two positive).

Odd squares Can the plane be squared with all the odd squares? This seems unlikely to

us. Can the plane be squared with some of the odd squares? In general, what well-defined

subsets of the natural numbers can be used to tile the plane?

i

i

“master” — 2012/11/15 — 11:23 — page 152 — #166i

i

i

i

i

i


Coloring Neither our tiling nor the Fibonacci tiling can be 3-colored. Is there a 3-colorable

tiling of the plane using exactly one square of each integral side? Is there a simple algo-

rithm for 4-coloring the tiling described in this paper?

Space Can space be cubed?

Triangles Scherer proved that the plane cannot be tiled with equilateral triangles of dif-

ferent sizes if one triangle is smallest [7]. He has found a way of tiling the plane, however,

with different sizes of iscoceles right triangles and with enlargements of certain other tri-

angles [8]. Pomerance proved that it is possible to tile the plane with one rational triangle

of each congruence class such that any two neighboring triangles share either an entire

side or just a vertex [5]. Left open is the question: Can the plane be tiled with all rational

equilateral triangles so that no triangle has an infinite number of neighbors?

Acknowledgments We would like to thank Joe O’Rourke for his encouragement, sug-

gestions, and enthusiasm. We are also grateful for the improvements to the paper suggested

by the anonymous referees.

Bibliography

[1] M. Gardner, The Second Scientific American Book of Mathematical Puzzles & Diversions: A

New Selection, Simon and Schuster, New York, 1961.

[2] ——, Fractal Music, Hypercards and More . . . , W. H. Freeman, New York, 1992.

[3] S. W. Golomb, The heterogeneous tiling conjecture, J. Recreat. Math. 8 (1975) 138–139.

[4] B. Grunbaum and G. C. Shephard, Tilings and Patterns, W. H. Freeman, New York, 1987.

[5] C. Pomerance, On a tiling problem of R. R. Eggleton, Discrete Math. 18 (1977) 63–70.

[6] K. Scherer, New Mosaics, privately printed, 1997.

[7] ——, The impossibility of a tessellation of the plane into equilateral triangles whose sidelengths

are mutually different, one of them being minimal, Elem. Math. 38 (1984) 1–4.

[8] ——, Nutts and Other Crackers, privately printed, 1994.

[9] W. Tutte, Squaring the square, Canad. J. Math. 2 (1950) 197–209.

i

i

“master” — 2012/11/15 — 11:23 — page 153 — #167i

i

i

i

i

i

20Magic Knight’s Tours

John D. Beasley

In Mathematical Magic Show [4], Martin Gardner looked at the classic problem of the

knight’s tour on a chess board, paying particular attention to tours in which the numbers in

each row and column added to 260. At that time, the question of whether the numbers in

each long diagonal could also be made to add to 260 was still open. Thanks to advances

in computer power since Martin wrote, this question has now been decided, and perhaps

readers will be interested in the complete statement that it is now possible to make.

A brief history of magic knight’s tours

In knight’s tour literature, the term magic is used for all tours in which the rows and

columns add to the same number. Any diagonal or other properties are a bonus. Figure

20.1a is an example. This is a closed tour (the starting and finishing squares are a knight’s

move apart), and it has two-fold rotational symmetry (numbers which are diametrically

opposite differ by 32, so if we draw the lines joining each number to the next, and 64 to 1,

we get a pattern which repeats itself on rotation through 180 degrees). No knight’s tour on

an 8�8 board can be laterally or diagonally symmetric, nor can it have more than two-fold

rotational symmetry, so Figure 20.1a has as much symmetry as we can hope for. We may

notice that the numbers along a row or column are alternately odd and even.

2 11 58 51 30 39 54 15 55 30 17 42 27 36 15 38

59 50 3 12 53 14 31 38 18 43 54 29 16 39 26 35

10 1 52 57 40 29 16 55 31 56 41 20 33 28 37 14

49 60 9 4 13 56 37 32 44 19 32 53 40 13 34 25

64 5 24 45 36 41 28 17 57 2 45 8 21 64 51 12

23 48 61 8 25 20 33 42 46 5 60 1 52 9 24 63

6 63 46 21 44 35 18 27 3 58 7 48 61 22 11 50

47 22 7 62 19 26 43 34 6 47 4 59 10 49 62 23

(a) Wenzelides, 1849 (b) Jaenisch, 1859

Figure 20.1. Two magic knight’s tours, with starting and finishing squares in bold.

Reprinted from The College Mathematics Journal, Vol. 43, No. 1 (January 2012), pp. 72–75.

153

i

i

“master” — 2012/11/15 — 11:23 — page 154 — #168i

i

i

i

i

i


The first magic knight’s tour to appear in print was published by William Beverley in

1848, but this was an open tour (there was no knight’s move from the final square back

to the first square). Figure 20.1a, which was found by Karl Wenzelides and appeared in

Schachzeitung in 1849, was the first closed magic knight’s tour [12, 18]. Several more

closed magic tours appeared during the next few years, including Figure 20.1b, by C. F.

Jaenisch, also rotationally symmetric, which appeared in The Chess Monthly in 1859 [5]

and was described by its discoverer as “La solution la plus parfaite du probleme du cav-

alier” ([5] is presented bilingually, but the French was the author’s original). Although in

neither of these tours do the long diagonals add to 260, the two long diagonals together do

add to 520, and Figure 20.1b has a further property at which we look in a moment.

By 1940, 126 magic tours had been discovered, 59 of them closed. They were listed in

1951 in an unpublished monograph The Magic Knight’s Tours, a Mathematical Recreation

by H. J. R. Murray [9, 16], but there was no convenient list in print until George Jelliss gave

one in Chessics in 1986 [7, 8]. He accompanied it with a note that with modern computer

methods it should be possible to ascertain whether all such tours had been discovered, and

T. W. Marlow immediately showed that they hadn’t by discovering five more, which he

published in 1987 and 1988 [14, 15]. T. S. Roberts found two more in January 2003 [10].

Still, none had proved to be diagonally magic, and a distributed computing exercise was

set up by Hugues Mackay, Jean-Charles Meyrignac, and Guenter Stertenbrink later in 2003

to put the matter finally to rest. This exercise rediscovered all the magic tours previously

known and added seven new ones, three open and four closed, giving a grand total of 140

(63 closed), none diagonally magic [11, 17]. These conclusions were subsequently verified

by Yann Denef using a program written independently [17].

The complete catalogue is on at least two web sites [11, 17] and in Variant Chess [1, 2].

Each tour can be presented in any of eight orientations and can be numbered from either

end, so these 140 tours give rise to 2240 different arithmetical matrices.

Job done?

Not quite. Only the 8 � 8 case was settled. We can ignore boards of odd side, because the

row sums of a tour on such a board will be alternately odd and even, and so cannot all be

the same. We can also ignore boards of side 4n C 2, since George Jelliss has produced

an elegant and ingenious proof implying that no magic tour on such a board exists [11].

Proofs in this field tend to be either elementary or non-existent, but this is an exception.

Theorem (G. P. Jelliss). A magic knight’s tour on a board of side 4n C 2 is impossible.

Lemma. In a magic knight’s tour, the number of entries of the form 4k C 2 or 4k C 3 in

any row or column, counted together, must be even.

Proof of the lemma. Let h be half the length of the side. The row sum is now

4h3 C h. If we express the numbers in the row in binary, we see that h of them, being

odd, have a non-zero units digit, and adding these digits gives h. Remove these units digits

from the entries, and the numbers represented by the remaining digits must sum to 4h3.

This is a multiple of four and the units digits in the contributing numbers have been re-

moved, so the number of non-zero twos digits in them must be even. But each entry of the

i

i

“master” — 2012/11/15 — 11:23 — page 155 — #169i

i

i

i

i

i

20. Magic Knight’s Tours 155

form 4k C 2 or 4k C 3 contributes one such digit and no entry of the form 4k or 4k C 1

contributes any, so the result follows. A similar argument applies to the columns.

Proof of the theorem. Label the squares of the chessboard thus:

A B A B A B . . .

C D C D C D . . .

A B A B A B . . .

C D C D C D . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

We reflect the tour, if necessary, so that the number in the top left corner is odd; then all

the A and D squares will contain odd numbers, and all the B and C squares even numbers.

Now consider the distribution of the entries of the form 4k C 3. There are 4n2 C 4n C 1

of these, which is an odd number, and they must all be in A or D squares, so either the A

squares contain an odd number and the D squares an even number, or vice versa. Similarly,

either the B squares contain an odd number of entries of the form 4k C 2 and the C squares

an even number, or vice versa.

Now suppose first that the A squares contain an odd number of 4k C 3 entries and the

B squares an odd number of 4k C2 entries; then the C squares will contain an even number

of 4k C 2 entries, and the total number of 4k C 3 and 4k C 2 entries on A and C squares,

counted together, will be odd. But these squares appear only in alternate columns, and by

the lemma each individual column must contain an even number of such entries. This is a

contradiction, so the situation is impossible. A similar argument applies whichever of the

A or D squares contain the odd number of 4k C 3 entries and whichever of the B or C

squares the odd number of 4k C 2 entries.

Thus there can be no magic knight’s tour on a board of side 4n C 2.

So only boards of side 4n need be considered. T. H. Willcocks had discovered an open

magic tour on a 12 � 12 board in which one diagonal added to the magic constant, and in

April 2003 Awani Kumar found four such tours with both diagonals magic [10, 17]. As far

as I know, the existence of closed magic tours on this board with both diagonals also magic

remains an open question, but two such tours on a 16 � 16 board were discovered by H. E.

de Vasa and published in 1962 [3, 11], and others have been discovered since [13, 17].

Return to the 8 � 8 board

In the field of ordinary magic squares, where the numbers are not constrained to form a

knight’s tour, particular attention is paid to pan-diagonal squares, whose numbers add to

the magic constant not only along rows, columns, and principal diagonals, but also along

broken diagonals (in algebraic chess notation, diagonals such as a4-e8/f1-h3). Even while

the possibility of a magic knight’s tour with each long diagonal also adding to 260 was

open, it was realised that no pan-diagonal tour could exist, because the numbers on a diag-

onal are either all odd or all even, and the 32 odd numbers from 1 to 64 don’t add to 4�260;

they add to 4 � 256, and the 32 even numbers to 4 � 264. So while preparing the catalogue

of magic tours for Variant Chess, I looked to see if any had a property which I called

i

i

“master” — 2012/11/15 — 11:23 — page 156 — #170i

i

i

i

i

i


quasi-magic, where the odd diagonals, principal and broken, each add to 256, and the even

diagonals add to 264. Given the inherent disparity between odd and even diagonals, this

seemed to be as close to a truly pan-diagonal magic knight’s tour as we could hope for.

It turned out that none had this property in full, but Figure 20.1b is half-way there. The

principal odd diagonal adds to 256, as do each of the three parallel odd broken diagonals,

and the principal even diagonal and the three parallel even broken diagonals each add

to 264. The same is true of two further tours which Jaenisch published in 1862 [6], but

those tours lack the rotational symmetry of Figure 20.1b. Because of this combination of

properties, therefore, Figure 20.1b is indeed the “most perfect” solution to the problem of

the knight’s tour on the 8 � 8 board.

Now comes a curious question: Was Jaenisch aware of this property of the broken

diagonals in Figure 20.1b, which to my mind adds greatly to the elegance of this tour? I can

find no mention in any of the sources I have examined from the nineteenth to the twenty-

first century. The property will have become obvious as soon as it occurred to somebody

to look for it, and I wrote in 2008 that I could not believe it had remained unspotted until

then. Even so, nobody has yet drawn my attention to an earlier report of its existence.

Perhaps a reader will be able to enlighten me.

Acknowledgment I have been merely the reporter, and in addition to the workers named

above I am grateful to the Bodleian Library for access to The Magic Knight’s Tours, a

Mathematical Recreation and to other Murray papers, to Cambridge University Library

for access to Traite des Applications de l’Analyse Mathematique au Jeu des Echecs, and

to George Jelliss for photocopies of the relevant pages from Schachzeitung and The Chess

Monthly (made for him by Marian Stere and Ken Whyld). I am also grateful to George

Jelliss for constructive comments on my original draft.

Bibliography

[1] J. D. Beasley, Another look at 8 � 8 magic knight’s tours, Variant Chess 57 (2008) 50–53.

[2] ——, Credit where credit is due, Variant Chess 58 (2008) 72.

[3] G. D’Hooghe, Les Secrets du Cavalier, Brepols, Bruxelles, 1962.

[4] M. Gardner, Knights of the square table, in Mathematical Magic Show, George Allen and Un-

win, London, 1977, 188–202, 283.

[5] C. F. Jaenisch, De la solution la plus parfaite du probleme du cavalier, The Chess Monthly

(1859) 110–115, 146–151, 176–179.

[6] ——, Traite des Applications de l’Analyse Mathematique au Jeu des Echecs, vol. 2, St Peters-

burg, 1862.

[7] G. P. Jelliss, Catalogue of 8 � 8 magic knight’s tours, Chessics 26 (1986) 122–128.

[8] ——, Notes on Chessics 26, Chessics 29/30 (1987) 163.

[9] ——, H. J. R. Murray’s history of magic knight’s tours, The Games and Puzzles Journal 14

(1996) 238–244, 15 (1997) 266–267.

[10] ——, Recent advances in magic knight’s tours, Variant Chess 43 (2003) 40–41.

[11] ——, Knight’s tour notes (2001–2005); available at www.mayhematics.com.

[12] D. E. Knuth, letter to G. P. Jelliss, quoted in The Games and Puzzles Journal 14 (1996) 243.

www.mayhematics.com

http://www.mayhematics.com

i

i

“master” — 2012/11/15 — 11:23 — page 157 — #171i

i

i

i

i

i

20. Magic Knight’s Tours 157

[13] A. Kumar, Studies in magic tours of knight on 16�16 board, J. Recreat. Math. 34 (2005–2006)

275–285.

[14] T. W. Marlow, Magic knight tours, The Games and Puzzles Journal 1 (1987) 11.

[15] ——, Magic knight tours, The Problemist 12 (1988) 379.

[16] H. J. R. Murray, The Magic Knight’s Tours, a Mathematical Recreation, manuscript in the

Bodleian Library, shelfmark MS Eng d.2370.

[17] G. Stertenbrink, Computing magic knight’s tours; available at magictour.free.fr.

[18] K. Wenzelides (as “...l .........s in P.......”), Bemerkungen uber den Rosselsprung, Schachzeitung

4 (1849) 41–97.

magictour.free.fr

http://magictour.free.fr

i

i

“master” — 2012/11/15 — 11:23 — page 158 — #172i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 159 — #173i

i

i

i

i

i

21Some New Results on

Magic Hexagrams

Martin Gardner

Combinatorial problems involving magic squares, stars, and other geometrical structures

often can be solved by brute force computer programs that simply explore all possible

permutations of numbers. When the number of permutations is too large for a feasible run-

ning time, an algorithm can frequently be reduced to manageable time by finding ingenious

shortcuts. Such planning makes computer solving less trivial and much more interesting.

A superb example of such planning was described in a little-known short article in

the Mathematical Gazette (vol. 75, June 1991, pp. 140–142). The authors, Brian Bolt,

Roger Eggleton, and Joe Gilks, posed for the first time a problem based on the pattern

shown in Figure 21.1. It is the traditional hexagram or Star of David with its inner hexagon

divided into six equilateral triangles. Can numbers 1 through 12 be placed inside the twelve

triangles so that each of the six rows, indicated by the arrow, have the same sum?

The authors point out that there are 12Š ways of arranging the numbers. When the

pattern’s six rotations, and six reflections, are excluded, the number reduces to 11Š or

39,916,800. Can this number be further reduced?

Figure 21.1.

Reprinted from The College Mathematics Journal, Vol. 31, No. 4 (September 2000), pp. 274–280.

159

i

i

“master” — 2012/11/15 — 11:23 — page 160 — #174i

i

i

i

i

i


It can. Each of the six rows must have the same sum, and this “magic constant” must

be either odd or even. It is not hard to discover that, by neglecting rotations, reflections,

and complements, there are just three ways that odd and even numbers from 1 to 12 can be

distributed on the hexagram (see Figure 21.2).

A B C

0

0 0

0

0

0 1 1 1

11

1

1 1 1 1

11

0

00

0

0

0

0

00 0

0

0

1

11 1

1

1

Figure 21.2.

A complement, I should explain, is obtained by subtracting each number of a magic

square or star from the pattern’s largest number plus 1. For example, consider the Chinese

lo shun or 3 � 3 magic square in Figure 21.3. If each number is replaced by the remainder

when it is taken from 10, we obtain the complement (Figure 21.4).

8 1 6

3 5 7

4 9 2

Figure 21.3.

2 9 4

7 5 3

6 1 8

Figure 21.4.

Note that the complement in this case is the same as before except for a rotation and

reflection. In the case of magic hexagrams, complements are always different, but are con-

sidered trivial variations. The complements of the three patterns shown in Figure 21.2 are

obtained by replacing each 1 with 0 and each 0 with 1. Each 1 stands for an odd number,

each 0 for an even number. By neglecting complements, as well as rotations and reflec-

tions, the three odd-even patterns provide a way of establishing upper and lower bounds

for the star’s magic constant.

In pattern A all the even numbers go in outside triangles. They add to 42, but because

each number appears in two rows, we double 42 to get 84 as the contribution the even

numbers make to the total of the sums of all six rows.

The odd numbers in pattern A go on interior triangles. They add to 36, but because

each number is in three rows we triple 36 to get 108 as the contribution the odd numbers

make to the total of all six rows.

The sum of all the rows sums is 84 C 108 D 192. There are six rows, so to find the

magic constant for this pattern we divide 192 by 6 to get 32. This is the magic constant

i

i

“master” — 2012/11/15 — 11:23 — page 161 — #175i

i

i

i

i

i

21. Some New Results on Magic Hexagrams 161

for pattern A on the assumption that the numbers can be arranged to make the hexagram

magic. As it turns out, pattern A has no solution.

We turn now to patterns B and C. Each has four odd numbers and two even numbers

on the interior triangles. To make the magic constant as low as possible, we place on these

inside triangles, where each number appears in three rows, the four lowest odd numbers

.1; 3; 5; 7/ and the two lowest even numbers .2; 4/. The sum of the inside numbers is 22.

Tripling it gives 66.

On the outside triangles go odd numbers 9; 11 and even numbers 6; 8; 10; 12. They add

to 56. Each number appears in two rows so we double 56 to get 112. Adding 66 to 112

gives 178.

We know that this total must be a multiple of 6 because six rows must have the same

sum. So we raise 178 by the smallest amount to arrive at a multiple of 6, namely 180.

Dividing 180 by 6 yields 30. This is the lower bound for the magic constants of patterns B

and C.

To obtain the upper bound we put the largest two even numbers .10; 12/ and the four

largest odd numbers .5; 7; 9; 11/ on the interior triangles. They add to 54 and three times

54 is 162. The outside numbers .1; 2; 3; 4; 6; 8/ add to 24 and twice 24 is 48. The sum of

162 and 48 is 210. Dividing 210 by 6 gives 35 as the upper bound for the magic constants

of patterns B and C.

We now have shown that the magic constant, if such a magic hexagram does indeed

exist, must be 30; 31; 32; 33; 34, or 35. Knowing this, and knowing how odd and even

numbers must be distributed to make the hexagrams magic, allowed the three authors to

reduce their computer program to a manageable length. To the authors’ vast surprise, their

computer search produced just one solution! It is shown on the left of Figure 21.5. Its com-

plement (each number taken from 13) is on the right. “When we discovered the beautiful

solution,” the authors write, “we rushed from our baths into the street shouting ‘Eureka!’”

The two patterns make excellent puzzles. Place numbers 1 through 12 on the hexagram so

each row adds to 33, or so each row adds to 32. In both cases the solution is unique. Note

that in the pattern on the left each pair of opposite corners add to 12, and on the right, they

total 14.

1

2 4

10

11

83

57

69

12

510

86

79

12

111

4

3

2

Figure 21.5.

A much older magic hexagram problem, now thoroughly explored, consists of placing

numbers 1 through 12 on the vertices of the traditional Star of David shown in Figure 21.6.

Because each number appears in two rows, the sum of all the row sums is twice the sum of

numbers 1 through 12, or 78�2 D 156. There are six rows, so 156=6 D 26, the hexagram’s

magic constant.

i

i

“master” — 2012/11/15 — 11:23 — page 162 — #176i

i

i

i

i

i


Figure 21.6.

The British puzzlist Henry Ernest Dudeney, in Modern Puzzles (1926) claimed there

are 37 fundamental solutions, or 74 if complements are included. This was one of Du-

deney’s rare mistakes. He missed three basic patterns. Von J. Christian Thiel, in the German

periodical Archimedes (vol. 5, September 1963, pp. 65–72) displays 15 basic solutions. By

applying well known transformations in which numbers are interchanged a certain way,

Thiel raises the total of patterns, not counting rotations and reflections, to 40. These, to-

gether with their complements, make 80 solutions.

Dudeney, in Modern Puzzles, and later in A Puzzle-Mine, identifies six solutions which

have the additional feature that the points of the star also add to 26. They are shown in

Figure 21.7. Each has a complement in which the interior numbers add to 26. Note that on

each hexagram each large triangle has a sum of 13, so together they add to 26, and the sum

of any small triangle is the same as the sum of the small triangle opposite.

8

10

9

5

5

10

6 9 8 3

5 7

2 11 12 1

4

7 10 4 5

6 11

2 12 9 3

1

3 11 10 2

49

1 6 12 7

84

3 12 10 1

8 5

2 6 11 7

4 7 9 6

58

1 11 12 2

3 8

2 7 11 6

10 3

1 9 12 4

Figure 21.7.

i

i

“master” — 2012/11/15 — 11:23 — page 163 — #177i

i

i

i

i

i


29 137

149 157 163 191

193 179

173 167 139 181

15137

73 43 41 47

71 67

53 31 61 59

Figure 21.8.

Akira Hirayama and Gakuho Abe, in Researches in Magic Squares (Osaka, 1983)—the

text is in Japanese—give a slightly different way of arriving at 80 fundamental solutions.

The book also includes Abe’s amazing constructions of two magic hexagons made with

consecutive primes! They are shown in Figure 21.8.

There is now a sizeable literature on stars with more than six points. The five-pointed

star, or pentagram, is easily shown to have no solution for numbers 1 through 10. You’ll

find one such proof in the chapter on magic stars in my Mathematical Carnival. In the

same chapter I explain how the numbers in a solution for the traditional hexagram can be

transferred to the twelve edges of the cube so that four numbers around each face total

26. Because the cube’s “dual” is the octahedron (faces and vertices exchanged), the same

numbers can be placed on the octahedron’s edges so that the four edges around each vertex

add to 26. Figure 21.9 shows how a hexagram solution is transferred to a magic cube and

to a magic octahedron.

10

3

1 11 12 2

8 5

4 7 9 6

7

64 9

5108

1 112

3

1211

121

23

8

105

9

47 6

Figure 21.9.

A third type of hexagram is formed by adding three diagonals to the traditional structure

as shown in Figure 21.10. The question arises: Can numbers 1 through 19 be placed on

the vertices of this pattern so eveiy line of five has the same sum? Harold Reiter, at the

University of North Carolina, Charlotte, posed this as an unsolved problem in his article

“A Magic Pentagram,” in Mathematics Teacher (March 1983, pp. 174–177).

By trial and error I discovered the solution shown; it has 1; 2; 3; 4; 5; 6 on the out-

side points, and a magic constant of 46. Its complement (numbers taken from 20) puts

14; 15; 16; 17; 18; 19 on the outside points, and raises the magic constant to 54.

i

i

“master” — 2012/11/15 — 11:23 — page 164 — #178i

i

i

i

i

i


19

1

24 12 9

10 15

18 13 8

11 16

75 17 14 3

6

Figure 21.10.

Finding all solutions to this problem by a brute force computer algorithm is out of

the question because it would require examining 19Š=12 possible permutations, Reiter and

his associate David Ritchie, in “A Complete Solution To the Magic Hexagram Problem,”

in The College Mathematics Journal (vol. 21, September 1989, pp. 307–316), describe the

clever shortcuts they used to reduce their algorithm to an examination of a mere 18,264,704

patterns. In less than five minutes their Pascal program found 2,190 basic solutions, not

counting rotations, reflections, and complements, or 4,380 solutions if complements are

included. The magic constants range from 46 through 54.

The hexagram pattern in Reiter’s problems is an interesting one. It solves an old tree-

plant puzzle of placing 19 trees to form six straight rows with four trees in each row. I used

this pattern for a checker-like game I named Solomon. It can be purchased from Kadon

Enterprises, 1227 Lorene Drive, Pasadena, Maryland 21122.

Frank Bernhart wrote to point out that there are many symmetric patterns with vertices

and lines that are combinatorially equivalent to Reiter’s hexagram, and therefore have the

same set of solutions. Three are shown in Figure 21.11.

Figure 21.11.

i

i

“master” — 2012/11/15 — 11:23 — page 165 — #179i

i

i

i

i

i


The traditional hexagram obviously contains eight different triangles. The modified

hexagram of the kind shown in Figure 21.1 has 20 triangles. (Many old puzzle books

give this as a problem.) How many different triangles can you find in Reiter’s hexagram?

Counting them all is not so easy. I leave this as a question to be answered in the next issue

(see next page).

i

i

“master” — 2012/11/15 — 11:23 — page 166 — #180i

i

i

i

i

i


In The College Mathematics Journal (Vol. 31, No. 5, page 146), the number of triangles

in Reiter’s hexagram was given as 50. However, the current editors have found 56 distinct

triangles.

19

1

24 12 9

10 15

18 13 8

11 16

75 17 14 3

6

The triangles are grouped by number of regions below with the count and a representative

for each of seven types of congruent triangles.

1 region: 12 triangles congruent to 1; 9; 12,

2 regions: 6 triangles congruent to 1; 12; 19,




6 regions: 12 triangles congruent to 1; 5; 17, and

12 regions: 2 triangles congruent to 1; 3; 5.

i

i

“master” — 2012/11/15 — 11:23 — page 167 — #181i

i

i

i

i

i

22Finding All Solutions

to the Magic Hexagram

Alexander Karabegov and Jason Holland

Problem 394 in Henry Dudeney’s 536 Puzzles and Curious problems [1] poses this puzzle:

Put the numbers from 1 to 12 in the circles on the left in Figure 22.1 so that the sum of the

four numbers on each line is 26. One solution is shown on the right.

6

2 5 9 10

G

I H F E

12 7DJ

1

3 11 48CB

A

LK

Figure 22.1. The hexagram along with one solution.

The solution pictured in Figure 22.1 is listed as solution number 16 in Table 22.1 at the

end. Table 22.1 contains 20 solutions to the puzzle. The letters in Figure 22.1 on the left

are used as positions for the solutions in Table 22.1.

There are many puzzles referred to as magic stars (see, for example, [2, Ch. 5]). While

such puzzles are usually fun, they often require the “brute force” method (also called the

method of exhaustion) for their solution. In this brief article, we present a systematic ap-

proach to finding all solutions of Dudeney’s puzzle. The key to this approach is something

known as an out-shuffle in cards.

It is known that if two solutions are considered to be the same when one can be obtained

from the other by rotation or reflection, then there are 80 different solutions to this puzzle.

(Curiously, Dudeney erred in his count, finding only 74, see [1, p. 350]). Martin Gardner

[2, chapter 21 in this collection] called this puzzle the Magic Hexagram and pointed out

Reprinted from The College Mathematics Journal, Vol. 39, No. 2 (Mar. 2008), pp. 102–106.

167

i

i

“master” — 2012/11/15 — 11:23 — page 168 — #182i

i

i

i

i

i


Table 22.1. The twenty non-equivalent solutions to the Magic Hexahedron.

A B C D E F G H I J K L

1 1 7 2 8 10 4 12 3 9 5 6 11

2 1 10 2 8 7 4 12 6 9 5 3 11

3 1 5 7 9 11 4 6 8 3 10 2 12

4 1 8 4 6 11 7 9 5 3 10 2 12

5 1 5 7 11 9 6 2 8 3 12 4 10

6 1 10 8 4 11 5 9 3 7 12 2 6

7 1 7 3 12 6 2 9 8 10 4 5 11

8 1 5 3 12 8 2 9 6 10 4 7 11

9 1 4 2 10 11 5 9 3 7 6 8 12

10 1 9 3 11 5 4 8 10 7 6 2 12

11 1 9 2 6 10 7 11 4 5 8 3 12

12 1 6 3 11 8 2 10 7 9 4 5 12

13 1 4 7 11 10 2 6 9 5 8 3 12

14 1 7 2 10 8 3 11 6 9 4 5 12

15 1 9 2 10 6 11 3 4 5 12 7 8

16 1 8 4 7 10 9 6 5 2 12 3 11

17 1 2 5 12 11 3 6 4 8 7 9 10

18 1 9 5 12 4 3 6 11 8 7 2 10

19 1 7 3 8 10 9 6 5 2 11 4 12

20 1 6 5 10 9 7 4 8 2 11 3 12

that there are indeed 80 different solutions. He also observed that it is equivalent to placing

the numbers from 1 to 12 on the edges of a cube so that the sum of the four labels on every

face is 26. This identification is illustrated in Figure 22.2.

There seems little choice but to call Gardner’s version the Magic Hexahedron (unless

it be called the Magic Cube). The two versions are not entirely equivalent however—the

6

2 5 9 10

12 7

1

3 11 48

9

6

5

4

7

10

8

11

121

2

3

Figure 22.2. The identification between the hexagram and the cube.

i

i

“master” — 2012/11/15 — 11:23 — page 169 — #183i

i

i

i

i

i

22. Finding All Solutions to the Magic Hexagram 169

6

25 910

12 7

1

311 4 8

9

6

5

4

7

10

8

11

121

2

3

Figure 22.3. Solution generated from Figure 22.2 by a reflection.

cube has more symmetries than the hexagram (or, equivalently, the hexagon). While the

symmetry group of a hexagon has order 12, that of the cube has order 48. For each solution

to the Magic Hexahedron, there are four Magic Hexagram solutions. We use Figure 22.2

to illustrate the latter point. If one reflects the cube across the vertical plane through the

midpoints of the edges labeled 7, 6, 12, and 1, one gets the solution shown on the left in

Figure 22.3.

The corresponding Magic Hexagram is shown on the right in Figure 22.3. Clearly, this

is different from the solution of the Magic Hexagram in Figure 22.2, that is, no symmetry

of one of these hexagrams can produce the other. (For an excellent discussion of the full

symmetry group of the cube, see [3, pp. 112–113].) We now turn to determining that there

are twenty different Magic Hexahedron solutions.

To facilitate our enumeration, we consider two ways of classifying a face in a solution.

First, a face is parity-balanced if it has two odd labels and two even ones. Note that the

only non-parity balanced faces have all labels odd or all even. A solution is then called

parity-balanced if all of its faces have this property. Observe that in a solution that is not

parity-balanced, one face must have all labels odd, one must have all labels even, and the

other four labels must alternate in parity around the cube. Since the only possibilities for a

face that is not parity-balanced are f1; 5; 9; 11g, f3; 5; 7; 11g, f2; 4; 8; 12g, and f2; 6; 8; 10g,

it is straightforward (and not difficult) to find the ten solutions that are not parity-balanced.

These are the first ten solutions in Table 22.1.

We now turn to our second way of classifying a solution. We call a face magnitude-

balanced if two of its labels are 6 or less and the other two are 7 or more, and a solution

is magnitude-balanced if all of its faces are. Note that of the solutions 1–10 in Table 22.1,

only the first two do not have this property.

Given a magnitude-balanced face, its four labels can be written as i , j , k C6, and l C6,

where each of i , j , k, and l is at most 6. Observe that since i C j C k C l D 14, the set

f2i � 1; 2j � 1; 2k; 2lg then is a parity-balanced set with sum 26. Hence, the mapping M

with

M.n/ D(

2n � 1 if 1 � n � 6,

2.n � 6/ if 7 � n � 12,

i

i

“master” — 2012/11/15 — 11:23 — page 170 — #184i

i

i

i

i

i


transforms a magnitude-balanced solution to one that is parity-balanced. If we apply the

mapping M to the eight solutions 3–10 in Table 1 (perhaps twice), we get the ten solutions

11–20 in Table 22.1. This process is illustrated in Table 22.2 where arrows to the right

indicate that the mapping M has been applied.

Table 22.2. Solution numbers from Table 22.1 along with what happens when M is applied.

3 ! 11 ! 12

4 ! 13 ! 14

5 ! 15

6 ! 16

7 ! 17

8 ! 18

9 ! 19

10 ! 20

All that remains is to show that there are no other solutions to the Magic Hexahedron

puzzle. For this, we use the inverse P of the mapping M :

P.n/ D

8

<

:

12.n C 1/ if n is odd,

12n C 6 if n is even.

This is a permutation of f1; 2; : : : ; 12g that takes a parity-balanced solution to one that is

magnitude-balanced. In a deck of 12 cards, this permutation is known as an out-shuffle [4].

We now show that every parity-balanced solution eventually leads through repeated

application of P to a non-parity solution. Suppose not, and let S be such a solution. Note

that our out-shuffle P leaves 1 and 12 fixed and permutes the other ten numbers in the

cycle .2; 7; 4; 8; 10; 11; 6; 9; 5; 3/. One of the faces in the solution S contains 12 and not

1. Sooner or later, if we apply P repeatedly, the labels on this face include 12, 11, and

two other numbers, neither of which can be 1. Since their sum is 26, this is impossible.

Hence, every parity-balanced solution eventually leads to a non-parity-balanced solution.

Therefore, every parity-balanced solution arises from a non-parity-balanced one by apply-

ing the mapping M , either once or twice. Thus, Table 22.1 gives all solutions to the Magic

Hexahedron puzzle.

Theorem. The Magic Hexahedron puzzle has precisely twenty non-equivalent solutions.

Corollary. The Magic Hexagram puzzle has precisely eighty non-equivalent solutions.

Acknowledgments The authors wish to thank John Barton who wrote a program that

verified all 960 solutions to the Magic Hexagram and thus gave us plenty of data to work

with. We also wish to thank Amy Niemeyer who pointed out the term out-shuffle, Stephanie

Wong who brought this wonderful puzzle to our attention, and Lowell Beineke for suggest-

ing the name Magic Hexahedron.

i

i

“master” — 2012/11/15 — 11:23 — page 171 — #185i

i

i

i

i

i

22. Finding All Solutions to the Magic Hexagram 171

Bibliography

[1] H. E. Dudeney, 536 Puzzles & Curious Problems, Charles Scribner’s Sons, 1967.

[2] M. Gardner, Mathematical Carnival, Mathematical Association of America, 1989.

[3] F. Goodman, Algebra Abstract and Concrete, Prentice Hall, 1998.

[4] E. Weisstein, Out-Shuffle Mathworld webpage,

mathworld.wolfram.com/Out-Shuffle.html.

mathworld.wolfram.com/Out-Shuffle.html

http://mathworld.wolfram.com/Out-Shuffle.html

i

i

“master” — 2012/11/15 — 11:23 — page 172 — #186i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 173 — #187i

i

i

i

i

i

23Triangular Numbers, Gaussian

Integers, and KenKen

John J. Watkins

One of the first of Martin Gardner’s Mathematical Games columns I ever read was

“Euler’s Spoilers,” in November 1959 [1], and after all these years I think it is still my

favorite (maybe this is just because I love the way he rhymed ‘Euler’ and ‘spoiler’). It dealt

with what we now call Latin squares, n � n arrays using n symbols such that each symbol

appears exactly once in each row and column.

Latin squares are extremely useful in the design of statistical experiments, but they

are perhaps better known now for their appearance in recreational puzzles such as sudoku

where each row and each column of a 9 � 9 array contains each of the integers 1; 2; : : : ; 9

exactly once. In 2004, a Japanese mathematics teacher, Tetsuya Miyamoto, invented a

sudoku-like puzzle called KenKen (loosely translated, this means “wisdom squared”).

KenKen quickly became so popular that it is now a standard feature in newspapers around

the world, and there are two very good web sites ([2], [3]) that offer new puzzles daily

online.

As in sudoku, the goal in KenKen is to fill an n � n grid with the numbers 1 through

n so as not to repeat a number in any row or column. Despite the fact that it uses num-

bers, sudoku is not an arithmetical puzzle; any nine symbols could be used in place of the

integers 1–9. Solving a KenKen puzzle, on the other hand, depends heavily upon several

important ideas about numbers.

Triangular numbers

In Figure 23.1, we see a typical KenKen where each heavily outlined ‘cage’ indicates

what must happen inside that particular cage. For example, inside the cage labeled ‘24�’

the product of the three numbers must be 24; hence, the three numbers in this cage are

forced to be 4, 3, and 2. The cage labeled ‘5�’ must contain the three numbers 5, 1, and

1 (because 5 is prime); moreover, we can immediately place these three numbers in this

cage in their correct positions since the two 1s must appear in different rows and different

columns. Since the only ways to partition the integer 10 into a sum of three distinct integers


173

i

i

“master” — 2012/11/15 — 11:23 — page 174 — #188i

i

i

i

i

i


24× 5×

12×

10+

10+

2÷4 –

1–

1–

2

3

(a)

Figure 23.1. A typical 5 � 5 KenKen.

1; 2; 3; 4, and 5 are as 10 D 5 C 4 C 1 or as 10 D 5 C 3 C 2, we see that we have two

choices for the numbers to put in each of the cages labeled ‘10C’.

It almost goes without saying that prime factorization and partitions of integers are

important in solving KenKen puzzles. What is quite surprising is that another important

notion in number theory, the ancient Greek concept of triangular numbers, can also often

be used to solve a KenKen because in any solution to a KenKen the sum of the numbers

in any row or column of the grid is 1 C 2 C � � � C n; that is, the sum is the nth triangular

number. In Figure 23.1, the sum of the numbers in any row or column must be 15 (that is,

the triangular number 15 D 1 C 2 C 3 C 4 C 5). For example, since in the bottom row the

sum in the first cage is 10, the sum in the second cage must be 5. Hence, that cage contains

2 and 3. But, there is already a 3 in the fourth column, so we can immediately place the 2

and 3 in their correct positions in the bottom row.

At this point we can use a similar argument for the left-hand column to conclude that

the sum of the two numbers in the upper and lower left-hand corners must also be 5. Thus,

these numbers are either 2 and 3, or 1 and 4. But, they can’t be 2 and 3 because we have

already placed both 2 and 3 in the bottom row. Therefore, we know these two numbers

must be 1 and 4, and we even know that the 4 goes in the upper corner since the cage

labeled ‘24�’ contains 4, 3, and 2. It is now a straightforward matter to finish solving this

puzzle. (Solutions for the ordinary KenKen puzzles in this article are on p. 178.)

In Figure 23.2, we see a 4 � 4 KenKen puzzle that normally would require a lot of trial

and error to solve. For example, it is clear that the ‘3�’ cage contains a 1 and 4 and the

cage below contains a 2 and 3, but a good deal of experimenting would need to take place

to determine the exact placement of these four numbers. Fortunately, we can use triangular

numbers to avoid a tedious trial and error process.

It is clear that the two numbers a and b in the ‘2�’ cage must be 1 and 2 in some order.

Since the sum of the numbers in the two bottom rows is an even number .20 D 10 C 10/,

we immediately conclude that b is also even (because 9 is odd and the sum of the two

numbers in the ‘1�’ cage must also be odd). Therefore, b D 2, and it is now easy to solve

i

i

“master” — 2012/11/15 — 11:23 — page 175 — #189i

i

i

i

i

i

23. Triangular Numbers, Gaussian Integers, and KenKen 175

2÷

12×3– 1–

1–

4

9+

a

b

(b)

Figure 23.2. Using triangular numbers to find b.

this puzzle without any trial and error (the first column can now be completed and the

‘12�’ cage cannot contain a 1). Note that we could have added all the known quantities in

the bottom two rows and then subtracted the resulting sum, 18, from 20 to find b, but that

takes more work, and often, as in this case, a simple parity argument provides the needed

information.

In Figure 23.3, we have a rather difficult 6 � 6 KenKen that can also be solved with

the aid of triangular numbers. We focus first on the ‘4�’ cage in the bottom two rows. The

question is whether the 4 factors as 4 � 1 � 1 or as 2 � 2 � 1. Note that, in either case, since the

120 in the ‘120�’ cage in the two bottom rows must factor as 6 � 5 � 4, we can conclude that

the ‘2�’ cage in the bottom row does not contain both 2 and 4 (because in the first case

there are already two 4s in the bottom two rows, and in the second case there are already

12×

120×

120×4×

11+

12+

10+

2

3

1–

3–

3÷

3÷2÷

2÷

7+

a

b

c

d

(c)

Figure 23.3. Using triangular numbers to find a; b and c.

i

i

“master” — 2012/11/15 — 11:23 — page 176 — #190i

i

i

i

i

i


two 2s in the bottom two rows). Hence, this ‘2�’ cage contains either 1 and 2, or 3 and 6,

and either way the sum of the two numbers in this cage is odd. Now, 3 is also odd, as is the

sum of the numbers in the ‘120�’ cage (6 C 5 C 4 D 15); however, the sum in the ‘10C’

cage is even, as is the entire sum of the bottom two rows (being twice the triangular number

21). Therefore, we immediately see that a C b C c must be odd. Hence, the numbers in

the ‘4�’ cage are 2, 2, and 1. With this information we can quickly fill in the bottom two

rows (using the fact that the ‘11C’ cage in the top row must contain a 5 and 6), and easily

complete the rest of the puzzle without ever having to resort to trial and error.

More tricks

The puzzle in Figure 23.3 can also be used to illustrate a very sneaky technique I learned

from Barry Cipra in Atlanta at the eighth Gathering for Gardner conference. He used his

idea to solve sudoku puzzles, but the same idea can also be helpful for KenKen. So, let’s

start from scratch on the puzzle in Figure 23.3, and try to determine the value of d based

on an argument involving the assumption that there is a unique solution to this problem.

It is usually implicit, though rarely stated, in puzzles such as KenKen and sudoku, that

there is a unique solution to the puzzle, and this fact can sometimes be invoked as an aid

in finding that solution. Again, we consider the situation in the ‘120�’ cage at the bottom

of the puzzle. This time we look at the ‘11C’ cage in the top row directly above this cage.

Since the ‘11C’ cage must contain 5 and 6, if we assume that there is a unique solution to

this puzzle then there cannot also be a 5 and 6 in the two squares in the bottom row in the

‘120�’ cage (otherwise, simply switching the 5s and 6s in the top and bottom rows yields

two distinct solutions). And, of course, 5 and 6 can’t both appear in the 4th column in the

‘120�’ cage since either a 5 or a 6 appears in that same column in the ‘11C’ cage in the

top row. The only remaining option in the ‘120�’ cage then is if d D 4.

Factorials, the multiplicative analog of triangular numbers, can also be used in solving

KenKen puzzles. For example, in a 6 � 6 puzzle, if there is a straight ‘120�’ cage with

four squares in a row, then the product of the two extra squares in this row is 6 (because6Š

120D 6). So, if this ‘120�’ cage is next to an angled ‘12�’ cage with two squares in the

same row and one square in the row above, then the single square in the row above must

contain a 2.

Gaussian puzzles

One of the most enjoyable features of KenKen puzzles, especially larger ones, is the way

in which the various possibilities that arise from the prime factorization of an integer come

into play when dealing with irregularly shaped cages. Another number system with unique

factorization into primes is the Gaussian integers, all complex numbers of the form a C bi

where a and b are integers. What is fun about the Gaussian integers is that some familiar

numbers such as 2 and 5 are no longer prime since they can be factored: 2 D .1 C i/.1 � i/

and 5 D .1 C 2i/.1 � 2i/.

We conclude this tribute to Martin Gardner with several KenKen puzzles for the reader

to solve featuring the Gaussian integers (solutions are on p. 186). Here are a few hints

to get you started. Consider the ‘4�’ cage in puzzle (d). Since 4 D 2 � 2 and we are not

i

i

“master” — 2012/11/15 — 11:23 — page 177 — #191i

i

i

i

i

i

23. Triangular Numbers, Gaussian Integers, and KenKen 177

4×

2×2×

3+

3+

i –

1+ i – i – i –

i –

1– i –

8×

4×

(d) (e)

Figure 23.4. Solve using the four numbers 1, 1C i , 1 � i , and 2.

allowed to repeat a 2 in a column, there is only one useful way to factor 4 as a product of

three distinct numbers: 4 D 2.1 C i/.1 � i/. Thus, we know which three numbers belong

in this cage (though not their order). This tells us immediately that a 1 goes in the lower

left-hand corner (and then a 2 next to it in the ‘2�’ cage). There are two ‘3C’ cages in

this puzzle, one with three squares and one with two squares; it is easy to see that the only

possibility is for the cage with three squares to contain 1; 1 C i , and 1 � i , and for the cage

with two squares to contain 1 and 2.

What about the ‘i�’ cage in this puzzle? Clearly 2 cannot appear here since 2 minus

any of the other three numbers leaves a 1 behind. Also, this cage cannot contain both 1 C i

and 1� i since .1C i/� .1� i/ D 2i . So, the only options for this cage are for it to contain

1 and 1 C i (since .1 C i/ � 1 D i ), or for it to contain 1 and 1 � i (since 1 � .1 � i/ D i ).

5× 2×

5×4×2+2 i ×

2–2 i ×

1+2 i –

1+2 i–

i –

3 i –

4+ 3+

3+

i ÷

(f)

Figure 23.5. Solve using the six numbers 1, 1C i , 1 � i , 1C 2i , 1 � 2i , and 2.

i

i

“master” — 2012/11/15 — 11:23 — page 178 — #192i

i

i

i

i

i


Similarly, the ‘1 C i�’ cage must contain a 2 (since, otherwise, the 1s completely cancel

during subtraction). Thus, the only possibility is for this cage to contain 2 and 1 � i (since

2 � .1 � i/ D 1 C i ). The Gaussian integers, as lattice points in the complex plane, are

also vectors. For example, 1 C i can be thought of as the vector from the origin .0; 0/ to

.1; 1/. Geometrically, the difference ˛ � ˇ between any two complex numbers ˛ and ˇ is

just the vector from point ˇ to point ˛. Thus, in puzzle (d), it is easy to spot which pairs

of numbers can go in the ‘i�’ cage because the i vector can only be drawn from .1; 0/ to

.1; 1/, or from .1; �1/ to .1; 0/; similarly, we can determine the pair of numbers that must

go in the ‘1 C i�’ cage merely by observing that we can only draw a 1 C i vector from

.1; �1/ to .2; 0/.

Bibliography

[1] M. Gardner, Euler’s spoilers: the discovery of an order-10 Graeco-Latin square, New Mathemat-

ical Diversions, Mathematical Association of America, Washington DC, 1995, 162–172.

[2] The official KENKEN web site with puzzles posted daily; available at www.kenken.com.

[3] The New York Times puzzle web site with puzzles posted daily; available at

www.nytimes.com/kenken.

Solutions to the ordinary KenKen puzzles (a)–(c) on pages

174–175

(a)4 3 2 1 5

3 2 4 5 1

2 5 1 3 4

5 1 3 4 2

1 4 5 2 3

(b)1 3 4 2

4 1 2 3

3 2 1 4

2 4 3 1

(c)3 4 1 5 6 2

5 1 4 3 2 6

6 5 2 1 3 4

4 6 5 2 1 3

1 2 3 6 4 5

2 3 6 4 5 1

www.kenken.com

http://www.kenken.com

www.nytimes.com/kenken

http://www.nytimes.com/kenken

i

i

“master” — 2012/11/15 — 11:23 — page 179 — #193i

i

i

i

i

i

VFurther Puzzles and Games

i

i

“master” — 2012/11/15 — 11:23 — page 180 — #194i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 181 — #195i

i

i

i

i

i

24Cups and Downs

Ian Stewart

Martin Gardner was fond of mathematically-based magic tricks using simple apparatus

—in particular, pennies. Chapter 2 of his Mathematical Carnival [2, pages 12–26] is en-

tirely about penny puzzles. About five years later, Gardner and Karl Fulves invented a

delightfully simple trick with three pennies (see Demaine [1]). Its explanation involves an

important mathematical idea called a state diagram, useful in many combinatorial prob-

lems.

A related trick, usually described using cups instead of coins, has the same state dia-

gram with ‘heads’ and ‘tails’ replaced by ‘up’ and ‘down.’ This article explores the rela-

tions between the two tricks, develops the deeper mathematics that they have in common,

and discusses a generalization of the cups trick. (For other developments from Gardner’s

penny puzzles, see [1].)

Gardner’s three-penny trick

The trick is performed by a blindfolded magician. A volunteer places three pennies in a

row, and chooses at will whether each coin shows heads or tails. However, both heads and

tails must appear, otherwise the trick ends before it begins. The magician announces that

even though she cannot see the coins, she will give instructions to turn coins over so that

all three coins show the same face, heads or tails.

The instructions are:

1. Flip the left-hand coin.

2. Flip the middle coin.

3. Flip the left-hand coin.

After steps 1 and 2 the magician asks whether all three coins show the same face, and

if the answer is ‘yes’, the trick stops, otherwise the magician requests the third flip. Figure

24.1 shows an example. Although it is plausible that enough flips will eventually get all

coins the same way up, it is a little surprising that at most three flips are needed.

In this case it is straightforward to list all possible starting configurations, and check

that three moves always suffice. However, this is not a very satisfactory explanation. All be-


181

i

i

“master” — 2012/11/15 — 11:23 — page 182 — #196i

i

i

i

i

i

182 Part V. Further Puzzles and Games

(a) Typical start. . . (b) Flip the left. . .

(c) Flip the middle. . . (d) Flip the left.

Figure 24.1.

comes crystal-clear, if we draw the state diagram, which shows all possible configurations,

together with the transitions that occur when the instructions are obeyed.

The diagram itself naturally forms a cube, Figure 24.2a, in which each coin corresponds

to an axis in 3-dimensional space, and H, T (respectively) are the coordinates 0, 1. The

effects of moves 1 and 2 are shown in the figure, and it can easily be checked that starting

anywhere except HHH and TTT, a sequence of at most three moves always leads to HHH

or TTT.

⋂⋃⋂

⋂⋂⋂

⋂⋂⋃

⋂⋃⋃

⋃⋃⋂

⋃⋂⋂

⋃⋂⋃

⋃⋃⋃

THT

TTT

TTH

THH

HHT

HTT

HTH

HHH

flip

middle

flip

middle

flip

middle

flip

middle

flip left

flip left

flip left

flip left

Figure 24.2. (a) State diagram for the coin trick. (b) State diagram for the cup trick.

Three cups

The same state diagram occurs in the analysis of an old trick, which starts by placing three

cups upright on a table or a bar[[[

;

then turning the middle one over:[\[

:

You announce that you will turn all three of them upside down in exactly three moves,

where a move inverts exactly two cups. Like this:

i

i

“master” — 2012/11/15 — 11:23 — page 183 — #197i

i

i

i

i

i

24. Cups and Downs 183

[\[

\[[

[[\

\\\

:

It can be done in one move, but asking for three is a classic piece of misdirection, setting

up the second part of the trick. Which is to casually turn the middle cup upright:

\[\

;

and invite someone to repeat the trick. But now, whatever they do, they can’t get all three

cups upside down.

What they haven’t spotted is that you’ve altered the initial position, and the parity of the

upright cups has changed from even to odd. The original start position has even parity, and

so does the required end position. But the new start position has odd parity. Since inverting

two cups preserves parity, the task becomes impossible.

There are several different ways to think about this puzzle mathematically. One is to

draw its state diagram, which shows all possible states of the three cups and uses connecting

lines to show what happens if you turn cups upside down one at a time. Figure 24.2b shows

this. Of course, you have to invert them two at a time, so you redraw the diagram to show

which states are connected by two 1-cup moves, obtaining Figure 24.3. This falls into two

disconnected pieces: one contains the first starting state and the required finishing one, and

the other contains the second starting state. Obviously you can’t change components by

going along the lines.

⋃⋂⋂ ⋂⋃⋂

⋂⋂⋃

⋃⋃⋃

⋂⋃⋃ ⋃⋂⋃

⋃⋃⋂

⋂⋂⋂

Figure 24.3. State diagram for 2-cup moves. (Each edge represents two or three distinct combinations

of 1-cup moves.)

The problem doesn’t depend on the order in which the cups are arranged, just how

many are up (and hence how many are down). So you can ‘factor out the symmetry’ by

considering only the number of ‘up’ cups, getting Figure 24.4a for 1-cup moves and Figure

24.4b for 2-cup moves. Now the obstacle is even more obvious.

3 2 1 0 3 1 2 0

Figure 24.4. State diagrams identifying symmetrically related states. (a) 1-cup moves. (b) 2-cup

moves.

i

i

“master” — 2012/11/15 — 11:23 — page 184 — #198i

i

i

i

i

i


You can also represent parity by colouring odd numbers grey and even ones white. The

result, for Figure 24.4a, is Figure 24.5. Now any two-step move preserves colour (that is,

parity) so again there can’t be a solution.

3 2 1 0

Figure 24.5. Parity diagram for 1-cup moves.

Matrix solution and generalization

If you prefer algebra to graph theory, you can use the transition matrix. For 1-cup moves

this is a 4 � 4 matrix A whose rows and columns correspond to the four states in Figure

24.4, with entry aij being 1 if there is a move from state j to state i , and 0 if not. Clearly

A D

2

664

0 1 0 0

1 0 1 0

0 1 0 1

0 0 1 0

3

775

:

The transition matrix for 2-cup moves is just the square of A, and

A2 D

2

664

1 0 1 0

0 2 0 1

1 0 2 0

0 1 0 1

3

775

:

The entry 2 shows not just that there is a connection, but that there are two ways to realise

it, but this is information we don’t really need. So let’s introduce a new symbol, 1, subject

to the rules 1 C 1 D 1; 1 � 1 D 1. Then we can use 1 to represent the existence of

a connection, and 0 when there isn’t one. Now

A2 D

2

664

1 0 1 0

0 1 0 11 0 1 0

0 1 0 1

3

775

:

Sequences of 2-cup moves now correspond to powers of A2. But you can use the properties

of 1 to show that .A2/2 D A2, so such sequences can’t achieve anything new. Since the

start and end states are not connected, the problem is impossible.

A few years ago Man-Keung Siu, a mathematician at Hong Kong University, wondered

what happens if you allow more than three cups to start with, and each move inverts a

fixed number of cups. In particular, when can you start with all cups up and end with all

cups down, and what is the shortest sequence of moves that achieves this? Together, we

discovered that this problem has a surprisingly complicated answer. If there are n cups,

and any m of them can be inverted at each move, then this minimal number is shown in

Table 24.1. Here dxe is the ceiling function, the smallest integer � x.

i

i

“master” — 2012/11/15 — 11:23 — page 185 — #199i

i

i

i

i

i

24. Cups and Downs 185

Table 24.1.

n even, m even n odd, m odd n odd, m even n even, m odd

2m � n dn=me 2d.n�m/=2me C 1 no solution 2dn=2me2m > n 1 if m D n; 3 if m < n 1 if m D n; 3 if m < n no solution 2dn=2.n�m/e

The proof uses the transition matrix and is found in [3]. Exploring some of the simpler

cases is highly recommended: you will quickly appreciate that the general pattern is not at

all obvious.

Bibliography

[1] Erik D. Demaine. Puzzles and tricks from Martin Gardner inspire math and science,

American Scientist 98 (2010) 452–456; available at www.americanscientist.org/

issues/pub/2010/6/recreational-computing.

[2] M. Gardner. Penny puzzles, in Mathematical Carnival, Knopf, New York, 1975.

[3] I. Stewart and M.-K. Siu, How to invert n cups m at a time? Math. Today (Southend-on-Sea) 46

(2010) 34–38.

www.americanscientist.org/

http://www.americanscientist.org/issues/pub/2010/6/recreational-computing

issues/pub/2010/6/recreational-computing

http://www.americanscientist.org/issues/pub/2010/6/recreational-computing

i

i

“master” — 2012/11/15 — 11:23 — page 186 — #200i

i

i

i

i

i


Solutions to the Gaussian integer KenKen puzzles (d)–(f)

on page 177

(d)1 C i 1 2 1 � i

2 1 � i 1 C i 1

1 � i 1 C i 1 2

1 2 1 � i 1 C i

(e)1 C i 2 1 � i 1

1 1 C i 2 1 � i

2 1 � i 1 1 C i

1 � i 1 1 C i 2

(f)2 1 � 2i 1 C 2i 1 1 � i 1 C i

1 C 2i 1 � i 1 1 C i 2 1 � 2i

1 � 2i 1 1 C i 1 � i 1 C 2i 2

1 � i 1 C 2i 1 � 2i 2 1 C i 1

1 1 C i 2 1 C 2i 1 � 2i 1 � i

1 C i 2 1 � i 1 � 2i 1 1 C 2i

i

i

“master” — 2012/11/15 — 11:23 — page 187 — #201i

i

i

i

i

i

2530 Years of Bulgarian Solitaire

Brian Hopkins

“Oh, you’re a mathematician! Let me show you something interesting.”

But I’m trying to work on my talk, I thought. As the train sped along, the man sitting

across from me looked eager. OK, let’s get this over with.

“Here are fifteen playing cards. Arrange them into piles; as many piles as you like, each

with as many cards as you like.”

I made five piles with heights 3, 1, 4, 1, 6. The � reference went unnoticed.

“Now take one card from each pile to make a new pile.”

The operation left me with piles of 3 � 1 D 2 cards, 4 � 1 D 3 cards, 6 � 1 D 5 cards,

two empty piles from 1 � 1 D 0, and a new pile of 5 cards. I realized that the order of the

piles does not matter, so for consistency I put them in non-increasing order, 5; 5; 3; 2.

“Now do it again and again. I know what will happen!” He looked away.

Did he already think through the iterations? How long will this go? I was curious now.

Here is the sequence of pile sizes:

.6; 4; 3; 1; 1/ ! .5; 5; 3; 2/ ! .4; 4; 4; 2; 1/ ! .5; 3; 3; 3; 1/

! .5; 4; 2; 2; 2/ ! .5; 4; 3; 1; 1; 1/ ! .6; 4; 3; 2/

Oh no, that’s almost where I started. When will this end? But then, suddenly, it did end:

.6; 4; 3; 2/ ! .5; 4; 3; 2; 1/ ! .5; 4; 3; 2; 1/ again.

“Hmm,” I said.

“You ended with one pile of 5 cards, one of 4, one of 3, one of 2, and one of 1, didn’t

you?” He looked at the cards. “Yes! That’s always what happens. Try again!”

I started with a single pile of 15 cards. It took more moves, but I did indeed end up with

the 5; 4; 3; 2; 1 pattern. Then I started with three piles of 5 cards. It took even more moves,

but ended at the same fixed point.

Three examples is not a proof, but the claim was now reasonable. Why would it al-

ways go there? How long could it take to reach this fixed point? What happens with other

numbers of cards?

“This is interesting,” I admitted.


187

i

i

“master” — 2012/11/15 — 11:23 — page 188 — #202i

i

i

i

i

i


This puzzle was popularized by Martin Gardner in 1983 [9] with the unusual name

Bulgarian solitaire. In this article, we examine the earlier history of the puzzle including

its name, summarize subsequent research, and consider a new two-player variation.

Before Gardner

Around 1980, Konstantin Oskolkov of the Steklov Mathematical Institute in Moscow trav-

eled by train to give a talk in Leningrad (now Saint Petersburg). A man on the train told

him of the problem, although the details of the dialog above are fictional. Oskolkov shared

this with his colleagues at the institute; reportedly when one number theorist heard about it

“his face took a Satanic expression, he ran to his office, closed the door and did not come

out until he solved the problem.”

The puzzle reached Victor Gutenmacher, editor of the journal Kvant. The problem ap-

peared there in 1980, in the context of stacks of books. Andrei Toom, a research scientist

at Moscow State University, published a solution in 1981 [19]. The material was also in-

cluded in a 1981 book on mathematics olympiads [20] whose authors include Gutenmacher

and Toom.

Later in 1980, Anatolii Alexeevich Karatsuba traveled from Steklov to the Institute of

Mathematics of the Bulgarian Academy of Sciences in Sofia. After a lecture on approxi-

mation theory, he shared Oskolkov’s story and the puzzle. Again, it captured the attention

of many mathematicians, and Milko Petkov published it in the “competition problems”

section of a high school mathematics journal in 1980, in the context of heaps of balls. No

student solved the puzzle, so the solution of Petkov’s institute colleague Borislav Bojanov

was published in 1981 [4].

Gert Almkvist of Lund University was visiting Sofia while Karatsuba was there. When

Almkvist returned to Sweden, he shared the puzzle with colleagues, including Henrik

Eriksson of the KTH Royal Institute of Technology, who published a solution also in 1981

[7] with the name “Bulgarisk patiens,” in the context of piles of cards.

Jørgen Brandt, finishing his masters at Aarhus University, somehow learned of the

puzzle as well (without the name). “The problem appeared so pure, that I did not think

much about where it came from,” he explained recently. His thorough solution, in the

setting of integer partitions, was submitted to Proceedings of the American Mathematical

Society in 1981 and published in 1982 [5].

Meanwhile, Eriksson traveled from Stockholm to California, where he called the puzzle

Bulgarian solitaire. He recently explained, “The silly name is my invention, silly because

it is neither Bulgarian nor a solitaire.” Donald Knuth started a fall 1982 programming

and problem-solving seminar with Bulgarian solitaire [12]. Ron Graham passed it on to

Gardner, who used Bulgarian solitaire as the culminating example of “tasks you cannot

help finishing no matter how hard you try to block finishing them” [9]. The silly name

has stuck.

Gardner and beyond

A partition of n is a collection of positive integers � D .�1; : : : ; �t/ whose sum is n; as

the order does not matter, we index the parts in non-increasing order, i.e., �1 � �2 �

i

i

“master” — 2012/11/15 — 11:23 — page 189 — #203i

i

i

i

i

i

25. 30 Years of Bulgarian Solitaire 189

� � � � �t . Bulgarian solitaire can be considered as an operation on partitions, B.�/ D.t; �1 � 1; : : : ; �t � 1/ where there may be zeros to remove and the parts may need to be

reordered. This can also be visualized through a graphic representation of partitions. The

Ferrers diagram of a partition has a column of �1 dots followed by a column of �2 dots, etc.

Figure 25.1 shows the Bulgarian solitaire operation on .6; 4; 3; 1; 1/, choosing to remove

the bottom dot of each column, that is, the bottom row.

�

� � �

� � � �

� � � ! � � �

� � � � � � �

� � � � � � � � �

6 4 3 1 1 5 5 3 2

Figure 25.1. Example of the Bulgarian solitaire operation.

The number of partitions of n grows very quickly as n increases, but it is helpful to

look at the effect of B on all partitions of n for small n. Figure 25.2 shows all partitions of

6 under the operation.

2211 - 411 - 33 -

16

23

-

-

6

313

-

-

51

6

42

�

-

214

321�

Figure 25.2. Bulgarian solitaire on all partitions of 6; exponents denote repetition.

Like the 15 card puzzle of the introduction, Bulgarian solitaire on partitions of 6 leads

to a fixed point, the partition �3 D .3; 2; 1/. Both 6 and 15 are triangular numbers, i.e., can

be written as Tk D 1 C � � �C k D k.k C 1/=2. The effect of Bulgarian solitaire can be very

different on other values of n.

Exercise. Work out Bulgarian solitaire on the 22 partitions of 8. (For the solution, see [13,

Figure 4].)

The three 1981 solutions in European journals [4, 7, 19] use very similar ideas to show

that, for n D Tk , there is indeed just the single fixed point �k D .k; k � 1; : : : ; 2; 1/. Each

author argues that a ball / book / card cannot move to a longer diagonal in the configuration,

and will eventually move into the shortest diagonal possible, filling any gaps and leaving a

triangular shape. Brandt [5] and Toom [19] showed that for other values of n, the “almost

triangular” partitions form cycles. For example, the partitions .4; 2; 2/ and .3; 3; 1; 1/ form

a 2-cycle in one connected component of the partitions of 8 from the exercise; see Figure

25.3. From this characterization, Brandt used Polya enumeration theory to determine the

number of connected components. For instance, the partitions are in a single component

exactly when n is within one of a triangular number. See also [11], which revisits and

generalizes Toom’s solution, and [1], which elaborates on Brandt’s work.

The partitions of Figure 25.3 illustrate another relationship. Reading the row lengths of

.4; 2; 2/ rather than the column heights gives .3; 3; 1; 1/. Considered another way, reflect-

i

i

“master” — 2012/11/15 — 11:23 — page 190 — #204i

i

i

i

i

i


�

� � �

� � � ! � �

� � � � � � �

4 2 2 3 3 1 1

Figure 25.3. A two-cycle among the partitions of 8.

ing the Ferrers diagram for .3; 3; 1; 1/ across the diagonal line y D x gives .4; 2; 2/. These

partitions are said to be conjugates.

Gardner’s article [9] focuses on the fixed point result for triangular numbers, with

a discussion of using 45 cards, and the analog of Figure 25.2 for partitions of 10. He

mentions two other conjectures, which were both proven within a few years. First, for

n D Tk , data suggest that the greatest possible distance to the fixed point is linear in

n, specifically k.k � 1/. Igusa [15] shows that the partition k D .k � 1; k � 1; k � 2;

k � 3; : : : ; 3; 2; 1; 1/ is at distance k.k � 1/ from �k and that this distance is maximal; see

also [3] and [8]. Second, look carefully at the sequence of partitions from

3 D .2; 2; 1; 1/ to �3 D .3; 2; 1/ in Figure 25.2: until the fixed point, the sequence con-

sists of nested conjugate pairs, shown in Figure 25.4. Bentz [3] proves that this occurs

along the “main trunk” of the diagram for all n D Tk , which distinguishes k among the

partitions at maximal length from the fixed point (for k � 4, there are several partitions at

that distance). See also [2].

2211 - 411 - 33 - 23 - 313 - 42 - 321�

Figure 25.4. Conjugate pairs on the “main trunk” of Figure 25.2.

Gardner also mentioned “Eden” partitions that have no predecessor under the opera-

tion, such as 2 D .2; 2; 1; 1/, .2; 1; 1; 1; 1/, and .1; 1; 1; 1; 1; 1/ in Figure 25.2. Recently,

Hopkins and Sellers [14] determined a formula for the number of these partitions for gen-

eral n that resembles a recurrence result of Euler.

There remain many open questions about Bulgarian solitaire, several of which are sur-

veyed in Hopkins and Jones [13]. Griggs and Ho [10] conjectured maximal lengths to a

cycle partition for n ¤ Tk . How many partitions are at given length from a cycle partition

or fixed point? What is the relation between B and conjugation outside the “main trunk”?

When there are multiple components, is there an easy way to tell if two partitions are in

the same component? What can be said about component sizes? There are many questions

about asymptotic and random behavior; some are addressed by Popov [18].

Bulgarian solitaire continues to arise in the teaching literature, such as Nicholson [16]

and Doree [6]. The operation has been rediscovered at least once, see [17]. Several variants

have been considered in the literature, but there is not space here to discuss them.

A new game

Figure 25.1 shows the effect of changing the bottom row of the Ferrers diagram of

.6; 4; 3; 1; 1/ to a column. Suppose you could change any row to a column. Figure 25.5

i

i

“master” — 2012/11/15 — 11:23 — page 191 — #205i

i

i

i

i

i


shows the effect of changing the length 2 row to a column. Under this operation, .6; 4; 3; 1; 1/

could be followed by one of three partitions: .5; 5; 3; 2/ by selecting the bottom row,

.5; 3; 3; 2; 1; 1/ selecting the length 2 or a length 3 row, or .5; 4; 3; 1; 1; 1/ selecting a length

1 row.

�

� �

� � �

� � � ! � � �

� � � � � � �

� � � � � � � � � � �

6 4 3 1 1 5 3 3 2 1 1

Figure 25.5. Changing the length 2 row of .6; 4; 3; 1; 1/ to a column gives .5; 3; 3; 2; 1; 1/.

For a 2-player game, we start with the single-part partition .n/. In turn, each player

chooses a single row to change into a column. The loser is the first player who creates a

partition that has occurred before in play.

The first player can only chose a row of length 1, giving the partition .n � 1; 1/. The

second player then can choose a row of length 1 or 2, making the partition .n � 2; 1; 1/ or

.n � 2; 2/, respectively (for sufficiently large n). All possible moves for partitions of 6 are

shown in Figure 25.6.

16 - 6 - 51

�� ?

411

�� ?

42

??33

?

313

��

?

321�

??23

��

214

- �

2211�B

BBB

BB

BBM

Figure 25.6. All possible moves in the two-player game on partitions of 6. Double arrows indicate

where selecting either of two different row lengths produces the same partition.

Here are some open questions about this game.

� As a finite impartial two-player game of perfect information with no draws, either

the first or second player has a winning strategy. Which player? What is the strategy?

Do the answers depend on n?

� Notice that there is a Hamiltonian cycle in the directed graph of Figure 25.6. That

is, there is a sequence of play where every partition arises before there is repeti-

i

i

“master” — 2012/11/15 — 11:23 — page 192 — #206i

i

i

i

i

i


tion. I conjecture that this occurs for all n. If so, is there a nice way to describe the

corresponding sequence of moves?

Conclusion

A mathematical recreation that reportedly arose from a conversation in a train has gone

on to inspire several lines of research and student activities. This is due largely to Martin

Gardner’s attention to it in his mathematics column from Scientific American. Bulgarian

solitaire is just one example of the impact his work has had on all of us who enjoy mathe-

matics.

Acknowledgment The history of Bulgarian solitaire comes from personal communica-

tion with Borislav Bojanov, Jørgen Brandt, Vesselin Drensky, Victor Gutenmacher, Pencho

Petrushev, Andrei Toom, and especially Henrik Eriksson. Recollections of events thirty

years ago can be imperfect; hopefully the account provided here approaches what occurred.

Aleksandar Nikolov kindly provided translations from Bulgarian. Suggestions from two

anonymous referees improved the article. I would like to thank Eriksson, Suzanne Doree,

and Mizan Khan for their interest and help.

Bibliography

[1] E. Akin and M. Davis, Bulgarian solitaire, Amer. Math. Monthly 92 (1985) 237–250; available

at dx.doi.org/10.2307/2323643.

[2] T. Bending, Bulgarian Solitaire, Eureka 50 (1990) 12–20.

[3] H.-J. Bentz, Proof of the Bulgarian solitaire conjectures, Ars Combin. 23 (1987) 151–170.

[4] B. Bojanov, Problem solution 4, Obuchenieto po matematika (Mathematics Education) 24 (5)

(1981) 59–60.

[5] J. Brandt, Cycles of partitions, Proc. Amer. Math. Soc. 85 (1982) 483–486; available at

dx.doi.org/10.1090/S0002-9939-1982-0656129-5.

[6] S. Doree, Bulgarian Solitaire, Resources for Teaching Discrete Mathematics, B. Hopkins, ed.,

Mathematical Association of America, Washington DC, 2009, 83–92.

[7] H. Eriksson, Bulgarisk patiens, Elementa 64 (4) (1981) 186–188.

[8] G. Etienne, Tableaux de Young et solitaire bulgare. J. Combin. Theory Ser. A 58 (1991) 181–

197; available at dx.doi.org/10.1016/0097-3165(91)90059-P.

[9] M. Gardner, Mathematical Games: Tasks you cannot help finishing no matter how hard

you try to block finishing them, Scientific American 249 (1983) 12–21; available at

dx.doi.org/10.1038/scientificamerican0883-12. Also available as Bulgarian

solitaire & other seemingly endless tasks, The Last Recreations, Springer-Verlag NY, 2007, 27–

43.

[10] J. Griggs and C.-C. Ho, The cycling of partitions and compositions under repeated shifts, Adv. in

Appl. Math. 21 (1998) 205–227; available at dx.doi.org/10.1006/aama.1998.0597.

[11] T. Hart, G. Khan, and M. Khan, Revisiting Toom’s proof of Bulgarian solitaire, Ann. Sci. Math.

Qu., to appear.

[12] J. Hobby and D. Knuth, A Programming and Problem-Solving Seminar, Stanford University

Department of Computer Science report, 1983.

dx.doi.org/10.2307/2323643

http://dx.doi.org/10.2307/2323643

dx.doi.org/10.1090/S0002-9939-1982-0656129-5

http://dx.doi.org/10.1090/S0002-9939-1982-0656129-5

dx.doi.org/10.1016/0097-3165(91)90059-P

http://dx.doi.org/10.1016/0097-3165(91)90059-P

dx.doi.org/10.1038/scientificamerican0883-12


dx.doi.org/10.1006/aama.1998.0597

http://dx.doi.org/10.1006/aama.1998.0597

i

i

“master” — 2012/11/15 — 11:23 — page 193 — #207i

i

i

i

i

i


[13] B. Hopkins and M. Jones, Shift-Induced Dynamical Systems on Partitions and Compositions,

Electr. J. Comb. 13 (2006) R80 19 pp.

[14] B. Hopkins and J. Sellers, Exact Enumeration of Garden of Eden Partitions, Integers 7 (2)

(2007) A19.

[15] K. Igusa, Solution of the Bulgarian solitaire conjecture, Math. Mag. 58 (1985) 259–271.

[16] A. Nicholson, Bulgarian Solitaire, The Mathematics Teacher 83 (1996) 84–86.

[17] H. D. Phan and E. Thierry, Dynamics of the Picking transformation on integer partitions,

Discrete Math. Theor. Comput. Sci. AB(DMCS) (2003) 43–56.

[18] S. Popov, Random Bulgarian Solitaire, Random Struct. Alg. 27 (2005) 310–330; available at

dx.doi.org/10.1002/rsa.20076.

[19] A. Toom, Problem solution M655, Kvant (Quantum) 1981 (7) 28–30.

[20] N. Vasilyev, V. Gutenmacher, J. Rabbot, and A. Toom, Zaochnye matematicheskie olimpiady

(Mathematical olympiads by correspondence), Nauka, Moscow, 1981.

dx.doi.org/10.1002/rsa.20076

http://dx.doi.org/10.1002/rsa.20076

i

i

“master” — 2012/11/15 — 11:23 — page 194 — #208i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 195 — #209i

i

i

i

i

i

26Congo Bongo

Hsin-Po Wang

An expedition into Congo uncovered a treasure chest in the shape of a regular octagon. At

each corner was a bongo drum. A scroll attached to the chest, written in French, explained

that there was a genie inside each bongo drum. A genie is either standing upright or doing

a handstand. One may strike a number of bongo drums at the same time. When a bongo

drum is struck, the genie inside will change its posture from right side up to upside down,

or vice versa. The treasure chest will open if and only if all genies are right side up, or

all are upside down. However, each time some bongo drums are hit the treasure chest will

spin rapidly on its vertical axis. As the bongo drums are all identical in appearance, after

the rotation it is impossible to tell which of them had just been hit.

Unfortunately, the scroll did not record the exact procedure by which the treasure chest

might be opened. However, it mentioned that such a procedure had been documented.

This document was considered so valuable that it was put inside the treasure chest for

safekeeping. It was a lot safer than was originally thought.

The sponsor of the expedition was definitely not pleased with the current state of affairs,

so she hired a team of mathematicians to try to open it. The task was seemingly hopeless,

and many gave up, until Dr. Jacob Ecco arrived with his sidekick, Professor Justin Scarlet.

“My dear professor,” said Dr. Ecco, “we must approach this challenge systematically.

What is the most basic principle in problem solving?”

“Downsizing,” replied the professor, who was well versed with the methods of Dr.

Ecco. “Suppose there is only one bongo drum. Then the treasure chest will open automat-

ically.”

“What if there are two bongo drums?” asked Dr. Ecco. “Well, if the treasure chest is

not already open, hitting either of the bongo drums will do it.”

“Excellent! As usual, my dear friend, you have provided me with the inspiration that

leads to the solution.”

“How?” cried Professor Scarlet. “I have made but the most trivial observation.”

“Nevertheless, a solution to the case with two bongo drums leads to a solution to the

case with four bongo drums, which in turn leads to a solution to the case with eight bongo

drums,” Dr. Ecco proclaimed.

“I can smell that you are going to use induction, but l still do not see the connection

from one case to the next,” replied Professor Scarlet.

Reprinted from Math Horizons, Vol. 18, No. 1 (Sept. 2010), pp. 18–21.

195

i

i

“master” — 2012/11/15 — 11:23 — page 196 — #210i

i

i

i

i

i


“Be patient. Obviously, we will be treating pairs of bongo drums as a single bongo

drum. How can we identify pairs of bongo drums? Adjacency is unsatisfactory because

after the treasure chest spins around, it is impossible to tell whether a bongo drum is to be

paired with its neighbor to the left or its neighbor to the right.”

“The obvious solution is to identify opposite pairs of bongo drums,” pronounced Pro-

fessor Scarlet.

“What insight!” said Dr. Ecco with a slight smile. “This is fine if the genies in each

opposite pair of bongo drums are both right side up or both upside down. But there is no

guarantee that they start off like that,” his companion pointed out.

“True, but there is a way to make them end up that way. Let 0 or 1 indicate whether

a genie is right side up or upside down, and consider a four-sided chest. If we are really

lucky, the initial state may already be (0,0,0,0) or (1,1,1,1).”

“You may be good enough to be that lucky, but that never works with me,” complained

Professor Scarlet.

“I am not counting on luck. I am just singling out the most favorable scenario. We may

regard (0,0,0,0) and (1,1,1,1) as the same state. Call it state 0. It is also referred to as an

absorbing state, in that once we enter it we do not leave (for the problem is then solved).

How many other states are there?” asked Dr. Ecco.

Professor Scarlet paused. “Well, we have three other states, namely, (0,1,1,1), (0,0,1,1),

and (0,0,0,1).”

“You are correct as far as the number of other states is concerned, but wrong about their

composition,” Dr. Ecco said. “By symmetry, we may consider (0,1,1,1) and (0,0,0,1) as the

same state. We call it state 3. Your (0,0,1,1) is state 2, which is not the same as (0,1,0,1).

This I call state 1.”

“Yes!” exclaimed Professor Scarlet excitedly. “In state 1, the genies in each opposite

pair of bongo drums are both right side up or both upside down. In the case with two bongo

drums, we hit one of them. So here we hit an opposite pair of bongo drums, and the treasure

chest will open.”

“We will call hitting an opposite pair of bongo drums operation A, and as you keenly

observed, performing this operation from state 1 will unlock the chest. What happens when

I perform operation A if we are in state 2 or state 3?”

“Let me see,” Professor Scarlet said. “Ah, we will remain in the same state as before.”

“How can I move from state 2 to state 0 or 1?” Dr. Ecco asked.

“Well, we may hit an adjacent pair of bongo drums. I suppose this will be operation B.

If we hit both zeros, or both ones, we will be in state 0 immediately. If we hit one 0 and

one 1, we will be in state 1.”

“Indeed, you can check that state 3 is left unchanged by operation B, just as it was

unchanged by operation A,” Dr. Ecco agreed.

“The rest is easy now. We perform operation C by striking just one bongo drum. This

will change state 3 into state 0, state 1, or state 2,” Professor Scarlet said triumphantly.

“Not so fast,” Dr. Ecco cauntioned. “After operation B, we may be in state 1, and if we

rush into operation C now, state 1 will become state 3, and we will be going in circles.”

“I was hasty,” admitted Professor Scarlet. “We must perform operation A once more

to clear state 1 before taking care of state 3. After operation C, we will clear the lower

states with the sequence ABA again. So the overall procedure for the four-sided chest is

ABACABA.”

i

i

“master” — 2012/11/15 — 11:23 — page 197 — #211i

i

i

i

i

i

26. Congo Bongo 197

“Excellent. Here, l have drawn a state transition diagram for you,” Dr. Ecco said.

State 0 State 1 State 2 State 3

0

00

0

1

11

1

1

1

1

0

0

0 0

10

0

1

1

1 1

0

0

C

A

A/B

B A

Here is the same drawing, with an emphasis on the structure of the transition opera-

tions:

BB

B

C

C

C AA

A

State 0

State 1

State 2

State 3

“That explains everything clearly!” said the professor. “Now I am confident we are onto

something that will crack the eight-sided treasure chest. Let’s see. We wish to treat each

pair of opposing bongo drums as a single entity. As before, the simplest states are those in

which the genies in each opposite pair of bongo drums are both right side up or both upside

down. Here is my drawing of this part of the state transition diagram. It is really the same

as yours, except that operation A means hitting every other pair of opposing bongo drums;

operation B means hitting any two adjacent pairs of opposing bongo drums; operation C

means hitting any pair of opposing bongo drums. By performing the sequence ABACABA

when starting from any such simple state, the treasure chest will open.”

“Excellent, my dear professor,” exclaimed Dr. Ecco. “These states together form an

expanded absorbing state in the overall diagram below. It is the box marked 4, where the

box marked m contains all states with m matching opposite pairs, for 0 D m D 4.”

i

i

“master” — 2012/11/15 — 11:23 — page 198 — #212i

i

i

i

i

i


0 0

0 01

11

11

1

1

1

00

0

A

BA

A/B

0

00 0

0

000

1 111

11 1

1

C

0

00 0

1

111 0

000

1

1 110

“This diagram looks suspiciously like the ones before, except that states 4 and 0 are no

longer equivalent. Why do you have a state 2 and a state 20? And what are the operations

D, E, F, and G?”

D

E

E

F

F

GG

F

E

G

GD

D

D

F

E 1 or 3

4

0

2

2¢

“The states with 2 matching pairs are classified according to whether those matching

pairs are alternating or adjacent,” Dr. Ecco said. “The former are grouped under state 2

while the latter are grouped under state 20. Operation D hits any four adjacent bongo drums.

Operation E hits any two bongo drums separated by one other drum. Operation F hits any

two adjacent bongo drums. Operation G hits any one bongo drum.”

“Let me see. If we denote the sequence ABACABA by X, then the sequence that will

open the treasure chest is XDXEXDXFXDXEXDXGXDXEXDXFXDXEXDX,” summed

up Professor Scarlet.

“We keep repeating X so that whenever we enter state 4, we will not return to another

state. Whatever the initial state of the treasure chest may be, it will open by the end of this

sequence.”

The sponsor of the expedition was delighted with their analysis. She hired a team of

Congo natives to strike the bongo drums in the prescribed fashion. Lo and behold, the

treasure chest opened right before her eyes. Unable to contain her excitement, she reached

inside, but found only another scroll. It was written in Kituba, but the most prominent line

read something like XDXEXDXFXDXEXDXGXDXEXDXFXDXEXDX.

i

i

“master” — 2012/11/15 — 11:23 — page 199 — #213i

i

i

i

i

i

26. Congo Bongo 199

Later, back in his New York apartment, Dr. Ecco said to Professor Scarlet, “That was

hilarious, wasn’t it? By the way, have you thought of the obvious question?”

“Yes, l have,” replied the professor. “For what number of bongo drums can such a

treasure chest be opened? From our analysis, the general procedure when the number is

a power of two is clear. We treat each opposing pair as a single entity, thereby reducing

to the preceding case. Then we progressively move all states into the expanded absorbing

state. But what happens if the number of drums is not a power of two?”

“Then unlocking the chest is not always possible,” Dr. Ecco replied. “For in this case

the number of bongo drums has an odd prime factor p. Suppose the pagan god Hemba is

having fun with us. He will choose p evenly spaced bongo drums and make sure that the

genies inside are not all right side up and not all upside down. Now ignore all other bongo

drums except these p. The two types of bongo drums are not equal in number since p is

odd. In order for us to succeed, we must strike precisely the bongo drums of one type. But

Hemba will spin the treasure chest so that the bongo drums we plan to hit include at least

one from the opposite group. This way, he can keep us from opening the treasure chest

forever.”

See problem 249 in the Playground below for more fun with these treasure chests.

Further Reading

This problem was posed in the Senior A-Level paper in the 2009 Fall Round of the Inter-

national Mathematics Tournament of the Towns. It is related to a problem posed in Martin

Gardner’s famous “Mathematical Games” column in Scientific American (February 1979),

which is reproduced in his anthology Fractal Music, Hypercards and More Mathematical

Recreations (W.H. Freeman, 1992). Gardner’s version of the problem is also treated in

three other sources:

Ted Lewis and Steve Willard, The rotating table, Math. Mag., 53 (1980): 174–179.

William Laaser and Lyle Ramshaw, Probing the rotating table, in The Mathematical

Gardner, edited by David Klarner, (Wadsworth, 1981) 288–307.

Albert Stanger, Variations on the rotating table problem, J. Recreat. Math., 19 (1987):

307–308 and 20 (1988): 312–314.

Jacob Ecco and Justin Scarlet are fictitional characters created by Dennis Shasha of the

Courant Institute, New York. He is a leading creator of mathematical puzzles. Shasha has

written a number of puzzle books featuring these clever detectives. A good place to start is

his book The Puzzling Adventures of Doctor Ecco (Dover, 1998).

Playground Problem 249

The end of the article explains why no sequence of drum strikes is guaranteed to open

an n-drum chest unless n is a power of 2. However, for other n it may be possible to

have a pretty good chance of opening the chest during a certain sequence of drum strikes,

assuming that the spinning between strikes is truly random. This leads us to Problem 249,

Congo Bongo not so Wrongo: What is the smallest number of drum strikes needed on a

three-drum treasure chest to give you at least a 90 percent chance of opening it? (As a

separate further challenge, which may require computer assistance, try to answer the same

question for a five-drum chest.)

i

i

“master” — 2012/11/15 — 11:23 — page 200 — #214i

i

i

i

i

i


Problem 249 Solution

The Math Club from the University of Minnesota Duluth submitted a correct solution that

6 drum strikes are required; there were two incorrect submissions. First, the club members

realized that there are only two possible states for the chest: either all three of the genies

have the same orientation (state A, which causes the chest to open) or just two of the

genies have the same orientation (state B). Then, they realized that striking either one or

two drums will cause a chest in state B to move to state A with probability 1/3 and stay in

state B with probability 2/3.

Assuming that the chest is initially in state B (otherwise it would open immediately!),

the probability that the chest opens in k or fewer strikes is

1=3 C .2=3/.1=3/ C .2=3/2.1=3/ C � � � C .2=3/k�1.1=3/ D 1 � .2=3/k

which first becomes greater than 0.9 when k D 6.

i

i

“master” — 2012/11/15 — 11:23 — page 201 — #215i

i

i

i

i

i

27Sam Loyd’s Courier Problem

with Diophantus, Pythagoras,

and Martin Gardner

Owen O’Shea

In his classic collection Cyclopedia of Puzzles, published in 1914, Sam Loyd has two

versions of the Courier Problem ([2, p. 315]):

For the reason that many communications are being received relating to a very

ancient problem, the authorship of which has been incorrectly accredited to

me, occasion is taken to present the original version which has led to consid-

erable discussion. It has been reproduced, in many forms, generally accompa-

nied by an absurd statement regarding the impossibility of solving it, which

produced letters of inquiry as well as correct answers from some, who, under

the misapprehension of having mastered a hitherto unsolved problem, desire

to have the same published.

It is a simple and pretty problem which yields readily to ordinary methods,

and can be solved by experimental analysis upon the plan generally adopted

by puzzlists. The trouble is that the terms of the problem are seldom given

correctly and are not generally understood, for which reason,. . . , we will first

look at the ancient version which appears in the oldest mathematical works:

A courier starting from the rear of a moving army, fifty miles long, dashes

forward and delivers a dispatch to the front and returns to his position in the

rear, during the exact time it required the entire army to advance just fifty

miles.

How far did the courier have to travel in delivering the dispatch, and returning

to his previous position in the rear of the army?

A better puzzle is created by the following extension of the theme given as

problem No. 2:

If a square army, fifty miles long by fifty miles wide, advances fifty miles

while a courier makes the complete circuit of the army and returns to the

starting point in the rear, how far does the courier have to travel?

Reprinted from The College Mathematics Journal, Vol. 39, No. 5 (Nov. 2008), pp. 387–391.

201

i

i

“master” — 2012/11/15 — 11:23 — page 202 — #216i

i

i

i

i

i


The problem appeared in print long before Loyd published it. The first known ap-

pearance was in A Companion to the Gentleman’s Diary of 1798, and a year later it was

published in The Gentleman’s Mathematics Companion. It also appeared much later in

William R. Ransom’s One Hundred Mathematical Curiosities in 1955 [3]. Loyd describes

the puzzle as an ancient problem and gives the answers to both parts. However, he does not

indicate how to tackle the problems.

In his book More Mathematical Puzzles of Sam Loyd [1, p. 103], Martin Gardner dis-

cusses both problems and their solutions. The answer to the first question is that the courier

travels 1 Cp

2 times the length of the army, or about 120.71 miles for a fifty-mile-long

army. Here is Gardner’s solution: For convenience, assume that the length of the army is

1 unit, and also assume that the time it takes the army to march its length is 1. Thus its

speed is also 1. Let x denote the total distance traveled by the courier, so that is also his

speed. On his forward trip, the courier’s speed relative to the army is thus x � 1, and on the

return trip it is x C 1. Since each trip is a distance of 1 relative to the moving army and the

courier’s total journey is completed in unit time, it follows that

1

x � 1C 1

x C 1D 1;

or equivalently, x2 � 2x � 1 D 0. Consequently, x D 1 Cp

2. This means that in Loyd’s

problem the courier travels 50�

1 Cp

2�

miles, or, as asserted earlier, about 120.71 miles.

Gardner’s solution to the second question is similar. As before, let the length of the

army and the time it takes to travel its length be 1 unit, so its speed is also 1. Additionally,

we let x be the total distance traveled by the courier and also his speed. On the forward

trip, the courier’s speed relative to the army is x � 1, on the return trip, x C 1, and on each

of the diagonal portions,p

x2 � 1. Since each trip has length 1 relative to the army and the

total journey is completed in unit time, it follows that

1

x � 1C 1

x C 1C 2p

x2 � 1D 1:

i

i

“master” — 2012/11/15 — 11:23 — page 203 — #217i

i

i

i

i

i

27. Sam Loyd’s Courier Problem with Diophantus, Pythagoras, and Martin Gardner 203

This time a fourth-degree equation results: x4 �4x3 �2x2C4x C5 D 0. The only solution

that fits the conditions of the problem is approximately 4.18. Thus, in traversing the entire

army, the courier now travels approximately 4.18 times as far as the army, or about 209

miles.

In the spirit of classroom exercises, it would be nice to have versions of the courier

problems with integer solutions; that is, Diophantine problems. In exploring this possi-

bility, we discovered that certain Pythagorean triples, such as .3; 4; 5/, .5; 12; 13/, and

.7; 24; 25/, that is, those in which the two largest numbers differ by 1, can be used to

generate such problems.

Army in single file

Assume that an army 60 miles long is marching at a constant rate. A courier, also going

at a constant rate, rides from the rear of the army up to the front, delivers a message, and

returns, arriving at the rear just as the army has gone 45 miles. How far did the courier go?

Following Gardner, we again let the speed of the army (but not its length) be 1, and we

let x be the speed of the courier. As before, the speed of the courier relative to the army

is x � 1 on the outward journey and x C 1 on the return journey, so the total time that

he travels is 60x�1

C 60xC1

. This must equal 45, the time it takes the army to go the given

distance. Gardner’s equation is therefore

60

x � 1C 60

x C 1D 45:

This reduces to 3x2 � 8x � 3 D 0, for which the only realistic solution is x D 3. Thus, the

courier travels three times as fast as the army, and therefore rides 135 miles. We thus have

a Diophantine version of the courier problem.

Consider now a general single-file problem, in which the army is ` miles long and

travels d miles during the time that the courier makes the trip. Gardner’s equation becomes

`

x � 1C `

x C 1D d:

Setting ˛ D `d

, we find that this reduces to

x2 � 2˛x � 1 D 0;

which has x D ˛ Cp

˛2 C 1 as its only positive solution.

This is where Pythagorean triples come to the fore. If .a; b; b C 1/ is such a triple and

˛ D ba

, then x D a is a solution to Gardner’s equation. Our example used the triple .3; 4; 5/

with distances multiplied by 15. Since in these triples b D 12.a2 � 1/ and c D 1

2.a2 C 1/,

we can pose a single-file courier problem with an integer solution for any odd positive

integer a and any constant k:

Problem 1 (single file army) An army 12.a2 � 1/k miles long travels a distance of ak

miles during the time that the courier makes his journey. How far does the courier travel?

i

i

“master” — 2012/11/15 — 11:23 — page 204 — #218i

i

i

i

i

i


Army in square formation

We can do something similar for the second of Loyd’s problems. Here is an example (also

based on the .3; 4; 5/ triple):

An army in the formation of a square 36 miles on a side is advancing at a constant

rate. A courier, also moving at a steady rate, travels all the way around the army just as it

advances 256 miles. How far does the courier go?

Again we take the speed of the army to be 1 and that of the courier x, so Gardner’s

equation becomes36

x � 1C 36

x C 1C 2 � 36p

x2 � 1D 256:

In polynomial form, this is

81x4 � 576x3 � 162x3 C 576x C 1105 D 0;

but all we care about is that x D 54

is a root (this is easy to check in the original equation,

not so easy to discover).

The same Pythagorean triples .a; b; c/ as before (with a an odd integer, b D 12.a2 � 1/,

and c D 12.a2 C 1/) all generate square-army courier problems with Diophantine solu-

tions:

Problem 2 (square army) An army b2 miles on a side advances 2a .b C c/ miles during

the time that a courier completes his circuit. How far does the courier go?

Gardner’s equation for this army can be written as

2b2

�x

x2 � 1C 1p

x2 � 1

�

D 2a .b C c/ ;

which has x D ca

as a solution. That is, the courier travels ca

times as fast as the army and

hence goes 2c.b C c/ miles.

Note that the roles of a and b can be interchanged here; that is, the army could be b2

on a side and travel 2b.a C c/ miles while the courier completes his circuit.

Army in rectangular formation

Of course, armies do not always march in either single-file or square formation; sometimes

they are proper rectangles. Suppose that an army is ` miles long and w miles wide and that

it goes a distance d in the time that the courier transverses it, going at a speed x times that

of the army. Gardner’s equation for this situation is

`

x � 1C `

x C 1C 2wp

x2 � 1D d:

For example, assume that the army is 40 miles long and 30 miles wide, and that it

travels 120 miles while the courier completes his circuit. Then the courier rides 5=3 as fast

as the army travels, and hence goes 200 miles.

Thus, Pythagorean triples of the given type can be used in this equation to generate Dio-

phantine solutions for a rectangular army in two ways, with .a; b; c/ again a Pythagorean

triple with c � b D 1.

i

i

“master” — 2012/11/15 — 11:23 — page 205 — #219i

i

i

i

i

i

27. Sam Loyd’s Courier Problem with Diophantus, Pythagoras, and Martin Gardner 205

Problem 3 (rectangular army) An army’s length is a multiple of 12b2 and its width a

multiple of 12b. It advances 2a .b C c/ miles during the time that courier completes his

circuit. How far does the courier travel?

Acknowledgment The author thanks the referees and the editors for helpful suggestions.

Bibliography

[1] Martin Gardner, More Mathematical Puzzles of Sam Loyd, Dover, 2007.

[2] Sam Loyd, Cyclopedia of Puzzles, Lamb Publishing, 1914. www.mathpuzzle.com/loyd/

[3] William R. Ransom, One Hundred Mathematical Curiosities, J. Weston Walch, 1955.

www.mathpuzzle.com/loyd/

http://www.mathpuzzle.com/loyd/

i

i

“master” — 2012/11/15 — 11:23 — page 206 — #220i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 207 — #221i

i

i

i

i

i

28Retrolife and The Pawns Neighbors

Yossi Elran

Martin Gardner created no less than a revolution in the popularization of mathematics

through his many books, and, in particular, his Scientific American columns. Some of his

most important columns, which spurred not only popular interest but also a wealth of in-

novative research and even practical applications, were his columns describing John Con-

way’s “Game of Life” [5, 6]. Gardner himself said [7],

Probably my most famous column was the one in which I introduced Con-

way’s game of Life. Conway had no idea, when he showed it to me, that it

was going to take off the way it did. He came out on a visit, and he asked

me if I had a Go board. I did have one, and we played Life on the Go board.

He had about 50 other things to talk about besides that. I thought that Life

was wonderful—a fascinating computer game. When I did the first column on

Life, it really took off. There was even an article in Time magazine about it.

“It really took off” is an understatement.

The Game of Life

The “Game of Life” was invented in 1970 by British mathematician John Conway. It is

best described as the archetype of cellular automata. The player (there is only one) places

an initial distribution of checkers on an infinite checkerboard; one checker per square. The

squares are called cells and the initial distribution is called a population. This terminology

reflects one of the original objects of the game—to simulate biological evolution. Cells are

live if they contain a checker and dead if empty. The population evolves in time in discrete

time steps known as generations, according to a simple set of rules, applied simultaneously.

The rules determine the fate of each cell according to the number of live cells in its eight

neighboring cells. If a cell is empty and exactly three of its eight neighboring cells are alive,

the cell itself will be born—a checker is placed there. If a cell is alive and two or three of its

eight neighboring cells are also alive, nothing changes—the cell survives. A live cell dies

(i.e., the checker is removed from the board) in two situations. If a cell is alive and four or

more of its eight neighboring cells are also alive, the cell dies (from overcrowding). A live

cell also dies from isolation, if it has less than two live neighbors. Figure 28.1 shows three

consecutive generations of Life.


207

i

i

“master” — 2012/11/15 — 11:23 — page 208 — #222i

i

i

i

i

i


Figure 28.1. Three generations of the Game of Life.

Although these rules are simple and straightforward, iterating Life for many genera-

tions can result in a surprising variety of intriguing outcomes. A population can die out.

Another possibility is that the population evolves into a stable, constant form which doesn’t

change anymore. A 2-by-2 block of live cells is an example of such a form. Populations

may also cycle, regenerating themselves after a certain time. Such populations come in

forms known as oscillators and gliders. Three isolated live cells in a row are an example

of a simple oscillator.

Populations can also grow indefinitely. In 1970, a team from the Artificial Intelligence

Project at M.I.T. won a 50 dollar prize from Conway for proving that an initial finite popu-

lation can grow without limit. Such populations are generated by shapes variously known

as glider guns, agars and puffer-trains. Martin Gardner wrote: “You will find the popula-

tion constantly undergoing unusual, sometimes beautiful and always unexpected change.”

Stephen Silver’s lexicon [10] lists over 700 terms applied to various forms of Life.

Much work on Life was done as a result of Martin Gardner’s original publication in

1970, in particular by Bill Gosper from the original M.I.T. group, Robert Wainwright

(who managed a newsletter called “Lifeline”) and other “Life-enthusiasts” (as Gardner

calls them). The three articles that Martin Gardner wrote in Scientific American and there-

after reprinted in Wheels, Life and Other Mathematical Amusements [6] have spurred so

much research that it is impossible to summarize it all without doing injustice to some peo-

ple. Life is considered the prototypical computer simulation. Among other things, it is a

universal Turing machine [3]. Paul Rendell proved the universality of Life by showing a di-

rect simulation of counter machines [9], and Dean Hickerson created an initial population

that, when evolved, generates the Prime Numbers [8].

Variations on Life [2] have also been developed. Some interesting games, where only

the rules are changed, are Highlife, which has the same rules as Life and in addition six

live neighbors also cause a birth, and Seeds where a birth is caused when a cell has exactly

2 live neighbors and in all other cases the cell dies. The two-dimensional Moore neigh-

borhood grid can also be changed, resulting in games played on triangular and hexagonal

boards. Andrew Adamatzky recently commemorated Life’s 40th birthday by publishing a

unique collection of works that provide a comprehensive and up-to-date study of Life, with

contributions from many renowned mathematicians and computer scientists [1].

Retrolife

An important aspect of Life is that applying the rules to any given generation results in

a unique outcome. The reverse, however, is not true. In fact, there are an infinite number

of possible population distributions (hereafter referred to as states) in the preceding gen-

i

i

“master” — 2012/11/15 — 11:23 — page 209 — #223i

i

i

i

i

i

28. Retrolife and The Pawns Neighbors 209

eration. Finding a particular state from the variety of possible states is the essence of the

Retrolife puzzles [4].

In its original form, the task in Retrolife is to find predecessors of a given initial state

that comply with some extra constraints. Many constraints can be chosen, each generating

a slightly different puzzle. One might, for instance, demand that all the cells in the initial

state survive, that is, cells live in the puzzle (the initial state in Life) are also live in the

solution (the previous generation). Another constraint is to demand the opposite, that all

the live cells in the initial state are born. Figure 28.2 shows three Retrolife puzzles and

their solution using this last constraint.

Figure 28.2. Three Retrolife puzzles. Top row: puzzles. Bottom row: possible solutions.

To explain the rules of Retrolife, one first needs to explain Life and then suggest the

reverse problem. Needless to say, this is rather cumbersome. Instead, we therefore suggest

the following rephrasing of Retrolife.

The Pawns Neighbors

A player is given an infinite chessboard, any number of pawns, and some tokens. Each

square on the board has eight neighboring squares (in the directions: north, south, west, east

and on the diagonals). The pawns are placed in some pattern on the board. The challenge:

surround each pawn (and no other square on the board) with exactly three tokens so that

no token itself has either two or three tokens as neighbors. A token may have less than

two tokens as neighbors, or more than three. Note also that no empty square in the board

is allowed to be surrounded with three tokens; however it may be surrounded by more or

less. Figure 28.3 shows one setup and a possible solution.

The Pawns Neighbors is really Retrolife in disguise, however it can be described in a

straightforward manner and solved without any prior knowledge of Life or Retrolife.

i

i

“master” — 2012/11/15 — 11:23 — page 210 — #224i

i

i

i

i

i


Figure 28.3. A Pawns Neighbors puzzle and a solution.

Variations on The Pawns Neighbors

The Pawns Neighbors is isomorphic to Retrolife with the constraint that all the cells in the

initial state are born. We can change the rules to create an isomorphic Retrolife problem

with the constraint that all the cells in the initial state survive. In this case, we have to

surround each pawn with tokens so that the pawns have either two or three neighbors,

either pawns or tokens, but no token has two or three neighbors (again, neighbors can be

either pawns or tokens). No empty square on the board is allowed to have three neighbors

(pawns or tokens). Many other variations are possible.

The computational complexity of Pawn Neighbor games is an open question. We leave

the reader the following Pawns Neighbors problem to solve. Surround each pawn (and

no other square on the board) with exactly three tokens. No token can have either 2 or 3

neighbors. The solution is on p. 211.

Figure 28.4. A Pawns Neighbors puzzle.

Martin Gardner—in memoriam

Martin Gardner was particularly fond of Life and related problems. Shortly before he died,

he wrote to me enthusiastically about Retrolife and offered to add a piece about them in

one of his upcoming books. It’s a pity I can’t share with him this article. I’m sure he would

have enjoyed it. I miss Martin a lot, and for all the many books he wrote, I still yearn for

more.

i

i

“master” — 2012/11/15 — 11:23 — page 211 — #225i

i

i

i

i

i

28. Retrolife and The Pawns Neighbors 211

Bibliography

[1] A. Adamatzky (ed.), Game of Life Cellular Automata, Springer-Verlag, London, 2002.

[2] B. Bays, The Game of Life in Non-square Environments, in Game of Life Cellular Automata,

A. Adamatzky (ed.), Springer-Verlag, London, 2002, 319–330.

[3] E. R. Berlekamp, J. H. Conway and R. K. Guy III, Winning Ways (for your Mathematical

Plays), Volume 2, A. K. Peters, Wellesley, MA, 2003.

[4] Y. Elran, Retrolife, in Homage to a Pied Puzzler, E. Demaine, M. Demaine and T. Rodgers

(ed.), A. K. Peters, Wellesley, MA, 2008, 129–136.

[5] M. Gardner, Scientific American 223 (1970) 120–123.

[6] ——, Wheels, Life and Other Mathematical Amusements, W. H. Freeman, New York, 1983.

[7] ——, “Mathematical Games” and Beyond: Part II of an Interview of Martin Gardner, College

Math J. 36 (2005) 301–314.

[8] C. Hickerson, Dean Hickerson’s Game of Life Page; available at radicaleye.com/DRH/

life.html

[9] P. Rendell, Turing Universality of the Game of Life, in Collision Based Computing, A.

Adamatzky (ed.), Springer-Verlag, London, 2002, 513–540.

[10] S. Silver, The Life Lexicon; available at www.argentum.freeserve.co.uk/lex.htm

Solution to the problem on p. 210.

radicaleye.com/DRH/

http://radicaleye.com/DRH/life.html

life.html

http://radicaleye.com/DRH/life.html

www.argentum.freeserve.co.uk/lex.htm

http://www.argentum.freeserve.co.uk/lex.htm

i

i

“master” — 2012/11/15 — 11:23 — page 212 — #226i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 213 — #227i

i

i

i

i

i

29RATWYT

Aviezri S. Fraenkel

In memory of Martin Gardner, who was and remains enchantingly influential and inspiring.

WYTHOFF

In 1907, the Dutch mathematician, Willem Abraham Wythoff [13] invented this game, later

vividly explained by Martin Gardner in [7].

WYTHOFF is played on a pair of nonnegative integers, .M; N /. A move consists of

either (i) subtracting any positive integer from precisely one of M or N such that the result

remains nonnegative, or (ii) subtracting the same positive integer from both M and N such

that the results remain nonnegative. The first player unable to move loses.

Given the position .3; 3/, say, the next player wins in a single move: .3; 3/ ! .0; 0/.

The position .3; 3/ is called an N -position, because the Next player wins. If M D N D 0,

the next player loses, and the previous player, the one who moved to .0; 0/, wins. Thus

.0; 0/ is a P -position, because the Previous player wins.

If M > 0, it is easy to see that .0; M/ and .M; M/ are N -positions, since the next

player can win in one move. On the other hand, .1; 2/ is a P -position because all its fol-

lowers—positions reached in one move—are N -positions. The first few P -positions are

listed in Table 29.1. Note that every N -position has at least one P -follower, but all fol-

lowers of a P -position are N -positions. From an N -position, in order to win, a player

must move to a P -position. Further, the P - and N -positions partition the set of all game

positions: every game position is either a P -position or an N -position but never both.

Table 29.1. The first few P -positions .An; Bn/ for Wythoff’s game.

n 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27

An 0 1 3 4 6 8 9 11 12 14 16 17 19 21 22 24 25 27 29 30 32 33 35 37 38 40 42 43

Bn 0 2 5 7 10 13 15 18 20 23 26 28 31 34 36 39 41 44 47 49 52 54 57 60 62 65 68 70

The sequences, An and Bn in Table 29.1, each strictly increasing, have remarkable

properties. Note that Bn D An C n for all n. But how is An computed? A study of the

table reveals that for n � 1, An is the smallest positive integer not yet appearing in the

sequences. Thus, the next entries in Table 29.1 are A28 D 45, B28 D 73. It follows that the


213

i

i

“master” — 2012/11/15 — 11:23 — page 214 — #228i

i

i

i

i

i


sequences are complementary: every positive integer appears precisely once in these two,

recursively defined sequences.

Can a take-away game, such as WYTHOFF, be played on the rational numbers, rather

than the integers? We present such a game here.

A rational take-away game

Given a rational number p=q in lowest terms, a step is defined by

p

q!

p � q

q;

if p=q � 1, otherwisep

q! p

q � p:

The game RATWYT is played on a pair of reduced rational numbers .p1=q1, p2=q2/.

A move consists of either (i) doing any positive number of steps to precisely one of the

rationals, or (ii) doing the same number of steps to both. The first player unable to play

(because both numerators are 0) loses.

For example, from .3=5; 2=3/, suppose that Alice moves to .3=2; 2=3/. Then Bob can

do three steps to both rationals: .3=2; 2=3/ ! .1=2; 2=1/ ! .1=1; 1=1/ ! .0=1; 0=1/, and

thereby win. Could Alice have made a better initial move? Can Alice win? Suppose she

does two steps to each of the initial rationals: .3=5; 2=3/ ! .3=2; 2=1/ ! .1=2; 1=1/. If

Bob then moves to .1=1; 1=1/, Alice can move to .0=1; 0=1/ D .0; 0/, winning. We leave

it to the reader to verify that in the three remaining possible moves for Bob, Alice can also

win in one move. Thus .3=5; 2=3/ is an N -position in RATWYT.

Is there a nice winning strategy for RATWYT? If so, what is it?

Continuing with our example, we expand 3=5 into a continued fraction:

3=5 D 0 C 1

1C 1

1C 12

;

or 3=5 D Œ0; 1; 1; 2�, to use the short notation for continued fractions. Similarly, 2=3 DŒ0; 1; 2�.

The sum of the partial quotients: 0 C 1 C 1 C 2 D 4 is called the integer induced by

the corresponding rational. We claim that playing RATWYT on .3=5; 2=3/ is equivalent to

playing WYTHOFF on their induced integers, .3; 4/!

Supposing this established, then, because .3; 4/ is not in Table 1, it is not a

P -position, rather an N -position, in WYTHOFF. Hence .3=5; 2=3/ is an N -position in

RATWYT. So Alice can, in fact, win. Let us re-examine her initial move: .3=5; 2=3/ !.3=2; 2=3/. Now 3=2 D Œ1; 2� � 3 (where � denotes the inducing process),

so .3=2; 2=3/ � .3; 3/. Alas, .3; 3/ is another N -position in WYTHOFF, so Alice missed

her opportunity to move to a P -position! Had she made the two-step move .3=5; 2=3/ !.1=2; 1=1/ D .Œ0; 2�; Œ1�/ � .1; 2/, she could have won, since .1; 2/ is a P -position in

WYTHOFF.

i

i

“master” — 2012/11/15 — 11:23 — page 215 — #229i

i

i

i

i

i

29. RATWYT 215

The CW-tree

To substantiate our claim, that playing RATWYT on .p1=q1; p2=q2/ is equivalent to play-

ing WYTHOFF on their induced integers, we resort to a method of counting the rationals,

described in the short, elegant, and influential paper of Calkin and Wilf [4].

The Calkin Wilf tree, CW-tree for short, is a binary tree whose nodes are all the non-

negative rational numbers without repetition! The root is the fraction 0=1. The rest of the

tree is described inductively by the rule that each vertex i=j has at most two children: a

right child .i C j /=j ; and, if i > 0, a left child i=.i C j /. The first five levels (levels 0 � 4)

are drawn in Figure 29.1. Note that at each step in RATWYT we move towards the root of

the tree by one level, so we are guaranteed to eventually reach 1=1 on level 1, and then the

root on level 0.

0/1

1/1

1/2

1/3

1/4 4/3

3/2

3/5 5/2

2/1

2/3

2/5 5/3

3/1

3/4 4/1

Figure 29.1. The first five levels of the CW-tree of reduced fractions.

Here are some key properties of the CW-tree.

1. The numerator and denominator at each vertex are relatively prime: proved by in-

duction on the level of the tree.

2. The induced integer of every rational on level k of the tree is k. Indeed, an element

r has the right child r C 1. Incrementing by 1 means incrementing the first term

of the continued fraction, hence incrementing the induced integer. The left child is

1=.1 C 1=r/. Since taking the reciprocal of a continued fraction amounts to either

prefixing or removing a leading 0, neither of which changes the induced integer [8],

this also increments the induced integer by one. (See Figure 29.2.)

3. Every rational number p=q in reduced form occurs precisely once in the CW-tree,

namely on level k, where k is the integer induced by p=q. There is a unique path

from p=q to the root whose length is precisely k—the same as for every element

on level k of the tree, in particular of the rational k=1 D k. Hence playing on the

rational p=q is equivalent to playing on its induced integer k, as we set out to show.

More games

In [13], Wythoff also discovered that the sequences An, Bn can be described explicitly:

An D bn�c, Bn D bn�2c, where bxc is the floor function; and � D .1 Cp

5/=2 is the

i

i

“master” — 2012/11/15 — 11:23 — page 216 — #230i

i

i

i

i

i


[0]

[1]

[0,2]

[0,3]

[0,4] [1,3]

[1,2]

[0,1,1,2] [2,2]

[2]

[0,1,2]

[0,2,2] [1,1,2]

[3]

[0,1,3] [4]

Figure 29.2. The first five levels of the CW-tree of continued fractions.

golden ratio. This observation leads to a winning strategy for Wythoff that is polynomial

time in the succinct input size log.xy/ of any given game position .x; y/.

The preceding section explained how playing RATWYT on the rationals is equivalent

to playing on their induced integers. The same holds for any take-away game. Playing on

the integers means restricting travel along the right edge of the CW-tree. Playing on the

integer k is equivalent to playing on any of its 2k�1 rational siblings on level k of the tree

.k � 1/. Here are a few sample take-away games, played on reduced rationals. Keep in

mind that the number of steps performed must always preserve non-negativity, and that the

first player unable to move loses.

Game I. Let t be a fixed positive integer. A position is a pair .p1=q1, p2=q2/ of rationals.

There are two types of moves: (i) do any positive number of steps on precisely one of the

rationals, or (ii) do k > 0 steps on one of the rationals and ` > 0 on the other, such that

jk � `j < t .

This game, if played on the integers induced by p1=q1, p2=q2, is a generalization of

WYTHOFF (the case t D 1 is WYTHOFF). The winning strategies given in [5] apply di-

rectly to every rational number with the same induced integers. P -positions of generalized

WYTHOFF partition the integers, so Game I partitions the rationals. Are there any mean-

ingful partitions of the rationals that transcend tree level?

Game II. Let t be fixed positive integer. A position is a single rational, p=q. A move

consists of performing up to t steps. The P -positions are all the rationals on nonnegative

integer multiples of level .t C 1/.

Game III. A position consists of m rationals .p1=q1; : : : ; pm=qm/. A move consists of

selecting any one of the rationals and performing a positive number of steps. This game

amounts to playing Nim on the integers induced by those rationals.

Though playing on the integer k is equivalent to playing on any of its 2k�1 rational

siblings on level k of the tree .k � 1/, there are games on the rationals with no obvious

integer parallels. For example, one may restrict every move to a single tree direction: either

move up right, or move up left, both for impartial (Wythoff-like) and partizan (chess-like)

games.

i

i

“master” — 2012/11/15 — 11:23 — page 217 — #231i

i

i

i

i

i

29. RATWYT 217

Our game steps amount to single steps in the “slow Euclidean Algorithm.” The game

EUCLID corresponds to playing along the Euclidean Algorithm at a pace chosen by the

player. There is a large bibliography on EUCLID including [3], [11], [6]. For related games,

see [2] and [10].

Like the CW-tree, the older Stern-Brocot tree [12], [1] also contains all the reduced

rationals. The former has the advantage that the transition from any level to its neighbor

is simple; the latter that it is a search tree. Both trees are intimately related to the famous

Stern diatomic sequence available at http://oeis.org/A002487. A construction of

the rationals using recurrence is given in [9].

Final remark In a different context, an anonymous referee recently asked whether there

are any “bridges” between combinatorial game theory (CGT) and classical game theory. In

the latter, of course, the notion of “rationality” plays a prominent role (in the form of the

rational players). Now we see that rationality also plays a role in CGT.

Bibliography

[1] A. Brocot, Calcul des rouages par approximation, nouvelle methode, Revue Chronometrique 6

(1860) 186–194.

[2] G. Cairns and N. B. Ho, Min, a combinatorial game having a connection with prime

numbers, Integers 10 G03 (2010) 765–770; available at dx.doi.org/10.1515/

INTEG.2010.103.

[3] G. Cairns, N. B. Ho, and T. Lengyel, The Sprague-Grundy function of the real game

Euclid, Discrete Math. 311 (2011) 457–462; available at dx.doi.org/10.1016/

j.disc.2010.12.011.

[4] N. Calkin and H. S. Wilf, Recounting the rationals, Amer. Math. Monthly 107 (2000) 360–363;


[5] A. S. Fraenkel, How to beat your Wythoff games’ opponent on three fronts, Amer. Math.

Monthly 89 (1982) 353–361; available at dx.doi.org/10.2307/2321643.

[6] A. S. Fraenkel, Euclid and Wythoff games, Discrete Math. 304 (2005) 65–68; available at

dx.doi.org/10.1016/j.disc.2005.06.022.

[7] M. Gardner, Wythoff’s Nim, Ch. 8, pp. 103–118 in: Penrose Tiles to Trapdoor Ciphers, Revised

Edition, Mathematical Association of America, 1997.

[8] J. Gibbons, D. Lester and R. Bird, Functional pearl: Enumerating the rationals, J.

Funct. Programming 16 (3) (2006) 281–291; available at dx.doi.org/10.1017/

S0956796806005880.

[9] S. P. Glasby, Enumerating the rationals from left to right, Amer. Math. Monthly 118 (2011)

830–835; available at dx.doi.org/10.4169/amer.math.monthly.118.09.830.

[10] S. Hofmann, G. Schuster, and J. Stending, Euclid, Calkin & Wilf—playing with rationals, Elem.

Math V. 63 (2008) 109–117; available at dx.doi.org/10.4171/EM/95.

[11] G. Nivasch, The Sprague-Grundy function of the game Euclid, Discrete Math. 306 (2006)

2798–2800; available at dx.doi.org/10.1016/j.disc.2006.04.020.

[12] M. A. Stern, Uber eine zahlentheoretische Funktion, J. Reine Angew. Math. 55 (1858) 193–220;

available at dx.doi.org/10.1515/ crll.1858.55.193.

[13] W. A. Wythoff, A modification of the game of Nim, Nieuw Arch. Wisk. 7 (1907) 199–202.

http://oeis.org/A002487

dx.doi.org/10.1515/

http://dx.doi.org/10.1515/INTEG.2010.103

INTEG.2010.103

http://dx.doi.org/10.1515/INTEG.2010.103

dx.doi.org/10.1016/


j.disc.2010.12.011


dx.doi.org/10.2307/2589182

http://dx.doi.org/10.2307/2589182

dx.doi.org/10.2307/2321643

http://dx.doi.org/10.2307/2321643

dx.doi.org/10.1016/j.disc.2005.06.022


dx.doi.org/10.1017/

http://dx.doi.org/10.1017/S0956796806005880

S0956796806005880

http://dx.doi.org/10.1017/S0956796806005880

dx.doi.org/10.4169/amer.math.monthly.118.09.830

http://dx.doi.org/10.4169/amer.math.monthly.118.09.830

dx.doi.org/10.4171/EM/95

http://dx.doi.org/10.4171/EM/95

dx.doi.org/10.1016/j.disc.2006.04.020


dx.doi.org/10.1515/

http://dx.doi.org/10.1515/crll.1858.55.193

crll.1858.55.193

http://dx.doi.org/10.1515/crll.1858.55.193

i

i

“master” — 2012/11/15 — 11:23 — page 218 — #232i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 219 — #233i

i

i

i

i

i

VICards and Probability

i

i

“master” — 2012/11/15 — 11:23 — page 220 — #234i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 221 — #235i

i

i

i

i

i

30Modeling Mathematics

with Playing Cards

Martin Gardner

Because playing cards have values 1 through 13 (jacks 11, queens 12, kings 13), come

in two colors, four suits, and have fronts and backs, they provide wonderfully convenient

models for hundreds of unusual mathematical problems involving number theory and com-

binatorics. What follows is a choice selection of little known examples.

One of the most surprising of card theorems is known as the Gilbreath principle after

magician Norman Gilbreath who first discovered it. Arrange a deck so the colors alternate.

Cut it so the bottom cards of each half are different colors, and then riffle shuffle the halves

together. Take cards from the top in pairs. Amazingly, every pair will consist of a red and

black card!

Here is a simple proof by induction that this must happen. Assume that the first card

to fail on the table during the shuffle is black. If the next card to fail is the card directly

above it in the same half, that card will be red. This places on the table a red-black pair. If

the next card after the first one comes from the other half, it too will be red and will put a

red-black pair on the table. In either case, after two cards have dropped, the bottom cards

of each half will be of different colors, so the situation is exactly the same as before, and

the same argument applied for the rest of the cards. No matter how careful or careless the

shuffle, it will pile red-black pairs on the table.

Gilbreath’s principle generalizes. Arrange the deck so the suits are in an order, say

spades, hearts, clubs, diamonds, that repeats throughout the pack. Deal as many cards as

you like to form a pile. This of course reverses the order of the suits. When the pile is about

the same size as the remaining portion of the deck, riffle shuffle the two portions together.

If you now take cards in quadruplets from the top of the shuffled pack, you will find that

each set of four contains all four suits. The ultimate generalization is to shuffle together

two decks, one with its cards in the reverse order of the other deck. After the shuffle, divide

the 104-card pack exactly in half. Each half will be a complete deck of 52 different cards!

What mathematician David Gale has called the “non-messing-up theorem” is another

whimsical result. From a shuffled deck, deal the cards face up to form a rectangle of any

proportion. In each row, rearrange the cards so their values do not decrease from left to

right. In other words, each card has a value higher than the one on its left, or two cards


221

i

i

“master” — 2012/11/15 — 11:23 — page 222 — #236i

i

i

i

i

i

222 Part VI. Cards and Probability

of the same value are side by side. After ordering the rows, do the same thing with the

columns. This of course drastically alters the order of cards in the rows. After rearranging

the columns, you may be amazed to find that the rows are still ordered!

The theorem is at least a hundred years old. You will find it proved as the answer to a

problem in the American Mathematical Monthly (70:2, 1963, 212-13), and in a monograph

by Gale and Richard Karp, published in 1971 by the operations research center of the

engineering school of the University of California, Berkeley. Donald Knuth discusses the

theorem in the third volume of The Art of Computer Programming in connection with a

method of sorting called “shellsort.” In my The Last Recreations, Chapter 11, I describe a

clever card trick based on the theorem.

Is it possible to arrange a deck so that if you spell the name of each card by moving a

card from top to bottom for each letter, then turning over the card at the end of the spell

and discarding it, it will always be the card you spelled? For example, can you so arrange

the cards that you can first spell all the spades, taking them in order from ace through

king, then do the same thing with the hearts, clubs, and diamonds? You might imagine it

would take a long time to find out how to arrange the deck, assuming it is possible to do

so, in a way that permits the spelling of all 52 cards. Actually, finding the order is absurdly

easy. First arrange the deck from top down in the order that is the reverse of your spelling

sequence. Take the King of Diamonds from the top of the deck, then take the queen, place

it on top of the king, and spell “Queen of Diamonds” by moving a card at each letter from

bottom to top. In brief, you are reversing the spelling procedure. Continue in this way until

the new deck is formed. You are now all set to spell every card in the predetermined order.

Of course you can do the same thing with smaller packets, such as the thirteen spades, or

with cards bearing pictures, say of animals whose names you spell.

Remember the old brainteaser about two glasses, one filled with water, the other with

wine? You take a drop of water, put it into the wine, stir, then take a drop of the mixture,

move it back to the water, and stir. Is there now more or less wine in the water than water

in the wine? The answer is that the two quantities are exactly equal. The simplest proof is

to realize that, after the transfers, the amounts of liquid in each glass remain the same. So

the quantity missing from the water is replaced by wine, and amount of wine missing from

the other glass is replaced by the same amount of water.

This is easily modeled with cards. Divide the deck into two halves, one of all red cards,

the other of all black. Randomly remover n red cards, insert them anywhere in the black

half, and shuffle. Now randomly remove n cards from the half you just shuffled, put them

back among the reds, and shuffle. Inspection will show that the number of black cards in

the red half exactly equals the number of red cards in the black half. It doesn’t matter in

the least if the red and black portions are not equal at the start.

Closely related to this demonstration is the following trick. Cut a deck exactly in half,

turn over either half and shuffle the two parts together. Cut the mixed-up deck in half

again, and turn over either half. You’ll find that the number of face-down cards in either

half exactly equals the number of face-down cards in the other half. The same is true, of

course, for the face-up cards. Do you see why this is the case? The trick is baffling to

spectators if they don’t know that the deck is initially divided exactly in half, and if you

secretly turn over one half as you spread its cards on the table.

i

i

“master” — 2012/11/15 — 11:23 — page 223 — #237i

i

i

i

i

i

30. Modeling Mathematics with Playing Cards 223

Playing cards provide a wealth of counterintuitive probability questions. The notorious

Monty Hall problem can be modeled with cards. There are three cards face down on the

table and you are told that one card only is an ace. Put a finger on a card. Clearly the

chance you have selected the ace is 1=3. A friend now secretly peeks at all three cards and

turns face up a card that is not the ace. Two cards remain face down, one of which you

know is the ace. What now is the probability your finger is on the ace? Many persons think

the probability has risen from 1=3 to 1=2. A little reflection should convince you that it

remains 1=3 because your friend can always turn a non-ace. Now switch your finger from

the card it is on to the other card. The probability you have now chosen the ace jumps form

1=3 to 2=3. This is obvious from the fact that the card you first selected has the probability

of 1=3 being the ace. Because the ace must be one of the two face-down cards, the two

probabilities must add to 1 or certainty.

A similar seeming paradox also involves three face-down cards dealt from a shuffled

deck. A friend looks at their faces and turns over two that are the same color. What’s the

probability that the remaining face-down card is the same color as the two face-up cards?

You might think it is 1=2. Actually it is 1=4. Here’s the proof. The probability that three

randomly selected cards are the same color is two out of eight equal possibilities, or 1=4.

Subtract 1=4 from 1 (the card must be red or black) and you get 3=4 for the probability

that the face-down card differs in color from the two face-up cards. This is the basis for

an ancient sucker bet. If you are the operator, you can offer even odds that the card is of

opposite color from the two face-up cards, and win the bet three out of four times.

Here’s a neat problem involving a parity check. Take three red cards from the deck.

Push one of them back into the pack and take out three black cards. Push one of them back

into the deck and remove three reds. Continue in this manner. At each step you randomly

select a card of either color, return it to the deck and remove three cards of opposite color.

Continue as long as you like. When you decide to stop you will be holding a mixture of reds

and blacks. Is it possible that the number of black cards you hold will equal the number of

reds? Unless you think of a parity check it might take a while to prove that the answer is

no. After each step you will always have in your hand an odd number of cards, therefore

the two colors can never be equal.

Magicians have discovered the following curiosity. Place cards with values ace through

nine face down in a row in counting order, ace at the left. Remove a card from either end of

the row. Take another card from either end. Finally, take a third card from either end. Add

the values of the three cards, then divide by six to obtain a random number n. Count the

cards in the row from left to right, and turn over the nth card. It will always be the four!

I leave it to readers to figure out why this works and perhaps to generalize it to longer

rows of numbers. For example, use twelve cards with values 2; 3; 4; 5; 6; 7; 8; 9; 10; J; Q; K

to make the row. Take a card three times from either end, divide their sum by 9, and call

the result n. The nth card from the left will always be the five.

A classic card task, going back more than two centuries, is to arrange all the aces,

kings, queens, and jacks—sixteen cards in all—in a square array so that no two cards of

the same value, as well as no two cards of the same suit, are in the same row, column,

or diagonal. Counting the number of different solutions is not trivial. W. W. Rouse Ball,

in his classic Mathematical Recreations and Essays, said there are 72 fundamental solu-

tions, not counting rotations and reflections. This is a mistake that persisted through the

i

i

“master” — 2012/11/15 — 11:23 — page 224 — #238i

i

i

i

i

i


book’s eleventh edition, but was dropped from later editions revised by H. S. M. Coxeter.

Dame Kathleen Ollerenshaw, a noted British mathematician who was once Lord Mayor of

Manchester, found there are twice as many fundamental solutions, 144, making the number

of solutions including rotations and reflections 8 � 144 D 1;152. She recently described a

simple procedure for generating all 1;152 patterns in an article written for the blind. (Dame

Ollerenshaw, now 87, is slowly losing her vision, and energetically learning how to read

Braille.)

This is her procedure. Number the sixteen positions of the square from 1 through 16,

left to right, top down. Place an arbitrary card, say the Ace of Spades, in position 1, the top

left corner. A second ace, say the Ace of Hearts, goes in the second row. It can’t go in the

same column or diagonal as the Ace of Spades, so it must go in either space 7 or 8. Place

it arbitrarily in space 7. Two aces remain to go in rows 3 and 4. Put the Ace of Diamonds

in the third row. It can go only in space 12. The Ace of Clubs is now forced into space 14

of the bottom row. Had the second ace gone in space 8, the last two aces would be forced

into spaces 10 and 15.

Consider the other three spades. They can’t go in the top row or leftmost column, or

in a main diagonal. This forces them into spaces 4, 10, and 15. Arbitrarily place the King

of Spades in 4, the Queen of Spades in 10, and the Jack of Spades in 15. The pattern now

looks like this:

A¨

Kª

Qª

Jª

Aª

A©

A§

The remaining nine cards are forced into spaces that complete the following pattern:

A¨

Kª

Qª

Jª

Aª

A©

A§

Q¨

J¨

K¨ Q©

J©

Q§

K§

K© J§

Multiply the number of choices at each step, 16 � 3 � 2 � 2 � 3 � 2, and you get the total

of 1,152 patterns.

For more examples of mathematical theorems, problems, and tricks with playing cards,

see my Dover paperback Mathematics, Magic, and Mystery, and Karl Fulves’s Self-Working

Card Tricks, also a Dover soft-cover, and the following chapters in my collections of Scien-

tific American columns: The Scientific American Book of Mathematical Puzzles and Diver-

sions, Chapter 10; Mathematical Carnival, Chapter 10 and 15; Mathematical Magic Show,

i

i

“master” — 2012/11/15 — 11:23 — page 225 — #239i

i

i

i

i

i

30. Modeling Mathematics with Playing Cards 225

Chapter 7; Wheels, Life, and Other Mathematical Amusements, Chapter 19; Penrose Tiles

to Trapdoor Ciphers, Chapter 19, and The Last Recreations, Chapter 2.

Now for two puzzles that can be modeled with cards. Solutions appear on page 226.

1. Arrange nine cards as shown in Figure 30.1. Assume the aces have a value of 1. Each

row, each column, and one diagonal has a sum of 6. The task is to alter the positions of

three cards so that the matrix is fully magic for all rows, columns, and diagonals.

A

A

A

2

2

3

3

3

2

Figure 30.1.

A 2 3

4 5 6

7 8 9

Figure 30.2.

2. Nine cards arranged and shown in Figure 30.2 have the property of minimizing the

sum of all absolute differences between each pair of cells that are adjacent vertically and

horizontally. Assume that the matrix is toroidal; that is, it wraps around in both directions.

The sum of the differences is 48. This was proved minimal by Friend Kirstead, Jr., in the

Journal of Recreational Mathematics (18, 1985–86, 301). The challenge is to take nine

cards of distinct values (court cards may be used) and form a toroidal square that will

maximize the sum of all absolute differences.

i

i

“master” — 2012/11/15 — 11:23 — page 226 — #240i

i

i

i

i

i


Solutions to the playing card puzzles above

1. Move the bottom two of three cards to the top, or move the leftmost column to the

right. Either change produces the desired magic square in Figure 30.3a.

A

Y

J

K

2

Z

3

Q

X

2

2

2

3

3

A

A

A

3

(a) (b)

Figure 30.3.

2. Figure 30.3b shows how nine cards of distinct values can be placed in a toroidal

square so as to maximize the sum of absolute differences of adjacent values in rows

and columns. Any value 4 through 10 can go in cells, X, Y, and Z to make a total

of 120. This was proved maximal by Brian Maxwell, of Middlesex, England, in the

Journal of Recreational Mathematics, vol. 18 (1985–86), p. 300.

i

i

“master” — 2012/11/15 — 11:23 — page 227 — #241i

i

i

i

i

i

31The Probability an Amazing

Card Trick Is Dull

Christopher N. Swanson

The Ashland University student chapter of the MAA holds biweekly meetings, typically

consisting of a short business meeting to plan activities that the group is sponsoring fol-

lowed by a social time when the group plays mathematical games and munches on brown-

ies. As a new feature in Fall 2002, I told the students that I would perform a new math-

ematical card trick at each of these meetings. My source for most of these card tricks is

the delightful book [2]. One of the finest tricks described there is due to an amateur New

York magician named Henry Christ. A spectator shuffles a deck of cards several times, and

then the magician deals nine cards face down on a table. The spectator selects one of these

cards, looks at it, and stacks the nine dealt cards on the table with the selected card on

top. The magician places this stack on the bottom of the deck, and hands the deck to the

spectator. The spectator is told to deal the cards out face up in a pile, counting backwards

out loud from ten while dealing the cards. If at some point, the denomination of the card

matches the number the spectator says, then the spectator repeats this procedure with a sec-

ond pile, again counting backwards from ten. If 1 is reached without any cards matching,

the spectator places the next card face down on top of the pile, and repeats this procedure

with the second pile. The spectator does this until four piles are created in this manner,

and then adds the numbers appearing on the cards that are face up. The spectator counts

through the remaining cards of the deck to find the card in the position represented by this

sum, and this card is the one that was originally selected.

The reason why this card trick works is that the final card will always be the 44th card

in the deck. This can be seen by noting that any unmatched pile will contain 11 cards that

are set aside. If a pile has a match with a value i showing, then the match occurred after

counting down 11� i cards. This card with value i will contribute i to the sum determining

how many cards in the final deck to count down, and thus, a total number of 11�i Ci D 11

cards will be set aside for this pile.

After practicing this card trick a few times, I was ready to present it to the students

at our meeting. Unfortunately, when I performed the trick at the meeting, a match did

not occur in any of the four piles. This is unfortunate for two reasons. First, nothing is

done with the pile of cards left in the spectator’s hands, and the magician must tell the


227

i

i

“master” — 2012/11/15 — 11:23 — page 228 — #242i

i

i

i

i

i


spectator to simply turn over the card appearing face down on the final pile. Second, and

more serious from the perspective of the magician, the trick seems quite simplistic when

no matches occur because it is clear that you set aside 44 cards. Thus, a magician may ask,

“What is the probability that no matches occur and my trick is dull?” This question falls

within a class of problems known as counting permutations with restricted positions. For a

more general discussion of this class of problems, see [1]. For a more advanced treatment

of the topic, see [7] or [8], and for additional applications, see [3], [4], [5], [6].

The probability of no matches

Our first step in studying this card trick problem is to note that after the nine cards are

returned to the bottom of the deck, the order of the cards in the deck is one of the possible

52! permutations. Hence, for our purposes, we just need to focus on the latter part of the

card trick. Let D be the number of ways to shuffle the deck of cards such that there is not

a 10 in the 1st, 11th, 21st, or 31st positions, there is not a 9 in the 2nd, 12th, 22nd, or 32nd

positions, etc. Thus, each card from ace and ten has four positions where it cannot occur.

We find D by using the principle of inclusion-exclusion: Given a collection of finite sets

A1; A2; : : : ; An, the number of elements in the union of these sets A1 [ A2 [ � � � [ An is

jA1 [ A2 [ � � � [ Anj DX

1�i�n

jAi j �X

1�i<j �n

jAi \ Aj j C � � �

C .�1/nC1 jA1 \ A2 \ � � � \ Anj:

Essentially this means to find the number of elements that appear in at least one of these

sets, count the number of elements in each set, then subtract off the number of elements

that appear in two sets, then add the number of elements that appear in three sets, and

continue to alternate these additions and subtractions up to the number of elements that

appear in all of the sets. To apply this principle to our problem, we number the forty cards

with restricted positions and let Ai be the set of permutations that have card numbered i

in one of its restricted positions. Then the number of permutations without any cards in

restricted positions is

D D 52Š � jA1 [ A2 [ � � � [ A40j

D 52Š �

0

@X

1�i�40

jAi j �X

1�i<j �40

jAi \ Aj j C � � � � jA1 \ A2 \ � � � \ A40j

1

A

D 52Š C40X

kD1

.�1/kX

1�i1<i2<��<ik�40

jAi1 \ Ai2 \ � � � \ Aik j:

The inner sum of this formula overcounts the number of ways to place at least k cards in

restricted positions. Let rk (sometimes called a rook number due to an interpretation con-

cerning placing non-attacking rooks on a chessboard) be the number of ways of placing k

cards in restricted positions, ignoring the placement of any other cards. Then the remaining

cards can be placed in .52 � k/Š ways, and we see thatX

1�i1<i2<��<ik�40

jAi1 \ Ai2 \ � � � \ Aik j D rk.52 � k/Š:

i

i

“master” — 2012/11/15 — 11:23 — page 229 — #243i

i

i

i

i

i

31. The Probability an Amazing Card Trick is Dull 229

Taking r0 D 1 (as is standard), we see that our formula becomes

D D40X

kD0

.�1/krk.52 � k/Š; (31.1)

for which we need the numbers rk . To calculate rk , we find the generating function (called

the rook polynomial) for this sequence, R.x/ D r0 C r1x C r2x2 C � � � C r40x40. (Note

that ri is simply the coefficient of xi .) We do this by first finding the generating polynomial

P.x/ for the number of ways for having i cards for a particular denomination in a restricted

position, and then observing that R.x/ D .P.x//10 .

For ease of description, we focus on placing 10’s in restricted positions, but clearly the

same holds for each denomination. There are�

41

�

� 4 D 16 ways to place one 10 since there

are�

41

�

ways to choose the 10 and four positions for it. Similarly, for two 10’s, there are�

42

�

ways to choose the cards, and then 4 � 3 ways to place them in restricted positions (with the

suits in alphabetical order say, four choices for the “first” and three for the “second”) for a

total of�42

�

� 4 � 3, or 72. In the same way, we find there are 96 ways to place three 10’s and

24 ways to place all four 10’s in restricted positions. Therefore, with there being just one

way to have no 10’s in restricted positions, P.x/ D 1 C 16x C 72x2 C 96x3 C 24x4.

Now let’s consider the cards of two denominations, say the 10’s and the 9’s. We can

find the number of ways of having various combinations of these eight cards in restricted

positions (without regard to any others) by squaring P.x/ W 1 C 32x C 400x2 C 2496x3 C8304x4C14592x5C12672x6C4608x7C576x8. For example, consider the term 2496x3.

This says there are 2496 ways of having three 9’s and 10’s in restricted positions, arising

from no 9’s and three 10’s (1 � 96 ways), one 9 and two 10’s (16 � 72 ways), two 9’s and

one 10 (72 � 16 ways), and three 9’s and no 10’s (96 � 1 ways). The multiplicative rule is

justified since the restricted positions for different denominations are disjoint. From this

line of reasoning it follows that the coefficient of xk in .P.x//10 is the number of ways of

having k number cards in restricted positions, ignoring the placement of the other cards.

Thus, R.x/ D .1 C 16x C 72x2 C 96x3 C 24x4/10.

We used MAPLE to expand R.x/ to find the numbers rk , and then used equation (31.1)

to get

D D 3286792937574955359428826782591912851876783001405845125895657881600

� 3:29 � 1066:

Therefore, the probability that no card is in a forbidden position is D52Š

� 0:0407. We

see that although it is unlikely that the magician would not get any cards to match while

performing the magic trick, it will happen often enough that the magician better know how

to adjust the trick. (An anonymous referee noted that we can also produce the approximate

value of 0:0407 using double-precision arithmetic.)

Exercises

1. The game Euchre uses only 6 cards .9; 10; Jack; Queen; King; Ace/ from each suit.

Suppose you are at a party and you wish to demonstrate the card trick using such

i

i

“master” — 2012/11/15 — 11:23 — page 230 — #244i

i

i

i

i

i


a deck. You plan to have the spectator create 4 piles while counting backwards

Jack-10-9. How many cards should you first deal? What is the probability that no

match occurs in the four piles?

2. Consider the general version of Henry Christ’s trick. You have a deck of cards con-

sisting of k suits having f face values and in the trick the spectator is to create p

piles while counting backwards from m to 1. What are the possible values for m?

How many cards should you first deal? Find an expression for the probability that

no match occurs in the p piles.

Acknowledgment The author would like to thank the editor and the anonymous referees

for improving the clarity of this article. He would also like to thank the students of Ashland

University’s student chapter of the MAA for putting up with his bad humor and card tricks

during meetings.

Bibliography

[1] R. A. Brualdi, Introductory Combinatorics, 3rd ed., Prentice-Hall, 1999, Ch. 6.

[2] M. Gardner, Mathematics, Magic and Mystery, Dover Publications, 1956.

[3] A. W. Joseph, A problem in derangements, J. Inst. Actuaries Students’ Soc. 6 (1946) 14–22.

[4] F. F. Knudsen and I. Skau, On the asymptotic solution of a card-matching problem, Math. Mag.,

69 (1996) 190–197.

[5] B. H. Margolius, The dinner-diner matching problem, Math. Mag., 76 (2003) 107–118.

[6] S. G. Penrice, Derangements, permanents, and Christmas presents, Amer. Math. Monthly 98

(1991) 617–620.

[7] J. Riordan, An Introduction to Combinatorial Analysis, Princeton Univ. Press, 1978.

[8] R. P. Stanley, Enumerative Combinatorics: Volume I, Cambridge Univ. Press, 1998.

i

i

“master” — 2012/11/15 — 11:23 — page 231 — #245i

i

i

i

i

i

32The Monty Hall Problem,

Reconsidered

Stephen Lucas, Jason Rosenhouse, and Andrew Schepler

In its classical form, the Monty Hall Problem (MHP) is the following:

Version 1. (Classic Monty) You are a player on a game show and are shown three iden-

tical doors. Behind one is a car, behind the other two are goats. Monty Hall, the host of the

show, asks you to choose one of the doors. You do so, but you do not open your chosen

door. Monty, who knows where the car is, now opens one of the doors. He chooses his door

in accordance with the following rules:

1. Monty always opens a door that conceals a goat.

2. Monty never opens the door you initially chose.

3. If Monty can open more than one door without violating rules one and two, then he

chooses his door randomly.

After Monty opens his door, he gives you the choice of sticking with your original choice

or switching to the other unopened door. What should you do to maximize your chances of

winning the car?

In the entire annals of mathematics, you would be hard-pressed to find a problem that

arouses the passions like the MHP. It has a history going back at least to 1959, when Mar-

tin Gardner introduced a version of it in Scientific American [4, 5]. When statistician Fred

Moseteller included it in his 1965 anthology of probability problems [9], he remarked that

it attracted far more mail than any other problem. In his 1968 book Mathematical Ideas in

Biology [18], biologist John Maynard Smith wrote, “This should be called the Serbelloni

problem since it nearly wrecked a conference on theoretical biology at the villa Serbel-

loni in the summer of 1966.” In its modern game show format the problem made its first

appearance in a 1975 issue of the academic journal The American Statistician [16]. Math-

ematician Steve Selvin presented it as an interesting classroom exercise on conditional

probability. Though he presented the correct solution, (that there is a big advantage to be

gained from switching), he found himself strongly challenged by subsequent letters to the

editor [17].

Reprinted from Mathematics Magazine, Vol. 82, No. 5 (Dec. 2009), pp. 332–342.

231

i

i

“master” — 2012/11/15 — 11:23 — page 232 — #246i

i

i

i

i

i


The problem really came into its own when Parade magazine columnist Marilyn vos

Savant responded to a reader’s question regarding it. There followed several rounds of

angry correspondence, in which readers challenged vos Savant’s solution. The challengers

later had to eat crow when it was shown by a Monte Carlo simulation that vos Savant was

correct, but not before the fracas reached the front page of the New York Times [20]. The

whole story is recounted in the books by Rosenhouse and vos Savant [13, 15].

In the end, the situation has been best summed up by cognitive scientist Massimo

Palmatelli-Palmarini who wrote that, “. . . no other statistical puzzle comes so close to fool-

ing all the people all the time. . . The phenomenon is particularly interesting precisely be-

cause of its specificity, its reproducibility, and its immunity to higher education” [10].

Why all the confusion?

The trouble, you see, is that most people argue like this: “Once Monty opens his door only

two doors remain in play. Since these doors are equally likely to be correct, it does not

matter whether you switch or stick.” We will refer to this as the fifty-fifty argument.

This intuition is supported by a well-known human proclivity. A negative consequence

incurred by inaction hurts less than the same negative consequence incurred through some

definite action. In the context of the MHP, people feel worse when they switch and lose

than they do after losing by sticking passively with their initial choice.

There is a large literature in the psychology and cognitive science journals documenting

and explaining the difficulty people have with the MHP. Burns and Wieth [3] summarized

the findings of numerous such studies by writing,

These previous articles reported 13 studies using standard versions of the

MHD, and switch rates ranged from 9% to 23% with a mean of 14.5%. This

consistency is remarkable given that these studies range across large differ-

ences in the wording of the problem, different methods of presentation, and

different languages and cultures.

(Note that MHD stands for “Monty Hall Dilemma.”)

Gilovich, Medvec, and Chen [6] studied people’s reactions to losing by switching ver-

sus their reactions to losing by sticking. They used boxes instead of doors, and crafted an

experimental situation in which players would lose regardless of their decision to switch

or stick. Their findings?

Because action tends to depart from the norm more than inaction, the indi-

vidual is likely to feel more personally responsible for an unfortunate action.

Thus, subjects who switched boxes in our experiment were more likely to

experience a sense of “I brought this on myself,” or “This need not have hap-

pened,” than subjects who decided to keep their initial box.

It would seem the defenders of sticking can point both to a plausible mathematical

argument and to certain fine points of human psychology. How can the forces for switching

fight back?

i

i

“master” — 2012/11/15 — 11:23 — page 233 — #247i

i

i

i

i

i

32. The Monty Hall Problem, Reconsidered 233

Focus on Monty, not the doors

There are a variety of elementary methods for solving the MHP. Working out a tree diagram

for the problem, as in Figure 32.1, establishes that switching wins with probability 23

,

while sticking wins with probability 13

. Consequently, we double our chances of winning

by switching.

MontyopensDoor 2

MontyopensDoor 3

MontyopensDoor 3

MontyopensDoor 2

1

3

1

3

1

21 1

1

2

CarbehindDoor 1

CarbehindDoor 2

CarbehindDoor 3

Switching loses Switching loses Switching wins Switching wins

Probability =16

Probability =16

Probability =13

Probability =13

1

3

Figure 32.1. Probability tree for the classical MHP, when the player initially chooses door one.

Monte Carlo simulations are also effective for establishing the correct answer. The

Monty Hall scenario is readily simulated on a computer. The large advantage to be gained

from switching quickly becomes apparent by playing the game multiple times.

Such methods, however, do little to clarify why the fifty-fifty argument is incorrect.

Practical results obtained from a simulation can show you that something is wrong with

your intuition, but they will not make the correct answer seem natural. The trouble lies in

the difficulty people have in recognizing what is and is not important in reasoning about

conditional probability.

The mantra about focus goes a long way towards pointing people in the right direction.

When Monty opens door X , there is a tendency to think, “I have learned that door X

conceals a goat, but I have learned nothing of relevance about the other two doors.” This

is what we mean by “focusing on the doors.” The proper approach involves focusing on

Monty, specifically on the precise manner in which he chooses his door to open. We should

think, “Monty, who makes his decisions according to strict rules, chose to open door X .

Why this door as opposed to one of the others?”

Let us assume the player initially chose door one and Monty then opened door two.

According to the rules, we can be certain that one of the following two scenarios has

played out:

i

i

“master” — 2012/11/15 — 11:23 — page 234 — #248i

i

i

i

i

i


1. The car is behind door one. Monty chose door two at random from among doors two

and three.

2. The car is behind door three. Since the player initially chose door one, Monty was

now forced to open door two.

The second of these scenarios is more likely than the first. Since the car is behind the

first door one-third of the time, and since Monty then opens door two in one-half of those

cases, we see that scenario one occurs one-sixth of the time. Scenario two, on the other

hand, happens whenever the car is behind door three (and the player has chosen door one).

That happens one-third of the time. Scenario two is twice as likely as scenario one.

Thus, we should think, “I have just witnessed an event that is twice as likely to occur

when the car is behind door three than it is when the car is behind door one. Consequently,

the car is more likely to be behind door three, and I am more likely to win the car by

switching.”

An exotic selection procedure

The general principle here is that anything affecting Monty’s decision-making process is

relevant to updating our probabilities after Monty opens his door. To further illuminate this

point, let us consider an altered version of the problem:

Version 2. (High-Numbered Monty) As before, we have three identical doors conceal-

ing one car and two goats. The player chooses a door that remains unopened. Monty now

opens a door he knows to conceal a goat. This time, however, we stipulate that Monty al-

ways opens the highest-numbered door available to him (keeping in mind that Monty will

never open the door the player chose). Will the player gain any advantage by switching

doors?

For reasons of concreteness, we will assume once more that the player initially chooses

door one.

Any time door one conceals a goat, Monty has no choice regarding which door to open.

He can not open door one (since the player chose that door), and he can not open the door

that conceals the car. This leaves only one door available to him.

The interesting cases occur when door one conceals the car. Unlike Classic Monty,

who now chooses randomly, High-Numbered Monty will always open door three when he

can. It follows that if we see him open door two instead we know for certain that the car is

behind door three.

And if High-Numbered Monty opens door three? Since Monty is certain to open door

three whenever the car is behind door one or door two, we now have no basis for deciding

between them. It really is a fifty-fifty decision in this case.

Take this as a cautionary tale. Whether we are playing Classic Monty or High-Numbered

Monty, it is certain that Monty will open a goat-concealing door. In the former case the

probability that our initial choice concealed the car did not change while in the latter case

it did. This shows that any proposed solution to the MHP failing to pay close attention to

Monty’s selection procedure is incomplete.

i

i

“master” — 2012/11/15 — 11:23 — page 235 — #249i

i

i

i

i

i


Monty meets Bayes

The main point thus far is that the probability that door X conceals the car, given that

Monty has shown us the goat behind door Y , depends on a detailed consideration of

Monty’s selection procedure. More precisely, it depends on the probability that Monty

will open door Y under the assumption that door X conceals the car. The precise manner

in which these probabilities are related is given by Bayes’ theorem.

We denote by Ci the event that the car is behind door i , and by Mj the event that

Monty opens door j to reveal a goat. Also assume the player initially chooses door one,

and Monty then opens door two. The probability that the player’s door conceals the car,

given Monty’s display, can be found using Bayes’ theorem:

P.C1jM2/ DP.C1/P.M2jC1/

P.M2/:

Expanding the bottom of this fraction via the law of total probability leads to

P.C1jM2/ D P.C1/P.M2jC1/

P.C1/P.M2 jC1/ C P.C2/P.M2jC2/ C P.C3/P.M2 jC3/:

In both of our versions of the MHP we have P.M2jC2/ D 0, since it is given that Monty

will never open the door concealing the car. Also, since we are given that the doors are

identical, we have

P.C1/ D P.C2/ D P.C3/ D 1

3:

Making these substitutions leads to

P.C1 jM2/ D P.M2jC1/

P.M2jC1/ C P.M2jC3/:

In both versions of the game we have P.M2jC3/ D 1. That is, when the player chooses

door one and the car is behind door three, Monty is certain to open door two.

In Classic Monty we have P.M2jC1/ D 12

, since Monty chooses at random when

the car is behind the door initially chosen by the player. In High-Numbered Monty we

have P.M2jC1/ D 0, since Monty is required by his rules to open door three. Plugging

everything into Bayes’ Theorem shows that for Classic Monty we now have

P.C1jM2/ D12

12

C 1D

1

3[Classic],

while for High-Numbered Monty we have

P.C1jM2/ D12

0 C 1D 1

2[High-Numbered].

These are precisely the answers we obtained in the previous section.

Let us go one more round:

i

i

“master” — 2012/11/15 — 11:23 — page 236 — #250i

i

i

i

i

i


Version 3. (Random Monty) As always, assume that the player has initially chosen door

one and Monty subsequently opened door two to reveal a goat. This time, however, you

know that Monty chose his door randomly, subject only to the restriction that he not open

the door the player chose. Should we switch?

The novelty here lies in the nonzero probability of Monty opening the door concealing

the car. Intuitively we would reason as follows: Since Monty opened door two after I se-

lected door one, since door two concealed a goat, and since I know Monty chose randomly

between doors two and three, I conclude that one of two scenarios has played out:

1. The car is behind door one, Monty chose door two randomly.

2. The car is behind door three, Monty chose door two randomly.

Since the car is equally likely to be behind doors one and three, these scenarios are equally

likely to occur. The conclusion is that the remaining doors are equiprobable, and therefore

there is no advantage to switching.

Our intuition is confirmed via Bayes’ Theorem. We know that Monty will not open

door one, and we know that door two conceals a goat. We now have

P.C1/ D P.C2/ D P.C3/ D 1

3;

P.M2jC1/ D P.M2jC3/ D 1

2;

P.M2jC2/ D 0:

Bayes’ Theorem now says

P.C1jM2/ D13

�12

�

13

�12

�

C 13.0/ C 1

3

�12

� D 1

2:

The tree diagram in Figure 32.2 might be helpful for visualizing the situation.

Two-player Monty

Three-door versions of the MHP can become remarkably complex. The following version

comes from a paper by philosopher Peter Baumann [1, 2]. For the remainder of the paper

we will refer simply to the probability of door X, thereby avoiding the more cumbersome

expression, “The probability that door X conceals the car.”

Version 4. (Two-Player Monty) We begin with three identical doors concealing two

goats and one car. There are two players in the game. Each player chooses one of the

doors, but does not open it. Each player knows there is another person in the game, but

neither knows which door the other player selected. Monty now opens a door according to

the following procedure.

1. If both players selected the same door, then everything proceeds as in the classical

game. Monty opens a goat-concealing door, choosing randomly if he has a choice.

i

i

“master” — 2012/11/15 — 11:23 — page 237 — #251i

i

i

i

i

i


MontyOpensDoor 2

MontyOpensDoor 3

MontyOpensDoor 2

MontyOpensDoor 3

MontyOpensDoor 2

MontyOpensDoor 3

Switching loses

Probability =16

Switching loses

Probability =16

1

3

1

3

Switching wins

Probability =16

Game ends

Switching wins

Probability =16

Game ends

1

2

1

2

1

2

1

2

1

2

1

2

CarbehindDoor 1

CarbehindDoor 2

CarbehindDoor 3

1

3

Figure 32.2. Probability tree for Random Monty, when the player initially chooses door one.

2. If the players selected different doors, then Monty opens the one remaining door,

regardless of what is behind it.

We assume that both players select their initial doors randomly. If you are one of the players

and you have just seen Monty open a goat-concealing door, should you switch?

This will be a fine test of our new-found intuition. For concreteness, suppose that Player

A initially chose door one, and Monty has now opened the goat-concealing door two. What

do Monty’s actions tell us about Player B’s choice? Initially we consider it equally likely

that Player B chose door one, door two or door three. After seeing Monty open door two

we reason that one of three scenarios has played out:

1. Player B chose door three. In this case Monty was forced to open door two, which

conceals a goat with probability 23

.

2. Player B chose door one and door one conceals the car. In this case Monty opens

door two with probability 12

. Since door one conceals the car with probability 13

, this

scenario occurs with probability 16

.

3. Player B chose door one and door three conceals the car (which happens with prob-

ability 13

). In this case Monty is again forced to open door two.

Combining items two and three above shows that Player B chose door one with probability13

C 16

D 12

. Item one shows that Player B chose door three with probability 23

. We conclude

that the event in which Monty opens the goat-concealing door two after Player A chooses

door one is 43

more likely to occur when Player B has chosen door three than when he has

chosen door one.

i

i

“master” — 2012/11/15 — 11:23 — page 238 — #252i

i

i

i

i

i


It is a consequence of Bayes’ theorem that the probabilities we now assign to “Player B

chose door three,” and “Player B chose door one” must preserve this 4 W 3 ratio. (A proof

of this assertion can be found in the paper by Rosenthal [14].) Consequently, we assign

probabilities of 47

and 37

respectively.

To continue the analysis, note that from Player A’s perspective there are now four

possibilities. Player B could have chosen door one or door three, and the car could be

behind either of those doors. Let us denote these possibilities via ordered pairs of the form

(Player B’s Door, Location of the Car).

Thus, the four remaining possibilities are

.3; 1/; .3; 3/; .1; 1/; .1; 3/:

Consider the first two pairs. If Player B chose door three, then Monty was forced to

open door two. Consequently, we learn nothing regarding the probability of doors one and

three. Since these two scenarios collectively have a probability of 47

, and since they are

equally likely, we now assign the following probabilities:

P.3; 1/ D P.3; 3/ D 2

7:

The remaining two pairs, however, are not equiprobable. Suppose that Player B chose

door one, just as Player A did. If the car is behind door one, then Monty chose door two

randomly, which happens with probability 12

. If the car is behind door three, then Monty

was forced to choose door two. It follows that it is twice as likely that the car is behind door

three than that it is behind door two. Since these scenarios have a collective probability of37

, we assign the following probabilities:

P.1; 1/ D 1

7and P.1; 3/ D 2

7:

Of the four scenarios, the two in which Player A wins by switching are .1; 3/ and .3; 3/.

Since both have probability 27

, this gives a total probability of winning by switching of 47

.

That is our solution.

The really amusing part is that both players will go through this analysis, and both will

decide to switch doors. In those scenarios in which the players chose different doors, this

implies that someone is definitely making the wrong decision. Such are the cruelties of

probability.

Two-player Monty has also been discussed by Baumann [1], Levy [7], Rosenhouse

[13], and Sprenger [19].

Many doors

Ready for the final exam?

Version 5. (Progressive Monty) This time there are n identical doors, concealing one

car and n � 1 goats. The player chooses a door, but does not open it. Monty now opens a

i

i

“master” — 2012/11/15 — 11:23 — page 239 — #253i

i

i

i

i

i


goat-concealing door, choosing randomly from among his options. The player is now given

the choice of sticking or switching. The player makes his choice, but again does not open

his chosen door. Monty opens another goat-concealing door. The player is again given the

opportunity to stick or switch. This continues until Monty has opened n � 2 doors. The

player makes his final selection, and wins whatever is behind his door. What strategy will

maximize his chances of winning the car?

To help us get our bearings, let us try a case study. Suppose we begin with five doors.

At any stage of the game we represent the probabilities of the doors, based on all available

knowledge, via an ordered 5-tuple, which we call the probability vector. As the game

begins, we have probability vector�

1

5;

1

5;

1

5;

1

5;

1

5

�

:

As always, let us assume the player chooses door one and Monty now opens door two.

Our past experience suggests that the probability of our door does not change, and this

is confirmed by Bayes’ theorem. In the following calculation, the notation Ci denotes the

event where the car is not behind door i .

We now compute

P.C1jM2/ DP.C1/P.M2 jC1/

P.C1/P.M2 jC1/ C P.C1 and C2/P.M2jC1 and C2/

D15

�14

�

15

�14

�

C 35

�13

� D 1

5:

Since the other doors are identical and since their probabilities must sum to 45

, we now

have probability vector �1

5; 0;

4

15;

4

15;

4

15

�

:

What if we now switch to door three and then see Monty open door five? Known

probabilities are now

P.C3/ D P.C4/ D P.C5/ D 4

15;

P.M5jC1/ D P.M5jC4/ D 1

2;

P.C1/ D 1

5and P.M5jC3/ D 1

3:

If we use the law of total probability to write

P.M5/ D P.C1/P.M5jC1/ C P.C3/P.M5 jC3/ C P.C4/P.M5jC4/ D29

90;

and plug the results into Bayes’ Theorem, the result is the probability vector

�9

29; 0;

8

29;

12

29; 0

�

:

The probabilities of all the remaining doors went up.

i

i

“master” — 2012/11/15 — 11:23 — page 240 — #254i

i

i

i

i

i


What if Monty had opened door one after we switched to door three? The reader can

supply the details that lead to the vector�

0; 0;1

4;

3

8;

3

8

�

:

Notice that the probability of door three went down, from 415

to 14

. Our chosen door actually

seems less likely as the result of Monty’s actions. A surprising result!

Things get messy indeed in this version. Plainly we need some guidelines to aid our

intuition.

The first principle is simple. Any time Monty chooses not to open a door different from

your present choice, the probability of that door goes up. In our case study, Monty opened

door two after we chose door one. The event, “Monty does not open door three,” is more

likely to happen when the car is behind door three than when it is elsewhere. Consequently,

we will revise upward our probability of door three.

The second principle is that if the doors different from your present choice are equiprob-

able, then the probability of your choice does not change when Monty opens a door. In our

case study, after Monty opened door two, we reason that the event, “Monty does not open

door one,” has probability one regardless of the location of the car. Consequently, we learn

nothing from the occurrence of that event. The calculation in our case study confirms this

intuition.

Why, though, does it matter that the other doors are equiprobable? The answer is that

Monty’s failure to open a door is not the only source of information to which we have

access. The probability of the event, “Monty opens door X ,” depends in part on the prob-

ability of the event, “Door X conceals the car.” Specifically, the more likely a door is to

conceal the car, the less likely Monty is to open that door. Once more returning to our

case study, we switched to door three at a moment when doors three through five were

equiprobable and collectively very likely to conceal the car. By opening door five, Monty

eliminated one element of this collection. This revelation does nothing to shake our confi-

dence that the car is more likely to be found among doors three through five than it is to be

found among any collection of three doors that includes door one. Consequently, we will

revise upward the probability of our chosen door.

Why did the probability of door three go down when Monty opened door one? This one

is harder to explain, but our calculation suggests the proper way to think about it. If all four

doors had been equiprobable at the moment we switched to door three, then we would sim-

ply be playing Classic Monty on four doors. In that case, our chosen door would retain its 14

probability after Monty opens a door. The probability vector we computed for our present

situation is identical to what we would have obtained were we playing four-door Classic

Monty. The implication is that by eliminating door one, Monty has essentially erased the

prior history of the game. We are now faced with three doors that were equiprobable at the

moment we chose among them, and these doors were among an ensemble of four doors,

just as in four-door Classic Monty. That door one had a different probability from the other

doors does not distinguish our situation in a relevant way.

This observation leads to our final clue. If we select a door at a moment when precisely

k doors remain, the probability of that door can never be smaller than 1k

. Even if we have

been careless in extracting the maximum amount of information from Monty’s actions, we

still know the door was chosen from among k possibilities.

i

i

“master” — 2012/11/15 — 11:23 — page 241 — #255i

i

i

i

i

i


As a test of our principles, let us go another round with our case study. We left off with

the player having chosen door three and with probability vector,

�

0; 0;1

4;

3

8;

3

8

�

:

Imagine that we now switch to door four.

If Monty now opens door three then only doors four and five remain in play. We would

reason that these two doors were equiprobable at the moment we switched to door four,

but that door four was selected from among three possibilities. We are, in effect, playing

Classic Monty, and we would expect our updated probability vector to be

�

0; 0; 0;1

3;

2

3

�

:

The calculation is

P.C4jM3/ D P.C4/P.M3 jC4/

P.C4/P.M3 jC4/ C P.C5/P.M3jC5/

D38

�12

�

38

�12

�

C 38.1/

D 1

3:

And if Monty opens door five instead? Our intuition tells us that both doors should see

their probabilities go up: door three, because it might have been opened but was not; door

five, because it was part of an equiprobable ensemble that has decreased in size. Bayes’

Theorem confirms our intuitions. We compute

P.C3 jM5/ DP.C3/P.M5 jC3/

P.C3/P.M5 jC3/ C P.C4/P.M5jC4/D

14.1/

14.1/ C 3

8

�12

� D4

7;

and obtain probability vector�

0; 0;4

7;

3

7; 0

�

:

Remarkably, our arguments to this point are already enough to justify the correct so-

lution to Progressive Monty. Consider the strategy in which we switch at the last minute

(SLM). That is, we will stick with our initial choice until only two doors remain, and then

we will switch. Our initial choice has probability 1n

. Since the other doors are equiprobable,

this probability will not change so long as we keep it as our choice. At the moment when

only two doors remain, the other door will have probability n�1n

. That is the probability

that we win with SLM.

We also know that there will never be a moment in the game when a door has a proba-

bility smaller than 1n

. Thus, at the moment when only two doors remain it is impossible to

produce a door with probability greater than n�1n

. This shows that SLM is optimal.

Very nice. A full, rigorous proof that SLM is, in fact, uniquely optimal can be found

in the book by Rosenhouse [13]. You might also wonder what can be said about other

strategies. For example, what if we are playing with fifty doors and we are absolutely

determined to switch exactly seven times during the game? What is our best strategy? A

consideration of such questions can be found in the paper by Lucas and Rosenhouse [8].

i

i

“master” — 2012/11/15 — 11:23 — page 242 — #256i

i

i

i

i

i


Progressive Monty receives further attention in one paper by Paradis, Viader, and Bibiloni

[11] and another by Rao and Rao [12]. For a variation on the problem, see the paper by

Zorzi in the December 2009 Mathematics Magazine.

It would seem that a bit of clear thinking can steer us through even the densest of

Monty-inspired forests. Once our intuition has been tuned to what is important, it is not so

difficult to ferret out the correct answer.

Bibliography

[1] Peter Baumann, Three doors, two players, and single-case probabilities, American Philosophi-

cal Quarterly 42(1) (January, 2005) 71–79.

[2] ——, Single-case probabilities and the case of Monty Hall: Levy’s view, Synthese 162(2) (May

2008) 265–273. dx.doi.org/10.1007/s11229-007-9185-6

[3] Bruce Burns and Mareike Wieth, The collider principle in causal reasoning: why the Monty

Hall Problem is so hard, Journal of Experimental Psychology, General 103(3) (2004) 436–449.

[4] Martin Gardner, Problems involving questions of probability and ambiguity, Scientific Ameri-

can 201(4) (April, 1959) 174–182.

[5] ——, How three modern mathematicians disproved a celebrated conjecture of Leonhard Euler,

Scientific American 201(5) (May, 1959) 188.

[6] T. Gilovich, V. H. Medvec, and S. Chen, Commission, omission, and dissonance reduction:

coping with regret in the Monty Hall Problem, Personality and Social Psychology Bulletin

21(2) (February 1995) 182–190. dx.doi.org/10.1177/0146167295212008

[7] Ken Levy, Baumann on the Monty Hall Problem and single-case probabilities, Synthese 158(1)

(September 2007) 139–151. dx.doi.org/10.1007/s11229-006-9065-5

[8] Stephen K. Lucas and Jason Rosenhouse, Optimal Strategies for the Progressive Monty Hall

Problem, Math. Gaz. (to appear).

[9] Fred Mosteller, Fifty Challenging Problems in Probability, With Solutions, Addison-Wesley

Inc., Reading, 1965.

[10] Massimo Piattelli-Palmarini, Probability blindness, neither rational nor capricious, Bostonia

(March/April 1991) 28–35.

[11] J. Paradis, P. Viader, and L. Bibiloni, A mathematical excursion: from the three-door

problem to a Cantor-type set, Amer. Math. Monthly 106 (1999) 241–251. dx.doi.org/

10.2307/2589679

[12] V. V. Bapeswara Rao and M. Bhaskara Rao, A three-door game show and some of its variants,

Math. Sci. 17(2) (1992) 89–94.

[13] Jason Rosenhouse, The Monty Hall Problem, Oxford University Press, New York, 2009.

[14] Jeffrey Rosenthal, Monty Hall, Monty fall, Monty crawl, Math Horizons (October 2008) 5–7.

[15] Marilyn vos Savant, The Power of Logical Thinking, St. Martin’s Press, New York, 1996.

[16] Steve Selvin, A problem in probability, (Letter to the Editor), Amer. Statist. 29(1) (1975) 67.

[17] ——, On the Monty Hall Problem, (Letter to the Editor), Amer. Statist. 9(3) (1975) 134.

[18] John Maynard Smith, Mathematical Ideas in Biology, Cambridge University Press, London,

1968.

[19] Jan Sprenger, Probability, rational single-case decisions, and the Monty Hall Problem, Synthese

(to appear).

[20] John Tierney, Behind Monty Hall’s doors, puzzle, debate, and answer? The New York Times,

July 21, 1991, 1A.

dx.doi.org/10.1007/s11229-007-9185-6

http://dx.doi.org/10.1007/s11229-007-9185-6

dx.doi.org/10.1177/0146167295212008

http://dx.doi.org/10.1177/0146167295212008

dx.doi.org/10.1007/s11229-006-9065-5

http://dx.doi.org/10.1007/s11229-006-9065-5

dx.doi.org/

http://dx.doi.org/10.2307/2589679

10.2307/2589679

http://dx.doi.org/10.2307/2589679

i

i

“master” — 2012/11/15 — 11:23 — page 243 — #257i

i

i

i

i

i

33The Secretary Problem from the

Applicant’s Point of View

Darren Glass

Searching for a job is always stressful and, with unemployment rates at their highest levels

in years, never more so than now. Applicants can and should use every advantage at their

disposal to obtain a job which is rewarding, financially and otherwise. While this author

believes a math major gives applicants many advantages as they search for their dream job,

one often overlooked is the ability to strategize and schedule their interviews to maximize

the chance of landing that job.

The secretary problem helps an employer find the optimal candidate for a job out of a

large pool of applicants. The set up is as follows: only one person can be hired, and, for

any pair of applicants, the employer has a strict preference for one of them that they can

discern after seeing both. However, after each interview the employer must either accept

or reject the candidate. If the candidate is the final person, the interviewer simply must

accept them, as rejected candidates cannot be recalled. In the classical formulation of the

problem, the goal is to select the best applicant overall.

What strategy can the employer use to maximize the probability of hiring the best

overall applicant? It is clear that, other than the final candidate, you only hire an applicant

if they are the best applicant you have seen to that point. Otherwise you are certainly not

hiring the best person. It follows that the best strategy is to reject an initial number of

candidates and then hire the first candidate who is better than everyone seen so far. With

a little work, one can show that if you know there will be a total of n candidates, where

n is large, then you should initially reject k � ne

candidates. For a proof, see [3]. A more

recent article discussing this problem and its implications for students’ dating lives can be

found in [6].

While there is some uncertainty regarding the origin of this problem, its first published

appearance was in Martin Gardner’s Mathematical Games column in February of 1960

(reprinted in [2]). A detailed history of the secretary problem and other classical stopping

problems is in [1].

The secretary problem is an outstanding example of how a seed planted by one of

Gardner’s columns has grown and flourished. Over the intervening half-century, people

have studied many variants. For example, how does the answer change if the interviewer

is allowed to recall the last few applicants [7]? Or what if there is a full committee of


243

i

i

“master” — 2012/11/15 — 11:23 — page 244 — #258i

i

i

i

i

i


interviewers rather than a single interviewer [4]? Or how should one deal with the costs

of interviewing and pressures to hire an early applicant to reduce costs [5]? As of this

writing, there are over 130 papers in MathSciNet referring to the Secretary Problem and

its extensions.

Almost all published variants look at the problem from the employer’s perspective.

At a recent senior thesis presentation on one such variant at Gettysburg College, another

senior, clearly concerned about his own job prospects, asked “What does this mean for

applicants? If I know that an employer is going to behave optimally, when should I schedule

my interview?” This is the question we examine here.

When rank is unknown

First we consider the situation of an applicant who has no idea how strong they are com-

pared to the rest of the pool. Throughout this note, we assume that there are n applicants

for a single job, and that the employer’s strategy is initially to reject the first k and then

hire the first subsequent candidate who is better than all candidates already seen, if such

a candidate exists, or the final candidate otherwise. We call this the optimal strategy. We

assume that all nŠ possible ordered rankings of the n candidates are equally likely and ex-

amine the probability that each position in the interview order is the one where the chosen

applicant is found.

Let us define the random variable X to be the interview position of the applicant who

is finally chosen. Using the optimal strategy, the employer rejects the first k applicants, and

therefore P.X D i/ D 0 if 1 � i � k. In order for the .k C 1/st candidate to be chosen,

they must be better than all the candidates in the rejected group. This occurs exactly when

the .k C 1/st candidate is the best among the first k C 1 candidates, which occurs with

probability 1kC1

. More generally, for k C 1 � i � n � 1, the i th candidate will be chosen

if and only if:

� The i th candidate is the best of the first i candidates.

� The best of the first i � 1 candidates is in the initial rejected group of k candidates.

The second condition ensures that we get to the i th candidate without choosing someone

else; the first condition ensures that we then choose the i th candidate. The probability of

these two conditions simultaneously holding is 1i

� ki�1

.

Finally, there are two separate situations in which the last candidate is selected: either

the best candidate overall was in the initial group of rejected candidates, in which case

the final candidate is chosen as a last resort, or the best overall candidate is the last one

interviewed and the second best candidate was in the initial group of rejectees. The first

scenario occurs with probability kn

; the second with probability 1n

� kn�1

. Adding these two

cases together, P.X D n/ D kn

C kn.n�1/

D kn�1

. In summary, we have the following

result.

Theorem 1. The probability that the i th candidate is chosen is

P.i/ D

8

ˆ<

ˆ:

0 if i � k,

ki.i�1/

if k < i < n,

kn�1

if i D n.

i

i

“master” — 2012/11/15 — 11:23 — page 245 — #259i

i

i

i

i

i

33. The Secretary Problem from the Applicant’s Point of View 245

The goal of the applicant is to obtain interview slot i which maximizes P.i/. Note thatk

i.i�1/is a decreasing function in i . Therefore the candidate should target the .k C 1/st slot

or the final slot, depending on whether 1kC1

or kn�1

is larger. An applicant will prefer the

final slot if kn�1

> 1kC1

. This is the case when

k >�1 C

p4n � 3

2; (33.1)

agreeing with the intuition that rejecting a larger number of candidates makes it more

likely that the best candidate is in the rejected group and that the employer accepts the final

candidate.

If the employer uses the optimal strategy with k D ne

, then the applicant must check

whether (33.1) holds, which happens when k2 C .1 � e/k C 1 > 0. A simple calculation

shows this is always the case, and therefore the candidate should choose to be interviewed

in the final slot if they have no information about their relative standing with the other

candidates. Of course, ne

is never an integer, so in reality the employer chooses either

k D bnec or dn

ee. In the latter case, the candidate will still wish to choose the final interview

slot, because if ne

> �1Cp

4n�32

then certainly dnee is as well. If the employer rounds down

then one can check that the candidate will be better off choosing last as long as k > 2:5. In

particular, being interviewed in the final position is optimal if n � 10. One can manually

check that, if the interviewer is going to reject k D bnec applicants, one will be no worse

off being interviewed last except when n D 4; 5, or 8. Thus we have proved

Theorem 2. If the number of applicants to a position is at least nine and an employer

uses the optimal strategy then the probability that they hire the final person interviewed is

higher than any other person.

When rank is known

On the whole, in the absence of information about their standing, a candidate should choose

to be interviewed last. This is also true if you know you are the worst candidate, as you

will never be better than all the candidates in the rejected group, so you can only be chosen

if you are the last resort. On the other hand, if you know you are the strongest candidate,

then you prefer the .k C 1/st slot over all others. You are then guaranteed to be chosen:

you are certainly better than anyone in the rejected group, and in any later slot there is a

chance that someone else is picked before you are interviewed. In general, then, it seems

that stronger applicants want to be interviewed early and weaker applicants later. In this

section, we prove that this is so by considering the point of view of a candidate who knows

they are the j th best in a pool of n candidates.

Assume first that our candidate interviews last. As in the previous section, if j ¤ 1

then the only way that they can be chosen is if the best candidate is in the rejected group,

which happens with probability kn�1

, since there are n � 1 slots remaining for the best

applicant to be in, all equally likely. On the other hand, if j D 1 then the only way to be

chosen from the last slot is if the second best candidate is rejected, which also happens

with probability kn�1

. In other words, the probability that the candidate in the final slot is

chosen is kn�1

independent of how strong they are.

i

i

“master” — 2012/11/15 — 11:23 — page 246 — #260i

i

i

i

i

i


As before, if a candidate is interviewed in slot i , k < i < n, they will be chosen if

and only if they are the best candidate seen so far and the second best candidate to that

point is rejected. Assume that i > 1, which will is the case as long as k > 0, i.e., at

least one candidate is rejected. The probability that the j th best candidate overall is the

best candidate seen so far, when they are interviewed in the i th slot, is the probability that

all i � 1 candidates seen previously come from the n � j candidates who are worse than

them. This happens with probability�

n�ji�1

�

=�

n�1i�1

�

. Moreover, the probability that the best

candidate seen before the i th slot was in the rejected group is ki�1

. Putting this together,

we get the following result:

Theorem 3. Assume the j th best candidate is interviewed in the i th slot. The probability

of getting chosen is:

Pj .i/ D

8

ˆ<

ˆ:

0 if i � k,

.n�ji�1/

.n�1i�1/

ki�1

if k < i < n,

kn�1

if i D n.

Again, Pj .i/ is a decreasing function in the range k < i < n. Explicitly, we note that

if i and i C 1 are in this range, then

Pj .i/

Pj .i C 1/D�

n�ji�1

�

�n�1i�1

�k

i � 1

�n�1

i

�

�n�j

i

�i

k

D .n � i/Š .n � i � j /Š

.n � i � 1/Š .n � i � j C 1/Š

i

i � 1

D n � i

n � i � j C 1

i

i � 1> 1;

and therefore Pj .i/ > Pj .i C 1/, as desired. This means that for any fixed value of j ,

an applicant should either choose to be interviewed in the .k C 1/st slot or in the nth

slot. In order to see which of these options gives the higher probability, we must compare

Pj .k C 1/ D�n�j

k

�

=�n�1

k

�

with Pj .n/ D kn�1

. After some algebraic manipulation, we find

that Pj .k C 1/ > Pj .n/ if and only if�

n�jk

�

>�

n�2k�1

�

. We wish to consider this inequality

separately for different values of j .

If j D 1, then we compare�

n�1k

�

to�

n�2k�1

�

. Since�

n�1k

�

D�

n�2k

�

C�

n�2k�1

�

, the best

candidate will prefer to be interviewed in the .k C 1/st slot, as we argued earlier.


n�2k

�

and�

n�2k�1

�

. The former is larger exactly when k <n�1

2. If the employer chooses k � n=e, then this holds for large values of k, implying that

the second best candidate should also choose the .k C 1/st slot.


n�3k

�

and�

n�2k�1

�

. Looking at the quotient of these terms and

expanding algebraically, we see that the former is larger exactly if

k2 C .5 � 3n/k C .n2 � 3n C 2/ > 0;

which occurs if k < 3n�5�p

5n2�18nC172

. For large n, this is true if k < 3�p

52

n � :38n;

which is (barely) guaranteed if the employer chooses k D n=e � :367n. More specifically,

one can work out that the third-best candidate should choose the .k C 1/st slot for all cases

except for 33 specific values of n, the largest of which is n D 98.

i

i

“master” — 2012/11/15 — 11:23 — page 247 — #261i

i

i

i

i

i

33. The Secretary Problem from the Applicant’s Point of View 247

For j � 4, comparing�n�j

k

�

and�

n�2k�1

�

is difficult. In particular, one can show that the

former term is greater exactly when

�n � k � 1

k

� j �2Y

`D1

n � k � ` � 1

n � ` � 1> 1:

Assuming that n D ek and letting n be large, we see that this holds if and only if.e�1/j �1

ej �2 � 1, which is false for j � 4. In particular, for sufficiently large n one prefers to

be interviewed last if ones ranking is fourth or worse. We summarize the preceding results:

Theorem 4. If you are the j th best applicant then you want to be interviewed in the

.k C 1/st slot for j � 3, and in the final slot if j � 4 and n is sufficiently large.

We also note that for any n and j � 3 and k > n�j2

we will have that�

n�2k�1

�

>�n�jk�1

�

��

n�jk

�

. If the employer sets k D dnee then k will be greater than n�j

2if j

n> e�2

e� :26. In

particular, this says that regardless of n, if a candidate suspects they are not in roughly the

top quarter of candidates, then they would prefer to be interviewed in the last slot.

Final thoughts

Unless you are a truly extraordinary applicant, if your prospective employer uses the clas-

sic strategy, then you should try to be interviewed last. However, attempting to game the

system is likely to backfire as there is no margin of error. Being the final interviewee gives

one the highest probability of being hired, but being the second-to-last has one of the low-

est! Students going on the job market should put their energy into improving their resumes

rather than strategizing interview timing. We note that a student can do both by general-

izing our work to some of the variants of the secretary problem mentioned earlier—and

publishing their work in future issues of the College Mathematics Journal.

Acknowledgment The author thanks the two students who inspired this question, Brian

Lemak and Paul Smith, as well as his colleagues in the math department at Gettysburg

College and the anonymous referees.

Bibliography

[1] T. S. Ferguson, Who solved the secretary problem? Statist. Sci. 4 (1989) 282–296; available at

dx.doi.org/10.1214/ss/1177012493

[2] M. Gardner, New mathematical diversions, Mathematical Association of America, Washington,

DC, revised edition, 1995.

[3] J. P. Gilbert and F. Mosteller, Recognizing the maximum of a sequence, J. Amer. Statist. Assoc.

61 (1966) 35–73; available at 10.2307/2283044

[4] H. Glickman, A best-choice problem with multiple selectors, J. Appl. Probab. 37 (2000) 718–

735; available at 10.1239/jap/1014842831

[5] Z. Govindarajulu, The secretary problem: optimal selection with interview cost, in Proceeding

of the Symposium on Statistics and Related Topics, Carleton Univ., Ottawa, 1975, 19.

dx.doi.org/10.1214/ss/1177012493

http://dx.doi.org/10.1214/ss/1177012493

10.2307/2283044

http://dx.doi.org/10.2307/2283044

10.1239/jap/1014842831

http://dx.doi.org/10.1239/jap/1014842831

i

i

“master” — 2012/11/15 — 11:23 — page 248 — #262i

i

i

i

i

i


[6] K. Merow, The view from here: Finding your match mathematically, Math Horizons 17 (2009)

18–20; available at 10.4169/194762109X468346

[7] O. K. Zakusilo, Optimal choice of the best object with possible returning to previously observed,

Theory Stoch. Process. 10 (2004) 142–149.

10.4169/194762109X468346

http://dx.doi.org/10.4169/194762109X468346

i

i

“master” — 2012/11/15 — 11:23 — page 249 — #263i

i

i

i

i

i

34Lake Wobegon Dice

Jorge Moraleda and David G. Stork

Nontransitive dice, discovered by Bradley Efron, but introduced to the public by Martin

Gardner [2], are sets of three dice which if rolled in pairs, die A most frequently beats

die B , B beats C , and C beats A. This is a statistical analogue of the game Rock, paper,

scissors (also called rochambeau, roshambo, and jan-ken-pon), in which each element or

action beats another and is beaten by yet another. In such a set there is no best or dominant

element. Of course the scalar mean values of the dice cannot exhibit nontransitivity; it is

only in the statistical case of rolls of different pairs of dice where nontransitivity can arise.

For an example of nontransitive dice, see Figure 34.1.

2

2

9

9

44

1

1

8

8

66

3

3

7

7

55

A B C

Figure 34.1. A set of nontransitive dice (after Efron), where in fair rolls of pairs of dice, on average

A beats B , B beats C , and C beats A. In this set, P ŒA > B� D P ŒB > C� D P ŒC > A� D 5=9 D55:56%:

Grinstead [3] explored the case where the dominance of dice alternated with successive

rolls. That is, if pips are summed over an even number of rolls, die A dominates die B;

summed over an odd number of rolls, die B dominates die A. Such nontransitive dominance

or paradoxical dominance can occur in preference for artworks and for political candidates,

as well as in dominance among warrior avatars in some computer games, some athletes,

and among teams, for instance the common claim that “Patriots beat Colts, Colts beat

Broncos, and Broncos beat Patriots.” [7, 6]. One experiment showed that in prospective

marriage partners, college students preferred intelligence over looks, looks over wealth,

and wealth over intelligence [5].

We extend Efron’s notion to the case, raised by David Stork, in which on each roll of

the full set of n dice each die is more likely to roll greater than the average of the set on the

roll than to roll less than that average. Informally, where Efron studied the case where each

die need beat merely one other die, in our case each die beats the average of all the dice.


249

i

i

“master” — 2012/11/15 — 11:23 — page 250 — #264i

i

i

i

i

i


In a specific statistical sense, then, each die is “better than the set average.” We call these

dice Lake Wobegon in honor of the children in the mythical Minnesota town chronicled by

humorist Garrison Keillor, “Where all the women are strong, all the men are good looking,

and all the children are above average” [4].

Notation and definitions

We define the Lake Wobegon dominance of a die in a set of n dice as the probability the

die rolls greater than the set average on that roll minus the probability the die rolls less

than that set average. We further define the Lake Wobegon dominance of the set to be the

dominance of the set’s least dominant die. We prove that such paradoxical dominance is

bounded above by .n � 2/=n regardless of the number of sides s on each die and the

maximum number of pips p on a side. A set achieving this bound is called Lake Wobegon

optimal. We give a constructive proof that Lake Wobegon optimal sets exist for all n � 3,

if one is free to choose s and p. We also show how to construct minimal optimal sets, that

is, a set that requires the smallest range in the number of pips on the faces.

We use the terms s-sided die/dice to refer to integer-valued, multinomial random vari-

ables where the probability of each value is of the form k=s, where k and s are natural

numbers and k � s. Furthermore, the integer value on each side—the number of its pips—

is in the set f1; 2; : : : ; pg, and is possibly the same on different sides. We denote the num-

ber of pips on the sides in non-decreasing order, for instance two s D 6; p D 7 dice are:

f1; 2; 3; 4; 4; 7g and f2; 2; 4; 5; 5; 6g. Occasionally we use the alternative notation of listing

the unique number pips followed by the probability with which they occur in parenthesis.

Using this notation, the two dice are

�

1

�1

6

�

; 2

�1

6

�

; 3

�1

6

�

; 4

�1

3

�

; 7

�1

6

��

and

�

2

�1

3

�

; 4

�1

6

�

; 5

�1

3

�

; 6

�1

6

��

:

Since we consider physical dice as random variables, we consider them distinguish-

able if and only if their set of pip values differ; that is, we do not care about the ar-

rangement of pips on the physical die. The number of distinguishable dice, according

to our convention, is the number of different sets of p objects chosen s at a time with

replacement, that is,�sCp�1

s

�

. A roll of a set of dice is an independent uniformly dis-

tributed random selection of the integer pip value of each die in the set. Let Xi be the

pip values of the individual dice in a roll, and let A denote the set average A on a given

roll. For each die Xi in a set, we define its individual Lake Wobegon dominance, Di ,

as the probability that Xi is greater than the average A minus the probability that it is

smaller:Di D P ŒEi > 0� � P ŒEi < 0�;

where

Ei D nXi �nX

j D1

Xj :

Much as with Efron’s nontransitive dice, we do not care by how much a die is above or

below the set mean, just the probability of those events. The dominance of a die may be

negative.

i

i

“master” — 2012/11/15 — 11:23 — page 251 — #265i

i

i

i

i

i

34. Lake Wobegon Dice 251

We define the Lake Wobegon dominance of a set of dice to be the minimum, D, of the

Lake Wobegon dominance of the individual dice in the set: D D mini Di . It is possible

that the Lake Wobegon dominance of a set of dice is negative.

Finally, a set of dice is said to be Lake Wobegon if its Lake Wobegon dominance D is

greater than zero. Thus, a set of dice is Lake Wobegon if the probability that each die in

the set rolls greater than the set’s average on that roll, A, is greater than the probability that

the die rolls less than A.

A simple example of a Lake Wobegon set is ff1; 2; 2g; f1; 2; 2g; f1; 2; 2gg. Here n D 3,

s D 3, p D 2 and the probability each die (i D 1; 2; 3) exceeds the group average on a roll

is P ŒEi > 0� D 10=27, the probability it is less than the average is P ŒEi < 0� D 8=27 and

the probability it equals the average is P ŒEi D 0� D 9=27 (as can be seen by enumerating

the sn D 27 combinations of faces.) Thus the dominance of each die is Di D 2=27 > 0,

and the dominance of the set is D D 2=27 > 0.

The Lake Wobegon property does not imply nontransitive dominance: the set ff1; 2; 2g,

f1; 2; 2g, f1; 2; 2gg is Lake Wobegon, but clearly not nontransitive, since the dice are iden-

tical. Conversely, a nontransitive set need not be Lake Wobegon, as illustrated by the set

ff1; 1; 6; 6; 8; 8g; f3; 3; 5; 5; 7; 7g; f2; 2; 4; 4; 9; 9gg, shown in Figure 34.1.

If a > 0, it is straightforward to verify that the affine transformation x ! ax C b on

the pip values preserves the Lake Wobegon dominance of each die, as well as the Lake

Wobegon dominance of the set. Such transformations, then, yield a natural partition of

the set of finite sets of multinomial random variables into equivalence classes. Thus, in

the reminder of this paper, and without loss of generality, we demand that a 1 appear

somewhere in every set of dice.

Lake Wobegon optimality

We begin by finding an upper bound for the Lake Wobegon dominance D of a set com-

prised of n dice.

Optimality bound An upper bound for the Lake Wobegon dominance of a set of n

dice is .n � 2/=n.

Proof. In every roll at least one die will be below average, unless all dies equal the average.

(The latter case does not improve the Lake Wobegon dominance of any die above 0, so it

cannot improve the Lake Wobegon dominance of the set above 0 either.) Thus an upper

bound of the Lake Wobegon dominance of the set occurs when in every roll only one die

is below average, n � 1 are above average, and each die is below average in exactly 1=n of

the rolls. This yields an upper bound for Lake Wobegon dominance of

n � 1

n� 1

nD n � 2

n:

A set of dice is Lake Wobegon optimal if the upper bound n�2n

is achieved. We have

proved also that if a set of dice is Lake Wobegon optimal, then exactly one die is below

average on each roll and every die is below average equally often. The next result shows

that Lake Wobegon optimality is achievable.

i

i

“master” — 2012/11/15 — 11:23 — page 252 — #266i

i

i

i

i

i


Optimality existence For n � 3 the set of n dice:

die 1 W˚

pmax � 1 .1/

die 2 W˚

pmax

�12

�

; pmax � n�

12

�

die 3 W˚

pmax

�23

�

; pmax � n.n � 1/�

13

�

� � �die n W

˚

pmax

�n�1

n

�

; pmax � n.n � 1/n�2�

1n

�

;

(34.1)

where

pmax D 1 C n.n � 1/n�2; (34.2)

is Lake Wobegon optimal.

Proof. We prove every die in the set is below the roll average with probability 1=n and

otherwise above the roll average.

Since dice are independent random variables, the probability that die i rolls its lowest

side and all die k, k > i roll their highest side is the product of the probabilities of each

die rolling the given side, with the odds given in parentheses in (34.1),

P�

pi D pmax � n.n � 1/i�2 and pk D pmax for k > i�

D 1

i

i

i C 1� � � n � 1

nD 1

n: (34.3)

Let j < i; j ¤ 1 be the die showing the next lowest value in the roll besides die i . (The

special cases when i D 1 or j D 1 are analogous.) Showing that die j is above average

proves that all dice except for i are above average. The largest mean that this roll can have

occurs when all dice except for i and j show their largest side. This average value is:

A D 1

n

�

.pmax � 1/ C�

pmax � n.n � 1/j �2�

C�

pmax � n.n � 1/i�2�

C .n � 3/pmax

�

D pmax � .n � 1/j �2 � .n � 1/i�2 � 1

n:

We subtract A from the pip value of die j , pmax � n.n � 1/j �2, to confirm that die j is

indeed above the average of the roll:

pmax � n.n � 1/j �2 � A D �.n � 1/j �1 C .n � 1/i�2 C 1

n� 1

n:

Thus all n � 1 dice are above average on this roll, except the single die i .

We show next that the Lake Wobegon optimal set just produced is minimal in the sense

that in any optimal set of n dice at least one side shows a value greater than or equal to

pmax.

Minimal pips For fixed n, every Lake Wobegon optimal set has at least one side show-

ing the pip value pmax D 1 C n.n � 1/n�2 or greater.

i

i

“master” — 2012/11/15 — 11:23 — page 253 — #267i

i

i

i

i

i


Proof. For die i , let li ; ui ; for i 2 f1; : : : ; ng be the minimum (respectively maximum)

number of pips on a side. Moreover let the dice be sorted so that i < j implies li � lj . For

each die i , we define its spread, �i , as

�i D ln � li C 1: (34.4)

Our proof proceeds as follows: First we show that maxj uj > ln. This proves that the

spread of a die bounds the difference between the maximal number of pips in any side and

the number of pips in the smallest side of die i . Afterwards, proving that �1 � pmax � 1

completes our proof.

To verify maxi ui > ln consider the roll fu1; : : : ; un�1; lng. Since the dice are Lake

Wobegon optimal, only one die can be below the average of the roll. Assume die .j ¤ n/

is that one die. This would imply uj < ln which would, in turn, imply that in any roll, die

n would be higher than die j , and thus die n could never be the sole die below the average.

Such a case is not possible because the set is Lake Wobegon optimal, thus die n must be

the one below the average and u1; : : : ; un�1 > ln, that is

minfu1; : : : ; un�1g � ln C 1: (34.5)

We complete the proof by computing a lower bound on �1 recursively. First we compute a

lower bound on �n�1 then compute a lower bound on �i ; i 2 f1; : : : ; n� 2g as a function

of �iC1.

Initial case. Bound on �n�1

Consider the roll fu1; : : : ; un�2; ln�1; lng. The only die below average is die n � 1 since

ln�1 � ln < u1; : : : ; un�1. Thus, since die n is above average, using (34.5) and (34.4) we

obtain the following:

nln > u1 C � � � C un�2 C ln�1 C ln

> .n � 2/.ln C 1/ C ln�1 C ln D nln C .n � 1/ � .ln � ln�1 C 1/;

hence

�n�1 D ln � ln�1 C 1 > n � 1;

or

�n�1 � n: (34.6)

Recursive step. Bound on �i ; i 2 f1; 2 ; : : : ; n � 2g as a function of �iCi

Consider the roll fu1; : : : ; ui�1; li ; liC1; uiC2; : : : ; ung. The only die below average is die

i since li � liC1 � ln � u1; : : : ; un. Thus since die i C 1 is above average, and ln � un,

using (34.5) and (34.4) yields a lower bound on �i as a function of �iCi :

nliC1 > u1 C � � � C ui�1 C li C liC1 C uiC2 C � � � C un

> .n � 2/.ln C 1/ C li C liC1 D liC1 C .n � 1/.ln C 1/ � .ln � li C 1/;

hence

�i D ln � li C 1 > .n � 1/.ln � liC1 C 1/ D .n � 1/�iC1;

i

i

“master” — 2012/11/15 — 11:23 — page 254 — #268i

i

i

i

i

i


or

�i � .n � 1/�iC1: (34.7)

Solving the recursion for �i defined by (34.6) and (34.7) yields

�i � n.n � 1/n�i�1;

and thus, by (34.2), we conclude

�1 � n.n � 1/n�2 D pmax � 1:

Figure 34.2 plots n�2n

and pmax. For large n, n�2n

approaches 1, so for large n, every

die in an optimal set is nearly always above average.

n 2

n

n(n – 1)n–2 + 1

3 4 5 6 7 8 90.

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.

100

101

102

103

104

105

106

107

108

109

1010

n

n2

n

pm

ax

Figure 34.2. The maximum value of Lake Wobegon dominance of a set .n� 2/=n (left, linear axis)

and the maximum number of pips on any side in the set, pmax (right, logarithmic axis).

If it is not required that all dice in a set have the same number of sides, then having one

die with one side, another with two sides, up to one die with n sides is sufficient to have a

Lake Wobegon optimal set that is minimal. Another way to obtain the required probabilities

is with rotating game wheels. (See Figure 34.3.) There the angles of the sectors correspond

to probabilities, and the number in each sector is the payoff.

Minimal Lake Wobegon optimality

Figure 34.4 shows the optimal set of dice ff6; 6; 6; 6; 6; 6g,f4; 4; 4; 7; 7; 7g,f1; 1; 7; 7; 7; 7gg.

It is the unique optimum set out of the 115,264,382 distinguishable and non-equivalent sets

of three, six-sided dice with up to seven pips per side. It also has three remarkable minimal

properties:

Property 1: No set with fewer than than three dice is Lake Wobegon (and hence Lake

Wobegon optimal), so n D 3 is minimal.

Proof. With only two variables, n D 2, the Lake Wobegon dominance is at most zero

because on every roll either both variables are equal to the average, or one variable is

i

i

“master” — 2012/11/15 — 11:23 — page 255 — #269i

i

i

i

i

i


n D 5

320316

321

241

321

1

321321

301

n D 6

37503745

3751

3721

3751

3601

3751

3001

3751

1

3751

n D 7

54 43254 426

54 433

54 391

54 433

54 181

54 433

52 921

54 433

45 361

54 433

1

54 433

Figure 34.3. Lake Wobegon Optimal game wheels. Each wheel is divided into sectors. The proba-

bilities of the dark-colored sections are 11 ; 1

2 ; 13 ; : : : ; 1

n .

above the average and one variable below the average. Thus the sum of the Lake Wobegon

dominance of both variables is zero, and hence the Lake Wobegon dominance of the set is

non-positive.

Property 2: No set of dice with fewer than seven pips per side is Lake Wobegon optimal,

so p D 7 is minimal.

Proof. The fact that n D 3 is minimal together with the requirement that a side shows at

least pmax pips, show that p D 1 C n.n � 1/n�2 D 7 is minimal.

Property 3: No set of three dice with fewer than six sides is Lake Wobegon optimal, so

s D 6 is minimal.

Proof. If a set of n dice is Lake Wobegon optimal, then exactly 1=n of all roll combinations

have each individual die below the roll average. The total number of roll combinations is

sn. Thus sn � 0 mod n. This means for three Lake Wobegon optimal dice the minimal

number of sides is either three or six. A separate counting argument for each of the 27

possible rolls of three, three-sided dice shows that it is impossible to assign pip values such

that each die is below the roll average for exactly nine rolls.

6

6

6

6

6

7

4

7

7

44

1

1

7

7

776

Figure 34.4. Lake Wobegon optimal set of dice for n D 3; s D 6, and p D 7. This set is minimal in

three distinct senses: n, s and p.

i

i

“master” — 2012/11/15 — 11:23 — page 256 — #270i

i

i

i

i

i


Concluding remarks

We can invert the pip values on a Lake Wobegon set to produce a new set where the Lake

Wobegon dominance of each variable has opposite sign as in the original set. In particular,

inverting a Lake Wobegon optimal set produces a Lake Wobegon pessimal set where each

die rolls below average with probability .n � 1/=n.

An outstanding problem is to find the minimal optimal sets in the sense of the smallest

number of sides s required for a given n. We conjecture these sets may be those in (34.1)

that are also minimal in the p sense and that require s D LCMŒ1; 2; : : : ; n� sides.

Lake Wobegon dice, and Lake Wobegon distributions more generally, could provide

models for non-cooperative, non-zero-sum games [1]. We can imagine that colluding eco-

nomic agents might engage in policies that each agent properly predicts will provide indi-

vidual benefit above the group average.

Acknowledgment The authors would like to thank Brad Efron for words of encourage-

ment.

Bibliography

[1] D. Fudenberg and J. Tirole, Game theory, MIT Press, Cambridge MA, 1991.

[2] M. Gardner, On the paradoxical situations that arise from non-transitive relations,

Scientific American 231, (1974) 120–125; available at dx.doi.org/10.1038/

scientificamerican0574-120.

[3] C. M. Grinstead, On medians of lattice distributions and a game with two dice, Comb. Probab.

Comput. 6, (1997) 273–294; available at dx.doi.org/10.1017/S0963548397003015.

[4] G. Keillor, Lake Wobegon Days, Penguin, New York NY, 1990.

[5] K. O. May, Intransitivity, utility and the aggregation of preference patterns, Econometrica 22

(1954) 1–13; available at dx.doi.org/10.2307/1909827.

[6] R. P. Savage, Jr., The paradox of nontransitive dice, Amer. Math. Monthly 101, (1994) 429–436;


[7] T. L. Tenney and C. C. Foster, Non-transitive dominance, Math. Mag. 49, (1976) 115–120; avail-

able at dx.doi.org/10.2307/2690186.

dx.doi.org/10.1038/


scientificamerican0574-120


dx.doi.org/10.1017/S0963548397003015

http://dx.doi.org/10.1017/S0963548397003015

dx.doi.org/10.2307/1909827

http://dx.doi.org/10.2307/1909827

dx.doi.org/10.2307/2974903

http://dx.doi.org/10.2307/2974903

dx.doi.org/10.2307/2690186

http://dx.doi.org/10.2307/2690186

i

i

“master” — 2012/11/15 — 11:23 — page 257 — #271i

i

i

i

i

i

35Martin Gardner’s Mistake

Tanya Khovanova

Martin Gardner was amazingly accurate and reliable. That he made a mistake is simply

testimonial to the difficulty of this particular problem, which appeared in 1959 and was

republished in [3]:

Mr. Smith has two children. At least one of them is a boy. What is the proba-

bility that both children are boys?

Mr. Jones has two children. The older child is a girl. What is the probability

that both children are girls?

Mr. Jones has failed to stir any controversy, so we ignore him and his two children [5].

Instead, we concentrate on Mr. Smith. Here is the solution that Martin Gardner published

with the problem:

If Smith has two children, at least one of which is a boy, we have three equally

probable cases: boy-boy, boy-girl, girl-boy. In only one case are both children

boys, so the probability that both are boys is 1=3.

The corrected solution

Later Martin Gardner wrote a column titled “Probability and Ambiguity,” which was also

republished (in [4]). In this column Gardner corrects himself, writing “. . . the answer de-

pends on the procedure by which the information “at least one is a boy” is obtained.”

He suggested two potential procedures.

(i) Pick all the families with two children, one of which is a boy. If Mr. Smith is

chosen randomly from this list, then the answer is 1=3.

(ii) Pick a random family with two children; suppose the father is Mr. Smith. Then

if the family has two boys, Mr. Smith says, “At least one of them is a boy.” If he

has two girls, he says, “At least one of them is a girl.” If he has a boy and a girl

he flips a coin to choose one or another of those two sentences. In this case the

probability that both children are the same sex is 1=2.

Thus, the original problem without a specified procedure is ambiguous.


257

i

i

“master” — 2012/11/15 — 11:23 — page 258 — #272i

i

i

i

i

i


More procedures

Let us call procedure (i) “boy-centered,” because from the start we know that we are talking

about boys. Correspondingly, we call (ii) “gender neutral.”

Gardner wanted to emphasize that the problem is ambiguous. For him it was enough

to show two different procedures leading to different answers. However, there are many

procedures. Here are two more that demonstrate the full range of ambiguity.

(iii) Suppose that Mr. Smith wants to brag about his sons and will always mention

as many as he can. In this case the procedure might be the following: If he has

two boys, he says, “I have two boys.” If he has one son, he says “At least one of

them is a boy.” In this case the answer to the problem is 0.

(iv) Suppose Mr. Smith dislikes boys, and wants to de-emphasize the number of

boys he has. In this case the procedure might be the following: If he has two

boys, he says, “At least one of them is a boy.” If he has a boy and a girl, he says,

“I am the proud father of a girl.” In this case the answer is 1.

We challenge the reader to invent a procedure for any given answer between 0 and 1.

Tuesday-child problem

Fast-forward to 2010. In a talk at the 9th Gathering for Gardner, Gary Foshee posed the

following problem:

I have two children. One is a boy born on a Tuesday. What is the probability

that I have two boys?

This is Martin Gardner’s Two-Children problem with an extra twist. Before discussing the

solution, let us agree on some basic assumptions:

� Sons and daughters are equally probable—not exactly true, but a reasonable approx-

imation.

� Twins do not exist. Not only is the proportion of twins in the population small,

but, because they are usually born on the same day, twins might complicate the

calculation.

� All days of the week are equally probable birthdays. While this isn’t actually true—

for example, assisted labors are unlikely to be scheduled for weekends—it is a rea-

sonable approximation.

History repeats itself

Just as occurred with the classical Two-Children problem, many mathematicians have been

fighting for the wrong solution. This is their argument:

Each child can be one of two genders and can be born on one of seven days of the week.

Thus, gender and day present 14 equally probable cases for each child. That, in turn, makes

each two-children family belong to one of 196 equally probable cases. When we restrict

all possible cases to the given information that one child is a boy born on a Tuesday, we

get 27 equally probable cases. We can divide these cases into several groups, as follows:

i

i

“master” — 2012/11/15 — 11:23 — page 259 — #273i

i

i

i

i

i

35. Martin Gardner’s Mistake 259

� 7 cases, where the first child is a son born on a Tuesday and the second child is a

daughter.

� 7 cases, where the second child is a son born on a Tuesday and the first child is a

daughter.

� 6 cases, where the first child is a son born on a Tuesday and the second child is a son

not born on a Tuesday.

� 6 cases, where the second child is a son born on a Tuesday and the first child is a son

not born on a Tuesday.

� 1 case, where both children are sons born on a Tuesday.

These are a total of 27 equally probable cases, 13 of which correspond to two sons.

Thus the probability must be 13=27.

This incorrect solution was widely published, for example, it appeared in Devlin’s An-

gle [1]. But, like Martin Gardner, Devlin corrected himself in his next column [2].

Procedures

The ambiguity that Martin Gardner found in the Two-Children problem is also present in

the Tuesday-Child problem. To resolve the ambiguity, we need to specify the procedure by

which the information was obtained. Here we present just four procedures. The calculations

are based on 196 equally probable cases for different combinations of gender and the day

of the week.

Gender-neutral, day-of-the-week-neutral procedure In this scenario, a father of two

children is picked at random. He is instructed to choose a child by flipping a coin. Then he

has to provide true information about the chosen child in the following format: “I have a

son/daughter born on a Mon/Tue/Wed/Thu/Fri/Sat/Sun.” If his statement is, “I have a son

born on a Tuesday,” what is the probability that the other child is also a son?

The solution is the following: A father has two daughters in 49 cases. Such a father

will make the above statement with probability zero. A father has a son and a daughter

in 98 cases, and will produce the above statement with probability 1=14: with probability

1=2 the son is chosen over the daughter and with probability 1=7 Tuesday is the birthday.

A father has two sons in 49 cases, and he will make the statement with probability 1=7.

The father of two sons is twice as likely to make the statement as the father of a son and a

daughter, but there are half as many such fathers. Thus the probability is 1=2 that the other

child is also a son. This is the most symmetric scenario and produces the most symmetric

answer.

Boy-centered, day-of-the-week-neutral procedure A father of two children is picked

at random. If he has two daughters, he is sent home and another father is picked until one

is found with at least one son. If he has just one son, he is instructed to provide true infor-

mation on his son’s day of birth. If he has two sons, he has to choose one at random. His

statement will be, “I have a son born on a Mon/Tue/Wed/Thu/Fri/Sat/Sun.” If his statement

i

i

“master” — 2012/11/15 — 11:23 — page 260 — #274i

i

i

i

i

i


is, “I have a son born on a Tuesday,” what is the probability that the other child is also

a son?

The solution is as follows. A father has a son and a daughter in 98 cases, and will

produce the statement above with probability 1=7. If he has two sons (49 cases), the prob-

ability of the statement above will likewise be 1=7. The father of two sons is exactly as

likely to make the statement as the father of a son and a daughter, but there are half as

many such fathers. Thus the probability is 1=3 that the other child is also a son.

This scenario corresponds to the original procedure leading to the first solution of Mar-

tin Gardner’s Two-Children problem. Unsurprisingly, the answer is the same.

Boy-centered, Tuesday-centered procedure Now let us consider the third scenario. A

father of two children is picked at random. If he doesn’t have a son born on a Tuesday,

he is sent home and another father is picked at random until one who has a son born on a

Tuesday is found. He is instructed to tell you, “I have a son born on a Tuesday.” What is

the probability that the other child is also a son?

Here is the solution. A father of a boy and a girl has a son born on a Tuesday in 14

cases. He will make the statement in question with probability 1. A father of two sons has

a son born on a Tuesday in 13 cases. He too is guaranteed to make the statement. Thus the

probability is 13=27 that the other child is a son.

This procedure corresponds to the procedure many mathematicians assume while solv-

ing the Tuesday-Child problem. This implicit assumption is the source of the erroneous

solution.

Gender-neutral, Tuesday-centered procedure For completeness let us consider a fourth

scenario. A father of two children is picked at random. If he doesn’t have a child who is

born on a Tuesday, he is sent home and another father is picked at random until one who

has a child born on a Tuesday is found. He is instructed to tell you, “I have a son/daughter

born on a Tuesday.” If both of his children were born on Tuesdays, he has to pick one at

random. If his statement is, “I have a son born on a Tuesday,” what is the probability that

the other child is also a son?

Here is the solution. A father of two daughters will have a child born on a Tuesday in

13 cases. He makes the statement in question with probability 0. A father of a boy and a

girl has a child born on a Tuesday in 26 cases. The probability that he makes the statement

is 1=2. A father of two sons will have a son born on a Tuesday in 13 cases. The probability

that he makes the statement is 1. The father of two sons is twice as likely to make the

statement as the father of a son and a daughter, but there are half as many such fathers.

Thus the probability is 1=2 that the other child is also a son.

In this procedure we added an additional constraint on the families with two children—

that a child was born on a Tuesday—that’s independent of gender. Not surprisingly, the

answer is 1=2.

Back to Gardner

Many people I argue with don’t want to listen to me. They refer to Martin Gardner as the

final authority in support of their wrong solution. Gardner was a great thinker; he corrected

i

i

“master” — 2012/11/15 — 11:23 — page 261 — #275i

i

i

i

i

i

35. Martin Gardner’s Mistake 261

his mistake. I urge those who do not agree with me to trust Martin Gardner. Revisit and

rethink this problem together with him.

Acknowledgment I am grateful to all my friends and colleagues who discussed the prob-

lem with me and supported me in writing about it for my blog [6, 7, 8]. I am especially

grateful to Alexey Radul [9] and Peter Winkler [10] who contributed essays on the subject

to my blog. I also thank Sue Katz and Julie Sussman, P.P.A., for editing.

Bibliography

[1] K. Devlin, Probability can bite, (2010); available at www.maa.org/devlin/

devlin_04_10.html.

[2] K. Devlin, The problem with word problems, (2010); available at www.maa.org/devlin/

devlin_05_10.html.

[3] M. Gardner, Nine more problems in The Second Scientific American Book of Mathematical Puz-

zles and Diversions, Univ. of Chicago Press, 1987, 152–164.

[4] ——, Probability and ambiguity in The Second Scientific American Book of Mathematical Puz-

zles and Diversions, Univ. of Chicago Press, 1987, 220–232.

[5] T. Khovanova, Mr. Jones, (2010); available at blog.tanyakhovanova.com/?p=313.

[6] ——, A son born on Tuesday, (2010); available at blog.tanyakhovanova.com/?p=221.

[7] ——, Sons and Tuesdays, (2010); available at blog.tanyakhovanova.com/?p=233.

[8] ——, A Tuesday quiz, (2010); available at blog.tanyakhovanova.com/?p=247.

[9] A. Radul, Shannon entropy rescues the Tuesday child, (2010); available at

blog.tanyakhovanova.com/?p=254.

[10] P. Winkler, Conditional probability and “He said, she said”, (2010); available at

blog.tanyakhovanova.com/?p=234.

www.maa.org/devlin/

http://www.maa.org/devlin/devlin_04_10.html

devlin_04_10.html


www.maa.org/devlin/


devlin_05_10.html


blog.tanyakhovanova.com/?p=313

http://blog.tanyakhovanova.com/?p=313











i

i

“master” — 2012/11/15 — 11:23 — page 262 — #276i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 263 — #277i

i

i

i

i

i

VIIOther Aspects of Martin

Gardner

i

i

“master” — 2012/11/15 — 11:23 — page 264 — #278i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 265 — #279i

i

i

i

i

i

36Against the Odds

Martin Gardner

Luther Washington was a friendly, shy, intelligent boy, the oldest of five children who lived

with their parents in Butterfield, Kansas. He was one of sixteen African Americans who

attended Butterfield Central High. His father owned a small grocery store in the town’s

black district. His mother cooked dinners for one of the town’s bankers.

For some reason, which his parents never understood, Luther was fascinated by num-

bers. One day, when he was ten, he surprised his father by saying: “Dad, I’ve discovered

an easy way to tell if a big number can be divided by four or eight and not have anything

left over.”

The family was having breakfast. “Tell us about it,” said Mr. Washington.

“Well,” said Luther, “if the last two digits”—his grade school teacher had taught him

to distinguish digits from numbers—“can be divided by four with nothing left over, the big

number also can be divided by four. Otherwise, it can’t. And if the big number’s last three

digits are a product of eight, so is the big number.”

“Interesting,” said Mrs. Washington. “But I can’t see how that could be of any use to

anybody.”

“I don’t care whether it’s any use or not. I just think it’s neat.”

“I wish you wouldn’t waste so much time on such things,” said Mr. Washington. “All

the math you’ll ever need in life is knowing how to add, subtract, multiply, and divide, and

how to make change.”

“Maybe so,” said Luther. “But there’s something wonderful and mysterious about num-

bers.”

In high school Luther was quick to learn algebra and geometry even though his teacher,

Miss Perkins, seemed bored by what she taught. The fact is, she really was bored. Her ma-

jor, at a Kansas teacher’s college, was American history, but when one of the school’s two

math teachers quit to take a better paying job as a used car salesman, Miss Perkins was

asked to take over his classes. She reluctantly agreed. By studying the textbook carefully

each night she managed fairly well to stay ahead of her students.

Miss Perkins was a devout Baptist. She felt no hostility toward blacks in general. After

all, they were God’s children and she believed that all persons are equally precious in the

eyes of the Lord. On the other hand, she was convinced that the intelligence of blacks was

considerably below that of whites. It had to be something in their genes.


265

i

i

“master” — 2012/11/15 — 11:23 — page 266 — #280i

i

i

i

i

i

266 Part VII. Other Aspects of Martin Gardner

Among the forty-two students in her geometry class, five were black—three girls and

two boys. They would sit silently at their desks, their dark eyes fixed on hers. They rarely

asked a question. When they did it was usually, in her opinion, a foolish one. If she asked

who had solved a problem, she almost never called on a black student whose hand was

raised.

One morning, after Miss Perkins had demonstrated on the blackboard how Euclid

proved that the three angles of a triangle add to a straight angle, Luther was the only

student to wave a hand. His classmates looked as if they were falling asleep.

“Yes, Luther, do you have a question?”

“No. But I thought of a simpler way to prove the theorem.”

Miss Perkins raised her eyebrow and peered at Luther over her bifocals. “Would you

like to come to the blackboard and explain it to the class?”

Luther stood up and walked to the blackboard. He erased the diagram Miss Perkins had

chalked. After drawing a large triangle with irregular sides, he took a pencil from his shirt

pocket and pressed it against the blackboard along the inside of the triangle’s base.

“As you can see,” he said, “the pencil points to the right.” Luther slid the pencil along

the triangle’s base until its point touched the triangle’s right corner. Keeping the point fixed

at the corner, he swung the pencil clockwise until it was alongside the triangle’s right side.

He then moved the pencil upward along the side until its point touched the triangle’s top

corner. Again he rotated the pencil clockwise until it paralleled the triangle’s left side. He

slid the pencil down to the triangle’s lower left corner, and rotated the pencil once more. It

was now back where it started.

“See,” Luther said as he turned his head to face the class, “the pencil now points left.

Obviously it has rotated a hundred and eighty degrees. This proves that the three angles

add to a straight angle.”

The students were all now wide awake. About a dozen, including the three black girls,

clapped. The others sat silent, waiting for Miss Perkins’ reaction.

Miss Perkins looked a bit flustered. She knew how to demonstrate the theorem by

folding over the corners of paper triangles, but Luther’s method caught her by surprise.

“I’m not sure that’s really a proof,” she said finally.

“Well, it is,” said Luther. “The pencil acts like a vector. The best thing about it is you

can apply it to any polygon, with any number of sides. It proves the inside angles must add

to a multiple of a straight angle. And the same thing is true about any polygon’s exterior

angles.”

Miss Perkins wasn’t sure she grasped what Luther was saying. “What are vectors and

exterior angles?” she thought. Fortunately, the bell rang and the class was over before she

could make any other comments. “That boy is going to be a trouble maker,” she said to

herself. “He’s not nearly as smart as he thinks he is.”

A few days later, during a study period, Miss Perkins walked by Luther’s desk. He

had been doodling on a sheet of paper. Scattered over the sheet were numerous ticktacktoe

patterns, their cells filled with X’s and O’s.

“You’re supposed to be working on your assignment.”

“I’ve already finished it, Miss Perkins. I’m trying to figure out whether the first player

can always win at ticktacktoe if both players do their best. Maybe the game’s a draw. I do

know the second player can never win, no matter how big the field is or how many marks

he has to get in a straight line.”

i

i

“master” — 2012/11/15 — 11:23 — page 267 — #281i

i

i

i

i

i

36. Against the Odds 267

Miss Perkins snatched up the sheet and crumpled it into a ball. “When you’re in my

class,” she said sternly, “I expect you to work on mathematics and nothing else. If you’ve

finished your assignment, come to my desk and I’ll give you some more problems.”

“Yes, Miss Perkins.”

To Miss Perkins’ chagrin, Luther had near perfect scores on every test she gave. But

she was so annoyed by what she considered his “uppity” way of talking to her that she gave

him a B grade for the semester.

In the middle of each school year Butterfield Central High held contests to decide who

among its seniors excelled in each of the school’s subjects. Winners were given a medal

and made members of an honorary society called the B Club.

Luther entered the math contest. Miss Perkins was annoyed when she learned that he

was the first student in the B Club’s history to solve every problem correctly. The test had

been designed and the papers graded by the school’s principal.

Members of the B Club met each Friday, after school. Miss Perkins was the club’s

sponsor. When Luther showed up at the next meeting, she told him he had been appointed

chairman of the club’s peanut-sacking committee.

“Who appointed me?” Luther asked.

“I did.”

“What am I supposed to do?”

“You’ll be in charge of a group of club members. They meet after school every Friday

to make sure all the peanuts are put in paper sacks to sell at Saturday basketball games.”

“How long will that take?”

“About two hours.”

Luther groaned. Time after school was precious. He had found in the town’s library a

book titled Mathematics and the Imagination. It was an exciting book, and he was eager to

get home to finish it.

“I don’t want to sack peanuts,” Luther said. “Am I allowed to resign from the B Club?”

Miss Perkins was so furious when Luther stalked out of the room that next Monday she

sent him to the principal’s office with a note saying he had refused to sack peanuts, and

that he had been very unpleasant in the way he had spoken to her.

Now it so happened that the principal knew a great deal more about mathematics than

Miss Perkins. Math had been his undergraduate major at the University of Michigan before

he went on to do graduate work in education.

The principal had been impressed by the unusual ways Luther solved the problems

on the test he had prepared. After discussing several mathematical topics with Luther, he

realized he was talking to a young man with deep insights into mathematical structures.

“Are you planning on college?”

“No, sir. My parents can’t afford it. Dad wants me to work in his grocery store. When

he’s too old to run the store, he expects me to take it over.”

“How are your other grades?”

“Not so good. I barely passed Latin with a D. And I got a C in English. I didn’t under-

stand any of Shakespeare’s plays. I was so bored by Ivanhoe that I couldn’t finish it.”

“Hmm.” The principal stared at the ceiling for a while before he asked: “Have you

applied for any scholarships?”

i

i

“master” — 2012/11/15 — 11:23 — page 268 — #282i

i

i

i

i

i


Luther shook his head. “My folks don’t want me to go to college. They say it would be

a big waste of time. Dad says if I had a college degree it would only make me overqualified

for any kind of work I could get.”

“What do you make of Miss Perkins as a teacher?”

Luther hesitated for a moment. “She’s okay, I guess. I don’t think she likes mathemat-

ics, though. I know she doesn’t like me.”

Luther told the principal about the time she had grabbed the sheet on which he was

analyzing ticktacktoe, and how she told him he had to work only on math during her study

periods.

The principal laughed, then sighed and looked solemn. “She means well. But she

should have known that ticktacktoe is based on combinatorics and graph theory. Did you

finally solve the game?”

“I did when I got home. I drew a complete game tree. It’s not hard to do if you take

advantage of symmetries like rotations and reflections. The game’s a draw if both sides

make their best moves.”

“That’s right. You may be interested to know that Frank Harary, a well known graph

theorist, generalized the game in an interesting way. Have you head about polyominoes?”

Luther shook his head.

“They’re shapes made by attaching unit squares along their edges. In standard ticktack-

toe the goal is to be the first to form what is called a straight tromino. That’s three little

squares in a row. Harary varied the goal by making it any kind of polyomino on a field of

any size. For example, suppose the goal is to be the first to form the only other tromino,

three squares joined to make a right angle. It’s called the bent tromino. If a bent tromino is

the goal, who wins? Or is it also a draw?”

“I wish I’d thought of that,” said Luther.

“Harary also analyzed reverse games for low-order polyominoes. Who wins if the first

player forced to form a specified polyomino is the loser?”

“I can see how that could get very complicated,” said Luther. “I did think of analyzing

the reverse game for standard ticktacktoe. It’s a draw, too. The first player can’t win but he

can force a draw by going first in the center.”

The principal was interested. “Can you show me how he does that?”

For the next ten minutes Luther and the principal played reverse games of ticktacktoe.

“The chairman of the math department at Stanford University is an old friend,” said the

principal, as he pushed the sheets of paper to one side of his desk. “We were frat brothers at

Michigan. If you don’t mind, I’d like to send him your contest answers. Would you object

if Stanford offered you a full scholarship that included room and board?”

“No.”

“Would your parents object?”

“I don’t know.”

The chairman of Stanford’s math department needed only a glance at Luther’s un-

conventional ways of solving the test problems to know that he wanted Luther in his de-

partment. Even before he had a long phone conversation with the principal of Butterfield

Central High, the chairman had started the process for Luther to be given a full freshman

scholarship.

i

i

“master” — 2012/11/15 — 11:23 — page 269 — #283i

i

i

i

i

i

36. Against the Odds 269

To Luther’s vast surprise and joy, his mother and father agreed to let him accept the

scholarship. He almost cried when his father embraced him and said how proud he was!

Luther’s scholarship was renewed each year, and on through graduate work until he

got his doctorate. Three years later he was made a full professor at Stanford. By then he

had published a dozen technical papers, and was much in demand as a lecturer at math

conventions.

Fast forward ten years to 1970. Luther was awarded the Fields Medal, the equivalent in

mathematics of a Nobel prize. It was for his brilliant solution of a long-standing conjecture

in Diophantine analysis.

A news release about the award went to the Butterfield Times. The paper ran the story

with Luther’s picture on the front page.

Miss Perkins had retired from teaching after she married Mr. Jones, the school’s bas-

ketball coach. During breakfast, her husband handed her the newspaper’s front section.

“It says here that Professor Washington grew up in Butterfield and attended Central

High. He must have been in one of your classes.”

Miss Perkins adjusted her spectacles and studied Dr. Washington’s picture for a full

minute. “I don’t recognize him,” she said, frowning. “There were lots of black boys in my

classes. They all looked so much alike. I wonder if he really deserved that prize. You know

how affirmative action works these days. Would he have been given that award if he’d been

white?”

Note Frank Harary’s generalization of ticktacktoe to polyomino patterns was the topic

of my Scientific American column, April 1979. The column is reprinted as Chapter 17 in

Fractal Music, Hypercards, and More (W. H. Freeman, 1992).

i

i

“master” — 2012/11/15 — 11:23 — page 270 — #284i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 271 — #285i

i

i

i

i

i

37A Modular Miracle

John Stillwell

The following piece is one of a series of results that I called “modular miracles” in a 2001

article in the Monthly. It depends on an earlier miracle, which I must therefore briefly ex-

plain. An 1857 theorem of Kronecker [3] relates the famous modular function, j , to unique

prime factorization in the field Q.p

�D/, where D is a positive integer. Kronecker’s theo-

rem states that unique prime factorization holds for the integers � in Q.p

�D/ only when

j.�/ is an ordinary integer for each such � .

Take, for example, the Gaussian integers, a C bp

�1 for ordinary integers a and b.

These are the integers of the field Q.p

�1/, and they were shown to possess unique prime

factorization by Gauss in 1832. Gauss also found several other values of D for which

Q.p

�D/ has unique prime factorization, the largest of which was D D 163. (It is now

known that no larger such D exists.) The number .1 Cp

�163/=2 is an example of an

integer in this field. It then follows, by Kronecker’s theorem, that j..1 Cp

�163/=2/ is an

ordinary integer, and indeed

j..1 Cp

�163/=2/ D .�640320/3:

The function j is defined by geometric conditions that I will not explain here; it is only

necessary to know that j has a series expansion beginning

j.�/ D q�1 C 744 C 196884q C 21493760q2 C � � � ;

where q D e2i�� .

The Numerical Miracle

Hermite (1859) noticed a curious numerical consequence of Kronecker’s theorem on the

values of j.�/:

e�p

163 D 262537412640768744;

(an integer!) correct to 12 decimal places [2].

This little known discovery of Hermite was exploited by Martin Gardner in an amus-

ing hoax edition of his column in Scientific American. On 1 April 1975 Gardner [1]

Reprinted from The American Mathematical Monthly, Vol. 108, No. 1 (Jan. 2001), pp. 70–76.

271

i

i

“master” — 2012/11/15 — 11:23 — page 272 — #286i

i

i

i

i

i


announced—among several other “sensational discoveries that have somehow or another

escaped public attention”—that

e�p

163 D 262537412640768744 exactly. (1)

He gave the announcement an extra coat of varnish by claiming that it settled a conjecture

of Ramanujan, supposedly made in a paper of 1914. The paper cited by Gardner does

indeed discuss near integers of the form e�p

n, but without claiming that they could be

integers, and without mentioning e�p

163. Still, in the pocket calculator days of 1975, it

was pretty hard to decide whether e�p

163 is an integer or not.

Its true value, as Hermite and Ramanujan knew, is the integer in (1) minus a very tiny

number (< 10�12).

In fact, putting � D .1 Cp

�163/=2 in q D e2i�� gives the tiny

q D ei��p

163 D �e��p

163;

and putting this q in

j.�/ D q�1 C 744 C 196884q C 21493760q2 C � � �

gives

j..1 Cp

�163/=2/ D �e�p

163 C 744 � tiny number;

and therefore

e�p

163 D integer � tiny number:

Bibliography

[1] M. Gardner, Mathematical Games, Scientific American 232 (April 1975), p. 126.

[2] C. Hermite, Sur la theorie des equations modulaires, C. R. Acad. Sci. Paris Ser. I Math. XLVIII

(1859) 940. Also in Oeuvres de Charles Hermite, Paris, Gauthier-Villars, 1905–17, vol. 2, pp.

38–82.

[3] L. Kronecker, Uber die elliptischen Functionen fur welche complexe Multiplication stattfindet,

Leopold Kronecker’s Werke, Chelsea Pub. Co., New York, 1968, vol. 4, pp. 179–183.

i

i

“master” — 2012/11/15 — 11:23 — page 273 — #287i

i

i

i

i

i

38The Golden Ratio—

A Contrary Viewpoint

Clement Falbo

Introduction

Over the past five centuries, a great deal of nonsense has been written about the golden

ratio, ˆ D 1Cp

52

, its geometry, and the Fibonacci sequence. Many authors make claims

that these mathematical entities are ubiquitous in nature, art, architecture, and anatomy.

Gardner [4] has shown that the admiration for this number seems to have been raised to

cult status. Fortunately, however, there have been some recent papers, including Fischler [2]

in 1981, Markowsky [7] in 1992, Steinbach [9] in 1997, and Fowler [3] in 1982, that

are beginning to set the record straight. For example, Markowsky, in his brilliant paper

“Misconceptions about the Golden Ratio,” speaking about ˆ, says:

Generally, its mathematical properties are correctly stated, but much of what

is presented about it in art, architecture, literature and esthetics is false or

seriously misleading. Unfortunately, these statements about the golden ratio

have achieved the status of common knowledge and are widely repeated. Even

current high school geometry textbooks . . . make many incorrect statements

about the golden ratio. It would take a large book to document all the misin-

formation about the golden ratio, much of which is simply repetition of the

same errors by different authors.

It is remarkable that prior to Fischler’s and Markowsky’s papers, there seemed to have

been no set standards for obtaining measurements of artwork. Often, a proponent of the

golden ratio will choose to frame some part of a work of art in an arbitrary way to create

the appearance that the artist made use of an approximation of ˆ. Markowsky shows an

example in which Bergamini [1] arbitrarily circumscribes a golden rectangle about the

figure of St. Jerome in a painting by Leonardo da Vinci, cutting off the poor fellow’s arm

in order to make the picture fit.

It is frequently asserted that the golden ratio occurs in nature as the shape of spi-

rals in sea shells. We can easily test this claim by first providing a protocol for measur-

ing the spirals. One requirement should be to allow for some error in the measurements.


273

i

i

“master” — 2012/11/15 — 11:23 — page 274 — #288i

i

i

i

i

i


Markowsky [7] suggests an error bar of ˙2%, which seems to be quite adequate. Measur-

ing under this protocol, we find that spirals in sea shells do not generally fit the shape of

the golden ratio. This is true despite the numerous articles on the Internet and elsewhere, in

which pictures apparently have been stretched to fit the ˆ ratio—“stretching the truth”—so

to speak.

The golden ratio is associated with the Fibonacci sequence in a very simple way. The

sequence is an example of a quadratic recursive equation sometimes used to describe var-

ious scientific and natural phenomena such as age-structured population growth. In order

to define the general quadratic recursive formula, let x0, x1, p, and q be fixed positive

numbers, and for any integer n � 2, define xn as

xn D pxn�1 C qxn�2: (38.1)

Murthy [8] provides a number of theorems for this general recursive equation. It is clear

that many of the features that are proclaimed to be unique to the Fibonacci sequence are,

indeed, common to all second-order recursive equations. For example,

limn!1

xnC1

xn

D r; (38.2)

where r is the positive root of the quadratic equation

x2 � px � q D 0; (38.3)

obtained by assuming that xn D xn is a solution to (38.1). One of our objectives in this

paper is to show that if q D 1 and r is the limit in (38.2), then the pair .r; p/ has all of

the geometric and algebraic properties that are often ascribed as being unique to the pair

.ˆ; 1/. For example, we have r � p D 1=r corresponding to the property ˆ � 1 D 1=ˆ:

For equation (38.1) to be useful in describing aged-structured population growth in

plants and animals, the coefficients p and q must be determined by some niche or fecundity

properties of the organism being studied. In other applications, such as phyllotaxy, we may

use a second-order recursive equation such as (38.1) to predict and explain the evolution

of leaf placement on a stem in terms of maximizing the gathering of sunlight. However,

we should not expect the complexities of natural systems to yield to the easy-to-compute

Fibonacci sequence, and there seems to be no unbiased evidence favoring the Fibonacci

sequences over all other possible sequences. If one expends great effort in looking only

for this special sequence, then it may be perceived, whether or not it is there. This is

succinctly illustrated in terms of statistical analysis by Fischler [2], who shows that careless

computations and misused formulas produce the golden number when it isn’t there.

In a popular new book, Livio [6] draws upon the information developed in Markowsky

and others to discuss the protocol violations that are the source of claims that ˆ occurs in

classical architecture, such as the Parthenon and the Egyptian pyramids. Livio presents a

well-written explanation of misleading claims concerning these classics, as well as various

paintings and other art work. Livio calls the advocates for these claims the “golden num-

berists.” It seems, however, that he believes that certain constructions (such as Kepler’s

triangle) or arithmetic equations (such as a continued fraction representation of ˆ) are sig-

nificant and unique enough for him to subtitle his book “The Story of Phi, the World’s

Most Astonishing Number.”

i

i

“master” — 2012/11/15 — 11:23 — page 275 — #289i

i

i

i

i

i

38. The Golden Ratio—A Contrary Viewpoint 275

Origins of the golden ratio

The golden ratio is the solution to a problem given by Euclid (c. 300 BC) in his Elements,

Book VI, Proposition 30:

To cut a given finite line in extreme and mean ratio.

That is, given a segment AE, find the point B for which AE=AB D AB=BE. Euclid had

already solved this in a previous theorem (Book II, Proposition 11). We show his construc-

tion in Figure 38.1 in order to generalize it later.

D C F

A M B E

1

1

1_2

1_2

Figure 38.1. The mean proportion AE W AB WW AB W BE.

Start with the unit square ABCD and let M be the midpoint of AB. Construct the line

segment MD. Draw a circle with center at M and radius MD so that it cuts�!AB at the point

E . So, MD D ME. In modern notation, we have the lengths,

ME Dp

5

2; MB D

1

2; and BE D

p5 � 1

2:

Now, since AE D AB C BE, or AE D 1 C BE, we can write: AE D .1 Cp

5/=2. Thus, we

can easily show that AE=AB D AB=BE.

The length AE, .1 Cp

5/=2, is denoted by ˆ, and is called the golden ratio, or the

divine proportion. In the above figure, the rectangle AEFD is called the golden rectangle.

The history of the golden ratio pre-dates Euclid. As early as 540 BC, the Pythagoreans had

studied it in their work with the pentagon. We discuss some of the ratios that appear in the

pentagon and all other odd polygons later.

The golden ratio ˆ can be used to construct a beautiful logarithmic spiral, shown in

Figure 38.2. This graph can be obtained by fitting the polar equation � D bec� to selected

points on the golden rectangle. We can, however, also plot a logarithmic spiral inscribed in

a rectangle with a ratio other than ˆ. Figure 38.3 shows a spiral with a 1.33 to 1 ratio.

i

i

“master” — 2012/11/15 — 11:23 — page 276 — #290i

i

i

i

i

i


0.8

–0.2

0.2

0.4

0.6

–1 –0.5 0.5

(1.6182 = Ratio)

Figure 38.2. A spiral with the golden ratio.

0.8

–0.2

0.2

0.4

0.6

–0.5 0.5

–0.4

–0.75 –0.25 0.25 0.75 1

(1.333 = Ratio)

Figure 38.3. A spiral with a 1:33 to 1 ratio.

Spirals in sea shells

Basically, the two types of sea shells that many of us are familiar with are the cephalopod

(head-foot) and the gastropod (stomach-foot), or with apologies to my biologist friends,

the octopus and the snail. The nautilus, a cephalopod, is an octopus in a shell that consists

of a series of chambers, each sealed from the previous one. This animal lives in the latest

chamber with its eight tentacles sticking out. In Figure 38.4 is a photo that I took of a

longitudinal section of a nautilus.

The nautilus is definitely not in the shape of the golden ratio. Anyone with access to

such a shell can see immediately that the ratio is somewhere around 4 to 3. In 1999, I

measured shells of Nautilus pompilius, the chambered nautilus, in the collection at the

California Academy of Sciences in San Francisco. The measurements were taken to the

nearest millimeter, which gives them error bars of ˙1 mm. The ratios ranged from 1:24 to

1:43, and the average was 1:33, not ˆ (which is approximately 1:618). Using Markowsky’s

˙2% allowance for ˆ to be as small as 1:59, we see that 1:33 is quite far from this ex-

Figure 38.4. A digital photograph of a longitudinal section of a nautilus.

i

i

“master” — 2012/11/15 — 11:23 — page 277 — #291i

i

i

i

i

i


panded value of ˆ: It seems highly unlikely that there exists any nautilus shell that is within

2% of the golden ratio, and even if one were to be found, I think it would be rare rather

than typical.

A ratio of different color,p

2

A spiral of a different ratio is in the shape of the “silver ratio”p

2 to 1. (Some authors call

1 Cp

2 the silver ratio.) A spiral based onp

2 is considerably closer to the shape of the

nautilus than ˆ is (see Figure 38.5).

0.8

–0.2

0.2

0.4

0.6

–0.4

(1.41421 = Ratio)

–1 –0.5 0.5 1

Figure 38.5. A spiral with the ratiop

2 to 1.

It is well known that the golden rectangle can be subdivided into subrectangles (each

similar to the original) forming a kind of “spiral” of smaller and smaller similar rectangles.

This interesting property happens to be shared by every rectangle (except the square). For

example, if you take ap

2-by-1 rectangle and cut it in two (in the natural way), you get

two rectangles with sides in the same ratio. Continuing this, we get similar rectangles as

shown in Figure 38.6.

Figure 38.6. A rectangle and subrectangles withp

2 to 1.

i

i

“master” — 2012/11/15 — 11:23 — page 278 — #292i

i

i

i

i

i


(This rectangle was actually used in a field study in biology. In an article in Science,

Harte, Kinzig, and Green [5] used Figure 38.6 when they wanted to determine a relation be-

tween species distribution and geographical area. They were interested in studying regions

of varying sizes, but they wanted to maintain similarity in the shape of the region so as not

to introduce a bias against rare species. One of their protocols was to start with a rectangle

of known dimensions and then to generate a sequence of smaller similar subrectangles in

which to collect samples. They chose the rectangle with the ratio ofp

2 to 1:)

The extreme mean ratio, generalized

Figure 38.6 illustrates the fact that the golden rectangle is not the only one that can be sub-

divided into similar subrectangles. Indeed, Fowler shows that we can construct an infinite

sequence of extreme mean ratios. What he does is to generalize the standard procedure for

constructing ˆ. He starts with a unit square ABCD and for any positive integer n, takes

the point Mn on�!AB, at a distance of n=2 from A. Then the segment MnD has lengthp

n2 C 4=2. Now rotating MnD about Mn he gets a circular arc that cuts�!AB at a point

E . (Mn is between A and E .) The length of AE is n Cp

n2 C 4=2, which he calls the

noem, or nth order extreme mean. That is, .MnE C AMn/=AB D AB=.MnE � AMn/.

If n D 1, the noem is ˆ, or the first order extreme mean, while if n D 2, the noem

is 1 Cp

2: In Figure 38.7a, n D 3, AM3 D 32

, and G is a point so that M3 is the

midpoint of the segment AG. AE D .3 Cp

13/=2, (the third order extreme mean) and

GE D .�3 Cp

13/=2, so AE=AB D AB=GE. One property of the nth order extreme mean

is that noem � 1=noem D n.

D C F

A M3 E 1 G B

noem

1

p_2

D C F

A M E B

p

r

p_2

(a) (b)

Figure 38.7. (a) The noem, with n D 3; (b) The poem, for any p > 0.

We can obtain the same result for any positive real number p, not just for an integer n,

as follows. After Fowler, we call our ratio the pth order extreme mean, or the poem. Its

construction is shown in Figure 38.7b. Start with any p-by-1 rectangle ABCD, AB D p.

Select the midpoint M of AB and construct the segment MD; its length isp

p2 C 4=2.

Rotate the radius MD about M until it cuts the line�!AB at the point E , giving the segment

AE with length .p

p2 C 4 C p/=2, denoted by r . Then BE D AE � AB; this length is

i

i

“master” — 2012/11/15 — 11:23 — page 279 — #293i

i

i

i

i

i


.�p Cp

p2 C 4/=2, which is 1=r: Thus, r � p D 1=r: The number r is the poem. Notice

that r is a root of the quadratic equation (38.3) when q D 1. Completing the rectangle

BEFC, as shown, we have that AE=EF D FE=BE.

Self-similar decompositions of rectangles of any ratio

The poem, defined above, has a simple interpretation. For any r > 1, and any rectangle R

with a ratio of r to 1, we can divide R into two smaller rectangles, one of which has the

same ratio r to 1, and this in turn can be so divided etc., as in Figure 38.8. The coordinates

of the vertices A4, A5, A6, are given below.

A4 D�

r � 1

rC 1

r3; 0

�

A5 D�

r � 1

r;

1

r2� 1

r4

�

A6 D�

r � 1

rC 1

r3� 1

r5;

1

r2

�

The vertices A1; A2; A3; : : : converge to a “center-point,”

�r3

1 C r2;

1

1 C r2

�

:

In order to show this, we look at the abscissas of the corners. First notice that the corner

points alternate in returning to a previous x value before moving on to a new x value. So,

skipping those that return to a previous abscissa, we have:

Point A2 A4 A6 . . .

Abscissa r � 1r

r � 1r

C 1

r3 r � 1r

C 1

r3 � 1

r5 . . .

The alternating geometric series

r � 1

rC 1

r3� 1

r5C � � � converges to

r3

1 C r2:

Similarly, the corner points return to a previous ordinate in an alternating pattern and they

converge to 1=.1 C r2/. Notice that, for any r (including ˆ, of course) the center point

determined by these infinite series is the intersection of the line through .0; 1/ and .r; 0/

and the line through the points .r � 1=r; 0/ and .r; 1/.

Other geometric properties

Here are some other common geometric properties that are not unique to ˆ.

Spirals An equiangular spiral can be drawn through the points A1; A2; A3; : : : in Figure

38.8. Just use the center point�

r3

1 C r2;

1

1 C r2

�

i

i

“master” — 2012/11/15 — 11:23 — page 280 — #294i

i

i

i

i

i


(0,1)

A1=(0,0) A4

A6

A5

A2=(r–1/r,1)(r, 1)

(r, 0)

A3=r ,1_

r2

Figure 38.8. A rectangle of dimensions r to 1, with similar subrectangles.

and fit a polar equation for the logarithmic spiral through any two of these points. It is a

general property of all such spirals that the tangents to the spiral at any point make a fixed

angle with the rays from the center point.

Sum of the areas of all of the subrectangles If r > 1 and p D r � 1=r , then the sum

of the areas,

r C 1

rC 1

r3C � � � ;

of all the rectangles in Figure 38.8 is r2=p, and in the special case in which r D ˆ, the

sum is ˆ2 because p D 1.

Kepler’s triangle 4ACD in Figure 38.9 is called Kepler’s triangle; here is how it is

constructed. Let r > 1 and again let p D r � 1=r ; then r2 D rp C 1: (As before, p D 1

when r D ˆ:) Now construct a semicircle with rp C 1 as the diameter AC. Mark the point

B so that AB D rp and BC D 1. Erect a line at B that intersects the circle at D. Then

the length of DC (shown as x here) is r . Apparently this was thought to be amazing when

Kepler did it, but it is not so amazing—it is simply the Euclidean construction for finding

D

A C rp B 1

x

Figure 38.9. Kepler’s triangle.

i

i

“master” — 2012/11/15 — 11:23 — page 281 — #295i

i

i

i

i

i


the square root of a segment AB, that is DB D prp. The hypotenuse of the right triangle

DBC isp

rp C 1 D r .

Pentagons, heptagons, nonagons, . . .

Regular polygons with an odd number of sides have some interesting properties related

to the ratios we have been discussing. Steinbach makes the case for other polygons in his

paper; he opens with this:

One of the best-kept secrets in plane geometry is the family of ratios of di-

agonals to sides in the regular polygons. So much attention has been given to

one member of this family, the golden ratio ˆ in the pentagonal case, that the

others live in undeserved obscurity. But the wealth of material that pours from

the pentagon—proportional sections, recursive sequences, and quasiperiodic

systems—can be matched wonder-for-wonder by any other polygon : : : :

It is well known that the golden ratio occurs in the regular pentagon with unit sides. In

Figure 38.10, the length AC D 2 cos.�=5/, which is ˆ:

1.6

18

108˚ B

A

E

C D

1

1

Figure 38.10. The pentagon and the golden ratio, AC � AB D 1=AC.

Much of the publicity given to ˆ has come from the special place that the pentagon had

among the Pythagoreans. In this case, the equation AC � AB D 1=AC says that ˆ � 1 D1=ˆ. But, if we look at other regular odd polygons, we see that some pair of diagonals also

satisfy the same relationship. Indeed, it is an interesting exercise to prove the following

result.

Let P be any regular odd polygon with n sides, n � 5. If r is the length of a longest

diagonal of P and p is the length of a second longest diagonal, then r � p D 1=r .

Example Behold the heptagon! See Figure 38.11. The diagonals AD and AC have the

relationship described:

AD D 4 cos2.�=7/ � 1 � 2:247 and AC D 2 cos.�=7/ � 1:802

i

i

“master” — 2012/11/15 — 11:23 — page 282 — #296i

i

i

i

i

i


2.2

47

102.86˚

B

A

EC

D

1

128.57˚

G

F

1

11.8

02

Figure 38.11. The heptagon and two related ratios. If AD D r , then AC D r � 1=r .

That is, AD � AC D 1=AD:

Algebraic properties of xn D pxn�1 C xn�2

We want to consider a modest generalization of the Fibonacci sequence, equation (38.4)

below. The paper by Murthy presents substantially more general recursive equations that

include (38.4).

Again, let r > 1 and p D r � 1r

. Let x0 D 1, x1 D 1, and for n � 2, let

xn D pxn�1 C xn�2: (38.4)

Then the general term of the sequence is

xn D c1rn C c2

�

�1

r

�n

; (38.5)

where

r D p Cp

p2 C 4

2;

�1

rD p �

p

p2 C 4

2;

and

c1 D 2 � p Cp

p2 C 4

2p

p2 C 4; c2 D p � 2 C

p

p2 C 4

2p

p2 C 4:

Because of the way that p is defined, r and �1r

are the roots of the quadratic equation

x2 � px � 1 D 0: (38.6)

A few terms of this sequence, obtainable from either equation (38.4) or equation (38.5),

are

x2 D p C 1;

x3 D p2 C p C 1;

x4 D p3 C p2 C 2p C 1:

i

i

“master” — 2012/11/15 — 11:23 — page 283 — #297i

i

i

i

i

i


If p ¤ 1, this is not Fibonacci. We now point out several properties that any such

solution r of (38.6) would have and that the quadratic equation is sufficient to imbue r

with these properties.

Subtracting a number to get the reciprocal From the fact that ˆ is the solution to

equation (38.6) when p D 1, ˆ � 1 D 1=ˆ: Thus, ˆ is the only number whose reciprocal

can be found by subtracting 1. But for any other positive number p, if r is the positive

root of (38.6), then r � p D 1=r , and r is the only number whose reciprocal can be

found by subtracting p. This is simply a relation between the coefficients and the sum of

the roots of any quadratic equation. For example, if p D 12

, and r is the positive root

of x2 � 12x � 1 D 0, then r D .1 C

p17/=4. Subtracting 1

2gives .�1 C

p17/=4, the

reciprocal of .1 Cp

17/=4:

Square of nth term It is easy to prove by induction that for any three consecutive terms

of the sequence in equation (38.4), the square of the middle term is p more or p less than

the product of the other two terms. The Fibonacci special case says that given any three

consecutive terms, the square of the middle one is 1 more or 1 less than the product of the

other two.

Continued fraction and endless square root Two other relations between r and p are

r D p C 1

p C 1

pC 1pC��

and r Dr

1 C p

q

1 C pp

1 C p : : : ;

which specialize when p D 1 to

ˆ D 1 C1

1 C 1

1C 11C��

and ˆ D

r

1 Cq

1 Cp

1 C � � � :

A word in favor of ˆ

We do recognize that the number ˆ, like e and � , has many interesting properties. The

sense in which this is true comes from the fact that ˆ shows up when we try to simplify

certain formulas, such as setting p D 1 in the continued fraction given above. This is

the same sense in which the number e is interesting. If you want to find the derivative of

logb.x/, b > 0 and b ¤ 1; from the definition

limh!0

logb.x C h/ � logb.x/

h;

after some work, you get

1

xlogb

limh!0

�

1 C h

x

�x=h!

:

The indicated limit is e; so we have 1x

logb.e/: Now, it is natural to select b D e; so, the

desired derivative is 1x

: Using e for the logarithm base just makes the formula simpler.

i

i

“master” — 2012/11/15 — 11:23 — page 284 — #298i

i

i

i

i

i


Conclusions

We have shown that in certain instances, the claim that the golden ratio has a special place

among numbers as a valid description of nature is unsupported. Furthermore, it has been

well refuted that this ratio is somehow exceptionally pleasing and that it occurs frequently

in art and architecture. For example, from taking measurements, we find that there is no

basis for saying that ˆ is the ratio that naturally occurs in sea shells. In particular, there is

no basis for the assertion that it occurs in the nautilus. I disagree with Livio’s implication

that ˆ is “The World’s Most Astonishing Number.” The interesting properties of ˆ as a

positive root to the quadratic equation (38.6) (with p D 1) are matched by the positive

root to (38.6) for any positive number, p ¤ 1. The wonderful geometric properties of ˆ

as an extreme mean are matched by the other means discussed here. Also, finding ˆ, as

2 cos.�=5/, in the pentagon is exciting, but no more so than finding 4 cos2.�=7/ � 1 in the

heptagon. Perhaps, ˆ deserves to be included in a list along with e, � , and other numbers

because it can be used to simplify certain formulas. In that sense, it is interesting, perhaps

even very interesting, but not entirely astonishing.

Acknowledgment I would like to thank my wife, Jean, for her help with the many rough

draft versions of this paper. Also, I appreciate the amount of time and energy expended by

the editor of this journal in assisting me in the development of this paper.

Bibliography

[1] D. Bergamini, Mathematics, Time Incorporated, 1963.

[2] R. Fischler, How to find the golden number without really trying, Fibonacci Quart. 19 (1981)

406–410.

[3] D. H. Fowler, A generalization of the golden section, Fibonacci Quart. 20 (1982) 146–158.

[4] M. Gardner, The cult of the golden ratio, Skeptical Inquirer 18 (1994) 243–247.

[5] J. Harte, A. Kinzig, and J. Green, Self-Similarity in the distribution of abundance of species,

Science (April 9, 1999) 334–339.

[6] M. Livio, The Golden Ratio: The Story of Phi, the World’s Most Astonishing Number, Broadway

Books, 2002.

[7] G. Markowsky, Misconceptions about the golden ratio, College Math. J. 23 (1992) 2–19.

[8] P. V. S. Murthy, Generalizations of some problems on Fibonacci numbers, Fibonacci Quart. 20

(1982) 65–66.

[9] P. Steinbach, Golden fields: A case for the heptagon, Math. Mag. 70 (1997) 22–31.

i

i

“master” — 2012/11/15 — 11:23 — page 285 — #299i

i

i

i

i

i

39Review of The Mysterious Mr.

Ammann by Marjorie Senechal

Philip D. Straffin

In 1975, Martin Gardner wrote in his Mathematical Games column in Scientific American

about Roger Penrose’s discovery of a set of tiles that do not tile the plane periodically,

but do tile it nonperiodically. Gardner did not show pictures because Penrose was waiting

for a patent. In response to that column, Gardner received a letter from Robert Ammann,

who wrote, “I am also interested in nonperiodic tiling, and have discovered both a set of

two polygons which tile the plane only nonperiodically and a set of four solids which fill

space only nonperiodically.” Ammann included pictures: he had independently discovered

Penrose tilings and generalized them to three dimensions. He went on to find five more sets

of planar tilings, discover the general organizing principle now known as Ammann bars,

and make many other contributions to the theory of nonperiodic tiling.

No one in the tiling community knew who Ammann was. He described himself as “an

amateur doodler” and he declined all invitations to meetings and conferences. In 1987,

Marjorie Senechal finally succeeded in tracking him down, arranging for him to meet Pen-

rose, Donald Coxeter, and John Conway, and even getting him to speak at two conferences.

After Ammann’s early death from a heart attack in 1994, she learned more about his life.

At age three, he was precociously brilliant. At age four, he stopped talking and withdrew

into his own world. He struggled through childhood and adolescence, “off the charts in-

tellectually but impossible emotionally.” During the years of his tiling discoveries he was

living in a motel and supporting himself by sorting mail in a post office. His mother said

that the mathematical meetings were the high point of his life: “No one else reached out

to him.” Perhaps we should draw the moral that it is important to encourage mathemati-

cal talent wherever we find it, and to welcome into the mathematical community all able

contributors.


285

i

i

“master” — 2012/11/15 — 11:23 — page 286 — #300i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 287 — #301i

i

i

i

i

i

40Review of PopCo by Scarlett Thomas

Martin Gardner

This is the only novel I know of that is saturated with mathematics. Alice Butler, the 29-

year-old narrator of the story, works for the world’s third-largest toy company, PopCo. Her

job is to create ideas for toys and games, especially products that will appeal to teen-age

girls.

Orphaned when young, Alice is raised by loving and loved grandparents, both of whom

are first-rate mathematicians. Her grandfather writes a monthly column on recreational

math, said to be similar to the one I wrote for Scientific American. He is obsessed with

trying to decode the notorious Voynich manuscript, now widely believed to be a hoax.

Although he fails in this project, he succeeds in decoding a cipher-text that reveals the

location of a buried treasure. The treasure is on a bird sanctuary, and because digging it up

would disturb the birds, he never reveals its location. He does, however, leave cipher clues

to Alice. The grandmother’s obsession on the other hand is trying to solve the Riemann

hypothesis.

Alice has been strongly influenced by her grandfather’s fondness for mathematical

play, and she constantly interrupts her narrative with accurate descriptions of a variety

of mathematical topics. These include Conway’s computer game of Life, Newcomb’s fa-

mous paradox involving decision theory, trapdoor ciphers, and the Monty Hall problem

of the three doors. In addition to recreational topics, Alice provides excellent accounts of

Cantor’s alephs, prime factorization, Godel’s proofs, and even the Riemann hypothesis!

In addition to being extremely knowledgeable about math, Alice plays both Go and

chess well. There is an amusing flashback to her secondary school days when she partici-

pated in a tournament run by her loathed sexist math teacher—the winner of the tournament

gets to play him. Alice of course wins, and when the teacher and Alice sit down to play,

he pompously tells her that the game probably won’t last five minutes. Alice recognizes

his opening as one used by Kasparov in a game that she had once studied intensively (in

fact, her grandfather had written one of his columns about it), so she knows exactly how to

meet the attack. Ten minutes later the teacher is defeated. He is so humiliated that he takes

extensive sick leave.

The novel concludes with a double climax. Expert in cipher-breaking—one of Al-

ice’s toys is a code-breaking kit that includes Thomas Jefferson’s cylinder with rotating

wheels—Alice cracks her grandfather’s cipher about the buried treasure. The treasure,

worth billions, is unearthed, and Alice generously gives it all to the bird sanctuary.


287

i

i

“master” — 2012/11/15 — 11:23 — page 288 — #302i

i

i

i

i

i


The second climax, much more improbable, concerns Alice’s recruitment into a myste-

rious secret organization with a revolutionary agenda—but I’ll not reveal that here in case

a reader is moved to read the book.

Although I enjoyed the story very much, there were a couple of things that bothered

me. One was the frequency with which the characters, especially Alice’s lover Ben, seem

unable to speak without peppering their conversation with meaningless f-words. Do em-

ployees in British toy firms actually talk that way?

The other was Alice’s apparent low opinion of mainline doctors. I can understand how

she might be a vegan, avoiding all food coming from animals, but her devotion to home-

opathy is much harder to comprehend. One wonders whether this reflects the author’s opin-

ion or is just written into her lead character. Surely Alice would know that homeopathic

remedies dilute a drug so thoroughly that at most two or three molecules, if any, remain.

(Somehow, the water is supposed to “remember” the drug’s properties.)

Other features of the book include a table for writing a Vigenere cipher, a postscript

listing the primes less than 1000, a clever crossword puzzle, and a recipe for a cake. Part 1

opens with a quotation from a paper by psychologist Stanley Milgram, Part 2 with an brief

excerpt from Paul Hoffman’s biography of Paul Erdos, and Part 3 starts with one of Piet

Hein’s best “grooks”:

A bit beyond perception’s reach

I sometimes believe I see

That life is two locked boxes, each

Containing the other’s key.

Scarlett Thomas writes captivatingly; her novel is not easy to put down.

i

i

“master” — 2012/11/15 — 11:23 — page 289 — #303i

i

i

i

i

i

41Superstrings and Thelma

Martin Gardner

Several years ago I was a graduate student at the University of Chicago. I was working on

my doctorate in physics, about possible ways to test superstring theory, when my brother

in Tulsa died suddenly from a heart attack. Both parents had earlier passed away. After

the funeral I drove past my past, marveling at the enormous changes that had taken place

since I grew up there. I drove by the red brick building, now an enormous warehouse, that

had once been Tulsa Central High. My grades in history, Latin, and English lit were low,

but I was good in math and had a great physics teacher. He was mainly responsible for my

majoring in physics after a scholarship took me to the University of Chicago.

While I was having dinner at a popular restaurant on the corner of Main and Sixth

streets, the waitress stared at me with a look of surprise. “Are you Michael Brown?”

“That’s me,” I said. She smiled and held out a hand. “I’m Thelma O’Keefe. We were

in the same algebra 101 class.”

We shook hands.

“You won’t remember me,” she said. “I was fat in those days, and shy, and not very

pretty.”

“That’s hard to believe,’ I said. “You look gorgeous now.”

“Well, thank you, kind sir,” she said, smiling. “You were a whiz at algebra. Do you

remember when you caught Mr. Miller in a mistake he made on the blackboard, and how

embarrassed he was?”

“I remember. He was a miserable teacher. I think he hated math.”

“I know I hated it,” Thelma said.

“I’m sorry to hear that. Math can be exciting and beautiful when it’s taught properly.”

After Thelma brought my receipt and credit card, I said, “Any chance I could see you

after work? Maybe you could steer me to a late night bar where we could chat about old

times?”

“I’m free at eleven,” she said.

I followed Thelma’s car to a pleasant little bar on the outskirts of town, near where she

lived. She was divorced, she told me, and had a boy of ten who was probably asleep in her

apartment. The bar served only beer. She said she didn’t drink anything with more alcohol

than beer. Her ex, she added, had an alcohol problem. I didn’t press her for details. Instead,

I fear I talked too much about myself, and mainly about superstrings.

Reprinted from Math Horizons, Vol. 18, No. 1 (Sept. 2010), pp. 6–9.

289

i

i

“master” — 2012/11/15 — 11:23 — page 290 — #304i

i

i

i

i

i


I did my best to explain that strings were inconceivably tiny loops, like rubber bands,

that vibrated at different rates. Their frequencies generated all the properties of the fun-

damental particles, such as electrons and quarks. The simplest string vibration produces

gravitons, conjectured particles that carry gravity waves.

“You have nice dark eyes,” she interrupted.

“Thanks,” I said. “Your eyes aren’t so bad either.”

I tried hard to explain how a famous physicist named Ed Witten had generalized strings

to what he called M-theory. The M stands for membranes, or branes for short. A superstring

is a brane of one dimension. Other branes have higher dimensions. Our universe, I said,

has ten or eleven dimensions, of which six or seven are squeezed into compact tiny spheres

that are attached to every point in our spacetime.

“I didn’t understand a word you spoke,” Thelma said. “It sounds nutty to me. Do you

believe all that?”

“Mostly. I think strings are for real, but I’m not so sure about Witten’s membranes.”

“Is everything made of strings?” Thelma asked.

“Everything.”

“And what are the strings made of?”

“Nothing. They’re just pure mathematical structures.”

“If the universe isn’t made of anything,” she said, “how come it exists?”

“Good question. Nobody knows.”

“Well, maybe God knows,” she said.

Outside the bar, standing by our cars, Thelma invited me to her apartment for some

coffee.

“No,” I said. “I really can’t stay another minute. I have to be up early to catch a plane

to Chicago. It was great getting to know you.”

“Will I see you again?”

“It would be a pleasure,” I said.

Like a fool I failed to ask for her address and phone number. We shook hands. She said

goodbye, then startled me with a quick kiss on the mouth.

Almost a year drifted by. My thesis was published as a book by the University of

Chicago Press. My suggestions for testing string theory were taken seriously by most

stringers. There was hope that some of the tests might actually be made by a new atom

cruncher under construction in Switzerland. There were vague rumors about a Nobel prize.

I was unable to get Thelma out of my head. I kept thinking of her wonderful smile, and

how good she smelled. It wasn’t perfume. Was it her hair? I thought about her more than I

thought about superstrings!

The University of Oklahoma, at Norman, hired me as an assistant professor. A suburb

of Oklahoma City, Norman is only a few hours drive from Tulsa.

None of the waitresses at the restaurant where Thelma had worked knew what had

happened to her. She left her job six months before, and they hadn’t heard from her since.

No Thelma O’Keefe was in the Tulsa phone book. I drove back to Norman feeling

sad and frustrated. Should I hire a detective? The Norman yellow book had a long list of

“investigators” and two detective agencies.

I was planning to call one of the agencies when my telephone rang. It was Thelma!

“I heard you were asking about me,” she said.

i

i

“master” — 2012/11/15 — 11:23 — page 291 — #305i

i

i

i

i

i

41. Superstrings and Thelma 291

“Yes. How did you get my phone number?”

“It’s on the Internet. How are strings?”

“Not so good. It didn’t predict dark matter. It didn’t predict dark energy. It even failed

to pass one of my tests. Lots of stringers are starting to have doubts, including me.”

“If we meet again,” said Thelma, “don’t tell me about it.”

Further Reading

Two recent books attacking string/M theory as pseudoscience are Not Even Wrong, by

mathematician Peter Woit (Basic Books, 2006), and The Trouble with Physics, by Lee

Smolin (Mariner Books, 2007). See Chapter 18, “Is String Theory in Trouble?” in my

book, The Jinn from Hyperspace (Prometheus Books, 2008).

i

i

“master” — 2012/11/15 — 11:23 — page 292 — #306i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 11:23 — page 293 — #307i

i

i

i

i

i

IndexAbe, Gakuho, 163

Adamatzky, Andrew, 208

Aitken, A. C., 39

Almkvist, Gert, 188

Ammann, Robert, 285

Ascher, Marcia, 45

Ball, W. W. Rouse, 223

Bankoff, Leon, 3, 7

Bernhart, Frank, 164

Beverley, William, 154

Bodlaender, Hans, 73

Bojanov, Borislav, 188

Bolt, Brian, 159

Bottomley, Henry, 46

Brams, Steven, 73

Brandt, Jørgen, 188

Brooks, R. L., 143

Carroll, Lewis, vi, 58, 66

Carver, W. B., 26

Catalan, Eugene Charles, 119

Christ, Henry, 227

Chu, I. Ping, 131

Cipra, Barry, 176

Coeter, H. S. M., 3

Conway, John, 207, 285

Coxeter, H. S. M., 224, 285

da Vinci, Leonardo, 273

de Vasa, H. E., 155

Denef, Yann, 154

diophantine, 203, 269

Dodgson, Charles, 58

Dudeney, Henry, 162, 167

Efron, Bradley, 249

Eggleton, Roger, 159

Erdos, Paul, 4, 288

Eriksson, Henrik, 188

Euler, Leonhard, 20

Feynman, Richard, 87, 110

Fibonacci numbers, 51, 143, 274

Fields Medal, 269

Ford, L. R., 26

Foshee, Gary, 258

Four Color Theorem, 67, 73

Frederickson, Greg, 135

Friedman, Erich, 16

Fulves, Karl, 181, 224

Gale, David, 221

games

Go, 78, 207

KenKen, 173

Life, 207, 287

map coloring, 73

Nim, 45

sudoku, 173

ticktacktoe, 266

two-player Bulgarian solitaire, 191

two-player Monty, 236

Wythoff, 213

Garnett, F. M., 26

Gauss, Carl Friederich, 271

Gessel, Ira, 114

Gilbreath, Norman, 221

Gilks, Joe, 159

golden ratio, 123, 273

Golomb, Solomon, 38, 128, 136, 140, 150

Gomory, Ralph, 127

Gosper, Bill, 208

Grunbaum, Branko, 66, 150

Grabarchuk, Serhiy, 139

Graham, Ron, 188

Gutenmacher, Victor, 188

293

i

i

“master” — 2012/11/15 — 11:23 — page 294 — #308i

i

i

i

i

i

294 Index

Guthrie, Francis, 65

Hall, Monty, 223, 231, 287

Hamilton circuit, 68, 191

Harary, Frank, 268

Hardy, G. H., 54

Hein, Piet, 288

Hermite, Charles, 271

Hickerson, Dean, 208

High, Robert, 74

Hirayama, Akira, 163

Hoffman, Paul, 288

Hoggatt, Verner, Jr., 150

Huber, Greg, 33

Jaenisch, C. F., 154

Jefferson, Thomas, 287

Jelliss, George, 154

Jensen, Christopher, 132

Johnsonbaugh, Richard, 131

Jones, Kate, 132

Karatsuba, Anatolii Alexeevich, 188

Karp, Richard, 222

Keillor, Garrison, 250

Khodulev, Andrei, 16

Kim, Scott, 19

Kirstead, Friend, Jr., 225

Klamkin, Murray, 4

Knuth, Donald, 188, 222

Kronecker, Leopold, 271

Kumar, Awani, 155

Ligocki, Terry, 33

Lindgren, Harry, 135

Liouville, Joseph, 37

Loyd, Sam, 201

Mackay, Hughues, 154

magic square, 153, 160, 225

Mahler, Kurt, 38

Marlow, T. W., 154

Maxwell, Brian, 226

Mayrignac, Jean-Charles, 154

McIntosh, Harold, 99

McLean, Bruce, 103

Milgram, Stanely, 288

Miyamoto, Tetsuya, 173

Moseteller, Fred, 231

Murray, H. J. R., 154

Nelsen, Roger, 128

Nieuwland, Pieter, 25

Nobel Prize, 290

O’Beirne, T. H., 13

Ollerenshaw, Kathleen, 224

Online Encyclopedia of Integer Sequences,

46, 107, 217

Oskolkov, Konstantin, 188

Polya Award, 3

Palmatelli-Palmarini, Massimo, 232

partitions, 51, 53, 188

Penrose, Roger, 285

Petkov, Milko, 188

Pomerance, Carl, 150

Poniachik, Jaime, 38

Putnam Competition, 3

Rademacher, Hans, 54

Ramanujan, Srinivasa, 54, 272

Ransom, William, 202

Reiter, Harold, 163

Rendell, Paul, 208

Ritchie, David, 164

Roberts, T. S., 154

Robinson, Raphael, 11

Schattschneider, Doris, 11

Scherer, Karl, 150

Schreck, D. J. E., 25

Selvin, Steve, 231

Senechal, Marjorie, 285

Shapley, Lloyd, 74

Shasha, Dennis, 199

Sherman, Scott, 111

shuffle, 170, 221

Silver, Stephen, 208

Siu, Man-Keung, 184

Smith, C. A. B., 143

Smith, John Maynard, 231

i

i

“master” — 2012/11/15 — 11:23 — page 295 — #309i

i

i

i

i

i

Index 295

Starr, Norton, 133

state diagram, 182, 189, 197

Stertenbrink, Guenter, 154

Stone, Arthur, 87, 103, 109, 113, 143

Stork, David, 249

Susco, Barbara, 19

Theobald, Gavin, 14

Thiel, Von J. Christian, 162

Thomas, Scarlett, 287

Toom, Andrei, 188

tree

Calkin Wilf, 215

Monty Hall, 233

pat, 115

Stern Brocot, 217

ticktacktoe, 268

Tucker, Bryant, 87

Tukey, John, 87, 110

Tutte, William, 143

vos Savant, Marilyn, 232

Wainwright, Robert, 208

Wallis, John, 25

Wenzelides, Karl, 154

Willcocks, T. H., 155

Witten, Ed, 290

Wythoff, Willem Abraham, 213

i

i

“master” — 2012/11/15 — 11:23 — page 296 — #310i

i

i

i

i

i

i

i

“master” — 2012/11/15 — 15:44 — page 297 — #311i

i

i

i

i

i

About the Editors

Michael Henle is a professor of mathematics at Oberlin College in Oberlin, Ohio. He is

the author of several previous books including Which Numbers are Real? which was just

published by the MAA in 2012. Trained as a functional analyst, he has written as well on

combinatorial subjects and geometry. He is serving as editor of The College Mathematics

Journal through 2013.

Brian Hopkins is a professor of mathematics at Saint Peter’s University in Jersey City,

New Jersey. He won, with Robin Wilson, the 2005 George Polya Award, edited the 2008

MAA Notes volume Resources for Teaching Discrete Mathematics, and was given the

2011 MAA New Jersey Section Award for Distinguished College or University Teaching

of Mathematics. Much of his research stems from Bulgarian Solitaire, a topic popularized

by Martin Gardner. Hopkins will be the editor of The College Mathematics Journal from

2014 to 2018.

297

₪ in the twenty- first century

Documents