by Robin Thomas
Every planar map of connected countries can be colored using four colors in such a way that countries with a common boundary segment (not just a point) receive different colors. It is amazing that such a simply stated result resisted proof for one and a quarter centuries, and even today it is not yet fully understood. In this article I concentrate on recent developments: equivalent formulations, a new proof, and progress on some generalizations.
Brief history
The Four-Color Problem dates back to 1852 when Francis Guthrie, while trying to color the map of the counties of England, noticed that four colors sufficed. He asked his brother Frederick if it was true that any map can be colored using four colors in such a way that adjacent regions (i.e., those sharing a common boundary segment, not just a point) receive different colors. Frederick Guthrie then communicated the conjecture to DeMorgan. The first printed reference is by Cayley in 1878.
A year later the first “proof” by Kempe appeared; its incorrectness was pointed out by Heawood eleven years later. Another failed proof was published by Tait in 1880; a gap in the argument was pointed out by Petersen in 1891. Both failed proofs did have some value, though. Kempe proved the five-color theorem (Theorem 2 below) and discovered what became known as Kempe chains, and Tait found an equivalent formulation of the Four-Color Theorem in terms of edge 3-coloring, stated here as Theorem 3.
The next major contribution came in 1913 from G. D. Birkhoff, whose work allowed Franklin to prove in 1922 that the Four-Color Conjecture is true for maps with at most 25 regions. The same method was used by other mathematicians to make progress on the Four-Color Problem. Important here is the work by Heesch, who developed the two main ingredients needed for the ultimate proof — “reducibility” and “discharging”. While the concept of reducibility was studied by other researchers as well, the idea of discharging, crucial for the unavoidability part of the proof, is due to Heesch, and he also conjectured that a suitable development of this method would solve the Four-Color Problem. This was confirmed by Appel and Haken (abbreviated A&H) when they published their proof of the Four-Color Theorem in two 1977 papers, the second one joint with Koch. An expanded version of the proof was later reprinted in [1].
Let me state the result precisely. Rather than trying to define maps, countries, and their boundaries, it is easier to restate Guthrie’s 1852 conjecture using planar duality. For each country we select a capital (an arbitrary point inside that country) and join the capitals of every pair of neighboring countries. Thus we arrive at the notion of a plane graph, which is formally defined as follows.
A graph
While Theorem 1 presented a major challenge for several generations of mathematicians, the corresponding statement for five colors is fairly easy to see. Let us state and prove that result now.
Proof.
Let
If
For future reference it will be useful to sketch another proof of
Theorem 2. The initial part proceeds in the same manner until we reach
the one and only nontrivial step, namely, when we have a vertex
Equivalent formulations

Another attractive feature of the 4CT, apart from the simplicity of its formulation, is that it has many equivalent formulations using the languages of several different branches of mathematics. Indeed, in a 1993 article Kauffman and Saleur write: “While it has sometimes been said that the four-color problem is an isolated problem in mathematics, we have found that just the opposite is the case. The four-color problem and the generalization discussed here is central to the intersection of algebra, topology, and statistical mechanics.”
Saaty
[e1]
presents 29 equivalent formulations of the 4CT. In
this article let me repeat the most classical reformulation and then
mention three new ones. A graph is cubic if every vertex has degree
3, that is, is incident with precisely three edges. An edge
3-coloring of a graph
In this article let me repeat the most classical reformulation and
then mention three new ones. A graph is cubic if every vertex has
degree 3, that is, is incident with precisely three edges. An edge
3-coloring of a graph
The equivalence of Theorems 1 and 3 is not hard to see and can be
found in most texts on graph theory. To illustrate where the
equivalence comes from, let us see that Theorem 1 implies Theorem 3.
Let
Let me now describe three striking reformulations of the 4CT. The
first is similar but deeper than a result found by
Whitney
and discussed in
[e1].
Let
At this point interested readers might try to prove Theorem 4 before reading further. After all, it is only a statement about the vector cross product in a 3-dimensional space, and so it cannot be too hard. Or can it? Kauffman [e3] has also shown:
Let me clarify that Kauffman proves Theorem 4 by reducing it to the Four-Color Theorem, and so despite Theorem 5 he did not obtain a new proof of the 4CT.

Where does Theorem 5 come from? To understand it, we should think of
an association as a rooted tree in the natural sense. Given two
associations
This explains why the 4CT implies Theorem 4. To prove the converse,
one must show that it suffices to prove Theorem 3 for cubic graphs
For the next reformulation let
However, the following theorem of Bar-Natan is
surprising, regardless of what one grew up thinking about the relationship of
Like Theorem 4, Theorem 6 is deduced from the Four-Color Theorem, and hence Theorem 7 does not yield a new proof of the 4CT.
There is an easy hint as to why Theorem 7 holds.
Penrose has shown that
The last reformulation, in terms of divisibility, is due to Matiyasevich [e9].
In fact, Matiyasevich found the functions
Let us try to understand where this theorem
comes from. Let
, if and only if , if and only if , and if and only if .
It is not too difficult to show that every planar
graph arises as the planar graph
By Theorem 2 we can in the above theorem
confine ourselves to discrete maps
An outline
The work of Appel and Haken undoubtedly represents a major breakthrough in mathematics, but, unfortunately, there remains some skepticism regarding the validity of their proof. To illustrate the nature of those concerns, let me quote from a 1986 article by Appel and Haken themselves: “This leaves the reader to face 50 pages containing text and diagrams, 85 pages filled with almost 2500 additional diagrams, and 400 microfiche pages that contain further diagrams and thousands of individual verifications of claims made in the 24 lemmas in the main sections of text. In addition, the reader is told that certain facts have been verified with the use of about 1200 hours of computer time and would be extremely time-consuming to verify by hand. The papers are somewhat intimidating due to their style and length and few mathematicians have read them in any detail.”
A discussion of errors, their correction, and other potential problems may be found in the above article, in [1], and in F. Bernhart’s review of [1]. For the purpose of this survey, let me telescope the difficulties with the A&H proof into two points:
- part of the proof uses a computer and cannot be verified by hand, and
- even the part that is supposedly hand-checkable has not, as far as I know, been independently verified in its entirety.
To my knowledge, the most comprehensive effort to verify the A&H proof was undertaken by Schmidt. According to [1], during the one-year limitation imposed on his master’s thesis Schmidt was able to verify about 40 percent of part I of the A&H proof.
Neil Robertson, Daniel P. Sanders, Paul Seymour, and I tried to verify the Appel–Haken proof, but soon gave up and decided that it would be more profitable to work out our own proof. So we did and came up with the proof that is outlined below. We were not able to eliminate reason (1), but we managed to make progress toward (2).
The basic idea of our proof is the same as Appel and Haken’s. We
exhibit a set of 633 “configurations” and prove each of them is
“reducible”. This is a technical concept that implies that no
configuration with this property can “appear” in a minimal
counterexample to the Four-Color Theorem. It can also be used in an
algorithm, for if a reducible configuration appears in a sufficiently
connected planar graph
Birkhoff showed in 1913 that every minimal counterexample to the Four-Color Theorem is an “internally 6-connected” triangulation. In the second part of the proof we prove that at least one of our 633 configurations appears in every internally 6-connected planar triangulation (not necessarily a minimal counterexample to the 4CT). This is called proving unavoidability and uses the “discharging method” first suggested by Heesch. Here our method differs from that of Appel and Haken.
The main aspects of our proof are as follows. We confirm a conjecture of Heesch that in proving unavoidability a reducible configuration can be found in the second neighborhood of an “over-charged” vertex; this is how we avoid “immersion” problems that were a major source of complication for Appel and Haken. Our unavoidable set has size 633 as opposed to the 1,476-member set of Appel and Haken; our discharging method uses only 32 discharging rules instead of the 487 of Appel and Haken; and we obtain a quadratic algorithm to 4-color planar graphs, an improvement over the quartic algorithm of Appel and Haken. Our proof, including the computer part, has been independently verified, and the ideas have been and are being used to prove more general results. Finally, the main steps of our proof are easier to present, as I will attempt to demonstrate below.
Before I turn to a more detailed discussion of configurations, reducibility, and discharging, let me say a few words about the use of computers in our proof. The theoretical part is completely described in [e6], but it relies on two results that are stated as having been proven by a computer. The rest of [e6] consists of traditional (computer-free) mathematical arguments. There is nothing extraordinary about the theoretical arguments, and so the main burden of verification lies in those two computer proofs. How is one supposed to be convinced of their validity? There are basically two ways. The reader can either write his or her own computer programs to check those results (they are easily seen to be finite problems), or he or she can download our programs along with their supporting documentation and verify that those programs do what we claim they do.
The reducibility part is easier to believe, because we are doing almost the same thing as many authors before us (including Appel and Haken) have done, and so it is possible to compare certain numerical invariants obtained during the computation to gain faith in the results. This is not possible in the unavoidability part, because our approach to it is new. To facilitate checking, we have written this part of the computer proof in a formal language so that it will be machine verifiable. Every line of the proof can be read and checked by a human, and so can (at least theoretically) the whole argument. However, the entire argument occupies about 13,000 lines, and each line takes some thought to verify. Therefore, verifying all of this without a computer would require an amount of persistence and determination my coauthors and I do not possess. The computer data, programs, and documentation are available by anonymous ftp1 and can also be conveniently accessed on the World Wide Web.2
Two of my students independently verified the computer work. Tom Fowler verified the reducibility part (and in fact extended it to obtain a more general result — see later), and Christopher Carl Heckman wrote an independent version of the unavoidability part using a different programming language. Bojan Mohar also informed us that his student Gašper Fijavž wrote independent programs and was able to confirm both parts of the computer proof. The computer verification can be carried out in a matter of several hours on standard commercially available equipment.
I should mention that both our programs use only integer arithmetic, and so we need not be concerned with round-off errors and similar dangers of floating point arithmetic. However, an argument can be made that our “proof” is not a proof in the traditional sense, because it contains steps that most likely can never be verified by humans. In particular, we have not proven the correctness of the compiler on which we compiled our programs, nor have we proven the infallibility of the hardware on which we ran our programs. These have to be taken on faith and are conceivably a source of error. However, from a practical point of view, the chance of a computer error that appears consistently in exactly the same way on all runs of our programs on all the compilers under all the operating systems that our programs run on is infinitesimally small compared to the likelihood of a human error during the same amount of case-checking. Apart from this hypothetical possibility of a computer consistently giving an incorrect answer, the rest of our proof, including the programs, can be checked in the same way as traditional mathematical proofs. My coauthors and I concede, however, that checking a computer program is much more difficult than verifying a mathematical proof of the same length.
Configurations

First we need a result of Birkhoff about connectivity of minimal
counterexamples. Let
Next we need to introduce the concept of a configuration, which is
central to the rest of the proof. Configurations are technical devices
that permit us to capture the structure of a small part of a larger
triangulation. A graph
A near-triangulation is a nonnull connected
plane graph with one region designated as special
such that every region, except possibly the special
region, is a triangle. A configuration
- for every vertex
of , if is not incident with the special region of , then equals , the degree of , and otherwise , and in either case ; - for every vertex
of , has at most two components, and if there are two, then the degree of in is ; has ring-size at least 2, where the ring-size of is defined to be summed over all vertices incident with the special region of such that is connected.
The significance of condition (1) will become clearer from the definition of “appears” below; conditions (2) and (3) are needed for the definition of the “free completion”, discussed in the next section.

When drawing pictures of configurations, one
possibility is to draw the underlying graph and
write the value of
Isomorphism of configurations is defined in
the natural way. Any configuration isomorphic to
one of the 633 configurations exhibited in
[e6]
is called a good configuration. Let
For example, the good configuration in the upper left corner of Figure 4 appears in the center of Figure 3.
From Theorems 10, 11, and 12 it follows that no minimal counterexample exists, and so the 4CT is true. I will discuss the proofs of the latter two theorems in the next two sections.
Reducibility

Reducibility is a technique for proving statements such as Theorem 11. Qualitatively it can be described by saying that it is obtained by pushing to the limit the arguments used in the two proofs of Theorem 2. It was developed from Kempe’s failed attempt at proving the 4CT by Birkhoff, Bernhart, Heesch, Allaire, Swart, Appel and Haken, and others.
I will explain the idea of reducibility by giving
an example; interested readers may find the precise definition in
[e6].
Let us consider the first configuration in Figure 4, and let
us call it
(Here we were lucky — for larger configurations it does not
follow that the free completion is a subgraph of

Now let us consider
The above argument is called D-reducibility in the literature.
Notice the way in which we used the fact that
D- and C-reducibility can be automated and carried out on a computer. In fact, they must be carried out on a computer, because we need to test configurations with ring-size as large as 14, in which case there are almost 200,000 colorings to be checked. At the present time there is little hope of doing this part of the proof by hand. Even writing down the proof in a formal language, as we did for the discharging part, does not seem practical because of the length of the argument.
Discharging

Discharging is a clever and effective use of Euler’s formula, first suggested by Heesch and later used by Appel and Haken and many others since then. In fact, discharging has become a standard tool in graph theory. In what follows I will refer to Figure 7, which some readers may find overwhelming at the first sight. I recommend focusing attention on the first row and thinking of the rest as being of secondary importance. The first row has a geometric interpretation, discussed below. Also, we will make use of Heesch’s notational convention introduced two paragraphs prior to Theorem 11.
Let
This procedure defines a new set of charges with the same total sum.
For instance, if
Since the total sum of the new charges is positive, there is a vertex
The rules corresponding to the first row in Figure 7 were derived from
an elegant method of
Mayer and have the following geometric
interpretation. Let
While this redistribution of charges seems natural, unfortunately it is not true that a reducible configuration appears in the second neighborhood of every vertex of positive charge. Therefore, we had to introduce additional rules to make small changes to this distribution to take care of second neighborhoods of vertices with positive charge in which no reducible configuration appeared. The additional rules were obtained by trial and error; there is a lot of flexibility in their choice, but they were not designed with any geometric intuition behind them.
Beyond the Four-Color Theorem
The new proof of the 4CT gives hope that other more general
conjectures that are known to imply the 4CT could be settled by an
appropriate adaptation of the same methods. Let me start by
discussing a result of my student
Tom Fowler
on unique colorability. A
graph

By Theorem 10 this generalizes the Four-Color Theorem, and it implies the Fiorini–Wilson conjecture by a result of Goldwasser and Zhang, who have shown that a minimal counterexample is internally 6-connected. (The proof of the latter is analogous to that of Theorem 10.)
While the 4CT becomes false very quickly once
we leave the world of planar graphs,
Tutte
noticed
that Theorem 3 remains true for a reasonably large
class of nonplanar graphs. The smallest cubic
graph with no cut-edge and no edge 3-coloring is
the famous Petersen graph depicted in Figure 8.
We say that a graph

Tutte’s conjecture implies the 4CT by Theorem 3 (because the Petersen
graph is not planar, and contraction preserves planarity), but it
seems to be much stronger. However, it now appears that there is a
good chance that Tutte’s conjecture can be proven. Indeed,
Robertson,
Seymour,
and I have reduced the problem to the following two classes
of graphs. We say that a graph
Thus, in order to prove Conjecture 14 it suffices to prove it for apex and doublecross graphs, two classes of graphs that are “almost” planar. In fact, my recent work with Sanders suggests that it might be possible to adapt our proof of the 4CT to show that no apex graph is a counterexample to Tutte’s conjecture. There is an indication that the doublecross case will be easier, but we cannot confirm that yet.
There is a way to generalize the 4CT along the lines of (vertex) 4-coloring. Hadwiger conjectured the following.
This is trivial for
By the 4CT every apex graph has a 5-coloring,
and so Hadwiger’s conjecture for