real numbers | njwildberger: tangential thoughts

There are several approaches to the modern theory of “real numbers”. Unfortunately, none of them makes complete sense. One hundred years ago, there was vigorous discussion about the ambiguities with them and Cantor’s theory of “infinite sets”. As time went by, the debate subsided but the difficulties didn’t really go away.

A largely unquestioning uniformity has settled on the discipline, with most students now only dimly aware of the logical problems with “uncomputable numbers”, “non-measurable functions”, the “Axiom of choice”, “hierarchies of cardinals and ordinals”, and various anomalies and paradoxes that supposedly arise in topology, set theory and measure theory.

Some of the stumbling blocks have been described at length in my Math Foundations series of YouTube videos. In this blog we concentrate on the problems with real numbers and arithmetic with them.

The basic division in mathematics is between the discrete and the continuous. Discrete mathematics studies locally finite collections and patterns, and relies on counting, beginning with the natural numbers 1,2,3,⋯ and then extending to the integers, including 0, as well as -1,-2,-3,⋯, and to rational numbers, or fractions, of the form a/b with a and b integers, subject to the condition that b non-zero and with a particular notion of equality.

Continuous mathematics studies the “continuum” and functions on it, and relies on measurement, which these days involves also “irrational numbers” like √2,√5 and π that the ancients wrestled with, as well as more modern “irrational numbers” such as e and γ arising from integrals and infinite series. But what do these words and objects actually precisely mean and refer to? We should not presume that just because we use a common term or notion familiar from everyday life, that its mathematical meaning has been properly established.

Up to a hundred years ago, the notion of the “continuum” seemed intuitively straightforward, but difficult to pin down precisely. It appeared that we could rely on our intuition of space, following the philosopher I. Kant’s view that somehow humans had an innate understanding of three-dimensional geometry. However with the advent of modern physics, and in particular relativity and quantum mechanics, the true nature of the “continuum” grew increasingly murky: if time is relative and perhaps finite in extent, and space has an inherent graininess which renders it certainly not infinitely divisible, then what exactly are we modelling with our notion of the “infinite number line”?

While engineers and scientists work primarily with finite decimal numbers in an approximate sense, “real numbers” as infinite decimals are idealized objects which attempt to extend the explicit finite but approximate numbers of engineers into a domain where infinite processes can be ostensibly be exactly evaluated. To make this magic work, mathematicians invoke a notion of “equivalence classes of Cauchy sequences of rational numbers”, or as “Dedekind cuts”.

Each view has different difficulties, but always there is the crucial problem of discussing infinite objects without sufficient regard to how to specify them. I have discussed the serious logical difficulties at length around video 80-105 in the Math Foundations series.

For example the video Inconvenient truths with sqrt(2) has generated a lot of discussion. However not everyone approves of casting doubt on the orthodoxy: the video has more than 1000 likes, but also 316 dislikes. I doubt if I am saying anything in this video which is actually incorrect though — you can judge for yourselves.

Let’s return to the safe side of things. A finite sequence such as s = 1,5,9 may be described in quite different ways, for example as the “increasing sequence of possible last digits in an odd integer square”, or as the “sequence of numbers less than 10 which are congruent to 1 modulo 4”, or as the “sequence of digits occurring in the 246-th prime after removing repetitions”. But ultimately there is only one way to specify such a sequence s completely and unambiguously: by explicitly listing all its elements.

When we make the jump to “infinite sequences”, such as a sequence somehow implied by the expression “m = 3,5,7,⋯” the situation changes dramatically. It is never possible to explicitly list “all the elements” of an infinite sequence. Instead we are forced to rely on a rule generating the sequence to specify it. In this case perhaps: m is the list of all odd numbers starting with 3, or perhaps: m is the list of all odd primes. Without such a rule, a definition like “m = 3,5,7,⋯” is really rather meaningless.

We can say the words “infinite sequence”, but what are we actually explicitly talking about??

To a computer scientist, an “infinite sequence” is modelled by a computer program, churning out number after number perhaps onto a hard drive, or in former years onto a long tape. At any given point in time, there are only finite many outputs. As long as you keep supplying more memory, or tape, and electricity the process in principle never stops, but in practice will run short of resources and either grind forwards ever more and more slowly (the next output will take two years, hang on just a while!) or just come to a grinding halt when power or memory is inevitably exhausted.

So in this case the sequence is not to be identified by the “completed output tape”, which is a figment of our imagination, but rather by the computer program that generates it, which is concrete and completely specifiable. However here we come to an essential difficulty with “infinite processes”: the program that generates a given “infinite sequence”: is always far from unique. There is no escape from this inescapable fact, and it colours all meaningful aspects of dealing with “infinity”.

A finite set such as {2 4 6 8} can also be described in many ways, but ultimately it too is only properly specified by showing all its elements. In this case order is not important, so that for example the elements might be scattered over a page. Finite sets whose elements cannot be explicitly shown have not been specified, though we might agree that they have been described.

An example might be: let S be the set of all odd perfect numbers less than 10^{100000}. [A perfect number, like 6 and 28, is the sum of those of its divisors less than itself, i.e. 6=1+2+3 and 28=1+2+4+7+14.] Such a description of S does not deserve to be called a specification of the set, at least not with our current understanding of perfect numbers, which doesn’t even allow us to determine if S is empty or not.

With sets the dichotomy between finite and infinite is much more severe than for sequences, because we do not allow a steady exhibition of the elements through time. It is impossible to exhibit all of the elements of an “infinite set” at once, so the notion is an ideal one that more properly belongs to philosophy—it can only be approximated within mathematics.

The notion of a “completed infinite set” is strongly contrary to classical thinking; since we can’t actually collect together more than a finite number of elements as a completed totality, why pretend that we can? It is the same reason that “Leprechaun heaven” or “hierarchies of angels” are not generally recognized as proper scientific entities. Infinite sets, angels and Leprechauns may very well exist in our universe, but this is a philosophical or religious inquiry, not a mathematical or scientific one.

The idea of “infinity” as an unattainable ideal that can only be approached by an endless sequence of better and better finite approximations is both humble and ancient, and one I would strongly advocate to those wishing to understand mathematics more deeply. This is the position that Archimedes, Newton, Euler and Gauss would have taken, and it is a view that ought to be seriously reconsidered. I believe it is also closer to the view of modern giants such as H. Poincare and H. Weyl, both of whom were skeptical about our uses of “infinity”.

Why is any of this important? The “real numbers” are where Cantor’s “hierarchies of infinities” begins, and much of modern set theory rests, so this is an issue with widespread consequences, even within algebra and combinatorics. Secondly the “real numbers” are the arena where calculus and analysis is developed, so difficulties with their essential arithmetic foundation lead to weakness in the calculus curriculum, confusion with aspects of measure theory, functional analysis and other advanced subjects, and are obstacles in our attempts to understand physics.

In my opinion, it is possible, perhaps even likely, that we need to understand mathematics in the right way before we will be able to unlock the deeper secrets of the universe.

By reorganizing our subject to be more careful and logical, and by removing dubious axiomatic assumptions and unnecessary philosophizing about “real numbers” and “infinite sets” we make it easier for young people to learn, appreciate and contribute.

This also strengthens the relationship between mathematics and computing.

It is time to acknowledge the orthodoxy that silently frames our discipline. We need to learn from our colleagues in physics and computer science, and begin the slow, challenging but important and ultimately rewarding task of restructuring mathematics properly.

A few days ago I had an online conversation with Dr Daniel Rubin who is a mathematician living in the US and who works in analysis, geometry and approximation theory. The topic was one close to my heart: Daniel wanted to hear of my objections to the status quo concerning the foundations of modern analysis: namely my rejection of “real number arithmetic” and why I don’t accept “completed infinite processes”. And naturally he wanted to do his best to rebut them.

Here is a link to our chat:

It is certainly encouraging to see that some analysts are willing to engage with the uncomfortable idea that their discipline might actually be in serious logical difficulties. Most of us are reluctant to accept that something we have been working on for years and years might actually be wrong. I applaud Daniel for the courage to engage with these important ideas, and to consider how they fit, or don’t fit, into his current view on analysis.

When we learn pure mathematics, there are many things that we at first don’t understand, perhaps because they are obscure, or perhaps because we are not smart enough — it is easy not to be sure which. Our usual reaction to that is: let me try to accept the things which are cloudy, and hopefully with further learning things will become clearer. This is a reasonable approach to tackling such a difficult subject. However it does require us to put aside our natural skepticism, and accept what the more established figures are telling us at critical points in the theoretical development, even if we imagine this is only temporary.

A good example is: “analysis is built from axiomatic set theory.” In other words the foundations of “infinite sets” and so the basic logical structure of the “arithmetic of real numbers” is a consequence of work of logicians, and can be taken for granted without much further inquiry. Or to put it less politely: it is not the job of an analyst to work out clearly the foundations of the subject; this is something that can be outsourced.

In this fashion dubious logical sleights of hand can creep into an area, transmitted from generation to generation and strengthened with each repeat. Young academics in pure mathematics are under a lot of pressure to publish to obtain a foothold in the academic ladder. This means they do not often have time to mull over those knotty foundational questions that might have been bugging them secretly at the backs of their minds. They probably don’t spend a lot of time on the history of these problems, many of which go back centuries, and in former times engaged the interest of many prominent mathematicians.

Later in their career, if our young PhD has been lucky enough to score an academic job, they might be in a position to go back over these core problems and think them through more carefully. But even then there is often not a lot of “academic reward” in doing so: their fellows are not particularly interested in endeavors that are critical of the orthodoxy — pure mathematics is quite different in this regard than science or even applied mathematics!

And journals are uniformly not keen on publishing papers on foundational issues, especially ones which challenge accepted beliefs. As pure mathematics rests on a premise of logical correctness, any questioning of that is seen as subversive to the entire discipline.

But maybe some serious consideration and debate of the underlying logical structure is just what the discipline really needs.

I certainly enjoyed our conversation and I think there are valuable points in it. I hope you enjoy it, and look forward to another public YouTube discussion with Daniel.

	njwildberger: tangen… on Insights into Mathematics and…
	Anonymous on Insights into Mathematics and…
	Anonymous on Insights into Mathematics and…
	Anonymous on Insights into Mathematics and…
	Anonymous on Insights into Mathematics and…

njwildberger: tangential thoughts

mathematical ruminations and more

Tag Archives: real numbers

Finite versus “infinite”

My talk with Daniel Rubin on Real Numbers and the Infinite in Analysis