Re-proving theorems, and the trouble with incorrect proofs of true statements

Posted by Catarina July 27, 2013

Re-proving theorems, and the trouble with incorrect proofs of true statements

(Cross-posted at NewAPPS)

“That's the problem with false proofs of true theorems: it's not easy to produce a counterexample.”

This is a comment by Jeffrey Shallit in a post on a purported proof of Fermat’s Last Theorem. (Incidentally, the author of the purported proof comments here at M-Phi occasionally.) In all its apparent simplicity, this remark raises a number of interesting philosophical questions. (Being the pedantic philosopher that I am, I'll change a bit the terminology and use the phrase 'incorrect proof' instead of 'false proof', which I take to be a category mistake.)

First of all, the remark refers to a pervasive but prima facie slightly puzzling feature of mathematical practice: mathematicians often formulate alternative proofs of theorems that have already been proved. This may appear somewhat surprising on the assumption that mathematicians are (solely) in the business of establishing (mathematical) truths; now, if a given truth, a theorem, has already been established, what is the point of going down the same road again? (Or more precisely, going to the same place by taking a different road.) This of course shows that the assumption in question is false: mathematicians are not only interested in theorems, in fact they are mostly interested in proofs. (This is one of the points of Rav’s thought-provoking paper ‘Why do we prove theorems?’)

There are several reasons why mathematicians look for new proofs of previously established theorems, and John Dawson Jr.’s excellent ‘Why do mathematicians re-prove theorems?’ discusses a number of these reasons. The original proof may be seen as too convoluted or not sufficient explanatory – ideally, a proof shows not only that P is the case, but also why P is the case (more on this below). Alternatively, the proof may rely on notions and concepts alien to the formulation and understanding of the theorem itself, giving rise to concerns of purity. Indeed, recall that Colin McLarty motivates his search for a new proof of Fermat’s Last Theorem in these terms: “Fermat’s Last Theorem is just about numbers, so it seems like we ought to be able to prove it by just talking about numbers”. This is not the case of the currently available proof by Wiles, which relies on much heavier machinery.

From the point of view of the dialogical conception of proofs that I’ve been developing, as involving a proponent who wants to establish the conclusion and an opponent who seeks to block the derivation of the conclusion (see here and here), an important reason to re-prove theorems would be related to the persuasive function of proofs. Dawson does mention persuasion in his paper, but he does not adopt an explicit dialogical, multi-agent perspective:

[W]e shall take a proof to be an informal argument whose purpose is to convince those who endeavor to follow it that a certain mathematical statement is true (and, ideally, to explain why it is true). (p. 270)

(In my opinion, this is a fabulous definition of mathematical proofs, except for the fact that it is not explicitly multi-agent.) That is, a given proof, while correct, may still fail to be sufficiently convincing in this sense. I am here reminded of Smale’s original proof of the possibility of eversion of the sphere, which however did not exhibit the process through which the eversion would take place. It was only when Morin built clay models of the stages of the process that it became clear not only that a sphere can be eversed, but also how it can be eversed. (In mathematics, whys often become hows, i.e. how to construct a given entity, how to realize a given process etc.) In fact, it is now known that there are different ways of eversing the sphere.

Still within the dialogical framework, another reason to formulate alternative proofs of theorems are the different commitments and tastes of various audiences. A mathematical proof is a discourse, and even though there is an absolute sense in which a proof is or is not correct, different proofs will be more or less persuasive to different audiences. For example, this observation would explain the search for constructive as well as classical proofs of the same theorems, thus catering for different groups of potential addressees. More generally, different preferences in argumentative styles (not only in theoretical commitments as the ones separating classical and constructivist mathematicians) may also create the space for several proofs of the same theorems.

And here is a final, less ‘noble’ reason for re-proving previously established theorems: such proofs are harder to refute. The mathematician’s preferred approach to refuting a proof is to provide a counterexample, i.e. a situation or construction where the premises hold but the conclusion (the theorem) does not. Now, if providing such a counterexample were the only move available to opponent to block the inference of the conclusion by proponent (a thought that I confess to have entertained for a while), then every proof of a true theorem would be a valid proof, no matter how absurd and defective (such as the one motivating Shallit’s comment above).

This is exactly why a proof cannot be a one-step argument going directly from premises to conclusion (which is, in effect, a necessarily truth-preserving move in the case of true theorems): a proof spells out the intermediate steps, which must be individually perspicuous and explanatory – and yes, also necessarily truth-preserving. So incorrect proofs of true theorems require the additional work of delving into the details of the proof in its different steps in order to reveal where the mistake(s) lie(s) – more work, and often tedious work.

Nobody said it was easy being a mathematical opponent.

Comments

gowers27 July 2013 at 15:16
For some reason I decided a couple of years ago to allow myself to be drawn into a long email exchange with somebody who wrongly thought that he had proved a major theorem. I ran into exactly the difficulty that you refer to, and that presumably Shallit was talking about: that if you point out that B doesn't follow from A, despite the fact that B is probably true, then you may be challenged to provide a counterexample. To someone who isn't fully steeped in the conventions of mathematical proofs, it is very hard to explain why such a challenge is illegitimate. The technique I attempted to use was to identify the underlying wrong argument and try to prove a statement that was definitely false using the same argument. My hope was to persuade the person I was corresponding with of the truth of the false statement and then reveal that it was false. The trouble was, he had enough knowledge to spot many of the false statements and come back at me with, "That argument doesn't work because the conclusion is false." And if I tried to argue that it was the same argument, he could always point to some superficial difference. Also, even if I did temporarily convince him of a false statement, if I then revealed that it was false he could switch to the same tactic.

The remarkable thing about the whole experience was that eventually, after many failed attempts, I did manage to persuade him that he could use his own logic to prove a statement that was manifestly false. His initial reaction to that was that ZFC must be inconsistent, but I eventually got him to agree that if you start from some correct assumptions, make a not fully justified step, and end up with a false conclusion, it is more likely that the fully justified step can't be justified than that ZFC is inconsistent.

I found the exchange interesting (or I wouldn't have persisted with it for so long). A model I like of what a proof is, and described in a book I wrote about ten years ago, is that it is a text written at a high level on the understanding that any part of it can be expanded if necessary -- a process that can be iterated. Most of the time, you don't need to bother expanding, since you know your audience when you write, but in principle if there is anything that your reader doesn't accept, you can write it more fully and in a lower-level language. So there is a kind of "implicit" dialogue that only occasionally becomes actual. The exchange I had was by far the most elaborate actualization of this that I had ever experienced, though my opponent didn't always play by the rules I had in mind (since he wasn't used to expressing himself with the precision that I insisted on).
ReplyDelete
Replies
Jeffrey Ketland27 July 2013 at 15:46
Hi Catarina,

"a proof spells out the intermediate steps, which must be individually perspicuous and explanatory – and yes, also necessarily truth-preserving."

Isn't the notion of "intermediate step" equivalent to an inference rule (which can involve CUT: i.e., using a previously proved lemma)? Otherwise, why would an informal proof convince anyone?

This isn't about discovery of proofs, but about whether they serve an epistemic justificatory role.

If I understand right, you disagree that an informal proof given in a maths paper should be, in principle, formalizable: in principle, one should be able to translate the sentences into some $L$ and fill in the intuitive "gaps" by principles like "from $A, A \to B$, infer $B$" and "given $\exists x \phi(x)$, one may name a witness $a$, such that $\phi(a)$", etc.

Is that principle of formalizablity of informal proofs what you don't agree with?

With you on the importance of conceptual explanatoriness, though. We want to know why something is so, and not just that something is so. But it's a very hard topic! "Proofs from the Book", as Erdos called them.

http://en.wikipedia.org/wiki/Proofs_from_THE_BOOK

Cheers,

Jeff
ReplyDelete
Replies
Jeffrey Ketland27 July 2013 at 17:08
Catarina,

Suppose we have an informal proof - in English, Punjabi or Hebrew, etc., - a finite sequence of interpreted sentences, say,

$S_1, \dots, S_n$.

(I ignore diagram proofs ... which I think are important and interesting).

We formalize this by a mapping

$^{\circ} : Informal-Mathlish \to L$

giving:

$(S_1)^{\circ}, \dots, (S_n)^{\circ}$.

This can be expanded into a fully formalized proof, in "machine code".

Is that the picture you do agree with?

Cheers.

Jeff
ReplyDelete
Replies
Jeffrey Ketland27 July 2013 at 18:12
Hi Catarina,

Yes, I agree entirely with Dawson and Rav.

Cheers,

Jeff
ReplyDelete
Replies
Jeffrey Ketland27 July 2013 at 18:23
Hi Catarina,

To be more exact on our difference: I think you identify a normative, multi-agent, epistemic role for informal proofs, P. Call this role R. Then your view is that a proof is something filling this role.

Whereas I say that a proof is whatever it is, even if it doesn't fill this role R. There may exist proofs that don't satisfy your role criterion. For example, they might be trillions of pages long. A proof might be infinitely long. A proof might involve concepts which humans cannot grasp because of their cognitive limitations.

(Actually, I think of proofs *semantically*: what matters is their meaning.)

For example, Mochizuki's text does not fill this defined normative, multi-agent, epistemic role, as all agree.

On your view, it is therefore "wrong".

http://m-phi.blogspot.co.uk/2013/05/whats-wrong-with-mochizukis-proof-of.html

But, on my view, no one has given a reason for thinking Mochizuki's proof is wrong. It may be right. No one knows; possibly not even Mochizuki.

Cheers,

Jeff
ReplyDelete
Replies
Anonymous27 July 2013 at 22:31
"I'll change a bit the terminology and use the phrase 'incorrect proof' instead of 'false proof', which I take to be a category mistake.)"

"Maverick Philosopher" Bill Vallicella calls those alienans adjectives. For example red leather is leather and Corinthian leather is leather, but imitation leather is not leather.

http://maverickphilosopher.typepad.com/maverick_philosopher/2010/01/alienans-adjectives.html

I don't see much different between a false proof and an incorrect proof. Both are alienans adjectives, negating rather than modifying their associated noun.
ReplyDelete
Replies
Jeffrey Ketland31 July 2013 at 13:47
Hi Alan, many thanks!

But Rav's view is the same as mine:

"Let us fix our terminology to understand by proof a conceptual proof of customary mathematical discourse, having an irreducible semantic content, and distinguish it from derivation, which is a syntactic object of some formal system. "

That is, proofs have semantic content, and analysing this turns out to be a major problem.

Cheers,

Jeff
ReplyDelete
Replies
Jeffrey Ketland31 July 2013 at 14:00
Having said that, and agreed with the importance of semantic content, I do have several points of disagreement with Rav, because he makes some mistakes. For example, Rav does not understand Tarski's analysis of concept of truth: the notion "truth in a model" does not appear in his Tarski's 1936 paper, "On the Concept of Truth in Formalized Languages".

A definition of truth for an interpreted language $L$ is adequate when it implies the T-sentences, which involve a name of the object language sentence $S$ and a translation of it into the metalanguage. This translation must preserve semantic content.

The language $L$ for which truth is defined is, and must be (as Tarski emphasised) interpreted. This confusion about interpreted languages is very common, and is a mistake in Rav's paper.

Cheers,

Jeff
ReplyDelete
Replies
Michael Pershan4 August 2013 at 14:15
If proof is simply an informal argument intended to convince an audience, then we shouldn't care much who that intended audience is. As you say, different proofs may be intended to be convincing arguments to different audiences.

But I'm wondering about small children. (I am a classroom teacher.) First, there are many informal arguments that would be convincing to a young student that would be seen as mathematically problematic for a more mature mathematician. Would these arguments be accepted as proofs on your thinking?

More generally: what about bad arguments that are convincing? (e.g. Arguments that contain false premises or incorrect inferences that nonetheless fool an entire audience.) On what grounds can you argue that these arguments aren't proofs, given your definition? (Perhaps by arguing that these proofs fail to be convincing in the broad sense, since eventually someone will find the error and unconvince everyone? Or perhaps you take a pragmatic stand, and call the argument a proof until an error is revealed?)

Thanks in advance for dealing with a noob to your blog!
ReplyDelete
Replies
Anna2 September 2021 at 17:23
MAT or the Mathematics Admission Test is conducted by University of Oxford. A student needs a deep knowledge in mathematics to pass this test. An applicant for computer science, mathematics or other joint degrees in Oxford must face MAT. University of Warwick and the Imperial College London use MAT to select applicants. These universities can easily shortlist the candidates for the interviews from the test score.

Oxford Mathematics Admissions Test (MAT past papers)
ReplyDelete
Replies
rioraj21 October 2021 at 12:55
This post is so usefull and informative.keep updating with more information...
Quality Software
Benefits Of Software Testing
ReplyDelete
Replies
rakesh7 May 2022 at 08:30
This post is so useful and informative. Keep updating with more information.....
Swift Training In Bangalore
Swift Developer Course In Bangalore
ReplyDelete
Replies
rizeza18 February 2023 at 15:59
kralbet
betpark
tipobet
slot siteleri
kibris bahis siteleri
poker siteleri
bonus veren siteler
mobil ödeme bahis
betmatik
6B2PMU
ReplyDelete
Replies
Anonymous15 December 2024 at 08:39
شركة مكافحة بق الفراش بالجبيل tVJRgDCSMM
ReplyDelete
Replies
Anonymous24 May 2025 at 17:02
عزل اسطح الاحساء
JATosxn1MW
ReplyDelete
Replies

Add comment

Search This Blog

M-Phi

Re-proving theorems, and the trouble with incorrect proofs of true statements

Comments

Post a Comment

Popular Posts

Mona Simion on resistance to evidence

Discount code for Bertrand's Paradox and the Principle of Indifference by Nicholas Shackel