Auto formalisation of Chaitin and of the surprise incompleteness Theorem
Abstract
This is a continuation of a previous report on an experiment in autoformalisation of Gödel's second incompleteness theorem in Agda using Claude. Using the framework built in this experiment, Claude could ``automformalise'' Chaitin's proof of the first incompleteness theorem and then the Kritchman-Raz surprise examination paradox version of the second incompleteness. As the first experiment, the project provides a case study of the strengths and limitations of current large language models in mathematics. Since Chaitin's proof involves coding programs, Claude had to represent code as ternary string and could build autonomously a parser and a continuation stack evaluation machine. The fact that we can simulate computations as expected is not completely trivial and we suggested a Gandy/Howard majorisation argument, that Claude had no problem to follow. The resulting formalisation clarifies a number of details left implicit in the original presentation and provides a fully machine-checked proof of these arguments for Church's Basic Recursive Arithmetic.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.