Science Cast

A Language-Agent Approach to Formal Theorem-Proving

Amitayush ThakurOctober 9, 2023 8:44am

Views (61)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

A Language-Agent Approach to Formal Theorem-Proving

arXivPDFOctober 6, 2023 12:00am

Authors

Amitayush Thakur, Yeming Wen, Swarat Chaudhuri

Abstract

Language agents, which use a large language model (LLM) capable of in-context learning to interact with an external environment, have recently emerged as a promising approach to control tasks. We present the first language-agent approach to formal theorem-proving. Our method, COPRA, uses a high-capacity, black-box LLM (GPT-4) as part of a policy for a stateful backtracking search. During the search, the policy can select proof tactics and retrieve lemmas and definitions from an external database. Each selected tactic is executed in the underlying proof framework, and the execution feedback is used to build the prompt for the next policy invocation. The search also tracks selected information from its history and uses it to reduce hallucinations and unnecessary LLM queries. We evaluate COPRA on the miniF2F benchmark for Lean and a set of Coq tasks from the Compcert project. On these benchmarks, COPRA is significantly better than one-shot invocations of GPT-4, as well as state-of-the-art models fine-tuned on proof data, at finding correct proofs quickly.

TwitterandLinkedIn

0 comments

Add comment

A Language-Agent Approach to Formal Theorem-Proving

A Language-Agent Approach to Formal Theorem-Proving

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments