Science Cast

AxDafny: Agentic Verified Code Generation in Dafny

librarianJuly 1, 2026 2:27am

Views (38)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

AxDafny: Agentic Verified Code Generation in Dafny

arXivPDFJune 30, 2026 12:00am

Authors

Benjamin Breen, Austin Letson, Borja Requena Pozo, Leopoldo Sarra

Abstract

We study agentic code generation in Dafny, where a model must generate both executable code and the proof artifacts for verification. We present AxDafny, a verifier-guided repair framework that iteratively generates implementations, invariants, assertions, and termination arguments. We also introduce LiveCodeBench-Pro-Dafny (LCB-Pro-Dafny), a benchmark of 250 competition-style programming problems translated into Dafny with formal specifications and a verifier-based evaluation harness. On LCB-Pro-Dafny, AxDafny substantially improves verification success over baseline GPT-5.5 performance. On DafnyBench, AxDafny achieves 92.7\% verification success, outperforming the strongest previously reported proof-hint baseline by 6.5 percentage points. Lastly, we show that verification success and runtime test performance measure different aspects of generated code.

TwitterandLinkedIn

0 comments

Add comment

AxDafny: Agentic Verified Code Generation in Dafny

AxDafny: Agentic Verified Code Generation in Dafny

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments