Science Cast

Reasoning Structure of Large Language Models

librarianJune 3, 2026 4:15am

Views (0)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Reasoning Structure of Large Language Models

arXivPDFJune 2, 2026 12:00am

Authors

Frédéric Berdoz, Luca A. Lanzendörfer, Fabian Farestam, Roger Wattenhofer

Abstract

Large reasoning models (LRMs) are often evaluated using metrics such as final-answer accuracy or token count. However, identical scores on these metrics can hide fundamentally different reasoning structures. To address this limitation, we introduce a scalable LRM benchmark of logic puzzles and a pipeline that converts unstructured traces into verifiable reasoning graphs of claims and dependencies. This turns reasoning into a structured, measurable object whose topology can be quantitatively analyzed. Building on this, we define a reasoning efficiency metric that quantifies how concentrated the model's logical flow is. Our analysis on open-source reasoning models shows that structural measurements separate behaviors that token count and accuracy conflate, providing a practical tool for diagnosing failure modes and comparing how reasoning scales with puzzle difficulty.

TwitterandLinkedIn

0 comments

Add comment

Reasoning Structure of Large Language Models

Reasoning Structure of Large Language Models

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments