Science Cast

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

librarianMarch 7, 2025 2:17am

Views (17)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

arXivPDFMarch 6, 2025 12:00am

Authors

Kou Misaki, Yuichi Inoue, Yuki Imajuku, So Kuroki, Taishi Nakamura, Takuya Akiba

Abstract

Recent advances demonstrate that increasing inference-time computation can significantly boost the reasoning capabilities of large language models (LLMs). Although repeated sampling (i.e., generating multiple candidate outputs) is a highly effective strategy, it does not leverage external feedback signals for refinement, which are often available in tasks like coding. In this work, we propose $\textit{Adaptive Branching Monte Carlo Tree Search (AB-MCTS)}$, a novel inference-time framework that generalizes repeated sampling with principled multi-turn exploration and exploitation. At each node in the search tree, AB-MCTS dynamically decides whether to "go wider" by expanding new candidate responses or "go deeper" by revisiting existing ones based on external feedback signals. We evaluate our method on complex coding and engineering tasks using frontier models. Empirical results show that AB-MCTS consistently outperforms both repeated sampling and standard MCTS, underscoring the importance of combining the response diversity of LLMs with multi-turn solution refinement for effective inference-time scaling.

TwitterandLinkedIn

0 comments

Add comment

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments