Artificial Intelligence

How Uncertainty Estimation Scales with Sampling in Reasoning Models
Avatar
librarian
5 views
D5P4: Partition Determinantal Point Process for Diversity in Parallel Discrete Diffusion Decoding
Avatar
librarian
5 views
cuGenOpt: A GPU-Accelerated General-Purpose Metaheuristic Framework for Combinatorial Optimization
Avatar
Yuyang Liu
5 views
Box Maze: A Process-Control Architecture for Reliable LLM Reasoning
Avatar
Zou Qiang
5 views
dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models
Avatar
librarian
5 views
MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation
Avatar
librarian
5 views
Memento-Skills: Let Agents Design Agents
Avatar
librarian
20 views
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation
Avatar
librarian
5 views
Facts as First Class Objects: Knowledge Objects for Persistent LLM Memory
Avatar
Oliver Zahn
4 views
RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy
Avatar
Zhenhang Yuan
5 views
AgentFactory: A Self-Evolving Framework Through Executable Subagent Accumulation and Reuse
Avatar
Zhang Zhang
5 views
When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution
Avatar
librarian
6 views
Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations
Avatar
Haozheng Luo
10 views
Towards Safer Large Reasoning Models by Promoting Safety Decision-Making before Chain-of-Thought Generation
Avatar
librarian
8 views
Proactive Knowledge Inquiry in Doctor-Patient Dialogue: Stateful Extraction, Belief Updating, and Path-Aware Action Planning
Avatar
librarian
5 views
Proactive Knowledge Inquiry in Doctor-Patient Dialogue: Stateful Extraction, Belief Updating, and Path-Aware Action Planning
Avatar
librarian
6 views
Machines acquire scientific taste from institutional traces
Avatar
librarian
9 views
Anticipatory Planning for Multimodal AI Agents
Avatar
librarian
7 views
Internalizing Agency from Reflective Experience
Avatar
librarian
7 views
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models
Avatar
librarian
8 views
Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences
Avatar
Quan Cheng
7 views
Adaptive Theory of Mind for LLM-based Multi-Agent Coordination
Avatar
Chunjiang Mu
7 views
TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas
Avatar
Jian Ai
8 views
Talk, Evaluate, Diagnose: User-aware Agent Evaluation with Automated Error Analysis
Avatar
librarian
10 views
Portfolio of Solving Strategies in CEGAR-based Object Packing and Scheduling for Sequential 3D Printing
Avatar
Pavel Surynek
23 views
Compiling Temporal Numeric Planning into Discrete PDDL+: Extended Version
Avatar
librarian
12 views
Increasing intelligence in AI agents can worsen collective outcomes
Avatar
librarian
21 views
On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents
Avatar
librarian
23 views
TopoBench: Benchmarking LLMs on Hard Topological Reasoning
Avatar
Mayug Maniparambil
23 views
Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training
Avatar
librarian
21 views
FAME: Formal Abstract Minimal Explanation for Neural Networks
Avatar
librarian
24 views
Emulating Clinician Cognition via Self-Evolving Deep Clinical Research
Avatar
Ruiyang Ren
31 views