Artificial Intelligence

Learn-by-Wire Training Control Governance: Bounded Autonomous Training Under Stress for Stability and Efficiency
Avatar
Anis Radianis
9 views
Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR
Avatar
Utkarsh Tyagi
7 views
Probing Embodied LLMs: When Higher Observation Fidelity Hurts Problem Solving
Avatar
librarian
4 views
Probabilistic Tiny Recursive Model

Probabilistic Tiny Recursive Model

Artificial Intelligence
Avatar
Amin Sghaier
7 views
GeoX: Mastering Geospatial Reasoning Through Self-Play and Verifiable Rewards
Avatar
librarian
7 views
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration
Avatar
librarian
7 views
Neurosymbolic Learning for Inference-Time Argumentation
Avatar
librarian
6 views
A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents
Avatar
librarian
6 views
OpenComputer: Verifiable Software Worlds for Computer-Use Agents
Avatar
librarian
2 views
Prior Knowledge or Search? A Study of LLM Agents in Hardware-Aware Code Optimization
Avatar
Dmitry Redko
2 views
Learning Quantifiable Visual Explanations Without Ground-Truth
Avatar
librarian
2 views
Efficient Lookahead Encoding and Abstracted Width for Learning General Policies in Classical Planning
Avatar
Michael Aichmüller
2 views
AI for Auto-Research: Roadmap & User Guide
Avatar
librarian
2 views
Position: A Three-Layer Probabilistic Assume-Guarantee Architecture Is Structurally Required for Safe LLM Agent Deployment
Avatar
librarian
3 views
SkillGenBench: Benchmarking Skill Generation Pipelines for LLM Agents
Avatar
librarian
5 views
GIM: Evaluating models via tasks that integrate multiple cognitive domains
Avatar
librarian
3 views
Democratizing Large-Scale Re-Optimization with LLM-Guided Model Patches
Avatar
librarian
3 views
CatalyticMLLM: A Graph-Text Multimodal Large Language Model for Catalytic Materials
Avatar
Yanjie Li
2 views
CAM-Bench: A Benchmark for Computational and Applied Mathematics in Lean
Avatar
librarian
2 views
Is VLA Reasoning Faithful? Probing Safety of Chain-of-Causation
Avatar
librarian
2 views
Case-Based Calibration of Adaptive Reasoning and Execution for LLM Tool Use
Avatar
librarian
13 views
Why Neighborhoods Matter: Traversal Context and Provenance in Agentic GraphRAG
Avatar
librarian
18 views
Orchard: An Open-Source Agentic Modeling Framework
Avatar
librarian
22 views
APWA: A Distributed Architecture for Parallelizable Agentic Workflows
Avatar
librarian
21 views
OpenDeepThink: Parallel Reasoning via Bradley--Terry Aggregation
Avatar
librarian
25 views
Senses Wide Shut: A Representation-Action Gap in Omnimodal LLMs
Avatar
librarian
23 views
Harnessing Agentic Evolution

Harnessing Agentic Evolution

Artificial Intelligence
Avatar
librarian
20 views
History Anchors: How Prior Behavior Steers LLM Decisions Toward Unsafe Actions
Avatar
librarian
27 views
D-VLA: A High-Concurrency Distributed Asynchronous Reinforcement Learning Framework for Vision-Language-Action Models
Avatar
Yucheng Guo
24 views
Differentiable Learning of Lifted Action Schemas for Classical Planning
Avatar
Jonas Reiter
23 views
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
Avatar
librarian
26 views
CAAFC: Chronological Actionable Automated Fact-Checker for misinformation / non-factual hallucination detection and correction
Avatar
Islam Eldifrawi
26 views