RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy

0upvotes

By: Zhenhang Yuan, Shenghai Yuan, Lihua Xie

LLM agents often fail in closed-world embodied environments because actions must satisfy strict preconditions -- such as location, inventory, and container states -- and failure feedback is sparse. We identify two structurally coupled failure modes: (P1) invalid action generation and (P2) state drift, each amplifying the other in a degenerative cycle. We present RPMS, a conflict-managed architecture that enforces action feasibility via struct... more

Artificial IntelligenceMarch 19, 2026 2:22am

Comments (0)
Views (23)

AgentFactory: A Self-Evolving Framework Through Executable Subagent Accumulation and Reuse

0upvotes

By: Zhang Zhang, Shuqi Lu, Hongjin Qian, Di He, Zheng Liu

Building LLM-based agents has become increasingly important. Recent works on LLM-based agent self-evolution primarily record successful experiences as textual prompts or reflections, which cannot reliably guarantee efficient task re-execution in complex scenarios. We propose AgentFactory, a new self-evolution paradigm that preserves successful task solutions as executable subagent code rather than textual experience. Crucially, these subagent... more

Artificial IntelligenceMarch 19, 2026 2:22am

Comments (0)
Views (21)

When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution

0upvotes

By: Yi Nian, Haosen Cao, Shenzhe Zhu, Henry Peng Zou, Qingqing Luan, Yue Zhao

When a multi-agent system produces an incorrect or harmful answer, who is accountable if execution logs and agent identifiers are unavailable? Multi-agent language systems increasingly rely on structured interactions such as delegation and iterative refinement, yet the final output often obscures the underlying interaction topology and agent contributions. We introduce IET (Implicit Execution Tracing), a metadata-independent framework that en... more

Artificial IntelligenceMarch 19, 2026 1:26am

Comments (0)
Views (36)

Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations

0upvotes

By: Haozheng Luo, Yimin Wang, Jiahao Yu, Binghui Wang, Yan Chen

We propose CRAFT, a red-teaming alignment framework that leverages model reasoning capabilities and hidden representations to improve robustness against jailbreak attacks. Unlike prior defenses that operate primarily at the output level, CRAFT aligns large reasoning models to generate safety-aware reasoning traces by explicitly optimizing objectives defined over the hidden state space. Methodologically, CRAFT integrates contrastive representa... more

Artificial IntelligenceMarch 19, 2026 1:02am

Comments (0)
Views (31)

Towards Safer Large Reasoning Models by Promoting Safety Decision-Making before Chain-of-Thought Generation

0upvotes

By: Jianan Chen, Zhifang Zhang, Shuo He, Linan Yue, Lei Feng, Minling Zhang

Large reasoning models (LRMs) achieved remarkable performance via chain-of-thought (CoT), but recent studies showed that such enhanced reasoning capabilities are at the expense of significantly degraded safety capabilities. In this paper, we reveal that LRMs' safety degradation occurs only after CoT is enabled, and this degradation is not observed when CoT is disabled. This observation motivates us to consider encouraging LRMs to make safety ... more

Artificial IntelligenceMarch 19, 2026 1:01am

6 SciCasts by .