Optimization-Free Topological Sort for Causal Discovery via the Schur Complement of Score Jacobians

0upvotes

By: Rui Wu, Hong Xie

Continuous causal discovery typically couples representation learning with structural optimization via non-convex acyclicity penalties, which subjects solvers to local optima and restricts scalability in high-dimensional regimes. We propose a decoupled paradigm that shifts the causal discovery bottleneck from non-convex optimization to statistical score estimation. We introduce the Score-Schur Topological Sort (SSTS), an algorithm that extrac... more

Machine LearningApril 29, 2026 2:11am

5 SciCasts by .

Comments (0)
Views (7)

Toward Scalable Terminal Task Synthesis via Skill Graphs

0upvotes

By: Zhiyuan Fan, Tinghao Yu, Yuanjun Cai, Jiangtao Guan, Yun Yang, Dingxin Hu, Jiang Zhou, Xing Wu, Zhuo Han, Feng Zhang, Lilin Wang

Terminal agents have demonstrated strong potential for autonomous command-line execution, yet their training remains constrained by the scarcity of high-quality and diverse execution trajectories. Existing approaches mitigate this bottleneck by synthesizing large-scale terminal task instances for trajectory sampling. However, they primarily focus on scaling the number of tasks while providing limited control over the diversity of execution tr... more

Artificial IntelligenceApril 29, 2026 2:04am

Comments (0)
Views (10)

SpecRLBench: A Benchmark for Generalization in Specification-Guided Reinforcement Learning

0upvotes

By: Zijian Guo, İlker Işık, H. M. Sabbir Ahmad, Wenchao Li

Specification-guided reinforcement learning (RL) provides a principled framework for encoding complex, temporally extended tasks using formal specifications such as linear temporal logic (LTL). While recent methods have shown promising results, their ability to generalize across unseen specifications and diverse environments remains insufficiently understood. In this work, we introduce SpecRLBench, a benchmark designed to evaluate the general... more

Machine LearningApril 28, 2026 8:26am

Comments (0)
Views (10)

Scalable Hyperparameter-Divergent Ensemble Training with Automatic Learning Rate Exploration for Large Models

0upvotes

By: Hailing Cheng, Tao Huang, Chen Zhu, Antonio Alonso

Training large neural networks with data-parallel stochastic gradient descent allocates N GPU replicas to compute effectively identical updates -- a practice that leaves the rich space of learning rate configurations entirely unexplored during training. We propose Hyperparameter-Divergent Ensemble Training (HDET), a method that repurposes these replicas for simultaneous learning rate exploration at negligible communication overhead. HDET oper... more

Machine LearningApril 28, 2026 3:58am

Comments (0)
Views (10)

The Optimal Sample Complexity of Multiclass and List Learning

0upvotes

By: Chirag Pabbaraju

While the optimal sample complexity of binary classification in terms of the VC dimension is well-established, determining the optimal sample complexity of multiclass classification has remained open. The appropriate complexity parameter for multiclass classification is the DS dimension, and despite significant efforts, a gap of $\sqrt{\text{DS}}$ has persisted between the upper and lower bounds on sample complexity. Recent work by Hanneke ... more

Machine LearningApril 28, 2026 3:58am

Comments (0)
Views (14)

Governing What You Cannot Observe: Adaptive Runtime Governance for Autonomous AI Agents

0upvotes

By: German Marin, Jatin Chaudhary

Autonomous AI agents can remain fully authorized and still become unsafe as behavior drifts, adversaries adapt, and decision patterns shift without any code change. We propose the \textbf{Informational Viability Principle}: governing an agent reduces to estimating a bound on unobserved risk $\hat{B}(x) = U(x) + SB(x) + RG(x)$ and allowing an action only when its capacity $S(x)$ exceeds $\hat{B}(x)$ by a safety margin. The \textbf{Agent Viabil... more

Artificial IntelligenceApril 28, 2026 3:29am