Computer Science

Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning
Avatar
Justin Waugh
0 views
Recursive Models for Long-Horizon Reasoning
Avatar
librarian
0 views
Multi-Head Low-Rank Attention
Avatar
librarian
0 views
Frontier Models Can Take Actions at Low Probabilities
Avatar
Alex Serrano
0 views
Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy
Avatar
Xuechao Yang
1 view
Conformal Policy Control

Conformal Policy Control

Artificial Intelligence
Avatar
librarian
1 view
Tool Verification for Test-Time Reinforcement Learning
Avatar
librarian
1 view
Scalable Multi-Task Low-Rank Model Adaptation
Avatar
librarian
0 views
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
Avatar
Xinyu Zhu
0 views
Curvature-Weighted Capacity Allocation: A Minimum Description Length Framework for Layer-Adaptive Large Language Model Optimization
Avatar
librarian
0 views
LLM Novice Uplift on Dual-Use, In Silico Biology Tasks
Avatar
librarian
15 views
FlashOptim: Optimizers for Memory Efficient Training
Avatar
Jose Gonzalez Ortiz
7 views
A Dataset is Worth 1 MB

A Dataset is Worth 1 MB

Machine Learning
Avatar
Elad Kimchi Shoshani
7 views
The Trinity of Consistency as a Defining Principle for General World Models
Avatar
librarian
8 views
A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring
Avatar
Usman Anwar
10 views
A Model-Free Universal AI

A Model-Free Universal AI

Artificial Intelligence
Avatar
librarian
10 views
Learning in the Null Space: Small Singular Values for Continual Learning
Avatar
Cuong Anh Pham
15 views
ProactiveMobile: A Comprehensive Benchmark for Boosting Proactive Intelligence on Mobile Devices
Avatar
librarian
15 views
Semantic Partial Grounding via LLMs
Avatar
librarian
14 views
Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs
Avatar
Yining Hong
14 views
Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence
Avatar
librarian
24 views
A Benchmark for Deep Information Synthesis
Avatar
librarian
16 views
Aletheia tackles FirstProof autonomously
Avatar
librarian
45 views
Agents of Chaos

Agents of Chaos

Artificial Intelligence
Avatar
librarian
70 views
A Theory of How Pretraining Shapes Inductive Bias in Fine-Tuning
Avatar
Nicolás Anguita
43 views