Computation and Language

Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs
Avatar
librarian
11 views
Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It
Avatar
abcdezzy688
32 views
Staying In Character: Perspective-Bounded Memory For Book-Based Role-Playing Agents
Avatar
Xushuo Tang
39 views
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement
Avatar
Jiajie Jin
56 views
End-to-End Context Compression at Scale
Avatar
librarian
51 views
SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence
Avatar
librarian
67 views
Rethinking Memory as Continuously Evolving Connectivity
Avatar
librarian
83 views
MeMo: Memory as a Model

MeMo: Memory as a Model

Computation and Language
Avatar
Ryan Quek
117 views
The Impossibility Triangle of Long-Context Modeling
Avatar
librarian
89 views
GiVA: Gradient-Informed Bases for Vector-Based Adaptation
Avatar
Neeraj Gangwar
105 views
A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents
Avatar
librarian
122 views
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language
Avatar
librarian
116 views
CD2CR: Co-reference Resolution Across Documents and Domains
Avatar
k-m-smit2
114 views
Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models
Avatar
librarian
168 views
ClawBench: Can AI Agents Complete Everyday Online Tasks?
Avatar
librarian
154 views
Synthetic Sandbox for Training Machine Learning Engineering Agents
Avatar
Yuhang Zhou
162 views
Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation
Avatar
Daiwei Chen
170 views
AstroConcepts: A Large-Scale Multi-Label Classification Corpus for Astrophysics
Avatar
librarian
126 views
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents
Avatar
librarian
146 views
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World
Avatar
librarian
226 views
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Avatar
librarian
212 views
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
Avatar
Aditya Chattopadhyay
149 views
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL
Avatar
Khushboo Thaker
148 views
SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning
Avatar
librarian
173 views
Instruction set for the representation of graphs
Avatar
Ezequiel López-Rubio
143 views
Monitoring Emergent Reward Hacking During Generation via Internal Activations
Avatar
librarian
153 views
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
Avatar
Xinyu Zhu
163 views
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
Avatar
librarian
164 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Dr. Murat ALTUN
173 views
Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation
Avatar
Nilesh Jain
171 views
UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward
Avatar
librarian
234 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Salman
249 views