Computation and Language

F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World
Avatar
librarian
6 views
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Avatar
librarian
14 views
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
Avatar
Aditya Chattopadhyay
6 views
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL
Avatar
Khushboo Thaker
19 views
SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning
Avatar
librarian
19 views
Instruction set for the representation of graphs
Avatar
Ezequiel López-Rubio
20 views
Monitoring Emergent Reward Hacking During Generation via Internal Activations
Avatar
librarian
20 views
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
Avatar
Xinyu Zhu
19 views
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
Avatar
librarian
34 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Dr. Murat ALTUN
49 views
Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation
Avatar
Nilesh Jain
45 views
UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward
Avatar
librarian
109 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Salman
91 views
Memory in the Age of AI Agents

Memory in the Age of AI Agents

Computation and Language
Avatar
librarian
142 views
Non-Resolution Reasoning: A Framework for Preserving Semantic Ambiguity in Language Models
Avatar
Kei Saito
114 views
Latent Collaboration in Multi-Agent Systems
Avatar
librarian
149 views
Generalist Foundation Models Are Not Clinical Enough for Hospital Operations
Avatar
librarian
150 views
Instella: Fully Open Language Models with Stellar Performance
Avatar
librarian
172 views
Kimi Linear: An Expressive, Efficient Attention Architecture
Avatar
librarian
261 views
Tongyi DeepResearch Technical Report

Tongyi DeepResearch Technical Report

Computation and Language
Avatar
librarian
239 views
Agent Data Protocol: Unifying Datasets for Diverse, Effective
  Fine-tuning of LLM Agents
Avatar
librarian
248 views
FlatQuant: Flatness Matters for LLM Quantization
Avatar
丰辰 何
322 views
Reinforcement Learning on Pre-Training Data
Avatar
librarian
501 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
wang tuo
449 views
FlexOlmo: Open Language Models for Flexible Data Use
Avatar
librarian
397 views
Pre-Trained Policy Discriminators are General Reward Models
Avatar
librarian
350 views
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
Avatar
librarian
354 views
Answer Matching Outperforms Multiple Choice for Language Model
  Evaluation
Avatar
librarian
362 views
SynapseRoute: An Auto-Route Switching Framework on Dual-State Large
  Language Model
Avatar
librarian
405 views
On the Predictive Power of Representation Dispersion in Language Models
Avatar
librarian
423 views
STACK: Adversarial Attacks on LLM Safeguard Pipelines
Avatar
librarian
375 views
The Trilemma of Truth in Large Language Models
Avatar
Germans Savcisens
333 views