Computer Science

RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation
Avatar
librarian
3 views
Dense SAE Latents Are Features, Not Bugs
Avatar
librarian
5 views
Exploring and Exploiting the Inherent Efficiency within Large Reasoning
  Models for Self-Guided Efficiency Enhancement
Avatar
librarian
3 views
SwarmAgentic: Towards Fully Automated Agentic System Generation via
  Swarm Intelligence
Avatar
Yao Zhang
3 views
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated
  Agent Intelligence
Avatar
librarian
3 views
Doppelgänger Method: Breaking Role Consistency in LLM Agent via
  Prompt-based Transferable Adversarial Attack
Avatar
librarian
1 view
GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in
  Real-World Anomalies
Avatar
Unknown Unknown
4 views
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct
  Preference Optimization
Avatar
Mingkang Zhu
3 views
On the Hardness of Bandit Learning
Avatar
librarian
2 views
From Points to Places: Towards Human Mobility-Driven Spatiotemporal
  Foundation Models via Understanding Places
Avatar
Mohammad Hashemi
2 views
AgentDistill: Training-Free Agent Distillation with Generalizable MCP
  Boxes
Avatar
librarian
2 views
Optimizing Length Compression in Large Reasoning Models
Avatar
Tianyi Zhou
2 views
Stream-Omni: Simultaneous Multimodal Interactions with Large
  Language-Vision-Speech Model
Avatar
librarian
4 views
TimeMaster: Training Time-Series Multimodal LLMs to Reason via
  Reinforcement Learning
Avatar
Junru Zhang
13 views
Avoiding Obfuscation with Prover-Estimator Debate
Avatar
librarian
1 view
Weakest Link in the Chain: Security Vulnerabilities in Advanced
  Reasoning Models
Avatar
librarian
3 views
PB$^2$: Preference Space Exploration via Population-Based Methods in
  Preference-Based Reinforcement Learning
Avatar
librarian
9 views
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Avatar
librarian
29 views
GenPlanX. Generation of Plans and Execution
Avatar
librarian
24 views