Artificial Intelligence

Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications
Avatar
librarian
12 views
RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography
Avatar
librarian
10 views
Diagnosing LLM Judge Reliability: Conformal Prediction Sets and Transitivity Violations
Avatar
librarian
12 views
How Do LLMs and VLMs Understand Viewpoint Rotation Without Vision? An Interpretability Study
Avatar
librarian
16 views
Generalization in LLM Problem Solving: The Case of the Shortest Path
Avatar
librarian
9 views
Discovering Novel LLM Experts via Task-Capability Coevolution
Avatar
librarian
10 views
An Axiomatic Benchmark for Evaluation of Scientific Novelty Metrics
Avatar
librarian
11 views
IG-Search: Step-Level Information Gain Rewards for Search-Augmented Reasoning
Avatar
librarian
11 views
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents
Avatar
librarian
18 views
TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration
Avatar
librarian
30 views
GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis
Avatar
librarian
23 views
AI-Assisted Peer Review at Scale: The AAAI-26 AI Review Pilot
Avatar
Joydeep Biswas
27 views
Hierarchical Reinforcement Learning with Runtime Safety Shielding for Power Grid Operation
Avatar
librarian
25 views
BEAM: Bi-level Memory-adaptive Algorithmic Evolution for LLM-Powered Heuristic Design
Avatar
librarian
20 views
Cycle-Consistent Search: Question Reconstructability as a Proxy Reward for Search Agent Training
Avatar
librarian
19 views
DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
Avatar
librarian
14 views
Transferable Expertise for Autonomous Agents via Real-World Case-Based Learning
Avatar
librarian
15 views
RePAIR: Interactive Machine Unlearning through Prompt-Aware Model Repair
Avatar
librarian
22 views
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time
Avatar
Haozhe Wang
25 views
Context Kubernetes: Declarative Orchestration of Enterprise Knowledge for Agentic AI Systems
Avatar
Charafeddine Mouzouni
25 views
Retrieval Is Not Enough: Why Organizational AI Needs Epistemic Infrastructure
Avatar
librarian
19 views
GenTac: Generative Modeling and Forecasting of Soccer Tactics
Avatar
Weidi Xie
21 views
Detecting Safety Violations Across Many Agent Traces
Avatar
librarian
19 views
VeriSim: A Configurable Framework for Evaluating Medical AI Under Realistic Patient Noise
Avatar
Sina Mansouri
15 views
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs
Avatar
librarian
18 views
From Perception to Planning: Evolving Ego-Centric Task-Oriented Spatiotemporal Reasoning via Curriculum Learning
Avatar
librarian
16 views
Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?
Avatar
librarian
16 views
From Safety Risk to Design Principle: Peer-Preservation in Multi-Agent LLM Systems and Its Implications for Orchestrated Democratic Discourse Analysis
Avatar
librarian
39 views
Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest
Avatar
Addison Wu
21 views
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver
Avatar
librarian
82 views
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation
Avatar
Zhengxi Lu
22 views
SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions
Avatar
librarian
20 views