Machine Learning

UMA: A Family of Universal Models for Atoms
Avatar
librarian
1 view
Optimising 4th-Order Runge-Kutta Methods: A Dynamic Heuristic Approach
  for Efficiency and Low Storage
Avatar
Gavin Goodship
8 views
Flow-Based Single-Step Completion for Efficient and Expressive Policy
  Learning
Avatar
librarian
446 views
AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards
  Improve Preference Learning
Avatar
Tevin Wang
13 views
Dense SAE Latents Are Features, Not Bugs
Avatar
librarian
12 views
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct
  Preference Optimization
Avatar
Mingkang Zhu
18 views
On the Hardness of Bandit Learning
Avatar
librarian
7 views
TimeMaster: Training Time-Series Multimodal LLMs to Reason via
  Reinforcement Learning
Avatar
Junru Zhang
19 views
Rethinking Losses for Diffusion Bridge Samplers
Avatar
librarian
31 views