Scalable Chain of Thoughts via Elastic Reasoning
Avatar
librarian
14 views
Hide & Seek: Transformer Symmetries Obscure Sharpness & Riemannian
  Geometry Finds It
Avatar
Marvin da Silva
10 views
ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge
  Distillation via $α$-$β$-Divergence
Avatar
Zhiyong Yang
11 views
Fight Fire with Fire: Defending Against Malicious RL Fine-Tuning via
  Reward Neutralization
Avatar
Wenjun Cao
11 views
Testing Juntas Optimally with Samples
Avatar
librarian
14 views
Automating the Discovery of Partial Differential Equations in Dynamical
  Systems
Avatar
rui-carvalho
26 views
Automatically identifying ordinary differential equations from data
Avatar
rui-carvalho
27 views
Large Language Models Are Human-Level Prompt Engineers
Avatar
Nicolas Borensztein
66 views
Multi-typed Objects Multi-view Multi-instance Multi-label Learning
Avatar
Hadil Otay
116 views
Deep Learning Statistical Arbitrage
Avatar
zulee1711
390 views
Network Deconvolution

Network Deconvolution

Machine Learning
Avatar
thurst
230 views
Transformer Dissection: A Unified Understanding of Transformer's
  Attention via the Lens of Kernel
Avatar
ScienceCast Board
201 views
Predicting COVID-19 pandemic by spatio-temporal graph neural networks: A
  New Zealand's study
Avatar
Arwa Alnajashi
231 views
Backprop Diffusion is Biologically Plausible
Avatar
Isaak Bruno
523 views
Adaptive Attention Span in Transformers
Avatar
ScienceCast Board
198 views
Regression with Linear Factored Functions
Avatar
evgenii-klebanov
221 views
Is Attention All What You Need? -- An Empirical Investigation on
  Convolution-Based Active Memory and Self-Attention
Avatar
legendblackguardian
217 views
Deep Learning of Representations: Looking Forward
Avatar
Isaak Bruno
187 views
Attention that does not Explain Away
Avatar
levymoshe16
207 views
Neural Networks Regularization Through Class-wise Invariant
  Representation Learning
Avatar
sysbit-technology
185 views
Deep Learning in Computational Biology: Advancements, Challenges, and
  Future Outlook
Avatar
sureshkumar
291 views
Improving the Knowledge Gradient Algorithm
Avatar
Le Yang
231 views
A Global Multi-Unit Calibration as a Method for Large Scale IoT
  Particulate Matter Monitoring Systems Deployments
Avatar
Saverio De Vito
230 views
Lifting the Veil: Unlocking the Power of Depth in Q-learning
Avatar
Shao-Bo Lin
197 views
Trustworthy Edge Machine Learning: A Survey
Avatar
Xiaojie Wang
204 views