Fairness under uncertainty in sequential decisions

0upvotes

By: Michelle Seng Ah Lee, Kirtan Padh, David Watson, Niki Kilbertus, Jatinder Singh

Fair machine learning (ML) methods help identify and mitigate the risk that algorithms encode or automate social injustices. Algorithmic approaches alone cannot resolve structural inequalities, but they can support socio-technical decision systems by surfacing discriminatory biases, clarifying trade-offs, and enabling governance. Although fairness is well studied in supervised learning, many real ML applications are online and sequential, wit... more

Machine LearningApril 24, 2026 11:26pm

Comments (0)
Views (19)

Transferable Physics-Informed Representations via Closed-Form Head Adaptation

0upvotes

By: Jian Cheng Wong, Isaac Yin Chung Lai, Pao-Hsiung Chiu, Chin Chun Ooi, Abhishek Gupta, Yew-Soon Ong

Physics-informed neural networks (PINNs) have garnered significant interest for their potential in solving partial differential equations (PDEs) that govern a wide range of physical phenomena. By incorporating physical laws into the learning process, PINN models have demonstrated the ability to learn physical outcomes reasonably well. However, current PINN approaches struggle to predict or solve new PDEs effectively when there is a lack of tr... more

Machine LearningApril 24, 2026 10:12am

Comments (0)
Views (22)

Low-Rank Adaptation Redux for Large Models

0upvotes

By: Bingcong Li, Yilang Zhang, Georgios B. Giannakis

Low-rank adaptation (LoRA) has emerged as the de facto standard for parameter-efficient fine-tuning (PEFT) of foundation models, enabling the adaptation of billion-parameter networks with minimal computational and memory overhead. Despite its empirical success and rapid proliferation of variants, it remains elusive which architectural choices, optimization techniques, and deployment constraints should guide practical method selection. This ov... more

Machine LearningApril 24, 2026 2:29am

Comments (0)
Views (17)

The Sample Complexity of Multicalibration

0upvotes

By: Natalie Collina, Jiuyao Lu, Georgy Noarov, Aaron Roth

We study the minimax sample complexity of multicalibration in the batch setting. A learner observes $n$ i.i.d. samples from an unknown distribution and must output a (possibly randomized) predictor whose population multicalibration error, measured by Expected Calibration Error (ECE), is at most $\varepsilon$ with respect to a given family of groups. For every fixed $κ> 0$, in the regime $|G|\le \varepsilon^{-κ}$, we prove that $\widetildeΘ(\v... more

Machine LearningApril 24, 2026 2:29am

Comments (0)
Views (19)

Temporal Taskification in Streaming Continual Learning: A Source of Evaluation Instability

0upvotes

By: Nicolae Filat, Ahmed Hussain, Konstantinos Kalogiannis, Elena Burceanu

Streaming Continual Learning (CL) typically converts a continuous stream into a sequence of discrete tasks through temporal partitioning. We argue that this temporal taskification step is not a neutral preprocessing choice, but a structural component of evaluation: different valid splits of the same stream can induce different CL regimes and therefore different benchmark conclusions. To study this effect, we introduce a taskification-level fr... more

Machine LearningApril 24, 2026 2:28am

Comments (0)
Views (21)

FedSIR: Spectral Client Identification and Relabeling for Federated Learning with Noisy Labels

0upvotes

By: Sina Gholami, Abdulmoneam Ali, Tania Haghighi, Ahmed Arafa, Minhaj Nur Alam

Federated learning (FL) enables collaborative model training without sharing raw data; however, the presence of noisy labels across distributed clients can severely degrade the learning performance. In this paper, we propose FedSIR, a multi-stage framework for robust FL under noisy labels. Different from existing approaches that mainly rely on designing noise-tolerant loss functions or exploiting loss dynamics during training, our method leve... more

Machine LearningApril 23, 2026 11:57am

Comments (0)
Views (17)

Relative Entropy Estimation in Function Space: Theory and Applications to Trajectory Inference

0upvotes

By: Chao Wang, Luca Nepote, Giulio Franzese, Pietro Michiardi

Trajectory Inference (TI) seeks to recover latent dynamical processes from snapshot data, where only independent samples from time-indexed marginals are observed. In applications such as single-cell genomics, destructive measurements make path-space laws non-identifiable from finitely many marginals, leaving held-out marginal prediction as the dominant but limited evaluation protocol. We introduce a general framework for estimating the Kullba... more

Machine LearningApril 23, 2026 5:56am

Comments (0)
Views (18)

Closing the Domain Gap in Biomedical Imaging by In-Context Control Samples

0upvotes

By: Ana Sanchez-Fernandez, Thomas Pinetz, Werner Zellinger, Günter Klambauer

The central problem in biomedical imaging are batch effects: systematic technical variations unrelated to the biological signal of interest. These batch effects critically undermine experimental reproducibility and are the primary cause of failure of deep learning systems on new experimental batches, preventing their practical use in the real world. Despite years of research, no method has succeeded in closing this performance gap for deep le... more

Machine LearningApril 23, 2026 4:05am

Comments (0)
Views (18)

ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control

0upvotes

By: Shelly Golan, Michael Finkelson, Ariel Bereslavsky, Yotam Nitzan, Or Patashnik

Reinforcement Learning (RL) post-training has become the standard for aligning generative models with human preferences, yet most methods rely on a single scalar reward. When multiple criteria matter, the prevailing practice of ``early scalarization'' collapses rewards into a fixed weighted sum. This commits the model to a single trade-off point at training time, providing no inference-time control over inherently conflicting goals -- such as... more

Machine LearningApril 23, 2026 3:01am

Comments (0)
Views (14)

Stream-CQSA: Avoiding Out-of-Memory in Attention Computation via Flexible Workload Scheduling

0upvotes

By: Yiming Bian, Joshua M. Akey

The scalability of long-context large language models is fundamentally limited by the quadratic memory cost of exact self-attention, which often leads to out-of-memory (OOM) failures on modern hardware. Existing methods improve memory efficiency to near-linear complexity, while assuming that the full query, key, and value tensors fit in device memory. In this work, we remove this assumption by introducing CQS Divide, an operation derived from... more

Machine LearningApril 23, 2026 3:00am