Expressive Score-Based Priors for Distribution Matching with Geometry-Preserving Regularization

0upvotes

By: Ziyu Gong, Jim Lim, David I. Inouye

Distribution matching (DM) is a versatile domain-invariant representation learning technique that has been applied to tasks such as fair classification, domain adaptation, and domain translation. Non-parametric DM methods struggle with scalability and adversarial DM approaches suffer from instability and mode collapse. While likelihood-based methods are a promising alternative, they often impose unnecessary biases through fixed priors or requ... more

Machine LearningJune 18, 2025 2:57am

Comments (0)
Views (8)

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

0upvotes

By: Mingkang Zhu, Xi Chen, Zhongdao Wang, Bei Yu, Hengshuang Zhao, Jiaya Jia

Recent advancements in reinforcement learning from human feedback have shown that utilizing fine-grained token-level reward models can substantially enhance the performance of Proximal Policy Optimization (PPO) in aligning large language models. However, it is challenging to leverage such token-level reward as guidance for Direct Preference Optimization (DPO), since DPO is formulated as a sequence-level bandit problem. To address this challen... more

Machine LearningJune 18, 2025 2:53am

Comments (0)
Views (6)

Towards Desiderata-Driven Design of Visual Counterfactual Explainers

0upvotes

By: Sidney Bender, Jan Herrmann, Klaus-Robert Müller, Grégoire Montavon

Visual counterfactual explainers (VCEs) are a straightforward and promising approach to enhancing the transparency of image classifiers. VCEs complement other types of explanations, such as feature attribution, by revealing the specific data transformations to which a machine learning model responds most strongly. In this paper, we argue that existing VCEs focus too narrowly on optimizing sample quality or change minimality; they fail to cons... more

Machine LearningJune 18, 2025 2:53am

5 SciCasts by .

Comments (0)
Views (18)

TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning

0upvotes

By: Junru Zhang, Lang Feng, Xu Guo, Yuhan Wu, Yabo Dong, Duanqing Xu

Time-series reasoning remains a significant challenge in multimodal large language models (MLLMs) due to the dynamic temporal patterns, ambiguous semantics, and lack of temporal priors. In this work, we introduce TimeMaster, a reinforcement learning (RL)-based method that enables time-series MLLMs to perform structured, interpretable reasoning directly over visualized time-series inputs and task prompts. TimeMaster adopts a three-part structu... more

Machine LearningJune 17, 2025 2:39am

Comments (0)
Views (9)

Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs

0upvotes

By: Sayed Mohammad Vakilzadeh Hatefi, Maximilian Dreyer, Reduan Achtibat, Patrick Kahardipraja, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin

Large Language Models (LLMs) are central to many contemporary AI applications, yet their extensive parameter counts pose significant challenges for deployment in memory- and compute-constrained environments. Recent works in eXplainable AI (XAI), particularly on attribution methods, suggest that interpretability can also enable model compression by identifying and removing components irrelevant to inference. In this paper, we leverage Layer-wi... more

Machine LearningJune 17, 2025 2:38am

Comments (0)
Views (13)

Discrete Diffusion in Large Language and Multimodal Models: A Survey

0upvotes

By: Runpeng Yu, Qi Li, Xinchao Wang

In this work, we provide a systematic survey of Discrete Diffusion Language Models (dLLMs) and Discrete Diffusion Multimodal Language Models (dMLLMs). Unlike autoregressive (AR) models, dLLMs and dMLLMs adopt a multi-token, parallel decoding paradigm using full attention and a denoising-based generation strategy. This paradigm naturally enables parallel generation, fine-grained output controllability, and dynamic, response-aware perception. T... more

Machine LearningJune 17, 2025 2:38am