Science Cast

Efficient Dataset Distillation through Alignment with Smooth and High-Quality Expert Trajectories

Jiyuan ShenOctober 17, 2023 8:05am

Views (58)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Efficient Dataset Distillation through Alignment with Smooth and High-Quality Expert Trajectories

arXivPDFOctober 16, 2023 12:00am

Authors

Jiyuan Shen, Wenzhuo Yang, Kwok-Yan Lam

Abstract

Training a large and state-of-the-art machine learning model typically necessitates the use of large-scale datasets, which, in turn, makes the training and parameter-tuning process expensive and time-consuming. Some researchers opt to distil information from real-world datasets into tiny and compact synthetic datasets while maintaining their ability to train a well-performing model, hence proposing a data-efficient method known as Dataset Distillation (DD). Despite recent progress in this field, existing methods still underperform and cannot effectively replace large datasets. In this paper, unlike previous methods that focus solely on improving the efficacy of student distillation, we are the first to recognize the important interplay between expert and student. We argue the significant impact of expert smoothness when employing more potent expert trajectories in subsequent dataset distillation. Based on this, we introduce the integration of clipping loss and gradient penalty to regulate the rate of parameter changes in expert trajectories. Furthermore, in response to the sensitivity exhibited towards randomly initialized variables during distillation, we propose representative initialization for synthetic dataset and balanced inner-loop loss. Finally, we present two enhancement strategies, namely intermediate matching loss and weight perturbation, to mitigate the potential occurrence of cumulative errors. We conduct extensive experiments on datasets of different scales, sizes, and resolutions. The results demonstrate that the proposed method significantly outperforms prior methods.

TwitterandLinkedIn

0 comments

Add comment

Efficient Dataset Distillation through Alignment with Smooth and High-Quality Expert Trajectories

Efficient Dataset Distillation through Alignment with Smooth and High-Quality Expert Trajectories

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments