Guiding Yourself with Your Own Insights: Student-Driven Knowledge Distillation

Jun 30, 2025·
Dacheng Qi
Dacheng Qi
,
Huayu Zhang
,
Yufeng Wang
,
Shuangkang Fang
,
Zehao Zhang
,
Zesheng Wang
,
Wenrui Ding
· 1 min read
PDF
Oral Presentation
Image credit: Unsplash
Abstract
We recognize the significance of preserving structural consistency to enhance knowledge transfer efficiency between networks. Leveraging this insight, we introduce a novel approach termed Student-Driven Knowledge Distillation (SDKD), which integrates a proxy teacher intermediary between the primary teacher and student model. Specifically, we construct the architecture of the proxy teacher entirely based on the student network to generate logits that closely align with the distribution of the student network. Besides, we propose a Feature Fusion Block (FFB) to integrate features from the teacher network into the proxy teacher. FFB can not only provide high-quality feature-based knowledge for distillation but also impart response-based knowledge to facilitate the learning process.
Type
Publication
2025 IEEE International Conference on Multimedia and Expo
Status
Peer-reviewed Open access
Awards
Oral Presentation
ICME · 2025
License
CC-BY-4.0
publications

Please visit the official ICME page for details.

Dacheng Qi
Authors
Dacheng Qi (he/him)
PhD Student

I am a Ph.D. student in Computing & Data Science at The University of Hong Kong, supervised by Prof. Yi Ma and Prof. Shenghua Gao. I received my M.S. degree from Beihang University under the supervision of Prof. Wenrui Ding and Researcher Yufeng Wang.

My research interests lie in generative methods for 3D design and manufacturing, especially parametric CAD synthesis and its integration with multimodal large language models. Before this, I worked on 3D generation.