Fine-Tuning Channel-Pruned Deep Model via Knowledge Distillation

Chong Zhang; Hong-Zhi Wang; Hong-Wei Liu; Yi-Lin Chen

doi:10.1007/s11390-023-2386-8

Zhang C, Wang HZ, Liu HW et al. Fine-tuning channel-pruned deep model via knowledge distillation. JOURNAL OFCOMPUTER SCIENCE AND TECHNOLOGY 39(6): 1238−1247 Nov. 2024. DOI: 10.1007/s11390-023-2386-8.

Citation:

Fine-Tuning Channel-Pruned Deep Model via Knowledge Distillation

Abstract

Abstract

Deep convolutional neural networks with high performance are hard to be deployed in many real world applications, since the computing resources of edge devices such as smart phones or embedded GPU are limited. To alleviate this hardware limitation, the compression of deep neural networks from the model side becomes important. As one of the most popular methods in the spotlight, channel pruning of the deep convolutional model can effectively remove redundant convolutional channels from the CNN (convolutional neural network) without affecting the network’s performance remarkably. Existing methods focus on pruning design, evaluating the importance of different convolutional filters in the CNN model. A fast and effective fine-tuning method to restore accuracy is urgently needed. In this paper, we propose a fine-tuning method KDFT (Knowledge Distillation Based Fine-Tuning), which improves the accuracy of fine-tuned models with almost negligible training overhead by introducing knowledge distillation. Extensive experimental results on benchmark datasets with representative CNN models show that up to 4.86% accuracy improvement and 79% time saving can be obtained.

FullText(HTML)

References (36)

Relative Articles

Supplements (3)

Cited By

Fine-Tuning Channel-Pruned Deep Model via Knowledge Distillation

Abstract

Catalog

Export File

Citation

Format

Content