Multi-exit self-distillation with appropriate teachers  

在线阅读下载全文

作  者:Wujie SUN Defang CHEN Can WANG Deshi YE Yan FENG Chun CHEN 

机构地区:[1]College of Computer Science and Technology,Zhejiang University,Hangzhou 310000,China

出  处:《Frontiers of Information Technology & Electronic Engineering》2024年第4期585-599,共15页信息与电子工程前沿(英文版)

基  金:supported by the National Natural Science Foundation of China(No.U1866602);the Starry Night Science Fund of Zhejiang University Shanghai Institute for Advanced Study,China(No.SN-ZJU-SIAS-001)。

摘  要:Multi-exit architecture allows early-stop inference to reduce computational cost,which can be used in resource-constrained circumstances.Recent works combine the multi-exit architecture with self-distillation to simultaneously achieve high efficiency and decent performance at different network depths.However,existing methods mainly transfer knowledge from deep exits or a single ensemble to guide all exits,without considering that inappropriate learning gaps between students and teachers may degrade the model performance,especially in shallow exits.To address this issue,we propose Multi-exit self-distillation with Appropriate TEachers(MATE)to provide diverse and appropriate teacher knowledge for each exit.In MATE,multiple ensemble teachers are obtained from all exits with different trainable weights.Each exit subsequently receives knowledge from all teachers,while focusing mainly on its primary teacher to keep an appropriate gap for efficient knowledge transfer.In this way,MATE achieves diversity in knowledge distillation while ensuring learning efficiency.Experimental results on CIFAR-100,TinyImageNet,and three fine-grained datasets demonstrate that MATE consistently outperforms state-of-the-art multi-exit self-distillation methods with various network architectures.

关 键 词:Multi-exit architecture Knowledge distillation Learning gap 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象