EAT-NAS: elastic architecture transfer for accelerating large-scale neural architecture search 被引量：2

作　　者：Jiemin FANG Yukang CHEN Xinbang ZHANG Qian ZHANG Chang HUANG Gaofeng MENG Wenyu LIU Xinggang WANG

机构地区：[1]Institute of Artificial Intelligence,Huazhong University of Science and Technology,Wuhan 430074,China [2]School of Electronic Information and Communications,Huazhong University of Science and Technology,Wuhan 430074,China [3]Horizon Robotics,Beijing 100089,China [4]National Laboratory of Pattern Recognition,Institute of Automation,Chinese Academy of Sciences,Beijing 100190,China

出　　处：《Science China(Information Sciences)》2021年第9期99-111,共13页中国科学（信息科学）（英文版）

基　　金：This work was in part supported by National Natural Science Foundation of China(NSFC)(Grant Nos.61876212,61976208,61733007);Zhejiang Lab(Grant No.2019NB0AB02);HUST-Horizon Computer Vision Research Center。

摘　　要：Neural architecture search(NAS) methods have been proposed to relieve human experts from tedious architecture engineering. However, most current methods are constrained in small-scale search owing to the issue of huge computational resource consumption. Meanwhile, the direct application of architectures searched on small datasets to large datasets often bears no performance guarantee due to the discrepancy between different datasets. This limitation impedes the wide use of NAS on large-scale tasks. To overcome this obstacle, we propose an elastic architecture transfer mechanism for accelerating large-scale NAS(EATNAS).In our implementations, the architectures are first searched on a small dataset, e.g., CIFAR-10. The best one is chosen as the basic architecture. The search process on a large dataset, e.g., ImageNet, is initialized with the basic architecture as the seed. The large-scale search process is accelerated with the help of the basic architecture. We propose not only a NAS method but also a mechanism for architecture-level transfer learning. In our experiments, we obtain two final models EATNet-A and EATNet-B, which achieve competitive accuracies of 75.5% and 75.6%, respectively, on ImageNet. Both the models also surpass the models searched from scratch on ImageNet under the same settings. For the computational cost, EAT-NAS takes only fewer than 5 days using 8 TITAN X GPUs, which is significantly less than the computational consumption of the state-of-the-art large-scale NAS methods.

关键词：architecture transfer neural architecture search evolutionary algorithm large-scale dataset

分类号：TP18[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

EAT-NAS: elastic architecture transfer for accelerating large-scale neural architecture search 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

EAT-NAS: elastic architecture transfer for accelerating large-scale neural architecture search 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索