机器视觉应用中的图像数据增广综述被引量：30

Review of Image Data Augmentation in Computer Vision

作　　者：林成创单纯赵淦森[1,4,5] 杨志荣彭璟陈少洁黄润桦[1,4,5] 李壮伟易序晟杜嘉华李双印罗浩宇樊小毛陈冰川 LIN Chengchuang;SHAN Chun;ZHAO Gansen;YANG Zhirong;PENG Jing;CHEN Shaojie;HUANG Runhua;LI Zhuangwei;YI Xusheng;DU Jiahua;LI Shuangyin;LUO Haoyu;FAN Xiaomao;CHEN Bingchuan(School of Computer Science,South China Normal University,Guangzhou 510631,China;School of Electronics and Information,Guangdong Polytechnic Normal University,Guangzhou 510665,China;Norwegian University of Science and Technology,Trondheim 17491,Norway;Key Lab on Cloud Security and Assessment Technology of Guangzhou,Guangzhou 510631,China;South China Normal University&VeChain Joint Lab on BlockChain Technology and Application,Guangzhou;510631,China 6.School of Statistics and Mathematics,Guangdong University of Finance and Economics,Guangzhou 510320,China)

机构地区：[1]华南师范大学计算机学院,广州510631 [2]广东技术师范大学电子与信息学院,广州510665 [3]挪威科技大学,挪威特隆赫姆17491 [4]广州市云计算安全与测评技术重点实验室,广州510631 [5]华南师范大学唯链区块链技术与应用联合实验室,广州510631 [6]广东财经大学统计与数学学院,广州510320

出　　处：《计算机科学与探索》2021年第4期583-611,共29页Journal of Frontiers of Computer Science and Technology

基　　金：国家重点研发计划(2018YFB1404402,2018YFB1802402);广东省重点领域研发计划项目(2019B010137003);广东省科技计划项目(2016B030305006,2018A07071702);广州市科技计划项目(201804010314,201802030004);唯链基金会项目(SCNU-2018-01)。

摘　　要：深度学习是目前机器视觉的前沿解决方案,而海量高质量的训练数据集是深度学习解决机器视觉问题的基本保障。收集和准确标注图像数据集是一个极其费时且代价昂贵的过程。随着机器视觉的广泛应用,这个问题将会越来越突出。图像增广技术是一种有效解决深度学习在少量或者低质量训练数据中进行训练的一种技术手段,该技术不断地伴随着深度学习与机器视觉的发展。系统性梳理当前图像增广技术研究,从增广对象、增广空间、标签处理和增广策略生成的角度,分析现有图像增广技术的研究范式。依据研究范式提出现有图像增广技术的分类系统,重点介绍每类图像增广研究的代表性研究成果。最后,对现有图像增广研究进行总结,指出当前图像增广研究中存在的问题及未来的发展趋势。Deep learning is a promising solution for computer vision at present.To solve the computer vision problem,it requires massive and high-quality image training datasets.Collecting and accurately labeling image datasets is a very time-consuming and expensive process.As computer vision applications become more widespread,it makes this problem even more pronounced.Image augmentation technologies are technical methods to effectively solve the problem of deep learning training under the condition of small-scale or low-quality training data.These technologies are continually accompanied with the development of deep learning and computer vision.This paper first reviews these image augmentation researches from the perspective of augmentation objects,operation spaces,label processing methods,and augmentation strategies and then concludes corresponding paradigms of current image data augmentation methods.After that,this paper proposes a taxonomy for current image data augmentation guided by the above paradigms,and reviews corresponding representative methods of each image data augmentation category.Finally,this paper makes conclusions on existing image data augmentation,points out the problems existing in the current image augmentation research and presents promising directions for future research.

关键词：深度学习计算机视觉图像增广数据增广图像增强

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

机器视觉应用中的图像数据增广综述被引量：30

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

机器视觉应用中的图像数据增广综述 被引量：30

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

机器视觉应用中的图像数据增广综述被引量：30