检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李嘉楠 韩林 柴赟达 LI Jia’nan;HAN Lin;CHAI Yunda(School of Information Engineering,Zhengzhou University,Zhengzhou 450000,China;National Supercomputing Center in Zhengzhou,Zhengzhou 450000,China)
机构地区:[1]郑州大学信息工程学院,郑州450000 [2]国家超级计算郑州中心,郑州450000
出 处:《计算机工程》2022年第1期142-148,共7页Computer Engineering
基 金:国家重点研发计划“全球对地观测成果管理及共享服务系统关键技术研究”(2018YFB0505000)。
摘 要:作为SIMD扩展部件向量化的重要手段,自动向量化已在LLVM编译器中得到实现,但向量长度以及指令集功能的差异,导致国产平台在自动向量化过程中容易错失向量化机会以及向量化后产生倒加速的问题。为使SIMD得到充分应用,结合国产平台的指令集特征完善指令代价信息以提高收益分析精准度,使其在自动向量化后生成后端支持且简洁高效的向量指令。在此基础上,提出一种改进的控制流向量化方法,通过添加指令代价信息提高自动向量化的适配能力,从而形成一套面向国产平台的LLVM自动向量化系统。实验结果表明,相比自动向量化移植前,通过该方法进行移植优化后,SPEC测试的整体性能提升10.8%,TSVC测试集中的加速比提升16%,精准代价指导下的加速比提升42%,控制流向量化下的加速比提升51%。Automatic vectorization is essential in SIMD extension vectorization,and has been implemented in the LLVM compiler.However,the difference of vector length and instruction set functions can cause the domestic processors to lose the opportunity of vectorization in the process of automatic vectorization,or produce negative acceleration after vectorization.To make full use of SIMD,this paper discusses how to improve instruction cost information according to the instruction set features of domestic processors,so the accuracy of benefit analysis is increased.On this basis,precise and efficient vector instructions supported by the back end are generated after automatic vectorization.Furthermore,this paper proposes a vectorization method with improved control flows.By adding instruction cost information,the adaptability of automatic vectorization is improved.Finally a LLVM-based automatic vectorization system for domestic platforms is formed.The experimental results show that for the platforms having received automatic vectorization transplant,the proposed method provides a 10.8%overall performance improvement in SPEC tests,16%acceleration ratio improvement on the TSVC test,42%acceleration ratio improvement under the guidance of precision cost,and 51%acceleration ratio improvement under the control flow vecctorization.
关 键 词:自动向量化 向量化收益 移植 LLVM编译器 国产平台
分 类 号:TP314[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.145.80.205