异构平台多流编程机制的性能模型研究  

Performance modeling of multi-stream programming mechanism on heterogeneous platforms

在线阅读下载全文

作  者:彭林[1] 张鹏[1] 方建滨[1] 黄春[1] 唐滔[1] PENG Lin;ZHANG Peng;FANG Jian-bin;HUANG Chun;TANG Tao(School of Computer,National University of Defense Technology,Changsha 410073,China)

机构地区:[1]国防科技大学计算机学院

出  处:《计算机工程与科学》2019年第7期1145-1154,共10页Computer Engineering & Science

基  金:国家重点研发计划(2017YFB0202004)

摘  要:多流编程机制为异构众核加速器提供流水、资源划分等多种资源使用方式,但如何选择有效使用方式目前缺乏指导。基于异构众核处理器Intel MIC上的hStreams,提出了针对单应用多流程序多硬件分区执行的性能模型,分析不同配置下多流程序性能差异的原因,指出了影响多流程序性能的关键因素,提出多流程序划分优化策略,同时所提性能模型能够帮助判断算法实现的效果。实验结果表明,性能模型与多流配置实际测试结果误差小于1%,根据性能模型指导调优稠密矩阵乘的多流程序,比单流程序获得了5.83%的性能提升。Multi-stream programming mechanism can fully provide a variety of resource utilization methods such as pipelining and resource partitioning for heterogeneous many-core accelerators,but there is currently no effective guidance on how to choose effective resource utilization methods.Based on hStreams on heterogeneous many-core processor Intel MIC,we design a performance model for multi-stream program’s multi-hardware partitioning execution.Based on our performance model,we can identify the reasons for the performance difference of multi-stream programs under varied configurations,find out key factors that affect the performance,and provide a partitioning optimization strategy for multi-stream programs.In addition,it can also judge the effect of algorithm implementation.Our evaluation results show that the error between the estimated results of the performance model and the actual test results of multi-stream configuration is within 1%.Compared to the single-streamed version,our model also realizes a 5.83%performance improvement when guiding multi-stream programs of the dense matrix multiplication.

关 键 词:多流编程 流水线 资源划分 hStreams 异构平台 

分 类 号:TP302[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象