片上多核中一种共享感知的数据主动推送Cache技术  被引量:3

A Sharing-Aware Active Pushing Cache Technology on Chip-Multiprocessor

在线阅读下载全文

作  者:王得利[1] 高德远[1] 

机构地区:[1]西北工业大学计算机学院,西安710072

出  处:《西安交通大学学报》2010年第10期18-23,共6页Journal of Xi'an Jiaotong University

基  金:国家自然科学基金资助项目(60773223;60736012);国家"863计划"资助项目(2009AA01Z110)

摘  要:针对片上多核处理器的二级Cache访问延时持续增加以及并行程序在运行时线程间执行速率差异大的问题,提出了一种基于共享感知的数据主动推送Cache技术(SAAPC).SAAPC技术充分考虑并行程序的系统性能由速度最慢的线程所决定这一重要特性,根据并行线程间读数据共享程度高以及共享读数据访问局部性好的特征,采用基于指令的方法来预测共享读数据流,在后行线程需要共享数据之前将其主动推送至该线程的一级Cache中去,从而减少较慢线程的数据访问延时,提高执行速率,降低较慢线程与先行线程间执行速率的差异.SAAPC技术避免了预取技术所带来的额外片外带宽增加的缺点.使用SESC模拟器对来自于SPLASH2测试程序集的5个存储敏感型并行程序进行了测试仿真,结果表明,与传统的共享Cache相比,使用SAAPC技术减少了并行线程间执行速率的差异,系统的每周期指令数平均提高了7%,最高达到13.1%.A sharing-aware active pushing Cache technology(SAAPC)is proposed to solve the problems of the increasing L2Cache latency and the high deviation of progressive rates among the simultaneous threads in parallel applications when they are running on chip multi-processors.SAAPC fully takes the important characteristic into consideration that the whole system performances of parallel applications are constrained by the slowest thread in parallel phases.Based on the high share degree of read Cache blocks among different threads and the locality of the shared read accesses,SAPPC exploits the program counter to predict the shared data streams.The shared data are actively pushed to the L1Caches of slower threads before it is needed so that the data access latencies for the slower threads are reduced and the progressive rates are increased.Therefore,the deviation of progressive rates is decreased.SAAPC avoids the problem caused by increasing off-chip bandwidth demand of the prefetch technique due to its inaccuracy.5memory intensive parallel programs from SPLASH2benchmark suit are simulated using the simulator called SESC.Experimental results and comparisons with conventional shared Cache show that the SAAPC reduces the progressive rates deviation,and the average system instruction per cycle improvement is 7%and can be up to 13.1%.

关 键 词:片上多核处理器 并行程序 共享感知 主动推送 执行速率 

分 类 号:TP302[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象