PartitionSim:一个面向众核结构的并行模拟器  被引量:2

PartitionSim: A Parallel Simulator for Many-Cores

在线阅读下载全文

作  者:焦帅[1,2] 徐卫志[1,2] 唐士斌[1,2] 范东睿[1] 孙凝晖[1] 

机构地区:[1]中国科学院计算技术研究所系统结构重点实验室,北京100190 [2]中国科学院研究生院,北京100049

出  处:《计算机学报》2011年第11期2084-2092,共9页Chinese Journal of Computers

基  金:国家自然科学基金重点项目(60736012);国家"九七三"重点基础研究发展规划项目基金(2011CB302500);国家"八六三"高技术研究发展计划项目基金(2009AA01Z103);国家杰出青年科学基金(60925009);国家自然科学基金创新研究群体科学基金(60921002);北京市自然科学基金(4092044);"核高基"国家科技重大专项(2011ZX01028-001-002)资助~~

摘  要:该文提出了一个面向众核处理器的并行模拟器:PartitionSim.PartitionSim采用了一种新颖的方法——时序分割来加速众核结构模拟.时序分割的提出基于这样的观察:在众核结构中,有些模块之间频繁交互而有的模块之间没有交互.有鉴于此,该方法将目标结构分割成两部分:交互部分和非交互部分.当模拟交互部分时,主机线程严格同步,维持时序精确.当模拟非交互部分时,主机线程通过异步运行,提高模拟速度,并且产生较小的时序损失.文中所述工作在一个16核的SMP机器上用PartitionSim模拟了千核规模的Godson-T众核结构.实验结果显示,PartitionSim展示出良好的加速比,达到最高25MIPS的模拟速度,时序损失平均值为0.92%.This paper introduces PartitionSim,a parallel simulator for future thousand-core processors with software-managed cache coherence.The purpose of PartitionSim is to improve the simulation performance of many-core architectures at the expense of little accuracy sacrifice.To achieve this goal,we propose a novel technique: timing partition.Timing partition is based on such an observation: in a target system,interacting components communicate with each other and impose simulation synchronization while non-interacting components don't communicate with each other and allow asynchronous simulation.It divides the target timing models into two groups: non-interacting group and interacting group.Non-interacting timing models are simulated by host threads that synchronize little with each other to improve speed and hurt little accuracy,while interacting timing models are simulated by host threads that synchronize strictly with each other to preserve accuracy.Using PartitionSim,We have simulated a target composed of thousands of cores on a 16-core SMP machine.The evaluation results show that PartitionSim scales well with near linear speedup and has considerable performance(up to 25MIPS) at the expense of little accuracy sacrifice(average 0.92%).

关 键 词:并行模拟 众核 Godson-T 时序分割 

分 类 号:TP302[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象