检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:焦帅[1,2] 徐卫志[1,2] 唐士斌[1,2] 范东睿[1] 孙凝晖[1]
机构地区:[1]中国科学院计算技术研究所系统结构重点实验室,北京100190 [2]中国科学院研究生院,北京100049
出 处:《计算机学报》2011年第11期2084-2092,共9页Chinese Journal of Computers
基 金:国家自然科学基金重点项目(60736012);国家"九七三"重点基础研究发展规划项目基金(2011CB302500);国家"八六三"高技术研究发展计划项目基金(2009AA01Z103);国家杰出青年科学基金(60925009);国家自然科学基金创新研究群体科学基金(60921002);北京市自然科学基金(4092044);"核高基"国家科技重大专项(2011ZX01028-001-002)资助~~
摘 要:该文提出了一个面向众核处理器的并行模拟器:PartitionSim.PartitionSim采用了一种新颖的方法——时序分割来加速众核结构模拟.时序分割的提出基于这样的观察:在众核结构中,有些模块之间频繁交互而有的模块之间没有交互.有鉴于此,该方法将目标结构分割成两部分:交互部分和非交互部分.当模拟交互部分时,主机线程严格同步,维持时序精确.当模拟非交互部分时,主机线程通过异步运行,提高模拟速度,并且产生较小的时序损失.文中所述工作在一个16核的SMP机器上用PartitionSim模拟了千核规模的Godson-T众核结构.实验结果显示,PartitionSim展示出良好的加速比,达到最高25MIPS的模拟速度,时序损失平均值为0.92%.This paper introduces PartitionSim,a parallel simulator for future thousand-core processors with software-managed cache coherence.The purpose of PartitionSim is to improve the simulation performance of many-core architectures at the expense of little accuracy sacrifice.To achieve this goal,we propose a novel technique: timing partition.Timing partition is based on such an observation: in a target system,interacting components communicate with each other and impose simulation synchronization while non-interacting components don't communicate with each other and allow asynchronous simulation.It divides the target timing models into two groups: non-interacting group and interacting group.Non-interacting timing models are simulated by host threads that synchronize little with each other to improve speed and hurt little accuracy,while interacting timing models are simulated by host threads that synchronize strictly with each other to preserve accuracy.Using PartitionSim,We have simulated a target composed of thousands of cores on a 16-core SMP machine.The evaluation results show that PartitionSim scales well with near linear speedup and has considerable performance(up to 25MIPS) at the expense of little accuracy sacrifice(average 0.92%).
分 类 号:TP302[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.149.249.140