检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李战辉[1] 刘畅[1] 孟建熠[2] 严晓浪[1,2]
机构地区:[1]浙江大学超大规模集成电路设计研究所,杭州310027 [2]专用集成电路与系统国家重点实验室(复旦大学),上海201203
出 处:《计算机研究与发展》2015年第9期2105-2113,共9页Journal of Computer Research and Development
基 金:中央高校基本科研业务费专项基金项目(2012QNA5004)
摘 要:针对动态翻译时指令和数据高速缓存访问负荷大幅增加且增幅不均衡导致翻译器性能下降的问题,提出基于指令高速缓存与数据高速缓存访问负荷动态均衡的软硬件协同翻译方法.该方法为处理器设计高速缓存负荷平衡状态,该状态将数据高速缓存分为普通区和负荷平衡区(load balancing area,LBA),普通区缓存正常的程序数据,负荷平衡区通过负荷转化通道(load transforming channel,LTC)吸收动态翻译器调度器地址空间转换操作在指令高速缓存上产生的部分负荷,以提高数据高速缓存利用率.EEMBC(embedded microprocessor benchmark consortium)测试基准实验结果表明,在同等处理器资源的情况下,该方法将指令高速缓存访问次数平均减少35%,数据高速缓存访问次数平均减少58%,动态翻译器综合性能提高171%.Based on the fact that the highly increasing load of instruction cache and data cache has led to great performance loss for DBT(dynamic binary translator),and the out-of-balance increasing rate between instruction cache and data cache makes the situation worse,this paper proposes a hardwaresoftware-codesigned DBT acceleration mechanism that speeds up DBT performance by dynamically balancing load of instruction cache to data cache.The key idea of this mechanism is the design of the cache load balancing state for microprocessors. When microprocessor working in this state,the instruction cache stays the same as usual and the data cache is divided into two areas:normalaccessing-area and load-balancing-area.The normal-accessing-area caches regular program data just as the traditional data cache does.However,the load-balancing-area is quite different.It doesn't cache regular program data,but supports load-transforming-channel,which is used to transform and assimilate most of the instruction cache load caused by scheduler of the DBT.Main work of the scheduler is converting jump target address from source machine code space to target machine code space.Experimental results based on EEMBC(embedded microprocessor benchmark consortium)benchmarks show that the access load of instruction cache is reduced by 35%,data cache is reduced by58%,and overall performance of the QEMU(quick emulator)DBT is improved by 171%.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.179