检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:冯国富[1] 董小社[1] 丁彦飞[1] 王旭昊[1]
机构地区:[1]西安交通大学计算机科学与技术系,西安710049
出 处:《西安交通大学学报》2009年第2期1-5,共5页Journal of Xi'an Jiaotong University
基 金:国家高技术研究发展计划资助项目(2006AA01A109;2004AA111110);国家自然科学基金资助项目(60773118)
摘 要:针对Cell宽带引擎架构(CBEA)多核高性能处理器要求软件显式地对分层存储结构进行管理,带来架构的可编程性及性能等问题,提出了一种基于CBEA的异构多核访存技术.将CBEA访存分为批量访存和按需访存;通过合理部署数据缓冲区来减小批量访存计算中的片内访存开销,利用支持粗粒度访问的软件管理cache及数据预取来降低按需访存的片外访存开销;以访存接口库的方式来改善软件的可编程性.实验结果表明,所提技术的访存接口库在批量访存方式下的性能比ALF和CellSs提高了30%~50%,按需访存中软件管理cache性能比CBE软件开发工具包提高了20%~30%,4路数据预取访存比单路缓存的性能提高约50%.A memory access technology is proposed to solve the problems of programming and performance caused by explicit management for multi-level memory hierarchies of Cell broadband engine architecture (CBEA). The access to main memory on CBEA is classified into two major types: bulk data transfer and on-demand data transfer. For bulk data transfer, the overhead of on-chip memory access is reduced by proper organization of data buffer. Meanwhile, a softwaremanaged cache supporting coarse-grained access and data pre-fetch are adopted to enhance the performance of on-demand off-chip data transfer. In order to facilitate programming, a memory access library based on the proposed methods is implemented. Experimental results show that the performance of applications based on the proposed bulk data transfer technology is about 30%- 50% higher than that of applications based on ALF and CellSs. For on-demand data transfer, the proposed software-managed cache performs about 20%-30% better than that of CBE SDK does, and the data prefetch technology based on four-buffer gets about 50% performance increase compared to single-buffer cache.
关 键 词:异构多核 访存技术 分层存储结构 Cell宽带引擎架构
分 类 号:TP393[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.249