检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王凯[1,2,3] 陈飞[1,2,3] 李强[1,2,3] 李晓民[1,2] 安学军[1,2] 孙凝晖[1,2]
机构地区:[1]中国科学院计算技术研究所,北京100190 [2]中国科学院计算机系统结构重点实验室(中国科学院计算技术研究所),北京100190 [3]中国科学院研究生院,北京100049
出 处:《计算机研究与发展》2011年第1期1-8,共8页Journal of Computer Research and Development
基 金:国家自然科学基金重点项目(60633040);国家"八六三"高技术研究发展计划基金项目(2006AA01A102)
摘 要:传统高性能计算机的节点由一个处理单元和一个节点控制器组成.为了有效地维护高速缓存一致性,处理单元中的处理器个数会非常有限.因此一台具有千万亿次处理能力的高性能计算机将会有上万个节点,这对互连网络的延迟和带宽都提出了非常高的要求.超节点控制器能够同时连接多个处理单元构成一个超节点,这能够减小互连网络的规模,从而降低互连网络的设计难度,并保证互连网络的性能.用FPGA实现了超节点控制器的原型系统的测试结果表明,采用超节点设计的高性能计算机拥有非常低的通信延迟,同时其通信带宽也有非常好的扩展性.A traditional high performance computer(HPC) consists of two parts: nodes and interconnection network,and the node part can be further divided into a processing unit and a node controller.The processing unit usually adopts symmetric multi-processors(SMP) or non-uniform memory access(NUMA) structure with cache coherence.In order to maintain the cache coherence efficiently,the number of processors in a processing unit is very limited.Therefore,a HPC of petaflops would possess tens of thousands of nodes,which makes a very high requirement of both latency and bandwidth for the interconnection network.The hyper-node controller introduced in this paper can connect several processing units simultaneously,and they together construct a hyper-node.Implementing hyper-nodes can largely reduce the scale of the interconnection network,which reduces the design complexity of the interconnection network and guarantees the performance of the interconnection network.The key techniques in the hyper-node controller,including supporting global address space,direct memory access,remote load store,global hardware lock,and multi-rail interconnection network,can effectively lower the communication latency,guarantee the sufficient bandwidth and enhance its synchronization performance.The hyper-node controller is implemented with FPGA,and a prototype system is built.The test result shows that the cluster with hyper-nodes has very low latency,and it has a good extendibility in bandwidth.
关 键 词:高性能计算机 超节点控制器 全局地址空间 直接内存访问 远程读写
分 类 号:TP303[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.229