The Sunway TaihuLight supercomputer: system and applications  被引量:63

The Sunway Taihu Light supercomputer:system and applications

在线阅读下载全文

作  者:Haohuan FU Junfeng LIAO Jinzhe YANG Lanning WANG Zhenya SONG Xiaomeng HUANG Chao YANG Wei XUE Fangfang LIU Fangli QIAO Wei ZHAO Xunqiang YIN Chaofeng HOU Chenglong ZHANG Wei GE Jian ZHANG Yangang WANG Chunbo ZHOU Guangwen YANG 

机构地区:[1]Ministry of Education Key Laboratory for Earth System Modeling,and Center for Earth System Science,Tsinghua University,Beijing 100084,China [2]Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China [3]National Supercomputing Center in Wuxi,Wuxi 214072,China [4]College of Global Change and Earth System Science,Beijing Normal University,Beijing 100875,China [5]Institute of Software,Chinese Academy of Sciences,Beijing 100190,China [6]First Institute of Oceanography,State Oceanic Administration,Qingdao 266061,China [7]Institute of Process Engineering,Chinese Academy of Sciences,Beijing 100190,China [8]Computer Network Information Center,Chinese Academy of Sciences,Beijing 100190,China

出  处:《Science China(Information Sciences)》2016年第7期109-124,共16页中国科学(信息科学)(英文版)

摘  要:The Sunway TaihuLight supercomputer is the world's first system with a peak performance greater than 100 PFlops. In this paper, we provide a detailed introduction to the TaihuLight system. In contrast with other existing heterogeneous supercomputers, which include both CPU processors and PCIe-connected many-core accelerators (NVIDIA GPU or Intel Xeon Phi), the computing power of TaihuLight is provided by a homegrown many-core SW26010 CPU that includes both the management processing elements (MPEs) and computing processing elements (CPEs) in one chip. With 260 processing elements in one CPU, a single SW26010 provides a peak performance of over three TFlops. To alleviate the memory bandwidth bottleneck in most applications, each CPE comes with a scratch pad memory, which serves as a user-controlled cache. To support the parallelization of programs on the new many-core architecture, in addition to the basic C/C++ and Fortran compilers, the system provides a customized Sunway OpenACC tool that supports the OpenACC 2.0 syntax. This paper also reports our preliminary efforts on developing and optimizing applications on the TaihuLight system, focusing on key application domains, such as earth system modeling, ocean surface wave modeling, atomistic simulation, and phase-field simulation.The Sunway TaihuLight supercomputer is the world's first system with a peak performance greater than 100 PFlops. In this paper, we provide a detailed introduction to the TaihuLight system. In contrast with other existing heterogeneous supercomputers, which include both CPU processors and PCIe-connected many-core accelerators (NVIDIA GPU or Intel Xeon Phi), the computing power of TaihuLight is provided by a homegrown many-core SW26010 CPU that includes both the management processing elements (MPEs) and computing processing elements (CPEs) in one chip. With 260 processing elements in one CPU, a single SW26010 provides a peak performance of over three TFlops. To alleviate the memory bandwidth bottleneck in most applications, each CPE comes with a scratch pad memory, which serves as a user-controlled cache. To support the parallelization of programs on the new many-core architecture, in addition to the basic C/C++ and Fortran compilers, the system provides a customized Sunway OpenACC tool that supports the OpenACC 2.0 syntax. This paper also reports our preliminary efforts on developing and optimizing applications on the TaihuLight system, focusing on key application domains, such as earth system modeling, ocean surface wave modeling, atomistic simulation, and phase-field simulation.

关 键 词:SUPERCOMPUTER MANY-CORE high performance computing scientific computing computer architecture 

分 类 号:TP338[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象