supported in part by the U.S.National Science Foundation under Grant Nos.CCF-2008907 and CCF-2029014;the Chinese Academy of Sciences Project for Young Scientists in Basic Research under Grant No.YSBR-029;the Chinese Academy of Sciences Project for Youth Innovation Promotion Association.
Graph processing is a vital component of many AI and big data applications.However,due to its poor locality and complex data access patterns,graph processing is also a known performance killer of AI and big data appli...
supported by the Royal Society International Exchanges Cost Share Award of UK under Grant No.RP202G0230,the Medical Research Council Confidence in Concept Award of UK under Grant No.MC_PC_17171;the Hope Foundation for Cancer Research of UK under Grant No.RM60G0680;the British Heart Foundation Accelerator Award of UK under Grant No.A A/18/3/34220;Sino-UK Industrial Fund under Grant No.RP202G0289;the Global Challenges Research Fund(GCRF)of UK under Grant No.P202PF11;the Fundamental Research Funds for the Central Universities of China under Grant No.CDLS-2020-03;the Key Laboratory of Child Development and Learning Science(Southeast University),Ministry of Education of China,Henan Key Research and Development Project of China,under Grant No.182102310629;the National Natural Science Foundation of China under Grant Nos.U19B2032 and 61772511.
COVID-19 is a contagious infection that has severe effects on the global economy and our daily life.Accurate diagnosis of COVID-19 is of importance for consultants,patients,and radiologists.In this study,we use the de...
supported by the National Key Research and Development Program of China under Grant No.2020YFB2010900;the Fundamental Research Funds for the Central Universities(Zhejiang University NGICS Platform)of China under Grant No.TC190A449.
Programmable logic controllers(PLCs)play a critical role in many industrial control systems,yet face increasingly serious cyber threats.In this paper,we propose a novel PLC-compatible software-based defense mechanism,...
This work was supported in part by the National Key Research and Development Program of China under Grant No.2018YFC1504104;the Fundamental Research Funds for the Central Universities of China under Grant No.WK6030000109;the National Natural Science Foundation of China under Grant No.61877056.
In order to conduct optical neurophysiology experiments on a freely swimming zebrafish,it is essential to quantify the zebrafish head to determine exact lighting positions.To efficiently quantify a zebrafish head's be...
The work was supported by the National Key Research and Development Program of China under Grant No. 2018YFB0204102。
The short-range pair interaction consumes most of the CPU time in molecular dynamics(MD)simulations.The inherent computation sparsity makes it challenging to achieve high-performance kernel on the emerging many-core a...
The National Key Research and Development Program of China under Grant No.2018YFB0204301;the National Natural Science Foundation of China under Grant Nos.61972408 and 61602501.
This article presents a comprehensive performance evaluation of Phytium 2000+,an ARMv8-based 64-core architecture.We focus on the cache and memory subsystems,analyzing the characteristics that impact the high-performa...
This work was supported by the National Key Research and Development Program of China under Grant No. 2016YFB0200501, the National Natural Science Foundation of China under Grant Nos. 61332009 and 61521092, the Open Project Program of State Key Laboratory of Mathematical Engineering and Advanced Computing under Grant No. 2016A04, and the Beijing Municipal Science and Technology Commission under Grant No. Z15010101009.
Double buffering is an effective mechanism to hide the latency of data transfers between on-chip and off-chip memory. However, in dataflow architecture, the swapping of two buffers during the execution of many tiles d...
We have witnessed the tremendous momentum of the second spring of parallel computing in recent years. But, we should remember the low points of the field more than 20 years ago and review the lesson that has led to th...
The construction of large software systems is always achieved through assembly of independently written components -- program modules. For these software components to work together, they must share a common set of da...
This work was supported by the National High Technology Research and Development 863 Program of China under Grant No. 2015AA01A301, the National Natural Science Foundation of China under Grant No. 61332009, the National HeGaoJi Project of China under Grant No. 2013ZX0102-8001-001-001, and the Beijing Municipal Science and Technology Commission under Grant Nos. Z15010101009 and Z151100003615006.
Dataflow architecture has shown its advantages in many high-performance computing cases. In dataflow computing, a large amount of data are frequently transferred among processing elements through the network-on-chip ...