A visual analytics system for optimizing the performance of large-scale networks in supercomputing systems  

在线阅读下载全文

作  者:Takanori Fujiwara Jianping Kelvin Li Misbah Mubarak Caitlin Ross Christopher D.Carothers Robert B.Ross Kwan-Liu Ma 

机构地区:[1]University of California,Davis,United States [2]Argonne National Laboratory,United States [3]Rensselaer Polytechnic Institute,United States

出  处:《Visual Informatics》2018年第1期98-110,共13页可视信息学(英文)

基  金:This research was sponsored by the Advanced Scientific Computing Research Program,the Office of Science,U.S;Department of Energy through grants DE-SC0014917,DE-SC0012610,and DE-AC02-06CH11357.

摘  要:The overall efficiency of an extreme-scale supercomputer largely relies on the performance of its network interconnects.Several of the state of the art supercomputers use networks based on the increasingly popular Dragonfly topology.It is crucial to study the behavior and performance of different parallel applications running on Dragonfly networks in order to make optimal system configurations and design choices,such as job scheduling and routing strategies.However,in order to study these temporal network behavior,we would need a tool to analyze and correlate numerous sets of multivariate time-series data collected from the Dragonfly's multi-level hierarchies.This paper presents such a tool-a visual analytics system-that uses the Dragonfly network to investigate the temporal behavior and optimize the communication performance of a supercomputer.We coupled interactive visualization with time-series analysis methods to help reveal hidden patterns in the network behavior with respect to different parallel applications and system configurations.Our system also provides multiple coordinated views for connecting behaviors observed at different levels of the network hierarchies,which effectively helps visual analysis tasks.We demonstrate the effectiveness of the system with a set of case studies.Our system and findings can not only help improve the communication performance of supercomputing applications,but also the network performance of next-generation supercomputers.

关 键 词:SUPERCOMPUTING Parallel communication network Dragonfly networks Time-series data Performance analysis Visual analytics 

分 类 号:TP3[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象