一种并行作业任务启动模型及其可扩展性分析  

A parallel job launch model and its scalability analysis

在线阅读下载全文

作  者:宋长明[1] 龚道永[1] 张宏宇[1] 

机构地区:[1]江南计算技术研究所,江苏无锡214000

出  处:《计算机工程与科学》2013年第11期182-186,共5页Computer Engineering & Science

摘  要:随着高性能计算机系统规模的不断扩大,作业启动的时间越来越长,大作业的启动时间逐渐成为影响系统规模扩展的一个重要因素。同时,元器件数目快速增长带来的更频繁的故障也使大规模并行应用在完成前可能经历多次反复提交,因此作业任务的启动效率也直接影响着系统计算资源的有效利用率和用户使用体验。通过设计一种层次式并行作业任务启动模型,并对其在不同作业规模下的性能进行测试、分析与优化,经过优化后该模型能够支持一个大规模系统的作业任务启动与控制,并具备较好的可扩展性。With the expansion of the scale of high performance computer systems, job launch con- sumes more and more time, and the task start time gradually becomes an important factor affecting sys- tem scalability. Meanwhile, the rapid increase of the number of components brings failures more fre- quently, resulting in repeated submission of parallel applications before their completion. Therefore, the task start efficiency has a direct impact on the effective utilization of computing resources and user expe- rience. A hierarchical parallel job launch model is designed and its performance under different job scales is tested, analyzed and optimized. After optimization, the proposed model can support a large-scale sys- tem with tasks start and control efficiently and have good scalability.

关 键 词:作业任务启动 层次式管理 虚拟化 网络数据优化 扩展性 

分 类 号:TP316[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象