大规模InfiniBand网络自学习的故障诊断方法  被引量:2

Incremental learning method for fault diagnosis in large-scale InfiniBand network

在线阅读下载全文

作  者:胡银辉[1] 陈琳[1] 

机构地区:[1]国防科学技术大学计算机学院,长沙410073

出  处:《计算机应用》2015年第11期3092-3096,共5页journal of Computer Applications

基  金:国家863计划项目(2012AA01A50606)

摘  要:针对大规模数据中心网络中如何有效监控网络异常事件、发现网络性能瓶颈和潜在故障点等问题,在深入分析InfiniBand(IB)网络的特性,引入了特征选取策略和增量学习策略的基础上,提出了一种面向大规模IB网络增量学习的故障诊断方法 IL_Bayes,该方法以贝叶斯分类方法为基础,加入增量学习机制,能够有效提高故障分类精度。在天河2真实的网络环境下,对算法的诊断精度和误诊率进行了验证,结果表明IL_Bayes算法具有较高的故障分类精度和较低的误诊率。Aiming at how to effectively monitor the network abnormal events, find the bottleneck of network performance and potential point of failure in large-scale data center network, based on the deep analysis of the characteristics of InfiniBand (IB) network and introducing the feature selection strategy and incremental learning strategy, an incremental learning method of fault diagnosis for large-scale IB network ( IL_Bayes) which based on the Bayes classification and added incremental learning mechanism was proposed. It could effectively improve the accuracy of fault classification. Through testing and verifying the diagnostic accuracy and the rate of misdiagnosis of this method in the Tianhe-2' s real network environment, the result shows that the IL_Bayes method has higher classification accuracy and lower misdiagnosis rate.

关 键 词:数据中心 INFINIBAND 故障诊断 贝叶斯分类 增量学习 

分 类 号:TP393.07[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象