Machine learning job failure analysis and prediction model for the cloud environment  

在线阅读下载全文

作  者:Harikrishna Bommala Uma Maheswari V. Rajanikanth Aluvalu Swapna Mudrakola 

机构地区:[1]Department of CSE,KG Reddy College of Engineering&Technology,Hyderabad 500075,India [2]Department of CSE,Chaitanya Bharathi Institute of Technology,Hyderabad 500075,India [3]Department of IT,Chaitanya Bharathi Institute of Technology,Hyderabad 500075,India [4]Department of CSE,Matrusri College of Engineering,Hyderabad 500059,India

出  处:《High-Confidence Computing》2023年第4期73-86,共14页高置信计算(英文)

摘  要:Reliable and accessible cloud applications are essential for the future of ubiquitous computing,smart appliances,and electronic health.Owing to the vastness and diversity of the cloud,a most cloud services,both physical and logical services have failed.Using currently accessible traces,we assessed and characterized the behaviors of successful and unsuccessful activities.We devised and implemented a method to forecast which jobs will fail.The proposed method optimizes cloud applications more efficiently in terms of resource usage.Using Google Cluster,Mustang,and Trinity traces,which are publicly available,an in-depth evaluation of the proposed model was conducted.The traces were also fed into several different machine learning models to select the most reliable model.Our efficiency analysis proves that the model performs well in terms of accuracy,F1-score,and recall.Several factors,such as failure of forecasting work,design of scheduling algorithms,modification of priority criteria,and restriction of task resubmission,may increase cloud service dependability and availability.

关 键 词:Failure prediction Mustang trace Cloud computing Trinity trace Random forest Google cluster trace Fault tolerance 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象