检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王圣节 张庆红[1] WANG Shengjie;ZHANG Qinghong(College of Statistics and Data Science,Xinjiang University of Finance and Economics,Urumqi 830012,China)
机构地区:[1]新疆财经大学统计与数据科学学院,新疆乌鲁木齐830012
出 处:《电信科学》2024年第7期121-133,共13页Telecommunications Science
基 金:国家自然科学基金资助项目(No.72164034)。
摘 要:在电信行业中,客户流失的准确预测对于相关企业维持市场竞争力和增加收益至关重要。为此提出一个结合CatBoost算法和SHAP(shapley additive explanations)模型的客户流失预测框架,旨在提高预测的准确性,同时增强模型的可解释性。利用新疆某通信公司的实际营业数据,通过数据预处理及特征工程,构建预测模型,选取5种主要关键性能指标评估模型性能。实验结果显示,所提出模型在选取的评价指标上均优于当前主流机器学习预测模型。最后引入SHAP框架增强模型可解释性,揭示影响客户流失的关键因素,并提供具体的因素影响程度,为电信企业制定针对性的客户保留策略提供了科学依据。In the telecom industry,accurate prediction of customer churn is crucial for the companies involved to maintain market competitiveness and increase revenue.To this end,a customer churn prediction framework combining CatBoost algorithm and SHAP model was proposed,aiming to improve the accuracy of prediction and enhance the interpretability of the model.Using the actual business data of a communication company in Xinjiang,the prediction model was constructed through data preprocessing and feature engineering,and five major key performance indicators were selected to evaluate the model performance.The experimental results show that the proposed model outperforms the current mainstream machine learning prediction models in all the above evaluation indicators.Finally,the SHAP framework was introduced to enhance the model interpretability,reveal the key factors affecting customer churn,and provide the specific influence degree of the factors,which provided a scientific basis for telecommunications enterprises to formulate targeted customer retention strategies.
关 键 词:机器学习 CatBoost算法 SHAP 预测模型 电信行业
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.170