检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:柴旭清[1,2,3] 乔一航 范黎林 CHAI Xu-qing;QIAO Yi-hang;FAN Li-lin(College of Computer and Information Engineering,Henan Normal University,Xinxiang 453007;High Performance Computing Center,Henan Normal University,Xinxiang 453007;Henan Engineering Laboratory of Intelligent Commerce and Internet of Things Technology,Xinxiang 453007,China)
机构地区:[1]河南师范大学计算机与信息工程学院,河南新乡453007 [2]河南师范大学高性能计算中心,河南新乡453007 [3]智慧商务与物联网技术河南省工程实验室,河南新乡453007
出 处:《计算机工程与科学》2024年第7期1218-1228,共11页Computer Engineering & Science
基 金:国家自然科学基金(12274117);河南省优秀青年科学基金(202300410226);河南省高校科技创新计划(20HASTIT026)。
摘 要:高性能应用程序的传统性能分析方法因分析过程存在额外开销和分析结果不准确等缺陷,致使用户耗费更多的时间和领域知识。为解决以上问题,将程序的性能分析问题转化成高维特征下非平衡小样本数据集的多分类问题,采集500条包含程序运行时进程切换次数、内存利用率、磁盘I/O负载等7种性能数据,经PCA降维等数据预处理后,使用随机森林分类器训练程序性能问题分析模型。实验验证该模型可识别出内存利用率过高、磁盘I/O负载过重等5类性能问题。为评估模型的指导有效性,分别采集HotSpot3D程序和LU-Decomposition程序运行时产生的性能数据,并根据模型输出结果指导,分别基于运行级和编译级优化2个验证程序运行。实验结果表明,所提方法可有效指导优化程序的运行性能,2个验证程序的加速比分别为1.056和5.657。Traditional performance analysis methods for high performance applications have shortcomings such as additional overhead during the analysis process and inaccurate analysis results,resulting in users spending more time and domain knowledge.To address these issues,this paper transforms the problem of program performance analysis into a multi-classification problem of unbalanced small sample datasets under high-dimensional features.By collecting 500 pieces of performance data that include seven types of metrics such as the number of process switches,memory utilization,and disk I/O load during program runtime,after data preprocessing such as PCA dimensionality reduction,a program performance problem analysis model is trained using a random forest classifier.Experimental validation shows that the model can identify five types of performance issues,including excessive memory utilization and heavy disk I/O load.To evaluate the effectiveness of the model s guidance,this paper collects performance data generated by the HotSpot3D program and the LU-Decomposition program during runtime.Based on the model s output guidance,the two validation programs are optimized at the runtime level and the compilation level.Experimental results indicate that the proposed method can effectively guide the optimization of program performance,with speedup ratios of 1.056 and 5.657 for the two programs,respectively.
关 键 词:Nmon 性能分析 变分自编码器 聚类 随机森林
分 类 号:TP302[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.4