检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:姜伟华[1] 梅超[1] 郭一[1] 朱嘉华[1] 臧斌宇[1] 朱传琪[1]
出 处:《计算机学报》2005年第8期1255-1266,共12页Chinese Journal of Computers
基 金:国家自然科学基金(60273046);上海市科学技术委员会重点基础项目基金(02JC14013);Intel公司大学合作项目(IntelNetBurst微体系结构的编译优化)资助
摘 要:自动向量化编译是利用处理器的多媒体扩展指令集提升多媒体程序性能的理想工具.但目前的研究不能有效加速实际程序.其主要原因是:普通算术操作的向量化不一定有性能提升;而多媒体典型操作因为其在源代码中表现形式多样而不能充分向量化.为了解决这一问题,文章对经典向量化算法[1]进行改进以灵活统一地向量化这两类操作.主要的改进是增加了两个步骤:统一操作的不同表现形式和识别有价值的向量化操作.改进后的算法可以充分利用指令集生成高效代码,从而对实际多媒体程序拥有良好效果.此外,该算法可扩展性也很强.Almost all vendors have added multimedia extensions (MME) to their processors to speedup multimedia applications. However, researches on automatic vectorization of compiler so far have not fully utilized these MMEs to boost the performance of real-life multimedia applications. This results from their focus on vectorization for normal arithmetic operations which rarely have speedup and their failure to fully exploit benefits from MME support for multimedia specific operations. These multimedia specific operations have various forms in source code, especially those expressed in multiple statements and scattered in program. This fact greatly hindered their vectorization. In this paper, the authors resolve this problem by enhancing the classic vectorization algorithm to flexibly and uniformly vectorize beneficial normal arithmetic and multimedia specific operations. The authors mainly added two extra steps: one to uniform the appearance of operations and the other to recognize vectorizable operations. The experiment shows that above algorithm has satisfactory performance improvement in several real-life multimedia applications.The results reach 43.9 % maximum and 7.4 % average speedup for Accelerating Suite of Berkeley Multimedia Workload. Furthermore, any system based on the algorithm the authors proposed can be extended to vectorize more complicate cases by simply adding corresponding rules.
关 键 词:自动向量化编译技术 多媒体扩展指令集 多媒体典型操作
分 类 号:TP303[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222