一种针对多媒体扩展指令集和实际多媒体程序的自动向量化方法  被引量:3

Vectorization for Real-Life Multimedia Applications on Processors’ Multimedia Extensions

在线阅读下载全文

作  者:姜伟华[1] 梅超[1] 郭一[1] 朱嘉华[1] 臧斌宇[1] 朱传琪[1] 

机构地区:[1]复旦大学并行处理研究所,上海200433

出  处:《计算机学报》2005年第8期1255-1266,共12页Chinese Journal of Computers

基  金:国家自然科学基金(60273046);上海市科学技术委员会重点基础项目基金(02JC14013);Intel公司大学合作项目(IntelNetBurst微体系结构的编译优化)资助

摘  要:自动向量化编译是利用处理器的多媒体扩展指令集提升多媒体程序性能的理想工具.但目前的研究不能有效加速实际程序.其主要原因是:普通算术操作的向量化不一定有性能提升;而多媒体典型操作因为其在源代码中表现形式多样而不能充分向量化.为了解决这一问题,文章对经典向量化算法[1]进行改进以灵活统一地向量化这两类操作.主要的改进是增加了两个步骤:统一操作的不同表现形式和识别有价值的向量化操作.改进后的算法可以充分利用指令集生成高效代码,从而对实际多媒体程序拥有良好效果.此外,该算法可扩展性也很强.Almost all vendors have added multimedia extensions (MME) to their processors to speedup multimedia applications. However, researches on automatic vectorization of compiler so far have not fully utilized these MMEs to boost the performance of real-life multimedia applications. This results from their focus on vectorization for normal arithmetic operations which rarely have speedup and their failure to fully exploit benefits from MME support for multimedia specific operations. These multimedia specific operations have various forms in source code, especially those expressed in multiple statements and scattered in program. This fact greatly hindered their vectorization. In this paper, the authors resolve this problem by enhancing the classic vectorization algorithm to flexibly and uniformly vectorize beneficial normal arithmetic and multimedia specific operations. The authors mainly added two extra steps: one to uniform the appearance of operations and the other to recognize vectorizable operations. The experiment shows that above algorithm has satisfactory performance improvement in several real-life multimedia applications.The results reach 43.9 % maximum and 7.4 % average speedup for Accelerating Suite of Berkeley Multimedia Workload. Furthermore, any system based on the algorithm the authors proposed can be extended to vectorize more complicate cases by simply adding corresponding rules.

关 键 词:自动向量化编译技术 多媒体扩展指令集 多媒体典型操作 

分 类 号:TP303[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象