检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:胡玉贵[1] HU Yu-gui (College of Computer Science,Guangdong Institute of Science and Technology, Zhuhai 519090,China)
机构地区:[1]广东科学技术职业学院计算机学院,广东珠海519090
出 处:《电脑知识与技术》2012年第12期8248-8250,共3页Computer Knowledge and Technology
基 金:广东省自然科学基金项目(S2011010002537)
摘 要:随着越来越多的公司和企业使用GPU来作为加速计算设备,对并行程序的需求也越来越大,目前我们一般都使用CUDA或OPENCL等底层API进程序开发,但是使用这些底层API来进行开发效率都不高,目前出现OPENACC指令就是针对这个问题提出来的,在该文里,我们针对高斯模糊算法,分别使用CPU,OPENACC,CUDA进行实现,比较他们的效率,发现在虽然OPENACC相对于CUDA性能要低一些,但相对其陡峭的学习曲线和低下的开发效率,OPENACC有着不错的性价比,而且随着编译器和硬件技术的发展,OPENACC有着广阔的发展空间。Computer systems become increasingly widespread use of GPGPU devices to speed up the calculation, but, using the low-level API for accelerated calculation is Cumbersome and inefficient To solve this problem, Based on the instruction of the higher level of abstraction for programming to solve this problem, in this article, Gaussian blur algorithm experience OPENACC just usage OPENACC through instruction in C or C + + in the calculation close code transfer to GUP in conduct, we compare the respectively use the CPU, OPENACC, CUDA achieve Gaussian mode Lake algorithm performance performance. Although CUDA have a good performance, but relative to the low efficiency of the development and the steep learning curve, OPENACC used only a dozen lines of code can achieve significant speedup, and with the development of the compiler and hard- ware technology, the command guidance compile getting more the larger development space.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.149.253.148