检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《软件学报》2013年第8期1775-1785,共11页Journal of Software
基 金:国家高技术研究发展计划(863)(2009AA012201);国家科技重大专项(核高基)(2009ZX01036-001-001);河南省重大科技攻关专项(092101210501)
摘 要:SPMD翻译是指将一种特定类型的SPMD程序编译到多种设备上,当前的细粒度SPMD翻译研究建立在线程之间相互独立的假定上,线程之间只通过显式同步进行通信.但线程之间还隐含存在着各种数据依赖,如隐式同步,这导致了SPMD翻译在处理隐式同步时的正确性缺陷.为了对隐式同步进行处理,对细粒度SPMD模型CUDA中的隐式同步进行了系统的分析,指出了当前翻译CUDA程序到多核平台的相关研究在处理隐式同步上的不足,提出了基于依赖分析的隐式同步检测方法.在检测出隐式同步的基础上,设计了循环重排序的优化处理算法,对显式同步和隐式同步进行了统一处理.实验结果表明,与现有的SPMD翻译方法相比,该检测及处理算法能够正确而快速地检测并翻译CUDA中的各种隐式同步,代价较小,有助于编译器产生正确而有效的翻译结果.SPMD translation compiles programs of one SPMD-threaded programming model to multi devices.The current researches base on the supposition that different threads are independent except in communication with explicit synchronizations.However,the data dependence relation between threads such as implicit synchronizations results in the correctness pitfalls in SPMD translation.In order to deal with implicit synchronizations,the implicit synchronizations in fine-grained SPMD programming model CUDA are analyzed systematically.The correctness pitfalls in existing SPMD translation from CUDA to Multi-core are revealed in which this paper proposes a method of detecting implicit synchronizations based on dependence analysis.On the basis of implicit synchronizations detecting,an optimized treatment algorithm is designed to treat explicit and implicit synchronizations synthetically by the loop reorder.The experimental results show that compared with existing SPMD translation,the detecting and optimized algorithm could treat kinds of implicit synchronizations in fine grained SPMD translation correctly and quickly by small expense,which helps compiler produces correct and efficient result.
关 键 词:SPMD翻译 显式同步 隐式同步 依赖分析 线程循环 循环重排序
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.40