检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:黄鹏[1,2] 张雄[1,3] 马上[1] 王汉奎[1]
机构地区:[1]清华大学航天航空学院,北京100084 [2]中国工程物理研究院总体工程研究所,绵阳621900 [3]大连理工大学工业装备结构分析国家重点实验室,大连116024
出 处:《计算力学学报》2010年第1期21-27,共7页Chinese Journal of Computational Mechanics
基 金:国家自然科学基金(10872107);国家重点基础研究发展计划(2010CB32101)资助项目
摘 要:基于OpenMP技术开发了三维显式物质点并行程序MPM3DMP。为了避免节点更新阶段的数据竞争,采用区域分解法将背景网格分解为均匀的子域,每个线程负责一个子域的节点变量更新,然后将更新后的节点变量装配到整体。在质点更新阶段采用了循环分解方法进行并行。针对Taylor杆碰撞的三种计算模型,在双Intel Woodcrest 4核CPU服务器下进行了测试:粗模型在4核下加速比为3.82,在8核下为6.23,中模型在4核下加速比为3.79,在8核下加速比为6.23;细模型在4核下加速比为3.75,8核下加速比为6.26。因此,本文的并行程序具有较好的并行效率和可扩展性。Based on the OpenMP technique, a parallel 3D explicit material point method (MPM) code, MPM3DMP, is developed in this paper. The domain decomposition method is presented for avoiding ta races in updat each thread deal ng wi nodal variables. The background grid is decomposed into some uniform patches, da- and a patch in domain decomposition method. After updating nodes in all patches, their nodal variables are assembled into the global one. The code for updating particle variables can he parallelized using the loop splitting method directly. To test the performance of the developed code, the Taylor bar impact is simulated by using three models with different number of particles on a computer with dual quad-core Intel Woodcrest processors. A speedup of 3.82 is achieved for the coarse model in 4 cores, and 6.23 in 8 cores, while a speedup of 3.79 is achieved for the medium-sized model in 4 cores, and 6.23 in 8 cores. A speedup of 3.75 is achieved for the fine model in 4 cores, and 6.26 in 8 cores. Thus, MPM3DMP has good parallel efficiency and extension ability.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.114