GPUS

作品数:50被引量:67H指数:4
导出分析报告
相关领域:自动化与计算机技术更多>>
相关作者:明付仁张阿漫刘金硕邓娟眭海刚更多>>
相关机构:西安交通大学哈尔滨工程大学阿伯丁大学武汉大学更多>>
相关期刊:《Earthquake Science》《Science China Chemistry》《医疗卫生装备》《International Journal of Digital Earth》更多>>
相关基金:国家自然科学基金国家教育部博士点基金国家重点基础研究发展计划国家高技术研究发展计划更多>>
-

检索结果分析

结果分析中...
条 记 录,以下是1-10
视图:
排序:
Portable Software Environment for Ultrahigh-Resolution ELM Development on GPUs
《Journal of Computer and Communications》2025年第2期28-36,共9页Dali Wang Peter Schwartz Fengming Yuan Franklin Eaglebarge Danial Riccuito Peter Thornton Chris Layton Qinglei Cao 
This paper presents our endeavors in developing the large-scale, ultra-high-resolution E3SM Land Model (uELM), specifically designed for exascale computers furnished with accelerators such as Nvidia GPUs. The uELM is ...
关键词:E3SM Land Model Ultrahigh-Resolution ELM Portable Software Environment GPU-Ready Environment 
MuxFlow:efficient GPU sharing in production-level clusters with more than 10000 GPUs
《Science China(Information Sciences)》2024年第12期119-135,共17页Xuanzhe LIU Yihao ZHAO Shufan LIU Xiang LI Yibo ZHU Xin LIU Xin JIN 
supported by National Natural Science Foundation of China(Grant Nos.62325201,62172008);National Natural Science Fund for the Excellent Young Scientists Fund Program(Overseas);the PKU-Byte Dance Joint-Lab Program。
Large-scale GPU clusters are widely used to speed up both latency-critical(online)and besteffort(offline)deep learning(DL)workloads.However,similar to the common practice,the DL clusters at ByteDance dedicate each GPU...
关键词:GPU cluster deep learning workload cluster management GPU sharing deployed system 
A survey on dynamic graph processing on GPUs: concepts, terminologies and systems
《Frontiers of Computer Science》2024年第4期1-23,共23页Hongru GAO Xiaofei LIAO Zhiyuan SHAO Kexin LI Jiajie CHEN Hai JIN 
National Natural Science Foundation of China(Grant Nos.61972444,61825202,62072195,and 61832006);Zhejiang Lab(2022P10AC02).
Graphs that are used to model real-world entities with vertices and relationships among entities with edges,have proven to be a powerful tool for describing real-world problems in applications.In most real-world scena...
关键词:dynamic graphs graph processing graph algorithms GPUS 
Optimized CUDA Implementation to Improve the Performance of Bundle Adjustment Algorithm on GPUs
《Journal of Software Engineering and Applications》2024年第4期172-201,共30页Pranay R. Kommera Suresh S. Muknahallipatna John E. McInroy 
The 3D reconstruction pipeline uses the Bundle Adjustment algorithm to refine the camera and point parameters. The Bundle Adjustment algorithm is a compute-intensive algorithm, and many researchers have improved its p...
关键词:Scene Reconstruction Bundle Adjustment LEVENBERG-MARQUARDT Non-Linear Least Squares Memory Throughput Computational Throughput Contiguous Memory Access CUDA Optimization 
PySAGES:flexible,advanced sampling methods accelerated with GPUs
《npj Computational Materials》2024年第1期2877-2888,共12页Pablo F.Zubieta Rico Ludwig Schneider Gustavo R.Pérez-Lemus Riccardo Alessandri Siva Dasetty Trung D.Nguyen Cintia A.Menéndez Yiheng Wu Yezhi Jin Yinan Xu Samuel Varner John A.Parker Andrew L.Ferguson Jonathan K.Whitmer Juan Jde Pablo 
supported by the Department of Energy,Basic Energy Sciences,Materials Science and Engineering Division,through the Midwest Integrated Center for Computational Materials(MICCoM);supported by the Dutch Research Council(NWO Rubicon 019.202EN.028);supported by the U.S.Department of Energy,Office of Science,Office of Advanced Scientific Computing Research,Department of Energy Computational Science Graduate Fellowship under Award Number DE-SC0022158.
Molecular simulations are an important tool for research in physics,chemistry,and biology.The capabilities of simulations can be greatly expanded by providing access to advanced sampling methods and techniques that pe...
关键词:PYTHON SAGE COLLECTIVE 
优化的传输线有限元法在电磁场中的分析及应用
《东北电力技术》2024年第1期37-42,共6页方锦 阎秀恪 钟立国 任自艳 张殿海 
为提高传输线有限元法(transmission line model-finite element method,TLM-FEM)的求解效率,对该方法的入射阶段和反射阶段的求解过程进行了优化。在反射阶段采用优化的松弛方法加速求解非线性端口电压,将单元系数矩阵的计算以及全局...
关键词:优化松弛方法 并行计算 传输线法 有限元 GPUS 
Improving Accuracy and Computational Burden of Bundle Adjustment Algorithm Using GPUs
《Engineering(科研)》2023年第10期663-690,共28页Pranay R. Kommera Suresh S. Muknahallipatna John E. McInroy 
Bundle adjustment is a camera and point refinement technique in a 3D scene reconstruction pipeline. The camera parameters and the 3D points are refined by minimizing the difference between computed projection and obse...
关键词:Bundle Adjustment LEVENBERG-MARQUARDT Scene Reconstruction Radial Dis-tortion Coefficient Explicit Jacobian CUDA Optimization 
Kohn–Sham time-dependent density functional theory with Tamm–Dancoff approximation on massively parallel GPUs
《npj Computational Materials》2023年第1期1556-1567,共12页Inkoo Kim Daun Jeong Won-Joon Son Hyung-Jin Kim Young Min Rhee Yongsik Jung Hyeonho Choi Jinkyu Yim Inkook Jang Dae Sin Kim 
This work was in part supported by the National Research Foundation(NRF)of Korea(Grant No.2020R1A5A1019141 and 2021R1A2C2094153).Computational resources were provided by the Supercomput-ing Center of Samsung Electronics.
We report a high-performance multi graphics processing unit(GPU)implementation of the Kohn–Sham time-dependent density functional theory(TDDFT)within the Tamm–Dancoff approximation.Our algorithm on massively paralle...
关键词:GPUS GRAPHICS MASSIVE 
Efficient Knowledge Graph Embedding Training Framework with Multiple GPUs被引量:1
《Tsinghua Science and Technology》2023年第1期167-175,共9页Ding Sun Zhen Huang Dongsheng Li Min Guo 
When training a large-scale knowledge graph embedding(KGE)model with multiple graphics processing units(GPUs),the partition-based method is necessary for parallel training.However,existing partition-based training met...
关键词:knowledge graph embedding parallel algorithm partitioning graph framework graphics processing unit(GPU) 
BADF:Bounding Volume Hierarchies Centric Adaptive Distance Field Computation for Deformable Objects on GPUs被引量:1
《Journal of Computer Science & Technology》2022年第3期731-740,共10页Xiao-Rui Chen Min Tang Cheng Li Dinesh Manocha Ruo-Feng Tong 
the National Key Research and Development Program of China under Grant No.2018AAA0102703;the National Natural Science Foundation of China under Grant Nos.61972341,61972342,and 61732015.
We present a novel algorithm BADF(Bounding Volume Hierarchy Based Adaptive Distance Fields)for accelerating the construction of ADFs(adaptive distance fields)of rigid and deformable models on graphics processing units...
关键词:distance field deformable object graphics processing unit(GPU) OCTREE bounding volume hierarchy 
检索报告 对象比较 聚类工具 使用帮助 返回顶部