This paper presents our endeavors in developing the large-scale, ultra-high-resolution E3SM Land Model (uELM), specifically designed for exascale computers furnished with accelerators such as Nvidia GPUs. The uELM is ...
supported by National Natural Science Foundation of China(Grant Nos.62325201,62172008);National Natural Science Fund for the Excellent Young Scientists Fund Program(Overseas);the PKU-Byte Dance Joint-Lab Program。
Large-scale GPU clusters are widely used to speed up both latency-critical(online)and besteffort(offline)deep learning(DL)workloads.However,similar to the common practice,the DL clusters at ByteDance dedicate each GPU...
National Natural Science Foundation of China(Grant Nos.61972444,61825202,62072195,and 61832006);Zhejiang Lab(2022P10AC02).
Graphs that are used to model real-world entities with vertices and relationships among entities with edges,have proven to be a powerful tool for describing real-world problems in applications.In most real-world scena...
The 3D reconstruction pipeline uses the Bundle Adjustment algorithm to refine the camera and point parameters. The Bundle Adjustment algorithm is a compute-intensive algorithm, and many researchers have improved its p...
supported by the Department of Energy,Basic Energy Sciences,Materials Science and Engineering Division,through the Midwest Integrated Center for Computational Materials(MICCoM);supported by the Dutch Research Council(NWO Rubicon 019.202EN.028);supported by the U.S.Department of Energy,Office of Science,Office of Advanced Scientific Computing Research,Department of Energy Computational Science Graduate Fellowship under Award Number DE-SC0022158.
Molecular simulations are an important tool for research in physics,chemistry,and biology.The capabilities of simulations can be greatly expanded by providing access to advanced sampling methods and techniques that pe...
Bundle adjustment is a camera and point refinement technique in a 3D scene reconstruction pipeline. The camera parameters and the 3D points are refined by minimizing the difference between computed projection and obse...
This work was in part supported by the National Research Foundation(NRF)of Korea(Grant No.2020R1A5A1019141 and 2021R1A2C2094153).Computational resources were provided by the Supercomput-ing Center of Samsung Electronics.
We report a high-performance multi graphics processing unit(GPU)implementation of the Kohn–Sham time-dependent density functional theory(TDDFT)within the Tamm–Dancoff approximation.Our algorithm on massively paralle...
When training a large-scale knowledge graph embedding(KGE)model with multiple graphics processing units(GPUs),the partition-based method is necessary for parallel training.However,existing partition-based training met...
the National Key Research and Development Program of China under Grant No.2018AAA0102703;the National Natural Science Foundation of China under Grant Nos.61972341,61972342,and 61732015.
We present a novel algorithm BADF(Bounding Volume Hierarchy Based Adaptive Distance Fields)for accelerating the construction of ADFs(adaptive distance fields)of rigid and deformable models on graphics processing units...