面向大模型预训练的智算网络技术研究  被引量:2

Research on intelligent computing network technology for large-scale pre-trained models

在线阅读下载全文

作  者:王学聪 冀思伟 李聪 WANG Xuecong;JI Siwei;LI Cong(Research Institute of China Telecom Co.,Ltd.,Beijing 102209,China)

机构地区:[1]中国电信股份有限公司研究院,北京102209

出  处:《电信科学》2024年第6期160-172,共13页Telecommunications Science

基  金:国家重点研发计划项目(No.2023YFB2904100)。

摘  要:随着人工智能的发展,大规模预训练模型在自然语言处理和计算机视觉等领域都取得了显著成果,促进了智算中心的建设。针对面向大模型预训练的智算网络关键技术展开研究,系统梳理了智算网络国内外最新的标准化进展,提出了一种面向智算网络的目标架构,探讨了智算网络关键技术的原理,包括远程直接内存访问(RDMA)、IB(InfiniBand)、基于以太网的RDMA(RoCE)、集合通信等,同时也分析了智算网络目前存在的问题以及未来的发展趋势,在推动智算网络技术发展、指导智算中心建设等方面具有重要意义。With the development of artificial intelligence,significant achievements are made in various fields such as natural language processing and computer vision through the utilization of large-scale pre-trained models,which promotes the construction of intelligent computing centers.Key technologies related to large-scale pre-trained models in intelligent computing networks were studied.The latest standardization progress of intelligent computing network at home and abroad was systematically reviewed.A target architecture for intelligent computing network was proposed,and the principles of key technologies,including remote direct memory access(RDMA),IB,RoCE,and collective communication,were explored.Moreover,the current issues and future development trends of intelligent computing networks were analyzed.This research holds crucial importance in advancing the development of intelligent computing network technology and providing guidance for the establishment of intelligent computing centers.

关 键 词:智算网络 远程直接内存访问 大模型 

分 类 号:TP393[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象