基于可重构架构的数据中心异构加速软硬件系统级平台  

Data Center Heterogeneous Acceleration Software-Hardware System-Level Platform Based on Reconfigurable Architecture

在线阅读下载全文

作  者:王彦伟 李仁刚 徐冉[1] 刘钧锴 Wang Yanwei;Li Rengang;Xu Ran;Liu Junkai(Inspur(Beijing)Electronic Information Industry Co.,Ltd.,Beijing 100095;Department of Computer Science and Technology,Tsinghua University,Beijing 100084;Inspur Electronic Information Industry Co.,Ltd.,Jinan 250101)

机构地区:[1]浪潮(北京)电子信息产业有限公司,北京100095 [2]清华大学计算机科学与技术系,北京100084 [3]浪潮电子信息产业股份有限公司,济南250101

出  处:《计算机研究与发展》2025年第4期963-977,共15页Journal of Computer Research and Development

基  金:新一代人工智能国家科技重大专项(2021ZD0113004);山东省自然科学基金项目(ZR2023LZH010)。

摘  要:构建数据中心加速服务的软硬件系统级原型平台,需要考虑高计算能力、扩展性、灵活性和低成本等因素.为了提高数据中心的能力,从软硬件协同的角度研究数据中心异构计算在云平台架构、硬件实现、高速互连和应用等方面的创新,研究设计并构建了一个可重构组合的软硬件加速原型系统,简化了现有以处理器为中心的系统级计算平台构建方法,实现目标软硬件设计的快速部署与系统级原型验证.针对以上目标,通过解耦的可重构架构设备虚拟化和远程映射等方法,发掘独立计算单元的潜力,构建了一套ISOF(independent system of FPGA(field programmable gate arrays))软硬件计算平台系统,可使其超越普通服务器设计所能提供的能力,实现计算单元低成本高效扩展,使客户端可灵活使用外设资源,并且为满足系统级通信挑战,设计了一套计算单元之间的通信硬件平台和交互机制.此外,为提升软硬件系统级平台的敏捷性,ISOF提供了灵活统一的调用接口.最后,通过对平台目标系统级的分析评估,验证了该平台在满足了当下计算与加速需求下,保证了高速、低延时的通信,以及良好的吞吐率和弹性扩容效率,另外在高速通信的基础上改进的拥塞避免和丢包恢复机制,满足了数据中心规模通信的稳定性需求.Constructing a software and hardware system-level prototype platform for accelerating data center services requires the consideration of factors such as high computing power,scalability,flexibility,and low cost.To enhance data center capabilities,research from the perspective of software-hardware synergy has been conducted on the innovation of heterogeneous computing in cloud platform architecture,hardware implementation,high-speed interconnection,and applications.A reconfigurable and combinable software-hardware acceleration prototype system is designed and built to simplify existing processor-centric system-level computing platform construction methods,enabling rapid deployment and system-level prototype validation of target software-hardware designs.To achieve these objectives,methods such as decoupled reconfigurable architecture device virtualization and remote mapping are utilized to uncover the potential of independent computing units.An ISOF(independent system of FPGA)softwarehardware computing platform system is constructed to surpass the capabilities of conventional server designs,enabling low-cost and efficient expansion of computing units while allowing clients to flexibly utilize peripheral resources.To address system-level communication challenges,a communication hardware platform and interaction mechanism between computing units are designed.Additionally,to enhance the agility of the software-hardware system-level platform,ISOF provides a flexible and unified invocation interface.Finally,through the analysis and evaluation of the system-level objectives of the platform,it has been verified that the platform meets the current computing and acceleration requirements,ensuring high-speed,low-latency communication,as well as good throughput and efficient elastic scalability.In addition,improvements have been made in congestion avoidance and packet recovery mechanisms based on high-speed communication,meeting the stability requirements of communication at data center scale.

关 键 词:异构计算 加速平台 原型系统 高速互连 可重构架构 FPGA 

分 类 号:TP303[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象