检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:黄曼蒂 李韬[1] 杨惠[1] 李成龙 张毓涛 孙志刚[1] Huang Mandi;Li Tao;Yang Hui;Li Chenglong;Zhang Yutao;Sun Zhigang(College of Computer Science and Technology,National University of Defense Technology,Changsha 410073;Defense Innovation Institute,Academy of Military Sciences,Beijing 100071)
机构地区:[1]国防科技大学计算机学院,长沙410073 [2]军事科学院国防科技创新研究院,北京100071
出 处:《计算机研究与发展》2025年第5期1262-1289,共28页Journal of Computer Research and Development
基 金:国家国防科技工业局重点实验室基金项目(WDZC20245250113)。
摘 要:目前数据中心规模迅速扩大和网络带宽大幅度提升,传统软件网络协议栈的处理器开销较大,并且难以满足众多数据中心应用程序在吞吐、延迟等方面的需求.远程直接内存访问(remote direct memory access,RDMA)技术采用零拷贝、内核旁路和处理器功能卸载等思想,能够高带宽、低延迟地读写远端主机内存数据.兼容以太网的RDMA技术正在数据中心领域展开应用,以太网RDMA网卡作为主要功能承载设备,对其部署发挥重要作用.综述从架构、优化和实现评估3个方面进行分析:1)对以太网RDMA网卡的通用架构进行了总结,并对其关键功能部件进行了介绍;2)重点阐述了存储资源、可靠传输和应用相关3方面的优化技术,包括面向网卡缓存资源的连接可扩展性和面向主机内存资源的注册访问优化,面向有损以太网实现可靠传输的拥塞控制、流量控制和重传机制优化,面向分布式存储中不同存储类型、数据库系统、云存储系统以及面向数据中心应用的多租户性能隔离、安全性、可编程性等方面的优化工作;3)调研了不同实现方式、评估方式.最后,给出总结和展望.With the rapid expansion of data center and the significant increase in network bandwidth,traditional software network protocol stack has high processor overhead and is difficult to meet the needs of many data center applications in terms of throughput,latency and other aspects.Remote direct memory access(RDMA)technology uses the ideas of zero copy,kernel bypass and processor function offloading to read and write remote host memory data with high bandwidth and low latency.Ethernet-compatible RDMA technology is being applied in data centers,and Ethernet RDMA NIC plays a crucial role in its deployment as the main functional bearer device.This overview analyzes from three aspects:architecture,optimization,and implementation evaluation.1)We summarize the general architecture of Ethernet RDMA NIC and introduce the key functional components;2)We focus on the optimization techniques in storage resources,reliable transmission and application-related aspects,including optimization of both connection scalability for NIC cache resources and registration access for host memory resources,optimization of congestion control,flow control and retransmission mechanism for lossy Ethernet to achieve reliable transmission,and optimization of different storage types in distributed storage,database system,cloud storage system,and multi-tenant performance isolation,security and programmability for data center applications;3)Then we investigate different implementation and evaluation methods.Finally,the summary and outlook are given.
关 键 词:远程直接内存
分 类 号:TP393[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49