检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:安仲奇 杜昊 李强[1] 霍志刚[1] 马捷[1] An Zhongqi;Du Hao;Li Qiang;Huo Zhigang;Ma Jie(High Performance Computer Research Center,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190;School of Computer and Control Engineering,University of Chinese Academy of Sciences,Beijing 100049)
机构地区:[1]中国科学院计算技术研究所高性能计算机研究中心,北京100190 [2]中国科学院大学计算机与控制工程学院,北京100049
出 处:《计算机研究与发展》2018年第4期864-874,共11页Journal of Computer Research and Development
基 金:国家重点研发计划项目(2016YFB0200204;2016YFB0200300);国家自然科学基金青年科学基金项目(61402444;61502454)~~
摘 要:内存对象缓存系统在通信方面受制于传统以太网的高延迟,在存储方面受限于服务器内可部署的内存规模,亟需融合新一代高性能I/O技术来提升性能、扩展容量.以广泛应用的Memcached为例,聚焦内存对象缓存系统的数据通路并基于高性能I/O对其进行通信加速与存储扩展.首先,基于日益流行的高性能远程直接内存访问(remote direct memory access,RDMA)语义重新设计通信协议,并针对不同的Memcached操作及消息大小设计不同的策略,降低了通信延迟.其次,利用高性能NVMe SSD来扩展Memcached存储,采用日志结构管理内存与外存2级存储,并通过用户级驱动实现对SSD的直接访问,降低了软件开销.最终,实现了支持JVM环境的高性能缓存系统U2cache.U2cache通过旁路操作系统内核和JVM运行时与内存拷贝、RDMA通信、SSD访问交叠流水的方法,显著降低了数据访问开销.实验结果表明,U2cache通信延迟接近RDMA底层硬件性能;对大消息而言,相较无优化版本,性能提高超过20%;访问SSD中的数据时,相比通过内核I/O软件栈的方式,访问延迟最高降低了31%.Existing in-memory object caching systems are bottlenecked by the latency overhead of traditional Ethernet and the limited DRAM amount within the servers.Modern high-performance IO technologies such as RDMA and NVMe provide a promising solution to address such challenges.In this paper,we focus on the data plane efficiency of in-memory object caching systems and undertake a study on the widely deployed Memcached for fast message transfer and cost-effective storage extension based on high-performance IO.First,the communication protocol is re-designed on RDMA semantics,and different strategies are applied according to the Memcached operation type and message payload size for optimal overall latency.Second,Memcached is altered to incorporate the NVMe SSDs to expand storage capacity.A circular log structure is adopted to manage the two-level hierarchy of DRAM and SSD.The SSD is directly accessed from the user-space to reduce software overhead.Finally,a JVM-enabled caching system named U2cache is presented.U2cache significantly improves the performance by bypassing both the OS kernel and the JVM runtime.The latency is further hidden through pipelining and overlapping of memory copy,RDMA transfer and SSD access.Benchmarking results indicate that U2cache achieves near-optimal performance of the underlying RDMA interconnect.Performance is further improved by 20%with careful optimization for transferring large messages.For accessing data located in SSD,the latency is reduced by up to 31%compared with the kernel-based IO.
关 键 词:MEMCACHED 远程直接内存访问 NVMe固态硬盘 JAVA虚拟机 用户级IO
分 类 号:TP316[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.222.30.59