一种基于安全多方计算的快速Transformer安全推理方案被引量：1

A Fast and Secure Transformer Inference Scheme with Secure Multi-Party Computation

作　　者：刘伟欣管晔玮霍嘉荣丁元朝郭华李博[2] Liu Weixin;Guan Yewei;Huo Jiarong;Ding Yuanchao;Guo Hua;Li Bo(School of Cyber Science and Technology,Beihang University,Beijing 100191;State Key Laboratory of Complex&Critical Software Environment(Beihang University),Beijing 100878)

机构地区：[1]北京航空航天大学网络空间安全学院,北京100191 [2]复杂关键软件环境全国重点实验室(北京航空航天大学),北京100878

出　　处：《计算机研究与发展》2024年第5期1218-1229,共12页Journal of Computer Research and Development

基　　金：国家重点研发计划(2021YFB2700200);国家自然科学基金项目(U21B2021,61972018,61932014)。

摘　　要：Transformer模型在自然语言处理、计算机视觉等众多领域得到了广泛应用,并且有着突出的表现.在Transformer的推理应用中用户的数据会被泄露给模型提供方.随着数据隐私问题愈发得到公众的关注,上述数据泄露问题引发了学者们对Transformer安全推理的研究,使用安全多方计算(secure multi-party computation,MPC)实现Transformer模型的安全推理是当前的一个研究热点.由于Transformer模型中存在大量非线性函数,因此使用MPC技术实现Transformer安全推理会造成巨大的计算和通信开销.针对Transformer安全推理过程中开销较大的Softmax注意力机制,提出了2种MPC友好的注意力机制Softmax freeDiv Attention和2Quad freeDiv Attention.通过将Transformer模型中的Softmax注意力机制替换为新的MPC友好的注意力机制,同时结合激活函数GeLU的替换以及知识蒸馏技术,提出了一个MPC友好的Transformer转换框架,通过将Transformer模型转化为MPC友好的Transformer模型,提高Transformer安全推理的效率.在局域网环境下使用安全处理器(secure processing unit,SPU)提供的隐私计算协议,基于所提出的MPC友好的Transformer转换框架,在SST-2上使用Bert-Base进行安全推理.测试结果表明,在保持推理准确率与无近似模型一致的情况下,安全推理计算效率提高2.26倍.Transformer has been widely used in many fields such as natural language processing and computer vision,and has outstanding performance.The users’data will be leaked to the Transformer model provider during inference.With the increasing public attention on data privacy,the above data leakage problem has triggered researchers’study on secure Transformer inference.Implementing secure Transformer inference with secure multi-party computation(MPC)is today’s hot topic.Due to the widely existence of non-linear functions in Transformer,it is hard to use MPC to implement secure Transformer inference,which leads to huge computation and communication cost.We focus on Softmax attention,bottleneck in secure Transformer inference,and propose two kinds of MPC-friendly attention mechanism,Softmax freeDiv Attention and 2Quad freeDiv Attention.By replacing the Softmax attention in Transformer with the MPC-friendly attention mechanism proposed,combining with the replacement of activation function GeLU and knowledge distillation,we propose an MPC-friendly Transformer convert framework,which can convert Transformer model to an MPC-friendly one,so as to improve the performance of secure Transformer inference later.Based on the proposed MPC-friendly Transformer convert framework,we perform secure Bert-Base inference on SST-2 in the LAN setting,using privacy computing protocols provided by secure processing unit(SPU).The result shows that the secure inference achieves 2.26 times speedup while maintaining the accuracy with nonapproximation model.

关键词：安全推理 TRANSFORMER 安全多方计算安全处理器知识蒸馏

分类号：TP309[自动化与计算机技术—计算机系统结构]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种基于安全多方计算的快速Transformer安全推理方案被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种基于安全多方计算的快速Transformer安全推理方案 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

一种基于安全多方计算的快速Transformer安全推理方案被引量：1