检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:黄林 林健 徐驰 罗明宇 王武双 鲁晓丹 HUANG Lin;LIN Jian;XU Chi;LUO Mingyu;WANG Wushuang;LU Xiaodan(Oriental Mind(Wuhan)Computing Technology Co.,Ltd.,Wuhan 430200,China)
机构地区:[1]东云睿连(武汉)计算技术有限公司,湖北武汉430200
出 处:《软件导刊》2024年第7期87-95,共9页Software Guide
基 金:武汉东湖高新区第十三批“3551光谷人才计划”项目(M165);Intel Marketing Exchange项目(581394)。
摘 要:虚拟数字人是人工智能与元宇宙应用的交叉点,也是当今线上与线下人机交互的新兴渠道之一。虚拟数字人涉及控制引擎、自然语言处理、3D图形渲染、语音识别与合成等技术领域,需要软硬件栈多层次的协同设计。为此,基于一体机部署模式的OMHuman虚拟数字人解决方案提出一套松耦合式控制引擎,采用独立显卡实现图形渲染,并通过自研算法在Intel OpenVINO计算引擎上实现人工智能模型推理,解决了传统方案在语音—动作协同控制等诸多方面的不足,同时兼顾了最终用户体验、开发成本与部署成本。比较测试表明,OMHuman虚拟数字人模型推理性能为传统引擎的2~3倍,图形渲染效率为核芯显卡的2倍,能够以自然的方式满足人机交互需求,目前已在虚拟主持人、智能数据分析师等场景得到成功应用。Virtual digital humans are the intersection of artificial intelligence and metaverse applications,involving technology fields such as control engines,natural language processing,3D graphics rendering,speech recognition and synthesis,and require multi-level collaborative design of software and hardware stacks.To this end,a loosely coupled control engine is proposed for the OMHuman virtual digital human solu-tion based on the all-in-one machine deployment mode.It uses an independent graphics card to achieve graphics rendering and implements ar-tificial intelligence model inference on the Intel OpenVINO computing engine through self-developed algorithms.This solves the shortcomings of traditional solutions in voice action collaborative control and other aspects,while also taking into account the end user experience,develop-ment costs,and deployment costs.Comparative tests have shown that the reasoning performance of the OMHuman virtual digital human model is 2-3 times that of traditional engines,and the graphics rendering efficiency is twice that of core graphics cards.It can meet human-computer interaction needs in a natural way and has been successfully applied in scenarios such as virtual hosts and intelligent data analysts.
关 键 词:虚拟数字人 人工智能 一体机 控制引擎 自然语言处理 图形渲染
分 类 号:TP37[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.171