检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:孙保胜 谢东亮 SUN Baosheng;XIE Dongliang(School of Computer Science,Beijing University of Posts and Telecommunications,Beijing 100876,China)
出 处:《北京邮电大学学报》2023年第4期58-63,共6页Journal of Beijing University of Posts and Telecommunications
摘 要:为了促进汉语唇读的快速发展和实际应用,提出了一种基于交错组卷积和空洞卷积组合的轻量化唇读模型。所提模型通过分组卷积学习不同特征间的相关性,通过空洞卷积扩展模型视野,在大幅度降低模型参数量和复杂度的同时提高模型识别精度。针对汉语唇读数据集较少的问题,在可控制环境下录制了一个句子级汉语唇读数据集。在录制数据集和公开数据集上对轻量化唇读模型适用性进行实验验证,证明了模型的有效性。并通过热图可视化的方法分析了模型对视频帧和文本映射关系的学习能力。In order to promote the rapid development and practical application of Chinese lipreading,a lightweight lipreading model is proposed based on the combination of interleaved group convolution and dilated convolution.In the proposed model,the interleaved group convolution is taken to learn the correlation between different features and the dilated convolution is taken to expand the model receptive field,which greatly reduces the amount and complexity of model parameter and improves the accuracy of model recognition.In addition,the largest sentence-level Chinese lipreading dataset is recorded in a controlled environment to enrich the Chinese lipreading dataset.The applicability of the lightweight lipreading model is verified on the recorded datasets and public datasets.The learning ability of the model to the video frame and text mapping relationship is analyzed visually through the heatmap.
分 类 号:TN911.73[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49