检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张荣泽 王修晖 ZHANG Rongze;WANG Xiuhui(College of Information Engineering,China Jiliang University,Hangzhou 310018,China)
出 处:《计算机工程与应用》2024年第15期143-149,共7页Computer Engineering and Applications
基 金:国家重点研发计划课题(2021YFC3340402)。
摘 要:在直播带货场景中,通过分析消费者发送的弹幕信息,能够在一定程度上反映出当前商品的实际评价是否与主播的描述一致,对直播行业中假冒伪劣产品的监管具有重要指导意义。针对弹幕文本识别的特殊性,提出了一种基于改进CRNN(convolutional recurrent neural network)的实时弹幕识别网络,以解决CRNN算法对于复杂背景环境下的文本特征信息提取不全等问题。为此所设计的网络采用了编解码结构对特征提取模块进行强化设计,以解决弹幕像素区域小造成的特征提取过程中的特征丢失问题。使用Transformer模型对输入的帧画面构建长距离全局特征关系,以强化网络模型对弹幕信息的捕捉能力,并对提取的特征信息进行序列建模及转录得到具体的弹幕语义信息。实验结果表明,所设计的网络在测试集上检测精度高达0.926,平均精度值提高了0.101。In the scenario of live-streaming e-commerce,through the analysis of the bullet screen information sent by con-sumers,whether the actual evaluation of current commodities is the same as that described by the anchor can be reflected to a certain extent,which plays a regulatory role in the promotion of counterfeit products in the live-streaming industry.For the special characteristics of bullet screen text recognition,in this thesis,a real-time bullet screen recognition network based on improved CRNN(convolutional recurrent neural network)is proposed to solve the problems of incomplete extrac-tion of text feature information by CRNN algorithm in complex background environment.Therefore,the designed net-work adopts an encoding and decoding structure to enhance the feature extraction module to solve the problem of feature loss during feature extraction caused by the small pixel area of the bullet screen.Moreover,a Transformer model is used to construct long-distance global feature relationships for input frames to strengthen the ability of network model to cap-ture and extract bullet screen information.And the extracted feature information is sequentially modeled and transcribed to obtain the specific bullet screen semantic information.The experimental results show that the detection accuracy of the designed network is tested experimentally up to 0.926 on the test set,which improves the accuracy value by 0.101 on average.
关 键 词:文本识别 深度学习 循环卷积网络 Transformer模型
分 类 号:TP391.43[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.118.226.34