检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:曾祥玖 刘达维 刘逸凡 赵志滨 柳秀梅 任酉贵 ZENG Xiangjiu;LIU Dawei;LIU Yifan;ZHAO Zhibin;LIU Xiumei;REN Yougui(School of Computer Science and Engineering,Northeastern University,Shenyang 110169,China;Service Center of Natural Resource Affairs of Liaoning Province,Shenyang 110001,China)
机构地区:[1]东北大学计算机科学与工程学院,沈阳110169 [2]辽宁省自然资源事务服务中心,沈阳110001
出 处:《计算机工程与应用》2023年第14期107-113,共7页Computer Engineering and Applications
基 金:全国高等院校计算机基础教育研究会计算机基础教育教学研究项目(2022-AFCEC-236)。
摘 要:视频分类是理解、归纳和检索视频数据的一个重要环节。新闻短视频具有音频信息比图像信息更能完整地描述新闻事件的特点,但传统视频分类模型常常只考虑图像信息或融合了音频和图像的多模态信息,并没有考虑模态信息之间的主辅关系。针对上述问题,采用以音频模态为主,图像模态为辅的融合机制,提出了融合多模态特征的新闻短视频分类模型。为进一步利用音频为主的特点,采用两阶段训练方式,使用音频模态单独训练,音频和图像模态联合训练,利用图像信息修正分类结果,提升新闻短视频分类的准确率。为训练和评价模型,采集了10304个新闻联播短视频作为实验数据集,总时长约为240 h。实验结果表明,所提模型的分类效果优于传统的新闻短视频分类模型。Video classification is an important part of understanding,summarizing and retrieving video data.News short video has the feature that audio information can describe news events more completely than image information,while traditional video classification models often only consider image information or fuse multimodal information of audio and image,which do not consider the primary-secondary relationship between modal information.To address the above problems,a news short video classification model fusing multimodal feature is proposed.It is designed with the fusion mechanism of audio modality as the main and image modality as the auxiliary.In order to make further use of the audio-dominated feature,a two-stage training mode is adopted.Firstly,the audio mode is trained separately,and then the audio and image modes are trained jointly.The image information is used to correct the classification results,so as to improve the accuracy of news short video classification.For the purpose of the model in training and evaluation,10304 news broadcast short videos have been collected as experimental dataset,with a total time of about 240 hours.The experimental results show that the classification effect of the proposed model is better than the traditional news short video classification model.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.16.161.16