以图像视频为中心的跨媒体分析与推理  被引量:4

Image video centered cross-media analysis and reasoning

在线阅读下载全文

作  者:黄庆明 王树徽[2] 许倩倩[2] 李亮[2] 蒋树强 HUANG Qingming;WANG Shuhui;XU Qianqian;LI Liang;JIANG Shuqiang(School of Computer Science and Technology,University of Chinese Academy of Sciences,Beijing 100049,China;Key Lab of Intelligent Information Processing,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China)

机构地区:[1]中国科学院大学计算机科学与技术学院,北京100049 [2]中国科学院计算技术研究所智能信息处理实验室,北京100190

出  处:《智能系统学报》2021年第5期834-848,共15页CAAI Transactions on Intelligent Systems

基  金:科技创新2030-新一代人工智能重大项目(2018AAA0102000);国家自然科学基金项目(62022083,61976202,61771457,61732007).

摘  要:如何跨越从跨媒体数据到跨媒体知识所面临的“异构鸿沟”和“语义鸿沟”,对体量巨大的跨媒体数据进行有效管理与利用,是发展新一代人工智能亟待突破的瓶颈问题。针对以图像视频为代表的海量网络跨媒体内容,借鉴人类感知与认知机理,本文对跨媒体内容统一表征与符号化表征、跨媒体深度关联理解、类人跨媒体智能推理等关键技术开展研究。基于上述关键技术,着力于解决发展新一代人工智能的知识匮乏共性难题,开展大规模跨媒体知识图谱的构建及人机协同标注技术研究,为跨媒体感知进阶到认知提供关键支撑,进一步为跨媒体理解、检索、内容转换生成等跨媒体内容管理与服务热点应用领域提供了可行思路。How to surpass the heterogeneity gap and semantic gap between the cross-media content and cross-media knowledge,and how to manage and utilize the huge amount of cross-media data effectively are urgent bottleneck prob-lems of developing a new generation of artificial intelligence.Aiming at massive online cross-media content represen-ted by image video and by referring to human perception and cognition mechanisms,this paper undertakes studies on such key technologies as unified representation and symbolic representation of cross-media content,deep correlative un-derstanding of cross-media and human-like cross-media intelligent reasoning.Based on the above technologies,this pa-per focuses on solving the common problem of knowledge shortage in the development of a new generation of artificial intelligence and carries out a research on the construction of large-scale cross-media knowledge graph and the human-machine cooperation based labeling technology,to provide strong support for the advancement from cross-media per-ception to cognition and further provide feasible solutions towards cross-media content management and popular ser-vice applications,e.g.,cross-media content understanding,retrieval,content transformation and generation,etc.

关 键 词:跨媒体 图像视频 统一表征 关联理解 可解释推理 人机协同 知识图谱 内容管理与服务 

分 类 号:TP37[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象