Visual knowledge in the bigmodel era:retrospect and prospect  

在线阅读下载全文

作  者:Wenguan WANG Yi YANG Yunhe PAN 

机构地区:[1]College of Computer Science and Technology,Zhejiang University,Hangzhou 310027,China†E-mail:yangyics@zju.edu.cn

出  处:《Frontiers of Information Technology & Electronic Engineering》2025年第1期1-19,共19页信息与电子工程前沿(英文版)

基  金:supported by“Pioneer”and“Leading Goose”R&D Program of Zhejiang Province,China(No.2024C01161);the National Science and Technology Major Project of China(No.2023ZD0121300);the National Natural Science Foundation of China(No.62372405);the Fundamental Research Funds for the Central Universities,China。

摘  要:Visual knowledge is a new form of knowledge representation that can encapsulate visual concepts and their relations in a succinct,comprehensive,and interpretable manner,with a deep root in cognitive psychology.As the knowledge of the visual world has been identified as an indispensable component of human cognition and intelligence,visual knowledge is poised to have a pivotal role in establishing machine intelligence.With the recent advance of artificial intelligence(AI)techniques,large AI models(or foundation models)have emerged as a potent tool capable of extracting versatile patterns from broad data as implicit knowledge,and abstracting them into an outrageous amount of numeric parameters.To pave the way for creating visual knowledge empowered AI machines in this coming wave,we present a timely review that investigates the origins and development of visual knowledge in the pre-big-model era,and accentuates the opportunities and unique role of visual knowledge in the big model era.

关 键 词:Visual knowledge Artificial intelligence Foundation model Deep learning 

分 类 号:TP3[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象