MULTI-MODAL

作品数:173被引量:350H指数:9
导出分析报告
相关领域:自动化与计算机技术更多>>
相关作者:陈光慧曲扬顾伟红晏克非陶晓峰更多>>
相关机构:广东柯内特环境科技有限公司暨南大学沈阳化工大学浙江师范大学更多>>
相关期刊:更多>>
相关基金:国家自然科学基金国家教育部博士点基金国家重点基础研究发展计划国家社会科学基金更多>>
-

检索结果分析

结果分析中...
选择条件:
  • 期刊=Machine Intelligence Researchx
条 记 录,以下是1-6
视图:
排序:
Editorial for Special Issue on Multi-modal Representation Learning
《Machine Intelligence Research》2024年第4期615-616,共2页Deng-Ping Fan Nick Barnes Ming-Ming Cheng Luc Van Gool 
The past decade has witnessed the impressive and steady development of single-modal AI technologies in several fields,thanks to the emergence of deep learning.Less studied,however,is multi-modal AI-commonly considered...
关键词:MODAL utilize INCOMPLETE 
Boosting Multi-modal Ocular Recognition via Spatial Feature Reconstruction and Unsupervised Image Quality Estimation
《Machine Intelligence Research》2024年第1期197-214,共18页Zihui Yan Yunlong Wang Kunbo Zhang Zhenan Sun Lingxiao He 
This work was supported by National Natural Science Foundation of China(Nos.62006225,61906199 and 62071468);the Strategic Priority Research Program of Chinese Academy of Sciences(CAS),China(No.XDA 27040700);sponsored by The Beijing Nova Program,China(Nos.Z201100006820050 and Z211100002121010).
In the daily application of an iris-recognition-at-a-distance(IAAD)system,many ocular images of low quality are acquired.As the iris part of these images is often not qualified for the recognition requirements,the mor...
关键词:Iris recognition periocular recognition spatial feature reconstruction fully convolutional network flexible matching unsupervised iris quality assessment adaptive weight fusion 
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey被引量:14
《Machine Intelligence Research》2023年第4期447-482,共36页Xiao Wang Guangyao Chen Guangwu Qian Pengcheng Gao Xiao-Yong Wei Yaowei Wang Yonghong Tian Wen Gao 
supported by National Natural Science Foundation of China(Nos.61872256 and 62102205);Key-Area Research and Development Program of Guangdong Province,China(No.2021B0101400002);Peng Cheng Laboratory Key Research Project,China(No.PCL 2021A07);Multi-source Cross-platform Video Analysis and Understanding for Intelligent Perception in Smart City,China(No.U20B2052).
With the urgent demand for generalized deep models,many pre-trained big models are proposed,such as bidirectional encoder representations(BERT),vision transformer(ViT),generative pre-trained transformers(GPT),etc.Insp...
关键词:Multi-modal(MM) pre-trained model(PTM) information fusion representation learning deep learning 
Visual Superordinate Abstraction for Robust Concept Learning
《Machine Intelligence Research》2023年第1期79-91,共13页Qi Zheng Chao-Yue Wang Dadong Wang Da-Cheng Tao 
supported in part by the Australian Research Council(ARC)(Nos.FL-170100117,DP-180103424,IC-190100031 and LE-200100049).
Concept learning constructs visual representations that are connected to linguistic semantics, which is fundamental to vision-language tasks. Although promising progress has been made, existing concept learners are st...
关键词:Concept learning visual question answering weakly-supervised learning multi-modal learning curriculum learning 
Causal Reasoning Meets Visual Representation Learning: A Prospective Study被引量:4
《Machine Intelligence Research》2022年第6期485-511,共27页Yang Liu Yu-Shen Wei Hong Yan Guan-Bin Li Liang Lin 
supported in part by National Natural Science Foundation of China(Nos.62002395,61976250 and U1811463);the National Key R&D Program of China(No.2021ZD0111601);the Guangdong Basic and Applied Basic Research Foundation,China(Nos.2021A15150123 and 2020B1515020048).
Visual representation learning is ubiquitous in various real-world applications,including visual comprehension,video understanding,multi-modal analysis,human-computer interaction,and urban computing.Due to the emergen...
关键词:Causal reasoning visual representation learning reliable artificial intelligence spatial-temporal data multi-modal analysis 
Exploring the Brain-like Properties of Deep Neural Networks:A Neural Encoding Perspective被引量:1
《Machine Intelligence Research》2022年第5期439-455,共17页Qiongyi Zhou Changde Du Huiguang He 
supported by National Natural Science Foundation of China(Nos.61976209 and 62020106015);the CAS International Collaboration Key Project,China(No.173211KYSB20190024);the Strategic Priority Research Program of CAS,China(No.XDB32040000)。
Nowadays,deep neural networks(DNNs)have been equipped with powerful representation capabilities.The deep convolutional neural networks(CNNs)that draw inspiration from the visual processing mechanism of the primate ear...
关键词:Convolutional neural network(CNN) vision transformer(Vi T) multi-modal networks spatial-temporal networks visual neural encoding brain-like neural networks 
检索报告 对象比较 聚类工具 使用帮助 返回顶部