MULTIMODAL

作品数:530被引量:677H指数:10
导出分析报告
相关领域:医药卫生更多>>
相关作者:陈伟王群丁彰雄顾曰国施鹏飞更多>>
相关机构:西安外国语大学华中师范大学安徽师范大学华南理工大学更多>>
相关期刊:更多>>
相关基金:国家自然科学基金中国博士后科学基金国家重点基础研究发展计划北京市自然科学基金更多>>
-

检索结果分析

结果分析中...
选择条件:
  • 期刊=Machine Intelligence Researchx
条 记 录,以下是1-9
视图:
排序:
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art
《Machine Intelligence Research》2024年第1期4-28,共25页Mengting Liu Ying Zhou Yuwei Wu Feng Gao 
This work was supported by National Natural Science Foundation of China(No.62176006);the National Key Research and Development Program of China(No.2022YFF0902302).
In recent years,computing art has developed rapidly with the in-depth cross study of artificial intelligence generated con-tent(AIGC)and the main features of artworks.Audio-visual content generation has gradually been...
关键词:Artificial intelligence(AI)art AUDIO-VISUAL artificial intelligence generated content(AIGC) MULTIMODAL artistic evalu-ation 
Multimodal Biometric Fusion Algorithm Based on Ranking Partition Collision Theory
《Machine Intelligence Research》2023年第6期884-896,共13页Zhuorong Li Yunqi Tang 
This work was supported by Double First-Class Innovation Research Project for People’s Public Security University of China(No.2023SYL06).
Score-based multimodal biometric fusion has been shown to be successful in addressing the problem of unimodal techniques’vulnerability to attack and poor performance in low-quality data.However,difficulties still exi...
关键词:Image processing convolutional neural network MULTIMODAL BIOMETRICS FUSION 
Transformer: A General Framework from Machine Translation to Others被引量:2
《Machine Intelligence Research》2023年第4期514-538,共25页Yang Zhao Jiajun Zhang Chengqing Zong 
supported by Natural Science Foundation of China(Nos.62006224 and 62122088).
Machine translation is an important and challenging task that aims at automatically translating natural language sentences from one language into another.Recently,Transformer-based neural machine translation(NMT)has a...
关键词:Neural machine translation TRANSFORMER document neural machine translation(NMT) multimodal NMT low-resource NMT 
Federated Learning on Multimodal Data:A Comprehensive Survey被引量:1
《Machine Intelligence Research》2023年第4期539-553,共15页Yi-Ming Lin Yuan Gao Mao-Guo Gong Si-Jia Zhang Yuan-Qiao Zhang Zhi-Yuan Li 
supported by the National Natural Science Foundation of China(No.62036006);the Fundamental Research Funds for the Central Universities,China;the Innovation Fund of Xidian University,China.
With the growing awareness of data privacy,federated learning(FL)has gained increasing attention in recent years as a major paradigm for training models with privacy protection in mind,which allows building models in ...
关键词:Federated learning multimodal learning heterogeneous data edge computing collaborative learning 
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval
《Machine Intelligence Research》2023年第4期569-582,共14页Haoyu Lu Yuqi Huo Mingyu Ding Nanyi Fei Zhiwu Lu 
Cross-modal image-text retrieval is a fundamental task in bridging vision and language. It faces two main challenges that are typically not well addressed in previous works. 1) Generalizability: Existing methods often...
关键词:Image-text retrieval multimodal modeling contrastive learning weak correlation computer vision 
Vision Enhanced Generative Pre-trained Language Model for Multimodal Sentence Summarization
《Machine Intelligence Research》2023年第2期289-298,共10页Liqiang Jing Yiren Li Junhao Xu Yongcan Yu Pei Shen Xuemeng Song 
Multimodal sentence summarization(MMSS)is a new yet challenging task that aims to generate a concise summary of a long sentence and its corresponding image.Although existing methods have gained promising success in MM...
关键词:Multimodal sentence summarization(MMSS) generative pre-trained language model(GPLM) natural language generation deep learning artificial intelligence 
Multimodal Pretraining from Monolingual to Multilingual被引量:1
《Machine Intelligence Research》2023年第2期220-232,共13页Liang Zhang Ludan Ruan Anwen Hu Qin Jin 
supported by the National Natural Science Foundation of China(No.62072462);the National Key R&D Program of China(No.2020AAA0108600);the Large-scale Pretraining Program 468 of Beijing Academy of Artificial Intelligence(BAAI).
Multimodal pretraining has made convincing achievements in various downstream tasks in recent years.However,since the majority of the existing works construct models based on English,their applications are limited by ...
关键词:Multilingual pretraining multimodal pretraining cross-lingual transfer multilingual generation cross-modal retrieval 
VLP:A Survey on Vision-language Pre-training被引量:8
《Machine Intelligence Research》2023年第1期38-56,共19页Fei-Long Chen Du-Zhen Zhang Ming-Lun Han Xiu-Yi Chen Jing Shi Shuang Xu Bo Xu 
supported by the Key Research Program of the Chinese Academy of Sciences(No.ZDBSSSW-JSC006);the Strategic Priority Research Program of the Chinese Academy of Sciences(No.XDA 27030300).
In the past few years,the emergence of pre-training models has brought uni-modal fields such as computer vision(CV)and natural language processing(NLP)to a new era.Substantial works have shown that they are beneficial...
关键词:Vision and language pre-training TRANSFORMERS multimodal learning representation learning 
A Dynamic Resource Allocation Strategy with Reinforcement Learning for Multimodal Multi-objective Optimization被引量:3
《Machine Intelligence Research》2022年第2期138-152,共15页Qian-Long Dang Wei Xu Yang-Fei Yuan 
Many isolation approaches, such as zoning search, have been proposed to preserve the diversity in the decision space of multimodal multi-objective optimization(MMO). However, these approaches allocate the same computi...
关键词:Multimodal multi-objective optimization(MMO) dynamic resource allocating strategy(DRAS) reinforcement learning(RL) decision space partition zoning search 
检索报告 对象比较 聚类工具 使用帮助 返回顶部