云南高校图书馆联盟文献共享服务平台- AUDIO-VISUAL

AUDIO-VISUAL: 作品数：40被引量：15H指数：2; 导出分析报告; 相关领域：电子电信更多>>; 相关作者：李瑶王若虹曾雪梅更多>>; 相关机构：杭州电子科技大学湖南对外经济贸易职业学院大连工业大学重庆师范大学更多>>; 相关期刊：《Chinese Journal of Electronics》《Annals of Eye Science》《语言与文化研究》《Cultural and Religious Studies》更多>>; 相关基金：国家自然科学基金国家高技术研究发展计划北京市自然科学基金更多>>

Integrating Zhuang Culture Into College English Audio-Visual Speaking Course:A Multicultural Perspective: 《Cultural and Religious Studies》2024年第12期801-805,共5页LUO Mei CHEN Yingzhu; supported by Guangxi University of Chinese Medicine School-Level Education and Teaching Reform and Research Project:Integration and Innovative Practice of Ideological and Political Education and Zhuang Ethnic Culture in College English Audio-Visual Speaking Course(Project No.2022B073).; Zhuang culture,a representative of the native ethnic culture of Guangxi,China,is of great significance to Chinese culture.In order to promote traditional culture,enrich the teaching content of College English Audio-Vi...; 关键词：Zhuang culture College English Audio-Visual Speaking Course classroom practice multicultural perspective

A Recurrent Neural Network for Multimodal Anomaly Detection by Using Spatio-Temporal Audio-Visual Data: 《Computers, Materials & Continua》2024年第11期2493-2515,共23页Sameema Tariq Ata-Ur-Rehman Maria Abubakar Waseem Iqbal Hatoon S.Alsagri Yousef A.Alduraywish Haya Abdullah AAlhakbani; supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University(IMSIU)(grant number IMSIU-RG23148).; In video surveillance,anomaly detection requires training machine learning models on spatio-temporal video sequences.However,sometimes the video-only data is not sufficient to accurately detect all the abnormal activi...; 关键词：Acoustic-visual anomaly detection sequence-to-sequence autoencoder reconstruction error late fusion regularity score

Self-supervised Learning for Speech Emotion Recognition Task Using Audio-visual Features and Distil Hubert Model on BAVED and RAVDESS Databases: 《Journal of Systems Science and Systems Engineering》2024年第5期576-606,共31页Karim Dabbabi Abdelkarim Mars; Existing pre-trained models like Distil HuBERT excel at uncovering hidden patterns and facilitating accurate recognition across diverse data types, such as audio and visual information. We harnessed this capability to...; 关键词：Wav2vec 2.0 Distil HuBERT HuBERT SER audio and audio-visual features

Audio-visual keyword transformer for unconstrained sentence-level keyword spotting: 《CAAI Transactions on Intelligence Technology》2024年第1期142-152,共11页Yidi Li Jiale Ren Yawei Wang Guoquan Wang Xia Li Hong Liu; Science and Technology Plan of Shenzhen,Grant/Award Number:JCYJ20200109140410340;National Natural Science Foundation of China,Grant/Award Number:62073004。; As one of the most effective methods to improve the accuracy and robustness of speech tasks,the audio-visual fusion approach has recently been introduced into the field of Keyword Spotting(KWS).However,existing audio-...; 关键词：artificial intelligence multimodal approaches natural language processing neural network speech processing

Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art: 《Machine Intelligence Research》2024年第1期4-28,共25页Mengting Liu Ying Zhou Yuwei Wu Feng Gao; This work was supported by National Natural Science Foundation of China(No.62176006);the National Key Research and Development Program of China(No.2022YFF0902302).; In recent years,computing art has developed rapidly with the in-depth cross study of artificial intelligence generated con-tent(AIGC)and the main features of artworks.Audio-visual content generation has gradually been...; 关键词：Artificial intelligence(AI)art AUDIO-VISUAL artificial intelligence generated content(AIGC) MULTIMODAL artistic evalu-ation

On the Presentation of Spectacle Aesthetics in Films: 《US-China Education Review(A)》2023年第5期240-244,共5页WU Liuming; Spectacle is instrumental in the artistic expression of movies.It is also a seismic factor that influences people to watch films,as an aesthetic activity.When the concept of spectacle was first proposed,it referred,in...; 关键词：film spectacle aesthetics AUDIO-VISUAL STORYTELLING NARRATIVE

Emotion-Aware Music Driven Movie Montage: 《Journal of Computer Science & Technology》2023年第3期540-553,共14页刘伍琴林敏轩黄海斌马重阳宋玉董未名徐常胜; supported by the National Key Research and Development Program of China under Grant No.2020AAA0106200 and the National Natural Science Foundation of China under Grant No.61832016.; In this paper, we present Emotion-Aware Music Driven Movie Montage, a novel paradigm for the challenging task of generating movie montages. Specifically, given a movie and a piece of music as the guidance, our method ...; 关键词：movie montage emotion analysis audio-visual modality contrastive learning

Concert of ‘Charm of the Yellow River’ When Traditional ChineseFolk Music Meets Shanxi A Wonderful Audio-Visual Feast: 《China & The World Cultural Exchange》2022年第10期9-11,共3页Lu Feng; As the mother river of the Chinese nation,the Yellow River has been surging for thousands of years,nurturing the Chinese people and giving birth to the Chinese civilization.Cultural heritage carries the profound histo...; 关键词：mother Yellow SHANXI

Dynamic Audio-Visual Biometric Fusion for Person Recognition被引量：1: 《Computers, Materials & Continua》2022年第4期1283-1311,共29页Najlaa Hindi Alsaedi Emad Sami Jaha; Biometric recognition refers to the process of recognizing a person’s identity using physiological or behavioral modalities,such as face,voice,fingerprint,gait,etc.Such biometric modalities are mostly used in recogni...; 关键词：BIOMETRICS dynamic fusion feature fusion identification multimodal biometrics occluded face recognition quality-based recognition verification voice recognition

Sensitivity of N400 Effect During Speech Comprehension Under the Uni-and Bi-Modality Conditions被引量：1: 《Tsinghua Science and Technology》2022年第1期141-149,共9页Yanfei Lin Zhiwen Liu Xiaorong Gao; supported by the National Natural Science Foundation of China (Nos. 61601028 and 61431007);the Key R&D Program of Guangdong Province of China (No.2018B030339001);the National Key R&D Program of China (No. 2017YFB1002505)。; N400 is an objective electrophysiological index in semantic processing for brain.This study focuses on the sensitivity of N400 effect during speech comprehension under the uni-and bi-modality conditions.Varying the Si...; 关键词：audio-visual speech auditory noise audio-visual integration Signal-to-Noise Ratio(SNR)

AUDIO-VISUAL