supported by Guangxi University of Chinese Medicine School-Level Education and Teaching Reform and Research Project:Integration and Innovative Practice of Ideological and Political Education and Zhuang Ethnic Culture in College English Audio-Visual Speaking Course(Project No.2022B073).
Zhuang culture,a representative of the native ethnic culture of Guangxi,China,is of great significance to Chinese culture.In order to promote traditional culture,enrich the teaching content of College English Audio-Vi...
supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University(IMSIU)(grant number IMSIU-RG23148).
In video surveillance,anomaly detection requires training machine learning models on spatio-temporal video sequences.However,sometimes the video-only data is not sufficient to accurately detect all the abnormal activi...
Existing pre-trained models like Distil HuBERT excel at uncovering hidden patterns and facilitating accurate recognition across diverse data types, such as audio and visual information. We harnessed this capability to...
Science and Technology Plan of Shenzhen,Grant/Award Number:JCYJ20200109140410340;National Natural Science Foundation of China,Grant/Award Number:62073004。
As one of the most effective methods to improve the accuracy and robustness of speech tasks,the audio-visual fusion approach has recently been introduced into the field of Keyword Spotting(KWS).However,existing audio-...
This work was supported by National Natural Science Foundation of China(No.62176006);the National Key Research and Development Program of China(No.2022YFF0902302).
In recent years,computing art has developed rapidly with the in-depth cross study of artificial intelligence generated con-tent(AIGC)and the main features of artworks.Audio-visual content generation has gradually been...
Spectacle is instrumental in the artistic expression of movies.It is also a seismic factor that influences people to watch films,as an aesthetic activity.When the concept of spectacle was first proposed,it referred,in...
supported by the National Key Research and Development Program of China under Grant No.2020AAA0106200 and the National Natural Science Foundation of China under Grant No.61832016.
In this paper, we present Emotion-Aware Music Driven Movie Montage, a novel paradigm for the challenging task of generating movie montages. Specifically, given a movie and a piece of music as the guidance, our method ...
As the mother river of the Chinese nation,the Yellow River has been surging for thousands of years,nurturing the Chinese people and giving birth to the Chinese civilization.Cultural heritage carries the profound histo...
Biometric recognition refers to the process of recognizing a person’s identity using physiological or behavioral modalities,such as face,voice,fingerprint,gait,etc.Such biometric modalities are mostly used in recogni...
supported by the National Natural Science Foundation of China (Nos. 61601028 and 61431007);the Key R&D Program of Guangdong Province of China (No.2018B030339001);the National Key R&D Program of China (No. 2017YFB1002505)。
N400 is an objective electrophysiological index in semantic processing for brain.This study focuses on the sensitivity of N400 effect during speech comprehension under the uni-and bi-modality conditions.Varying the Si...