1 Introduction In recent years,foundation Vision-Language Models(VLMs),such as CLIP[1],which empower zero-shot transfer to a wide variety of domains without fine-tuning,have led to a significant shift in machine learn...
In multimodal learning, Vision-Language Models (VLMs) have become a critical research focus, enabling the integration of textual and visual data. These models have shown significant promise across various natural lang...
National Key Research and Development Program of China,Grant/Award Number:2022YEB4700400;National Natural Science Foundation of China,Grant/Award Numbers:U22B2041,62173056;Education Department of Hainan Province,Grant/Award Number:Hnky2024ZD‐19。
In the past decades,substantial progress has been made in human action recognition.However,most existing studies and datasets for human action recognition utilise still images or videos as the primary modality.Image-b...
supported by National Key Research and Development Program of China (Grant No.2022ZD0160403);National Natural Science Foundation of China (Grant No.62176178)。
Driven by the expansion of foundation models and the increasing variety of downstream tasks,parameter-efficient fine-tuning(PEFT) methods have exhibited remarkable efficacy in the unimodal domain,effectively mitigatin...
supported in part by the United States Department of Agriculture(USDA)National Institute of Food and Agriculture(NIFA);Award Number 2023-67021-40614.
The transformation of age-old farming practices through the integration of digitization and automation has sparked a revolution in agriculture that is driven by cutting-edge computer vision and artificial intelligence...
supported by the Science and Technology Major Project of Fujian Province of China(Grant No.2022HZ028018);the National Natural Science Foundation of China(Grant No.51907030).
The state of health(SOH)and remaining useful life(RUL)of lithium-ion batteries are crucial for health management and diagnosis.However,most data-driven estimation methods heavily rely on scarce labeled data,while trad...