Task-specific Part Discovery for Fine-grained Few-shot Classification  

在线阅读下载全文

作  者:Yongxian Wei Xiu-Shen Wei 

机构地区:[1]School of Computer Science and Technology,Nanjing University of Science and Technology,Nanjing,210094,China

出  处:《Machine Intelligence Research》2024年第5期954-965,共12页机器智能研究(英文版)

基  金:supported by National Natural Science Foundation of China(No.62272231);Natural Science Foundation of Jiangsu Province of China(No.BK 20210340);National Key R&D Program of China(No.2021YFA1001100);the Fundamental Research Funds for the Central Universities,China(No.NJ2022028);CAAI-Huawei MindSpore Open Fund,China.

摘  要:Localizing discriminative object parts(e.g.,bird head)is crucial for fine-grained classification tasks,especially for the more challenging fine-grained few-shot scenario.Previous work always relies on the learned object parts in a unified manner,where they attend the same object parts(even with common attention weights)for different few-shot episodic tasks.In this paper,we propose that it should adaptively capture the task-specific object parts that require attention for each few-shot task,since the parts that can distinguish different tasks are naturally different.Specifically for a few-shot task,after obtaining part-level deep features,we learn a task-specific part-based dictionary for both aligning and reweighting part features in an episode.Then,part-level categorical prototypes are generated based on the part features of support data,which are later employed by calculating distances to classify query data for evaluation.To retain the discriminative ability of the part-level representations(i.e.,part features and part prototypes),we design an optimal transport solution that also utilizes query data in a transductive way to optimize the aforementioned distance calculation for the final predictions.Extensive experiments on five fine-grained benchmarks show the superiority of our method,especially for the 1-shot setting,gaining 0.12%,8.56%and 5.87%improvements over state-of-the-art methods on CUB,Stanford Dogs,and Stanford Cars,respectively.

关 键 词:Fine-grained image recognition few-shot learning transductive learning visual dictionary part feature discovery 

分 类 号:TP39[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象