基于高分辨率网络的大熊猫姿态估计方法  被引量:3

Giant panda pose estimation method based on high resolution net

在线阅读下载全文

作  者:漆愚 苏菡[1] 侯蓉 刘鹏 陈鹏 臧航行 张志和 QI Yu;SU Han;HOU Rong;LIU Peng;CHEN Peng;ZANG Hangxing;ZHANG Zhihe(School of Computer Science,Sichuan Normal University,Chengdu 610101,China;Chengdu Research Base of Giant Panda Breeding,Sichuan Key Laboratory of Conservation Biology for Endangered Wildlife,Chengdu 610086,China;Sichuan Academy of Giant Panda,Chengdu 610081,China)

机构地区:[1]四川师范大学计算机科学学院,成都610101 [2]成都大熊猫繁育研究基地,四川省濒危野生动物保护生物学重点实验室,成都610086 [3]四川省大熊猫科学研究院,成都610081

出  处:《兽类学报》2022年第4期451-460,共10页Acta Theriologica Sinica

基  金:四川省科技厅创新苗子工程项目(2021008);成都大熊猫繁育研究基地课题(2021CPB-B06,2020CPB-C09,CPB2018-02)。

摘  要:对圈养大熊猫(Ailuropoda melanoleuca)开展长期行为监测能及时了解其所处生理周期和健康状况,有助于繁殖饲养机构迅速采取相应繁育保护措施提高饲养管理水平,但目前无法对大熊猫进行24 h监控并及时地获得相应的行为信息。准确的动物姿态估计是动物行为研究的关键,也是诸多下游应用的基础。了解大熊猫的姿态可以促进大熊猫行为研究并提升保护管理水平。为了提高复杂环境下大熊猫姿态估计的准确率,本文以高分辨率网络(High resolution net,HRNet)为基础网络架构提出了一种大熊猫姿态估计方法:针对大熊猫不同部位尺度差异较大的问题,在HRNet-32中引入了空洞空间金字塔池化(Atrous spatial pyramid pooling,ASPP)模块,在提升特征感受野的同时捕获多尺度信息;同时对大熊猫身体关键点进行分组,引入基于部位的多分支结构来学习特定于每个部位组的表征。多次对比实验结果表明本文所用模型具有较高的检测精度:在PCK@0.05中所用模型精度达到了81.51%。本文提出的方法可为大熊猫的行为分析和健康评估提供技术支撑。Long-term behavioral monitoring of captive giant pandas(Ailuropoda melanoleuca)can help animal managers better understand the panda’s physiological cycle and health status in a timely manner,and help breeding facilities quickly take corresponding husbandry actions to improve breeding management.At present,neither animal managers nor scientists can monitor giant pandas 24 hours a day and obtain corresponding behavioral information on time.Accurate animal pose estimation is an important factor in animal behavior research and is also the basis for many downstream applications.Understanding the pose of giant pandas can greatly promote the research of panda behavior and improve its conservation and management.In order to improve the accuracy of giant panda pose estimation in complex environments,this paper proposed a pose estimation method based on the high-resolution network HRNet-32.To solve the problem of large-scale differences in different parts of the giant pandas,an atrous spatial pyramid pooling module was introduced in HRNet-32,which used dilated convolution with different dilated rates to form a similar pyramid form,so as to capture multi-scale information while enhancing the feature’s receptive field.Meanwhile,the giant panda pose estimation was regarded as a homogeneous multi-task learning problem,the joint points of the giant panda were grouped,and the part-based multi-branch structure was introduced to learn the representations specific to each part group.The results of several comparison experiments show that the model proposed in this paper,PCK@0.05,had a high detection accuracy(81.51%).The method proposed in this paper can provide technical support for the behavioral analysis and health assessment of giant pandas.

关 键 词:大熊猫 姿态估计 图像分析 深度学习 

分 类 号:Q958.1[生物学—动物学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象