机构地区:[1]Key Laboratory of Road and Traffic Engineering of the Ministry of Education,Tongji University,Cao’an Road 4800,Shanghai 201804,China [2]School of Transportation,Southeast University,Southeast University Road 2,Jiangning District,Nanjing 211189,China [3]State Key Laboratory of Coastal and Offshore Engineering,Dalian University of Technology,No.2 Linggong Road,Ganjingzi District,Dalian 116023,Liaoning Province,China [4]Smart Transportation Application and Research Laboratory,Department of Civil and Environmental Engineering,University of Washington,135 More Hall,Seattle,WA 98195,US
出 处:《International Journal of Transportation Science and Technology》2024年第2期73-86,共14页交通科学与技术(英文)
基 金:supported by the Scientific Research Project of Shanghai Science and Technology Commission of China(No.21DZ1200601);the National Natural Science Foundation of China(No.NSFC52108411).
摘 要:Vehicle model recognition(VMR)benefits the parking,surveillance,and tolling system by automatically identifying the exact make and model of the passing vehicles.Edge computing technology enables the roadside facilities and mobile cameras to achcieve VMR in realtime.Current work generally relies on a specific view of the vehicle or requires huge calculation capability to deploy the end-to-end deep learning network.This paper proposes a lightweight two-stage identification method based on object detection and image retrieval techniques,which empowers us the ability of recognizing the vehicle model from an arbitrary view.The first-stage model estimates the vehicle posture using object detection and similarity matching,which is cost-efficient and suitable to be programmed in the edge computing devices;the second-stage model retrieves the vehicle’s label from the dataset based on gradient boosting decision tree(GBDT)algorithm and VGGNet,which is flexible to the changing dataset.More than 8000 vehicle images are labeled with their components’information,such as headlights,windows,wheels,and logos.The YOLO network is employed to detect and localize the typical components of a vehicle.The vehicle postures are estimated by the spatial relationship between different segmented components.Due to the variety of the perspectives,a 7-dimensional vector is defined to represent the relative posture of the vehicle and screen out the images with a similar photographic perspective.Two algorithms are used to extract the features from each image patch:(1)the scale invariant feature transform(SIFT)combined with the bag-of-features(BoF)and(2)pre-trained deep neural network.The GBDT is applied to evaluate the weight of each component regarding its impact on VMR.The descriptors of each component are then aggregated to retrieve the best matching image from the database.The results showed its advantages in terms of accuracy(89.2%)and efficiency,demonstrating the vast potential of applying this method to large-scale vehicle model recogni
关 键 词:Vehicle model recognition Edging computing Convolutional neural network Posture estimation Image retrieval
分 类 号:TP3[自动化与计算机技术—计算机科学与技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...