检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]北京航空航天大学宇航学院图像处理中心,北京100191
出 处:《红外与激光工程》2014年第7期2354-2361,共8页Infrared and Laser Engineering
基 金:国家863计划(2011AA0641,2013AA041201-7)
摘 要:目标跟踪中的伺服系统需要极低的跟踪延时,由于粒子滤波跟踪算法固有的庞大计算量使得目标跟踪的精度大受影响。提出了一种粒子滤波跟踪算法在多核DSP系统中的快速实现方法。首先,利用DSP片上的包加速器来降低以太网相机的采集延时以及CPU占用率,CPU占用率从31%降低到10%;其次,通过手动操作高速缓存的刷新和实效,解决了多核同时共享图像数据带来的存储器一致性问题,多个核能通过高速缓存快速获取图像数据;最后,通过在多核核心上设置代理任务的方法,建立了一种多核并行计算的机制。粒子滤波算法中计算复杂度高的运算阶段被分配到多个核心上同时运算,实现了算法的低延时。实验结果显示8核加速比达到7倍以上,优于开放多处理标准OpenMP的并行优化效果。The object tracking servo system requires a low delay from an object moving to starting of rotations while the inherent computational complexity of PF (Particle Filter) affects the tracking precision. In this paper, a multicore DSP parallel implementation strategy for particle filter object tracking was proposed. Firstly, the PA module on chip was used to reduce the GigE image capturing delay and the CPU occupancy. The CPU load was considerably reduced from 31% to 10%. Secondly, by manually FLUSH after writing and INVALID before reading, the memory consistency problem was addressed and cacheable shared image data can be accessed at high efficiency. Finally, a mechanism of parallel computing on multi-core processor was introduced by adding proxy task. The computational intensive stages of particle filter were dispatched to 8 cores to eliminate system delay. Experimental results show that the tracking response time was decreased and algorithmic speedup runs up to 7 and exceeds OpenMP.
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.229