检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张好聪 李涛 邢立冬 潘风蕊 ZHANG Haocong;LI Tao;XING Lidong;PAN Fengrui(School of Electronic Engineering,Xi’an University of Posts&Telecommunications,Xian 710121,China;School of Computer Science&Technology,Xian University of Posts&Telecommunications,Xian 710121,China)
机构地区:[1]西安邮电大学电子工程学院,西安710121 [2]西安邮电大学计算机学院,西安710121
出 处:《计算机科学与探索》2022年第7期1583-1593,共11页Journal of Frontiers of Computer Science and Technology
基 金:陕西省科技统筹项目(2015KTCQ013);陕西省教育厅协同创新中心项目(17JF032);陕西省教育厅科研计划项目(20JY058)。
摘 要:针对数字图像处理计算量大、串行结构计算速度慢等特点,完成了最新的开源OpenVX计算机视觉加速规范1.3中底层特征抽取核函数的并行实现,使用自主设计的OpenVX可编程并行处理器进行了验证。在对图像的底层特征提取中,前期滤波及平滑处理选择OpenVX规范1.3中基本像素点处理函数ColorConvert(颜色转换)和局部图像处理函数GaussianFilter(高斯滤波)、MedianFilter(中值滤波)等,核心的特征抽取操作选择HarrisCorners(哈里斯角点检测)和CannyEdgeDetector(坎尼边缘检测)核函数,通过将计算量大的复杂结点拆分为多个简单结点,构建不同的基于图的执行模型,并映射在OpenVX并行处理器上,分别实现图像的边缘检测和特征点抽取。使用Verilog语言设计整体硬件电路,经Xilinx公司的FPGA芯片xcvu440-flga-2892-2-e综合验证,与串行映射结构相比,所选核函数在OpenVX可编程并行处理器上的并行加速比最高可达14.269。实验结果表明,OpenVX规范1.3中的核函数尤其是复杂核函数能够在本并行处理结构上达到预期的加速效果,且并行与串行结构加速比呈线性增长。Aiming at the mass computing and slow speed of serial structure calculation of digital image processing,parallel implementation of underlying feature extraction kernel functions in the latest open source OpenVX specifi-cation 1.3 is completed,and the verification is carried out with the self-designed OpenVX programmable parallel processor.In the underlying feature extraction of the image,the basic pixel processing function Color Convert,the local image processing functions Gaussian Filter and Median Filter of OpenVX specification 1.3 are selected for fil-tering and smoothing.Harris Corners and Canny Edge Detector are selected for feature extraction.By dividing the complex nodes with large amount of computation into several simple nodes,different graph execution models are constructed and mapped on the OpenVX parallel processor to realize image edge detection and feature point extraction respectively.Verilog is used to design the hardware circuit,and the FPGA chip xcvu440-flga-2892-2-e of Xilinx has comprehensively verified that,compared with the serial mapping structure,the parallel acceleration ratio of the se-lected kernel function on the OpenVX programmable parallel processor can be up to 14.269.Experimental results show that the kernel functions in OpenVX specification 1.3,especially the complex kernel functions,can achieve ex-pected acceleration effect in this parallel processing structure,and the speedup ratio of parallel and serial structures increases linearly.
关 键 词:OpenVX规范1.3 计算机视觉函数 底层特征抽取 图执行模型 并行处理器
分 类 号:TP302[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.171