宽带信号匹配滤波的GPU实现及性能优化  被引量:2

Implementation and optimization of the wideband matched filter on the GPU

在线阅读下载全文

作  者:周航[1] 蔡志明[1] 王希敏[1] 

机构地区:[1]海军工程大学电子工程学院,湖北武汉430033

出  处:《西安电子科技大学学报》2015年第3期135-140,191,共7页Journal of Xidian University

基  金:国家自然科学基金资助项目(51009146)

摘  要:从宽带相关的角度推导了基于小波变换的匹配滤波算法及基于快速傅里叶变换(FFT)算法,并分析了算法复杂度,提出了基于图形处理器(GPU)的可配置宽带匹配滤波的软件实现和理论预测与函数实测结合的优化方法.通过优化线程块的维度、绑定纹理寄存器来改进内核函数性能,再使用计算统一设备架构(CUDA)库来降低FFT与极值搜索的时延,并进行了性能优化设计.在性能测试中,文中方法在GPU平台的实现相比8核CPU平台的实现具有3.3倍加速比,其处理时延能够满足宽带匹配滤波的实时性需求.The fine estimation of wideband ambiguity, which has a sharp main ridge, requires large amounts of searching on the time-scale. That desperately needs the well-optimized software on high performance hardware. In terms of wideband correlation, the matched filter based on the CWT and its fast algorithm based on the FFT are studied, and furthermore its complexity is analyzed. Then a reconfigurable implementation on the GPU is proposed, and a method of optimization that combines analysis with testing is proposed. By optimizing the dimension of the thread block and utilizing texture memory, the time of the kernel is reduced; the CUDA library is introduced, so the delays of the FFT and maximum searching are reduced. In comparison with the method in the 8-core CPU, the proposed method improves the overall performance up to 3.3 times. The speed can meet the challenge of real-time processing of the wideband matched filter.

关 键 词:信号处理 并行计算 图形处理器 程序优化 连续小波变换 

分 类 号:TP311[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象