检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:郝亚喆 张远 张为 HAO Yazhe;ZHANG Yuan;ZHANG Wei(School of Microelectronics, Tianjin University, Tianjin 300072, China;Science and Technology Information Department of Qinhuangdao Public Security Bureau, Qinhuangdao, Hebei 066000, China)
机构地区:[1]天津大学微电子学院,天津300072 [2]秦皇岛市公安局科技信息化处,河北秦皇岛066000
出 处:《西安交通大学学报》2022年第1期177-183,共7页Journal of Xi'an Jiaotong University
基 金:光电信息控制和安全技术重点实验室资助项目(JCKY2019210C053)。
摘 要:针对二维9/7离散小波变换硬件架构中数据缓存需求高的问题,提出了一种基于提升算法的低存储架构。通过调整提升算法数据计算顺序,设计了一种动态计算二维小波变换的新型迭代分步计算方法。根据行、列变换的不同,对其分别做一维变换架构设计,其中行滤波器结构通过将输入数据进行三序列分裂,有效减少了寄存器数;列滤波器结构通过单行输入处理消除转置存储器,同时实现了乘法器和加法器的复用。整体二维变换采用并行和流水线混合架构设计,关键路径延时减小到一个乘法器延迟。实验结果表明,对于1024像素×1024像素的图像,与其他提升结构相比,本结构片上内存使用减少了11.1%,硬件效率提高了8.2%以上;与基于卷积的迭代计算方法相比,计算周期减少为现有结构的1/9。在型号为Xilinx Kintex7 XC7K325T的现场可编程逻辑门阵列上实现,吞吐率达到460 MB/s,且具有明显的硬件资源优势。In the two-dimensional 9/7 discrete wavelet transform hardware architecture,high data storage requirements is always the problem to be solved.A lifting-based memory-efficient architecture is proposed.First,the data calculation sequence of the lifting algorithm is modified,and a new iterative step-by-step calculation method is designed to dynamically calculate the two-dimensional wavelet transform.According to the difference of row and column transforms,one-dimensional transform architecture design is performed on them.The row filter structure effectively reduces the number of registers by splitting the input data into three sequences.The column filter structure eliminates the data buffer of the transposed module through single-line input processing.At the same time,the multiplexing of multipliers and adders is realized.With parallel and pipeline hybrid architecture,the critical path delay of two-dimensional transform structure is reduced to a multiplier delay.Experimental results show that for an image of 1024 pixel×1024 pixel,the on-chip memory requirement is reduced by 11.1%,and the hardware efficiency is raised by 8.24%in contrast to the existing lifting-based architectures.Compared with the iterative calculation method based on convolution,the calculation time is reduced to 1/9 of the existing structure.The proposed architecture is implemented on Xilinx FPGA Kintex7 XC7K325T,achieving a throughput of 460 MB/s and obvious hardware resource advantages.
关 键 词:离散小波变换 低存储架构 转置存储器 现场可编程逻辑门阵列
分 类 号:TN47[电子电信—微电子学与固体电子学]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.137.210.133