检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李慧祥 张会福[1,2] 胡勇华[1,2] 张鑫 王书盈[1,2] LI Huixiang;ZHANG Huifu;HU Yonghua;ZHANG Xin;WANG Shuying(School of Computer Science and Engineering,Hunan University of Science and Technology,Xiangtan 411201,China;Hunan Provincial Key Laboratory for Service Computing and Novel Software Technology,Hunan University of Science and Technology,Xiangtan 411201,China)
机构地区:[1]湖南科技大学计算机科学与工程学院,湖南湘潭411201 [2]湖南科技大学服务计算与软件服务新技术湖南省重点实验室,湖南湘潭411201
出 处:《计算机集成制造系统》2024年第7期2444-2452,共9页Computer Integrated Manufacturing Systems
基 金:湖南省教育厅科研资助项目(20B242,19A169);湖南省自然科学基金资助项目(2023JJ50019)。
摘 要:为提高快速傅里叶变换的处理速度,提升相关应用领域系统的性能,针对向量超长指令字(VLIW)架构处理器,提出一种适用于数据涉及的采样点数量为2的整数次幂的分组整序方法。该方法对输入数据按照一定规模分成若干个组,在每个分组内部进行混洗整序,以及通过直接存储器访问(DMA)传输将每组中的数据依次传输到结果数组,有效减少逆序数的计算需求和消除了单个数据寻址的要求。此外,针对硬件的“乒乓”存储功能提出了数据的向量混洗和DMA传输的并行处理方法,进一步提升分组整序算法的执行效率。通过在FT-M7002处理器上的具体算法实现和实验结果表明,该方法适用于向量VLIW架构处理器,运算结果正确,且有效地提高了整序的执行速度。Realignment is a key bottleneck of the performance of fast Fourier transform operation process.For the vector Very Long Instruction Word(VLIW)architecture processor,to accelerate the processing of fast Fourier transform and improve the performance of the application systems in related fields,a group realigning method suitable for arbitrary integer power bases of 2 was proposed.This method included three aspects:dividing the input data into several groups of certain scale,shuffling and realigning within each group,and transferring the data in each group to the destination array in turn through Direct Memory Access(DMA).The method had the advantage of effectively reducing the calculation requirement of inverse sequence numbers and avoiding the requirement of finding index for each single data.Moreover,for architectures with“ping-pong”storage mechanism,a parallel method of shuffling and DMA was proposed based on this method.The specific algorithm implementation and experimental verification were carried out on the FT-M7002 processor.Experimental results showed that the method was not only correct,but also could effectively improve the execution speed of realigning in vector processors,which was very suitable for vector VLIW processors.
分 类 号:TP301[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7