检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]国防科技大学计算机学院,湖南长沙410073
出 处:《计算机工程与科学》2009年第4期130-133,共4页Computer Engineering & Science
基 金:国家自然科学基金资助项目(60473079)
摘 要:一些数字信号处理程序存在强数据相关性,在将这些数字信号处理程序划分到多核DSP上时,需要开发细粒度并行性,而细粒度并行性的开发需要快速的核间通信机制支持。本文提出了一种新的面向多核DSP的快速核间通信机制:标记式共享寄存器文件TSRF,TSRF由所有的DSP核共享,寄存器文件中的每个寄存器同一个有效标记位关联,该标记位提供了核间通信同步支持。本文构建了集成TSRF机制的多核DSP原型的周期精确模拟器,该多核DSP原型包含的处理器核数目为4个。通过详细模拟,我们使用数据相关性较强的数字信号处理算法:IIR滤波和ADPCM编解码,对TSRF机制的性能进行了测试,与单核DSP相比,TSDB机制性能提升分别为1.8、1.2和1.9左右。Some digital signal processing algorithms have dependence between relatively very small amounts of computations, so partitioning these algorithms across multiple cores needs to exploit fine-grained parallelism. However, the capability of exploiting fine-grained parallelism is constrained by the fast inter-core communication mechanism. In this paper, we propose a new inter-core communication mechanism for multi-core DSP: TSRF (Tagged Share Register File). TSRF is shared by all DSP cores, and each register in TSRF is correlated with a tag bit which provides the low cost synchronization support for inter-core communication. We construct a cycle-accurate architecture simulator for a multi-core DSP which integrates TSRF. The number of DSP cores is four. Through detailed simulation, we evaluate the efficiency of TSRF with typical digital signal processing algorithms: IIR filter and ADPCM encoder/decoder. Compared to the result on the singlecore DSP, the multi-core DSP which integrates TSRF attains the speedup of about 1.8, 1.2 and 1.9 respectively.
分 类 号:TP368.1[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222