检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]复旦大学专用集成电路与系统国家重点实验室,上海201203
出 处:《小型微型计算机系统》2008年第4期741-745,共5页Journal of Chinese Computer Systems
基 金:华为科技基金项目(YJCB2005019BA)资助
摘 要:视频技术发展要求更高速,更利于硬件实现的运动估计算法.提出了一种蝶形运动估计算法,该算法采用蝶形搜索模板、快速截止技术和运动向量预测技术.该算法较钻石搜索算法提速43.26%-80%,并且图像质量更好.同时,本文采用加法树和片内并行存储器,构建该算法的VLSI实现结构.通过两种数据映射方法(拉丁方映射和4×4块映射),该结构不但解决了快速搜索算法的数据不规则性难题,并且节省了带宽.当系统时钟为27MHz,数据总线为16位,外部存储器带宽要求仅为4.57Mbit/s.比较其它硬件实现结构,该结构采用了更少的处理单元数,更小的缓存单元,但却获得更快的速度和更高的灵活性.Development of video technology requires the fast and hardware-friendly motion estimation algorithms. A novel fast motion estimation algorithm with butterfly-shaped search pattern, halfway-stop technique, and motion vector prediction is proposed. This algorithm can achieve 43. 26-80 percent speedup than diamond search algorithm and the picture quality is better. And then a relative VLSI architecture with tree-structured adders and parallel internal memories is also proposed and analyzed. Two new access methods for parallel memories, Latin square map and 4 × 4 block map, are adopted in this VLSI architecture, which regularize the data and save the external memory bandwidth. Under the condition of the 27MHz system clock and the 16 bits data bus, the desired data rate of external memory is only 4. 57Mbit/s. Compared with other implementations with tree architecture, this implementation has superior performance on PE count, memory size, search speed and flexibility.
分 类 号:TN919.8[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15