H.264编码器的SSE2指令级优化  被引量:1

Instruction-level optimization of H.264 encoder using SSE2 instructions

在线阅读下载全文

作  者:王琰[1] 向校萱[1] 祁燕[1] 

机构地区:[1]沈阳理工大学信息科学与工程学院,沈阳110159

出  处:《计算机工程与应用》2012年第10期217-221,共5页Computer Engineering and Applications

基  金:辽宁省高等学校科技计划项目(No.1810162)

摘  要:H.264视频编码标准采用了很多新技术,具有更优越的编码效率,同时也增加了计算复杂度,无法满足实时应用。由于单指令多数据扩展指令集2(SSE2)的并行运算能力可以提高计算机对多媒体数据的实时处理。文中主要采用了SSE2对H.264中的一些耗时较多的关键模块,例如整数像素运动估计中计算SAD、整数DCT变换、量化、Hadamard变换以及亚像素运动估计中计算SATD进行了指令级优化。实验结果表明,经过优化后,在保持视频图像质量的前提下,相应模块运行速度得到了提高,使H.264编码器整体的编码速度较好地满足实时要求。H.264 video encoding standard adopts lots of new techniques. It has a significant performance benefit comparing with older standards in compression performance. However, it also has a considerable increase in encoder complexity, which limits the application it can be used for. For the Streaming SIMD Extensions 2(SSE2)instruction set which has the parallel computing power to improve the computer’s real-time processing of multimedia data. In this paper, instruction-level optimization of H.264 encoder is proposed by exploiting SSE2 instructions. The key time-consuming modules such as computing the sum of absolute difference(SAD)in integer pixel motion estimation, integer transform, quantization and computing the Hadamard transform of difference matrix and computing the sum of absolute transformed difference(SATD)in sub-pixel motion estimation. The experimental results show that the speed of corresponding module increases after being optimized while the same picture quality is achieved compared with the original encoder in H.264. The post-optimized coding speed of the encoder can satisfy the real-time requirement.

关 键 词:H.264编码 绝对误差和 整数DCT变换 变换绝对差值总和 单指令多数据扩展指令集2(SSE2) 

分 类 号:TN919.81[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象