一种基于图神经网络和统计分析的VVC帧内编码快速算法  

A fast VVC intra-coding algorithm based on graph neural network and statistical analysis

在线阅读下载全文

作  者:黎天送 刘昊坤 崔少国 刘姝岑 陈艳[1] 王鸿奎 LI Tiansong;LIU Haokun;CUI Shaoguo;LIU Shucen;CHEN Yan;WANG Hongkui(Chongqing Normal University,Chongqing 401331,China;Hangzhou Dianzi University,Hangzhou 310018,China)

机构地区:[1]重庆师范大学,重庆401331 [2]杭州电子科技大学,浙江杭州310018

出  处:《电信科学》2024年第9期109-122,共14页Telecommunications Science

基  金:重庆市科技局自然基金项目(No.CSTB2022NSCQ-MSX1231);重庆市教委青年项目(No.KJQN202200519);重庆师范大学人才基金项目(No.21XLB031)。

摘  要:多功能视频编码(versatile video coding,VVC)作为最新一代的视频编码标准,通过引入多种高效的编码工具进一步提升了视频编码性能。然而,VVC标准引入了四叉树加多类型树(quadtree plus multi-type tree,QTMT)划分结构,并将帧内预测模式从35种扩展到67种,导致编码复杂度急剧上升。为降低VVC的帧内编码复杂度,首先,提出了一种基于图神经网络的帧内编码单元(coding unit,CU)划分快速算法,该算法利用高效的图神经网络模型直接预测CU的最优划分模式,从而跳过冗余的CU划分遍历。其次,提出了一种基于空间相关性和纹理特征的帧内模式选择快速算法,该算法利用平均方向方差和Sobel梯度算子确定纹理方向,并跳过部分角度预测模式,同时结合预测模式间的相关性精简率失真模式列表。实验结果表明,该算法能够在BDBR(bjontegaard delta bit rate)上升2.29%的代价下,节省64.04%的编码时间。VVC as the latest generation of video coding standards,further improves video compression quality by introducing a variety of efficient coding tools.However,the VVC standard introduces the QTMT division structure and expands the intra prediction modes from 35 to 67,resulting in a sharp increase in coding complexity.Firstly,a fast algorithm for intra-frame coding unit(CU)division based on graph neural network was proposed,in order to reduce the complexity of intra-frame coding of VVC.An efficient graph neural network model was used to directly predict the optimal partition mode of CU,thus skipping redundant CU partition traversal.Secondly,a fast algorithm for intraframe mode selection based on spatial correlation and texture features was proposed.The average direction variance and Sobel gradient operator were used to determine the texture direction,some angle prediction modes were skipped,and the correlation between prediction modes to streamline the rate-distortion mode list were combined.Experimental results show that this algorithm can save 64.04%of encoding time at the cost of increasing BDBR by 2.29%.

关 键 词:多功能视频编码 帧内编码 编码单元划分 帧内角度模式 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象