基于民航数据特性的重删固定长度分块方法  被引量:1

Deduplication fixed-length block method based on characteristics of civil aviation data

在线阅读下载全文

作  者:丁建立[1] 李慧 曹卫东[1] DING Jianli;LI Hui;CAO Weidong(College of Computer Science and Technology,CAUC,Tianjin 300300,China)

机构地区:[1]中国民航大学计算机科学与技术学院,天津300300

出  处:《中国民航大学学报》2022年第4期32-37,共6页Journal of Civil Aviation University of China

基  金:国家自然科学基金项目(U1833114);民航安全能力建设资金项目(SA2020280)。

摘  要:针对民航数据在容灾备份时存在存储数据重复的问题,提出一种基于民航数据特性的重删固定长度分块方法。该方法根据民航数据类型的一致性,结合固定长度分块与可变长度分块的优势,设计了一种分块策略索引表的数据结构,为同种类型的数据提供分块策略,节省了分块时寻找数据块边界的时间,将备份时重复数据的模拟重删率提高到97.8%~99.3%,比固定长度分块方法高11.8%~12.5%,比可变长度分块方法高2.5%~3.0%;同时,为新的数据类型建立新的分块策略,便于后续数据流匹配,提高命中精度。Aiming at the problem of data duplication in the storage of civil aviation data during disaster recovery and backup.A deduplication fixed-length block method based on the characteristics of civil aviation data is proposed.According to the consistency of civil aviation data types,this method combines the advantages of fixed-length block and variable-length block,and designs a data structure of the block strategy index table.A block strategy is provided for the same type of data,and time is saved to find the boundary of the data block during block division,the simulated data deduplication rate during backup increases to 97.8%~99.3%,which is 11.8%~12.5%higher than that of the fixed-length block method,and 2.5%~3.0%higher than that of the variable length block method;at the same time,a new block strategy is established for new data types to facilitate subsequent data stream matching and improve hit accuracy.

关 键 词:民航数据 容灾备份 重复数据删除 类型一致性 分块策略 模拟重删率 

分 类 号:V351.392[航空宇航科学与技术—人机与环境工程] TP392[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象