Highly reliable and efficient encoding systems for hexadecimal polypeptide-based data storage  被引量:2

在线阅读下载全文

作  者:Yubin Ren Yi Zhang Yawei Liu Qinglin Wu Hong-Gang Hu Jingjing Li Chunhai Fan Dong Chen Kai Liu Hongjie Zhang 

机构地区:[1]Department of Chemistry,Tsinghua University,Beijing 100084,China [2]State Key Laboratory of Rare Earth Resource Utilization,Changchun Institute of Applied Chemistry,Chinese Academy of Sciences,Changchun 130022,China [3]Institute of Process Equipment,College of Energy Engineering and State Key Laboratory of Fluid Power and Mechatronic Systems,Zhejiang University,Hangzhou 310027,China [4]Institute of Translational Medicine,Shanghai University,Shanghai 200444,China [5]Frontiers Science Center for Transformative Molecules,School of Chemistry and Chemical Engineering,and Institute of Molecular Medicine,Renji Hospital,School of Medicine,Shanghai Jiao Tong University,Shanghai 200240,China

出  处:《Fundamental Research》2023年第2期298-304,共7页自然科学基础研究(英文版)

基  金:supported by the National Key Research and Development Program of China (2018YFA0902600,2021YFF1200300,and 2020YFA0712102);the National Natural Science Foundation of China (21877104,21834007,22107097,21878258,22020102003,and 22125701);K.C.Wong Education Foundation (GJTD-2018-09);the Youth Innovation Promotion Association of CAS (2021226);the Zhejiang Provincial Natural Science Foundation of China (Y20B060027).

摘  要:Polypeptides consisting of amino acid(AA)sequences are suitable for high-density information storage.However,the lack of suitable encoding systems,which accommodate the characteristics of polypeptide synthesis,storage and sequencing,impedes the application of polypeptides for large-scale digital data storage.To address this,two reliable and highly efficient encoding systems,i.e.RaptorQ-Arithmetic-Base64-Shuffle-RS(RABSR)and RaptorQArithmetic-Huffman-Rotary-Shuffle-RS(RAHRSR)systems,are developed for polypeptide data storage.The two encoding systems realized the advantages of compressing data,correcting errors of AA chain loss,correcting errors within AA chains,eliminating homopolymers,and pseudo-randomized encrypting.The coding efficiency without arithmetic compression and error correction of audios,pictures and texts by the RABSR system was 3.20,3.12 and 3.53 Bits/AA,respectively.While that using the RAHRSR system reached 4.89,4.80 and 6.84 Bits/AA,respectively.When implemented with redundancy for error correction and arithmetic compression to reduce redundancy,the coding efficiency of audios,pictures and texts by the RABSR system was 4.43,4.36 and 5.22 Bits/AA,respectively.This efficiency further increased to 7.24,7.11 and 9.82 Bits/AA by the RAHRSR system,respectively.Therefore,the developed hexadecimal polypeptide-based systems may provide a new scenario for highly reliable and highly efficient data storage.

关 键 词:Biomaterial POLYPEPTIDE Data storage HEXADECIMAL Encoding system 

分 类 号:TP333[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象