Data storage using peptide sequences

被引:26
|
作者
Ng, Cheuk Chi A. [1 ,2 ,3 ,4 ]
Tam, Wai Man [5 ]
Yin, Haidi [1 ,2 ,3 ,4 ]
Wu, Qian [1 ,2 ,3 ,4 ]
So, Pui-Kin [6 ]
Wong, Melody Yee-Man [7 ]
Lau, Francis C. M. [5 ]
Yao, Zhong-Ping [1 ,2 ,3 ,4 ]
机构
[1] Hong Kong Polytech Univ, Res Inst Future Food, State Key Lab Chem Biol & Drug Discovery, Hung Hom,Kowloon, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Appl Biol & Chem Technol, Hung Hom, Kowloon, Hong Kong, Peoples R China
[3] Hong Kong Polytech Univ, Shenzhen Res Inst, State Key Lab Chinese Med & Mol Pharmacol Incubat, Shenzhen, Guangdong, Peoples R China
[4] Hong Kong Polytech Univ, Shenzhen Res Inst, Shenzhen Key Lab Food Biol Safety Control, Shenzhen, Guangdong, Peoples R China
[5] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hung Hom, Kowloon, Hong Kong, Peoples R China
[6] Hong Kong Polytech Univ, Univ Res Facil Life Sci, Hung Hom, Kowloon, Hong Kong, Peoples R China
[7] Hong Kong Polytech Univ, Univ Res Facil Chem & Environm Anal, Hung Hom, Kowloon, Hong Kong, Peoples R China
关键词
DIGITAL INFORMATION; SENSITIVE ANALYSIS; MASS-SPECTROMETRY; DNA; CAPACITY; ROBUST;
D O I
10.1038/s41467-021-24496-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Humankind is generating digital data at an exponential rate. These data are typically stored using electronic, magnetic or optical devices, which require large physical spaces and cannot last for a very long time. Here we report the use of peptide sequences for data storage, which can be durable and of high storage density. With the selection of suitable constitutive amino acids, designs of address codes and error-correction schemes to protect the order and integrity of the stored data, optimization of the analytical protocol and development of a software to effectively recover peptide sequences from the tandem mass spectra, we demonstrated the feasibility of this method by successfully storing and retrieving a text file and the music file Silent Night with 40 and 511 18-mer peptides respectively. This method for the first time links data storage with the peptide synthesis industry and proteomics techniques, and is expected to stimulate the development of relevant fields. Finding durable, high-density media for data storage is necessary to support the ever-expanding generation of digital data. Here, the authors use peptide sequences to store digital data and retrieve them using tandem mass spectrometry, proving that peptides can be used as a storage medium.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Holographic data storage using photopolymer
    Hsu, KY
    Lin, SH
    Whang, WT
    Chen, WZ
    PHOTOREFRACTIVE FIBER AND CRYSTAL DEVICES: MATERIALS, OPTICAL PROPERTIES, AND APPLICATIONS V, 1999, 3801 : 66 - 74
  • [12] Holographic data storage using bacteriorhodopsin
    Gary, C
    Timucin, D
    ODS - 1997 OPTICAL DATA STORAGE TOPICAL MEETING, CONFERENCE DIGEST, 1997, : 62 - 63
  • [13] A Motif Detection and Classification Method for Peptide Sequences Using Genetic Programming
    Tomita, Yasuyuki
    Kato, Ryuji
    Okochi, Mina
    Honda, Hiroyuki
    JOURNAL OF BIOSCIENCE AND BIOENGINEERING, 2008, 106 (02) : 154 - 161
  • [14] Automated Detection of Conformational Epitopes Using Phage Display Peptide Sequences
    Negi, Surendra S.
    Braun, Werner
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2009, 3 : 71 - 81
  • [15] Identification of peptide sequences that target to the brain using in vivo phage display
    Li, Jingwei
    Zhang, Qizhi
    Pang, Zhiqing
    Wang, Yuchen
    Liu, Qingfeng
    Guo, Liangran
    Jiang, Xinguo
    AMINO ACIDS, 2012, 42 (06) : 2373 - 2381
  • [16] Identification of peptide sequences that target to the brain using in vivo phage display
    Jingwei Li
    Qizhi Zhang
    Zhiqing Pang
    Yuchen Wang
    Qingfeng Liu
    Liangran Guo
    Xinguo Jiang
    Amino Acids, 2012, 42 : 2373 - 2381
  • [17] Quantitative prediction of MHC-II peptide binding affinity using global description of peptide sequences
    Zhang, Wen
    Liu, Juan
    Niu, Yanqing
    Wang, Lian
    Zhang, Zhi
    BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 352 - +
  • [18] Simulating cropping sequences using earth observation data
    Sharp, Ryan T.
    Henrys, Peter A.
    Jarvis, Susan G.
    Whitmore, Andrew P.
    Milne, Alice E.
    Coleman, Kevin
    Mohankumar, Sajeev Erangu Purath
    Metcalfe, Helen
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 188
  • [19] Classification of biological sequences by using a Data Mining approach
    Maddouri, M
    Elloumi, M
    METMBS'01: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2001, : 54 - 60
  • [20] Mining Repetitive Sequences Using A Big Data Ecosystem
    Phinney, Michael
    Cao, Hongfei
    Dhroso, Andi
    Shyu, Chi-Ren
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,