An Information Theory for Out-of-Order Media With Applications in DNA Data Storage

被引:0
|
作者
Ravi, Aditya Narayan [1 ]
Vahid, Alireza [2 ]
Shomorony, Ilan [1 ]
机构
[1] Univ Illinois, Elect & Comp Engn Dept, Urbana, IL 61801 USA
[2] Rochester Inst Technol, Elect & Microelect Engn Dept, Rochester, NY 14623 USA
基金
美国国家科学基金会;
关键词
DNA; Sequential analysis; Out of order; Encoding; Codes; Channel capacity; Symbols; Biological information theory; Channel coding; Decoding; DIGITAL INFORMATION; CAPACITY; CODES; CHANNEL; RECONSTRUCTION; ROBUST; BOUNDS;
D O I
10.1109/TMBMC.2024.3403759
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent advancements in DNA-based storage prototypes focus on encoding information across multiple DNA molecules. This approach utilizes high-throughput sequencing technologies, leading to outputs that are out-of-order. We study the shuffling channel, where input codewords are split into fixed-size fragments. We show that achieving channel capacity uses index-based coding, which assigns unique indices to each fragment. We also introduce two more complex channels, which aim to model popular sequencing strategies in DNA sequencing. In the torn-paper channel, the input codeword is torn up into fragments of random sizes, while in the shotgun sequencing channel, fixed-length random substrings of the input codeword are observed at the output. In both of these channels, the lack of ordering cannot be circumvented by simply adding unique indices to the fragments. We show how the capacity of both of these channels can be achieved using random codes. We introduce and analyze code constructions based on index sequences. While these codes are computationally efficient, they are not capacity-achieving, and we leave the questions of finding efficient capacity-achieving codes for these settings as open problems.
引用
收藏
页码:334 / 348
页数:15
相关论文
共 50 条
  • [21] BeaconGNN: Large-Scale GNN Acceleration with Out-of-Order Streaming In-Storage Computing
    Wang, Yuyue
    Pan, Xiurui
    An, Yuda
    Zhang, Jie
    Reinman, Glenn
    2024 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA 2024, 2024, : 330 - 344
  • [22] Real-Time Centralized and Decentralized Out-of-Order Data Transfer Scheduling Techniques
    Andreica, Mugurel Ionut
    Dragomir, Eduard-Marius
    Tapus, Nicolae
    9TH ROEDUNET IEEE INTERNATIONAL CONFERENCE, 2010, : 228 - 233
  • [23] Algorithm of Handling Out-of-Order Delivery for Multithreaded UDP-based Data Transport
    Syzov, Dmytro
    Kachan, Dmitry
    Siemens, Eduard
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON APPLIED INNOVATIONS IN IT, 2017, 5 : 17 - 23
  • [24] DSSP: Stream Split Processing Model for High Correctness of Out-of-Order Data Processing
    Sun, Donghan
    Hwang, Soochan
    2018 IEEE FIRST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2018, : 193 - 197
  • [25] Quality-Driven Continuous Query Execution over Out-of-Order Data Streams
    Ji, Yuanzhen
    Zhou, Hongjin
    Jerzak, Zbigniew
    Nica, Anisoara
    Hackenbroich, Gregor
    Fetzer, Christof
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 889 - 894
  • [26] Space-efficient Online Approximation of Time Series Data: Streams, Amnesia, and Out-of-order
    Gandhi, Sorabh
    Foschini, Luca
    Suri, Subhash
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 924 - 935
  • [27] Separation or Not: On Handing Out-of-Order Time-Series Data in Leveled LSM-Tree
    Kang, Yuyuan
    Huang, Xiangdong
    Song, Shaoxu
    Zhang, Lingzhe
    Qiao, Jialin
    Wang, Chen
    Wang, Jianmin
    Feinauer, Julian
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 3340 - 3352
  • [28] Next Generation Information Storage Technology - DNA Data Storage
    Liu, Shuguang
    Ye, Zhenxing
    Chen, Maolong
    ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 213 - 217
  • [29] Distributed Low-Latency Out-of-Order Event Processing for High Data Rate Sensor Streams
    Mutschler, Christopher
    Philippsen, Michael
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 1133 - 1144
  • [30] A method for detecting complex events over out-of-order RFID (radio frequency identification) data streams
    Liu, Hailong
    Li, Zhanhuai
    Chen, Qun
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2009, 27 (04): : 449 - 454