Block Storage Optimization and Parallel Data Processing and Analysis of Product Big Data Based on the Hadoop Platform

被引:1
|
作者
Wang, Yajun [1 ]
Cheng, Shengming [1 ]
Zhang, Xinchen [1 ]
Leng, Junyu [1 ]
Liu, Jun [1 ]
机构
[1] Dalian Polytech Univ, Sch Mech Engn & Automat, Dalian 116034, Peoples R China
关键词
ENTROPY;
D O I
10.1155/2021/3839800
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The traditional distributed database storage architecture has the problems of low efficiency and storage capacity in managing data resources of seafood products. We reviewed various storage and retrieval technologies for the big data resources. A block storage layout optimization method based on the Hadoop platform and a parallel data processing and analysis method based on the MapReduce model are proposed. A multireplica consistent hashing algorithm based on data correlation and spatial and temporal properties is used in the parallel data processing and analysis method. The data distribution strategy and block size adjustment are studied based on the Hadoop platform. A multidata source parallel join query algorithm and a multi-channel data fusion feature extraction algorithm based on data-optimized storage are designed for the big data resources of seafood products according to the MapReduce parallel frame work. Practical verification shows that the storage optimization and data-retrieval methods provide supports for constructing a big data resource-management platform for seafood products and realize efficient organization and management of the big data resources of seafood products. The execution time of multidata source parallel retrieval is only 32% of the time of the standard Hadoop scheme, and the execution time of the multichannel data fusion feature extraction algorithm is only 35% of the time of the standard Hadoop scheme.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Analysis of Big Data Storage Tools for Data Lakes based on Apache Hadoop Platform
    Belov, Vladimir
    Nikulchev, Evgeny
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) : 551 - 557
  • [2] Optimization of Management and Processing of Big Data on a Platform for Distributed Data Storage
    Nerić, Vedrana
    Sarajlić, Nermin
    Hadžić, Đulaga
    Elektrotehniski Vestnik/Electrotechnical Review, 2024, 91 (05): : 272 - 283
  • [3] Optimization of Management and Processing of Big Data on a Platform for Distributed Data Storage
    Neric, Vedrana
    Sarajlic, Nermin
    Hadzic, Dulaga
    ELEKTROTEHNISKI VESTNIK, 2024, 91 (05): : 272 - 283
  • [4] Huge Data Analysis and Processing Platform based on Hadoop
    Li, Yuanbin
    Chen, Rong
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 267 - 271
  • [5] Storage optimization and parallel processing of condition monitoring big data of transmission and transforming equipment based on cloud platform
    Song, Yaqi
    Zhou, Guoliang
    Zhu, Yongli
    Li, Li
    Wang, Liuwang
    Wang, Dewen
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2015, 35 (02): : 255 - 267
  • [6] Research on Industry Data Analysis Model Based on Hadoop Big Data Platform
    Xu, Hongsheng
    Fan, Ganglong
    Li, Ke
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, INFORMATION AND COMPUTER SCIENCE (ICEMC 2017), 2017, 73 : 783 - 787
  • [7] Analysis of Big Data Platform with OpenStack and Hadoop
    Li, Xiaoyan
    Lu, Zhihui
    Wang, Nini
    Wu, Jie
    Huang, Shalin
    ADVANCES IN SERVICES COMPUTING, 2016, 10065 : 375 - 390
  • [8] Big data storage optimization and parallel processing technology for power equipment surveillance under cloud platform
    Li, Tianli
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2019, 19 (S1) : S277 - S284
  • [9] Processing and Analysis of Seismic data in Hadoop Platform
    Chen, Zhuang
    Zhang, Ti
    2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [10] Performance optimization of computing task scheduling based on the Hadoop big data platform
    Li, Yang
    Hei, Xinhong
    NEURAL COMPUTING & APPLICATIONS, 2022,