Block Storage Optimization and Parallel Data Processing and Analysis of Product Big Data Based on the Hadoop Platform

被引:1
|
作者
Wang, Yajun [1 ]
Cheng, Shengming [1 ]
Zhang, Xinchen [1 ]
Leng, Junyu [1 ]
Liu, Jun [1 ]
机构
[1] Dalian Polytech Univ, Sch Mech Engn & Automat, Dalian 116034, Peoples R China
关键词
ENTROPY;
D O I
10.1155/2021/3839800
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The traditional distributed database storage architecture has the problems of low efficiency and storage capacity in managing data resources of seafood products. We reviewed various storage and retrieval technologies for the big data resources. A block storage layout optimization method based on the Hadoop platform and a parallel data processing and analysis method based on the MapReduce model are proposed. A multireplica consistent hashing algorithm based on data correlation and spatial and temporal properties is used in the parallel data processing and analysis method. The data distribution strategy and block size adjustment are studied based on the Hadoop platform. A multidata source parallel join query algorithm and a multi-channel data fusion feature extraction algorithm based on data-optimized storage are designed for the big data resources of seafood products according to the MapReduce parallel frame work. Practical verification shows that the storage optimization and data-retrieval methods provide supports for constructing a big data resource-management platform for seafood products and realize efficient organization and management of the big data resources of seafood products. The execution time of multidata source parallel retrieval is only 32% of the time of the standard Hadoop scheme, and the execution time of the multichannel data fusion feature extraction algorithm is only 35% of the time of the standard Hadoop scheme.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Design and Implementation of Meteorological Big Data Platform Based on Hadoop and Elasticsearch
    Yin, He
    Deng Fengdong
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 705 - 710
  • [32] A Hadoop/MapReduce based platform for supporting health big data analytics
    Kuo A.
    Chrimes D.
    Qin P.
    Zamani H.
    Studies in Health Technology and Informatics, 2019, 257 : 229 - 235
  • [33] Development and Application of Personal Hadoop-Based Big Data Platform
    Wu G.
    Lin F.
    Chang W.-Y.
    Tsai W.-F.
    Lin S.-C.
    Yang C.-T.
    Journal of the Chinese Institute of Civil and Hydraulic Engineering, 2018, 30 (02): : 107 - 120
  • [34] Hadoop and Spark for Data Management, Processing and Analysis of Astronomical Big Data: Applicability and Performance
    Harischandra, Lloyd
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXV, 2017, 512 : 41 - 44
  • [35] Data Storage Adapter in Big Data Platform
    Minh Chau Nguyen
    Won, Hee Sun
    2015 8TH INTERNATIONAL CONFERENCE ON DATABASE THEORY AND APPLICATION (DTA), 2015, : 6 - 9
  • [36] Online Data Processing on Cloud and Hadoop Platform
    Akhtar, Ayesha
    Shakir, Muhammad Sohaib
    2017 FOURTH HCT INFORMATION TECHNOLOGY TRENDS (ITT), 2017, : 25 - 29
  • [37] The Hadoop Technology Applies in Power Big Data Platform
    Hu, Jianyong
    Chen, Jilin
    Xie, Mei
    Gao, Bo
    Yu, Zhihong
    Yan, Jianfeng
    Lv, Ying
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL AND ELECTRICAL ENGINEERING (AMEE 2017), 2017, 87 : 113 - 116
  • [38] Performance Challenges and Solutions in Big Data Platform Hadoop
    Singh B.
    Verma H.K.
    Madaan V.
    Recent Advances in Computer Science and Communications, 2023, 16 (09)
  • [39] Performance Modeling and Analysis of a Hadoop Cluster for Efficient Big Data Processing
    Lim, JongBeom
    Ahnh, Jong-Suk
    Lee, Kang-Woo
    ADVANCED SCIENCE LETTERS, 2016, 22 (09) : 2314 - 2319
  • [40] PDM: A parallel data analysis system based on Hadoop
    Duan, Song-Qing
    Wu, Bin
    Yu, Le
    Wang, Bai
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2012, 39 (10): : 87 - 92