HDFS Improvement Using Shortest Path Algorithms

被引:0
|
作者
Eddoujaji, Mohamed [1 ]
Samadi, Hassan [2 ]
Bouhorma, Mohammed [3 ]
机构
[1] Abdelamlek ESSADI Univ, Natl Sch Appl Sci, Doctoral Studies Res Ctr Engn Sci & Technol, Tangier, Morocco
[2] Abdelamlek ESSADI Univ, Natl Sch Appl Sci, Doctoral Studies Res Ctr, Tangier, Morocco
[3] Abdelamlek ESSADI Univ, Doctoral Studies Res Ctr, Fac Sci & Tech, Tangier, Morocco
来源
EMERGING TRENDS IN INTELLIGENT SYSTEMS & NETWORK SECURITY | 2023年 / 147卷
关键词
Hadoop; HDFS; Algorithms; Small files; Dijkstra; Shortest path;
D O I
10.1007/978-3-031-15191-0_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the previous version of the article "data processing on distributed systems: storage challenge" we presented a new approach for the storage, management and exploitation of distributed data in the form of small files, such as the case of message exchanges real-time localization in port activity. During this approach, we managed to optimize more than 30% of information management using the classic HADOOP/YARN/HDFS architecture [11]. Considering that in aHADOOPecosystem with several data processing nodes, access to the right node, containing the desired data, in the optimal time presents a major challenge and very important research avenues for researchers and scientists [15]. In this paper, we will see together that the marriage between mathematical algorithms and computer magic can give us very encouraging and very important results. Indeed, one of the principle that manifests itself is the theory of graphs, especially the calculation of the shortest path to optimally reach the data on a few nodes in an architecture of a few hundred nodes or even thousands [16]. After several research and comparison, Dijkstra's algorithm is the chosen algorithm for calculating the shortest path in a HADOOP/HDFS system.
引用
收藏
页码:253 / 269
页数:17
相关论文
共 50 条