An optimized intelligent open-source MLaaS framework for user-friendly clustering and anomaly detection

被引:1
|
作者
Eldahshan, Kamal A. [1 ]
Abutaleb, Gaber E. [1 ]
Elemary, Berihan R. [2 ]
Ebeid, Ebeid A. [1 ]
Alhabshy, AbdAllah A. [1 ]
机构
[1] Al Azhar Univ, Fac Sci, Math Dept, Cairo 11511, Egypt
[2] Damietta Univ, Fac Commerce, Dept Appl Math & Actuarial Stat, Dumyat 34511, Egypt
来源
JOURNAL OF SUPERCOMPUTING | 2024年 / 80卷 / 18期
关键词
Machine learning as a service; Unsupervised machine learning; Business analysis; Clustering; Anomaly detection; Fraud detection; ALGORITHM;
D O I
10.1007/s11227-024-06420-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As data grow exponentially, the demand for advanced intelligent solutions has become increasingly urgent. Unfortunately, not all businesses have the expertise to utilize machine learning algorithms effectively. To bridge this gap, the present paper introduces a cost-effective, user-friendly, dependable, adaptable, and scalable solution for visualizing, analyzing, processing, and extracting valuable insights from data. The proposed solution is an optimized open-source unsupervised machine learning as a service (MLaaS) framework that caters to both experts and non-experts in machine learning. The framework aims to assist companies and organizations in solving problems related to clustering and anomaly detection, even without prior experience or internal infrastructure. With a focus on several clustering and anomaly detection techniques, the proposed framework automates data processing while allowing user intervention. The proposed framework includes default algorithms for clustering and outlier detection. In the clustering category, it features three algorithms: k-means, hierarchical clustering, and DBScan clustering. For outlier detection, it includes local outlier factor, K-nearest neighbors, and Gaussian mixture model. Furthermore, the proposed solution is expandable; it may include additional algorithms. It is versatile and capable of handling diverse datasets by generating separate rapid artificial intelligence models for each dataset and facilitating their comparison rapidly. The proposed framework provides a solution through a representational state transfer application programming interface, enabling seamless integration with various systems. Real-world testing of the proposed framework on customer segmentation and fraud detection data demonstrates that it is reliable, efficient, cost-effective, and time-saving. With the innovative MLaaS framework, companies may harness the full potential of business analysis.
引用
收藏
页码:26658 / 26684
页数:27
相关论文
共 50 条
  • [31] Linien: A versatile, user-friendly, open-source FPGA-based tool for frequency stabilization and spectroscopy parameter optimization
    Wiegand, B.
    Leykauf, B.
    Joerdens, R.
    Krutzik, M.
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2022, 93 (06):
  • [32] Nelly: A User-Friendly and Open-Source Implementation of Tree-Based Complex Refractive Index Analysis for Terahertz Spectroscopy
    Tayvah, Uriel
    Spies, Jacob A.
    Neu, Jens
    Schmuttenmaer, Charles A.
    ANALYTICAL CHEMISTRY, 2021, 93 (32) : 11243 - 11250
  • [33] Predictive Framework Development for User-Friendly On-Site Glucose Detection
    Kishnani, Vinay
    Gupta, Ankur
    ACS APPLIED BIO MATERIALS, 2023, 6 (10) : 4336 - 4344
  • [34] On-the-Fly Audio Source Separation-A Novel User-Friendly Framework
    El Badawy, Dalia
    Duong, Ngoc Q. K.
    Ozerov, Alexey
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (02) : 261 - 272
  • [35] Forum 4.0: An Open-Source User Comment Analysis Framework
    Haering, Mario
    Andersen, Jakob Smedegaard
    Biemann, Chris
    Loosen, Wiebke
    Milde, Benjamin
    Pietz, Tim
    Stoecker, Christian
    Wiedemann, Gregor
    Zukunft, Olaf
    Maalej, Walid
    EACL 2021: THE 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 63 - 70
  • [36] ChimericSeq: An open-source, user-friendly interface for analyzing NGS data to identify and characterize viral-host chimeric sequences
    Shieh, Fwu-Shan
    Jongeneel, Patrick
    Steffen, Jamin D.
    Lin, Selena
    Jain, Surbhi
    Song, Wei
    Su, Ying-Hsiu
    PLOS ONE, 2017, 12 (08):
  • [37] GenomeGraphR: A user-friendly open-source web application for foodborne pathogen whole genome sequencing data integration, analysis, and visualization
    Sanaa, Moez
    Pouillot, Regis
    Vega, Francisco Garces
    Strain, Errol
    van Doren, Jane M.
    PLOS ONE, 2019, 14 (02):
  • [38] An Index for User-Friendly Proximal Detection of Water Requirements to Optimized Irrigation Management in Vineyards
    Fernandes de Oliveira, Ana
    Mameli, Massimiliano Giuseppe
    Lo Cascio, Mauro
    Sirca, Costantino
    Satta, Daniela
    AGRONOMY-BASEL, 2021, 11 (02):
  • [39] EvoCluster: An Open-Source Nature-Inspired Optimization Clustering Framework
    Qaddoura R.
    Faris H.
    Aljarah I.
    Castillo P.A.
    SN Computer Science, 2021, 2 (3)
  • [40] A Framework to Represent Antecedents of User Interest in Open-Source Software Projects
    Ghapanchi, Amir Hossein
    BUSINESS TRANSFORMATION THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: AN ACADEMIC PERSPECTIVE, VOLS 1-2, 2010, : 542 - 553