Visiting Time Prediction Using Machine Learning Regression Algorithm

被引:0
|
作者
Hapsari, Indri [1 ]
Surjandari, Isti [1 ]
Komarudin [1 ]
机构
[1] Univ Indonesia, Ind Engn, Depok, Indonesia
来源
2018 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT) | 2018年
关键词
visiting time; clustering; classification; machine learning; regression;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Smart tourists cannot be separated with mobile technology. With the gadget, tourist can find information about the destination, or supporting information like transportation, hotel, weather and exchange rate. They need prediction of traveling and visiting time, to arrange their journey. If traveling time has predicted accurately by Google Map using the location feature, visiting time has another issue. Until today, Google detects the user's position based on crowdsourcing data from customer visits to a specific location over the last several weeks. It cannot be denied that this method will give a valid information for the tourists. However, because it needs a lot of data, there are many destinations that have no information about visiting time. From the case study that we used, there are 626 destinations in East Java, Indonesia, and from that amount only 224 destinations or 35.78% has the visiting time. To complete the information and help tourists, this research developed the prediction model for visiting time. For the first data is tested statistically to make sure the model development was using the right method. Multiple linear regression become the common model, because there are six factors that influenced the visiting time, i.e. access, government, rating, number of reviews, number of pictures, and other information. Those factors become the independent variables to predict dependent variable or visiting time. From normality test as the linear regression requirement, the significant value was less than p that means the data cannot pass the statistic test, even though we transformed the data based on the skewness. Because of three of them are ordinal data and the others are interval data, we tried to exclude and include the ordinal by transform it to interval. We also used the Ordinal Logistic Regression by transform the interval data in dependent variable into ordinal data using Expectation Maximization, one of clustering algorithm in machine learning, but the model still did not fit even though we used 5 functions. Then we used the classification algorithm in machine learning by using 5 top algorithm which are Linear Regression, k-Nearest Neighbors, Decision Tree, Support Vector Machines, and Multi-Layer Perceptron. Based on maximum correlation coefficient and minimum root mean square error, Linear Regression with 6 independent variables has the best result with the correlation coefficient 20.41% and root mean square error 48.46%. We also compared with model using 3 independent variable, the best algorithm was still the same but with less performance. Then, the model was loaded to predict the visiting time for other 402 destinations.
引用
收藏
页码:495 / 500
页数:6
相关论文
共 50 条
  • [41] Advancing Breast Cancer Prediction using Logistic Regression and Machine Learning Techniques
    Bhuria, Ruchika
    Gill, Kanwarpartap Singh
    Malhotra, Sonal
    Singh, Mukesh
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1374 - 1377
  • [42] Prediction of the natural frequencies of various beams using regression machine learning models
    Das, Oguzhan
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2023, 41 (02): : 302 - 321
  • [43] Crop Pests Prediction Method using Regression and Machine Learning Technology: Survey
    Kim, Yun Hwan
    Yoo, Seong Joon
    Gu, Yeong Hyeon
    Lim, Jin Hee
    Han, Dongil
    Baik, Sung Wook
    2013 INTERNATIONAL CONFERENCE ON FUTURE SOFTWARE ENGINEERING AND MULTIMEDIA ENGINEERING (ICFM 2013), 2014, 6 : 52 - 56
  • [44] Evaluation and prediction of groundwater quality for irrigation using regression and machine learning models
    Shaw, Souvick Kumar
    Sharma, Anurag
    WATER QUALITY RESEARCH JOURNAL, 2025, 60 (01) : 260 - 297
  • [45] Bandgap prediction of metal halide perovskites using regression machine learning models
    Vakharia, V.
    Castelli, Ivano E.
    Bhavsar, Keval
    Solanki, Ankur
    PHYSICS LETTERS A, 2022, 422
  • [46] Bandgap prediction of metal halide perovskites using regression machine learning models
    Vakharia, V.
    Castelli, Ivano E.
    Bhavsar, Keval
    Solanki, Ankur
    Physics Letters, Section A: General, Atomic and Solid State Physics, 2022, 422
  • [47] Prediction of preterm birth in nulliparous women using logistic regression and machine learning
    Belaghi, Reza Arabi
    Beyene, Joseph
    McDonald, Sarah D.
    PLOS ONE, 2021, 16 (06):
  • [48] Machine Learning Fails To Improve Marathon Time Prediction Compared To Multiple Linear Regression
    Foreman, Nicholas
    Hesse, Anton
    Lundstrom, Chris
    MEDICINE AND SCIENCE IN SPORTS AND EXERCISE, 2021, 53 (08): : 49 - 49
  • [49] Sensitive time series prediction using extreme learning machine
    Hong-Bo Wang
    Xi Liu
    Peng Song
    Xu-Yan Tu
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 3371 - 3386
  • [50] Sensitive time series prediction using extreme learning machine
    Wang, Hong-Bo
    Liu, Xi
    Song, Peng
    Tu, Xu-Yan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (12) : 3371 - 3386