Construction of Tourism Attraction Knowledge Graph Based on Web Text and Transfer Learning

被引:0
|
作者
Gao J. [1 ,2 ]
Lu F. [1 ,2 ,3 ,4 ]
Peng P. [1 ]
Xu Y. [1 ,2 ]
机构
[1] State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing
[2] College of Resources and Environment, University of Chinese Academy of Sciences, Beijing
[3] Fujian Collaborative Innovation Center for Big Data Applications in Governments, Fuzhou
[4] Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing
基金
中国国家自然科学基金;
关键词
knowledge graph; tourism management; transfer learning; web text mining;
D O I
10.13203/j.whugis20220120
中图分类号
学科分类号
摘要
Objectives The rapid development of information and communication technology has facilitated the online tourism service and massive web text, which provides a new opportunity for tourism sector planning and personalized recommendation. However, owing to the characteristics of semantic vagueness and low signal-to-noise ratio, the web text is difficult to get utilized directly. Therefore, how to integrate the technologies of knowledge engineering, natural language processing and machine learning, so as to form a formalized domain knowledge graph from abundant tourism text, has attracted much attention. Methods This paper proposes a tourism knowledge graph construction method based on tourism domain ontology and transfer learning. Firstly, the ontology of tourist attractions is defined based on the domain specifications and standards, which support a comprehensive and systematic description of the semantic characteristics of attractions. Secondly, a transfer learning method is adopted to transform the pre-training language model into a customized knowledge extractor to acquire knowledge triples accurately from web text, which is integrated with the scattered tourism-related information including tourist check-ins and POI (point of interest) attributes to build a systematic knowledge graph. Results Experimental results show that the proposed knowledge extractor improves the accuracy (average area under the curve) and integrity (the number of sematic characteristics) of acquisition of sematic knowledge by 50.7% and 670%, respectively, compared with the common LDA (latent Dirichlet allocation) model. The constructed knowledge graph of tourist attractions contained 77 039 entities, 16 types of relationship, and total 10 971 810 triples. Conclusions Through the unified organization paradigm of triplet knowledge, the study realizes the fusion and integration of multi-source heterogeneous tourism data, and addresses the potential systemic risk in the decision-making process based on a single data source. It is argued that the constructed knowledge graph can fully capture the real tourism scene, support in-depth analysis of tourist behaviors and demands at different scales and granularities, and provide decision support for sustainable developments of tourist destinations. © 2022 Wuhan University. All rights reserved.
引用
收藏
页码:1191 / 1200and1219
相关论文
共 29 条
  • [1] Lu W L,, Stepchenkova S., User-Generated Content as a Research Mode in Tourism and Hospitality Applications:Topics,Methods,and Software[J], Journal of Hospitality Marketing & Management, 24, 2, pp. 119-154, (2015)
  • [2] Gao J L,, Peng P,, Lu F,, Et al., A Multi-scale Com⁃ parison of Tourism Attraction Networks Across Chi⁃ na[J], Tourism Management, 90, (2022)
  • [3] Yi Liu, Jigang Bao, Yiling Zhu, Exploring Emotion Methods of Tourism Destination Evaluation:A Big-Data Approach[J], Geographical Research, 36, 6, pp. 1091-1105, (2017)
  • [4] Yanyan Zhao, Bing Qin, Ting Liu, Sentiment Analy⁃ sis[J], Journal of Software, 21, 8, pp. 1834-1848, (2010)
  • [5] Brown G,, Brown G D,, Brown G R,, Et al., Dis⁃ course Analysis[M], (1983)
  • [6] Xuhui Chen, Xiaojuan Su, Lixia Cui, Social Media Strategies to Reduce Negativity Bias Towards a Tourist City: A Case Study of Qingdao Pricey Prawn[J], Tourism Tribune, 32, 7, pp. 47-56, (2017)
  • [7] Hong Wen, Logic Interaction Between Public Opinion Orientation and Government Response in Internet Group Crisis Events—Emotional Analysis Based on Big Data of“Snow Village”Event[J], CASS Jour⁃ nal of Political Science, 1, pp. 77-90, (2019)
  • [8] Junping Qiu, Fei Zou, A Study of Content Analysis Methods[J], The Journal of the Library Science in China, 30, 2, pp. 12-17, (2004)
  • [9] Feifei Xu, Liqing La, Feng Ye, A Research on Des⁃ tination Image and Perceived Dimension Difference Based on Big Data of Tourists Comments:A Case of Nanjing[J], Resources Science, 40, 7, pp. 1483-1493, (2018)
  • [10] Yi Liu, Jigang Bao, Kaiqi Chen, Sentimental Fea⁃ tures of Chinese Outbound Tourists in Australia:Big-Data Based Content Analysis[J], 32, 5, pp. 46-58, (2017)