Detection for Approximately Duplicate Records Based on Fuzzy Comprehensive Evaluation

被引:0
|
作者
Zhou, Lijuan [1 ]
Xiao, Zhe [1 ]
机构
[1] Hunan Univ Technol, Coll Sci & Technol, Zhuzhou 412008, Hunan, Peoples R China
关键词
Approximately Duplicate Records; Attribute Weight; Fuzzy Comprehensive Evaluation; Record Grouping; Similarity;
D O I
10.4028/www.scientific.net/AMM.397-400.2464
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
To solve the problem of attribute weight determination in the approximately duplicate records, we put forward a method based on fuzzy comprehensive evaluation to get attribute weight in data set. We first perform an analysis of the composition factors of attribute. Then we carry out an evaluation of their rank. Finally, we make a determination of the attribute weight using the fuzzy comprehensive evaluation method, on the basis of which the approximately duplicate records are detected. Theoretical analysis and experimental results show that the method can objectively determine all attributes weight, and effectively detect the approximately duplicate records in massive data set.
引用
收藏
页码:2464 / 2468
页数:5
相关论文
共 50 条
  • [21] The Condition Evaluation of Bridges Based on Fuzzy BWM and Fuzzy Comprehensive Evaluation
    Li, Yunyu
    Deng, Jingwen
    Wang, Yongsheng
    Liu, Hao
    Peng, Longfan
    Zhang, Hepeng
    Liang, Yabin
    Feng, Qian
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [22] Efficient duplicate records detection method for massive data
    Pang, Xiongwen
    Yao, Zhanlin
    Li, Yongjun
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2010, 38 (02): : 8 - 11
  • [23] ANALYSIS OF METHODS AND SYSTEMS FOR FUZZY DUPLICATE DETECTION
    Sharapova, Ekaterina
    GEOCONFERENCE ON INFORMATICS, GEOINFORMATICS AND REMOTE SENSING, VOL I, 2014, : 27 - 33
  • [24] The Camouflage Evaluation Model Based on Fuzzy Comprehensive Evaluation
    Cai, Rongrong
    PROCEEDINGS OF THE 2015 INTERNATIONAL SYMPOSIUM ON COMPUTERS & INFORMATICS, 2015, 13 : 2004 - 2011
  • [25] Detection of Fuzzy Duplicate Texts in News Feeds
    Sharapova, E. V.
    Sharapov, R. V.
    2019 SYSTEMS OF SIGNAL SYNCHRONIZATION, GENERATING AND PROCESSING IN TELECOMMUNICATIONS (SYNCHROINFO), 2019,
  • [26] The University Performance Evaluation Based on Fuzzy Comprehensive Evaluation
    Gao Guowei
    Zhang Wenting
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON INNOVATION AND MANAGEMENT, VOLS I AND II, 2014, : 1660 - 1665
  • [27] Lie detection analysis on basis of fuzzy comprehensive evaluation
    Chen, ZiLong
    Ma, BoTao
    Wu, ChengFeng
    Fan, Lu
    Guo, XinZheng
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MANAGEMENT, EDUCATION, INFORMATION AND CONTROL, 2015, 125 : 731 - 735
  • [28] A Comprehensive Evaluation Study of Green Building Degree Based on the Fuzzy Comprehensive Evaluation
    Bao, Xueying
    Wang, Qicai
    ADVANCES IN CIVIL ENGINEERING II, PTS 1-4, 2013, 256-259 : 3033 - 3037
  • [29] Comprehensive Evaluation of Fault Sealing Based on Improved Fuzzy Comprehensive Evaluation Method
    Zhang Z.
    Yan H.
    Wang X.
    Gao L.
    Li J.
    Diqiu Kexue - Zhongguo Dizhi Daxue Xuebao/Earth Science - Journal of China University of Geosciences, 2024, 49 (03): : 1144 - 1153
  • [30] A Similar Duplicate Data Detection Method Based on Fuzzy Clustering for Topology Formation
    Guo, Lejiang
    Wang, Wei
    Chen, Fangxin
    Tang, Xiao
    Wang, Weijiang
    PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (1B): : 26 - 30