A Novel Architecture for Search Engine using Domain Based Web Log Data

被引:1
|
作者
Sharma, Prem [1 ]
Yadav, Divakar [2 ]
机构
[1] Veer Madho Singh Bhandari Uttarakhand Tech Univ, Comp Sci & Engn, Sudhowala, India
[2] Indira Gandhi Natl Open Univ, Sch Comp & Informat Sci, New Delhi, India
关键词
Search engine; information retrieval; web usage mining; content mining; RANKING; USAGE;
D O I
10.34028/iajit/20/1/10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Search engines, an information retrieval tool are the main source of information for users' information need now a day. For every query, the search engine explores its repository and/or indexer to find the relevant documents/URLs for that query. Page ranking algorithms rank the Uniform Resource Locator in abstract section (URLs) according to its relevancy with respect to users' query. It is analyzed that many of the queries fired by users on search engines are duplicate. There is a scope to improve the performance of search engine to reduce its efforts for duplicate queries. In this paper a proxy server is created that keep store the search results of user queries in web log. The proposed proxy server uses this web log to find results faster for duplicate queries fired next time. The proposed scheme has been tested and found prominent. The proposed architecture tested for ten duplicate user queries. it return all relevant web pages for duplicate user query (if query is found in web log at proxy server) from a particular domain instead of entire database. It reduces the perceived latency for duplicate query and also improves the value of precession and accuracy up to 81.8% and 99% respectively for all duplicate user queries.
引用
收藏
页码:92 / 101
页数:10
相关论文
共 50 条
  • [31] Overview of an agent based search engine architecture
    de la Mata, J
    Olivas, JA
    Serrano-Guerrero, J
    IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, : 62 - 67
  • [32] EUREKA: A Web Based Search Engine for Hospitals
    Guidi, Gabriele
    Luschi, Alessio
    Miniati, Roberto
    Iadanza, Ernesto
    6TH EUROPEAN CONFERENCE OF THE INTERNATIONAL FEDERATION FOR MEDICAL AND BIOLOGICAL ENGINEERING, 2015, 45 : 625 - 628
  • [33] FPGA-BASED ACCELERATION OF NEURAL NETWORK FOR RANKING IN WEB SEARCH ENGINE WITH A STREAMING ARCHITECTURE
    Yan, Jing
    Xu, Ning-Yi
    Cai, Xiong-Fei
    Gao, Rui
    Wang, Yu
    Luo, Rong
    Hsu, Feng-Hsiung
    FPL: 2009 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2009, : 662 - +
  • [34] A domain-based intelligent search engine
    Zhong, Minjuan
    Lu, Xingdong
    COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 461 - 467
  • [35] Domain-Based Search Engine Evaluation
    Bajpai, Nidhi
    Arora, Deepak
    PROGRESS IN ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, VOL 2, 2018, 564 : 711 - 720
  • [36] Web Site Auditing Using Web Access Log Data
    He, Si
    Balecel, Nabil
    Hamam, Habib
    Bouslimani, Yassine
    2009 7TH ANNUAL COMMUNICATION NETWORKS AND SERVICES RESEARCH CONFERENCE, 2009, : 94 - +
  • [37] A FRAMEWORK OF CONTENT-BASED WEB IMAGE SEARCH ENGINE USING MAPREDUCE
    Li Junyi
    Li Jianhua
    Li Xiang
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 2: FUTURE COMMUNICATION AND NETWORKING, 2011, : 311 - 314
  • [38] Semantic Similarity Measures in the Biomedical Domain by Leveraging a Web Search Engine
    Hsieh, Sheau-Ling
    Chang, Wen-Yung
    Chen, Chi-Huang
    Weng, Yung-Ching
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2013, 17 (04) : 853 - 861
  • [39] Semantic Similarity Measure in Biomedical Domain Leverage Web Search Engine
    Chen, Chi-Huang
    Hsieh, Sheau-Ling
    Weng, Yung-Ching
    Chang, Wen-Yung
    Lai, Feipei
    2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 4436 - 4439
  • [40] When the Web is your Data Lake: Creating a Search Engine for Datasets on the Web
    Noy, Natasha
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 801 - 801