Giving life to dead: role of WayBack Machine in recovery of dead URLs

被引:1
|
作者
Loan, Fayaz Ahmad [1 ,2 ]
Khan, Aasif Mohammad [1 ]
Andrabi, Syed Aasif Ahmad [1 ]
Sozia, Sozia Rashid [1 ]
Parray, Umer Yousuf [1 ,3 ]
机构
[1] Univ Kashmir, Ctr Cent Asian Studies, Srinagar, India
[2] Islamic Univ Sci & Technol, Kashmir, Jammu & Kashmir, India
[3] Sher e Kashmir Univ Agr Sci & Technol Kashmir SKUA, Srinagar, Jammu & Kashmir, India
关键词
URL persistence; URL decay; Dead references; Link rot; Error codes; WayBack Machine; INFORMATION-SCIENCE; WEB CITATIONS; INTERNET CITATIONS; DECAY; PERSISTENCE; RESOURCE; AVAILABILITY; JOURNALS; LINK;
D O I
10.1108/DTA-06-2022-0242
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeThe purpose of the present study is to identify the active and dead links of uniform resource locators (URLs) associated with web references and to compare the effectiveness of Chrome, Google and WayBack Machine in retrieving the dead URLs.Design/methodology/approachThe web references of the Library Hi Tech from 2004 to 2008 were selected for analysis to fulfill the set objectives. The URLs were extracted from the articles to verify their accessibility in terms of persistence and decay. The URLs were then executed directly in the internet browser (Chrome), search engine (Google) and Internet Archive (WayBack Machine). The collected data were recorded in an excel file and presented in tables/diagrams for further analysis.FindingsFrom the total of 1,083 web references, a maximum number was retrieved by the WayBack Machine (786; 72.6 per cent) followed by Google (501; 46.3 per cent) and the lowest by Chrome (402; 37.1 per cent). The study concludes that the WayBack Machine is more efficient, retrieves a maximum number of missing web citations and fulfills the mission of preservation of web sources to a larger extent.Originality/valueA good number of studies have been conducted to analyze the persistence and decay of web-references; however, the present study is unique as it compared the dead URL retrieval effectiveness of internet explorer (Chrome), search engine giant (Google) and WayBack Machine of the Internet Archive.Research limitations/implicationsThe web references of a single journal, namely, Library Hi Tech, were analyzed for 5 years only. A major study across disciplines and sources may yield better results.Practical implicationsURL decay is becoming a major problem in the preservation and citation of web resources. The study has some healthy recommendations for authors, editors, publishers, librarians and web designers to improve the persistence of web references.
引用
收藏
页码:201 / 213
页数:13
相关论文
共 50 条