Extending Web Mining to Digital Forensics Text Mining

被引:0
|
作者
Hicks, Chelsea [1 ]
Beebe, Nicole Lang
Haliscak, Brandi [1 ]
机构
[1] Univ Texas San Antonio, San Antonio, TX 78249 USA
来源
关键词
Digital forensics; text mining; string search; ranking algorithm; PageRank; PAGERANK;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As digital devices become increasingly integral to our daily lives, so too is the prevalence of digital evidence in the form of unstructured, textual data. Such data exists in both cyber and non-cyber crime cases. As a result, text mining is an important forensic technique to digital forensic investigators. However, text mining in the digital forensics domain is a non-trivial task, as investigators must locate relevant search hits amongst millions of investigatively non-relevant hits that are in-fact responsive to the search query. This emergent research tackles the problem of exceedingly poor information retrieval overhead by reviewing extant web mining ranking algorithms, explaining why they cannot be simply extended to digital forensic text mining, and proposing a new digital forensic text mining ranking algorithm, using PageRank as its basis. Future work is on-going and focused on lexical ontology development and validating the proposed algorithm.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Web text mining method with word familiarity database
    Akihiro, K
    Tsutomu, F
    KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 1415 - 1419
  • [42] Text-Mining-Methoden im Semantic Web
    Gerold Schneider
    Heinrich Zimmermann
    Wirtschaftsinformatik & Management, 2011, 3 (3) : 28 - 35
  • [43] Text-Mining-Methoden im Semantic Web
    Gerold Schneider
    Heinrich Zimmermann
    HMD Praxis der Wirtschaftsinformatik, 2010, 47 (1) : 35 - 46
  • [44] Analyzing Customer Complaints: A Web Text Mining Application
    Ozyirmidokuz, Esra Kahya
    Ozyirmidokuz, Mustafa Hakan
    INTERNATIONAL CONFERENCE ON EDUCATION AND SOCIAL SCIENCES (INTCESS14), VOLS I AND II, 2014, : 507 - 515
  • [45] Fuzzy Classification of Web Reports with Linguistic Text Mining
    Dedek, Jan
    Vojtas, Peter
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 167 - 170
  • [46] Mining subtopics from text fragments for a web query
    Qinglei Wang
    Yanan Qian
    Ruihua Song
    Zhicheng Dou
    Fan Zhang
    Tetsuya Sakai
    Qinghua Zheng
    Information Retrieval, 2013, 16 : 484 - 503
  • [47] A text mining based approach for web service classification
    Rozina Nisa
    Usman Qamar
    Information Systems and e-Business Management, 2015, 13 : 751 - 768
  • [48] News item extraction for text mining in web newspapers
    Norvåg, K
    Oyri, R
    International Workshop on Challenges in Web Information Retrieval and Integration, Proceedings, 2005, : 195 - 204
  • [49] Application of An Improved DBSCAN Algorithm in Web Text Mining
    Xie Ping
    Zhang Lin
    Wang Ying
    Li Qinqian
    PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON CLOUD COMPUTING AND INFORMATION SECURITY (CCIS 2013), 2013, 52 : 400 - 403
  • [50] frances: A Deep Learning NLP and Text Mining Web Tool to Unlock Historical Digital Collections
    Filgueira, Rosa
    2022 IEEE 18TH INTERNATIONAL CONFERENCE ON E-SCIENCE (ESCIENCE 2022), 2022, : 246 - 255