Extended Compact Web Graph Representations

被引:0
|
作者
Claude, Francisco [1 ]
Navarro, Gonzalo [2 ]
机构
[1] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
[2] Univ Chile, Dept Comp Sci, Santiago, Chile
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Many relevant Web mining tasks translate into classical algorithms on the Web graph. Compact Web graph representations allow running these tasks on larger graphs within main memory. These representations at least provide fast navigation (to the neighbors of a node), yet more sophisticated operations are desirable for several Web analyses. We present a compact Web graph representation that, in addition, supports reverse navigation (to the nodes pointing to the given one). The standard approach to achieve this is to represent the graph and its transpose, which basically doubles the space requirement. Our structure, instead, represents the adjacency list using a compact sequence representation that allows finding the positions where a given node v is mentioned, and answers reverse navigation using that primitive. This is combined with a previous proposal based on grammar compression of the adjacency list. The combination yields interesting algorithmic problems. As a result, we achieve the smallest graph representation reported in the literature that supports direct and reverse navigation, and also obtain other variants that occupy relevant niches in the space/time tradeoff.
引用
收藏
页码:77 / +
页数:3
相关论文
共 50 条
  • [1] Fast and Compact Web Graph Representations
    Claude, Francisco
    Navarro, Gonzalo
    ACM TRANSACTIONS ON THE WEB, 2010, 4 (04)
  • [2] Overlapping Spaces for Compact Graph Representations
    Shevkunov, Kirill
    Prokhorenkova, Liudmila
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] Compact Representations of Extended Causal Models
    Halpern, Joseph Y.
    Hitchcock, Christopher
    COGNITIVE SCIENCE, 2013, 37 (06) : 986 - 1010
  • [4] Graph representations for Web document clustering
    Schenker, A
    Last, M
    Bunke, H
    Kandel, A
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2003, 2652 : 935 - 942
  • [5] A fast and compact web graph representation
    Claude, Francisco
    Navarro, Gonzalo
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2007, 4726 : 118 - 129
  • [6] Approximating the Graph Edit Distance with Compact Neighborhood Representations
    Bause, Franka
    Permann, Christian
    Kriege, Nils M.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT V, ECML PKDD 2024, 2024, 14945 : 300 - 318
  • [7] Compact Path Representations for Graph Database Pattern Matching
    Martens, Wim
    Niewerth, Matthias
    Popp, Tina
    Rojas, Carlos
    Vansummeren, Stijn
    Vrgoc, Domagoj
    2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 379 - 380
  • [8] Compact representation of Web graphs with extended functionality
    Brisaboa, Nieves R.
    Ladra, Susana
    Navarro, Gonzalo
    Information Systems, 2014, 39 (01) : 152 - 174
  • [9] Compact representation of Web graphs with extended functionality
    Brisaboa, Nieves R.
    Ladra, Susana
    Navarro, Gonzalo
    INFORMATION SYSTEMS, 2014, 39 : 152 - 174
  • [10] Engineering Wavelet Tree Implementations for Compressed Web Graph Representations
    He, Meng
    Miao, Chen
    2016 DATA COMPRESSION CONFERENCE (DCC), 2016, : 603 - 603