Hardware Accelerator for Full-Text Search (HAFTS) with Succinct Data Structure

被引:2
|
作者
Tanida, Naoki [1 ]
Inaba, Mary [1 ]
Hiraki, Kei [1 ]
Yoshino, Takeshi [2 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] Google Japan Inc, Tokyo, Japan
来源
2009 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS | 2009年
关键词
full-text search; succinct data structure; compressed suffix array; FPGA;
D O I
10.1109/ReConFig.2009.74
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient utilization of massive data, such as full-text search has become important in view of the growing needs for Web search and genome analysis. High-speed search and small storage space are required to handle massive amounts of data. For high-speed search, generally, a data structure such as index which needs additional storage space is required. Recently, compressed suffix array, which is a data structure with an indexable dictionary that can be used to compress data to its information-theoretic lower bound, has been proposed. The distinctive feature of this array is that it enables direct data retrieval without decompression from the compressed data. Further, theoretically, the computational complexity of data retrieval is the same for both compressed and uncompressed data when we assume that rank operation involving the bit vector can be executed in constant time; this rank operation returns the number of occurrences of smaller elements. Practically, rank operation involves many bit-manipulations and random access to the memory. Hence, this constant time is not negligible, and as a result, data retrieval using compressed suffix array is relatively slower than that using plain suffix array. Although compression to create an indexable dictionary is performed only once, data retrieval queries occur repeatedly. Hence, high speed rank operations involving bit vectors are essential for compressed suffix arrays. We propose a FPGA-based hardware accelerator for full-text search (HAFTS) with compressed suffix array. FPGA helps speedup rank operation for compressed suffix array by enabling many bit calculations performed simultaneously and controlling the order of memory accesses. We conduct performance simulations of HAFTS. We consider a development board on which FPGA is connected to DDR2-800 SDRAM by a 64-bit bus as our model. We evaluate the performance of HAFTS by comparing it with that of software implementation. As a result, we conclude that the search speed of FPGA-based HAFTS is seven times faster than that of software implementation.
引用
收藏
页码:155 / +
页数:2
相关论文
共 50 条
  • [1] Big Data Full-Text Search Index Minimization Using Text Summarization
    Iqbal, Waheed
    Malik, Waqas Ilyas
    Bukhari, Faisal
    Almustafa, Khaled Mohamad
    Nawaz, Zubiar
    INFORMATION TECHNOLOGY AND CONTROL, 2021, 50 (02): : 375 - 389
  • [2] Semantic Full-text Search with Broccoli
    Bast, Hannah
    Baurle, Florian
    Buchhold, Bjoern
    Haussmann, Elmar
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1265 - 1266
  • [3] Full-text search engine with suffix index for massive heterogeneous data
    Xu, Wentao
    Chen, Haoyu
    Huan, Yidong
    Hu, Xuedong
    Nong, Ge
    INFORMATION SYSTEMS, 2022, 104
  • [4] Full-text Search Using Database Index
    Chaitanya, B. Sri Sai Krishna
    Reddy, D. Ajay Kumar
    Chandra, B. Pavan Sai Eshwar
    Krishna, A. Bala
    Menon, Remya R. K.
    2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [5] Preparing heterogeneous XML for full-text search
    Lehtonen, Miro
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2006, 24 (04) : 455 - 474
  • [6] Expressiveness and performance of full-text search languages
    Botev, Chavdar
    Amer-Yahia, Sihem
    Shanmugasundaram, Jayavel
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 349 - 367
  • [7] An Index for Efficient Semantic Full-Text Search
    Bast, Hannah
    Buchhold, Bjoern
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 369 - 378
  • [8] Full-Text Search Engine using MySQL
    Gyorodi, C.
    Gyorodi, R.
    Pecherle, G.
    Cornea, G. M.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2010, 5 (05) : 735 - 743
  • [9] Study and implementation of the full-text search for hypertext
    Zheng, Q.H.
    Zhang, W.
    2001, Xi'an Jiaotong University (35):
  • [10] Full-Text Search on Data with Access Control using Generalized Suffix Tree
    Zaky, Ahmad
    Munir, Rinaldi
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2016,