An approach for document fragment retrieval and its formatting issue in engineering information management

被引:0
|
作者
Liu, Shaofeng [1 ]
McMahon, Chris A. [1 ]
Darlington, Mansur J. [1 ]
Culley, Steve J. [1 ]
Wild, Peter J. [1 ]
机构
[1] Univ Bath, Dept Mech Engn, IMRC, Bath BA2 7AY, Avon, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper discusses engineering document fragment mark-up supported by the use of the eXstensible Stylesheet Language - Formatting Objects (XLS-FO). XLS-FO can be used to convert the native format representation of such documents as Word, Excel and PDF into XML. Once in XML, documents fragments can be retrieved at will in response to a search query. In the paper the process of a document fragment retrieval - based on the authors' decomposition scheme approach - has been modelled and the issue of converting documents into XML addressed. Additionally, the use of document templates is discussed as a means of ensuring that the transformed XML documents are compliant with the decomposition schemes. Automating the reformatting of documents into XML and the use of templates helps make implementation of a document-fragment approach to retrieval more resource efficient, so making its adoption in industry more practicable.
引用
收藏
页码:279 / 287
页数:9
相关论文
共 50 条
  • [1] A review of structured document retrieval (SDR) technology to improve information access performance in engineering document management
    Liu, S.
    McMahon, C. A.
    Culley, S. J.
    COMPUTERS IN INDUSTRY, 2008, 59 (01) : 3 - 16
  • [2] A computational framework for retrieval of document fragments based on decomposition schemes in engineering information management
    Liu, S.
    McMahon, C. A.
    Darlington, M. J.
    Culley, S. J.
    Wild, P. J.
    ADVANCED ENGINEERING INFORMATICS, 2006, 20 (04) : 401 - 413
  • [3] Information retrieval and document management in the multimedia age
    Fugmann, R
    KNOWLEDGE ORGANIZATION, 1998, 25 (03): : 119 - 120
  • [4] A personalized query expansion approach for engineering document retrieval
    Hahm, Gyeong June
    Yi, Mun Yong
    Lee, Jae Hyun
    Suh, Hyo Won
    ADVANCED ENGINEERING INFORMATICS, 2014, 28 (04) : 344 - 359
  • [5] Using metadata for information retrieval in document management systems
    Andric, MA
    Hall, W
    EUROCON 2005: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOL 1 AND 2 , PROCEEDINGS, 2005, : 1093 - 1096
  • [6] A Context Sensitive Document Indexing Approach for Information Retrieval
    Vanishree, M.
    Sudha, R.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [7] The Ontological Approach to the Identification of Information in Tasks of Document Retrieval
    Golitsyna, O. L.
    Maksimov, N. V.
    Okropishina, O. V.
    Strogonov, V. I.
    AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2012, 46 (03) : 125 - 132
  • [8] INTERACTIVE DOCUMENT DISPLAY AND ITS USE IN INFORMATION-RETRIEVAL
    BOVEY, JD
    BROWN, PJ
    JOURNAL OF DOCUMENTATION, 1987, 43 (02) : 125 - 137
  • [9] Semantic relation based personalized ranking approach for engineering document retrieval
    Hahm, Gyeong June
    Lee, Jae Hyun
    Suh, Hyo Won
    ADVANCED ENGINEERING INFORMATICS, 2015, 29 (03) : 366 - 379
  • [10] From Word Embeddings To Document Similarities for Improved Information Retrieval in Software Engineering
    Ye, Xin
    Shen, Hui
    Ma, Xiao
    Bunescu, Razvan
    Liu, Chang
    2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 404 - 415