Analysing performance in a word prediction system with multiple prediction methods

被引:1
|
作者
Vayrynen, Pertti Alvar [1 ]
Noponen, Kai [1 ]
Seppanen, Tapio [1 ]
机构
[1] Tietokonetekniikan Lab, Oulum Yliopisto Sahko Ja Tietotekniikan Osasto, Oulum Yliopisto 90014, Finland
来源
COMPUTER SPEECH AND LANGUAGE | 2007年 / 21卷 / 03期
关键词
D O I
10.1016/j.csl.2006.09.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we investigate the performance of a hybrid prediction system with a phrase prediction utility in English word prediction from two viewpoints. From the application user's point of view, measures of effort savings are important in word prediction. Global performance measures such as the average percentage of keystroke or character savings, however, hide rather than display the details of the functioning of the prediction system as a whole. In the present study, we analysed in detail the performance of a prediction system with a phrase prediction utility along with single word prediction. Our preliminary results with a corpus of 383 lexical bundles show that, from a technological viewpoint, the following three parameters affect the practical utility of the phrase prediction method in a hybrid prediction system: (1) cost of selecting an appropriate prediction mode for single word prediction and phrase prediction; (2) token frequency of phrases in the text predicted, and (3) coverage of the phrasal lexicon. We found that all three affect the phrase prediction performance in different proportions. When the percent of ambiguous search keys finding both phrases and single words is 20%, phrase frequency 35%, and coverage of the phrasal lexicon 98%, the character savings percentage for the whole text will be improved by 6% points under optimal conditions. The system is practically useful as long as an appropriate prediction mode can be selected automatically or the cost of disambiguation of a prediction mode is not too high. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:479 / 491
页数:13
相关论文
共 50 条
  • [41] Word prediction and word completion for the German language
    Zagler, WL
    ASSISTIVE TECHNOLOGY ON THE THRESHOLD OF THE NEW MILLENNIUM, 1999, 6 : 191 - 196
  • [42] Prediction of system reliability for multiple component repairs
    Sun, Yong
    Ma, Lin
    Mathew, Joseph
    2007 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1-4, 2007, : 1186 - 1190
  • [43] Multiple architecture system for wind speed prediction
    Bouzgou, Hassen
    Benoudjit, Nabil
    APPLIED ENERGY, 2011, 88 (07) : 2463 - 2471
  • [44] Computational methods in the development of a knowledge-based system for the prediction of solid catalyst performance
    Procelewska, Joanna
    Galilea, Javier Llamas
    Clerc, Frederic
    Farrusseng, David
    Schueth, Ferdi
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2007, 10 (01) : 37 - 50
  • [45] Feasibility of Word Difficulty Prediction
    Baeza-Yates, Ricardo
    Mayo-Casademont, Marti
    Rello, Luz
    STRING PROCESSING AND INFORMATION RETRIEVAL (SPIRE 2015), 2015, 9309 : 362 - 373
  • [46] Posgram Driven Word Prediction
    Spiccia, Carmelo
    Augello, Agnese
    Pilato, Giovanni
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 589 - 596
  • [47] Neural evidence of word prediction
    Aristia, Jane
    NATURE REVIEWS PSYCHOLOGY, 2024, 3 (02): : 71 - 71
  • [48] A classification approach to word prediction
    Even-Zohar, Y
    Roth, D
    6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : A124 - A131
  • [49] Neural evidence of word prediction
    Jane Aristia
    Nature Reviews Psychology, 2024, 3 : 71 - 71
  • [50] Corpus Studies in Word Prediction
    Trnka, Keith
    McCoy, Kathleen F.
    ASSETS'07: PROCEEDINGS OF THE NINTH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2007, : 195 - 202