Objective and automated protocols for the evaluation of biomedical search engines using No Title Evaluation protocols

被引：3

作者：

Campagne, Fabien ^{[1
,2
]}

机构：

[1] Cornell Univ, Weill Med Coll, HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsau, New York, NY 10021 USA

[2] Cornell Univ, Weill Med Coll, Dept Physiol & Biophys, New York, NY 10021 USA

来源：

BMC BIOINFORMATICS | 2008年 / 9卷 / 1期

关键词：

Genome - Medical applications - Search engines - Information retrieval;

D O I：

10.1186/1471-2105-9-132

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: The evaluation of information retrieval techniques has traditionally relied on human judges to determine which documents are relevant to a query and which are not. This protocol is used in the Text Retrieval Evaluation Conference (TREC), organized annually for the past 15 years, to support the unbiased evaluation of novel information retrieval approaches. The TREC Genomics Track has recently been introduced to measure the performance of information retrieval for biomedical applications. Results: We describe two protocols for evaluating biomedical information retrieval techniques without human relevance judgments. We call these protocols No Title Evaluation (NT Evaluation). The first protocol measures performance for focused searches, where only one relevant document exists for each query. The second protocol measures performance for queries expected to have potentially many relevant documents per query (high-recall searches). Both protocols take advantage of the clear separation of titles and abstracts found in Medline. We compare the performance obtained with these evaluation protocols to results obtained by reusing the relevance judgments produced in the 2004 and 2005 TREC Genomics Track and observe significant correlations between performance rankings generated by our approach and TREC. Spearman's correlation coefficients in the range of 0.79-0.92 are observed comparing bpref measured with NT Evaluation or with TREC evaluations. For comparison, coefficients in the range 0.86-0.94 can be observed when evaluating the same set of methods with data from two independent TREC Genomics Track evaluations. We discuss the advantages of NT Evaluation over the TRels and the data fusion evaluation protocols introduced recently. Conclusion: Our results suggest that the NT Evaluation protocols described here could be used to optimize some search engine parameters before human evaluation. Further research is needed to determine if NT Evaluation or variants of these protocols can fully substitute for human evaluations.

引用

页数：14

共 50 条

[1] Objective and automated protocols for the evaluation of biomedical search engines using No Title Evaluation protocols
Fabien Campagne
BMC Bioinformatics, 9
[2] Evaluation of chromogenic factor IX assays by automated protocols
Kershaw, G. W.
Dissanayake, K.
Chen, V. M.
Khoo, T. -L.
HAEMOPHILIA, 2018, 24 (03) : 492 - 501
[3] Automated evaluation of secure route discovery in MANET protocols
Andel, Todd R.
Yasinsac, Alec
MODEL CHECKING SOFTWARE, PROCEEDINGS, 2008, 5156 : 26 - +
[4] Evaluation of sampling, cookery, and shear force protocols for objective evaluation of lamb longissimus tenderness
Shackelford, SD
Wheeler, TL
Koohmaraie, A
JOURNAL OF ANIMAL SCIENCE, 2004, 82 (03) : 802 - 807
[5] Simple conformation space search protocols for the evaluation of enantioselectivity of lipases
Orrenius, C
van Heusden, C
van Ruiten, J
Overbeeke, PLA
Kierkels, H
Duine, JA
Jongejan, JA
PROTEIN ENGINEERING, 1998, 11 (12): : 1147 - 1153
[6] A SIMULATOR USING TRANSPUTERS FOR EVALUATION OF MULTILAYERED PROTOCOLS
SAITO, T
AIDA, H
FUJITA, H
SAADAN, Z
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART I-COMMUNICATIONS, 1995, 78 (08): : 23 - 31
[7] Search Engines Evaluation
Kumar, Rakesh
Suri, P. K.
Chauhan, R. K.
DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2005, 25 (02): : 3 - 10
[8] Implementation and evaluation of a protocol management system for automated review of CT protocols
Grimes, Joshua
Leng, Shuai
Zhang, Yi
Vrieze, Thomas
McCollough, Cynthia
JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2016, 17 (05): : 523 - 533
[9] Evaluation of Four Automated Protocols for Extraction of DNA from FTA Cards
Stangegaard, Michael
Borsting, Claus
Ferrero-Miliani, Laura
Frank-Hansen, Rune
Poulsen, Lena
Hansen, Anders J.
Morling, Niels
JALA, 2013, 18 (05): : 404 - 410
[10] Development of automated multiplex immunofluorescence protocols for tumor microenvironment evaluation.
Masabanda, Julio S.
Wang, Sherry
Vargas, Joseph
Ramos, Jason
CANCER RESEARCH, 2021, 81 (13)

← 1 2 3 4 5 →