Solving the multiple instance problem with axis-parallel rectangles

被引:1757
作者
Dietterich, TG
Lathrop, RH
LozanoPerez, T
机构
[1] UNIV CALIF IRVINE, DEPT INFORMAT & COMP SCI, IRVINE, CA 92697 USA
[2] ARRIS PHARMACEUT CORP, San Francisco, CA 94080 USA
[3] MIT, ARTIFICIAL INTELLIGENCE LAB, CAMBRIDGE, MA 02139 USA
关键词
machine learning; drug design; structure-activity relationships;
D O I
10.1016/S0004-3702(96)00034-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multiple instance problem arises in tasks where the training examples are ambiguous: a single example object may have many alternative feature vectors (instances) that describe it, and yet only one of those feature vectors may be responsible for the observed classification of the object. This paper describes and compares three kinds of algorithms that learn axis-parallel rectangles to solve the multiple instance problem. Algorithms that ignore the multiple instance problem perform very poorly. An algorithm that directly confronts the multiple instance problem (by attempting to identify which feature vectors are responsible for the observed classifications) performs best, giving 89% correct predictions on a musk odor prediction task. The paper also illustrates the use of artificial data to debug and compare these algorithms.
引用
收藏
页码:31 / 71
页数:41
相关论文
共 38 条
[1]  
AHA DW, 1992, MACHINE LEARNING /, P1
[2]  
[Anonymous], QUANTITATIVE STRUCTU
[3]  
BEETS MGJ, 1987, STRUCTURE ACTIVITY R
[4]  
BERSUKER IB, 1991, NEW J CHEM, V15, P307
[5]  
BUCHANAN BG, 1978, PATTERN DIRECTED INF, P297
[6]   AN INTERNAL COORDINATE MONTE-CARLO METHOD FOR SEARCHING CONFORMATIONAL SPACE [J].
CHANG, G ;
GUIDA, WC ;
STILL, WC .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1989, 111 (12) :4379-4386
[7]   SOLVENT-ACCESSIBLE SURFACES OF PROTEINS AND NUCLEIC-ACIDS [J].
CONNOLLY, ML .
SCIENCE, 1983, 221 (4612) :709-713
[8]   COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) .1. EFFECT OF SHAPE ON BINDING OF STEROIDS TO CARRIER PROTEINS [J].
CRAMER, RD ;
PATTERSON, DE ;
BUNCE, JD .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1988, 110 (18) :5959-5967
[9]  
CRIPPEN GM, 1988, DISTANCE GEOMETRY CO
[10]   INDUCTIVE LEARNING OF STRUCTURAL DESCRIPTIONS - EVALUATION CRITERIA AND COMPARATIVE REVIEW OF SELECTED METHODS [J].
DIETTERICH, TG ;
MICHALSKI, RS .
ARTIFICIAL INTELLIGENCE, 1981, 16 (03) :257-294