Active learning using adaptable task-based prioritisation

被引:2
|
作者
Saeed, Shaheer U. [1 ,2 ]
Ramalhinho, Joao [1 ,2 ]
Pinnock, Mark [1 ,2 ]
Shen, Ziyi [1 ,2 ]
Fu, Yunguan [1 ,2 ,3 ]
Montana-Brown, Nina [1 ,2 ]
Bonmati, Ester [1 ,2 ,4 ]
Barratt, Dean C. [1 ,2 ]
Pereira, Stephen P. [1 ,2 ,5 ]
Davidson, Brian [1 ,2 ,6 ]
Clarkson, Matthew J. [1 ,2 ]
Hu, Yipeng [1 ,2 ]
机构
[1] UCL, Ctr Med Image Comp, Wellcome EPSRC Ctr Intervent & Surg Sci, London, England
[2] UCL, Dept Med Phys & Biomed Engn, London, England
[3] InstaDeep, London, England
[4] Univ Westminster, Sch Comp Sci & Engn, London, England
[5] UCL, Inst Liver & Digest Hlth, London, England
[6] UCL, Ctr Surg Innovat Organ Regenerat & Transplantat CI, Div Surg & Intervent Sci, London, England
基金
英国工程与自然科学研究理事会; 美国国家卫生研究院;
关键词
Active Learning; Medical Image Quality; Segmentation;
D O I
10.1016/j.media.2024.103181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised machine learning-based medical image computing applications necessitate expert label curation, while unlabelled image data might be relatively abundant. Active learning methods aim to prioritise a subset of available image data for expert annotation, for label-efficient model training. We develop a controller neural network that measures priority of images in a sequence of batches, as in batch-mode active learning, for multi- class segmentation tasks. The controller is optimised by rewarding positive task-specific performance gain, within a Markov decision process (MDP) environment that also optimises the task predictor. In this work, the task predictor is a segmentation network. A meta-reinforcement learning algorithm is proposed with multiple MDPs, such that the pre-trained controller can be adapted to a new MDP that contains data from different institutes and/or requires segmentation of different organs or structures within the abdomen. We present experimental results using multiple CT datasets from more than one thousand patients, with segmentation tasks of nine different abdominal organs, to demonstrate the efficacy of the learnt prioritisation controller function and its cross-institute and cross-organ adaptability. We show that the proposed adaptable prioritisation metric yields converging segmentation accuracy for a new kidney segmentation task, unseen in training, using between approximately 40% to 60% of labels otherwise required with other heuristic or random prioritisation metrics. For clinical datasets of limited size, the proposed adaptable prioritisation offers a performance improvement of 22.6% and 10.2% in Dice score, for tasks of kidney and liver vessel segmentation, respectively, compared to random prioritisation and alternative active sampling strategies.
引用
收藏
页数:13
相关论文
共 50 条