In the recent years, multicue visual tracking frameworks have been preferred over single cue visual tracking approaches to address critical environmental challenges. In literature, it has been well accepted that combining multiple complementary cues extracted from single sensor or multiple sensors, deep features and features extracted from different layers of deep learning architecture enhance tracking performance and accuracy. In this paper, we have categorized the multi-cue object tracking work based on the exploited appearance model into traditional architecture and deep learning based trackers. The categorized work have been tabulated to provide detailed overview of the representative work and to list out the new trends in the domain. Also, we have briefly analyzed the various tracking benchmark and tabulated their substantial parameters. Our review work analyze the recent trends in the field of object tracking alongwith the latest tracking benchmark to indicate the future directions to the researchers. In addition, we have experimentally evaluated the state-of-the-arts on OTB-15, UAV123, VOT2017 and LaSOT datasets under various tracking challenges. (C) 2020 Elsevier Ltd. All rights reserved.