Integrating diverse data sources to predict disease risk in dairy cattle-a machine learning approach

被引:8
|
作者
Lasser, Jana [1 ,2 ,3 ]
Matzhold, Caspar [1 ,3 ]
Egger-Danner, Christa [4 ]
Fuerst-Waltl, Birgit [5 ]
Steininger, Franz [4 ]
Wittek, Thomas [6 ]
Klimek, Peter [1 ,3 ]
机构
[1] Med Univ Vienna, Ctr Med Stat Informat & Intelligent Syst, Sect Sci Complex Syst, A-1090 Vienna, Austria
[2] Graz Univ Technol, Inst Interact Syst & Data Sci, A-8010 Graz, Austria
[3] Complex Sci Hub Vienna, A-1080 Vienna, Austria
[4] ZuchtData EDV Dienstleistungen GmbH, A-1200 Vienna, Austria
[5] Univ Nat Resources & Life Sci, Div Livestock Sci, A-1180 Vienna, Austria
[6] Vetmeduni Vienna, Univ Clin Ruminants, A-1210 Vienna, Austria
关键词
data integration; disease prediction; machine learning; precision livestock farming; LAMENESS SCORING SYSTEM; BODY CONDITION SCORE; TEST DAY MILK; COWS; MASTITIS; HEALTH; YIELD; ASSOCIATION; KETOSIS; TRAITS;
D O I
10.1093/jas/skab294
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Livestock farming is currently undergoing a digital revolution and becoming increasingly data-driven. Yet, such data often reside in disconnected silos making them impossible to leverage their full potential to improve animal well-being. Here, we introduce a precision livestock farming approach, bringing together information streams from a variety of life domains of dairy cattle to study whether including more and diverse data sources improves the quality of predictions for eight diseases and whether using more complex prediction algorithms can, to some extent, compensate for less diverse data. Using three machine learning approaches of varying complexity (from logistic regression to gradient boosted trees) trained on data from 5,828 animals in 165 herds in Austria, we show that the prediction of lameness, acute and chronic mastitis, anestrus, ovarian cysts, metritis, ketosis (hyperketonemia), and periparturient hypocalcemia (milk fever) from routinely available data gives encouraging results. For example, we can predict lameness with high sensitivity and specificity (F1= 0.74). An analysis of the importance of individual variables to prediction performance shows that disease in dairy cattle is a product of the complex interplay between a multitude of life domains, such as housing, nutrition, or climate, that including more and diverse data sources increases prediction performance, and that the reuse of existing data can create actionable information for preventive interventions. Our findings pave the way toward data-driven point-of-care interventions and demonstrate the added value of integrating all available data in the dairy industry to improve animal well-being and reduce disease risk.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Integrating heterogeneous data sources for traffic flow prediction through extreme learning machine
    Zhang, Qingqing
    Jian, Darren
    Xu, Rui
    Dai, Wei
    Liu, Ying
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4189 - 4194
  • [22] An Approach with Machine Learning for Heart Disease Risk Prediction
    Jeribi, Fathe
    Kaur, Chamandeep
    Pawar, A. B.
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1474 - 1479
  • [23] A Machine Learning Approach for Risk Prediction of Cardiovascular Disease
    Panda, Shovna
    Palei, Shantilata
    Samartha, Mullapudi Venkata Sai
    Jena, Biswajit
    Saxena, Sanjay
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 313 - 323
  • [24] INTEGRATING COUNTY-LEVEL HEALTH, HUMAN SERVICES, AND CRIMINAL JUSTICE DATA TO PREDICT RISK OF OPIOID OVERDOSE AMONG MEDICAID BENEFICIARIES: A MACHINE LEARNING APPROACH
    Lo-Ciganic, W.
    Donohue, J. M.
    Hulsey, E.
    Barnes, S.
    Li, Y.
    Kuza, C.
    Yang, Q.
    Buchanich, J.
    Huang, J.
    Gellad, W.
    VALUE IN HEALTH, 2020, 23 : S206 - S206
  • [25] Improving streamflow predictions across CONUS by integrating advanced machine learning models and diverse data
    Tayal, Kshitij
    Arvind, Renganathan
    Lu, Dan
    ENVIRONMENTAL RESEARCH LETTERS, 2024, 19 (10):
  • [26] Using machine learning methods to predict dry matter intake from milk mid-infrared spectroscopy data on Swedish dairy cattle
    Salleh, Suraya Mohamad
    Danielsson, Rebecca
    Kronqvist, Cecilia
    JOURNAL OF DAIRY RESEARCH, 2023, 90 (01) : 5 - 8
  • [27] Reconciling schemas of disparate data sources: A machine-learning approach
    Doan, AH
    Domingos, P
    Halevy, A
    SIGMOD RECORD, 2001, 30 (02) : 509 - 520
  • [28] Decoding Wilson disease: a machine learning approach to predict neurological symptoms
    Yang, Yulong
    Wang, Gang-Ao
    Fang, Shuzhen
    Li, Xiang
    Ding, Yufeng
    Song, Yuqi
    He, Wei
    Rao, Zhihong
    Diao, Ke
    Zhu, Xiaolei
    Yang, Wenming
    FRONTIERS IN NEUROLOGY, 2024, 15
  • [29] Maritime accident risk prediction integrating weather data using machine learning
    Brandt, Peter
    Munim, Ziaul Haque
    Chaal, Meriam
    Kang, Hooi-Siang
    TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2024, 136
  • [30] Utilization of deep learning models to predict calving time in dairy cattle from tail acceleration data
    Yang, Lingling
    Zhao, Jizheng
    Ying, Xiaoyi
    Lu, Cheng
    Zhou, Xinyi
    Gao, Yannian
    Wang, Lei
    Liu, Han
    Song, Huaibo
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 225