Integrating diverse data sources to predict disease risk in dairy cattle-a machine learning approach

被引:8
|
作者
Lasser, Jana [1 ,2 ,3 ]
Matzhold, Caspar [1 ,3 ]
Egger-Danner, Christa [4 ]
Fuerst-Waltl, Birgit [5 ]
Steininger, Franz [4 ]
Wittek, Thomas [6 ]
Klimek, Peter [1 ,3 ]
机构
[1] Med Univ Vienna, Ctr Med Stat Informat & Intelligent Syst, Sect Sci Complex Syst, A-1090 Vienna, Austria
[2] Graz Univ Technol, Inst Interact Syst & Data Sci, A-8010 Graz, Austria
[3] Complex Sci Hub Vienna, A-1080 Vienna, Austria
[4] ZuchtData EDV Dienstleistungen GmbH, A-1200 Vienna, Austria
[5] Univ Nat Resources & Life Sci, Div Livestock Sci, A-1180 Vienna, Austria
[6] Vetmeduni Vienna, Univ Clin Ruminants, A-1210 Vienna, Austria
关键词
data integration; disease prediction; machine learning; precision livestock farming; LAMENESS SCORING SYSTEM; BODY CONDITION SCORE; TEST DAY MILK; COWS; MASTITIS; HEALTH; YIELD; ASSOCIATION; KETOSIS; TRAITS;
D O I
10.1093/jas/skab294
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Livestock farming is currently undergoing a digital revolution and becoming increasingly data-driven. Yet, such data often reside in disconnected silos making them impossible to leverage their full potential to improve animal well-being. Here, we introduce a precision livestock farming approach, bringing together information streams from a variety of life domains of dairy cattle to study whether including more and diverse data sources improves the quality of predictions for eight diseases and whether using more complex prediction algorithms can, to some extent, compensate for less diverse data. Using three machine learning approaches of varying complexity (from logistic regression to gradient boosted trees) trained on data from 5,828 animals in 165 herds in Austria, we show that the prediction of lameness, acute and chronic mastitis, anestrus, ovarian cysts, metritis, ketosis (hyperketonemia), and periparturient hypocalcemia (milk fever) from routinely available data gives encouraging results. For example, we can predict lameness with high sensitivity and specificity (F1= 0.74). An analysis of the importance of individual variables to prediction performance shows that disease in dairy cattle is a product of the complex interplay between a multitude of life domains, such as housing, nutrition, or climate, that including more and diverse data sources increases prediction performance, and that the reuse of existing data can create actionable information for preventive interventions. Our findings pave the way toward data-driven point-of-care interventions and demonstrate the added value of integrating all available data in the dairy industry to improve animal well-being and reduce disease risk.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] MLMDA: a machine learning approach to predict and validate MicroRNA–disease associations by integrating of heterogenous information sources
    Kai Zheng
    Zhu-Hong You
    Lei Wang
    Yong Zhou
    Li-Ping Li
    Zheng-Wei Li
    Journal of Translational Medicine, 17
  • [2] Machine Learning to Predict Pregnancy in Dairy Cows: An Approach Integrating Automated Activity Monitoring and On-Farm Data
    Marques, Thaisa Campos
    Marques, Leticia Ribeiro
    Fernandes, Patrick Bezerra
    de Lima, Fabio Soares
    Paim, Tiago do Prado
    Leao, Karen Martins
    ANIMALS, 2024, 14 (11):
  • [3] A Machine Learning Approach for Rainfall Estimation Integrating Heterogeneous Data Sources
    Guarascio, Massimo
    Folino, Gianluigi
    Chiaravalloti, Francesco
    Gabriele, Salvatore
    Procopio, Antonio
    Sabatino, Pietro
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] MLMDA: a machine learning approach to predict and validate MicroRNA-disease associations by integrating of heterogenous information sources
    Zheng, Kai
    You, Zhu-Hong
    Wang, Lei
    Zhou, Yong
    Li, Li-Ping
    Li, Zheng-Wei
    JOURNAL OF TRANSLATIONAL MEDICINE, 2019, 17 (01)
  • [5] Evaluating machine learning algorithms to predict lameness in dairy cattle
    Neupane, Rajesh
    Aryal, Ashrant
    Haeussermann, Angelika
    Hartung, Eberhard
    Pinedo, Pablo
    Paudyal, Sushil
    PLOS ONE, 2024, 19 (07):
  • [6] Integrating data sources to improve hydraulic head predictions: A hierarchical machine learning approach
    Michael, WJ
    Minsker, BS
    Tcheng, D
    Valocchi, AJ
    Quinn, JJ
    WATER RESOURCES RESEARCH, 2005, 41 (03) : 1 - 14
  • [7] MasPA: A Machine Learning Application to Predict Risk of Mastitis in Cattle from AMS Sensor Data
    Ghafoor, Naeem Abdul
    Sitkowska, Beata
    AGRIENGINEERING, 2021, 3 (03): : 575 - 583
  • [8] Toward a Model to Predict Cardiovascular Disease Risk Using a Machine Learning Approach
    Slime, Khaoula
    Maizate, Abderrahim
    Hassouni, Larbi
    Mouine, Najat
    IAENG International Journal of Computer Science, 2024, 51 (05) : 519 - 527
  • [9] Application of machine-learning algorithms to predict calving difficulty in Holstein dairy cattle
    Avizheh, Mahdieh
    Dadpasand, Mohammad
    Dehnavi, Elena
    Keshavarzi, Hamideh
    ANIMAL PRODUCTION SCIENCE, 2023, 63 (11) : 1095 - 1104
  • [10] Integrating human services and criminal justice data with claims data to predict risk of opioid overdose among Medicaid beneficiaries: A machine-learning approach
    Lo-Ciganic, Wei-Hsuan
    Donohue, Julie M.
    Hulsey, Eric G.
    Barnes, Susan
    Li, Yuan
    Kuza, Courtney C.
    Yang, Qingnan
    Buchanich, Jeanine
    Huang, James L.
    Mair, Christina
    Wilson, Debbie L.
    Gellad, Walid F.
    PLOS ONE, 2021, 16 (03):