Integrating diverse data sources to predict disease risk in dairy cattle-a machine learning approach

被引:8
|
作者
Lasser, Jana [1 ,2 ,3 ]
Matzhold, Caspar [1 ,3 ]
Egger-Danner, Christa [4 ]
Fuerst-Waltl, Birgit [5 ]
Steininger, Franz [4 ]
Wittek, Thomas [6 ]
Klimek, Peter [1 ,3 ]
机构
[1] Med Univ Vienna, Ctr Med Stat Informat & Intelligent Syst, Sect Sci Complex Syst, A-1090 Vienna, Austria
[2] Graz Univ Technol, Inst Interact Syst & Data Sci, A-8010 Graz, Austria
[3] Complex Sci Hub Vienna, A-1080 Vienna, Austria
[4] ZuchtData EDV Dienstleistungen GmbH, A-1200 Vienna, Austria
[5] Univ Nat Resources & Life Sci, Div Livestock Sci, A-1180 Vienna, Austria
[6] Vetmeduni Vienna, Univ Clin Ruminants, A-1210 Vienna, Austria
关键词
data integration; disease prediction; machine learning; precision livestock farming; LAMENESS SCORING SYSTEM; BODY CONDITION SCORE; TEST DAY MILK; COWS; MASTITIS; HEALTH; YIELD; ASSOCIATION; KETOSIS; TRAITS;
D O I
10.1093/jas/skab294
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
Livestock farming is currently undergoing a digital revolution and becoming increasingly data-driven. Yet, such data often reside in disconnected silos making them impossible to leverage their full potential to improve animal well-being. Here, we introduce a precision livestock farming approach, bringing together information streams from a variety of life domains of dairy cattle to study whether including more and diverse data sources improves the quality of predictions for eight diseases and whether using more complex prediction algorithms can, to some extent, compensate for less diverse data. Using three machine learning approaches of varying complexity (from logistic regression to gradient boosted trees) trained on data from 5,828 animals in 165 herds in Austria, we show that the prediction of lameness, acute and chronic mastitis, anestrus, ovarian cysts, metritis, ketosis (hyperketonemia), and periparturient hypocalcemia (milk fever) from routinely available data gives encouraging results. For example, we can predict lameness with high sensitivity and specificity (F1= 0.74). An analysis of the importance of individual variables to prediction performance shows that disease in dairy cattle is a product of the complex interplay between a multitude of life domains, such as housing, nutrition, or climate, that including more and diverse data sources increases prediction performance, and that the reuse of existing data can create actionable information for preventive interventions. Our findings pave the way toward data-driven point-of-care interventions and demonstrate the added value of integrating all available data in the dairy industry to improve animal well-being and reduce disease risk.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches
    Cai, Yaping
    Guan, Kaiyu
    Lobell, David
    Potgieter, Andries B.
    Wang, Shaowen
    Peng, Jian
    Xu, Tianfang
    Asseng, Senthold
    Zhang, Yongguang
    You, Liangzhi
    Peng, Bin
    AGRICULTURAL AND FOREST METEOROLOGY, 2019, 274 : 144 - 159
  • [32] A machine learning approach to predict the most and the least feed-efficient groups in beef cattle
    Shirzadifar, Alimohammad
    Miar, Younes
    Plastow, Graham
    Basarab, J. A. John
    Li, Changxi
    Fitzsimmons, Carolyn
    Riazi, Mohammad
    Manafiazar, Ghader
    SMART AGRICULTURAL TECHNOLOGY, 2023, 5
  • [33] Development of a machine learning tool to predict the risk of incident chronic kidney disease using health examination data
    Yoshizaki, Yuki
    Kato, Kiminori
    Fujihara, Kazuya
    Sone, Hirohito
    Akazawa, Kohei
    FRONTIERS IN PUBLIC HEALTH, 2024, 12
  • [34] Letter to the editor: accuracy required in data entry when using machine learning techniques to predict risk of disease
    Korat, Akash
    Dobariya, Dhaval
    Bavaliya, Imtiyaz
    Mali, Vikram
    FUTURE MICROBIOLOGY, 2024, 19 (13) : 1109 - 1110
  • [35] Integrating multiple data sources for improved flight delay prediction using explainable machine learning
    Pineda-Jaramillo, Juan
    Munoz, Claudia
    Mesa-Arango, Rodrigo
    Gonzalez-Calderon, Carlos
    Lange, Anne
    RESEARCH IN TRANSPORTATION BUSINESS AND MANAGEMENT, 2024, 56
  • [36] Optimizing Micropropagation and Rooting Protocols for Diverse Lavender Genotypes: A Synergistic Approach Integrating Machine Learning Techniques
    Simsek, Ozhan
    Sekerci, Akife Dalda
    Isak, Musab A.
    Bulut, Fatma
    Izgu, Tolga
    Tutuncu, Mehmet
    Donmez, Dicle
    HORTICULTURAE, 2024, 10 (01)
  • [37] A Machine Learning Approach to Predict Groundwater Levels in California Reveals Ecosystems at Risk
    Rohde, Melissa M.
    Biswas, Tanushree
    Housman, Ian W.
    Campbell, Leah S.
    Klausmeyer, Kirk R.
    Howard, Jeanette K.
    FRONTIERS IN EARTH SCIENCE, 2021, 9
  • [38] A Machine Learning Approach to Predict Volatile Substance Abuse for Drug Risk Analysis
    Nath, Priyanka
    Kilam, Sumran
    Swetapadma, Aleena
    2017 THIRD IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2017, : 255 - 258
  • [39] A machine learning approach to predict radioxenon isotopes concentrations using experimental data
    Azimi, Sepideh Alsadat
    Afarideh, Hossein
    Chai, Jong-Seo
    Kalinowski, Martin
    RADIATION PHYSICS AND CHEMISTRY, 2023, 213
  • [40] A Machine Learning Model and Data Visualization Approach to Predict In-Hospital Sepsis
    Kan, Chin Fung Kelvin
    Kotter, Adam
    Wan, Yik-Ki Jacob
    Morgan, Keaton
    Abdelrahman, Samir
    Madaras-Kelly, Karl
    Del Fiol, Guilherme
    ANESTHESIA AND ANALGESIA, 2024, 139 (05): : 952 - 953