Evaluation of Different Machine Learning Approaches to Predict Antigenic Distance Among Newcastle Disease Virus (NDV) Strains

Giovanni Franzo, Alice Fusaro, Chantal J. Snoeck, Aleksandar Dodovski, Steven Van Borm, Mieke Steensels, Vasiliki Christodoulou, Iuliana Onita, Raluca Burlacu, Azucena Sánchez Sánchez, Ilya A. Chvala, Mia Kim Torchetti, Ismaila Shittu, Mayowa Olabode, Ambra Pastori, Alessia Schivo, Angela Salomoni, Silvia Maniero, Ilaria Zambon, Francesco BonfanteIsabella Monne, Mattia Cecchinato, Alessio Bortolami

Research output: Contribution to journalArticlepeer-review

Abstract

Newcastle disease virus (NDV) continues to present a significant challenge for vaccination due to its rapid evolution and the emergence of new variants. Although molecular and sequence data are now quickly and inexpensively produced, genetic distance rarely serves as a good proxy for cross-protection, while experimental studies to assess antigenic differences are time consuming and resource intensive. In response to these challenges, this study explores and compares several machine learning (ML) methods to predict the antigenic distance between NDV strains as determined by hemagglutination-inhibition (HI) assays. By analyzing F and HN gene sequences alongside corresponding amino acid features, we developed predictive models aimed at estimating antigenic distances. Among the models evaluated, the random forest (RF) approach outperformed traditional linear models, achieving a predictive accuracy with an R2 value of 0.723 compared to only 0.051 for linear models based on genetic distance alone. This significant improvement demonstrates the usefulness of applying flexible ML approaches as a rapid and reliable tool for vaccine selection, minimizing the need for labor-intensive experimental trials. Moreover, the flexibility of this ML framework holds promise for application to other infectious diseases in both animals and humans, particularly in scenarios where rapid response and ethical constraints limit conventional experimental approaches.

Original languageEnglish (US)
Article number567
JournalViruses
Volume17
Issue number4
DOIs
StatePublished - Apr 2025
Externally publishedYes

Keywords

  • antigenic cartography
  • cross-protection
  • hemagglutination inhibition
  • machine learning
  • NDV
  • sequencing

ASJC Scopus subject areas

  • Infectious Diseases
  • Virology

Fingerprint

Dive into the research topics of 'Evaluation of Different Machine Learning Approaches to Predict Antigenic Distance Among Newcastle Disease Virus (NDV) Strains'. Together they form a unique fingerprint.

Cite this