RIKEN PREDICTION

RIKEN website

Event Information

24th Prediction Science Seminar

Date
April 11th, 2024 (Thu.) 13:00-14:30 (JST)
Language
Engish
Place
C107(R-CCS) or online(zoom)

To join the seminar, please contact the Prediction Science Seminar Office: prediction-seminar[remove here]@ml.riken.jp

Program

Time Content Speaker
13:00-14:00 A Protein Language Model for Exploring Viral Fitness Landscapes Associate Prof. Jumpei ITO (Division of Systems Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo)
14:00-14:30 Discussion -

Abstract

Successively emerging SARS-CoV-2 variants lead to repeated epidemic surges through escalated transmissibility (i.e., fitness) in the human population. Modeling genotype-fitness relationship enables us to pinpoint the mutations boosting viral fitness and flag high-risk variants immediately after their detection. Here, we introduce CoVFit, a protein language model able to predict the fitness of variants based solely on their spike protein sequences. CoVFit was trained with genotype-fitness data derived from viral genome surveillance and functional mutation data related to immune evasion. CoVFit identified 549 fitness elevation events throughout SARS-CoV-2 evolution until late 2023. CoVFit successfully predicted the higher fitness of variants that emerged after training data creation through sequential or non-sequential evolution. Furthermore, a CoVFit-based simulation successfully predicted which mutations would be acquired next by the exisiting variants. Our study provides both insight into the SARS-CoV-2 fitness landscape and a novel tool potentially transforming viral genome surveillance.

Organizer

  • Prediction Science Laboratory (RIKEN CPR)
  • RIKEN Center for Biosystems Dynamics Research (BDR)

Co-organizer

  • Data Assimilation Research Team (R-CCS)
  • RIKEN Interdisciplinary Theoretical and Mathematical Sciences Program
  • Environmental Metabolic Analysis Research Team (RIKEN CSRS)
  • Computational Climate Science Research Team (R-CCS)
  • Medical Data Deep Learning Team (R-IH)
  • Medical Data Mathematical Reasoning Team (R-IH)
  • Laboratory for Physical Biology (RIKEN BDR)

PS Seminar Series