24th Prediction Science Seminar
- Date
- April 11th, 2024 (Thu.) 13:00-14:30 (JST)
- Language
- Engish
- Place
- C107(R-CCS) or online(zoom)
To join the seminar, please contact the Prediction Science Seminar Office: prediction-seminar[remove here]@ml.riken.jp
Program
Time | Content | Speaker |
---|---|---|
13:00-14:00 | A Protein Language Model for Exploring Viral Fitness Landscapes | Associate Prof. Jumpei ITO (Division of Systems Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo) |
14:00-14:30 | Discussion | - |
Abstract
Successively emerging SARS-CoV-2 variants lead to repeated epidemic surges through escalated transmissibility (i.e., fitness) in the human population. Modeling genotype-fitness relationship enables us to pinpoint the mutations boosting viral fitness and flag high-risk variants immediately after their detection. Here, we introduce CoVFit, a protein language model able to predict the fitness of variants based solely on their spike protein sequences. CoVFit was trained with genotype-fitness data derived from viral genome surveillance and functional mutation data related to immune evasion. CoVFit identified 549 fitness elevation events throughout SARS-CoV-2 evolution until late 2023. CoVFit successfully predicted the higher fitness of variants that emerged after training data creation through sequential or non-sequential evolution. Furthermore, a CoVFit-based simulation successfully predicted which mutations would be acquired next by the exisiting variants. Our study provides both insight into the SARS-CoV-2 fitness landscape and a novel tool potentially transforming viral genome surveillance.
Organizer
- Prediction Science Laboratory (RIKEN CPR)
- RIKEN Center for Biosystems Dynamics Research (BDR)
Co-organizer
- Data Assimilation Research Team (R-CCS)
- RIKEN Interdisciplinary Theoretical and Mathematical Sciences Program
- Environmental Metabolic Analysis Research Team (RIKEN CSRS)
- Computational Climate Science Research Team (R-CCS)
- Medical Data Deep Learning Team (R-IH)
- Medical Data Mathematical Reasoning Team (R-IH)
- Laboratory for Physical Biology (RIKEN BDR)