<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3-mathml3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="review-article" dtd-version="1.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">J. Pharm. Pharm. Sci.</journal-id>
<journal-title-group>
<journal-title>Journal of Pharmacy &#x26; Pharmaceutical Sciences</journal-title>
<abbrev-journal-title abbrev-type="pubmed">J. Pharm. Pharm. Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1482-1826</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">16146</article-id>
<article-id pub-id-type="doi">10.3389/jpps.2026.16146</article-id>
<article-version article-version-type="Version of Record" vocab="NISO-RP-8-2008"/>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Mini Review</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Artificial intelligence directed computational protein design: lessons from COVID-19 for pandemic-ready vaccines and antibody therapeutics</article-title>
<alt-title alt-title-type="left-running-head">Kaushik and Re</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/jpps.2026.16146">10.3389/jpps.2026.16146</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Kaushik</surname>
<given-names>Rahul</given-names>
</name>
<xref ref-type="aff" rid="aff1"/>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1027183"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Re</surname>
<given-names>Suyong</given-names>
</name>
<xref ref-type="aff" rid="aff1"/>
</contrib>
</contrib-group>
<aff id="aff1">
<institution>Laboratory of In-Silico Design, Artificial Intelligence Center for Health and Biomedical Research, National Institutes of Biomedical Innovation, Health and Nutrition (NIBN)</institution>, <city>Osaka</city>, <country country="JP">Japan</country>
</aff>
<author-notes>
<corresp id="c001">
<label>&#x2a;</label>Correspondence: Rahul Kaushik, <email xlink:href="mailto:rkaushik@nibn.go.jp">rkaushik@nibn.go.jp</email>
</corresp>
</author-notes>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2026-05-08">
<day>08</day>
<month>05</month>
<year>2026</year>
</pub-date>
<pub-date publication-format="electronic" date-type="collection">
<year>2026</year>
</pub-date>
<volume>29</volume>
<elocation-id>16146</elocation-id>
<history>
<date date-type="received">
<day>26</day>
<month>12</month>
<year>2025</year>
</date>
<date date-type="rev-recd">
<day>15</day>
<month>04</month>
<year>2026</year>
</date>
<date date-type="accepted">
<day>24</day>
<month>04</month>
<year>2026</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2026 Kaushik and Re.</copyright-statement>
<copyright-year>2026</copyright-year>
<copyright-holder>Kaushik and Re</copyright-holder>
<license>
<ali:license_ref start_date="2026-05-08">https://creativecommons.org/licenses/by/4.0/</ali:license_ref>
<license-p>This is an open-access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License (CC BY)</ext-link>. The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</license-p>
</license>
</permissions>
<abstract>
<p>Artificial intelligence (AI) directed computational protein design has emerged as a transformative force in modern therapeutic discovery, reshaping how vaccines and antibody-based interventions are conceived, optimized, and deployed against emerging infectious diseases. The COVID-19 pandemic served as an unprecedented real-world stress test for these technologies, highlighting their potential to accelerate antigen design, guide antibody optimization, and anticipate viral evolution in near real time. AI driven approaches contributed to faster characterization of viral variants, supported vaccine and broadly neutralizing antibodies developments. Despite the significant contributions, the pandemic also revealed important limitations that must be addressed before such approaches can be relied upon as cornerstones of global preparedness. Challenges related to data bias, model interpretability, experimental validation bottlenecks, and integration with existing regulatory frameworks became increasingly apparent. In several cases, the gap between computational promise and translational readiness underscored the need for closer coupling between <italic>in silico</italic> design, laboratory experimentation, and clinical evaluation. Moreover, the rapid pace of AI innovation often outstripped established regulatory pathways, raising questions about standardization, validation, and long-term safety. This mini review provides a focused overview of recent advances in AI enabled computational protein design, with an emphasis on applications relevant to pandemic response. Drawing on lessons from COVID-19 case studies, it examines translational and regulatory considerations, highlights unresolved controversies, and identifies critical research gaps. Collectively, these insights outline a path toward transitioning AI designed vaccines and antibody therapeutics from reactive emergency tools into proactive, scalable infrastructures for future pandemic preparedness.</p>
</abstract>
<kwd-group>
<kwd>AI regulations</kwd>
<kwd>artificial intelligence</kwd>
<kwd>biologics</kwd>
<kwd>COVID-19</kwd>
<kwd>pandemic preparedness</kwd>
</kwd-group>
<funding-group>
<funding-statement>The author(s) declared that financial support was not received for this work and/or its publication.</funding-statement>
</funding-group>
<counts>
<fig-count count="1"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="45"/>
<page-count count="9"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>The growing frequency of emerging and re-emerging infectious diseases has revealed fundamental limitations in traditional pharmaceutical development pipelines. Conventional vaccine and antibody discovery processes typically rely on sequential experimental screening, optimization, and validation, often requiring years to progress from concept to clinical deployment [<xref ref-type="bibr" rid="B1">1</xref>&#x2013;<xref ref-type="bibr" rid="B5">5</xref>]. Such timelines are poorly suited to rapidly spreading pathogens capable of global dissemination within months, particularly in a highly interconnected global landscape. The COVID-19 pandemic starkly illustrated this mismatch, as unprecedented scientific mobilization was required to counter a rapidly evolving viral threat. At the same time, COVID-19 catalyzed the widespread adoption of artificial intelligence (AI)-directed computational approaches as tools for accelerating therapeutic discovery. Advances in data availability, computing infrastructure, and machine learning algorithms converged to enable rapid analysis of viral genomes, structural modeling of antigens, and prioritization of therapeutic candidates at a scale that was not previously feasible. These developments signaled a shift away from purely empirical discovery toward data-driven and design-oriented paradigms [<xref ref-type="bibr" rid="B2">2</xref>, <xref ref-type="bibr" rid="B6">6</xref>&#x2013;<xref ref-type="bibr" rid="B10">10</xref>].</p>
<p>AI-enabled protein design represents a fundamental rethinking of how biologics are conceptualized and developed. By leveraging large-scale biological data and machine learning models capable of learning complex sequence&#x2013;structure&#x2013;function relationships, these approaches allow systematic exploration of protein space at a scale unattainable by experimental methods alone. Importantly, AI-driven design does not replace experimental science; rather, it reframes it by prioritizing hypotheses, narrowing candidate space, and enabling rational iteration under severe time constraints [<xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B5">5</xref>, <xref ref-type="bibr" rid="B11">11</xref>&#x2013;<xref ref-type="bibr" rid="B13">13</xref>]. During the COVID-19 pandemic, AI-driven pipelines contributed to antibody discovery, antigen stabilization, variant impact assessment, and scaffold design, compressing early discovery timelines and enabling rapid iteration in response to viral evolution. These applications spanned multiple methodological paradigms, including sequence-based learning, structure-aware modeling, and generative optimization [<xref ref-type="bibr" rid="B7">7</xref>&#x2013;<xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B14">14</xref>&#x2013;<xref ref-type="bibr" rid="B16">16</xref>]. The diversity of successful approaches highlighted both the flexibility of AI-enabled methods and the importance of aligning computational strategies with specific biological questions [<xref ref-type="bibr" rid="B1">1</xref>&#x2013;<xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B13">13</xref>, <xref ref-type="bibr" rid="B17">17</xref>].</p>
<p>However, the pandemic also demonstrated that computational acceleration does not automatically translate into clinical or public health impact. Challenges related to reproducibility, model interpretability, experimental validation, data availability, manufacturing scalability, and regulatory acceptance remain significant barriers. Addressing these challenges requires a balanced and critical assessment of what AI-directed protein design can currently deliver, where it falls short, and how it can be responsibly integrated into sustainable pharmaceutical development pipelines.</p>
<p>This mini-review adopts a narrative synthesis approach to evaluate recent advances in AI-directed computational protein design, examining evidence-based lessons from COVID-19 to inform the development of truly pandemic-ready vaccines and antibody therapeutics. As schematically summarized in <xref ref-type="fig" rid="F1">Figure 1</xref>, this framework integrates the methodological, empirical, and regulatory themes explored throughout the following sections to provide a holistic view of the current biodefense landscape.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Integrated Framework for AI-Directed Pandemic Preparedness. The schematic provides a multi-quadrant synthesis of the review&#x2019;s core sections. Upper Left: Technical paradigms in sequence, structure, and generative modeling (referencing Section <italic>AI-directed computational protein design in therapeutics development</italic>; <xref ref-type="table" rid="T1">Table 1</xref>). Upper Right: Lessons from COVID-19 applications, including rapid antibody discovery and variant surveillance (referencing Section <italic>Methodological paradigms in protein design</italic>). Lower Right: Future trajectories toward broad-spectrum vaccines and &#x201c;One Health&#x201d; integration (referencing Section <italic>Translational and regulatory considerations</italic>). Lower Left: Critical translational barriers, including experimental validation and regulatory frameworks (referencing Section <italic>Future directions: Toward pandemic-ready biologics</italic>). The central axis illustrates the convergence of these strategies to accelerate the delivery of next-generation biologics. (Created with <ext-link ext-link-type="uri" xlink:href="http://BioRender.com">BioRender.com</ext-link>).</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="jpps-29-16146-g001.tif">
<alt-text content-type="machine-generated">Infographic illustrating the cycle of AI-driven protein design, learnings from COVID-19, future directions, and translational and regulatory challenges in antibody and vaccine development, using visual icons for each process.</alt-text>
</graphic>
</fig>
</sec>
<sec id="s2">
<title>Literature selection and methodology</title>
<p>This mini review adopts a narrative synthesis approach to evaluate the integration of artificial intelligence in protein design, focusing on the transition from traditional structural biology to modern generative paradigms. To ensure transparency and reproducibility in our literature selection, we performed systematic searches across PubMed, Google Scholar, and preprint servers (bioRxiv and medRxiv).</p>
<sec id="s2-1">
<title>Search strategy and identification</title>
<p>The search strategy utilized targeted strings designed to capture the intersection of machine learning and structural virology, including: AI-directed protein design, machine learning for antibody optimization, generative modeling for SARS-CoV-2 antigens, and <italic>de novo</italic> protein design COVID-19.</p>
</sec>
<sec id="s2-2">
<title>Inclusion and exclusion criteria</title>
<p>Studies were primarily included based on their contribution to the methodological shift toward data-driven paradigms or their direct impact on the accelerated development of vaccines and antibody therapeutics published between 2020 and late 2025. Of the 45 total references cited in this review, 17 are utilized in the Introduction to establish the broader scientific and historical context. The remaining 28 references were specifically selected and analyzed to describe the technical evolution of AI/ML-based strategies in the field of computational protein design. To maintain a focused scope, articles focusing exclusively on clinical outcomes or epidemiological trends without a substantial computational design or structural modeling component were excluded.</p>
</sec>
<sec id="s2-3">
<title>Quality assessment and synthesis</title>
<p>The quality and technical rigor of the selected literature were assessed based on three primary pillars: Peer-review status, clarity of the machine learning architectures employed, and experimental or biophysical validation to ground the reported computational predictions. The insights gathered from this literature selection are synthesized to discuss how these advancements inform the development of truly pandemic-ready biologicals, as schematically summarized in <xref ref-type="fig" rid="F1">Figure 1</xref>.</p>
</sec>
</sec>
<sec id="s3">
<title>AI-directed computational protein design in therapeutics development</title>
<p>Protein-based vaccines and antibody therapeutics rely fundamentally on precise molecular recognition. Traditional experimental approaches, such as phage display, animal immunization, and high-throughput screening are resource-intensive and time-consuming. Computational protein design offers a complementary strategy by enabling rational exploration of vast sequence and structural spaces. Recent advances in artificial intelligence (AI) and computational modeling have transformed both vaccine and antibody therapeutic development, enabling more systematic, scalable, and predictive design workflows [<xref ref-type="bibr" rid="B7">7</xref>&#x2013;<xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B11">11</xref>, <xref ref-type="bibr" rid="B18">18</xref>, <xref ref-type="bibr" rid="B19">19</xref>]. Across these applications, sequence-based and structure-informed AI models facilitate the identification, optimization, and functional evaluation of biologics, complementing experimental approaches and reducing reliance on iterative laboratory screening.</p>
<sec id="s3-1">
<title>Sequence-based modeling</title>
<p>Sequence-based modeling constitutes the foundation of many AI-driven protein design approaches. By learning statistical patterns from large collections of natural protein sequences, these models capture evolutionary constraints associated with folding, stability, and function. Recent deep learning architectures have enabled sequence models to represent long-range dependencies, contextual relationships, and epistatic interactions that were previously difficult to capture using classical bioinformatics methods [<xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B10">10</xref>, <xref ref-type="bibr" rid="B20">20</xref>, <xref ref-type="bibr" rid="B21">21</xref>]. During the COVID-19 pandemic, sequence-based analyses were widely applied to rapidly screen antibody repertoires, identify conserved viral regions, and prioritize targets less susceptible to immune escape [<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B2">2</xref>, <xref ref-type="bibr" rid="B5">5</xref>, <xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B22">22</xref>]. These approaches were particularly valuable in the earliest phases of the outbreak, when structural information was scarce but viral genomic sequences were rapidly accumulating. In this context, sequence-based models enabled rapid hypothesis generation and target prioritization at a time when experimental resources were limited.</p>
<p>Sequence-based approaches have also proven useful for assessing mutational tolerance and identifying regions of functional constraint across viral proteins. Such analyses informed the selection of epitopes likely to remain stable under immune pressure, thereby supporting the design of more durable vaccines and antibody therapeutics [<xref ref-type="bibr" rid="B11">11</xref>, <xref ref-type="bibr" rid="B23">23</xref>, <xref ref-type="bibr" rid="B24">24</xref>]. However, these predictions are inherently probabilistic and depend strongly on the diversity and representativeness of training data.</p>
<p>Despite their utility, sequence-only models have inherent limitations. They may fail to fully capture three-dimensional interactions, conformational dynamics, post-translational modifications, and context-dependent effects that strongly influence binding and stability [<xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B10">10</xref>, <xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B25">25</xref>]. Consequently, sequence-based predictions are most effective when used as an initial filtering step, guiding subsequent structure-aware analyses rather than serving as standalone decision tools. Recognizing these strengths and limitations is essential for their effective translational application.</p>
</sec>
<sec id="s3-2">
<title>Structure-based modeling</title>
<p>Structure-based modeling leverages three-dimensional information to predict protein folding, binding interactions, and conformational changes that underlie biological function. The availability of high-quality experimental structures, combined with major advances in deep learning&#x2013;based structure prediction, dramatically expanded the applicability of structure-guided design during the COVID-19 pandemic [<xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B11">11</xref>, <xref ref-type="bibr" rid="B19">19</xref>, <xref ref-type="bibr" rid="B21">21</xref>]. Structural models of the SARS-CoV-2 spike protein and its complexes with neutralizing antibodies enabled rational identification of binding hotspots, informed affinity maturation strategies, and supported the design of stabilized antigens [<xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B13">13</xref>, <xref ref-type="bibr" rid="B26">26</xref>]. Beyond static representations, structure-based approaches facilitated comparative analysis of antibody binding modes and epitope accessibility, providing mechanistic explanations for differences in neutralization potency. Importantly, structure-based modeling also contributed to understanding why certain antibodies retained activity against emerging variants while others rapidly lost efficacy. By visualizing how specific mutations altered local geometry or electrostatic interactions, these models provided actionable insights for therapeutic redesign and cocktail formulation [<xref ref-type="bibr" rid="B18">18</xref>, <xref ref-type="bibr" rid="B22">22</xref>].</p>
<p>Nevertheless, significant uncertainties persist, particularly flexible regions, transient conformations, glycosylated surfaces, and multimeric assemblies. These limitations underscore the need for cautious interpretation of structural predictions and for integrating multiple modeling approaches with experimental validation. In translational settings, structure-based predictions must therefore be viewed as probabilistic guides rather than definitive answers.</p>
</sec>
<sec id="s3-3">
<title>Generative and optimization approaches</title>
<p>Beyond screening existing biological sequences, generative AI models enable <italic>de novo</italic> protein design by learning to sample novel structures and sequences optimized for predefined objectives. These approaches allow for the simultaneous consideration of multi-objective properties, including binding affinity, stability, solubility, and manufacturability [<xref ref-type="bibr" rid="B10">10</xref>, <xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B27">27</xref>]. During the COVID-19 response, generative methods were applied to design mini-proteins and optimized antibody scaffolds, demonstrating their potential to expand the functional space of therapeutics beyond evolutionary precedents [<xref ref-type="bibr" rid="B13">13</xref>, <xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B26">26</xref>].</p>
<sec id="s3-3-1">
<title>Concepts and technical architectures</title>
<p>Modern generative efforts have transitioned from purely statistical sequence modeling to structure-first paradigms driven by Denoising Diffusion Probabilistic Models (DDPMs) and Flow-Matching architectures. These models treat protein design as the reversal of a physical corruption process; a model is trained to denoise a random cloud of points into a biologically plausible protein fold. Unlike older autoregressive models that treat proteins as 1D token streams, diffusion-based frameworks such as RFdiffusion, explicitly construct 3D coordinates (backbone C&#x3b1; atoms and residue frames) throughout the generative sampling process [<xref ref-type="bibr" rid="B13">13</xref>, <xref ref-type="bibr" rid="B28">28</xref>]. This allows the AI to navigate the complex equivariant space of protein geometry, ensuring that global symmetry and fold-topology are maintained with atomic-level intent.</p>
</sec>
<sec id="s3-3-2">
<title>Evaluation and designability</title>
<p>A critical component of this generative process is the <italic>in silico</italic> validation of hallucinated structures before they reach the wet lab. To ensure biological plausibility, generated backbones are subjected to a Self-Consistency (scRMSD) evaluation. This involves using an independent, high-fidelity structure predictor (e.g., AlphaFold2 or ESMFold) to forward-fold the AI-designed sequence. A design is considered feasible only if the predicted structure matches the original generative 3D backbone with high precision (scRMSD &#x3c;2&#xa0;&#xc5;).</p>
<p>This iterative workflow, combining generative sampling with structural consistency oracles and biophysical filtering, reinforces the importance of hybrid computational-experimental pipelines in achieving the rapid iteration required for pandemic preparedness [<xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B28">28</xref>].</p>
</sec>
</sec>
<sec id="s3-4">
<title>Immune escape and variant prediction</title>
<p>A defining feature of the COVID-19 pandemic was the rapid emergence of viral variants with altered transmissibility and immune evasion properties. AI-driven models integrating sequence evolution, structural constraints, and mutational scanning data were widely used to assess how specific substitutions might impact antibody binding and vaccine efficacy. These approaches supported real-time evaluation of emerging variants and informed therapeutic updates [<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B2">2</xref>, <xref ref-type="bibr" rid="B5">5</xref>, <xref ref-type="bibr" rid="B26">26</xref>, <xref ref-type="bibr" rid="B29">29</xref>]. Immune escape prediction represents one of the most challenging and consequential applications of AI-directed protein design. While short-term mutational effects can often be captured with reasonable accuracy, forecasting longer-term evolutionary trajectories under immune pressure is considerably more complex. Viral evolution is shaped by trade-offs between transmissibility, immune evasion, replication efficiency, and fitness costs, factors that are difficult to model comprehensively [<xref ref-type="bibr" rid="B2">2</xref>, <xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B26">26</xref>].</p>
<p>The COVID-19 experience highlighted the need for cautious interpretation of immune escape predictions and for continuous model updating as new data become available. Rather than serving as definitive forecasts, these models are best viewed as tools for scenario exploration and risk prioritization, informing preparedness strategies rather than dictating fixed design decisions.</p>
</sec>
<sec id="s3-5">
<title>Translational impact</title>
<p>Experience from COVID-19 highlighted that computational approaches have moved beyond supportive roles to become essential tools in pharmaceutical development. While experimental validation remains indispensable, AI-enabled modeling provides a structured framework for narrowing candidate spaces, integrating diverse datasets, and accelerating the discovery and optimization of both vaccines and antibody therapeutics under conditions requiring speed, adaptability, and scalability [<xref ref-type="bibr" rid="B2">2</xref>, <xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B11">11</xref>, <xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B26">26</xref>]. Thus, the success of AI driven computational approaches depends heavily on experimental validation, manufacturability considerations, and regulatory pathways, highlighting the importance of integration across the drug development pipeline.</p>
<p>While these applications illustrate the functional scope of AI-directed computational protein design, their performance is ultimately determined by the underlying machine learning paradigms used to represent, learn, and optimize protein sequence&#x2013;structure&#x2013;function relationships. Understanding these methodological foundations is essential for interpreting both the strengths and limitations of AI-enabled design pipelines.</p>
</sec>
</sec>
<sec id="s4">
<title>Methodological paradigms in protein design</title>
<p>Methodologically, recent advances in AI-driven protein design are supported by three complementary paradigms: sequence-based protein language models, structure-informed machine learning, and multimodal generative frameworks. These paradigms, supported by the representative computational tools summarized in <xref ref-type="table" rid="T1">Table 1</xref>, have transitioned the field from descriptive modeling to predictive and generative engineering.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Representative AI/ML-based computational tools available for protein structure prediction, antibody engineering, vaccine antigen design, and immune escape analysis. The key features for each tool include its application, datasets used in training, AI/ML architecture, and over contribution to the field in a sequential manner.</p>
</caption>
<table>
<tbody valign="top">
<tr>
<td colspan="2" align="left">A. Protein structural characterization</td>
</tr>
<tr>
<td align="left">Tool: <italic>AlphaFold3</italic> [<xref ref-type="bibr" rid="B7">7</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://alphafoldserver.com/">https://alphafoldserver.com</ext-link>
<break/>Predicts biomolecular complex structures<break/>Protein sequence and structures data<break/>Diffusion-based deep learning<break/>Structure-guided therapeutic design</td>
<td align="left">Tool: <italic>RoseTTAFold</italic> [<xref ref-type="bibr" rid="B8">8</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://robetta.bakerlab.org/">https://robetta.bakerlab.org</ext-link>
<break/>Predicts protein structures and complexes<break/>Sequences, MSAs, and structural data<break/>Three-track neural network<break/>Reliable and interpretable structure prediction</td>
</tr>
<tr>
<td align="left">Tool: <italic>ColabFold</italic> [<xref ref-type="bibr" rid="B14">14</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/sokrypton/ColabFold">https://github.com/sokrypton/ColabFold</ext-link>
<break/>Accelerates protein structure prediction<break/>Protein sequences with fast MSA generation<break/>AlphaFold2-based deep neural architecture<break/>Democratized large-scale structure prediction</td>
<td align="left">Tool: OpenFold [<xref ref-type="bibr" rid="B9">9</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/aqlaboratory/openfold">https://github.com/aqlaboratory/openfold</ext-link>
<break/>Reimplements AlphaFold2 for open research<break/>Public protein sequence and structure datasets<break/>Transformer-based attention architecture<break/>Enabled transparency and benchmarking</td>
</tr>
<tr>
<td colspan="2" align="left">B. Protein-ligand and binding prediction</td>
</tr>
<tr>
<td align="left">Tool: <italic>DiffDock</italic> [<xref ref-type="bibr" rid="B30">30</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/gcorso/DiffDock">https://github.com/gcorso/DiffDock</ext-link>
<break/>Predicts protein&#x2013;ligand binding poses<break/>Protein&#x2013;ligand complex datasets<break/>Geometric deep learning diffusion models<break/>Advanced structure-based drug discovery</td>
<td align="left">Tool: <italic>GNINA</italic> [<xref ref-type="bibr" rid="B31">31</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/gnina/gnina">https://github.com/gnina/gnina</ext-link>
<break/>Scores protein&#x2013;ligand docking poses<break/>Labeled docking and binding affinity data<break/>Convolutional neural networks<break/>Improved docking-based virtual screening</td>
</tr>
<tr>
<td colspan="2" align="left">C. Generative protein design</td>
</tr>
<tr>
<td align="left">Tool: <italic>ProteinMPNN</italic> [<xref ref-type="bibr" rid="B15">15</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/dauparas/ProteinMPNN">https://github.com/dauparas/ProteinMPNN</ext-link>
<break/>Designs sequences on protein backbones<break/>Experimentally determined protein structures<break/>Graph neural network architecture<break/>Robust <italic>de novo</italic> protein sequence design</td>
<td align="left">Tool: <italic>RFDiffusion</italic> [<xref ref-type="bibr" rid="B28">28</xref>] <ext-link ext-link-type="uri" xlink:href="https://github.com/RosettaCommons/RFdiffusion">https://github.com/RosettaCommons/RFdiffusion</ext-link>
<break/>Generates protein backbones and binders<break/>Protein structural coordinates<break/>Denoising diffusion probabilistic models<break/>Introduced diffusion-based protein design</td>
</tr>
<tr>
<td align="left">Tool: <italic>ProGen2</italic> [<xref ref-type="bibr" rid="B25">25</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/salesforce/progen">https://github.com/salesforce/progen</ext-link>
<break/>Generates functional protein sequences<break/>Large-scale protein sequence databases<break/>Autoregressive transformer language model<break/>Scalable generative protein modeling</td>
<td align="left">Tool: <italic>Chroma</italic> [<xref ref-type="bibr" rid="B13">13</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/generatebio/chroma">https://github.com/generatebio/chroma</ext-link>
<break/>Designs functional proteins<break/>Protein structures and sequences<break/>Diffusion-based generative model<break/>Expanded controllable protein design</td>
</tr>
<tr>
<td colspan="2" align="left">D. Protein language models (PLMs)</td>
</tr>
<tr>
<td align="left">Tool: <italic>ESM-2</italic> [<xref ref-type="bibr" rid="B27">27</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/facebookresearch/esm">https://github.com/facebookresearch/esm</ext-link>
<break/>Protein representations and mutation effects<break/>Hundreds of millions of protein sequences<break/>Transformer-based protein language model<break/>Structure, function, and evolution modeling</td>
<td align="left">Tool: <italic>ProtT5</italic> [<xref ref-type="bibr" rid="B16">16</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://huggingface.co/Rostlab">https://huggingface.co/Rostlab</ext-link>
<break/>Generates embeddings for protein annotation<break/>Large unlabeled protein sequence datasets<break/>Encoder&#x2013;decoder transformer architecture<break/>Enabled transfer learning in protein biology</td>
</tr>
<tr>
<td colspan="2" align="left">E. Variant effect and evolutionary modeling</td>
</tr>
<tr>
<td align="left">Tool: <italic>EVE/EVEscape</italic> [<xref ref-type="bibr" rid="B26">26</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/OATML/EVE">https://github.com/OATML/EVE</ext-link>
<break/>Predicts variant fitness and immune escape<break/>Evolutionary sequence variation (MSAs)<break/>Bayesian variational autoencoder<break/>Linked sequence, disease and immune escape</td>
<td align="left">Tool: <italic>MAVE-NN</italic> [<xref ref-type="bibr" rid="B29">29</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/fhalab/MLDE">https://github.com/fhalab/MLDE</ext-link>
<break/>Learns genotype&#x2013;phenotype relationships<break/>Deep mutational scanning data<break/>Supervised neural networks<break/>Interpreted experimental variant effects</td>
</tr>
<tr>
<td colspan="2" align="left">F. Immunoinformatics and vaccine design</td>
</tr>
<tr>
<td align="left">Tool: <italic>NetMHCpan/NetMHCIIpan</italic> [<xref ref-type="bibr" rid="B24">24</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://services.healthtech.dtu.dk/">https://services.healthtech.dtu.dk</ext-link>
<break/>Predicts peptide&#x2013;MHC class I and II binding<break/>Peptide binding and eluted ligand data<break/>Neural network ensemble models<break/>State of the art for epitope discovery</td>
<td align="left">Tool: <italic>IEDB Analysis Resource</italic> [<xref ref-type="bibr" rid="B23">23</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://tools.iedb.org/main">https://tools.iedb.org/main</ext-link>
<break/>Epitope prediction and analysis tools<break/>Curated immune epitope experimental data<break/>Multiple ML and statistical models<break/>Platform for immunoinformatics</td>
</tr>
<tr>
<td align="left">Tool: <italic>DeepAb</italic> [<xref ref-type="bibr" rid="B32">32</xref>]<break/>Access:<ext-link ext-link-type="uri" xlink:href="https://github.com/RosettaCommons/DeepAb">https://github.com/RosettaCommons/DeepAb</ext-link>
<break/>Antibody variable-region structures<break/>Antibody sequences and structures<break/>Deep learning with attention mechanisms<break/>Improved CDR loop modeling</td>
<td align="left">Tool: <italic>IgFold</italic> [<xref ref-type="bibr" rid="B11">11</xref>]<break/>Access: <ext-link ext-link-type="uri" xlink:href="https://github.com/Graylab/IgFold">https://github.com/Graylab/IgFold</ext-link>
<break/>Antibody structures prediction<break/>Curated antibody structural datasets<break/>End-to-end deep learning framework<break/>Enabled fast therapeutic antibody modeling</td>
</tr>
</tbody>
</table>
</table-wrap>
<sec id="s4-1">
<title>Sequence-based protein language models (PLMs)</title>
<p>Protein language models (PLMs) apply self-supervised learning to large-scale sequence datasets, treating amino acid sequences as contextual token streams to learn latent representations that encode evolutionary constraints and functional signatures without requiring explicit structural input [<xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B25">25</xref>]. Tools such as ESM-2 and ProtT5 exemplify this approach, leveraging transformer-based architectures to predict mutation effects and functional annotations across hundreds of millions of sequences [<xref ref-type="bibr" rid="B16">16</xref>, <xref ref-type="bibr" rid="B27">27</xref>]. This scalability enables rapid downstream applications, such as the variant fitness and immune escape predictions performed by EVE/EVEscape, making PLMs particularly valuable for early-stage discovery and tracking rapidly evolving pathogens [<xref ref-type="bibr" rid="B26">26</xref>].</p>
</sec>
<sec id="s4-2">
<title>Structure-informed and geometric machine learning</title>
<p>Structure-driven approaches explicitly incorporate three-dimensional information, encoding spatial relationships such as backbone geometry and intermolecular interfaces using graph neural networks (GNNs) or geometric deep learning [<xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B8">8</xref>]. AlphaFold3 and RoseTTAFold have established the benchmark for biomolecular complex prediction, enabling structure-guided therapeutic design with high fidelity [<xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B8">8</xref>]. These structural insights are extended by specialized docking and scoring tools like DiffDock and GNINA, which utilize diffusion models and convolutional neural networks to predict protein-ligand binding poses [<xref ref-type="bibr" rid="B30">30</xref>, <xref ref-type="bibr" rid="B31">31</xref>]. By enforcing physical and spatial constraints, these models achieve high predictive accuracy for tasks dominated by structural determinants, including antibody-antigen interface modeling as seen in platforms like DeepAb and IgFold [<xref ref-type="bibr" rid="B11">11</xref>, <xref ref-type="bibr" rid="B32">32</xref>].</p>
</sec>
<sec id="s4-3">
<title>Generative and multimodal integrative frameworks</title>
<p>Building on these predictive foundations, generative AI frameworks now integrate heterogeneous data sources&#x2014;including sequence, structure, and experimental mutational data&#x2014;within unified architectures. Denoising diffusion probabilistic models, such as RFdiffusion and Chroma, allow for the <italic>de novo</italic> generation of functional protein backbones and binders tailored to specific viral epitopes [<xref ref-type="bibr" rid="B13">13</xref>, <xref ref-type="bibr" rid="B28">28</xref>]. These are often coupled with sequence-design tools like ProteinMPNN or autoregressive models like ProGen2 to ensure fold-stability and biological function [<xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B25">25</xref>].</p>
<p>Furthermore, the maturation of immune-informatics resources, such as the IEDB Analysis Resource and NetMHCpan, facilitates the identification of high-affinity epitopes for vaccine antigen design [<xref ref-type="bibr" rid="B23">23</xref>, <xref ref-type="bibr" rid="B24">24</xref>]. While these multimodal models improve robustness, experience from the COVID-19 pandemic highlighted the need to validate computational predictions under real-world experimental constraints. <xref ref-type="table" rid="T1">Table 1</xref> summarizes the key features, architecture, and contributions of these representative tools. It should be noted that these listings are not intended as a definitive catalogue, as numerous additional methods and resources exist beyond those included to address the complexities of protein engineering.</p>
</sec>
</sec>
<sec id="s5">
<title>Lessons from COVID-19: case studies and applications</title>
<p>The COVID-19 pandemic provided an unprecedented stress test for AI-directed protein design, moving computational workflows from theoretical frameworks to experimental and clinical reality. The following subsections highlight key evidence-based studies that demonstrate the efficacy of these approaches.</p>
<sec id="s5-1">
<title>De novo design of hyper-stable miniprotein binders</title>
<p>A pivotal shift in the pandemic response was the move from stabilizing viral proteins to the <italic>de novo</italic> design of synthetic binders. Researchers utilized Rosetta-based and early diffusion models to design small (approx. 55-residue) proteins targeting the SARS-CoV-2 Spike Protein Receptor Binding Domain (RBD) [<xref ref-type="bibr" rid="B33">33</xref>]. These designed miniproteins, such as LCB1, demonstrated picomolar neutralization potency, exceeding that of many monoclonal antibodies. Crucially, experimental validation through cryo-EM confirmed that the binders engaged the target exactly as predicted in the computational model [<xref ref-type="bibr" rid="B33">33</xref>, <xref ref-type="bibr" rid="B34">34</xref>]. Their superior thermal stability and ability to be produced in <italic>E. coli</italic> provided a scalable alternative to traditional biologics.</p>
</sec>
<sec id="s5-2">
<title>Computational design of self-assembling nanoparticle vaccines</title>
<p>AI played a critical role in the development of multivalent antigens, which provide broader protection than monomeric proteins. The design of the GBP510 (Ruvaxivid) vaccine utilized computational tools to create a self-assembling protein nanoparticle (icosahedral symmetry) decorated with 60 copies of the SARS-CoV-2 RBD [<xref ref-type="bibr" rid="B35">35</xref>, <xref ref-type="bibr" rid="B36">36</xref>]. Clinical trial data showed that this AI-scaffolded design elicited significantly higher neutralizing antibody titters compared to the monomeric RBD or mRNA-based benchmarks in certain populations. This provides concrete evidence that computational geometry can directly enhance the human immune response.</p>
</sec>
<sec id="s5-3">
<title>Generative AI for rapid antibody optimization</title>
<p>As variants like Omicron emerged, AI was deployed to future-proof antibody therapeutics by identifying conserved epitopes. Studies employing Deep Mutational Scanning (DMS) data combined with Graph Neural Networks (GNNs) allowed for the rapid optimization of antibodies like Sotrovimab derivatives. By predicting escape trajectories, researchers were able to engineer antibodies that maintained high-affinity binding across multiple variants of concern [<xref ref-type="bibr" rid="B37">37</xref>&#x2013;<xref ref-type="bibr" rid="B39">39</xref>]. Experimental assays confirmed that these AI-optimized candidates retained neutralization against the BA.2 and BA.5 subvariants when many traditional antibodies failed [<xref ref-type="bibr" rid="B40">40</xref>, <xref ref-type="bibr" rid="B41">41</xref>].</p>
<p>Importantly, these case studies reveal that the success of AI-enabled approaches depends not only on algorithms but also on high-quality data availability and experimental throughput. Regions with limited sequencing or experimental capacity were less able to benefit from these advances, highlighting equity and infrastructure considerations that must be addressed for future pandemic preparedness.</p>
</sec>
</sec>
<sec id="s6">
<title>Translational and regulatory considerations</title>
<p>Despite their promise, AI-enabled protein design approaches face significant challenges in pharmaceutical translation. Regulatory acceptance requires transparent, reproducible, and validated computational models, as well as clear documentation of how predictions are generated and used. During COVID-19, variability in model predictions underscored the need for standardized benchmarking, reporting practices, and version control [<xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B42">42</xref>].</p>
<p>Integration with experimental pipelines proved essential for successful translation. Hybrid workflows combining <italic>in silico</italic> screening with targeted wet-lab validation reduced attrition and improved confidence in AI-generated candidates. Such integration also facilitated early assessment of manufacturability, stability, and developability, factors that are critical for clinical success but often overlooked in purely computational studies [<xref ref-type="bibr" rid="B1">1</xref>&#x2013;<xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B42">42</xref>].</p>
<p>Current regulatory frameworks were not designed with AI-driven discovery in mind. While the pandemic prompted limited flexibility in review processes, long-term adoption of AI-designed biologics will require clearer guidance on model validation, lifecycle management, update policies, and post-approval monitoring. Developing such frameworks will be essential to balance innovation with patient safety.</p>
</sec>
<sec id="s7">
<title>Future directions: toward pandemic-ready biologics</title>
<p>Preparing for future pandemics, including hypothetical Disease X scenarios, requires a shift from reactive therapeutic development to anticipatory design. Pre-emptive development of broad-spectrum vaccines and antibodies targeting conserved viral elements represents a central goal of this paradigm, offering the potential for baseline protection against entire viral families [<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B19">19</xref>, <xref ref-type="bibr" rid="B24">24</xref>, <xref ref-type="bibr" rid="B26">26</xref>, <xref ref-type="bibr" rid="B43">43</xref>]. Curated libraries of ready-to-deploy biologics could further accelerate response timelines by enabling rapid selection and adaptation of pre-validated candidates. Such libraries would need to be continuously updated as new data emerge, integrating insights from viral surveillance, structural biology, and immunology. Integration of One Health data spanning human, animal, and environmental surveillance will be critical for identifying zoonotic risks before spillover occurs [<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B2">2</xref>, <xref ref-type="bibr" rid="B4">4</xref>, <xref ref-type="bibr" rid="B44">44</xref>, <xref ref-type="bibr" rid="B45">45</xref>]. Coupled with global data-sharing platforms that enable real-time model updating, these approaches could support coordinated and proactive responses to emerging threats. Realizing this vision will require sustained investment in data infrastructure, governance frameworks, and international collaboration.</p>
</sec>
<sec id="s8">
<title>Strengths, limitations, and challenges</title>
<p>The rapid evolution of AI-directed protein design necessitates a critical appraisal of the current methodological landscape and the scope of this synthesis. The primary strength of this mini review is its focus on the transition from theoretical <italic>in silico</italic> models to experimentally validated evidence-based outcomes, capturing the shift toward generative paradigms through late 2025. However, this study is limited by the inherent black-box nature of many deep learning architectures, which complicates a purely mechanistic interpretation of design successes. Furthermore, the rapid pace of the field means that some proprietary models may lack the open-source data required for a full technical audit.</p>
<sec id="s8-1">
<title>High computational costs and resource disparity</title>
<p>A significant challenge in modern protein design is the escalating computational cost associated with training and deploying state-of-the-art models. The high-fidelity diffusion models and large-scale molecular dynamics (MD) simulations require massive GPU clusters, creating a compute divide that favors well-funded institutions. Further, the energy requirements for these iterative design cycles raise concerns regarding the environmental sustainability of always-on pandemic surveillance systems.</p>
</sec>
<sec id="s8-2">
<title>Data privacy and biosecurity</title>
<p>As AI models are increasingly trained on vast repositories of viral sequences and human immune repertoires, data privacy and biosecurity have emerged as paramount concerns. The use of patient-derived monoclonal antibody data requires rigorous anonymization and secure handling to prevent the exposure of sensitive clinical information. Further, there is an ongoing debate regarding the dual-use potential of generative AI, where the same tools used to design vaccines could theoretically be repurposed to enhance viral fitness or escape, necessitating robust international governance and red-teaming protocols.</p>
</sec>
<sec id="s8-3">
<title>Experimental bottleneck</title>
<p>The primary operational challenge remains the disparity between the speed of AI-generated designs and the throughput of wet-lab validation. Even with high-affinity predictions, the design-build-test cycle is often slowed by traditional synthesis and purification timelines, highlighting the need for more integrated, automated bio-foundries to achieve true pandemic readiness.</p>
<p>Addressing these integrated challenges, from high computational overhead to the ethical management of biological data, remains a prerequisite for the maturation of the field. Only by harmonizing rapid algorithmic innovation with sustainable infrastructure and robust biosecurity protocols can AI-directed design transition from an experimental urgency into a dependable pillar of public health. Ultimately, navigating these multifaceted limitations will define the trajectory of the next-generation of pandemic-ready therapeutics.</p>
</sec>
</sec>
<sec sec-type="conclusion" id="s9">
<title>Conclusion</title>
<p>AI-directed computational protein design has fundamentally reshaped the landscape of vaccine and antibody development, a shift catalyzed by the urgent demands of the COVID-19 pandemic. This mini review has highlighted how the transition from reactive to proactive design paradigms, moving from the stabilization of viral antigens to the <italic>de novo</italic> generation of hyper-stable binders, has moved computational workflows into the realm of clinical reality. While the potential of these technologies to accelerate discovery and expand therapeutic breadth is well-established, the field remains balanced by critical considerations regarding model interpretability and predictive reliability. The pandemic underscored that the success of AI-driven biologics is inextricably linked to the availability of high-quality experimental data and the presence of standardized validation frameworks.</p>
<p>Ultimately, the maturation of AI-enabled protein design depends on its integration as a supportive component of global pandemic preparedness, complementing established therapeutic development frameworks rather than replacing them. By grounding computational innovation in rigorous experimental validation and transparent regulatory oversight, these technologies offer a robust path toward ensuring that the global scientific community is better equipped for future infectious disease challenges.</p>
</sec>
</body>
<back>
<sec sec-type="author-contributions" id="s10">
<title>Author contributions</title>
<p>RK: Conceptualization, Data curation, Formal analysis, Writing &#x2013; Original draft. SR: Writing &#x2013; Original draft, review and editing. Both authors approved the published version. </p>
</sec>
<ack>
<title>Acknowledgements</title>
<p>The figure in this review article was created with <ext-link ext-link-type="uri" xlink:href="http://BioRender.com">BioRender.com</ext-link>.</p>
</ack>
<sec sec-type="COI-statement" id="s12">
<title>Conflict of interest</title>
<p>The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="ai-statement" id="s13">
<title>Generative AI statement</title>
<p>The author(s) declared that generative AI was not used in the creation of this manuscript.</p>
<p>Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thadani</surname>
<given-names>NN</given-names>
</name>
<name>
<surname>Gurev</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Notin</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Youssef</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Rollins</surname>
<given-names>NJ</given-names>
</name>
<name>
<surname>Ritter</surname>
<given-names>D</given-names>
</name>
<etal/>
</person-group> <article-title>Learning from prepandemic data to forecast viral escape</article-title>. <source>Nature</source> (<year>2023</year>) <volume>622</volume>(<issue>7984</issue>):<fpage>818</fpage>&#x2013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-023-06617-0</pub-id>
<pub-id pub-id-type="pmid">37821700</pub-id>
</mixed-citation>
</ref>
<ref id="B2">
<label>2.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hie</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Zhong</surname>
<given-names>ED</given-names>
</name>
<name>
<surname>Berger</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Bryson</surname>
<given-names>B</given-names>
</name>
</person-group>. <article-title>Learning the language of viral evolution and escape</article-title>. <source>Science</source> (<year>2021</year>) <volume>371</volume>(<issue>6526</issue>):<fpage>284</fpage>&#x2013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1126/science.abd7331</pub-id>
<pub-id pub-id-type="pmid">33446556</pub-id>
</mixed-citation>
</ref>
<ref id="B3">
<label>3.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Malik</surname>
<given-names>YS</given-names>
</name>
<name>
<surname>Sircar</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Bhat</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Ansari</surname>
<given-names>MI</given-names>
</name>
<name>
<surname>Pande</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>P</given-names>
</name>
<etal/>
</person-group> <article-title>How artificial intelligence may help the Covid-19 pandemic: pitfalls and lessons for the future</article-title>. <source>Rev Med Virol</source> (<year>2021</year>) <volume>31</volume>(<issue>5</issue>):<fpage>e2205</fpage>&#x2013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1002/rmv.2205</pub-id>
<pub-id pub-id-type="pmid">33476063</pub-id>
</mixed-citation>
</ref>
<ref id="B4">
<label>4.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaushik</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>KYJ</given-names>
</name>
<name>
<surname>Srivastava</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Bhatia</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Malik</surname>
<given-names>YS</given-names>
</name>
</person-group>. <article-title>A novel structure-based approach for identification of vertebrate susceptibility to SARS-CoV-2: implications for future surveillance programmes</article-title>. <source>Environ Res</source> (<year>2022</year>) <volume>212</volume>:<fpage>113303</fpage>. <pub-id pub-id-type="doi">10.1016/j.envres.2022.113303</pub-id>
<pub-id pub-id-type="pmid">35460633</pub-id>
</mixed-citation>
</ref>
<ref id="B5">
<label>5.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Kaushik</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>KYJ</given-names>
</name>
<name>
<surname>Uversky</surname>
<given-names>VN</given-names>
</name>
<name>
<surname>Sahu</surname>
<given-names>U</given-names>
</name>
<name>
<surname>Sood</surname>
<given-names>R</given-names>
</name>
<etal/>
</person-group> <article-title>A novel consensus-based computational pipeline for screening of antibody therapeutics for efficacy against SARS-CoV-2 variants of concern including omicron variant</article-title>. <source>Proteins: Struct Funct Bioinformatics</source> (<year>2023</year>) <volume>91</volume>(<issue>6</issue>):<fpage>798</fpage>&#x2013;<lpage>806</lpage>. <pub-id pub-id-type="doi">10.1002/prot.26467</pub-id>
<pub-id pub-id-type="pmid">36629264</pub-id>
</mixed-citation>
</ref>
<ref id="B6">
<label>6.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ito</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Strange</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Joas</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Lytras</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Sato</surname>
<given-names>K</given-names>
</name>
<etal/>
</person-group> <article-title>A protein language model for exploring viral fitness landscapes</article-title>. <source>Nat Commun</source> (<year>2025</year>) <volume>16</volume>(<issue>1</issue>):<fpage>4236</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-025-59422-w</pub-id>
<pub-id pub-id-type="pmid">40360496</pub-id>
</mixed-citation>
</ref>
<ref id="B7">
<label>7.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abramson</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Adler</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Dunger</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Evans</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Green</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Pritzel</surname>
<given-names>A</given-names>
</name>
<etal/>
</person-group> <article-title>Accurate structure prediction of biomolecular interactions with AlphaFold 3</article-title>. <source>Nature</source> (<year>2024</year>) <volume>630</volume>(<issue>8016</issue>):<fpage>493</fpage>&#x2013;<lpage>500</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-024-07487-w</pub-id>
<pub-id pub-id-type="pmid">38718835</pub-id>
</mixed-citation>
</ref>
<ref id="B8">
<label>8.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krishna</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Ahern</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Sturmfels</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Venkatesh</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Kalvet</surname>
<given-names>I</given-names>
</name>
<etal/>
</person-group> <article-title>Generalized biomolecular modeling and design with RoseTTAFold all-atom</article-title>. <source>Science</source> (<year>2024</year>) <volume>384</volume>(<issue>6693</issue>):<fpage>eadl2528</fpage>. <pub-id pub-id-type="doi">10.1126/science.adl2528</pub-id>
<pub-id pub-id-type="pmid">38452047</pub-id>
</mixed-citation>
</ref>
<ref id="B9">
<label>9.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ahdritz</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Bouatta</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Floristean</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Kadyan</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Xia</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Gerecke</surname>
<given-names>W</given-names>
</name>
<etal/>
</person-group> <article-title>OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization</article-title>. <source>Nat Methods</source> (<year>2024</year>) <volume>21</volume>(<issue>8</issue>):<fpage>1514</fpage>&#x2013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-024-02272-z</pub-id>
<pub-id pub-id-type="pmid">38744917</pub-id>
</mixed-citation>
</ref>
<ref id="B10">
<label>10.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mall</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Kaushik</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Martinez</surname>
<given-names>ZA</given-names>
</name>
<name>
<surname>Thomson</surname>
<given-names>MW</given-names>
</name>
<name>
<surname>Castiglione</surname>
<given-names>F</given-names>
</name>
</person-group>. <article-title>Benchmarking protein language models for protein crystallization</article-title>. <source>Sci Rep</source> (<year>2025</year>) <volume>15</volume>(<issue>1</issue>):<fpage>2381</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-025-86519-5</pub-id>
<pub-id pub-id-type="pmid">39827171</pub-id>
</mixed-citation>
</ref>
<ref id="B11">
<label>11.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ruffolo</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Chu</surname>
<given-names>L-S</given-names>
</name>
<name>
<surname>Mahajan</surname>
<given-names>SP</given-names>
</name>
<name>
<surname>Gray</surname>
<given-names>JJ</given-names>
</name>
</person-group>. <article-title>Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies</article-title>. <source>Nat Commun</source> (<year>2023</year>) <volume>14</volume>(<issue>1</issue>):<fpage>2389</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-023-38063-x</pub-id>
<pub-id pub-id-type="pmid">37185622</pub-id>
</mixed-citation>
</ref>
<ref id="B12">
<label>12.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaushik</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Kant</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Christodoulides</surname>
<given-names>M</given-names>
</name>
</person-group>. <article-title>Artificial intelligence in accelerating vaccine development - current and future perspectives</article-title>. <source>Front Bacteriol</source> (<year>2023</year>) <volume>2</volume>:<fpage>1258159</fpage>. <pub-id pub-id-type="doi">10.3389/fbrio.2023.1258159</pub-id>
</mixed-citation>
</ref>
<ref id="B13">
<label>13.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ingraham</surname>
<given-names>JB</given-names>
</name>
<name>
<surname>Baranov</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Costello</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Barber</surname>
<given-names>KW</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Ismail</surname>
<given-names>A</given-names>
</name>
<etal/>
</person-group> <article-title>Illuminating protein space with a programmable generative model</article-title>. <source>Nature</source> (<year>2023</year>) <volume>623</volume>(<issue>7989</issue>):<fpage>1070</fpage>&#x2013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-023-06728-8</pub-id>
<pub-id pub-id-type="pmid">37968394</pub-id>
</mixed-citation>
</ref>
<ref id="B14">
<label>14.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mirdita</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Schutze</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Moriwaki</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Heo</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Ovchinnikov</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Steinegger</surname>
<given-names>M</given-names>
</name>
</person-group>. <article-title>ColabFold: making protein folding accessible to all</article-title>. <source>Nat Methods</source> (<year>2022</year>) <volume>19</volume>(<issue>6</issue>):<fpage>679</fpage>&#x2013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-022-01488-1</pub-id>
<pub-id pub-id-type="pmid">35637307</pub-id>
</mixed-citation>
</ref>
<ref id="B15">
<label>15.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dauparas</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Anishchenko</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Bennett</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Ragotte</surname>
<given-names>RJ</given-names>
</name>
<name>
<surname>Milles</surname>
<given-names>LF</given-names>
</name>
<etal/>
</person-group> <article-title>Robust deep learning&#x2013;based protein sequence design using ProteinMPNN</article-title>. <source>Science</source> (<year>2022</year>) <volume>378</volume>(<issue>6615</issue>):<fpage>49</fpage>&#x2013;<lpage>56</lpage>. <pub-id pub-id-type="doi">10.1126/science.add2187</pub-id>
<pub-id pub-id-type="pmid">36108050</pub-id>
</mixed-citation>
</ref>
<ref id="B16">
<label>16.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Elnaggar</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Heinzinger</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Dallago</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Rehawi</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>L</given-names>
</name>
<etal/>
</person-group> <article-title>ProtTrans: toward understanding the language of life through self-supervised learning</article-title>. <source>IEEE Trans Pattern Anal Machine Intelligence</source> (<year>2021</year>) <volume>44</volume>(<issue>10</issue>):<fpage>7112</fpage>&#x2013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.1109/TPAMI.2021.3095381</pub-id>
<pub-id pub-id-type="pmid">34232869</pub-id>
</mixed-citation>
</ref>
<ref id="B17">
<label>17.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaushik</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>KYJ</given-names>
</name>
</person-group>. <article-title>An integrated protein structure fitness scoring approach for identifying native-like model structures</article-title>. <source>Comput Struct Biotechnol J</source> (<year>2022</year>) <volume>20</volume>:<fpage>6467</fpage>&#x2013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1016/j.csbj.2022.11.032</pub-id>
<pub-id pub-id-type="pmid">36467582</pub-id>
</mixed-citation>
</ref>
<ref id="B18">
<label>18.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Swanson</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Bulaong</surname>
<given-names>NL</given-names>
</name>
<name>
<surname>Pak</surname>
<given-names>JE</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>The virtual lab of AI agents designs new SARS-CoV-2 nanobodies</article-title>. <source>Nature</source> (<year>2025</year>) <volume>646</volume>(<issue>8085</issue>):<fpage>716</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-025-09442-9</pub-id>
<pub-id pub-id-type="pmid">40730228</pub-id>
</mixed-citation>
</ref>
<ref id="B19">
<label>19.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaushik</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>KYJ</given-names>
</name>
</person-group>. <article-title>ProFitFun: a protein tertiary structure fitness function for quantifying the accuracies of model structures</article-title>. <source>Bioinformatics</source> (<year>2022</year>) <volume>38</volume>(<issue>2</issue>):<fpage>369</fpage>&#x2013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btab666</pub-id>
<pub-id pub-id-type="pmid">34542606</pub-id>
</mixed-citation>
</ref>
<ref id="B20">
<label>20.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaushik</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>KYJ</given-names>
</name>
</person-group>. <article-title>A protein sequence fitness function for identifying natural and nonnatural proteins</article-title>. <source>Proteins</source> (<year>2020</year>) <volume>88</volume>(<issue>10</issue>):<fpage>1271</fpage>&#x2013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.1002/prot.25900</pub-id>
<pub-id pub-id-type="pmid">32415863</pub-id>
</mixed-citation>
</ref>
<ref id="B21">
<label>21.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baek</surname>
<given-names>M</given-names>
</name>
<name>
<surname>DiMaio</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Anishchenko</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Dauparas</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Ovchinnikov</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>GR</given-names>
</name>
<etal/>
</person-group> <article-title>Accurate prediction of protein structures and interactions using a three-track neural network</article-title>. <source>Science</source> (<year>2021</year>) <volume>373</volume>(<issue>6557</issue>):<fpage>871</fpage>&#x2013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1126/science.abj8754</pub-id>
<pub-id pub-id-type="pmid">34282049</pub-id>
</mixed-citation>
</ref>
<ref id="B22">
<label>22.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Kaushik</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Uversky</surname>
<given-names>VN</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>KYJ</given-names>
</name>
<name>
<surname>Sahu</surname>
<given-names>U</given-names>
</name>
<etal/>
</person-group> <article-title>Bayesian molecular dating analyses combined with mutational profiling suggest an independent origin and evolution of SARS-CoV-2 omicron BA.1 and BA.2 sub-lineages</article-title>. <source>Viruses</source> (<year>2022</year>) <volume>14</volume>(<issue>12</issue>):<fpage>2764</fpage>. <pub-id pub-id-type="doi">10.3390/v14122764</pub-id>
<pub-id pub-id-type="pmid">36560768</pub-id>
</mixed-citation>
</ref>
<ref id="B23">
<label>23.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vita</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Blazeska</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Marrama</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Duesing</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Bennett</surname>
<given-names>J</given-names>
</name>, <collab>IEDB Curation Team Members</collab>
<etal/>
</person-group> <collab>IEDB Curation Team Members</collab>, <article-title>The immune epitope database (IEDB): 2024 update</article-title>. <source>Nucleic Acids Res</source> (<year>2025</year>) <volume>53</volume>(<issue>D1</issue>):<fpage>D436</fpage>&#x2013;<lpage>D443</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkae1092</pub-id>
<pub-id pub-id-type="pmid">39558162</pub-id>
</mixed-citation>
</ref>
<ref id="B24">
<label>24.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reynisson</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Alvarez</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Paul</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
</person-group>. <article-title>NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data</article-title>. <source>Nucleic Acids Res</source> (<year>2020</year>) <volume>48</volume>(<issue>W1</issue>):<fpage>W449</fpage>&#x2013;<lpage>W454</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkaa379</pub-id>
<pub-id pub-id-type="pmid">32406916</pub-id>
</mixed-citation>
</ref>
<ref id="B25">
<label>25.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nijkamp</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Ruffolo</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Weinstein</surname>
<given-names>EN</given-names>
</name>
<name>
<surname>Naik</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Madani</surname>
<given-names>A</given-names>
</name>
</person-group>. <article-title>ProGen2: exploring the boundaries of protein language models</article-title>. <source>Cell Syst</source> (<year>2023</year>) <volume>14</volume>(<issue>11</issue>):<fpage>968</fpage>&#x2013;<lpage>78.e3</lpage>. <pub-id pub-id-type="doi">10.1016/j.cels.2023.10.002</pub-id>
<pub-id pub-id-type="pmid">37909046</pub-id>
</mixed-citation>
</ref>
<ref id="B26">
<label>26.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Frazer</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Notin</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Dias</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Gomez</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Min</surname>
<given-names>JK</given-names>
</name>
<name>
<surname>Brock</surname>
<given-names>K</given-names>
</name>
<etal/>
</person-group> <article-title>Disease variant prediction with deep generative models of evolutionary data</article-title>. <source>Nature</source> (<year>2021</year>) <volume>599</volume>(<issue>7883</issue>):<fpage>91</fpage>&#x2013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-021-04043-8</pub-id>
<pub-id pub-id-type="pmid">34707284</pub-id>
</mixed-citation>
</ref>
<ref id="B27">
<label>27.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Akin</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Rao</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Hie</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>W</given-names>
</name>
<etal/>
</person-group> <article-title>Evolutionary-scale prediction of atomic-level protein structure with a language model</article-title>. <source>Science</source> (<year>2023</year>) <volume>379</volume>(<issue>6637</issue>):<fpage>1123</fpage>&#x2013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1126/science.ade2574</pub-id>
<pub-id pub-id-type="pmid">36927031</pub-id>
</mixed-citation>
</ref>
<ref id="B28">
<label>28.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Watson</surname>
<given-names>JL</given-names>
</name>
<name>
<surname>Juergens</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Bennett</surname>
<given-names>NR</given-names>
</name>
<name>
<surname>Trippe</surname>
<given-names>BL</given-names>
</name>
<name>
<surname>Yim</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Eisenach</surname>
<given-names>HE</given-names>
</name>
<etal/>
</person-group> <article-title>
<italic>De novo</italic> design of protein structure and function with RFdiffusion</article-title>. <source>Nature</source> (<year>2023</year>) <volume>620</volume>(<issue>7976</issue>):<fpage>1089</fpage>&#x2013;<lpage>100</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-023-06415-8</pub-id>
<pub-id pub-id-type="pmid">37433327</pub-id>
</mixed-citation>
</ref>
<ref id="B29">
<label>29.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tareen</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Kooshkbaghi</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Posfai</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Ireland</surname>
<given-names>WT</given-names>
</name>
<name>
<surname>McCandlish</surname>
<given-names>DM</given-names>
</name>
<name>
<surname>Kinney</surname>
<given-names>JB</given-names>
</name>
</person-group>. <article-title>MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect</article-title>. <source>Genome Biol</source> (<year>2022</year>) <volume>23</volume>(<issue>1</issue>):<fpage>98</fpage>. <pub-id pub-id-type="doi">10.1186/s13059-022-02661-7</pub-id>
<pub-id pub-id-type="pmid">35428271</pub-id>
</mixed-citation>
</ref>
<ref id="B30">
<label>30.</label>
<mixed-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Corso</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Stark</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Jing</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Barzilay</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Jaakkola</surname>
<given-names>TS</given-names>
</name>
</person-group>. <article-title>DiffDock: diffusion steps, twists, and turns for molecular docking</article-title>. In: <source>International conference on learning representations</source> (<year>2023</year>). <comment>Available online at: <ext-link ext-link-type="uri" xlink:href="https://openreview.net/forum?id=kKF8_K-mBbS">https://openreview.net/forum?id&#x3d;kKF8_K-mBbS</ext-link> (Accessed December 24, 2025).</comment>
</mixed-citation>
</ref>
<ref id="B31">
<label>31.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>McNutt</surname>
<given-names>AT</given-names>
</name>
<name>
<surname>Francoeur</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Aggarwal</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Masuda</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Meli</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Ragoza</surname>
<given-names>M</given-names>
</name>
<etal/>
</person-group> <article-title>GNINA 1.0: molecular docking with deep learning</article-title>. <source>J Cheminformatics</source> (<year>2021</year>) <volume>13</volume>:<fpage>43</fpage>. <pub-id pub-id-type="doi">10.1186/s13321-021-00522-2</pub-id>
<pub-id pub-id-type="pmid">34108002</pub-id>
</mixed-citation>
</ref>
<ref id="B32">
<label>32.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ruffolo</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Sulam</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Gray</surname>
<given-names>JJ</given-names>
</name>
</person-group>. <article-title>Antibody structure prediction using interpretable deep learning</article-title>. <source>Patterns</source> (<year>2022</year>) <volume>3</volume>(<issue>2</issue>):<fpage>100406</fpage>. <pub-id pub-id-type="doi">10.1016/j.patter.2021.100406</pub-id>
<pub-id pub-id-type="pmid">35199061</pub-id>
</mixed-citation>
</ref>
<ref id="B33">
<label>33.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cao</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Goreshnik</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Coventry</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Case</surname>
<given-names>JB</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Kozodoy</surname>
<given-names>L</given-names>
</name>
<etal/>
</person-group> <article-title>
<italic>De novo</italic> design of picomolar SARS-CoV-2 miniprotein inhibitors</article-title>. <source>Science</source> (<year>2020</year>) <volume>370</volume>:<fpage>426</fpage>&#x2013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1126/science.abd9909</pub-id>
<pub-id pub-id-type="pmid">32907861</pub-id>
</mixed-citation>
</ref>
<ref id="B34">
<label>34.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kalita</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Tripathi</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Padhi</surname>
<given-names>AK</given-names>
</name>
</person-group>. <article-title>Computational protein design for COVID-19 research and emerging therapeutics</article-title>. <source>ACS Cent Sci</source> (<year>2023</year>) <volume>9</volume>(<issue>4</issue>):<fpage>602</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1021/acscentsci.2c01513</pub-id>
<pub-id pub-id-type="pmid">37122454</pub-id>
</mixed-citation>
</ref>
<ref id="B35">
<label>35.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Walls</surname>
<given-names>AC</given-names>
</name>
<name>
<surname>Fiala</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Schafer</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Wrenn</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Pham</surname>
<given-names>MN</given-names>
</name>
<name>
<surname>Murphy</surname>
<given-names>M</given-names>
</name>
<etal/>
</person-group> <article-title>Elicitation of potent neutralizing antibody responses by designed protein nanoparticle vaccines for SARS-CoV-2</article-title>. <source>Cell</source> (<year>2020</year>) <volume>183</volume>(<issue>5</issue>):<fpage>1367</fpage>&#x2013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2020.10.043</pub-id>
<pub-id pub-id-type="pmid">33160446</pub-id>
</mixed-citation>
</ref>
<ref id="B36">
<label>36.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Song</surname>
<given-names>JY</given-names>
</name>
<name>
<surname>Choi</surname>
<given-names>WS</given-names>
</name>
<name>
<surname>Heo</surname>
<given-names>JY</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>EJ</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>JS</given-names>
</name>
<name>
<surname>Jung</surname>
<given-names>DS</given-names>
</name>
<etal/>
</person-group> <article-title>Immunogenicity and safety of SARS-CoV-2 recombinant protein nanoparticle vaccine GBP510 adjuvanted with AS03: interim results of a randomised, active-controlled, observer-blinded, phase 3 trial</article-title>. <source>eClinicalMedicine</source> (<year>2023</year>) <volume>64</volume>:<fpage>102140</fpage>. <pub-id pub-id-type="doi">10.1016/j.eclinm.2023.102140</pub-id>
<pub-id pub-id-type="pmid">37711219</pub-id>
</mixed-citation>
</ref>
<ref id="B37">
<label>37.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Case</surname>
<given-names>JB</given-names>
</name>
<name>
<surname>Mackin</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Errico</surname>
<given-names>JM</given-names>
</name>
<name>
<surname>Chong</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Madden</surname>
<given-names>EA</given-names>
</name>
<name>
<surname>Whitener</surname>
<given-names>B</given-names>
</name>
<etal/>
</person-group> <article-title>Resilience of S309 and AZD7442 monoclonal antibody treatments against infection by SARS-CoV-2 Omicron lineage strains</article-title>. <source>Nat Commun</source> (<year>2022</year>) <volume>13</volume>:<fpage>3824</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-022-31615-7</pub-id>
<pub-id pub-id-type="pmid">35780162</pub-id>
</mixed-citation>
</ref>
<ref id="B38">
<label>38.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shan</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Hong</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Su</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>F</given-names>
</name>
<etal/>
</person-group> <article-title>Deep learning guided optimization of human antibody against SARS-CoV-2 variants with broad neutralization</article-title>. <source>Proc Natl Acad Sci U S A</source> (<year>2022</year>) <volume>119</volume>(<issue>11</issue>):<fpage>e2122954119</fpage>. <pub-id pub-id-type="doi">10.1073/pnas.2122954119</pub-id>
<pub-id pub-id-type="pmid">35238654</pub-id>
</mixed-citation>
</ref>
<ref id="B39">
<label>39.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Starr</surname>
<given-names>TN</given-names>
</name>
<name>
<surname>Greaney</surname>
<given-names>AJ</given-names>
</name>
<name>
<surname>Hilton</surname>
<given-names>SK</given-names>
</name>
<name>
<surname>Ellis</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Crawford</surname>
<given-names>KHD</given-names>
</name>
<name>
<surname>Dingens</surname>
<given-names>AS</given-names>
</name>
<etal/>
</person-group> <article-title>Deep mutational scanning of SARS-CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding</article-title>. <source>Cell</source> (<year>2020</year>) <volume>182</volume>(<issue>5</issue>):<fpage>1295</fpage>&#x2013;<lpage>310.e20</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2020.08.012</pub-id>
<pub-id pub-id-type="pmid">32841599</pub-id>
</mixed-citation>
</ref>
<ref id="B40">
<label>40.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Desautels</surname>
<given-names>TA</given-names>
</name>
<name>
<surname>Arrildt</surname>
<given-names>KT</given-names>
</name>
<name>
<surname>Zemla</surname>
<given-names>AT</given-names>
</name>
<name>
<surname>Lau</surname>
<given-names>EY</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Ricci</surname>
<given-names>D</given-names>
</name>
<etal/>
</person-group> <article-title>Computationally restoring the potency of a clinical antibody against omicron</article-title>. <source>Nature</source> (<year>2024</year>) <volume>629</volume>:<fpage>878</fpage>&#x2013;<lpage>85</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-024-07385-1</pub-id>
<pub-id pub-id-type="pmid">38720086</pub-id>
</mixed-citation>
</ref>
<ref id="B41">
<label>41.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Rajan</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Hayes</surname>
<given-names>CF</given-names>
</name>
<name>
<surname>Kwong</surname>
<given-names>KY</given-names>
</name>
<name>
<surname>Goncalves</surname>
<given-names>AR</given-names>
</name>
<name>
<surname>Zemla</surname>
<given-names>AT</given-names>
</name>
<etal/>
</person-group> <article-title>Preemptive optimization of a clinical antibody for broad neutralization of SARS-CoV-2 variants and robustness against viral escape</article-title>. <source>Sci Adv</source> (<year>2025</year>) <volume>11</volume>(<issue>13</issue>):<fpage>eadu0718</fpage>. <pub-id pub-id-type="doi">10.1126/sciadv.adu0718</pub-id>
<pub-id pub-id-type="pmid">40153503</pub-id>
</mixed-citation>
</ref>
<ref id="B42">
<label>42.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Paxton</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Auclair</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Regulating the AI-enabled ecosystem for human therapeutics</article-title>. <source>Commun Med</source> (<year>2025</year>) <volume>5</volume>(<issue>1</issue>):<fpage>181</fpage>. <pub-id pub-id-type="doi">10.1038/s43856-025-00910-x</pub-id>
<pub-id pub-id-type="pmid">40382515</pub-id>
</mixed-citation>
</ref>
<ref id="B43">
<label>43.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nuqui</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Casalino</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Shehata</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Tse</surname>
<given-names>AL</given-names>
</name>
<etal/>
</person-group> <article-title>Simulation-driven design of stabilized SARS-CoV-2 spike S2 immunogens</article-title>. <source>Nat Commun</source> (<year>2024</year>) <volume>15</volume>(<issue>1</issue>):<fpage>7370</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-024-50976-9</pub-id>
<pub-id pub-id-type="pmid">39191724</pub-id>
</mixed-citation>
</ref>
<ref id="B44">
<label>44.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Raharinirina</surname>
<given-names>NA</given-names>
</name>
<name>
<surname>Gubela</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Bornigen</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>MR</given-names>
</name>
<name>
<surname>Oh</surname>
<given-names>D-Y</given-names>
</name>
<name>
<surname>Budt</surname>
<given-names>M</given-names>
</name>
<etal/>
</person-group> <article-title>SARS-CoV-2 evolution on a dynamic immune landscape</article-title>. <source>Nature</source> (<year>2025</year>) <volume>639</volume>(<issue>8053</issue>):<fpage>196</fpage>&#x2013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-024-08477-8</pub-id>
<pub-id pub-id-type="pmid">39880955</pub-id>
</mixed-citation>
</ref>
<ref id="B45">
<label>45.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sikkema</surname>
<given-names>RS</given-names>
</name>
<name>
<surname>Koopmans</surname>
<given-names>M</given-names>
</name>
</person-group>. <article-title>Viral emergence and pandemic preparedness in a one health framework</article-title>. <source>Nat Rev Microbiol (2026)</source> (<year>2026</year>) <volume>24</volume>:<fpage>29</fpage>&#x2013;<lpage>44</lpage>. <pub-id pub-id-type="doi">10.1038/s41579-025-01243-1</pub-id>
<pub-id pub-id-type="pmid">41044181</pub-id>
</mixed-citation>
</ref>
</ref-list>
<fn-group>
<fn fn-type="custom" custom-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1953628/overview">Ashish Kumar Sarangi</ext-link>, Centurion University of Technology and Management, India</p>
</fn>
</fn-group>
</back>
</article>