REVIEW

Br. J. Biomed. Sci., 10 June 2026

Volume 83 - 2026 | https://doi.org/10.3389/bjbs.2026.16141

Artificial intelligence approaches in biological age prediction: current status and challenges

  • GW

    Guangjun Wang 1

  • PD

    Pengcheng Ding 1,2

  • ZL

    Zihui Li 1

  • QT

    Qingfeng Tang 1,2*

  • LT

    Liping Tu 2

  • TJ

    Tao Jiang 2

  • LZ

    Liangliang Zhang 1

  • BS

    Benyue Su 1,3

  • JX

    Jiatuo Xu 2

  • HA

    Hui An 4*

  • 1. Digital and Intelligent Health Research Center, Anqing Normal University, Anqing, China

  • 2. School of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, China

  • 3. School of Mathematics and Computer Science, Tongling University, Tongling, China

  • 4. Health Management and Physical Examination Center, Xiangyang Central Hospital, Affiliated Hospital of Hubei University of Arts and Science, Xiangyang, China

Abstract

Biological age (BA) prediction has emerged as a critical research frontier for evaluating individual health status and aging trajectories beyond chronological age (CA). Recent advances in artificial intelligence (AI) have substantially accelerated this field by enabling the integration and interpretation of complex, multimodal biological data. This review provides a systematic overview of AI-driven approaches to BA prediction, covering key components including biomarker selection, feature engineering, model development, bias correction, and performance evaluation. We further highlight the growing recognition of asynchronous aging, a phenomenon in which different organs or physiological systems age at distinct rates, and discuss how AI—particularly deep learning and multimodal fusion—offers powerful tools for capturing such system-specific aging patterns. We summarize current methodologies ranging from traditional machine learning algorithms to advanced neural architectures capable of modeling nonlinear and heterogeneous aging processes. The expanding applications of AI-based BA models in disease risk assessment, geriatric evaluation, and population health monitoring are also examined. Despite rapid methodological progress, significant challenges persist, including data heterogeneity, limited model generalizability, insufficient interpretability, and barriers to clinical translation. Addressing these issues will require standardized methodological practices, robust validation across diverse populations, and the development of interpretable and equitable AI systems. Future research should prioritize the integration of multi-omics and longitudinal datasets with AI-driven analytical frameworks to establish reliable, system-level, and clinically actionable BA prediction models that account for asynchronous aging.

Introduction

Aging is a universal yet highly heterogeneous biological process characterized by progressive functional decline and increased susceptibility to disease []. Chronological age (CA), defined simply as time elapsed since birth, fails to capture the substantial inter-individual variability in aging rates or the cumulative influence of genetic, environmental, and lifestyle factors on biological deterioration []. Individuals of identical CA often exhibit markedly different physiological states and disease risks, limiting the utility of CA for health assessment, risk stratification, and prognosis []. These limitations have motivated the search for alternative metrics that more accurately reflect the biological processes underlying aging.

Biological age (BA) emerged to address this gap as an objective, quantitative measure of physiological aging that integrates multidimensional biomarkers—including DNA methylation patterns, cellular senescence indicators, and organ-level functional parameters—into a unified framework []. Empirical evidence shows that individuals with the same CA may differ in BA by up to two decades, underscoring the inadequacy of CA as a surrogate for biological function []. Unlike CA, which is linear and irreversible, BA is dynamic and potentially modifiable, reflecting both current biological state and responsiveness to behavioral or therapeutic interventions. By capturing person-specific variation attributable to genetic predisposition (estimated to explain approximately 20– of aging variability) and environmental exposures, BA correlates more strongly than CA with functional decline and mortality outcomes []. Since its conceptual articulation by Comfort in 1969, BA has reframed aging assessment from time elapsed to biological function, enabling more precise evaluation of health status and intervention efficacy [].

Methodological advances have rapidly expanded BA estimation from single-domain biomarkers to integrative models informed by multi-omics and physiological data. DNA methylation–based clocks represent a major milestone in this evolution, demonstrating strong associations with cardiovascular risk, disease incidence, and long-term outcomes []. Longitudinal cohort studies further show that BA models outperform CA-based baselines in identifying individuals at elevated risk for age-related diseases []. At the same time, aging is increasingly recognized as an asynchronous process, in which different organs, tissues, and biological systems age at distinct rates within the same individual []. This intrinsic heterogeneity challenges traditional single-value BA estimates and necessitates modeling approaches capable of capturing nonlinear, system-specific aging dynamics. Artificial intelligence (AI) and machine learning techniques enable the integration of dozens to hundreds of biomarkers, offering powerful tools to model complex aging trajectories and improve prediction of healthspan and lifespan [].

In this review, we synthesize the methodological landscape of BA estimation with a particular focus on AI-driven approaches and their current limitations. As illustrated in Figure 1, we organize BA modeling into five interconnected components: (1) biomarker identification and validation, including emerging senescence-related markers; (2) feature representation and selection for high-dimensional omics data; (3) predictive modeling using regression, machine learning, and deep learning techniques; (4) model refinement to mitigate overfitting, bias, and population heterogeneity; and (5) evaluation and translation using standardized metrics for clinical applicability. By framing BA estimation within this structured pipeline, we highlight both the transformative potential of AI-based aging clocks and the key challenges that must be addressed to enable robust, interpretable, and clinically actionable BA assessment in precision medicine and public health [].

FIGURE 1

Aging biomarkers

Aging biomarkers provide the biological foundation for estimating BA beyond CA by quantifying molecular damage, systemic dysregulation, and organ-level structural or functional decline. In line with the heterogeneous and asynchronous nature of aging, contemporary BA research increasingly adopts a multimodal perspective, integrating biomarkers across molecular, physiological, and imaging scales. Rather than treating these markers in isolation, AI–based frameworks enable their joint modeling to capture nonlinear interactions and system-specific aging trajectories. Table 1 presents a tiered summary of representative aging biomarkers across these scales, highlighting their complementary roles in characterizing biological aging from molecular alterations to organ-level phenotypes.

TABLE 1

BiomarkerKey characteristicsCommon AI modelsReferences
Molecular-level biomarkers
DNA methylationHigh predictive accuracy; cross-tissue clocks (horvath, hannum, PhenoAge, GrimAge); limited interpretabilityElastic-net, ML, DL[, ]
Telomere lengthReplicative aging marker; associated with disease and mortality; lifestyle-sensitiveKDM, ML[28, 29, 31, 62]
RNA-seqGenome-wide expression profiling; captures pathway-level aging signalsML, DL[63, 64]
MetabolomicsFunctional downstream readout; environmentally sensitive; standardization challengesPCA, KDM, ML[34, 65]
System-level biomarkers
Blood testsMulti-organ integration; inflammation, metabolic and hematologic indicatorsKDM, MLR, DL[, 41]
Imaging-based biomarkers
Brain MRIStructural and functional brain aging; well-validated brain-age paradigmCNN, transformer[49, 51]
X-raysBone and dental structural aging; high availabilityML, CNN[53, 54]
Facial imagingNon-invasive; reflects lifestyle and cutaneous agingCNN, transformer[57, 58]
Retinal imagingMicrovascular and neural aging; scalable population screeningCNN, DNN[60, 61]

Representative aging biomarkers organized by biological tier for biological age prediction.

At the molecular level, biomarkers reflect core cellular and biochemical processes underlying aging. DNA methylation (DNAm)–based epigenetic clocks, including Horvath’s, Hannum’s, PhenoAge, and GrimAge, represent the most accurate single-modality estimators of BA, showing robust associations with mortality, disease risk, and healthspan [, ]. However, their mechanistic interpretability remains limited, partly due to the integration of diverse environmental and genetic influences. For example, longevity-associated loci such as APOE modulate epigenetic age acceleration, with the 4 allele linked to faster biological aging and 2 showing relative protection [, ]. Telomere length reflects replicative senescence and mortality risk while exhibiting substantial inter-individual variability [2831]. In parallel, conserved regulators such as FOXO3 in the insulin/IGF-1 pathway influence oxidative stress resistance, inflammation, and telomere maintenance, thereby shaping molecular aging trajectories [32, 33]. Complementary transcriptomic and metabolomic signatures further capture age-related shifts in gene regulation and metabolism, and machine learning models trained on multi-omic data achieve BA prediction errors of approximately 4–5 years [3436].

At the system level, blood-based biomarkers offer a clinically scalable and integrative view of organismal aging [37]. Inflammatory markers such as C-reactive protein and interleukins capture the phenomenon of inflammaging, while metabolic, hepatic, renal, and hematologic indices jointly reflect multi-organ functional decline [3840]. Statistical approaches such as multiple linear regression and the Klemera–Doubal method (KDM), as well as deep learning–based “blood clocks,” transform multivariate laboratory panels into BA estimates with prediction errors typically within approximately 5 years [, 4143]. Importantly, composite system-level models acknowledge asynchronous aging across physiological systems, improving robustness and interpretability when compared with single-marker approaches [36, 44]. Large-scale electronic health record (EHR) have recently enabled the development of longitudinal, clinically deployable biological aging clocks that integrate routine healthcare data and predict adverse outcomes at population scale [37, 45].

In addition to molecular and system-level biomarkers, genetic background contributes substantially to inter-individual variability in aging trajectories, with heritability estimates for lifespan commonly reported at approximately 20%–30% in population-based studies [46]. Variants in longevity-associated genes such as APOE and FOXO3 have been consistently linked to aging-related phenotypes. The APOE4 allele confers increased risk of neurodegenerative disease and reduced survival, whereas FOXO3 polymorphisms show reproducible associations with human longevity across diverse populations [32, 46]. Beyond inherited susceptibility, longitudinal clinical and biomarker trajectories are strongly associated with morbidity and mortality risk. Biological age acceleration measures and dynamic changes in physiological markers have been shown to predict healthspan and lifespan outcomes [, 47]. However, most current AI-based BA models rely primarily on biomarker patterns aligned with chronological age and rarely incorporate genotype information or structured longitudinal clinical context into model architectures. Integrating polygenic risk scores (PRS), gene–environment interactions, and harmonized longitudinal clinical data into BA frameworks may improve personalization and help distinguish intrinsic biological aging from disease-driven acceleration [36].

Imaging biomarkers provide a complementary organ-specific perspective on aging by quantifying structural and functional changes at the tissue level [48]. Brain MRI–based aging models capture age-related alterations in cortical thickness, gray and white matter integrity, and network organization, with deep learning architectures achieving mean absolute errors as low as 2–3 years and showing strong associations with cognitive decline and neurodegenerative risk [4952]. Other imaging modalities extend BA estimation beyond the brain: dental and skeletal X-rays quantify mineralized tissue aging [5355], facial imaging captures cutaneous and craniofacial aging influenced by lifestyle and environmental exposure [5658], and retinal imaging provides a non-invasive window into microvascular and neural aging [5961]. These modalities are particularly well suited for AI-based analysis due to their high dimensionality and scalability.

Collectively, aging biomarkers span a continuum from molecular damage and genetic susceptibility to system-wide physiological dysregulation and organ-level structural decline. Although epigenetic clocks and brain-age models provide high predictive accuracy within individual modalities, aging is inherently multidimensional and asynchronous. AI-driven multimodal fusion that integrates molecular, imaging, genetic, and longitudinal clinical features may therefore yield more comprehensive and clinically meaningful BA estimates. Continued progress will require standardized data processing pipelines, rigorous longitudinal validation, improved model interpretability, and careful handling of genetic and clinical heterogeneity. With these advances, BA may transition from a descriptive aging indicator to a clinically actionable metric supporting personalized healthy aging strategies.

Feature engineering

Feature engineering is central to BA modeling because it determines how heterogeneous aging biomarkers are converted into representations that are robust, informative, and suitable for downstream learning. Unlike conventional risk modeling, BA estimation must integrate signals across molecular mechanisms, systemic physiology, and organ-level phenotypes; moreover, aging is increasingly recognized as an asynchronous process, in which organs and biological systems may age at distinct rates within the same individual. This intrinsic heterogeneity makes feature engineering not merely a preprocessing step, but a key design decision that shapes whether BA models can capture system-specific aging trajectories while remaining generalizable and clinically interpretable [, , 66].

In practice, feature engineering for BA prediction can be viewed as a pipeline consisting of (1) modality-aware preprocessing and representation construction, (2) feature screening and dimensionality reduction for stability and interpretability, and (3) multimodal fusion to support holistic and system-level BA estimation. At the molecular level, biomarkers such as DNA methylation loci, transcriptomic profiles, metabolomic signatures, and telomere length encode fundamental cellular aging mechanisms but are typically high-dimensional and noisy. These data commonly require normalization (e.g., z-score or quantile normalization) and batch-effect mitigation to ensure comparability across samples and platforms [, 36]. Dimensionality reduction techniques such as principal component analysis (PCA) and, increasingly, autoencoder-based representation learning are then used to extract compact latent factors that summarize dominant age-related variation while suppressing technical noise [37, 6669]. At the system level, blood- and physiology-derived indicators offer scalable clinical features that reflect multi-organ regulatory decline; however, they often exhibit redundancy and confounding, making feature selection essential for interpretability and transportability [42, 43, 70]. For imaging modalities (e.g., brain MRI, retinal images, facial photographs), deep neural networks provide end-to-end feature extraction by encoding complex morphological patterns into low-dimensional embeddings that can be aligned with aging outcomes [51, 52, 58, 61, 71, 72].

Classical statistical screening methods remain widely used because they provide transparent and clinically intuitive feature panels. Correlation with chronological age (CWCA) is often adopted when BA ground truth is unavailable, using CA as a proxy to filter biomarkers with monotonic age trends [, 73, 74]. Cox proportional hazards regression further prioritizes biomarkers most predictive of survival or health outcomes, thereby linking BA construction to clinically meaningful endpoints [, 75]. Multicollinearity diagnostics such as the variance inflation factor (VIF) reduce redundancy and improve interpretability, and have been used extensively in NHANES-style BA modeling to refine biomarker sets [, 7678]. PCA offers a complementary strategy by projecting correlated biomarkers into orthogonal components and extracting latent aging factors, which can improve stability in high-dimensional settings and facilitate model transfer [66, 67, 79, 80]. These methods collectively establish a baseline feature engineering toolkit for BA modeling; however, their reliance on linear assumptions may limit the capture of nonlinear and hierarchical interactions among biomarkers.

Recent advances in artificial intelligence have shifted feature engineering from manual screening toward representation learning and multimodal fusion. Autoencoders and variational autoencoders (VAEs) learn compact and noise-robust embeddings from omics or imaging data, preserving nonlinear structure that conventional reductions may overlook [69, 81]. Graph neural networks provide an additional mechanism for modeling biomarker dependencies and biological structure, such as gene co-expression networks or pathway connectivity, enabling relational reasoning beyond independent-feature assumptions [82]. Attention-based models and Transformers further support cross-feature interaction modeling and facilitate interpretability by learning feature importance weights dynamically, making them increasingly relevant for heterogeneous biomarker integration [51, 83]. Importantly, these AI approaches are well suited to the challenge of asynchronous aging: they can encode organ- or system-specific representations and then fuse them into a unified BA estimate, allowing the model to reflect differential aging rates across biological subsystems [52, 58, 61].

Despite their promise, AI-based feature engineering introduces several unresolved challenges that directly affect model validity and translation. First, multimodal datasets remain limited and heterogeneous, and representation learning models can be sensitive to batch effects, missing modalities, and population shifts, reducing generalizability across cohorts [, 36]. Second, feature embeddings learned by deep models may lack biological interpretability, complicating mechanistic understanding and clinical trust [, ]. Third, bias can be introduced when feature construction pipelines are optimized for prediction accuracy but fail to account for confounding structure (e.g., sex, ancestry, socioeconomic factors), leading to systematic error in BA estimates. Addressing these challenges will require standardized preprocessing protocols, robust validation across diverse populations, and the development of interpretable and bias-aware feature learning strategies that can support equitable, clinically actionable BA prediction.

Genetic variants and structured clinical history require specialized representation strategies within AI-based BA pipelines. PRS provide a compact representation of inherited susceptibility to age-related diseases and have been increasingly used to stratify aging risk and healthspan outcomes [84]. Meanwhile, longitudinal EHR-derived features can encode cumulative multimorbidity and treatment exposure, offering dynamic context for BA modeling [, 85]. Advanced AI architectures—including graph neural networks and multimodal transformer models—are particularly suited to modeling gene–environment interactions and nonlinear dependencies between inherited risk and physiological aging signals [82, 86]. Systematically integrating genetic susceptibility and clinical trajectories into BA models may improve robustness and enable clearer differentiation between intrinsic aging processes and pathology-related acceleration.

Prediction models

A broad range of models have been developed for BA prediction, progressing from regression-based statistical frameworks to modern deep learning architectures. In practice, model selection is primarily determined by data modality, feature dimensionality, and the trade-off between interpretability and predictive performance. To better reflect this evolution, we categorize BA prediction models into two major groups: machine learning approaches (including traditional regression-based methods) and deep learning approaches.

Machine learning approaches

Machine learning (ML) provides a flexible data-driven framework for BA prediction and encompasses both classical statistical methods and modern nonlinear algorithms. Early BA estimators are often derived from multivariate regression frameworks that combine clinical biomarkers into a single composite index. A representative example is the KDM, which estimates BA through a weighted multivariate regression scheme, assigning each biomarker a weight inversely proportional to its measurement error [42, 87]. KDM-based BA acceleration has been shown to associate with morbidity, mortality, and lifestyle-related risk factors in large-scale cohorts [88, 89]. Despite its interpretability, such regression-based models are constrained by linear assumptions and limited adaptability to heterogeneous or high-dimensional data.

Modern ML models extend beyond linear constraints by capturing nonlinear relationships among complex biomarker sets. Algorithms such as elastic net regression, random forests, and support vector machines (SVMs) have been widely applied to high-dimensional omics, proteomic, and imaging datasets [90]. For instance, Ganaie et al. employed an improved least-squares twin support vector regression to estimate brain age from MRI data, achieving a mean absolute error (MAE) of approximately 3 years [72]. Similarly, proteomic clocks trained using ML models have demonstrated strong predictive accuracy and revealed systemic aging patterns across tissues and organ systems [91]. Overall, ML approaches offer improved flexibility and scalability relative to traditional regression methods, but they often rely on handcrafted features, careful hyperparameter tuning, and may have limited interpretability in high-dimensional settings.

Deep learning and emerging trends

Deep learning (DL) has increasingly become central to BA prediction due to its capability for hierarchical representation learning and end-to-end feature extraction. Neural architectures including convolutional neural networks (CNNs), recurrent neural networks (RNNs), and variational autoencoders (VAEs) have demonstrated strong performance in modeling nonlinear aging patterns in omics, imaging, and physiological signals [92, 93]. Unlike conventional ML pipelines, DL models can automatically learn latent aging representations without manual feature engineering. For example, Galkin et al. developed DeepMAge, a neural network-based epigenetic clock trained on over 4,000 blood methylomes, achieving an MAE of 2.77 years and outperforming regression-based baselines [69]. In neuroimaging, He et al. proposed a global–local transformer framework integrating global contextual information with local fine-grained features, reducing MAE to 2.7 years in brain age estimation [51].

Recent methodological advances highlight a shift toward multimodal and self-supervised paradigms. Transformer-based models are increasingly adopted for capturing long-range dependencies and cross-modal interactions, enabling unified latent representations across diverse biological inputs. Multimodal transformer architectures integrating methylation, transcriptomic, and imaging signals have shown improved prediction accuracy and interpretability [86, 94, 95]. Meanwhile, contrastive learning and other self-supervised objectives have emerged as powerful strategies to disentangle aging signals from confounding noise and enhance robustness across datasets [96, 97]. Additionally, multitask learning and adversarial autoencoder frameworks have been proposed to jointly predict BA and auxiliary outcomes such as sex, disease risk, or mortality, improving regularization and domain transferability [98].

Overall, BA prediction has transitioned from biomarker-based regression estimators toward scalable, multimodal deep architectures that learn biologically grounded aging representations. Future research should emphasize multimodal integration, interpretability, and validation across diverse populations to ensure translational reliability of biological aging models.

Model correction

In statistical and machine learning–based BA estimation, model correction serves as a crucial post-prediction process that improves accuracy, reduces systematic bias, and enhances generalizability across heterogeneous populations. Biological and environmental variability—such as genetic background, ethnicity, and lifestyle—can introduce consistent deviations between predicted BA and CA. Without correction, these biases may propagate into downstream analysis, leading to inaccurate or inequitable outcomes across demographic subgroups [99]. For example, race- and sex-specific biomarker distributions may alter regression slopes or shift prediction intercepts, motivating population-aware calibration. Correction is therefore essential for ensuring fairness, interpretability, and translational reliability of BA estimation in real-world settings.

A second motivation relates to dataset transferability. Models trained on a specific cohort often generalize poorly to external datasets due to overfitting to cohort-specific noise, measurement protocols, or confounding variables [100]. Correction frameworks mitigate this issue by recalibrating model outputs to new distributions while preserving biologically meaningful deviations. Importantly, even when the overall predictive error is low, systematic deviations may persist—most notably overestimation in younger individuals and underestimation in older ones, often referred to as the age–delta correlation (ADC) or regression-to-the-mean problem [101]. Model correction, applied either during training or post hoc, aims to align predicted and true age distributions and remove these structural biases. Representative correction strategies are summarized in Table 2.

TABLE 2

MethodConceptual basisReferences
Skewed loss functionReplaces symmetric regression losses with asymmetric, age-dependent penalties to eliminate bias during training[101103]
AgeMLApplies linear recalibration of predicted vs. true age to remove regression-to-the-mean effects[104106]
Z-score correctionStandardizes biomarkers or predicted ages to population-level mean and variance, minimizing demographic bias[, 81, 107]
Risk-differenceUses reference risk functions or expected age indices (e.g., vascular age) for relative BA estimation[107109]

Representative model correction methods for biological age estimation.

Loss function–based correction

Loss function modification provides an intrinsic correction mechanism during model training. Conventional regression objectives such as mean absolute error or mean squared error treat under- and overestimation symmetrically, which can reinforce age-dependent biases in BA prediction. To address this, recent approaches introduce skewed or asymmetric losses that apply age-dependent weighting to penalize systematic underestimation in older individuals and overestimation in younger ones [101, 103].

In the AccelerAge framework, Wang et al. proposed an adaptive penalty strategy in which the magnitude of the gradient is adjusted according to both the subject’s age and the direction of the prediction error. This design explicitly targets the ADC phenomenon by discouraging structured error patterns and driving the correlation between prediction residuals and CA toward zero. Evaluations on neuroimaging datasets such as Cam-CAN and ABIDE demonstrated that asymmetric-loss training substantially reduced systematic bias while maintaining competitive accuracy. Similar bias-aware loss formulations have also been applied in MRI-based BA modeling [102], improving stability at the extremes of the age spectrum and supporting fairer performance across age groups.

AgeML linear recalibration

AgeML provides an interpretable and statistically grounded post-processing correction. It assumes that predicted age and true CA exhibit an approximately linear relationship and performs a regression-based recalibration to remove regression-to-the-mean bias frequently observed in uncorrected BA estimates [104, 105]. In practice, AgeML fits a linear model relating predicted age to CA on the reference dataset and then uses the fitted slope and intercept to adjust the predicted values. After correction, the adjusted predictions are designed to be unbiased with respect to CA, while the remaining residual reflects age acceleration or deceleration relative to normative expectations.

This approach is attractive because it is transparent, computationally inexpensive, and applicable across different BA modalities. Condado et al. showed that AgeML can remove bias consistently across sex and disease stratified cohorts when a healthy reference population is available [105]. As a result, linear recalibration has become a widely used component in clinical BA pipelines, particularly for epigenetic and imaging-based clocks. A recent extension of this approach was applied in an ECG-based age prediction study, where linear recalibration was first used to remove the global age-related bias, followed by an age-stratified mean-centering step to eliminate residual nonlinear associations with CA, achieving a fully age-independent deviation metric [106].

Z-score correction

Z-score correction is among the simplest and most generalizable calibration techniques. It rescales biomarkers or predicted BA values by centering them to a reference mean and scaling them by the corresponding standard deviation, yielding standardized outputs that are comparable across cohorts. In BA modeling, z-score–based correction is commonly applied to reduce demographic and cohort-dependent bias by normalizing interindividual variability and harmonizing distributions across populations [81].

Peters et al. [] and Oh et al. [107] demonstrated that such normalization improves robustness and cross-population transferability, particularly for biomarkers with high variance and strong population stratification. Recent work has further extended z-score calibration to multi-organ aging frameworks, enabling standardized comparison of organ-specific BA metrics across demographic groups and supporting downstream risk stratification [107].

Risk-difference

Beyond purely statistical normalization, risk-based calibration methods define BA in clinically interpretable terms by referencing expected age-dependent physiological profiles [110]. Tang et al. proposed vascular aging frameworks in which predicted physiological parameters are compared to their expected reference values at a given CA [108, 109]. The deviation between observed and expected values is then interpreted as accelerated or decelerated vascular aging and used to construct a corrected vascular age metric.

Such formulations enable risk-normalized calibration by explicitly mapping deviations in physiological function to interpretable BA adjustments, thereby aligning BA prediction with clinically meaningful outcomes [111]. Recent extensions have generalized this idea to organ-specific aging indices, including hepatic and renal aging, yielding unified risk-referenced frameworks that support precision medicine applications and cross-organ comparisons [107].

Result evaluation

Rigorous evaluation of BA prediction models is essential for assessing their reliability, interpretability, and translational value. Unlike conventional regression tasks, BA estimation lacks a definitive biological ground truth, making evaluation inherently multidimensional. High numerical accuracy alone does not guarantee that a model captures meaningful aging processes or generalizes across populations. Accordingly, BA evaluation is commonly conducted from two complementary perspectives: a data-oriented statistical perspective and a knowledge-oriented biological perspective, as summarized in Table 3.

TABLE 3

PerspectiveEvaluation indicatorPurposeReferences
Data perspectiveMAE, MSE, RMSEQuantify prediction error relative to CA[51, 52, 112, 113]
Assess variance explained and model fit[, , 114, 115]
Knowledge perspectiveCross-sectional analysisIdentify health differences across individuals or subgroups[44, 47, 116, 117]
Longitudinal cohort analysisTrack individual aging trajectories and outcomes over time[118120]

Common biological age validity indicators.

Data perspective

From the data-oriented perspective, BA prediction is typically evaluated using statistical regression metrics that quantify the deviation between predicted BA and CA, which serves as a proxy reference in the absence of a biological ground truth. Commonly used indicators include MAE, mean squared error (MSE), root mean squared error (RMSE), and the coefficient of determination [45, 121]. These metrics provide standardized and easily comparable measures of predictive performance across models and datasets.

Among these indicators, MAE is particularly favored for its interpretability and robustness to outliers, reflecting the average deviation of predictions in age-equivalent units. RMSE and MSE emphasize larger errors and are useful for identifying models prone to extreme misestimation, which is especially relevant in clinical risk stratification. Deep learning–based BA models, particularly in neuroimaging and multimodal settings, now routinely achieve RMSE values below 3–4 years, indicating substantial progress in predictive accuracy [51, 115].

The complements error-based metrics by quantifying how much inter-individual variation in aging is captured by the model. Higher values suggest stronger consistency between predicted and observed age distributions and better modeling of population-level aging variability. However, high statistical performance does not necessarily imply biological validity, motivating the need for complementary evaluation frameworks.

Knowledge perspective

The knowledge-oriented perspective evaluates whether BA estimates reflect biologically meaningful aging processes rather than merely fitting chronological age. This perspective emphasizes interpretability, physiological relevance, and consistency with known aging patterns, and is commonly assessed through cross-sectional and longitudinal analyses.

Cross-sectional evaluation examines BA differences across individuals or subgroups at a single time point. A widely used indicator is the BA delta, defined as the deviation between predicted BA and CA. Positive deviations are interpreted as accelerated aging, while negative deviations indicate relative biological youth. Cross-sectional analyses have been used to identify health disparities associated with disease status, frailty, lifestyle factors, and socioeconomic conditions [44, 117]. Despite their scalability and practicality, cross-sectional studies rely on the assumption that associations between biomarkers and CA reflect true aging processes, an assumption that may be violated by cohort effects or confounding factors [113].

Longitudinal cohort studies provide a stronger biological validation framework by tracking intra-individual BA trajectories over time [122]. Fixed cohorts, in particular, enable direct assessment of whether BA acceleration predicts future health decline, disease onset, or mortality. Numerous studies have demonstrated that longitudinal BA changes are associated with cognitive impairment, multimorbidity, and survival outcomes, supporting BA as a prognostic aging marker [118, 120]. For example, cohort analyses of centenarian populations have revealed stable molecular and physiological signatures associated with exceptional longevity, confirming that longitudinal evaluation captures genuine biological aging trajectories [119]. EHR further extend longitudinal validation by providing real-world clinical evidence [123]. Large-scale EHR data allow assessment of whether BA acceleration predicts incident disease, multimorbidity, and mortality across heterogeneous populations [37]. Demonstrated associations between elevated BA and adverse outcomes support its prognostic and translational relevance beyond CA. Despite challenges such as missing data and coding bias, EHR-based analyses strengthen the link between biological aging signals and tangible health consequences [85].

Taken together, data-oriented metrics provide standardized benchmarks for model comparison, while knowledge-oriented evaluation establishes biological plausibility and translational relevance. Robust BA assessment therefore requires the integration of statistical accuracy and biologically informed validation, further strengthened by real-world clinical evidence derived from longitudinal cohorts and EHR-based outcome analyses.

Challenges

BA is an indicator that estimates the degree of physiological aging by assessing an individual’s biological state. Compared with CA, BA more accurately reflects health status and aging rate, and therefore plays a crucial role in aging research and clinical practice. With the rapid development of AI and high-throughput biomedical technologies, BA prediction has gradually shifted from traditional statistical models to data-driven AI-based approaches. These advances have significantly improved predictive performance and enabled the integration of multi-modal data. However, the increasing complexity of AI-based BA models has also introduced new methodological and conceptual challenges. In particular, issues related to biomarker selection, model interpretability, validation strategies, generalizability, and the characterization of asynchronous aging remain unresolved. This section discusses the key challenges of BA research from five interconnected perspectives, providing a reference for future in-depth studies.

How to select representative aging markers under multi-modal and asynchronous aging frameworks

Recent AI-based BA studies leverage a wide range of biomarkers, including blood tests, DNA methylation, transcriptomics, metabolomics, medical imaging (e.g., brain MRI, X-rays), facial and retinal images, and wearable sensor data [124]. Deep learning models are particularly effective at extracting latent features from high-dimensional data, enabling multi-modal fusion for BA prediction. However, the impact of biomarker selection and combination on BA estimation remains unclear [125].

Moreover, aging is increasingly recognized as an asynchronous and organ-specific process, in which different tissues and systems age at distinct rates within the same individual [107]. Current AI models often aggregate heterogeneous biomarkers into a single BA output, potentially obscuring organ-level aging patterns and biological heterogeneity. In addition, substantial variability in biomarker panels across studies limits reproducibility and cross-cohort comparability. Therefore, a key challenge lies in developing AI-driven biomarker selection strategies that are not only predictive but also biologically interpretable, capable of capturing asynchronous aging processes across organs and systems.

How to analyze the logical structure of BA to improve the interpretability of AI-based models

Existing BA prediction methods range from traditional models such as the KDM and epigenetic clocks (e.g., Horvath Clock) to machine learning and deep learning approaches. While AI models excel at handling complex, nonlinear relationships in large-scale datasets, they often function as “black boxes,” lacking explicit representation of the biological logic underlying BA [126]. This opacity limits their acceptance in clinical settings, where interpretability and causal insight are critical.

Additionally, the absence of a universally accepted gold standard for BA poses a fundamental challenge. Most supervised AI models use CA as the training label, implicitly assuming that deviations from CA reflect biological aging. This assumption may bias models toward chronological patterns rather than true aging mechanisms and increases the risk of overfitting [101]. Consequently, improving BA prediction requires not only algorithmic innovation but also a clearer conceptual framework that links AI-derived features to biological aging pathways and asynchronous aging structures.

How to scientifically validate AI-based BA models beyond chronological age prediction

BA remains a latent construct without a direct measurement standard. Although CA is commonly used as a proxy for model evaluation, this approach fails to capture health-related deviations that define biological aging. Different AI models may yield substantially different BA estimates for the same individual, complicating model comparison and selection [127].

Current evaluation metrics such as MSE, RMSE, MAE, and primarily assess numerical agreement with CA and do not adequately reflect biomedical relevance. Longitudinal validation using morbidity, mortality, or functional decline as endpoints is more informative but requires extensive time and resources. Cross-sectional studies, which dominate current AI-based BA research, cannot fully capture temporal aging trajectories or asynchronous aging dynamics [108]. Therefore, establishing a systematic and multi-level validation framework that integrates clinical outcomes, longitudinal changes, and organ-specific aging indicators remains a major challenge.

How to enhance the accuracy, robustness, and generalizability of AI-driven BA models

To facilitate the clinical translation of AI-based BA models, future research must move beyond incremental performance gains and address foundational challenges in model design, data integration, and validation. Incorporating established aging mechanisms—such as inflammation, metabolic dysregulation, cellular senescence, and epigenetic drift—can improve biological interpretability and reduce spurious associations [45]. Aging should also be modeled as a multidimensional and asynchronous process, with multi-omics and multi-modal data integration enabling the capture of organ- and system-specific aging trajectories []. Interpretable and explainable AI approaches are essential for clinical trust and adoption, while robust generalization requires large-scale, longitudinal, and demographically diverse cohorts [128]. Finally, standardized protocols for data processing, model training, and evaluation are critical for reproducibility and cross-study comparability.

By systematically addressing these challenges, AI-based BA models can move beyond mere CA prediction toward a more biologically grounded and clinically actionable understanding of human aging.

Integration of genetic susceptibility and medical history

Although genetic variation is estimated to explain a substantial proportion of aging heterogeneity, most AI-driven BA models do not explicitly model genotype or inherited susceptibility. Instead, models trained against chronological age risk conflating disease burden with intrinsic aging processes. Recent reviews emphasize that biological aging should be conceptualized as a multidimensional process shaped by genetic background, environmental exposure, and stochastic factors [, 127].

Incorporating genotype information (e.g., APOE status, FOXO3 variants, polygenic risk scores) alongside longitudinal medical history requires harmonized genomic and clinical datasets, careful control for population stratification, and bias-aware modeling strategies [36]. Developing unified frameworks that jointly model multimodal biomarkers, genetic predisposition, and clinical trajectories remains a major frontier in precision aging research.

Statements

Author contributions

GW drafted the manuscript and contributed to review and editing; PD and ZL drafted the manuscript; QT provided resources and contributed to review and editing; LT and TJ contributed to review and editing; LZ acquired funding; BS supervised the study and contributed to review and editing; JX acquired funding and supervised the study; HA administered the project. All authors contributed to the article and approved the submitted version.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by National Key R & D Plan Project of China (No. 2017YFC1703301), National Nature Science Foundation of China (No. 62302014, No. 62502007), High level Key Discipline Construction Project of Traditional Chinese Medicine of China (No. ZYYZDXK-2023069), Shanghai Biomedical Technology Support Special Project (No.22S31901100).

Conflict of interest

The authors(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

References

  • 1.

    LiuCMKuoMJKuoCYWuICChenPFHsuWTet alReclassification of the conventional risk assessment for aging-related diseases by electrocardiogram-enabled biological age. npj Aging (2025) 11:7. 10.1038/s41514-025-00198-0

  • 2.

    NakamuraEMiyaoK. Further evaluation of the basic nature of the human biological aging process based on a factor analysis of age-related physiological variables. The Journals Gerontol Ser A: Biol Sci Med Sci (2003) 58:B196B204. 10.1093/gerona/58.3.b196

  • 3.

    LiGChengLWongINYinYChenJLiuLet alPredicting healthspan and disease risks through biological age. Trends Mol Med (2025) 32:35469. 10.1016/j.molmed.2025.10.006

  • 4.

    KlemeraPDoubalS. A new approach to the concept and computation of biological age. Mech Ageing Development (2006) 127:2408. 10.1016/j.mad.2005.10.004

  • 5.

    LevineMELuATQuachAChenBHAssimesTLBandinelliSet alAn epigenetic biomarker of aging for lifespan and healthspan. Aging (Albany NY) (2018) 10:57391. 10.18632/aging.101414

  • 6.

    JeeHParkJ. Selection of an optimal set of biomarkers and comparative analyses of biological age estimation models in korean females. Arch Gerontol Geriatr (2017) 70:8491. 10.1016/j.archger.2017.01.005

  • 7.

    PetersMJJoehanesRPillingLCSchurmannCConneelyKNPowellJet alThe transcriptional landscape of age in human peripheral blood. Nat Commun (2015) 6:8570. 10.1038/ncomms9570

  • 8.

    MitnitskiABGrahamJEMogilnerAJRockwoodK. Frailty, fitness and late-life mortality in relation to chronological and biological age. BMC Geriatr (2002) 2:1. 10.1186/1471-2318-2-1

  • 9.

    WangCGuanXBaiYFengYWeiWLiHet alA machine learning–based biological aging prediction and its associations with healthy lifestyles: the dongfeng–tongji cohort. Ann New York Acad Sci (2022) 1507:10820. 10.1111/nyas.14685

  • 10.

    ChenBHMarioniREColicinoEPetersMJWard-CavinessCKTsaiPCet alDna methylation-based measures of biological age: meta-analysis predicting time to death. Aging (Albany NY) (2016) 8:184465. 10.18632/aging.101020

  • 11.

    ComfortA. Test-battery to measure ageing-rate in man. The Lancet (1969) 294:14115. 10.1016/S0140-6736(69)90950-7

  • 12.

    ZurbuchenRvon DänikenAJankaHvon WolffMStuteP. Methods for the assessment of biological age – a systematic review. Maturitas (2025) 195:108215. 10.1016/j.maturitas.2025.108215

  • 13.

    ElliottMLCaspiAHoutsRMAmblerABroadbentJMHancoxRJet alDisparities in the pace of biological aging among midlife adults of the same chronological age have implications for future frailty risk and policy. Nat Aging (2021) 1:295308. 10.1038/s43587-021-00044-4

  • 14.

    HorvathS. Dna methylation age of human tissues and cell types. Genome Biol (2013) 14:3156. 10.1186/gb-2013-14-10-r115

  • 15.

    LiXPlonerAWangYMagnussonPKReynoldsCFinkelDet alLongitudinal trajectories, correlations and mortality associations of nine biological ages across 20-years follow-up. eLife (2020) 9:e51507. 10.7554/eLife.51507

  • 16.

    RandoTAWyss-CorayT. Asynchronous, contagious and digital aging. Nat Aging (2021) 1:2935. 10.1038/s43587-020-00015-1

  • 17.

    ZhavoronkovAMamoshinaPVanhaelenQScheibye-KnudsenMMoskalevAAliperA. Artificial intelligence for aging and longevity research: recent advances and perspectives. Ageing Res Rev (2019) 49:4966. 10.1016/j.arr.2018.11.003

  • 18.

    JohnsonAAShokhirevMNWyss-CorayTLehallierB. Systematic review and analysis of human proteomics aging studies unveils a novel proteomic aging clock and identifies key processes that change with age. Ageing Res Rev (2020) 60:101070. 10.1016/j.arr.2020.101070

  • 19.

    ReaverAHewlingsSWestermanKBlanderGSchmellerTHeerMet alA randomized, placebo-controlled, double-blind crossover study to assess a unique phytosterol ester formulation in lowering ldl cholesterol utilizing a novel virtual tracking tool. Nutrients (2019) 11:2108. 10.3390/nu11092108

  • 20.

    SandersJLNewmanAB. Telomere length in epidemiology: a biomarker of aging, age-related disease, both, or neither?Epidemiologic Rev (2013) 35:11231. 10.1093/epirev/mxs008

  • 21.

    HannumGGuinneyJZhaoLZhangLHughesGSaddaSet alGenome-wide methylation profiles reveal quantitative views of human aging rates. Mol Cell (2013) 49:35967. 10.1016/j.molcel.2012.10.016

  • 22.

    LevineME. Modeling the rate of senescence: can estimated biological age predict mortality more accurately than chronological age?The Journals Gerontol Ser A (2012) 68:66774. 10.1093/gerona/gls233

  • 23.

    LuATQuachAWilsonJGReinerAPAvivARajKet alDna methylation grimage strongly predicts lifespan and healthspan. Aging (Albany NY) (2019) 11:30327. 10.18632/aging.101684

  • 24.

    McCroryCFioritoGHernandezBPolidoroSO’HalloranAMHeverAet alGrimage outperforms other epigenetic clocks in the prediction of age-related clinical phenotypes and all-cause mortality. The Journals Gerontol Ser A (2020) 76:7419. 10.1093/gerona/glaa286

  • 25.

    TayJHBarrosDWangWWaznyVKMaierAB. Biological age measured by dna methylation clocks and frailty: a systematic review and meta-analysis. The Lancet Healthy Longevity (2025) 6:100773. 10.1016/j.lanhl.2025.100773

  • 26.

    EllisDWatanabeKWilmanskiTLustgartenMSKoratAVAGlusmanGet alAPOE genotype and biological age impact inter-omic associations related to bioenergetics. Aging (2025) 17:110538. 10.18632/aging.206243

  • 27.

    FöhrTWallerKViljanenASanchezROllikainenMRantanenTet alDoes the epigenetic clock grimage predict mortality independent of genetic influences: an 18 year follow-up study in older female twin pairs. Clin Epigenetics (2021) 13:128. 10.1186/s13148-021-01112-7

  • 28.

    GreiderCW. Telomere length regulation. Annu Review Biochemistry (1996) 65:33765. 10.1146/annurev.bi.65.070196.002005

  • 29.

    ZglinickiTMartin-RuizC. Telomeres as biomarkers for ageing and age-related diseases. Curr Molecular Medicine (2005) 5:197203. 10.2174/1566524053586545

  • 30.

    MatherKAJormAFParslowRAChristensenH. Is telomere length a biomarker of aging? A review. The Journals Gerontol Ser A (2010) 66A:20213. 10.1093/gerona/glq180

  • 31.

    BlascoMA. Telomere length, stem cells and aging. Nat Chem Biol (2007) 3:6409. 10.1038/nchembio.2007.38

  • 32.

    WillcoxBJDonlonTAHeQChenRGroveJSYanoKet alFoxo3a genotype is strongly associated with human longevity. Proc Natl Acad Sci (2008) 105:1398792. 10.1073/pnas.0801030105

  • 33.

    TorigoeTHWillcoxDCShimabukuroMHigaMGerschensonMAndrukhivAet alNovel protective effect of the foxo3 longevity genotype on mechanisms of cellular aging in okinawans. npj Aging (2024) 10:18. 10.1038/s41514-024-00142-8

  • 34.

    EarlsJCRappaportNHeathLWilmanskiTMagisATSchorkNJet alMulti-omic biological age estimation and its correlation with wellness and disease phenotypes: a longitudinal study of 3,558 individuals. The Journals Gerontol Ser A (2019) 74:S52S60. 10.1093/gerona/glz220

  • 35.

    RobinsonOChadeau HyamMKaramanIClimaco PintoRAla-KorpelaMHandakasEet alDeterminants of accelerated metabolomic and epigenetic aging in a UK cohort. Aging Cell (2020) 19:e13149. 10.1111/acel.13149

  • 36.

    RutledgeJOhHWyss-CorayT. Measuring biological age using omics data. Nat Rev Genet (2022) 23:71527. 10.1038/s41576-022-00511-7

  • 37.

    WangKLiuFWuWHuCShenXWangMet alA full life cycle biological clock based on routine clinical data and its impact in health and diseases. Nat Med (2025) 31:422535. 10.1038/s41591-025-04006-w

  • 38.

    SilvaNRajadoATEstevesFBritoDApolónioJRobertoVPet alMeasuring healthy ageing: current and future tools. Biogerontology (2023) 24:84566. 10.1007/s10522-023-10041-2

  • 39.

    ReaIMGibsonDSMcGilliganVMcNerlanSEAlexanderHDRossOA. Age and age-related diseases: role of inflammation triggers and cytokines. Front Immunol (2018) 9:586. 10.3389/fimmu.2018.00586

  • 40.

    Puzianowska-KuźnickaMOwczarzMWieczorowska-TobisKNadrowskiPChudekJSlusarczykPet alInterleukin-6 and c-reactive protein, successful aging, and mortality: the polsenior study. Immun & Ageing (2016) 13:21. 10.1186/s12979-016-0076-x

  • 41.

    MamoshinaPKochetovKLaneECorteseFAliperALeeWSet alPopulation specific biomarkers of human aging: a big data study using south korean, canadian, and eastern european patient populations. The Journals Gerontol Ser A (2018) 73:148290. 10.1093/gerona/gly005

  • 42.

    SagersLMelas-KyriaziLPatelCJManraiAK. Prediction of chronological and biological age from laboratory data. Aging (Albany NY) (2020) 12:762638. 10.18632/aging.102900

  • 43.

    ChenLZhangYYuCGuoYSunDPangYet alModeling biological age using blood biomarkers and physical measurements in chinese adults. eBioMedicine (2023) 89:104458. 10.1016/j.ebiom.2023.104458

  • 44.

    MitnitskiAHowlettSERockwoodK. Heterogeneity of human aging and its assessment. The Journals Gerontol Ser A (2016) 72:87784. 10.1093/gerona/glw089

  • 45.

    JeongCULeibyJSKimDChoeEK. Artificial intelligence-driven biological age prediction model using comprehensive health checkup data: development and validation study. JMIR Aging (2025) 8:e64473. 10.2196/64473

  • 46.

    ShadyabAHLaCroixAZ. Genetic factors associated with longevity: a review of recent findings. Ageing Res Rev (2015) 19:17. 10.1016/j.arr.2014.10.005

  • 47.

    PyrkovTVAvchaciovKTarkhovAEMenshikovLIGudkovAVFedichevPO. Longitudinal analysis of blood markers reveals progressive loss of resilience and predicts human lifespan limit. Nat Commun (2021) 12:2765. 10.1038/s41467-021-23014-1

  • 48.

    BontempiDZalayOBittermanDSBirkbakNShyrDHauggFet alFaceage, a deep learning system to estimate biological age from face photographs to improve prognostication: a model development and validation study. The Lancet Digital Health (2025) 7:100870. 10.1016/j.landig.2025.03.002

  • 49.

    RazNLindenbergerURodrigueKMKennedyKMHeadDWilliamsonAet alRegional brain changes in aging healthy adults: general trends, individual differences and modifiers. Cereb Cortex (2005) 15:167689. 10.1093/cercor/bhi044

  • 50.

    DriscollIHamiltonDAPetropoulosHYeoRABrooksWMBaumgartnerRNet alThe aging hippocampus: cognitive, biochemical and structural findings. Cereb Cortex (2003) 13:134451. 10.1093/cercor/bhg081

  • 51.

    HeSGrantPEOuY. Global-local transformer for brain age estimation. IEEE Trans Med Imaging (2022) 41:21324. 10.1109/TMI.2021.3108910

  • 52.

    ChengJLiuZGuanHWuZZhuHJiangJet alBrain age estimation from mri using cascade networks with ranking loss. IEEE Trans Med Imaging (2021) 40:340012. 10.1109/TMI.2021.3085948

  • 53.

    DemirjianAGoldsteinHTannerJM. A new system of dental age assessment. Hum Biol (1973) 45:21127. Available online at: http://www.jstor.org/stable/41459864.

  • 54.

    GalibourgACussat-BlancSDumoncelJTelmonNMonsarratPMaretD. Comparison of different machine learning approaches to predict dental age using demirjian’s staging approach. Int J Leg Med (2021) 135:66575. 10.1007/s00414-020-02489-5

  • 55.

    ZhangDYangJDuSBuWGuoY. An uncertainty-aware and sex-prior guided biological age estimation from orthopantomogram images. IEEE J Biomed Health Inform (2023) 27:492637. 10.1109/JBHI.2023.3297610

  • 56.

    CampicheRTrevisanSSéroulPRawlingsAVAdnetCImfeldDet alAppearance of aging signs in differently pigmented facial skin by a novel imaging system. J Cosmet Dermatol (2019) 18:61427. 10.1111/jocd.12806

  • 57.

    BobrovEGeorgievskayaAKiselevKSevastopolskyAZhavoronkovAGurovSet alPhotoageclock: deep learning algorithms for development of non-invasive visual biomarkers of aging. Aging (Albany NY) (2018) 10:324959. 10.18632/aging.101629

  • 58.

    XiaXChenXWuGLiFWangYChenYet alThree-dimensional facial-image analysis to predict heterogeneity of the human ageing rate and the impact of lifestyle. Nat Metab (2020) 2:94657. 10.1038/s42255-020-00270-x

  • 59.

    JohnsonTE. Recent results: biomarkers of aging. Exp Gerontol (2006) 41:12436. 10.1016/j.exger.2006.09.006

  • 60.

    LongstrethJWTLarsenEKMKleinRWongTYSharrettARLefkowitzDet alAssociations between findings on cranial magnetic resonance imaging and retinal photography in the elderly: the cardiovascular health study. Am J Epidemiol (2006) 165:7884. 10.1093/aje/kwj350

  • 61.

    AhadiSWilsonKABabenkoBMcLeanCYBryantDPritchardOet alLongitudinal fundus imaging and its genome-wide association analysis provide evidence for a human retinal aging clock. eLife (2023) 12:e82364. 10.7554/eLife.82364

  • 62.

    AllsoppRCVaziriHPattersonCGoldsteinSYounglaiEVFutcherABet alTelomere length predicts replicative capacity of human fibroblasts. Proc Natl Acad Sci (1992) 89:101148. 10.1073/pnas.89.21.10114

  • 63.

    FleischerJGSchulteRTsaiHHTyagiSIbarraAShokhirevMNet alPredicting age from the transcriptome of human dermal fibroblasts. Genome Biol (2018) 19:221. 10.1186/s13059-018-1599-6

  • 64.

    HolzscheckNFalckenhaynCSöhleJKristofBSiegnerRWernerAet alModeling transcriptomic age using knowledge-primed artificial neural networks. Npj Aging Mech Dis (2021) 7:15. 10.1038/s41514-021-00068-5

  • 65.

    JovéMPortero-OtínMNaudíAFerrerIPamplonaR. Metabolomics of human brain aging and age-related neurodegenerative diseases. J Neuropathol & Exp Neurol (2014) 73:64057. 10.1097/NEN.0000000000000091

  • 66.

    ZhangQVallergaCLWalkerRMLinTHendersAKMontgomeryGWet alImproved precision of epigenetic clock estimates across tissues and its implication for biological ageing. Genome Med (2019) 11:54. 10.1186/s13073-019-0667-1

  • 67.

    ArmstrongNJMatherKAThalamuthuAWrightMJTrollorJNAmesDet alAging, exceptional longevity and comparisons of the hannum and horvath epigenetic clocks. Epigenomics (2017) 9:689700. 10.2217/epi-2016-0179

  • 68.

    PutinEMamoshinaPAliperAKorzinkinMMoskalevAKolosovAet alDeep biomarkers of human aging: application of deep neural networks to biomarker development. Aging (Albany NY) (2016) 8:102133. 10.18632/aging.100968

  • 69.

    GalkinFMamoshinaPKochetovKSidorenkoDZhavoronkovA. Deepmage: a methylation aging clock developed with deep learning. Aging Dis (2021) 12:125262. 10.14336/AD.2020.1202

  • 70.

    BahourNCortezBPanHShahHDoriaAAguayo-MazzucatoC. Diabetes mellitus correlates with increased biological age as indicated by clinical biomarkers. GeroScience (2022) 44:41527. 10.1007/s11357-021-00469-0

  • 71.

    ArmaniousKAbdulatifSShiWSalianSKüstnerTWeiskopfDet alAge-net: an mri-based iterative framework for brain biological age estimation. IEEE Trans Med Imaging (2021) 40:177891. 10.1109/TMI.2021.3066857

  • 72.

    GanaieMATanveerMBeheshtiI. Brain age prediction with improved least squares twin svr. IEEE J Biomed Health Inform (2023) 27:16619. 10.1109/JBHI.2022.3147524

  • 73.

    BaeCYImYLeeJParkCSKimMKwonHet alComparison of biological age prediction models using clinical biomarkers commonly measured in clinical practice settings: ai techniques vs. traditional statistical methods. Front Anal Sci (2021) 1:709589. 10.3389/frans.2021.709589

  • 74.

    ZhongXLuYGaoQNyuntMSZFulopTMonterolaCPet alEstimating biological age in the Singapore longitudinal aging study. The Journals Gerontol Ser A (2019) 75:191320. 10.1093/gerona/glz146

  • 75.

    BortzJGuarigliaAKlaricLTangDWardPGeerMet alBiological age estimation using circulating blood biomarkers. Commun Biol (2023) 6:1089. 10.1038/s42003-023-05456-z

  • 76.

    MarcoulidesKMRaykovT. Evaluation of variance inflation factors in regression models using latent variable modeling methods. Educ Psychological Measurement (2019) 79:87482. 10.1177/0013164418817803

  • 77.

    ZhangWGBaiXJSunXFCaiGYBaiXYZhuSYet alConstruction of an integral formula of biological age for a healthy chinese population using principle component analysis. The J Nutrition, Health Aging (2014) 18:13742. 10.1007/s12603-013-0345-8

  • 78.

    LiXZhangJSunCZhangYCaiRFuSet alApplication of biological age assessment of chinese population in potential anti-ageing technology. Immun & Ageing (2018) 15:33. 10.1186/s12979-018-0140-9

  • 79.

    NakamuraEMiyaoKOzekiT. Assessment of biological age by principal component analysis. Mech Ageing Development (1988) 46:118. 10.1016/0047-6374(88)90109-1

  • 80.

    TaguchiYMurakamiY. Principal component analysis based feature extraction approach to identify circulating microrna biomarkers. PLOS ONE (2013) 8:112. 10.1371/journal.pone.0066714

  • 81.

    ProszAPipekOBörcsökJPallaGSzallasiZSpisakSet alBiologically informed deep learning for explainable epigenetic clocks. Scientific Rep (2024) 14:1306. 10.1038/s41598-023-50495-5

  • 82.

    KhemaniBPatilSKotechaKTanwarS. A review of graph neural networks: concepts, architectures, techniques, challenges, datasets, applications, and future directions. J Big Data (2024) 11:18. 10.1186/s40537-023-00876-4

  • 83.

    YinCImmsPChengMAmgalanAChowdhuryNFMassettRJet alAnatomically interpretable deep learning of brain age captures domain-specific cognitive impairment. Proc Natl Acad Sci (2023) 120:e2214634120. 10.1073/pnas.2214634120

  • 84.

    TimmersPRMounierNLallKFischerKNingZFengXet alGenomics of 1 million parent lifespans implicates novel pathways and common diseases and distinguishes survival chances. eLife (2019) 8:e39856. 10.7554/eLife.39856

  • 85.

    HanCJRoskoAEPlascakJJTanANoonanAMBurdCE. Biological aging and chemotoxicity in patients with colorectal cancer: a secondary data analysis using ehr data. Curr Oncol (2025) 32:438. 10.3390/curroncol32080438

  • 86.

    UrbanASidorenkoDZagirovaDKozlovaEKalashnikovAPushkovSet alMultimodal transformer-based transfer learning for aging clock development and target discovery (precious1gpt). Aging (Albany NY) (2023) 15:464966. 10.18632/aging.204788

  • 87.

    WangKGaoJLiuYLiuZLiYChenSet alBiological age construction for prediction of mortality in the chinese population. GeroScience (2025) 47:586980. 10.1007/s11357-025-01612-x

  • 88.

    ChanMSArnoldMOfferAHammamiIMafhamMArmitageJet alA biomarker-based biological age in UK biobank: composition and prediction of mortality and hospital admissions. The Journals Gerontol Ser A (2021) 76:1295302. 10.1093/gerona/glab069

  • 89.

    LiuYKangMWeiWHuiJGouYLiuCet alDietary diversity score and the acceleration of biological aging: a population-based study of 88,039 participants. The J Nutrition, Health Aging (2024) 28:100271. 10.1016/j.jnha.2024.100271

  • 90.

    OddenMCMelzerD. Machine learning in aging research. The Journals Gerontol Ser A (2019) 74:19012. 10.1093/gerona/glz074

  • 91.

    LehallierBShokhirevMNWyss-CorayTJohnsonAA. Data mining of human plasma proteins generates a multitude of highly predictive aging clocks that reflect different aspects of aging. Aging Cell (2020) 19:e13256. 10.1111/acel.13256

  • 92.

    GuoYLiuYOerlemansALaoSWuSLewMS. Deep learning for visual understanding: a review. Neurocomputing (2016) 187:2748. 10.1016/j.neucom.2015.09.116

  • 93.

    AngermuellerCLeeHJReikWStegleO. Deepcpg: accurate prediction of single-cell dna methylation states using deep learning. Genome Biol (2017) 18:67. 10.1186/s13059-017-1189-z

  • 94.

    WangJGaoYWangFZengSLiJMiaoHet alAccurate estimation of biological age and its application using multimodal transformer architectures combining facial, tongue, and retinal images. Proc Natl Acad Sci (Pnas) (2024) 121:e2308812120. 10.1073/pnas.2308812120

  • 95.

    RibeiroRMoraesAMorenoMFerreiraPG. Integration of multi-modal datasets to estimate human aging. Machine Learn (2024) 113:7293317. 10.1007/s10994-024-06588-x

  • 96.

    BarbanoCADufumierBDuchesnayEGrangettoMGoriP. Contrastive learning for regression in multi-site brain age prediction. IN IEEE 20th International Symposium on Biomedical Imaging (ISBI), Cartagena, Colombia (2023). 14. 10.1109/ISBI53787.2023.10230733

  • 97.

    Arango-ArgotyGBikielDESunGJKipkogeiESmithKMCarrascoPSet alAi-driven predictive biomarker discovery with contrastive learning to improve clinical trial outcomes. Cancer Cell (2025) 43:87590.e8. 10.1016/j.ccell.2025.03.029

  • 98.

    Anonfe. Deeply supervised multitask autoencoder for brain age estimation. arXiv Preprint (2025). 10.48550/arXiv.2508.01565

  • 99.

    XiongMLinLJinYKangWWuSSunS. Comparison of machine learning models for brain age prediction using six imaging modalities on middle-aged and older adults. Sensors (2023) 23:3622. 10.3390/s23073622

  • 100.

    AshiqurRSGiacobbiPPylesLMullettCDorettoGAdjerohDA. Deep learning for biological age estimation. Brief Bioinform (2020) 22:176781. 10.1093/bib/bbaa021

  • 101.

    WangHTrederMSMarshallDJonesDKLiY. A skewed loss function for correcting predictive bias in brain age prediction. IEEE Trans Med Imaging (2023) 42:157789. 10.1109/tmi.2022.3231730

  • 102.

    ZhangJWangSLiuB. New insights into the genetics and epigenetics of aging plasticity. Genes (2023) 14:329. 10.3390/genes14020329

  • 103.

    SluiskesMGoemanJBeekmanMSlagboomEvan den AkkerEPutterHet alThe accelerage framework: a new statistical approach to predict biological age based on time-to-event data. Eur J Epidemiol (2024) 39:119. 10.1007/s10654-024-01114-8

  • 104.

    de LangeAMGColeJH. Commentary: correction procedures in brain-age prediction. NeuroImage: Clin (2020) 26:102229. 10.1016/j.nicl.2020.102229

  • 105.

    CondadoJGTellaetxe-ElorriagaJMCortesJMErramuzpeA. Ageml: age modeling with machine learning. IEEE J Biomed Health Inform (2025) 29: 37723781. 10.1109/JBHI.2025.3531017

  • 106.

    BarthelsMVerhofstadtEDelgadoIBGruwezHPisonLPierletNet alArtificial intelligence-predicted ecg age gap as a biomarker: bias-adjusted correlation with mortality and cardiovascular risk factors. Eur Heart J - Digital Health (2025) 7:ztaf137. 10.1093/ehjdh/ztaf137

  • 107.

    OhHSHRutledgeJNachunDPálovicsRAbioseOMoran-LosadaPet alOrgan aging signatures in the plasma proteome track health and disease. Nature (2023) 624:16472. 10.1038/s41586-023-06802-1

  • 108.

    TangQTaoCPanZWangGLiuKPanZet alA novel method for vascular age estimation via pressure pulse wave of radial artery. Biomed Signal Process Control (2022) 78:103904. 10.1016/j.bspc.2022.103904

  • 109.

    TangQPanZTaoCJiangJSuBAnHet alVascular age acquired from the pulse signal: a new index to screen early vascular aging. Comput Biol Med (2022) 151:106355. 10.1016/j.compbiomed.2022.106355

  • 110.

    FongSKohWPGruberJ. Reframing biological age as risk-equivalent age. Nat Aging (2025) 6:25. 10.1038/s43587-025-01038-2

  • 111.

    LiYYeBYangJLeiZYuanLKongMet alEstimation of biological age and age-related outcomes with easily accessible parameters in chinese. GeroScience (2025). 10.1007/s11357-025-01940-y

  • 112.

    de Lima CamilloLPLapierreLRSinghR. A pan-tissue DNA-methylation epigenetic clock based on deep learning. npj Aging (2022) 8:4. 10.1038/s41514-022-00085-y

  • 113.

    SluiskesMGoemanJBeekmanMSlagboomPPutterHRodríguez-GirondoM. Clarifying the biological and statistical assumptions of cross-sectional biological age predictors: an elaborate illustration using synthetic and real data. BMC Med Res Methodol (2024) 24:58. 10.1186/s12874-024-02181-x

  • 114.

    FongSPabisKLatumaleaDDugersurenNUnfriedMTolwinskiNet alPrincipal component-based clinical aging clocks identify signatures of healthy aging and targets for clinical intervention. Nat Aging (2024) 4:113752. 10.1038/s43587-024-00646-8

  • 115.

    LiXHaoZLiDJinQTangZYaoXet alBrain age prediction via cross-stratified ensemble learning. NeuroImage (2024) 299:120825. 10.1016/j.neuroimage.2024.120825

  • 116.

    PyrkovTVSokolovISFedichevPO. Deep longitudinal phenotyping of wearable sensor data reveals independent markers of longevity, stress, and resilience. Aging (Albany NY) (2021) 13:790013. 10.18632/aging.202816

  • 117.

    ZhangWJiaLCaiGShaoFLinHLiuZet alModel construction for biological age based on a cross-sectional study of a healthy chinese han population. The J Nutrition, Health Aging (2017) 21:12339. 10.1007/s12603-017-0874-7

  • 118.

    LiuZ. Development and validation of 2 composite aging measures using routine clinical biomarkers in the chinese population: analyses from 2 prospective cohort studies. The Journals Gerontol Ser A (2021) 76:162732. 10.1093/gerona/glaa238

  • 119.

    ZhangWLiZNiuYZheFLiuWFuSet alThe biological age model for evaluating the degree of aging in centenarians. Arch Gerontol Geriatr (2024) 117:105175. 10.1016/j.archger.2023.105175

  • 120.

    YooJHurJYooJJurivichDLeeK. A novel approach to quantifying individual’s biological aging using korea’s national health screening program toward precision public health. GeroScience (2024) 46:3387403. 10.1007/s11357-024-01079-2

  • 121.

    ZhangKChenPCHuangYTzouSJWuSTChuTWet alPrediction of biological age using machine learning. PLOS ONE (2025) 20:131. 10.1371/journal.pone.0330184

  • 122.

    YangMChaoKWangZXueRZhangXWangD. Accelerated biological aging and risk of atrial fibrillation: a cohort study. Heart Rhythm (2025) 22:250714. 10.1016/j.hrthm.2024.11.017

  • 123.

    PavlukDTheurlFProellSSchreinlecherMHoferFRockenschaubPet alAi-ecg-derived biological age as a predictor of mortality in cardiovascular and acute care patients. Eur Heart J - Digital Health (2025) 6:120415. 10.1093/ehjdh/ztaf109

  • 124.

    MitnitskiACollertonJMartin-RuizCJaggerCvon ZglinickiTRockwoodKet alAge-related frailty and its association with biological markers of ageing. BMC Medicine (2015) 13:19. 10.1186/s12916-015-0400-x

  • 125.

    BafeiSECShenC. Biomarkers selection and mathematical modeling in biological age estimation. npj Aging (2023) 9:13. 10.1038/s41514-023-00110-8

  • 126.

    MengDZhangSHuangYMaoKHanJDJ. Application of ai in biological age prediction. Curr Opin Struct Biol (2024) 85:102777. 10.1016/j.sbi.2024.102777

  • 127.

    FerrucciLGonzalez-FreireMFabbriESimonsickETanakaTMooreZet alMeasuring biological aging in humans: a quest. Aging Cell (2020) 19:e13080. 10.1111/acel.13080

  • 128.

    SrourLBejaouiYSheJAlamTEl HajjN. Deep aging clocks: Ai-powered strategies for biological age estimation. Ageing Res Rev (2025) 112:102889. 10.1016/j.arr.2025.102889

Summary

Keywords

age, aging, aging biomarker, biological age, chronological age, predication model

Citation

Wang G, Ding P, Li Z, Tang Q, Tu L, Jiang T, Zhang L, Su B, Xu J and An H (2026) Artificial intelligence approaches in biological age prediction: current status and challenges. Br. J. Biomed. Sci. 83:16141. doi: 10.3389/bjbs.2026.16141

Received

25 December 2025

Revised

24 February 2026

Accepted

28 May 2026

Published

10 June 2026

Volume

83 - 2026

Updates

Copyright

*Correspondence: Hui An, ; Qingfeng Tang,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article