<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3-mathml3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">J. Pharm. Pharm. Sci.</journal-id>
<journal-title-group>
<journal-title>Journal of Pharmacy &#x26; Pharmaceutical Sciences</journal-title>
<abbrev-journal-title abbrev-type="pubmed">J. Pharm. Pharm. Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1482-1826</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">16155</article-id>
<article-id pub-id-type="doi">10.3389/jpps.2026.16155</article-id>
<article-version article-version-type="Version of Record" vocab="NISO-RP-8-2008"/>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Original Research</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Integrative deep learning strategies to enhance early-stage drug discovery: optimizing computational structure&#x2013;activity modeling for pharmacotherapeutic innovation</article-title>
<alt-title alt-title-type="left-running-head">Rezazi et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/jpps.2026.16155">10.3389/jpps.2026.16155</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Rezazi</surname>
<given-names>Sarah</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/3332106"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Si-Moussa</surname>
<given-names>Cherif</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Hanini</surname>
<given-names>Salah</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<label>1</label>
<institution>Faculty of Sciences, Tipaza University</institution>, <city>Tipaza</city>, <country country="DZ">Algeria</country>
</aff>
<aff id="aff2">
<label>2</label>
<institution>Laboratory of Biomaterials and Transport Phenomena (LBMPT), University of Medea</institution>, <city>M&#xe9;d&#xe9;a</city>, <country country="DZ">Algeria</country>
</aff>
<author-notes>
<corresp id="c001">
<label>&#x2a;</label>Correspondence: Sarah Rezazi, <email xlink:href="mailto:rezazi.sarah@gmail.com">rezazi.sarah@gmail.com</email>
</corresp>
</author-notes>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2026-03-11">
<day>11</day>
<month>03</month>
<year>2026</year>
</pub-date>
<pub-date publication-format="electronic" date-type="collection">
<year>2026</year>
</pub-date>
<volume>29</volume>
<elocation-id>16155</elocation-id>
<history>
<date date-type="received">
<day>29</day>
<month>12</month>
<year>2025</year>
</date>
<date date-type="rev-recd">
<day>29</day>
<month>01</month>
<year>2026</year>
</date>
<date date-type="accepted">
<day>04</day>
<month>02</month>
<year>2026</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2026 Rezazi, Si-Moussa and Hanini.</copyright-statement>
<copyright-year>2026</copyright-year>
<copyright-holder>Rezazi, Si-Moussa and Hanini</copyright-holder>
<license>
<ali:license_ref start_date="2026-03-11">https://creativecommons.org/licenses/by/4.0/</ali:license_ref>
<license-p>This is an open-access article distributed under the terms of the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License (CC BY)</ext-link>. The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</license-p>
</license>
</permissions>
<abstract>
<p>The integration of computational intelligence into therapeutic development is increasingly important for accelerating early-stage drug discovery and improving compound prioritization. In this study, we developed an optimized neural network&#x2013;based predictive framework to support the identification of bioactive compounds with analgesic potential. A dataset of 532 structurally diverse molecules described by 227 molecular descriptors was analyzed, and a stepwise feature elimination procedure reduced the descriptor set to 105 informative variables, improving model robustness and reducing redundancy. The optimized artificial neural network, trained using the Levenberg&#x2013;Marquardt algorithm, achieved a correlation coefficient of 95.9% with a prediction error of 0.433%, outperforming conventional statistical approaches reported for comparable QSAR tasks. Additional analysis links key descriptor groups, including connectivity and polarity parameters, to physicochemical properties relevant to analgesic activity, improving interpretability for medicinal chemistry applications. The framework is intended to support computational screening and candidate prioritization prior to experimental validation, thereby contributing to more efficient pharmacotherapeutic discovery workflows. This work highlights how data-driven modeling can complement translational strategies aimed at accelerating drug discovery pipelines.</p>
</abstract>
<abstract abstract-type="graphical">
<title>Graphical Abstract</title>
<p>
<fig>
<graphic xlink:href="JPPS_jpps-2026-16155_wc_abs.tif" position="anchor">
<alt-text content-type="machine-generated">Graphic showing a molecular structure feeding into a neural network model, which predicts therapeutic activity such as analgesic effects with 96 percent correlation, highlighting that deep learning outperforms conventional methods.</alt-text>
</graphic>
</fig>
</p>
</abstract>
<kwd-group>
<kwd>computational drug discovery</kwd>
<kwd>deep learning</kwd>
<kwd>neural networks</kwd>
<kwd>predictive modeling</kwd>
<kwd>structure&#x2013;activity relationship</kwd>
</kwd-group>
<funding-group>
<funding-statement>The author(s) declared that financial support was not received for this work and/or its publication.</funding-statement>
</funding-group>
<counts>
<fig-count count="4"/>
<table-count count="2"/>
<equation-count count="2"/>
<ref-count count="25"/>
<page-count count="8"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Artificial intelligence (AI) has become a driving force in computational drug discovery, offering efficient tools for modeling the complex relationships between molecular structure and biological activity. Traditional experimental screening methods are costly, time-consuming, and often limited by chemical diversity. In contrast, AI-based models enable rapid <italic>in silico</italic> prediction of pharmacological properties, thereby accelerating the early stages of drug development and reducing experimental workloads [<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B2">2</xref>].</p>
<p>Among various AI approaches, Artificial Neural Networks (ANNs) have emerged as particularly effective for modeling nonlinear dependencies between molecular descriptors and biological responses. Their adaptive learning capability allows them to approximate highly complex functions, making them powerful tools for Quantitative Structure&#x2013;Activity Relationship (QSAR) analysis [<xref ref-type="bibr" rid="B3">3</xref>]. ANNs can integrate diverse molecular features, such as: topological, constitutional, and physicochemical descriptors, into predictive models that estimate biological activities with remarkable accuracy [<xref ref-type="bibr" rid="B4">4</xref>].</p>
<p>Despite their success, the use of ANNs in predicting analgesic activity remains relatively underexplored compared to other therapeutic domains such as anticancer or anti-inflammatory modeling. In many reported studies, model development suffers from insufficient feature optimization and ambiguous problem formulation, particularly regarding whether the predictive task is regression (predicting continuous activity values) or classification (discriminating between active and inactive compounds). This lack of clarity affects both model interpretability and evaluation consistency [<xref ref-type="bibr" rid="B5">5</xref>].</p>
<p>To address these challenges, the present work aims to develop a unified ANN-based framework capable of predicting analgesic activity using molecular descriptors while explicitly distinguishing between regression and classification tasks. The model was constructed using a dataset of 532 structurally diverse compounds, each characterized by 227 molecular descriptors. A stepwise feature elimination technique was employed to identify the most informative descriptors, thereby enhancing model interpretability and reducing redundancy. The optimized ANN architecture was trained using the Levenberg&#x2013;Marquardt algorithm, ensuring fast convergence and high predictive accuracy.</p>
<p>This study contributes to the field of AI-assisted molecular design by demonstrating how properly optimized neural architectures can yield accurate, interpretable, and computationally efficient models for analgesic activity prediction. Furthermore, it emphasizes the importance of rigorous feature selection and methodological transparency in constructing reliable QSAR models for modern drug discovery.</p>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>Materials and methods</title>
<sec id="s2-1">
<title>Dataset</title>
<p>The dataset used in this study comprised 532 structurally diverse compounds with experimentally reported analgesic activity values collected from published literature sources. Each compound was numerically characterized by 227 molecular descriptors computed using cheminformatics tools. These descriptors covered multiple categories, including constitutional, topological, geometrical, electronic, and physicochemical properties.</p>
<p>The target variable in this work represents the analgesic activity score expressed on a continuous scale. For classification purposes, a binary activity label (active or inactive) was assigned using a statistically derived threshold value of the continuous variable. Prior to modeling, the dataset was normalized using min&#x2013;max scaling to ensure that all input features contributed equally during network training.</p>
</sec>
<sec id="s2-2">
<title>Feature selection</title>
<p>To prevent overfitting and improve computational efficiency, a stepwise feature elimination approach was employed. This technique iteratively removes low-importance descriptors while monitoring model performance at each step, ensuring that only the most informative features correlated with analgesic activity are retained.</p>
<p>The selection criterion combined the Pearson correlation coefficient (R) and relative prediction error (ERAM%), which allowed the identification of descriptors contributing most significantly to the prediction accuracy.</p>
</sec>
<sec id="s2-3">
<title>Evaluation metrics</title>
<p>Performance was assessed using two statistical indicators: the correlation coefficient (R) and the average absolute relative error (ERAM).</p>
<p>The correlation coefficient (R) quantifies the linear relationship between predicted (<inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>) and experimental (<inline-formula id="inf2">
<mml:math id="m2">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>exp</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>) values, as given by <xref ref-type="disp-formula" rid="e1">Equation 1</xref>:<disp-formula id="e1">
<mml:math id="m3">
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:msubsup>
</mml:mstyle>
<mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>exp</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>exp</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:msubsup>
</mml:mstyle>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>exp</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>exp</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mstyle displaystyle="true">
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:msubsup>
</mml:mstyle>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")" separators="&#x7c;">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>
</p>
<p>The average absolute relative error (ERAM) evaluates the average percentage deviation of predicted values from the experimental ones, as expressed in <xref ref-type="disp-formula" rid="e2">Equation 2</xref>:<disp-formula id="e2">
<mml:math id="m4">
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:mi>R</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>M</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>100</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
</mml:mstyle>
<mml:mrow>
<mml:mfenced open="|" close="|" separators="&#x7c;">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>exp</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>exp</mml:mi>
</mml:msub>
</mml:mfrac>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
<label>(2)</label>
</disp-formula>
</p>
<p>Here <inline-formula id="inf3">
<mml:math id="m5">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>exp</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> represents the experimental value, <inline-formula id="inf4">
<mml:math id="m6">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> the predicted value, <inline-formula id="inf5">
<mml:math id="m7">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mi>exp</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf6">
<mml:math id="m8">
<mml:mrow>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo>&#xaf;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>l</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> their respective means, and n the number of observations.</p>
</sec>
<sec id="s2-4">
<title>ANN architecture</title>
<p>The Artificial Neural Network (ANN) was implemented and trained in MATLAB&#xae; using a feed-forward multilayer perceptron architecture. The model design followed a systematic optimization strategy involving iterative adjustment of architectural and training parameters.</p>
<p>The ANN model was implemented with:<list list-type="bullet">
<list-item>
<p>Input layer: optimized reduced descriptor set</p>
</list-item>
<list-item>
<p>Hidden layers: multiple fully connected layers with activation functions</p>
</list-item>
<list-item>
<p>Output layer: regression node predicting activity score</p>
</list-item>
<list-item>
<p>Training strategy: backpropagation with stochastic gradient descent and dropout regularization</p>
</list-item>
</list>
</p>
<p>Hyperparameter tuning was carried out through a combination of grid search and performance-based evaluation. The number of hidden neurons, learning rate, and regularization factors were optimized to minimize the average absolute relative error (ERAM%) between predicted and experimental outputs.</p>
</sec>
<sec id="s2-5">
<title>Model optimization workflow</title>
<p>The overall modeling process is summarized in <xref ref-type="fig" rid="F1">Figure 1</xref>, which presents the flowchart of the ANN optimization procedure. The workflow begins with descriptor calculation and preprocessing, followed by feature selection, network initialization, and iterative optimization of model parameters. Each iteration involved evaluating model performance on a validation subset to ensure stability and reproducibility.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Optimized ANN architecture determination procedure.</p>
</caption>
<graphic xlink:href="jpps-29-16155-g001.tif">
<alt-text content-type="machine-generated">Flowchart illustrating the process of developing an artificial neural network, including steps for collecting a dataset, preprocessing data, defining architecture, training, evaluating performance, optimizing hyperparameters, selecting the best architecture, and ending the process.</alt-text>
</graphic>
</fig>
<p>This structured approach ensures that the final ANN model is both predictively accurate and computationally efficient, providing a reproducible framework for modeling analgesic activity in structurally diverse molecular datasets.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec id="s3-1">
<title>Effect of descriptor reduction</title>
<p>The impact of descriptor reduction on model performance was systematically evaluated to determine the optimal set of molecular features influencing analgesic activity prediction. As illustrated in <xref ref-type="fig" rid="F2">Figure 2</xref>, progressive elimination of descriptors with minimal correlation to biological response resulted in a steady improvement in the mean absolute relative error (ERAM%) until reaching a subset of 105 descriptors, beyond which predictive performance began to decline.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Average absolute relative error (ERAM) during dataset size optimization.</p>
</caption>
<graphic xlink:href="jpps-29-16155-g002.tif">
<alt-text content-type="machine-generated">Bar chart showing ERAM percentage on the vertical axis and descriptor number on the horizontal axis, with ERAM values decreasing from descriptor 135 to 70, then slightly increasing at descriptors 65 and 60.</alt-text>
</graphic>
</fig>
<p>This indicates that the retained descriptors capture the most informative structural and physicochemical characteristics relevant to analgesic bioactivity. These include parameters describing molecular connectivity, polarity, and hydrogen-bonding potential&#x2014;factors known to modulate receptor binding affinity and pharmacodynamic behavior [<xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B7">7</xref>].</p>
<p>The observed behavior aligns with previous QSAR and deep learning studies reporting that redundant or weakly correlated features introduce noise and reduce model generalization [<xref ref-type="bibr" rid="B8">8</xref>]. Feature reduction not only improved accuracy but also enhanced computational efficiency, mitigating overfitting, one of the key challenges in high, dimensional neural modeling [<xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B10">10</xref>].</p>
</sec>
<sec id="s3-2">
<title>Optimized network topology and convergence</title>
<p>Following descriptor optimization, a series of ANN architectures were systematically evaluated to identify the configuration offering the best balance between predictive accuracy and complexity. The Levenberg&#x2013;Marquardt (LM) algorithm was selected for training due to its rapid convergence and robustness in nonlinear parameter optimization.</p>
<p>The final optimized topology (<xref ref-type="table" rid="T1">Table 1</xref>), consisted of three hidden layers, each employing a tansig activation function, followed by a single linear output neuron. This configuration achieved a correlation coefficient of R &#x3d; 0.959 and an ERAM of 0.433%, demonstrating a nearly perfect alignment between predicted and experimental analgesic activity values.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Optimized ANN architecture (MATLAB implementation).</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Layer</th>
<th align="center">Number of neurons</th>
<th align="center">Activation function</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Input layer</td>
<td align="center">72</td>
<td align="center">---</td>
</tr>
<tr>
<td align="left">1st hidden layer</td>
<td align="center">22</td>
<td align="center">Hyperbolic tangent sigmoid (<italic>tansig</italic> function, MATLAB<sup>&#xae;</sup>)</td>
</tr>
<tr>
<td align="left">2nd hidden layer</td>
<td align="center">9</td>
<td align="center">Hyperbolic tangent sigmoid (<italic>tansig</italic> function, MATLAB<sup>&#xae;</sup>)</td>
</tr>
<tr>
<td align="left">3rd hidden layer</td>
<td align="center">2</td>
<td align="center">Hyperbolic tangent sigmoid (<italic>tansig</italic> function, MATLAB<sup>&#xae;</sup>)</td>
</tr>
<tr>
<td align="left">Output layer</td>
<td align="center">1</td>
<td align="center">Linear (<italic>purelin</italic> function, MATLAB<sup>&#xae;</sup>)</td>
</tr>
<tr>
<td align="left">Learning algorithm</td>
<td align="center">&#x2014;</td>
<td align="center">Levenberg&#x2013;Marquardt backpropagation (<italic>trainlm</italic> function, MATLAB<sup>&#xae;</sup>)</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Regression plots (<xref ref-type="fig" rid="F3">Figure 3</xref>) show near-linear correlations for both training and validation datasets, confirming excellent generalization without bias toward specific chemical subgroups. The LM-based training achieved convergence within fewer than 50 iterations, outperforming conventional gradient-descent approaches in both speed and stability [<xref ref-type="bibr" rid="B11">11</xref>].</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Linear regression of predicted versus experimental analgesic activity using the optimized ANN model: <bold>(a)</bold> testing phase; <bold>(b)</bold> generalization phase.</p>
</caption>
<graphic xlink:href="jpps-29-16155-g003.tif">
<alt-text content-type="machine-generated">Two side-by-side scatter plots labeled (a) and (b) compare outputs versus targets with a regression line and reference line y &#x003D; t. Plot (a) shows R equals 0.86214 with a linear fit equation Y equals 0.87T plus 0.072, while plot (b) shows R equals 0.95789 with a linear fit equation Y equals 0.96T plus 0.02. Both plots display data points, the best linear fit in red, and the y equals t line as a dashed line.</alt-text>
</graphic>
</fig>
<p>Comparable studies have generally reported lower accuracies when employing shallow or single-hidden-layer ANNs for QSAR prediction, with correlation coefficients (R) typically ranging between 0.85 and 0.93 [<xref ref-type="bibr" rid="B12">12</xref>]. The superior performance achieved in the present work can be attributed to the joint optimization of both the network architecture and the feature selection process, which significantly improved the model&#x2019;s capacity to capture complex nonlinear structure&#x2013;activity relationships [<xref ref-type="bibr" rid="B13">13</xref>, <xref ref-type="bibr" rid="B14">14</xref>]. <xref ref-type="table" rid="T2">Table 2</xref> presents a comparative summary between our findings and previously published studies, highlighting key aspects such as dataset size (number of compounds), model type and topology, descriptor or feature set employed, and the corresponding reported performance metrics.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Comparative summary with previously published studies.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="center">Study (year) &#x2014; ref</th>
<th align="center">Target/task</th>
<th align="center">Dataset size (compounds)</th>
<th align="center">Model and topology</th>
<th align="center">Descriptors/feature set</th>
<th align="center">Reported performance</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">Tsou et al., 2020 [<xref ref-type="bibr" rid="B15">15</xref>]</td>
<td align="center">TNBC inhibitor/GPCR (MOR) hit discovery</td>
<td align="center">6,069 (train)/1,061 (test); MOR example: 63 (train)</td>
<td align="center">Deep neural networks (DNN) &#x2014; 3 hidden layers (80 neurons)</td>
<td align="center">613 descriptors (AlogP_count, ECFP_4, FCFP_4)</td>
<td align="center">DNN: Predicted R<sup>2</sup> &#x2248; 0.84&#x2013;0.94 (higher than PLS/MLR)</td>
</tr>
<tr>
<td align="center">Liu et al., 2024 [<xref ref-type="bibr" rid="B16">16</xref>]</td>
<td align="center">Classification &#x2014; lung surfactant inhibitors (QSAR classification)</td>
<td align="center">43 test chemicals (panel of 43)</td>
<td align="center">Multilayer perceptron</td>
<td align="center">&#x223c;1,826 descriptors (Mordred)</td>
<td align="center">Accuracy &#x3d; 96%, F1 &#x3d; 0.97 for best MLP configuration</td>
</tr>
<tr>
<td align="center">Deeb and Drabh, 2010 [<xref ref-type="bibr" rid="B17">17</xref>]</td>
<td align="center">Analgesic compounds (QSAR regression)</td>
<td align="center">95 heterogeneous analgesic compounds</td>
<td align="center">Principal-component ANN (PC-ANN)</td>
<td align="center">PCA of multiple descriptor families (reduced PC inputs)</td>
<td align="center">Reported robust regression models with good predictive ability</td>
</tr>
<tr>
<td align="center">Traor&#xe9; et al., 2019 [<xref ref-type="bibr" rid="B18">18</xref>]</td>
<td align="center">Tri-substituted pyrimidine derivatives &#x2014; analgesic activity</td>
<td align="center">20 compounds</td>
<td align="center">Multiple linear regression (MLR) (6-variable model)</td>
<td align="center">Selected physicochemical descriptors</td>
<td align="center">
<bold>R &#x3d; 0.93</bold>, MAE &#x2248; 0.003, Q<sup>2</sup>_CV &#x2248; 0.90 (MLR fit on small congeneric set)</td>
</tr>
<tr>
<td align="center">This work (present study)</td>
<td align="center">Analgesic activity &#x2014; regression &#x2b; classification</td>
<td align="center">(Dataset used in this study; chemically diverse set)</td>
<td align="center">ANN, 3 hidden layers (tansig) &#x2b; linear output; trained with levenberg&#x2013;Marquardt</td>
<td align="center">105 selected descriptors after stepwise elimination</td>
<td align="center">R &#x3d; 0.959, ERAM &#x3d; 0.433%, classification accuracy &#x3d; 98.01% (excellent regression and classification metrics; fast LM convergence)</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>The results obtained demonstrate the capacity of neural network&#x2013;based QSAR modeling to predict analgesic activity with high precision while supporting compound prioritization in early drug discovery. In line with current advances in cheminformatics, the combination of descriptor selection and neural network optimization improves predictive performance while maintaining a degree of interpretability necessary for medicinal chemistry applications [<xref ref-type="bibr" rid="B6">6</xref>, <xref ref-type="bibr" rid="B7">7</xref>].</p>
<p>Feature elimination techniques, including stepwise and recursive strategies, effectively reduced descriptor dimensionality while preserving relevant structural information governing bioactivity [<xref ref-type="bibr" rid="B8">8</xref>]. Consistent with recent studies, integrating descriptor reduction with neural architectures improves model stability and predictive performance in pharmacological QSAR modeling [<xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B10">10</xref>]. In the present study, reduction from 227 to 105 descriptors significantly limited redundancy and multicollinearity, contributing to improved generalization capability.</p>
<p>The use of the Levenberg&#x2013;Marquardt (LM) optimization algorithm facilitated rapid convergence and reduced training error, consistent with prior studies employing LM-ANN frameworks in physicochemical and pharmacokinetic predictions [<xref ref-type="bibr" rid="B11">11</xref>, <xref ref-type="bibr" rid="B12">12</xref>]. Compared with classical gradient-based training, the LM approach enables smoother error minimization and more stable weight optimization within nonlinear solution spaces.</p>
<p>The observed generalization performance across chemically diverse compounds suggests that the model is suitable for supporting virtual screening campaigns aimed at prioritizing candidate molecules [<xref ref-type="bibr" rid="B13">13</xref>, <xref ref-type="bibr" rid="B14">14</xref>]. The combined regression and classification validation strategy further aligns with contemporary modeling trends that integrate potency prediction and categorical activity assessment within unified computational frameworks [<xref ref-type="bibr" rid="B19">19</xref>, <xref ref-type="bibr" rid="B20">20</xref>].</p>
<p>The descriptor analysis has been expanded to more clearly relate the selected features to chemical determinants of analgesic activity. Descriptor classes retained in the optimized model, including molecular connectivity indices, polarity-related parameters, hydrogen bonding capacity, and lipophilicity descriptors, are known to influence key factors such as receptor accessibility, membrane permeability, and ligand&#x2013;target interaction potential, thereby contributing to variations in analgesic potency [<xref ref-type="bibr" rid="B21">21</xref>&#x2013;<xref ref-type="bibr" rid="B23">23</xref>].</p>
<p>These relationships provide medicinal chemistry insight, allowing the model to function not merely as a predictive engine but also as a supportive tool for rational compound optimization. Nevertheless, due to the nonlinear nature of neural networks, complete mechanistic interpretability remains limited, and future incorporation of explainable AI methods or fragment-level attribution analysis may further enhance structural insight.</p>
<p>Comparative evidence indicates that neural network approaches coupled with careful descriptor selection and stable optimization strategies frequently outperform classical QSAR approaches such as MLR or PLS when applied to chemically diverse datasets [<xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B16">16</xref>]. However, it is also recognized that modern ensemble machine-learning methods, including Random Forest and gradient boosting models, can sometimes achieve competitive performance on moderate-sized datasets while offering improved interpretability. Although benchmarking against such models was beyond the scope of the present study, this limitation is now acknowledged, and future comparative evaluation is planned to better position the proposed framework within the broader machine-learning landscape. Despite this, the present model achieves high regression fidelity (R &#x3d; 0.959) and strong classification performance (98.01%), placing it among the best-performing approaches reported for similar datasets while maintaining descriptor-based interpretability [<xref ref-type="bibr" rid="B15">15</xref>&#x2013;<xref ref-type="bibr" rid="B18">18</xref>].</p>
<p>It is important to note that the present work remains computational and does not replace experimental validation. Instead, the framework is intended to reduce experimental burden by prioritizing compounds before laboratory testing. Prospective validation on newly designed or synthesized compounds represents an important next step and is proposed as future work to strengthen translational applicability.</p>
<p>Finally, this optimized ANN-based QSAR framework supports the principles of computationally efficient and sustainable drug discovery by reducing dependence on exhaustive laboratory screening. Similar approaches have been recommended to promote eco-efficient pharmacological innovation, minimizing experimental waste while enabling broader chemical space exploration [<xref ref-type="bibr" rid="B24">24</xref>, <xref ref-type="bibr" rid="B25">25</xref>]. Overall, the results indicate that data-driven modeling can effectively complement experimental workflows and accelerate early pharmacotherapeutic discovery while remaining compatible with translational drug development strategies.</p>
<sec id="s4-1">
<title>Conclusion</title>
<p>This study successfully developed and optimized an Artificial Neural Network (ANN) framework for the accurate prediction of analgesic activity, achieving excellent regression (R &#x3d; 0.959) and classification (accuracy &#x3d; 98.01%) performance metrics. By integrating a rigorous descriptor reduction strategy with systematic ANN topology optimization, the proposed model effectively balanced predictive accuracy, interpretability, and computational efficiency. The resulting framework outperformed traditional QSAR and machine learning methods highlighting the superiority of deep ANN architectures in capturing nonlinear structure&#x2013;activity relationships.</p>
<p>The inclusion of 105 carefully selected molecular descriptors enhanced both model transparency and robustness, establishing direct links between topological, electronic, and physicochemical features and their biological relevance. This balance between interpretability and performance underscores the growing potential of AI-assisted modeling to accelerate rational drug discovery and virtual screening in pharmacology.</p>
<p>Beyond methodological advances, this work contributes to the broader objectives of green and sustainable drug development by reducing experimental dependence and resource consumption. Future perspectives include extending the optimized ANN framework to other pharmacological classes, implementing transfer learning for multitarget predictions, and integrating explainable AI tools to further elucidate mechanistic insights.</p>
<p>Overall, the proposed hybrid feature selection&#x2013;ANN modeling strategy represents a reliable, interpretable, and eco-efficient approach to predictive pharmacology, paving the way for more intelligent, data-driven discovery of next-generation analgesics.</p>
</sec>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The datasets presented in this article are not readily available because the dataset used in this study incorporates compound structures, activity values, and molecular descriptors obtained under license and contractual agreements that prohibit redistribution. Due to intellectual property and data-use restrictions, the raw and processed datasets cannot be shared externally. Access is therefore limited to authorized users under the original licensing terms. Requests to access the datasets should be directed to rezezi.sa@gmail.com.</p>
</sec>
<sec sec-type="author-contributions" id="s6">
<title>Author contributions</title>
<p>All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.</p>
</sec>
<ack>
<title>Acknowledgements</title>
<p>The authors gratefully acknowledge the support of their research laboratory (Laboratory of Biomaterials and Transport Phenomena, University of Medea) and colleagues for valuable discussions and technical assistance throughout this study. Special thanks are extended to the academic supervisors and collaborators who provided constructive feedback on experimental design and interpretation.</p>
</ack>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of interest</title>
<p>The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="ai-statement" id="s9">
<title>Generative AI statement</title>
<p>The author(s) declared that generative AI was not used in the creation of this manuscript.</p>
<p>Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ekins</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Puhl</surname>
<given-names>AC</given-names>
</name>
<name>
<surname>Zorn</surname>
<given-names>KM</given-names>
</name>
<name>
<surname>Lane</surname>
<given-names>TR</given-names>
</name>
</person-group>. <article-title>Artificial intelligence for drug discovery: are we there yet?</article-title> <source>Front Genet</source> (<year>2024</year>) <volume>15</volume>:<fpage>1450529</fpage>. <pub-id pub-id-type="doi">10.1146/annurev-pharmtox-040323-040828</pub-id>
<pub-id pub-id-type="pmid">39290983</pub-id>
</mixed-citation>
</ref>
<ref id="B2">
<label>2.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>L</given-names>
</name>
</person-group>. <article-title>A comprehensive review of artificial intelligence for pharmacology research</article-title>. <source>Front Genet</source> (<year>2024</year>) <volume>15</volume>:<fpage>1450529</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2024.1450529</pub-id>
<pub-id pub-id-type="pmid">39290983</pub-id>
</mixed-citation>
</ref>
<ref id="B3">
<label>3.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sheridan</surname>
<given-names>RP</given-names>
</name>
<name>
<surname>Liaw</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Dahl</surname>
<given-names>GE</given-names>
</name>
<name>
<surname>Svetnik</surname>
<given-names>V</given-names>
</name>
</person-group>. <article-title>Deep neural nets as a method for quantitative structure&#x2013;activity relationships</article-title>. <source>J Chem Inf Model</source> (<year>2015</year>) <volume>55</volume>(<issue>2</issue>):<fpage>263</fpage>&#x2013;<lpage>274</lpage>. <pub-id pub-id-type="doi">10.1021/ci500747n</pub-id>
<pub-id pub-id-type="pmid">25635324</pub-id>
</mixed-citation>
</ref>
<ref id="B4">
<label>4.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Deep learning approaches for drug discovery and design</article-title>. <source>Front Chem</source> (<year>2024</year>) <volume>12</volume>:<fpage>1408740</fpage>. <pub-id pub-id-type="doi">10.3389/fchem.2024.1408740</pub-id>
<pub-id pub-id-type="pmid">38882215</pub-id>
</mixed-citation>
</ref>
<ref id="B5">
<label>5.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vamathevan</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Clark</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Czodrowski</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Dunham</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Ferran</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>G</given-names>
</name>
<etal/>
</person-group> <article-title>Applications of machine learning in drug discovery and development</article-title>. <source>Nat Rev Drug Discov</source> (<year>2019</year>) <volume>18</volume>(<issue>6</issue>):<fpage>463</fpage>&#x2013;<lpage>477</lpage>. <pub-id pub-id-type="doi">10.1038/s41573-019-0024-5</pub-id>
<pub-id pub-id-type="pmid">30976107</pub-id>
</mixed-citation>
</ref>
<ref id="B6">
<label>6.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cherkasov</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Muratov</surname>
<given-names>EN</given-names>
</name>
<name>
<surname>Fourches</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Varnek</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Baskin</surname>
<given-names>II</given-names>
</name>
<name>
<surname>Cronin</surname>
<given-names>M</given-names>
</name>
<etal/>
</person-group> <article-title>QSAR modeling: where have you been? where are you going to?</article-title> <source>J Med Chem</source> (<year>2014</year>) <volume>57</volume>(<issue>12</issue>):<fpage>4977</fpage>&#x2013;<lpage>5010</lpage>. <pub-id pub-id-type="doi">10.1021/jm4004285</pub-id>
<pub-id pub-id-type="pmid">24351051</pub-id>
</mixed-citation>
</ref>
<ref id="B7">
<label>7.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Svetnik</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Liaw</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Tong</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Culberson</surname>
<given-names>JC</given-names>
</name>
<name>
<surname>Sheridan</surname>
<given-names>RP</given-names>
</name>
<name>
<surname>Feuston</surname>
<given-names>BP</given-names>
</name>
</person-group>. <article-title>Random forest: a classification and regression tool for compound classification and QSAR modeling</article-title>. <source>J Chem Inf Comput Sci</source> (<year>2003</year>) <volume>43</volume>(<issue>6</issue>):<fpage>1947</fpage>&#x2013;<lpage>1958</lpage>. <pub-id pub-id-type="doi">10.1021/ci034160g</pub-id>
<pub-id pub-id-type="pmid">14632445</pub-id>
</mixed-citation>
</ref>
<ref id="B8">
<label>8.</label>
<mixed-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Dablander</surname>
<given-names>M</given-names>
</name>
</person-group>. <source>Investigating graph neural networks and classical feature-extraction techniques in activity-cliff and molecular property prediction</source> (<year>2024</year>) <fpage>2411</fpage>. <pub-id pub-id-type="doi">10.48550/arXiv.2411.13688</pub-id>
</mixed-citation>
</ref>
<ref id="B9">
<label>9.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mayr</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Klambauer</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Unterthiner</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Hochreiter</surname>
<given-names>S</given-names>
</name>
</person-group>. <article-title>DeepTox: toxicity prediction using deep learning</article-title>. <source>Front Environ Sci</source> (<year>2016</year>) <volume>3</volume>:<fpage>80</fpage>. <pub-id pub-id-type="doi">10.3389/fenvs.2015.00080</pub-id>
</mixed-citation>
</ref>
<ref id="B10">
<label>10.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hinton</surname>
<given-names>GE</given-names>
</name>
<name>
<surname>Salakhutdinov</surname>
<given-names>RR</given-names>
</name>
</person-group>. <article-title>Reducing the dimensionality of data with neural networks</article-title>. <source>Science</source> (<year>2006</year>) <volume>313</volume>(<issue>5786</issue>):<fpage>504</fpage>&#x2013;<lpage>507</lpage>. <pub-id pub-id-type="doi">10.1126/science.1127647</pub-id>
<pub-id pub-id-type="pmid">16873662</pub-id>
</mixed-citation>
</ref>
<ref id="B11">
<label>11.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shi</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Ni</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Yuan</surname>
<given-names>X</given-names>
</name>
</person-group>. <article-title>Machine learning-driven QSAR modeling for predicting short-term exposure limits of hydrocarbons and their derivatives</article-title>. <source>Processes</source>. (<year>2025</year>) <volume>13</volume>(<issue>12</issue>):<fpage>4025</fpage>. <pub-id pub-id-type="doi">10.3390/pr13124025</pub-id>
</mixed-citation>
</ref>
<ref id="B12">
<label>12.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Melville</surname>
<given-names>JL</given-names>
</name>
<name>
<surname>Burke</surname>
<given-names>EK</given-names>
</name>
<name>
<surname>Hirst</surname>
<given-names>JD</given-names>
</name>
</person-group>. <article-title>Machine learning in virtual screening</article-title>. <source>Comb Chem High Throughput Screen</source> (<year>2009</year>) <volume>12</volume>(<issue>4</issue>):<fpage>332</fpage>&#x2013;<lpage>343</lpage>. <pub-id pub-id-type="doi">10.2174/138620709788167980</pub-id>
<pub-id pub-id-type="pmid">19442063</pub-id>
</mixed-citation>
</ref>
<ref id="B13">
<label>13.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lavecchia</surname>
<given-names>A</given-names>
</name>
</person-group>. <article-title>Deep learning in drug discovery: opportunities, challenges and future prospects</article-title>. <source>Drug Discov Today</source> (<year>2019</year>) <volume>24</volume>(<issue>10</issue>):<fpage>2017</fpage>&#x2013;<lpage>2032</lpage>. <pub-id pub-id-type="doi">10.1016/j.drudis.2019.07.006</pub-id>
<pub-id pub-id-type="pmid">31377227</pub-id>
</mixed-citation>
</ref>
<ref id="B14">
<label>14.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lenselink</surname>
<given-names>EB</given-names>
</name>
<name>
<surname>ten Dijke</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Bongers</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Papadatos</surname>
<given-names>G</given-names>
</name>
<name>
<surname>van Vlijmen</surname>
<given-names>HWT</given-names>
</name>
<name>
<surname>Kowalczyk</surname>
<given-names>W</given-names>
</name>
<etal/>
</person-group> <article-title>Beyond the hype: deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set</article-title>. <source>J Cheminform</source> (<year>2017</year>) <volume>9</volume>:<fpage>45</fpage>. <pub-id pub-id-type="doi">10.1186/s13321-017-0232-0</pub-id>
<pub-id pub-id-type="pmid">29086168</pub-id>
</mixed-citation>
</ref>
<ref id="B15">
<label>15.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tsou</surname>
<given-names>LK</given-names>
</name>
<name>
<surname>Yeh</surname>
<given-names>S-H</given-names>
</name>
<name>
<surname>Ueng</surname>
<given-names>S-H</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>C-P</given-names>
</name>
<name>
<surname>Song</surname>
<given-names>J-S</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>M-H</given-names>
</name>
<etal/>
</person-group> <article-title>Comparative study between deep learning and QSAR classifications for TNBC inhibitors and novel GPCR agonist discovery</article-title>. <source>Scientific Rep</source> (<year>2020</year>) <volume>10</volume>(<issue>1</issue>):<fpage>16771</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-020-73681-1</pub-id>
<pub-id pub-id-type="pmid">33033310</pub-id>
</mixed-citation>
</ref>
<ref id="B16">
<label>16.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>JY</given-names>
</name>
<name>
<surname>Peeples</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sayes</surname>
<given-names>CM</given-names>
</name>
</person-group>. <article-title>Evaluation of machine learning-based QSAR models for the classification of lung surfactant inhibitors</article-title>. <source>Environ Health</source> (<year>2024</year>) <volume>2</volume>(<issue>12</issue>):<fpage>912</fpage>&#x2013;<lpage>917</lpage>. <pub-id pub-id-type="doi">10.1021/envhealth.4c00118</pub-id>
<pub-id pub-id-type="pmid">39722839</pub-id>
</mixed-citation>
</ref>
<ref id="B17">
<label>17.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Deeb</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Drabh</surname>
<given-names>M</given-names>
</name>
</person-group>. <article-title>Exploring QSARs of some analgesic compounds by PC-ANN</article-title>. <source>Chem Biol Drug Des</source> (<year>2010</year>) <volume>76</volume>(<issue>3</issue>):<fpage>255</fpage>&#x2013;<lpage>262</lpage>. <pub-id pub-id-type="doi">10.1111/j.1747-0285.2010.01004.x</pub-id>
<pub-id pub-id-type="pmid">20626409</pub-id>
</mixed-citation>
</ref>
<ref id="B18">
<label>18.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kon&#xe9;</surname>
<given-names>GR</given-names>
</name>
<name>
<surname>Traore</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Ouattara</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Ziao</surname>
<given-names>N</given-names>
</name>
</person-group>. <article-title>QSAR approach to estimating the analgesic activity of a series of tri-substituted pyrimidine derivatives</article-title>. <source>SDRP J Comput Chem Mol Model</source> (<year>2019</year>) <volume>3</volume>:<fpage>1</fpage>&#x2013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.25177/jccmm.3.1.2</pub-id>
</mixed-citation>
</ref>
<ref id="B19">
<label>19.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>L</given-names>
</name>
</person-group>. <article-title>Computational model for drug research</article-title>. <source>Brief Bioinform</source> (<year>2024</year>) <volume>25</volume>(<issue>3</issue>). <pub-id pub-id-type="doi">10.1093/bib/bbae158</pub-id>
<pub-id pub-id-type="pmid">38581423</pub-id>
</mixed-citation>
</ref>
<ref id="B20">
<label>20.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>G</given-names>
</name>
</person-group>. <article-title>Integrated classification&#x2013;regression deep models for pharmacological prediction</article-title>. <source>Bioinform Adv</source> (<year>2022</year>) <volume>2</volume>(<issue>1</issue>):<fpage>vbab072</fpage>. <pub-id pub-id-type="doi">10.1093/bioadv/vbab072</pub-id>
</mixed-citation>
</ref>
<ref id="B21">
<label>21.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lipinski</surname>
<given-names>CA</given-names>
</name>
<name>
<surname>Lombardo</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Dominy</surname>
<given-names>BW</given-names>
</name>
<name>
<surname>Feeney</surname>
<given-names>PJ</given-names>
</name>
</person-group>. <article-title>Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings</article-title>. <source>Adv Drug Deliv Rev</source> (<year>2001</year>) <volume>46</volume>(<issue>1&#x2013;3</issue>):<fpage>3</fpage>&#x2013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1016/S0169-409X(00)00129-0</pub-id>
<pub-id pub-id-type="pmid">11259830</pub-id>
</mixed-citation>
</ref>
<ref id="B22">
<label>22.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Engkvist</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Olivecrona</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Blaschke</surname>
<given-names>T</given-names>
</name>
</person-group>. <article-title>The rise of deep learning in drug discovery</article-title>. <source>Drug Discov Today</source> (<year>2018</year>) <volume>23</volume>(<issue>6</issue>):<fpage>1241</fpage>&#x2013;<lpage>1250</lpage>. <pub-id pub-id-type="doi">10.1016/j.drudis.2018.01.039</pub-id>
<pub-id pub-id-type="pmid">29366762</pub-id>
</mixed-citation>
</ref>
<ref id="B23">
<label>23.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Polishchuk</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Tinkov</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Khristova</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Ognichenko</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Kosinskaya</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Varnek</surname>
<given-names>A</given-names>
</name>
<etal/>
</person-group> <article-title>Structural and physico-chemical interpretation (SPCI) of QSAR models and its comparison with matched molecular pair analysis</article-title>. <source>J Chem Inf Model</source> (<year>2016</year>) <volume>56</volume>(<issue>8</issue>):<fpage>1455</fpage>&#x2013;<lpage>1469</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jcim.6b00371</pub-id>
<pub-id pub-id-type="pmid">27419846</pub-id>
</mixed-citation>
</ref>
<ref id="B24">
<label>24.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Maurya</surname>
<given-names>A</given-names>
</name>
</person-group>. <article-title>Green chemistry techniques for sustainable pharmaceutical synthesis</article-title>. <source>J Drug Discov Health Sci</source> (<year>2024</year>) <volume>1</volume>:<fpage>187</fpage>&#x2013;<lpage>200</lpage>. <pub-id pub-id-type="doi">10.21590/jddhs.01.04.02</pub-id>
</mixed-citation>
</ref>
<ref id="B25">
<label>25.</label>
<mixed-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Walters</surname>
<given-names>WP</given-names>
</name>
<name>
<surname>Murcko</surname>
<given-names>M</given-names>
</name>
</person-group>. <article-title>Assessing the impact of generative AI on medicinal chemistry</article-title>. <source>Nat Biotechnol</source> (<year>2020</year>) <volume>38</volume>(<issue>2</issue>):<fpage>143</fpage>&#x2013;<lpage>145</lpage>. <pub-id pub-id-type="doi">10.1038/s41587-020-0418-2</pub-id>
<pub-id pub-id-type="pmid">32001834</pub-id>
</mixed-citation>
</ref>
</ref-list>
<fn-group>
<fn fn-type="custom" custom-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2185996/overview">Cristian Sandoval</ext-link>, University of La Frontera, Chile</p>
</fn>
</fn-group>
</back>
</article>