QSRR modeling of the chromatographic retention behavior of some quinolone and sulfonamide antibacterial agents using firefly algorithm coupled to support vector machine

Fouad, Marwa A.; Serag, Ahmed; Tolba, Enas H.; El-Shal, Manal A.; El Kerdawy, Ahmed M.

doi:10.1186/s13065-022-00874-2

QSRR modeling of the chromatographic retention behavior of some quinolone and sulfonamide antibacterial agents using firefly algorithm coupled to support vector machine

Research
Open access
Published: 03 November 2022

Volume 16, article number 85, (2022)
Cite this article

Download PDF

You have full access to this open access article

BMC Chemistry Aims and scope Submit manuscript

QSRR modeling of the chromatographic retention behavior of some quinolone and sulfonamide antibacterial agents using firefly algorithm coupled to support vector machine

Download PDF

Marwa A. Fouad^1,2,
Ahmed Serag³,
Enas H. Tolba⁴,
Manal A. El-Shal⁴ &
…
Ahmed M. El Kerdawy¹

2040 Accesses
4 Citations
Explore all metrics

Abstract

Quinolone and sulfonamide are two classes of antibacterial agents with an opulent history of medicinal chemistry features that contribute to their bacterial spectrum, efficacy, pharmacokinetics, and adverse effect profiles. The urgent need for their use, combined with the escalating rate of their resistance, necessitates the development of suitable analytical methods that accelerate and facilitate their analysis. In this study, the advanced firefly algorithm (FFA) coupled with support vector regression (SVR) was used to select the most significant descriptors and to construct two quantitative structure-retention relationship (QSRR) models using a series of 11 selected quinolone and 13 sulfonamide drugs, respectively, to predict their retention behavior in HPLC. Precisely, the effect of the pH value and acetonitrile composition in the mobile phase on the retention behavior of quinolones and sulfonamides, respectively, were studied. The obtained QSRR models performed well in both internal and external validations, demonstrating their robustness and predictive ability. Y-randomization validation demonstrated that the obtained models did not result by statistical chance. Moreover, the obtained results shed the light on the molecular features that influence the retention behavior of these two classes under the current chromatographic conditions.

View this article's peer review reports

QSRR models for predicting the retention indices of VOCs in different datasets using an efficient variable selection method coupled with artificial neural network modeling: ANN-based QSPR modeling

Article 12 April 2022

QSAR study of VEGFR-2 inhibitors by using genetic algorithm-multiple linear regressions (GA-MLR) and genetic algorithm-support vector machine (GA-SVM): a comparative approach

Article 10 March 2015

ADMET evaluation in drug discovery: 15. Accurate prediction of rat oral acute toxicity using relevance vector machine and consensus modeling

Article Open access 01 February 2016

Introduction

Antibacterial resistance is a major public health concern affecting humans worldwide, owing primarily to the uncontrolled use of such bioactive compounds, particularly in countries lacking standard treatment guidelines [1]. Among those antibacterial agents, fluoroquinolones, a fluoro substituent series derived from nalidixic acid, showed an escalating rate of resistance after dominating the therapeutic practice for some time, particularly against gram-negative pathogens [2,3,4]. Such classes of active compounds must be carefully monitored regarding their use and abundance in the environment. Consequently, from an analytical viewpoint, the urgent detection and analysis of these drugs become essential considering the need to develop quick, simple, economical, and accurate methods for their analysis.

A review of the literature revealed that quinolones could be determined thoroughly using high-performance liquid chromatography in various matrices, including biological fluids and tissues [5,6,7,8,9,10,11], milk and food of animal origin [12,13,14,15,16,17], marine products [18], honey [19], wastewater [20,21,22] and in many pharmaceutical formulations [23,24,25,26,27,28]. Furthermore, the relationship between the retention factors and lipophilicity of quinolones has been analyzed using RP-TLC [29,30,31] and HPLC human serum albumin and α1-acid glycoprotein stationary phases [32]. Additionally, Wu et al. [33] investigated the retention factors-activity relationship of some quinolones using micellar chromatography.

Moreover, sulfonamides are another class of synthetic antimicrobial agents that, unfortunately, have widespread resistance, making them infrequently used for medical interventions. However, the application of sulfonamides has expanded beyond their original indication as antimicrobial agents to other new medical uses, including anticancer, antiglaucoma, cyclooxygenase-2 (COX-2) and lipoxygenase inhibitors, anticonvulsant, and hypoglycemic activities [34]. Regarding the analytical tools used in their detection, a review of the literature revealed that reversed-phase liquid chromatography was also dominant in this class’s determination [35,36,37,38]. In the context of their retention mechanisms, Cazenave-Gassiot et al., [39] studied the correlation between the sulfonamides’ retention factors and the proportion of the organic modifier in the mobile phase using supercritical fluid chromatography. However, the separation behavior of this class on reversed-phase liquid chromatography must be investigated.

Among the various models and theories used to draw an image of the retention manner of the various analytes in reversed-phase liquid chromatography, the quantitative structure-retention relationship (QSRR) provides useful insights not only in elucidating how different the analytes perform regarding their retention but also in predicting their retention chromatographic systems relatively well [40, 41]. This relationship provides a powerful alternative to the conventional trial-and-error approach with significant improvements in experiment time and cost.

A correlation is built in these mathematical models between the chemical structures of compounds represented by their descriptors and their retention data in various chromatographic systems. The number of molecular descriptors that can be obtained for a single analyte is enormous, with some software capable of calculating up to 5000 descriptors per analyte [42]. Such a significant increase in the dimensionality of the descriptors and the incorporation of some nonempirical features could affect the performance of the various QSRR models. Consequently, feature selection methods (variable selection) are necessary to untangle this problem and determine which descriptors are important regarding the retention of the compounds of interest. These methods range from classical types like forward selection and backward elimination to advanced nature-inspired ones like particle swarm optimization (PSO), genetic algorithm (GA) and its descendants (firefly, flower pollination, grasshopper, and ant colony algorithms) [43,44,45,46,47,48,49,50,51,52].

Furthermore, various chemometric and machine learning algorithms, such as partial least squares (PLS), multiple linear regression (MLR), artificial neural networks (ANNs), and support vector regression (SVR) were proven to be effective in building reliable QSAR and QSRR models due to their ability in extracting the maximal chemical information while also capturing the possible relationship between the chemical structure and the target property of interest [53,54,55]. The application of QSRR models has been documented to various chemical families on reversed-phase liquid chromatography, such as non-steroidal anti-inflammatory drugs [56], azole antifungal agents [57], and some analgesics [58].

Support vector regression (SVR), a machine learning algorithm, was first reported by Vapnik, Chervonenkis, and colleagues [59]. The algorithm is based on identifying a linear function that explains most of the variation in the response and simultaneously links the nonlinear relationship between the input and the target data [60]. Compared to conventional regression and neural network algorithms, SVR has some advantages, including good generalization ability, global optimization, and dimensional independence [61]. Because of its capability to model possible nonlinear relationships between molecular descriptors and retention time, it has been incorporated into develo** powerful QSRR models [62, 63].

QSRR models could be classified into local models and universal models where local models focus on a specific class of chemical compounds, whereas universal models handle diverse classes in the chemical space. The specificity of the local models makes it perform superior relative to the performance of universal models which characterized by generality [64, 65].

Previously, our group developed two QSRR models that captured the essence of some β-lactam antibiotics retention behavior using MLR models combined with the forward or firefly variable selection algorithms [55]. In our pursuit of studying the QSRR modeling of antibacterial agents, our scope in this work is to investigate the quantitative structure retention relationship in the quinolone and sulfonamide antibacterial classes, highlighting their reversed-phase chromatographic retention mechanisms. Furthermore, investigate the quinolones’ retention behavior with respect to their different ionization states and the organic modifier percentage, as well as the sulfonamides’ retention behavior with respect to the organic modifier percentage. Because of the complexity of the generated data, the use of an advanced variable selection technique coupled with a machine learning algorithm seems imperative. Consequently, the firefly algorithm coupled with SVR was used to develop the target QSRR models. Furthermore, the obtained models were assessed regarding their predictive ability using strict validation criteria; thus, they could be used to predict the retention behavior of potential degradation products and even metabolites of these compounds.

Experimental

Solvents, chemicals, sample preparation, and instrumentation

The quinolones (Fig. S1) and sulfonamides (Fig. S2) under investigation were supplied by different pharmaceutical companies. Pure HPLC-grade acetonitrile, methanol, and dimethylsulfoxide were supplied by Scarlau (Barcelona, Spain). The other chemicals used in this study, including ortho-phosphoric acid, trifluoroacetic acid, sodium dihydrogen orthophosphate, and sodium hydroxide were supplied by Honeywell Riedel-de Haën (Seelze, Germany).

The instruments used in this study included a Jenway 3510, Essex-UK, England pH meter equipped with a glass electrode, and Agilent 1260 HPLC-UV series.

Each drug’s stock solution (2 mg mL^− 1) was prepared with a suitable solvent either (methanol, dimethylsulfoxide, water, or acetonitrile). These solutions were stored at 4 °C and then diluted with the mobile phase to achieve sample concentrations ranging (0.05–1 mg mL^− 1) before analysis.

Chromatographic conditions

The quinolones were eluted chromatographically using an Inertsil^® C18 column (250 mm x 4.6 mm, 5 μm) and detected at 275 nm. In a gradient mode, five mobile phases were prepared according to the plan of the experiment, and a chromatographic system was used as programmed in Table 1, using acetonitrile and 28 mM sodium dihydrogen orthophosphate buffer prepared at different pHs 2.2, 3.5, 5.2, 6.5, and 8.2 using ortho-phosphoric acid or sodium hydroxide. However, the pH was measured again after mixing the buffer with acetonitrile and was determined to be 3.2, 4.4, 5.9, 7.32, and 8.9, respectively. The system flow rate was adjusted to 1 ml min^− 1. After each injection, the system was reconditioned by returning to the initial ratio and remaining constant for 3 min. Data acquisition was performed using the Agilent LC Chemstation software.

Table 1 Gradient elution system used in quinolone separation

Full size table

Sulfonamides were separated chromatographically on a hypersil C18 column (150 mm x 4.6 mm, 5 μm) using isocratic elution based on a mobile phase consisting of acetonitrile and water acidified with trifluoroacetic acid (1 mL. L^− 1) in different ratios of 50:50, 45:55, or 30:70 v/v and at a flow rate of 0.8 ml min^− 1. A ratio of 15:85, v/v was initially included but not considered for further assessment because many compounds were strongly retained in the column. The analyses were performed at ambient temperature, with detection at 270 nm. Data acquisition was performed using the Agilent LC Chemstation software.

QSRR modeling

Drawing structures and molecular descriptors calculation and filtration

The major microspecies of the study quinolone at the pH of interest were estimated using the MarvinSketch (6.0.3) [66] generating 21 ions. The canonical smiles of these ions were imported into the Molecular Operating Environment (MOE, 2020.0901) software, where they were converted into 3D structures, and energy was minimized using an RMSD gradient of 0.05 kcal.mol^− 1Å⁻¹ with MMFF94x forcefield. The partial charges were automatically calculated. Finally, MOE molecular mechanical descriptors were computed for all compounds, generating a descriptor fund of 313 descriptors. The initial descriptor fund was reduced by removing zero value and constant descriptors. This resulted in a descriptor fund with 293 descriptors.

In the case of sulfonamides, the PubChem database [67, 68] was used to introduce sulfonamides canonical SMILES into the MOE, where they were converted to 3D structures, and energy was minimized using the same parameters as for quinolones. Afterward, MOE molecular mechanical descriptors were computed for all compounds and a descriptor fund of 313 descriptors was generated. The initial descriptor fund was reduced by removing zero value and constant descriptors, generating a fund of 112 descriptors; moreover, the acetonitrile percentage was incorporated as a descriptor.

Training set and test set generation

The 21 quinolones’ major microspecies were divided into a calibration (training) set of 16 molecules and a test set of five molecules. Regarding sulfonamides, a total of 39 experimental retention factors resulted from three different ratios of mobile phase for the 13 compounds that were used in building the QSRR model. The total number of experiments was split into a training set of 30 observations and an external validation test set of nine observations. The selection of the calibration and the validation compounds of quinolones and sulfonamides was based on maintaining the same retention factor value distribution in both sets.

Descriptor selection and modeling

Based on Durbin–Watson (DW) test, the linearity of the datasets was tested using augmented partial residual plots (APARP) [69,70,71]. The test was conducted using a custom script written in MATLAB (R2016 a) [72, 73]. The descriptors that survived the initial filtration were then used to build the QSRR models. The firefly algorithm was implemented in MATLAB and used for descriptor selection as an advanced nature-stimulated algorithm with the RMSE_CV of the SVR model serving as the fitness function inside the algorithm for both datasets. The selected descriptors were finally incorporated into the SVR final model building. The algorithm’s parameters were combinatorially optimized such that they were varied in intervals of specific increments, kee** in mind that in all optimization iterations, one parameter was varied while the others remained constant.

Model validation

Model validation approaches were performed to evaluate the reliability, robustness, and applicability of the generated models. In the current study, the generated models were validated both internally and externally, and any potential correlation was tested using a Y-scrambling technique, a method commonly used for this purpose.

Internal validation was conducted using leave-one-out cross-validation (CV_LOO) in the quinolones QSRR model while using leave-10%-out (CV_L10%O) in the sulfonamides QSRR model. On the other hand, the external validation was conducted by applying the obtained QSRR models to an external validation set of five microspecies of quinolones and nine molecules of sulfonamides. The statistical quality of the models was assessed by calculating the root mean square errors (RMSE) of the prediction and coefficient of the determination.

In Y-randomization validation for the two datasets, the compounds’ output retention factors were shuffled randomly, whereas the compounds’ descriptors remained unscrambled. The resulting datasets were used to build FFA-SVR models using the same protocol as the original models, and the correlation and predictive ability of the resulting models were determined. The entire procedure was repeated 100 times for both datasets.

Hotelling’s T2 and William’s plot methods were used to determine the developed models’ applicability domains (AD) as described in our previous work [55].

Results and discussion

Optimization of the FFA and SVR parameters for develo** the QSRR models

The firefly algorithm (FFA) was used as a feature selection method to find the relevant descriptors that build reliable QSRR models. The algorithm parameters were initially optimized for proper descriptor selection. Based on our previous study [55], the RMSE_CV was used as the fitness function computed by the SVR model to evaluate the models’ performance. A critical parameter in the FFA is the absorption coefficient parameter “γ” because it regulates the light intensity, and thus controls the fireflies’ attractiveness; thus, this parameter has a significant impact on the speed of convergence and the overall behavior of the algorithm. Another valuable parameter is the “α” parameter, which prevents sticking to the local optima by providing some sort of random movements. Finally, the exploration phase of the FFA was controlled by the number of fireflies used, whereas the exploitation phase was controlled by the number of generations. The adjusted FFA parameters obtained through combinatorial optimization are presented in Table 2.

Table 2 Parameters of the firefly algorithm used for variable selection in QSRR modeling

Full size table

Concerning SVR, different types of kernels as basis function expansions were assessed, including polynomial, radial basis function (RBF), and sigmoid. Initially, the kernel function was examined by evaluating the performance of developed FFA-SVR models, and the RBF was selected as the best kernel function to model the nonlinearity of the generated data. The RBF kernel parameter regulates the amplitude of the Gaussian function and influences the SVR’s generalization ability. Furthermore, two parameters determining the quality of the SVR model were optimized: the penalty error (C), a parameter that controls the trade-off between the complexity of the decision rule and the frequency of error, and the insensitive loss function (ɛ), a precision factor expressing the radius of the tube placed around the regression function f(x). To optimize these parameters, their values were systematically varied in the training step via (CV_LOO) and (CV_L10%O) for quinolones and sulfonamides, respectively, while the models’ RMSE_cv was monitored. To obtain the optimal ɛ, the SVR with different ɛ values was trained; initially, the value of C was set to 1, but after finding the optimal value of ɛ, the C value was further optimized. It was found that the best models were obtained using kernel types of (RBF), C = 1 and ɛ = 0.01 for both datasets. The final developed FFA-SVR models were used to predict the retention factors of molecules in the test set for quinolones and sulfonamides, respectively.

QSRR modeling of quinolones in their different ionization states

To elucidate the chromatographic behavior of the quinolones studied, it is important to first understand the relationship between the mobile phase pH and the ionization states of each compound (Fig. S3). Some compounds behave ideally with respect to their ionization state, for example, moxifloxacin exists as a cation (polar) at acidic pHs (2.2 and 3.5) but as a neutral compound (hydrophobic) at basic pH (6.5 and 8.2), rationalizing its longer retention factor in basic pH than an acidic one. Ciprofloxacin, lomefloxacin, and norfloxacin exist in different ionization states at pHs (5.2 and 8.2) and this justifies the fluctuation in their retention factors over these pHs. Nadifloxacin exists as a neutral compound at acidic pHs (2.2, 3.5, and 5.2), which explains its longer retention factor at these lower pH values, whereas at basic pH 8.2, it exists as an anionic compound, resulting in rapid elution and a lower retention factor. On the other hand, ofloxacin and danofloxacin exhibit distinct behavior, with their cationic forms appearing at acidic pHs (2.2 and 3.5) exhibiting lower retention factors, whereas their anionic forms present at basic pHs (6.5 and 8.2) exhibit higher retention factors. Additionally, gatifloxacin and gemifloxacin show stability in their retention factors although they can exist in different ionization states across the pH range (2.2–8.2). The calculated retention factors of the eluted quinolones are presented in Table 3. (The raw retention times ± SD are listed in Table S1 in the supporting material.)

Table 3 List of quinolones’ chromatographic retention factors (k)*

Full size table

Based on these previous observations, the behavior of quinolone compounds cannot be predicted solely on their ionization state, and a more in-depth analysis must successfully predict their behavior. It is worth noting that, at a specific pH, a compound can exist in various ionization states and percentages, making it difficult to predict the retention behavior based on single microspecies. To address this issue, we attempted to select the major microspecies as a representative for each molecule in the given pH while avoiding selecting the same microspecies at different pH or retention factors for the same ionization state. Considering this approach, we would be able to derive a simple, interpretable QSRR model that can predict the retention factors of quinolones in their various ionization states.

The first step for quinolones’ QSRR model generation was to check the linearity of the data. Consequently, augmented partial residual plots (APARP) and DW test were used to examine the residuals’ correlation [69,70,71]. The associated probability was found to be 0.045 (< 0.05) indicating the significance of the test and nonlinearity of the data; thus, nonlinear models such as ANN and SVR were tried for data modeling, with SVR yielding the best results.

Five descriptors were chosen by the FFA and combined in building the SVR model (SMR, GCUT_SLOGP_1, VSA, Vsurf_EWmin 2, and Vsurf_IW6). SMR is a 2D descriptor linked to molecular refractivity, which includes implicit hydrogens [74]. This property is an atomic contribution model that assumes the correct protonation state. GCUT_SLOGP_1 is a 2D descriptor that uses atomic contribution to logP in place of partial charge. VSA is a 3D descriptor related to the surface area, volume, and shape of molecules; it represents van der Waals’ surface area [75]. Vsurf_EWmin 2 is a 3D descriptor that represents the second lowest hydrophilic energy. Vsurf_IW6 is a 3D descriptor that represents the hydrophilic integy moment at (− 4.0). Considering the selected descriptors, the model displays that quinolones retention depends on their size and hydrophobic/hydrophilic nature, which is consistent with the main elements influencing the retention in reversed-phase liquid chromatography.

Regarding the performance of the developed QSRR model for the quinolones, the agreement of the experimental and predicted retention factors demonstrates the model’s good predictive capability, as shown in Table 4. The proximity between the training set prediction and the cross-validation results indicates the robustness of the resulting QSRR model and its lack of any overfitting. As shown in Table 5, the results demonstrate the good prediction capability of the obtained model. The correlation between the experimental and predicted retention factors for the training set, test set, and CV_LOO results are presented in the supporting materials (Figs. S4 and S5). The Spearman ranking correlation coefficient (ρ) was also calculated and found to be 0.976, 0.982, and 0.900 for the training set prediction (ρ_cal), CV _LOO (ρ_LOO), and the external test set (ρ_pred), respectively, (Table 5). The closeness of ρ to “1” indicates a reasonable accuracy and excellent capability of the generated model to reproduce the experimental retention factor ranking (Fig. 1).

Table 4 Experimental and predicted retention factors (k) of quinolone compounds in the training set, cross-validation, and test set prediction

Full size table

Table 5 Quinolones and sulfonamides FFA-SVR model performance evaluation parameters

Full size table

QSRR modeling of sulfonamides using different organic modifiers

QSRR modeling of sulfonamides was implemented to study the associations between the retention factors of the examined compounds eluted using different percentages of acetonitrile in the mobile phase composition (50%, 45%, and 30%), (See Fig. S6), and their calculated constitutional, geometrical, physicochemical, and electronic descriptors (independent variables). The raw retention times ± SD and the calculated retention factors of eluted sulfonamides are shown in Table S2 and Table 6, respectively.

The linearity of the data was first considered using the same procedures conducted in the quinolone dataset, with an associated probability of 3.2^− 17 (< 0.05) indicating the nonlinearity of the generated data. The FFA-SVR model was used in this case, resulting in two descriptors plus acetonitrile percentage in building the QSRR model. The selected features (Vsurf-D2 and vsurf-w2) are 3D descriptors related to the molecular hydrophobic and hydrophilic volumes, respectively. The QSRR model indicates that, in addition to the influence of the third descriptor (acetonitrile percentage in the mobile phase), the sulfonamide analytes retention depends on their hydrophobic/hydrophilic nature, which is a common element that plays an important role in the differential elution of analytes in reversed-phase liquid chromatography.

Table 6 List of sulfonamides chromatographic retention factors (k) *

Full size table

The results also demonstrate the obtained model’s good prediction capability, as shown in Tables 5 and 7. The model training and test set correlation of the experimental and predicted retention are presented in the supporting material (Fig. S7), while the compounds’ experimental and predicted retention in the CV _L10%O are presented in the supporting material (Fig. S8), indicating the good correlation and the generalizability of the developed QSRR sulfonamide model. The Spearman ranking correlation coefficient (ρ) was calculated for the training set prediction (ρ_cal), CV _L10%O (ρ_L10%O), and the external test set (ρ_pred) and was found to be 0.988, 0.941, and 0.883, respectively (Fig. 2). The proximity of ρ to “1” indicates the capability of the generated model to reproduce the experimental retention factor ranking of the compounds under investigation with reasonable accuracy.

Furthermore, the residual plots for both classes show the differences between the predicted and the experimental retention factors (residuals) for the various compounds. The random dispersion of the residuals around the horizontal axis confirmed the model’s prediction ability (see supporting materials) (Figs. S9 and S10). The prediction accuracy of the generated local focused models is acceptable and comparable to that of the generalized universal models [64, 65].

Table 7 Experimental and predicted retention factors (k) of sulfonamide compounds in the training set, cross-validation, and test set

Full size table

Y-scrambling validation

Y-randomization or permutation test is another criterion used to validate our findings in this study, especially with this small number of observations, to ensure that the obtained models are due to a true correlation between the selected descriptors and the target retention factors rather than statistical chance. It is suspected that the original QSRR model is significant if there is a solid link between the selected descriptors and the original response variables. Y-randomization was repeated 100 times, if the statistical attributes of these randomized models are significantly lower than the original one, it can be concluded that the model is sensible and was not obtained by chance. The equation below was used to evaluate the quality of the obtained models from the 100 randomized matrices and to compare it with the original model quality. ^cR_p² should be above 0.5 to ensure that the original model is not obtained by chance [76].

$${{\text{c}}_{\text{R}}}_{\text{p}}^{2}=\text{R}\text{*} \sqrt{{\text{R}}^{2}-{\text{R}}_{\text{y}}^{2}}$$

Where (^cR_p²) is the degree of variation in the values of the squared correlation coefficient average of the randomized models R_y² and the squared correlation coefficient of the original model R².

The statistical parameters of the scrambled models gathered around zero in a symmetrical pattern for both data (Fig. 3), indicating that the scrambled models are of an extremely low quality. ^cRp² values calculated for cross-validation were found to be 0.687 and 0.791 (more than 0.5) for quinolones and sulfonamides QSRR models, respectively, which negates that the obtained model is the result of a chance correlation.

Applicability domain of both QSRR models

The applicability domain of a QSPR is the structural, biological space, or physicochemical knowledge or information on which the model’s training set was developed and for which it is applicable to make predictions for new compounds. In William’s plot for the FFA-SVR models, the applicability domain is inside a squared area within ± 3 standard deviations and has a leverage threshold h* of 1.125 and 0.4 for quinolones and sulfonamides, respectively. The prediction is only considered reliable for those compounds that fall within this AD. It can be seen that all compounds (training and test sets) fall within this range, with no outliers (Fig. 4).

Conclusion

Two QSRR models were generated for predicting the retention behavior of quinolones and sulfonamides in the HPLC system. The influence of the pH of the mobile phase on the ionization state and hence the retention factor of each quinolone, as well as the effect of acetonitrile composition in the mobile phase on the retention factors of sulfonamides, were investigated, resulting in the selection of 21 major microspecies of quinolones and 39 sulfonamide compounds. In both classes, significant descriptors related to retention behavior in the chromatographic system were selected using the advanced FFA and then incorporated into building the QSRR models using the SVR algorithm. The two models performed well on both the training and the validation levels. In quinolones, the regression coefficients of the training set prediction (R²_cal), CV _LOO (q²_LOO), and the external test set (R²_pred) were 0.931 (R²_adjusted = 0.926), 0.808, and 0.879, respectively, with RMSE of 0.114, 0.163, and 0.148, respectively. In sulfonamides, the regression coefficients of the training set prediction (R²_cal), CV _L10%O (q²_L10%O) and the external test set (R²_pred) were 0.900 (R²_adjusted = 0.896), 0.812 and 0.820, respectively, with RMSE of 0.240, 0.450, and 0.328, respectively. In the Y-randomization validation test, the two models had ^cR_p² values of 0.687 and 0.791 for quinolones and sulfonamides, respectively, indicating that both models are significant and were not obtained by chance.

Data availability

All data generated or analyzed during this study are included in this published article and its supplementary information files.

References

Fair RJ, Tor Y. Antibiotics and bacterial resistance in the 21st century. Perspect Medicin Chem. 2014;6:PMC–14459.
Article Google Scholar
Doble A. Quinolones. xPharm Compr Pharmacol Ref. 2007;:1–3.
Aldred KJ, Kerns RJ, Osheroff N. Mechanism of quinolone action and resistance. Biochemistry. 2014;53:1565–74.
Article CAS PubMed Google Scholar
Pham TDM, Ziora ZM, Blaskovich MAT. Quinolone antibiotics. MedChemComm. 2019;10:1719–39.
Article CAS PubMed PubMed Central Google Scholar
Gauhar S, Ali SA, Shoaib H, Shyum Naqvi SB, Muhammad IN. Development and Validation of a HPLC method for determination of pefloxacin in tablet and human plasma. Mashhad Univ Med Sci. 2009;12:33–42.
CAS Google Scholar
Holtzapple CK, Buckley SA, Stanker LH. Determination of fluoroquinolones in serum using an on-line clean-up column coupled to high-performance immunoaffinity–reversed-phase liquid chromatography. J Chromatogr B Biomed Sci Appl. 2001;754:1–9.
Article CAS PubMed Google Scholar
Sultana N, Arayne MS, Shafi N, Naz A, Naz S, Shamshad H. A RP-HPLC Method for the simultaneous determination of diltiazem and quinolones in bulk, formulations and human serum. J Chil Chem Soc. 2009;54:358–62.
Article CAS Google Scholar
Haritova AM, Petrova DK, Stanilova SA. A Simple HPLC method for detection of fluoroquinolones in serum Of avian species. J Liq Chromatogr Relat Technol. 2012;35:1130–9.
Article CAS Google Scholar
Nemutlu E, Kır S, Özyüncü Ö, Beksaç MS. Simultaneous separation and determination of seven quinolones using HPLC: analysis of levofloxacin and moxifloxacin in plasma and amniotic fluid. Chromatographia. 2007;66:15–24.
Article Google Scholar
Cavazos-Rocha N, Carmona-Alvarado I, Vera-Cabrera L, Waksman-de-Torres N, Salazar-Cavazos M de la. L. HPLC Method for the simultaneous analysis of fluoroquinolones and oxazolidinones in plasma. J Chromatogr Sci. 2014;52:1281–7.
Article CAS PubMed Google Scholar
Yu H, Mu H, Hu Y-M. Determination of fluoroquinolones, sulfonamides, and tetracyclines multiresidues simultaneously in porcine tissue by MSPD and HPLC–DAD. J Pharm Anal. 2012;2:76–81.
Article PubMed Google Scholar
Gili M, Marchis D, Stella P, Olivo F, Ostorero F, Franzoni M, et al. Multiresidue confirmatory method for determination of quinolones in milk by HPLC: method development and validation according to the criteria of Commission Decision 2002/657/EC. Ital J Food Saf. 2013;2:9.
Article Google Scholar
Christodoulou EA, Samanidou VF. Multiresidue HPLC analysis of ten quinolones in milk after solid phase extraction: Validation according to the European Union Decision 2002/657/EC. J Sep Sci. 2007;30:2421–9.
Article CAS PubMed Google Scholar
Stoilova N, Surleva A, Stoev G. Determination of quinolonones in food of animal origin by liquid chromatography coupled with fluorescence and mass spectrometric detection. Acta Chromatogr. 2014;26:599–614.
Article CAS Google Scholar
Zhao S, Jiang H, Li X, Mi T, Li C. and, Shen* J. Simultaneous determination of trace levels of 10 quinolones in swine, chicken, and shrimp muscle tissues using HPLC with programmable fluorescence detection. J Agric Food Chem. 2007;55:3829–34.
Article CAS PubMed Google Scholar
Chonan T, Fujimoto T, Inoue M, Tazawa T, Ogawa H. [Multiresidue determination of quinolones in animal and fishery products by HPLC]. Shokuhin Eiseigaku Zasshi. 2008;49:244–8.
Article CAS PubMed Google Scholar
Stoilova N, Petkova M. Develo** and validation of method for detection of quinolone residues in poultry meat. Trakia J Sci. 2010;8:64–9.
Google Scholar
Jang J, Lee K, Kwon K, Bae S, Kim HS. Simultaneous determination of thirteen quinolones in livestock and fishery products using ultra performance LC with electrospray ionization tandem mass spectrometry. Food Sci Biotechnol. 2013;22:1–9.
Article CAS Google Scholar
Du W, Yao J, Li Y, Hashi Y. Rapid Determination of Residual quinolones in honey samples by fast HPLC with an on-line sample pretreatment system. Am J Anal Chem. 2011;02:200–5.
Article CAS Google Scholar
Turiel E, Bordin G, Rodríguez AR. Determination of quinolones and fluoroquinolones in hospital sewage water by off-line and on-line solid-phase extraction procedures coupled to HPLC-UV. J Sep Sci. 2005;28:257–67.
Article CAS PubMed Google Scholar
Ibrahim HK, Abdel-Moety MM, Abdel-Gawad SA, Al-Ghobashy MA, Kawy MA. Validated electrochemical and chromatographic quantifications of some antibiotic residues in pharmaceutical industrial waste water. Environ Sci Pollut Res. 2017;24:7023–34.
Article CAS Google Scholar
Eva M, Golet AC, Alder A, Hartmann, Thomas A, Ternes. and, Giger W. Trace determination of fluoroquinolone antibacterial agents in urban wastewater by solid-phase extraction and liquid chromatography with fluorescence detection. Anal Chem. 2001;73:3632–8.
Article Google Scholar
Scherer R, Pereira J, Firme J, Lemos M, Lemos M. Determination of ciprofloxacin in pharmaceutical formulations using HPLC method with UV detection. Indian J Pharm Sci. 2014;76:541–4.
CAS PubMed PubMed Central Google Scholar
Chamseddin C, Jira TH. Comparison of the chromatographic behavior of levofloxacin, ciprofloxacin and moxifloxacin on various HPLC phases. Pharmazie. 2011;66:244–8.
CAS PubMed Google Scholar
Sversut RA, Serrou do Amaral M, César de Moraes Baroni A, Rodrigues PO, Rosa AM, Galana Gerlin MC, et al. Stability-indicating HPLC-DAD method for the simultaneous determination of fluoroquinolones and corticosteroids in ophthalmic formulations. Anal Methods. 2014;6:2125–33.
Article CAS Google Scholar
Puranik M, Bhawsar D, Rathi P, Yeole P. Simultaneous determination of ofloxacin and ornidazole in solid dosage form by RP-HPLC and HPTLC techniques. Indian J Pharm Sci. 2010;72:513.
Article CAS PubMed PubMed Central Google Scholar
e Souza MJ, Bittencourt CF, Morsch LM. LC determination of enrofloxacin. J Pharm Biomed Anal. 2002;28:1195–9.
Article Google Scholar
John P, Azeem W, Ashfaq M, Khan IU, Razzaq SN. Stability indicating RP-HPLC method for simultaneous determination of piroxicam and ofloxacin in binary combination. Pak J Pharm Sci. 2015;28:1713–21.
CAS PubMed Google Scholar
Hubicka U, Żuromska-Witek B, Komsta Ł, Krzek J. Lipophilicity study of fifteen fluoroquinolones by reversed-phase thin-layer chromatography. Anal Methods. 2015;7:3841–8.
Article CAS Google Scholar
Ciura K, Fedorowicz J, Andrić F, Greber KE, Gurgielewicz A, Sawicki W, et al. Lipophilicity determination of Quaternary (Fluoro)Quinolones by chromatographic and theoretical approaches. International Journal of Molecular Sciences. 2019;20.
Rageh AH, Atia NN, Abdel-Rahman HM. Application of salting-out thin layer chromatography in computational prediction of minimum inhibitory concentration and blood-brain barrier penetration of some selected fluoroquinolones. J Pharm Biomed Anal. 2018;159:363–73.
Article CAS PubMed Google Scholar
Barbato F, di Martino G, Grumetto L, La Rotonda MI. Retention of quinolones on human serum albumin and α1-acid glycoprotein HPLC columns: Relationships with different scales of lipophilicity. Eur J Pharm Sci. 2007;30:211–9.
Article CAS PubMed Google Scholar
Wu L-P, Chen Y, Wang S-R, Chen C, Ye L-M. Quantitative retention–activity relationship models for quinolones using biopartitioning micellar chromatography. Biomed Chromatogr. 2008;22:106–14.
Article CAS PubMed Google Scholar
Parasca OM, Gheaţă F, Pânzariu A, Geangalău I, Profire L. Importance of sulfonamide moiety in current and future therapy. Rev Med Chir Soc Med Nat Iasi. 117:558–64.
Ye S, Yao Z, Na G, Wang J, Ma D. Rapid simultaneous determination of 14 sulfonamides in wastewater by liquid chromatography tandem mass spectrometry. J Sep Sci. 2007;30:2360–9.
Article CAS PubMed Google Scholar
Malintan NT, Mohd MA. Determination of sulfonamides in selected Malaysian swine wastewater by high-performance liquid chromatography. J Chromatogr A. 2006;1127:154–60.
Article CAS PubMed Google Scholar
Saini B, Bansal G. Degradation study on sulfasalazine and a validated HPLC-UV method for its stability testing. Sci Pharm. 2014;82:295–306.
Article CAS PubMed PubMed Central Google Scholar
Ozkorucuklu SP, Sahin Y, Alsancak G. Determination of sulfamethoxazole in pharmaceutical formulations by flow injection system/HPLC with potentiometric detection using polypyrrole electrode. J Braz Chem Soc. 2011;22:2171–7.
Article CAS Google Scholar
Cazenave-Gassiot A, Boughtflower R, Caldwell J, Coxhead R, Hitzel L, Lane S, et al. Prediction of retention for sulfonamides in supercritical fluid chromatography. J Chromatogr A. 2008;1189:254–65.
Article CAS PubMed Google Scholar
Kaliszan R, Foks H. The relationship between the RM values and the connectivity indices for pyrazine carbothioamide derivatives. Chromatographia. 1977;10:346–9.
Article CAS Google Scholar
Kaliszan* RQSRR: Quantitative Structure-(Chromatographic) Retention Relationships. 2007. https://doi.org/10.1021/CR068412Z.
Mauri A, Consonni V, Pavan M, Match RT-. 2006 undefined. Dragon software: An easy approach to molecular descriptor calculations. academia.edu.
Al-Thanoon NA, Qasim OS, Algamal ZY. A new hybrid firefly algorithm and particle swarm optimization for tuning parameter estimation in penalized support vector machine with application in chemometrics. Chemom Intell Lab Syst. 2019;184:142–52.
Article CAS Google Scholar
Algamal ZY, Qasim MK, Lee MH, Mohammad Ali HT. Improving grasshopper optimization algorithm for hyperparameters estimation and feature selection in support vector regression. Chemom Intell Lab Syst. 2021;208:104196.
Article CAS Google Scholar
Fister I, Yang X-S, Brest J, Fister D. A brief review of nature-inspired algorithms for optimization. Elektroteh Vestn. 2013;80:1–7.
Google Scholar
Yang X-S. Flower Pollination Algorithm for Global Optimization. Berlin: Springer; 2012. pp. 240–9.
Google Scholar
Yang X-S. Firefly Algorithms for Multimodal Optimization. Berlin: Springer; 2009. pp. 169–78.
Google Scholar
Qi Shen J-H, Jiang **g-chao, Tao. Guo-li Shen and, Ru-Qin Yu. Modified ant colony optimization algorithm for variable selection in QSAR Modeling: QSAR studies of cyclooxygenase inhibitors. J Chem Inf Model. 2005;45:1024–9.
Article PubMed Google Scholar
Gonzalez M, Teran C, Saiz-Urra L, Teijeira M. Variable selection methods in QSAR: An overview. Curr Top Med Chem. 2008;8:1606–27.
Article CAS PubMed Google Scholar
Shahlaei M. Descriptor selection methods in quantitative structure–Activity relationship Studies: A Review Study. Chem Rev. 2013;113:8093–103.
Article CAS PubMed Google Scholar
Attia KAM, Nassar MWI, El-Zeiny MB, Serag A. Firefly algorithm versus genetic algorithm as powerful variable selection tools and their effect on different multivariate calibration models in spectroscopy: A comparative study. Spectrochim Acta Part A Mol Biomol Spectrosc. 2017;170:117–23.
Article CAS Google Scholar
El-Zeiny MB, Zawbaa HM, Serag A. An evaluation of different bio-inspired feature selection techniques on multivariate calibration models in spectroscopy. Spectrochim Acta Part A Mol Biomol Spectrosc. 2021;246:119042.
Article CAS Google Scholar
Algamal ZY, Qasim MK, Ali HTM. A QSAR classification model for neuraminidase inhibitors of influenza A viruses (H1N1) based on weighted penalized support vector machine. SAR QSAR Environ Res. 2017;28:415–26.
Article CAS PubMed Google Scholar
Algamal ZY, Lee MH, Al-Fakih AM, Aziz M. High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm. SAR QSAR Environ Res. 2016;27:703–19.
Article CAS PubMed Google Scholar
Fouad MA, Tolba EH, El-Shal MA, El Kerdawy AM. QSRR modeling for the chromatographic retention behavior of some β-lactam antibiotics using forward and firefly variable selection algorithms coupled with multiple linear regression. J Chromatogr A. 2018;1549:51–62.
Article CAS PubMed Google Scholar
Carlucci G, D’Archivio AA, Maggi MA, Mazzeo P, Ruggieri F. Investigation of retention behaviour of non-steroidal anti-inflammatory drugs in high-performance liquid chromatography by using quantitative structure–retention relationships. Anal Chim Acta. 2007;601:68–76.
Article CAS PubMed Google Scholar
Golubović J, Protić A, Zečević M, Otašević B, Mikić M, Živanović L. Quantitative structure–retention relationships of azole antifungal agents in reversed-phase high performance liquid chromatography. Talanta. 2012;100:329–37.
Article PubMed Google Scholar
Ghasemi J, Saaidpour S. QSRR Prediction of the chromatographic retention behavior of painkiller Drugs. J Chromatogr Sci. 2009;47:156–63.
Article CAS PubMed Google Scholar
Vapnik NV. The Nature of Statistical Learning Theory. 2nd edition. New York: Springer science & business media; 2000.
Louis B, Agrawal VK, Khadikar PV. Prediction of intrinsic solubility of generic drugs using MLR, ANN and SVM analyses. Eur J Med Chem. 2010;45:4018–25.
Article CAS PubMed Google Scholar
Du H, Wang J, Yao X, Hu Z. Quantitative structure-retention relationship models for the prediction of the reversed-Phase HPLC gradient retention based on the heuristic method and support vector machine. J Chromatogr Sci. 2009;47:396–404.
Article CAS PubMed Google Scholar
Du H, Wang J, Yao X, Hu Z. Quantitative structure-retention relationship models for the prediction of the reversed-phase HPLC gradient retention based on the heuristic method and support vector machine. J Chromatogr Sci. 47:396–404.
Li X, Luan F, Si H, Hu Z, Liu M. Prediction of retention times for a large set of pesticides or toxicants based on support vector machine and the heuristic method. Toxicol Lett. 2007;175:136–44.
Article CAS PubMed Google Scholar
Yang Q, Ji H, Lu H, Zhang Z. Prediction of liquid chromatographic retention time with graph neural networks to assist in small molecule identification. Anal Chem. 2021;93:2200–6.
Article CAS PubMed Google Scholar
Bouwmeester R, Martens L, Degroeve S. Generalized calibration across liquid chromatography setups for generic prediction of small-molecule retention times. Anal Chem. 2020;92:6571–8.
Article CAS PubMed Google Scholar
ChemAxon - Software Solutions and Services for Chemistry and Biology
Kim S, Thiessen PA, Bolton EE, Chen J, Fu G, Gindulyte A, et al. PubChem substance and compound databases. Nucleic Acids Res. 2016;44:D1202–13.
Article CAS PubMed Google Scholar
The PubChem Project. https://pubchem.ncbi.nlm.nih.gov/. Accessed 20 Nov 2017.
Olivieri AC. Practical guidelines for reporting results in single- and multi-component analytical calibration: A tutorial. Anal Chim Acta. 2015;868:10–22.
Article CAS PubMed Google Scholar
Montgomery DC, Peck EA, Vining GG. Introduction to linear regression analysis. Wiley; 2012.
Centner V, De Noord O, Massart D. Detection of nonlinearity in multivariate calibration. Anal Chim Acta. 1998;376:153–68.
Article CAS Google Scholar
Martinez WL, Martinez AR, Solka JL. Exploratory data analysis with MATLAB. 2nd edition. CRC Press; 2011.
Menke W, Menke JE (Joshua E. Environmental data analysis with MatLab. 2nd edition. 2016.
Sharma MC, Kohli DV. Insight into the structural requirement of substituted quinazolinone biphenyl acylsulfonamides derivatives as Angiotensin II AT1 receptor antagonist: 2D and 3D QSAR approach. J Saudi Chem Soc. 2014;18:35–45.
Article CAS Google Scholar
Guha R, Willighagen E. A Survey of quantitative descriptions of molecular structure. Curr Top Med Chem. 2020;12:1946–56.
Article Google Scholar
Gramatica P. Principles of QSAR models validation: internal and external. QSAR Comb Sci. 2007;26:694–701.
Article CAS Google Scholar

Download references

Acknowledgments

Not applicable.

Funding

No funding received.

Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB).

Author information

Authors and Affiliations

Pharmaceutical Chemistry Department, Faculty of Pharmacy, Cairo University, Kasr El-Aini St, P.O. Box 11562, Cairo, Egypt
Marwa A. Fouad & Ahmed M. El Kerdawy
Department of Pharmaceutical Chemistry, School of Pharmacy, Newgiza University (NGU), Newgiza, km 22 Cairo–Alexandria Desert Road, Cairo, Egypt
Marwa A. Fouad
Pharmaceutical Analytical Chemistry Department, Faculty of Pharmacy, Al-Azhar University, 11751, Cairo, Egypt
Ahmed Serag
Egyptian Drug Authority (Former National Organization for Drug Control and Research), Cairo, Egypt
Enas H. Tolba & Manal A. El-Shal

Authors

Marwa A. Fouad
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Serag
View author publications
You can also search for this author in PubMed Google Scholar
Enas H. Tolba
View author publications
You can also search for this author in PubMed Google Scholar
Manal A. El-Shal
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed M. El Kerdawy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Marwa A. Fouad: Conceptualization, Writing - Review & Editing, Supervision Enas H. Tolba: Methodology, Investigation, Writing - Original Draft, Manal A. El-Shal: Supervision, Ahmed Serag: Methodology, Software, Writing - Original Draft and Ahmed M. El Kerdawy: Conceptualization, Methodology, Software, Writing - Review & Editing.

Corresponding author

Correspondence to Marwa A. Fouad.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors have no competing interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Fouad, M.A., Serag, A., Tolba, E.H. et al. QSRR modeling of the chromatographic retention behavior of some quinolone and sulfonamide antibacterial agents using firefly algorithm coupled to support vector machine. BMC Chemistry 16, 85 (2022). https://doi.org/10.1186/s13065-022-00874-2

Download citation

Received: 03 May 2022
Accepted: 04 October 2022
Published: 03 November 2022
DOI: https://doi.org/10.1186/s13065-022-00874-2

QSRR modeling of the chromatographic retention behavior of some quinolone and sulfonamide antibacterial agents using firefly algorithm coupled to support vector machine

Abstract

Similar content being viewed by others

QSRR models for predicting the retention indices of VOCs in different datasets using an efficient variable selection method coupled with artificial neural network modeling: ANN-based QSPR modeling

QSAR study of VEGFR-2 inhibitors by using genetic algorithm-multiple linear regressions (GA-MLR) and genetic algorithm-support vector machine (GA-SVM): a comparative approach

ADMET evaluation in drug discovery: 15. Accurate prediction of rat oral acute toxicity using relevance vector machine and consensus modeling

Introduction

Experimental

Solvents, chemicals, sample preparation, and instrumentation

Chromatographic conditions

QSRR modeling

Drawing structures and molecular descriptors calculation and filtration

Training set and test set generation

Descriptor selection and modeling

Model validation

Results and discussion

Optimization of the FFA and SVR parameters for develo** the QSRR models

QSRR modeling of quinolones in their different ionization states

QSRR modeling of sulfonamides using different organic modifiers

Y-scrambling validation

Applicability domain of both QSRR models

Conclusion

Data availability

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation