### Abstract

Purpose. A quantitative structure-property relationship (QSPR) was developed to predict drug solubility in binary mixtures of polyethylene glycol (PEG) 400 and water. The ability of the QSPR model to predict solubility was assessed and compared to the classic log-linear cosolvency model. Methods. The solubility of 122 drugs, ranging in log P from -2.4 to 7.5, was determined in 0%, 25%, 50%, and 75% PEG (v/v in water) by the shake-flask method. Solubility data from 84 drugs were fit by linear regression using the following molecular descriptors: molecular weight, volume, radius of gyration, density, number of rotatable bonds, hydrogen-bond donors, and hydrogen-bond acceptors. The multiple linear regression model was optimized by a genetic algorithm guided selection method. The remaining 38 compounds were used to test the predictability of the model. Results. QSPR-based models developed at each volume fraction with the training set compounds showed a reasonable correlation coefficient (r) of ∼0.9 and a root mean square (rms) error of <0.5 log unit. The model predicted solubility values of ∼78% of the testing set compounds within 1 log unit. The log-linear model was as effective as the QSPR-based model in predicting the testing set solubilities; however, many drugs, as expected, showed significant deviation from log-linearity. Conclusions. The QSPR model requires only the chemical structure of the drug and has utility for guiding vehicle identification for early preclinical in vivo studies, especially when compound availability is limited and experimental data such as aqueous solubility and melting point are unknown. When experimental data are available, the log-linear model was verified to be a useful predictive tool.

Original language | English (US) |
---|---|

Pages (from-to) | 237-244 |

Number of pages | 8 |

Journal | Pharmaceutical Research |

Volume | 21 |

Issue number | 2 |

DOIs | |

State | Published - Feb 2004 |

Externally published | Yes |

### Fingerprint

### Keywords

- Cosolvent
- in silico
- PEG 400
- Prediction
- QSPR model
- Solubility

### ASJC Scopus subject areas

- Chemistry(all)
- Pharmaceutical Science
- Pharmacology

### Cite this

*Pharmaceutical Research*,

*21*(2), 237-244. https://doi.org/10.1023/B:PHAM.0000016237.06815.7a

**A Quantitative Structure-Property Relationship for Predicting Drug Solubility in PEG 400/Water Cosolvent Systems.** / Rytting, Erik; Lentz, Kimberley A.; Chen, Xue Qing; Qian, Feng; Venkatesh, Srini.

Research output: Contribution to journal › Article

*Pharmaceutical Research*, vol. 21, no. 2, pp. 237-244. https://doi.org/10.1023/B:PHAM.0000016237.06815.7a

}

TY - JOUR

T1 - A Quantitative Structure-Property Relationship for Predicting Drug Solubility in PEG 400/Water Cosolvent Systems

AU - Rytting, Erik

AU - Lentz, Kimberley A.

AU - Chen, Xue Qing

AU - Qian, Feng

AU - Venkatesh, Srini

PY - 2004/2

Y1 - 2004/2

N2 - Purpose. A quantitative structure-property relationship (QSPR) was developed to predict drug solubility in binary mixtures of polyethylene glycol (PEG) 400 and water. The ability of the QSPR model to predict solubility was assessed and compared to the classic log-linear cosolvency model. Methods. The solubility of 122 drugs, ranging in log P from -2.4 to 7.5, was determined in 0%, 25%, 50%, and 75% PEG (v/v in water) by the shake-flask method. Solubility data from 84 drugs were fit by linear regression using the following molecular descriptors: molecular weight, volume, radius of gyration, density, number of rotatable bonds, hydrogen-bond donors, and hydrogen-bond acceptors. The multiple linear regression model was optimized by a genetic algorithm guided selection method. The remaining 38 compounds were used to test the predictability of the model. Results. QSPR-based models developed at each volume fraction with the training set compounds showed a reasonable correlation coefficient (r) of ∼0.9 and a root mean square (rms) error of <0.5 log unit. The model predicted solubility values of ∼78% of the testing set compounds within 1 log unit. The log-linear model was as effective as the QSPR-based model in predicting the testing set solubilities; however, many drugs, as expected, showed significant deviation from log-linearity. Conclusions. The QSPR model requires only the chemical structure of the drug and has utility for guiding vehicle identification for early preclinical in vivo studies, especially when compound availability is limited and experimental data such as aqueous solubility and melting point are unknown. When experimental data are available, the log-linear model was verified to be a useful predictive tool.

AB - Purpose. A quantitative structure-property relationship (QSPR) was developed to predict drug solubility in binary mixtures of polyethylene glycol (PEG) 400 and water. The ability of the QSPR model to predict solubility was assessed and compared to the classic log-linear cosolvency model. Methods. The solubility of 122 drugs, ranging in log P from -2.4 to 7.5, was determined in 0%, 25%, 50%, and 75% PEG (v/v in water) by the shake-flask method. Solubility data from 84 drugs were fit by linear regression using the following molecular descriptors: molecular weight, volume, radius of gyration, density, number of rotatable bonds, hydrogen-bond donors, and hydrogen-bond acceptors. The multiple linear regression model was optimized by a genetic algorithm guided selection method. The remaining 38 compounds were used to test the predictability of the model. Results. QSPR-based models developed at each volume fraction with the training set compounds showed a reasonable correlation coefficient (r) of ∼0.9 and a root mean square (rms) error of <0.5 log unit. The model predicted solubility values of ∼78% of the testing set compounds within 1 log unit. The log-linear model was as effective as the QSPR-based model in predicting the testing set solubilities; however, many drugs, as expected, showed significant deviation from log-linearity. Conclusions. The QSPR model requires only the chemical structure of the drug and has utility for guiding vehicle identification for early preclinical in vivo studies, especially when compound availability is limited and experimental data such as aqueous solubility and melting point are unknown. When experimental data are available, the log-linear model was verified to be a useful predictive tool.

KW - Cosolvent

KW - in silico

KW - PEG 400

KW - Prediction

KW - QSPR model

KW - Solubility

UR - http://www.scopus.com/inward/record.url?scp=1242314687&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=1242314687&partnerID=8YFLogxK

U2 - 10.1023/B:PHAM.0000016237.06815.7a

DO - 10.1023/B:PHAM.0000016237.06815.7a

M3 - Article

VL - 21

SP - 237

EP - 244

JO - Pharmaceutical Research

JF - Pharmaceutical Research

SN - 0724-8741

IS - 2

ER -