Logo Kérwá
 

Prediction of toluene/water partition coefficient in the SAMPL9 blind challenge: assessment of machine learning and IEF-PCM/MST continuum solvation models

dc.creatorZamora Ramírez, William J.
dc.creatorViayna Gaza, Antonio
dc.creatorPinheiro, Silvana De Souza
dc.creatorCurutchet, Carles
dc.creatorBisbal, Laia
dc.creatorRuiz, Rebeca
dc.creatorRàfols Pérez, Clara
dc.creatorLuque del Villar, Francisco Javier
dc.date.accessioned2026-01-09T16:43:52Z
dc.date.issued2023-03-30
dc.description.abstractIn recent years the use of partition systems other than the widely used biphasic n-octanol/water has received increased attention to gain insight into the molecular features that dictate the lipophilicity of compounds. Thus, the difference between n-octanol/water and toluene/water partition coefficients has proven to be a valuable descriptor to study the propensity of molecules to form intramolecular hydrogen bonds and exhibit chameleon-like properties that modulate solubility and permeability. In this context, this study reports the experimental toluene/water partition coefficients (logPtol/w) for a series of 16 drugs that were selected as an external test set in the framework of the Statistical Assessment of the Modeling of Proteins and Ligands (SAMPL) blind challenge. This external set has been used by the computational community to calibrate their methods in the current edition (SAMPL9) of this contest. Furthermore, the study also investigates the performance of two computational strategies for the prediction of logPtol/w. The first relies on the development of two machine learning (ML) models, which are built up by combining the selection of 11 molecular descriptors in conjunction with either multiple linear regression (MLR) and random forest regression (RFR) models to target a dataset of 252 experimental logPtol/w values. The second consists of the parametrization of the IEF-PCM/MST continuum solvation model from B3LYP/6-31G(d) calculations to predict the solvation free energies of 163 compounds in toluene and benzene. The performance of the ML and IEF-PCM/MST models has been calibrated against external test sets, including the compounds that define the SAMPL9 logPtol/w challenge. The results are used to discuss the merits and weaknesses of the two computational approaches.
dc.description.procedenceUCR::Vicerrectoría de Docencia::Ciencias Básicas::Facultad de Ciencias::Escuela de Química
dc.description.procedenceUCR::Vicerrectoría de Investigación::Unidades de Investigación::Ciencias de la Salud::Laboratorio de Ensayos Biológicos (LEBI)
dc.description.sponsorshipUniversidad de Costa Rica/[115-C2126]/UCR/Costa Rica
dc.description.sponsorshipUniversidad de Costa Rica/[908-C3610]/UCR/Costa Rica
dc.identifier.codproyecto115-C2126
dc.identifier.codproyecto908-C3610
dc.identifier.doihttps://doi.org/10.26434/chemrxiv-2023-fg64s
dc.identifier.issn2573-2293
dc.identifier.urihttps://hdl.handle.net/10669/103566
dc.language.isoeng
dc.rightsacceso abierto
dc.sourcePhysical Chemistry 25(27)
dc.subjectSAMPL9
dc.subjectToluene/Water Partition Coefficient
dc.subjectHydrophobicity
dc.subjectMachine Learning
dc.subjectContinuum Solvation Models
dc.subjectmultiple linear regression
dc.subjectrandom forest regression
dc.subjectIEF-PCM/MST
dc.subjectlogP
dc.titlePrediction of toluene/water partition coefficient in the SAMPL9 blind challenge: assessment of machine learning and IEF-PCM/MST continuum solvation models
dc.typeartículo preliminar

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
prediction-of-toluene-water-partition-coefficient-in-the-sampl9-blind-challenge-assessment-of-machine-learning-and-ief-pcm-mst-continuum-solvation-models.pdf
Size:
3.61 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.5 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections