Kernel learning for ligand-based virtual screening: discovery of a new PPARgamma agonist

  • Poster presentation at 5th German Conference on Cheminformatics: 23. CIC-Workshop Goslar, Germany. 8-10 November 2009 We demonstrate the theoretical and practical application of modern kernel-based machine learning methods to ligand-based virtual screening by successful prospective screening for novel agonists of the peroxisome proliferator-activated receptor gamma (PPARgamma) [1]. PPARgamma is a nuclear receptor involved in lipid and glucose metabolism, and related to type-2 diabetes and dyslipidemia. Applied methods included a graph kernel designed for molecular similarity analysis [2], kernel principle component analysis [3], multiple kernel learning [4], and, Gaussian process regression [5]. In the machine learning approach to ligand-based virtual screening, one uses the similarity principle [6] to identify potentially active compounds based on their similarity to known reference ligands. Kernel-based machine learning [7] uses the "kernel trick", a systematic approach to the derivation of non-linear versions of linear algorithms like separating hyperplanes and regression. Prerequisites for kernel learning are similarity measures with the mathematical property of positive semidefiniteness (kernels). The iterative similarity optimal assignment graph kernel (ISOAK) [2] is defined directly on the annotated structure graph, and was designed specifically for the comparison of small molecules. In our virtual screening study, its use improved results, e.g., in principle component analysis-based visualization and Gaussian process regression. Following a thorough retrospective validation using a data set of 176 published PPARgamma agonists [8], we screened a vendor library for novel agonists. Subsequent testing of 15 compounds in a cell-based transactivation assay [9] yielded four active compounds. The most interesting hit, a natural product derivative with cyclobutane scaffold, is a full selective PPARgamma agonist (EC50 = 10 ± 0.2 microM, inactive on PPARalpha and PPARbeta/delta at 10 microM). We demonstrate how the interplay of several modern kernel-based machine learning approaches can successfully improve ligand-based virtual screening results.

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Matthias Rupp, Timon Schroeter, Ramona Steri, Ewgenij ProschakORCiDGND, Katja Hansen, Heiko Zettl, Oliver Rau, Manfred Schubert-ZsilaveczGND, Klaus-Robert Müller, Gisbert SchneiderORCiDGND
URN:urn:nbn:de:hebis:30-77302
DOI:https://doi.org/10.1186/1758-2946-2-S1-P27
Parent Title (German):Journal of Cheminformatics
Publisher:BioMed Central
Place of publication:London
Document Type:Article
Language:English
Year of Completion:2010
Year of first Publication:2010
Publishing Institution:Universitätsbibliothek Johann Christian Senckenberg
Release Date:2010/05/19
Volume:2
Issue:(Suppl 1):P27
Note:
© 2010 Rupp et al; licensee BioMed Central Ltd.
Source:Journal of Cheminformatics 2010, 2(Suppl 1):P27 ; doi:10.1186/1758-2946-2-S1-P27 ; http://www.jcheminf.com/content/2/S1/P27
HeBIS-PPN:223693987
Institutes:Biochemie, Chemie und Pharmazie / Biochemie und Chemie
Biochemie, Chemie und Pharmazie / Pharmazie
Dewey Decimal Classification:5 Naturwissenschaften und Mathematik / 57 Biowissenschaften; Biologie / 570 Biowissenschaften; Biologie
Sammlungen:Sammlung Biologie / Sondersammelgebiets-Volltexte
Licence (German):License LogoDeutsches Urheberrecht