Refine
Year of publication
Document Type
- Article (32)
Has Fulltext
- yes (32)
Is part of the Bibliography
- no (32)
Keywords
- Virtual Screening (2)
- AF4–MLL (1)
- Compound Database (1)
- Gaussian Process (1)
- Identical Topology (1)
- Lead Structure (1)
- Multiple Kernel (1)
- Oncoprotein activation (1)
- Pairwise Sequence Alignment (1)
- Support Vector Regression (1)
Chemical language models enable de novo drug design without the requirement for explicit molecular construction rules. While such models have been applied to generate novel compounds with desired bioactivity, the actual prioritization and selection of the most promising computational designs remains challenging. Herein, we leveraged the probabilities learnt by chemical language models with the beam search algorithm as a model-intrinsic technique for automated molecule design and scoring. Prospective application of this method yielded novel inverse agonists of retinoic acid receptor-related orphan receptors (RORs). Each design was synthesizable in three reaction steps and presented low-micromolar to nanomolar potency towards RORγ. This model-intrinsic sampling technique eliminates the strict need for external compound scoring functions, thereby further extending the applicability of generative artificial intelligence to data-driven drug discovery.
Background: Threonine Aspartase 1 (Taspase1) mediates cleavage of the mixed lineage leukemia (MLL) protein and leukemia provoking MLL-fusions. In contrast to other proteases, the understanding of Taspase1's (patho)biological relevance and function is limited, since neither small molecule inhibitors nor cell based functional assays for Taspase1 are currently available. Methodology/Findings: Efficient cell-based assays to probe Taspase1 function in vivo are presented here. These are composed of glutathione S-transferase, autofluorescent protein variants, Taspase1 cleavage sites and rational combinations of nuclear import and export signals. The biosensors localize predominantly to the cytoplasm, whereas expression of biologically active Taspase1 but not of inactive Taspase1 mutants or of the protease Caspase3 triggers their proteolytic cleavage and nuclear accumulation. Compared to in vitro assays using recombinant components the in vivo assay was highly efficient. Employing an optimized nuclear translocation algorithm, the triple-color assay could be adapted to a high-throughput microscopy platform (Z'factor = 0.63). Automated high-content data analysis was used to screen a focused compound library, selected by an in silico pharmacophor screening approach, as well as a collection of fungal extracts. Screening identified two compounds, N-[2-[(4-amino-6-oxo-3H-pyrimidin-2-yl)sulfanyl]ethyl]benzenesulfonamideand 2-benzyltriazole-4,5-dicarboxylic acid, which partially inhibited Taspase1 cleavage in living cells. Additionally, the assay was exploited to probe endogenous Taspase1 in solid tumor cell models and to identify an improved consensus sequence for efficient Taspase1 cleavage. This allowed the in silico identification of novel putative Taspase1 targets. Those include the FERM Domain-Containing Protein 4B, the Tyrosine-Protein Phosphatase Zeta, and DNA Polymerase Zeta. Cleavage site recognition and proteolytic processing of these substrates were verified in the context of the biosensor. Conclusions: The assay not only allows to genetically probe Taspase1 structure function in vivo, but is also applicable for high-content screening to identify Taspase1 inhibitors. Such tools will provide novel insights into Taspase1's function and its potential therapeutic relevance.
We present a computational method for the reaction-based de novo design of drug-like molecules. The software DOGS (Design of Genuine Structures) features a ligand-based strategy for automated ‘in silico’ assembly of potentially novel bioactive compounds. The quality of the designed compounds is assessed by a graph kernel method measuring their similarity to known bioactive reference ligands in terms of structural and pharmacophoric features. We implemented a deterministic compound construction procedure that explicitly considers compound synthesizability, based on a compilation of 25'144 readily available synthetic building blocks and 58 established reaction principles. This enables the software to suggest a synthesis route for each designed compound. Two prospective case studies are presented together with details on the algorithm and its implementation. De novo designed ligand candidates for the human histamine H4 receptor and γ-secretase were synthesized as suggested by the software. The computational approach proved to be suitable for scaffold-hopping from known ligands to novel chemotypes, and for generating bioactive molecules with drug-like properties.
Bacterial autotransporters represent a diverse family of proteins that autonomously translocate across the inner membrane of Gram-negative bacteria via the Sec complex and across the outer bacterial membrane. They often possess exceptionally long N-terminal signal sequences. We analyzed 90 long signal sequences of bacterial autotransporters and members of the two-partner secretion pathway in silico and describe common domain organization found in 79 of these sequences. The domains are in agreement with previously published experimental data. Our algorithmic approach allows for the systematic identification of functionally different domains in long signal sequences. Keywords: bacterial autotransporter, sequence analysis, pattern, protein targeting, signal peptide, protein trafficking
Targeting signals direct proteins to their extra- or intracellular destination such as the plasma membrane or cellular organelles. Here we investigated the structure and function of exceptionally long signal peptides encompassing at least 40 amino acid residues. We discovered a two-domain organization ("NtraC model") in many long signals from vertebrate precursor proteins. Accordingly, long signal peptides may contain an N-terminal domain (N-domain) and a C-terminal domain (C-domain) with different signal or targeting capabilities, separable by a presumably turn-rich transition area (tra). Individual domain functions were probed by cellular targeting experiments with fusion proteins containing parts of the long signal peptide of human membrane protein shrew-1 and secreted alkaline phosphatase as a reporter protein. As predicted, the N-domain of the fusion protein alone was shown to act as a mitochondrial targeting signal, whereas the C-domain alone functions as an export signal. Selective disruption of the transition area in the signal peptide impairs the export efficiency of the reporter protein. Altogether, the results of cellular targeting studies provide a proof-of-principle for our NtraC model and highlight the particular functional importance of the predicted transition area, which critically affects the rate of protein export. In conclusion, the NtraC approach enables the systematic detection and prediction of cryptic targeting signals present in one coherent sequence, and provides a structurally motivated basis for decoding the functional complexity of long protein targeting signals.
A new method to bridge the gap between ligand and receptor-based methods in virtual screening (VS) is presented. We introduce a structure-derived virtual ligand (VL) model as an extension to a previously published pseudo-ligand technique [1]: LIQUID [2] fuzzy pharmacophore virtual screening is combined with grid-based protein binding site predictions of PocketPicker [3]. This approach might help reduce bias introduced by manual selection of binding site residues and introduces pocket shape information to the VL. It allows for a combination of several protein structure models into a single "fuzzy" VL representation, which can be used to scan screening compound collections for ligand structures with a similar potential pharmacophore. PocketPicker employs an elaborate grid-based scanning procedure to determine buried cavities and depressions on the protein's surface. Potential binding sites are represented by clusters of grid probes characterizing the shape and accessibility of a cavity. A rule-based system is then applied to project reverse pharmacophore types onto the grid probes of a selected pocket. The pocket pharmacophore types are assigned depending on the properties and geometry of the protein residues surrounding the pocket with regard to their relative position towards the grid probes. LIQUID is used to cluster representative pocket probes by their pharmacophore types describing a fuzzy VL model. The VL is encoded in a correlation vector, which can then be compared to a database of pre-calculated ligand models. A retrospective screening using the fuzzy VL and several protein structures was evaluated by ten fold cross-validation with ROC-AUC and BEDROC metrics, obtaining a significant enrichment of actives. Future work will be devoted to prospective screening using a novel protein target of Helicobacter pylori and compounds from commercial providers.
Eine Stiftungsprofessur ermöglicht die konzentrierte Forschung auf einem speziellen Fachgebiet und schafft den notwendigen Freiraum, Neues zu erproben. Insbesondere kann sie dazu dienen, Brücken zwischen Disziplinen zu errichten. Mit diesem Ziel wurde vor fünf Jahren die Beilstein-Stiftungsprofessur für Chemieinformatik an der Johann Wolfgang Goethe-Universität eingerichtet. Gefördert von dem in Frankfurt am Main ansässigen Beilstein-Institut zur Förderung der Chemischen Wissenschaften, wurde sie in enger Zusammenarbeit mit dem Institut für Organische Chemie und Chemische Biologie unter der Federführung von Prof. Dr. Michael Göbel konzipiert. Nachdem die Förderperiode von fünf Jahren im März 2007 ausgelaufen war, ist die Stiftungsprofessur nahtlos in den ordentlichen Universitätsbetrieb übernommen worden. Dies gibt Anlass, ein Fazit zu ziehen.
Protein kinases are targets for drug development. Dysregulation of kinase activity leads to various diseases, e.g. cancer, inflammation, diabetes. Human polo-like kinase 1 (Plk1), a serine/threonine kinase, is a cancer-relevant gene and a potential drug target which attracts increasing attention in the field of cancer therapy. Plk1 is a key player in mitosis and modulates entry into mitosis and the spindle checkpoint at the meta-/anaphase transition. Plk1 overexpression is observed in various human tumors, and it is a negative prognostic factor for cancer patients. The same catalytical mechanism and the same co-substrate (ATP) lead to the problem of inhibitor selectivity. A strategy to solve this problem is represented by targeting the inactive conformation of kinases. Kinases undergo conformational changes between active and inactive conformation and thus an additional hydrophobic pocket is created in the inactive conformation where the surrounding amino acids are less conserved. A "homology model" of the inactive conformation of Plk1 was constructed, as the crystal structure in its inactive conformation is unknown. A crystal structure of Aurora A kinase served as template structure. With this homology model a receptor-based pharmacophore search was performed using SYBYL7.3 software. The raw hits were filtered using physico-chemical properties. The resulting hits were docked using Gold3.2 software, and 13 candidates for biological testing were manually selected. Three compounds of the 13 tested exhibit anti-proliferative effects in HeLa cancer cells. The most potent inhibitor, SBE13, was further tested in various other cancer cell lines of different origins and displayed EC50 values between 12 microM and 39 microM. Cancer cells incubated with SBE13 showed induction of apoptosis, detected by PARP (Poly-Adenosyl-Ribose-Polymerase) cleavage, caspase 9 activation and DAPI staining of apoptotic nuclei.