Doctoral Thesis
Refine
Year of publication
Document Type
- Doctoral Thesis (282) (remove)
Language
- English (282) (remove)
Has Fulltext
- yes (282)
Is part of the Bibliography
- no (282)
Keywords
- Membranproteine (6)
- NMR-Spektroskopie (5)
- RNA (4)
- ABC-Transporter (3)
- Biochemie (3)
- Crystallography (3)
- Kristallographie (3)
- Photosynthese (3)
- Proteinfaltung (3)
- Biochemistry (2)
Institute
- Biochemie und Chemie (282) (remove)
The retinoic acid related orphan receptor alpha (RORalpha) regulates the expression of various target genes by binding to specific response elements in their promoter region. RORalpha is an interesting pharmaceutical target since it positively affects several pathophysiological processes of clinical relevance. RORalpha enhances the expression of Apo-AI protein, the major constituent of HDL, which is responsible for the cholesterol transportation. RORalpha notably contributes to the bone mineralization and generation of the extracellular bone matrix, demonstrating its involvement in osteoporosis, and by up-regulating the gene for IKBalpha, RORalpha has anti-inflammatory effects. Moreover, RORalpha is necessary for cerebellar development and the maintenance of the mammalian day-night periodicity governed by the core-clock within the suprachiasmatic nuclei. RORalpha receptors have been reported to bind cholesterol, melatonin, or to function ligand-independent. By monomeric binding to the recognition motif AGGTCA preceded by an A/T-rich sequence (ROR response element, RORE), RORalpha constitutively activates gene transcription. However, RORalpha activity is passively suppressed by its opponents RevErbalpha and RevErbbeta, which both bind to the same target sequence. ...
The following thesis is concerned with the elucidation of structural changes of RNA molecules during the time course of dynamic processes that are commonly denoted as folding reactions. In contrast to the field of protein folding, the concept of RNA folding comprises not only folding reactions itself but also refolding- or conformational switching- and assembly processes (see chapter III). The method in this thesis to monitor these diverse processes is high resolution liquid-state NMR spectroscopy. To understand the reactions is of considerable interest, because most biological active RNA molecules function by changing their conformation. This can be either an intrinsic property of their respective sequence or may happen in response to a cellular signal such as small molecular ligand binding (like in the aptamer and riboswitch case), protein or metal binding. The first part of the thesis (chapters II & III) provides a general overview over the field of RNA structure and RNA folding. The two chapters aim at introducing the reader into the current status of research in the field. Chapters II is structured such that primary structure is first described then secondary and tertiary structure elements of RNA structure. A special emphasis is given to bistable RNA systems that are functionally important and represent models to understand fundamental questions of RNA conformational switching. RNA folding in vitro as well as in vivo situations is discussed in Chapter III. The following chapters IV and V also belong to the introduction part and review critically the NMR methods that were used to understand the nature and the dynamics of the conformational/structural transitions in RNA. A general overview of NMR methods quantifying dynamics of biomolecules is provided in chapter IV. A detailed discussion of solvent exchange rates and time-resolved NMR, as the two major techniques used, follows. In the final chapter V of the first part the NMR parameters used in structure calculation and structure calculation itself are conferred. The second part of the thesis, which is the cumulative part, encompasses the conducted original work. Chapter VI reviews the general NMR techniques applied and explains their applicability in the field of RNA structural and biochemical studies in several model cases. Chapter VII describes the achievement of a complete resonance assignment of an RNA model molecule (14mer cUUCGg tetral-loop RNA) and introduces a new technique to assign quaternary carbon resonances of the nucleobases. Furthermore, it reports on a conformational analysis of the sugar backbone in this RNA hairpin molecule in conjunction with a parameterization of 1J scalar couplings. Achievements: • Establishment of two new NMR pulse-sequences facilitating the assignment of quaternary carbons in RNA nucleobases • First complete (99.5%) NMR resonance assignment of an RNA molecule (14mer) including 1H, 13C, 15N, 31P resonances • Description of RNA backbone conformation by a complete set of NMR parameters • Description of the backbone conformational dependence in RNA of new NMR parameters (1J scalar couplings) Chapters VII & VIII summarize the real-NMR studies that were conducted to elucidate the conformational switching events of several RNA systems. Chapter VIII gives an overview on the experiments that were accomplished on three different bistable RNAs. These molecules where chosen to be good model systems for RNA refolding reactions and so consequently served as reporters of conformational switching events of RNA secondary structure elements. Achievements: • First kinetic studies of RNA refolding reactions with atomic resolution by NMR • Application of [new] RT-NMR techniques either regarding the photolytic initiation of the reaction or regarding the readout of the reaction • Discovery of different RNA refolding mechanisms for different RNA molecules Deciphering of a general rule for RNA refolding methodology to conformational switching processes of RNA tertiary structure elements. The models for these processes were a) the guanine-dependent riboswitch RNA and b) the minimal hammerhead ribozyme. Achievements: • NMR spectroscopic assignment of imino-resonances of the hypoxanthine bound guanine-dependent riboswitch RNA • Application of RT-NMR techniques to monitor the ligand induced conformational switch of the aptamer domain of the guanine-dependent riboswitch RNA at atomic resolution • Translation of kinetic information into structural information • Deciphering a folding mechanism for the guanine riboswitch aptamer domain • Application of RT-NMR techniques to monitor the reaction of the catalytically active mHHR RNA at atomic resolution In the appendices the new NMR pulse-sequences and the experimental parameters are described, which are not explicitly treated in the respective manuscripts.
Das genetische Material der Zellen besteht aus Molekülketten der Desoxyribonukleinsäure (DNA), die ein Träger der Erbinformation ist. In normalen Körperzellen wird die Erbinformation der DNA in eine andere Molekülkette, die sogenannte Ribonukleinsäure (RNA), übersetzt. Die RNA reguliert die Bildung von neuem Protein in der Zelle. Dass die RNA nicht bloß ein „Stempel“ ist, der die Informationen der DNA weitervermittelt, darin sind sich die Experten heute einig. RNA-Moleküle können Informationen speichern, katalytische Aktivitäten entfalten, sich perfekt tarnen, und sie regulieren auch als Produkt ihre eigene Synthese. Manche Viren enthalten ebenfalls RNA (oder DNA) und können so den Produktionsapparat der Zelle täuschen. Erkenntnisse über die Wechselwirkung dieser RNA mit natürlichen und synthetischen Liganden können zur Suche nach potentiellen Wirkstoffen beitragen. Nukleinsäuren sind lineare Biopolymere von grundlegenden Untereinheiten, die Nukleotide genannt werden und aus Adenin (A), Cytosin (C), Guanin (G), Urazil (U), und Thymin (T) zusammengesetzt sind. Sie sind jedoch in der Lage sich zu falten und so eine Doppel-Helixstruktur auszubilden. Diese besteht größtenteils aus den bekannten "Watson-Crick-Basenpaaren" (G-C und A-U oder A-T), die zur Stabilität der Struktur beitragen, sowie aus den weniger stabilen G-U-Paaren. Durch die Wechselwirkung zwischen verschiedenen Sekundärstrukturelementen entstehen Tertiärstrukturelemente, deren Struktur und Dynamik oft nur schwer experimentell zu bestimmen sind. Fortschritte in der RNA-Strukturanalyse wurden durch Röntgenkristallographie und Kernresonanzspektroskopie (NMR) möglich. Durch die Röntgenkristallographie wurden viele RNA-Eigenschaften festgestellt. Allerdings besteht keine Kristallstruktur für alle mögliche Einzelnfaser-RNA-Haarnadeln, weil diese immer dazu neigen, in eine linearen doppelte Faserform zu kristallisieren, die geringe biologische Bedeutung hat. Außerdem wurde mit Hilfe der NMR-Spektroskopie das dynamische Verhalten von RNA, z.B. Entfaltungsprozesse bei ansteigender Temperatur, beobachtet. Jedoch erlauben diese experimentellen Daten oft keine direkte mikroskopische Beschreibung der molekularen Prozesse. Molekulardynamik (MD)-Simulationen von biologischen Systemen ermöglichen es hingegen, diese Prozesse in atomischem Detail zu untersuchen. Die MD-Simulation beschreibt ein molekulares System auf atomarer Ebene mit Hilfe der klassischen Mechanik. Kräfte werden von empirischen Potentialen abgeleitet. Sie liefern zeitabhängige Trajektorien, die sich aus den Newton'schen Bewegungsgleichungen ergeben. Durch verbesserte Computerleistung, bessere Kraftfelder, und neu entwickelte genauere Methoden stimmen heutzutage MD-Simulationen von RNA mit experimentellen Daten immer besser überein. In meiner Doktorarbeit wurden MD-Simulationen durchgeführt um die Dynamik, die Struktur und insbesondere die Stabilität von RNA-Hairpins theoretisch zu beschreiben, um so ein erweitertes Verständnis für die dynamischen Vorgänge zu erhalten. Auch der SFB 579 der Universität Frankfurt beschäftigt sich mit RNA-Systemen. Erforscht wird unter anderem der D-Loop des Coxsackievirus B3 (CVB3), der Virenmyocarditis verursacht. Die Interpretation dieser experimentellen Daten wird durch MD-Simulation möglich. In dieser Arbeit wurden das GROMACS Software-Paket und das AMBER Kraftfeld verwendet, um das strukturelle, dynamische und thermische Verhalten der RNA-Hairpins mit Hilfe von MD-Simulationen auf atomarer Ebene zu untersuchen. Betrachtet wurden die 14-mer RNA-Hairpins, uCACGg und cUUCGg. Die verfügbaren NMR-Strukturen zeigen, dass das uCACGg-Tetraloop auffallend ähnlich in der gesamten Geometrie und den Wasserstoffbindungen zu der experimentellen Struktur des cUUCGg-Tetraloop ist, obwohl die schließende Basenpaarsequenz der beiden Tetraloops unterschiedlich sind. Trotz beachtlicher struktureller Ähnlichkeit unterscheiden sich allerdings die uCACGg und cUUCGg Tetraloops in Funktionalität und Thermostabilität. Zunächst orientiert sich unser erstes Bemühen an der Frage nach einem guten Modell für RNA-Hairpins und Simulationsbedingungen, um die zu untersuchenden RNA-Hairpins in Wasser möglichst realitätsnah zu simulieren. Erstens werden drei Versionen des biomolekularen AMBER-Kraftfelds geprüft, indem man die 60 ns Simulationen des 14-mer uCACGg-Hairpins durchführt. Die simulierten strukturellen Eigenschaften und Atomfluktuationen zeigen hohe Ähnlichkeiten in den drei Kraftfeldern. Darüber hinaus stimmen die von MD-Simulationen berechneten Atomkernabstände mit den experimentellen NMR-Daten gut überein. Die gute Übereinstimmung zwischen den Simulationen und den strukturellen NMR Daten belegt die Fähigkeit des AMBER-Kraftfelds zur Beschreibung der strukturellen Eigenschaft von kleinen RNA-Hairpins. Anschließend werden die Einflüsse der Methoden, welche die langreichweitigen, elektrostatischen Wechselwirkungen beschreiben, auf die strukturellen Eigenschaften untersucht. Insbesondere werden die Ergebnisse der Reaktionfeld-Methode mit denen der Particle Mesh Ewald (PME)-Methode verglichen. Es zeigt sich, dass die PME-Methode die elektrostatischen Wechselwirkungen am besten beschreibt, auch wenn die Simulationen der beiden Methoden Ähnlichkeit in der Struktur-Stabilität und der Atomfluktuation bei niedriger Natriumkonzentration aufweisen. Drittens wird der Kationseffekt auf die RNA-Stabilität untersucht. Betrachtet wurden zwei unterschiedliche Kationen (ein- und zweiwertig) und verschiedene Konzentrationen. Die Simulationen weisen darauf hin, dass sich die Metallionen in der Affinität zum RNA-Hairpin unterscheiden, wenn Na+ und/oder Mg2+ als Gegenionen verwendet werden. Weiterhin wird gezeigt, dass sich die bevorzugten Positionen der Na+-Ionen in der großen Furche (major groove) des RNA-Hairpins befinden. Insbesondere die Anlagerungsort der Na+-Ionen liegen in der Nähe des schließenden Basenpaar U5-G10. Im Vergleich zu Na+-Ionen lagern sich Mg2+-Ionen sowohl an die RNA-Basen U3, A4-U11, und die Phosphat-Gruppe, als auch an das schließenden Basenpaar U5-G10 an. Bestätigt werden die Modelle und Simulationsbedingungen durch den Vergleich von Parametern, die sowohl experimentell als auch durch Simulationen ermittelt werden können. Ferner erlauben MD-Simulationen Einblick in das System, indem sie detallierte Konformations- und andere Verteilungen liefern. In der vorliegenden Arbeit wurden die Einflüsse der Loopsequenz und des schließenden Basenpaares auf die Verteilung der Konformationen, der internen Bewegungen, und auf die Thermostabilität von zwei RNA-Hairpins mit Hilfe dieser Modelle untersucht. Zunächst wurden die strukturellen Eigenschaften bei Raumtemperatur ausgewertet. Die starken strukturellen Ähnlichkeiten und die gute Übereinstimmung mit NMR-Daten bestätigen die Hypothese, dass die zwei Tetraloops zur gleichen “erweiterten” RNA-Familie gehören. Diese zwei Hairpins haben ähnliche Lösemittelzugängliche Oberflächen (solvent accessible surface), wobei deren Lösemittel zugänglichen funktionellen Gruppen unterschiedlich sind. Weiterhin weist das uCACGg-Hairpin eine stärkere Tendenz auf Wasserstoffe abzugeben als das cUUCGg-Hairpin, was in den unterschiedlichen Bindungsaffinitäten zwischen diesen Hairpins und der viralen Protease begründet liegt. Darüber hinaus wurde der Faltungs- und Entfaltungsprozess mit Hilfe der Replica-Exchange-Molekulardynamik-Simulationen untersucht. Diese Untersuchung zielt auf das bessere Verständnis der unterschiedlichen Thermostabilität der Hairpins, indem sie die möglichen Zwischenprodukte im atomaren Detail liefern. Sowohl experimentell als auch von den MD-Simulationen ergibt sich eine Differenz in den Schmelztemperaturen der beiden Hairpins von ungefähr 20 K. Allerdings sind die von MD beobachteten Schmelztemperaturen 20 % höher als die von Experiment zu ansehende Wert. Die Ergebnisse machen deutlich, dass die Schmelztemperaturdifferenz nicht auf die Unterschiede in der Sequenz, in der Struktur, oder in der Dynamik der Loops zurückführen sind, sondern auf die Unterschiede der Basenpaaren in den Stämmen. Weiterhin wird gezeigt, dass sich das uCACGg-Hairpin einerseits kooperativ entfaltet, und die Entfaltung des cCACGg-Hairpins anderseits weniger kooperativ stattfindet. Um die schnelle interne Dynamik der uCACGg- und cUUCGg-Hairpins zu untersuchen, erlauben die Simulationen von 50 ns eine akurate Beschreibung der schnellen internen Bewegung der RNA-Hairpin, obwohl der den Hairpins zugängliche Konformationsraum nicht vollständig abgedeckt wird. Die NMR-Relaxationsparameter, die mit Hilfe der MD-Simulationen zurückgerechnet wurden, bestätigen das Modell und die Simulationsbedingungen der MD-Simulationen. Im Hinblick auf die Übereinstimmung kann man den besten Ansatz zur Berechnung der NMR-Ordnungsparameter bestimmen. In dieser Arbeit wurden drei verschiedene Ansätze angewandt, nämlich das Fitting von 100 ps auf modellfreiem Ansatz nach Lipari-Szabo, equilibrium average, und das Gaussian Axial Fluctuation (GAF)-Modell. Die zwei letzteren können nur qualitativ mit den experimentellen Daten übereinstimmen. Die NMR-Ordnungsparameter können mit Hilfe des Modells von Lipari-Szabo richtig ermittelt werden, wenn sich die interne Bewegung in kleineren Zeitskalen als zur Gesamtbewegung vollzieht. Vorausetzung für die Berechnung dieses Modells ist aber, dass das Fitting der internen Korrelationsfunktionen nur auf den ersten Teil von 100 ps der Korrelationsfunktionen eingesetzt wird. Die berechneten Ordnungsparameter deuten auf ein unterschiedliches Verhalten der beiden Hairpins besonders im Loop-Bereich hin. Die konformationelle Umordnung, die beim UUCG-Loop beobachtet wurde, tritt beim CACG-Loop nicht ein. Zusammenfassend lässt sich sagen, dass es durch den Einsatz von MD Simulationen ermöglicht wird, die strukturellen und dynamischen Eigenschaften der RNA-Systeme auf atomarer Ebene zu untersuchen. Als Schlussfolgerung zeigt diese Doktorarbeit, dass sich die Studie der konformationell Dynamik der RNA-Systeme durch die Kombination aus MD-Simulation und NMR-Spektroskopie sowie der Leistungsfähigkeit der MD-Simulationen, die die interne Bewegungen deutlich beschreiben können, untersuchen lässt.
Two types of proteins transport ions across the membrane – ion channels and ion pumps. Ion pumps transport ions against their electrochemical gradient by co-transporting another ion or a substrate molecule through a concentration gradient or by coupling this process to an energy source like ATP. Those that couple ATP hydrolysis to ion transport are called ion motive ATPases and can be classified as ‘V’, ‘F’ and ‘P’ types. In this thesis, two sub-classes of P-type ATPases, PIIIA and PIB were studied. Attempts were made to over-express and crystallize the plant proton pump AHA2 (a PIIIA-ATPase). Also, the two putative copper transporting ATPases, CtrA3 (CopB-like) and CtrA2 (CopA-like) from Aquifex aeolicus (both PIB pumps) were over-expressed in E. coli and characterized. PIIIA-type pumps transport protons across the membrane and are found exclusively in plants and fungi, and probably some archaea. One of the most characterized proton pump biochemically is the A. thaliana proton pump AHA2. An 8Å projection map of this enzyme is already available (Jahn 2001). PIBATPases, also called CPX type pumps transport heavy metal ions such as Cu+, Cu2+, Zn2+, Pb2+, Cd2+, Co2+ across biological membranes and play an important role in homeostasis and biotolerance of these metals. CopA and CopB are two such proteins that transport copper across cell membrane found in many prokaryotes. CopB-like proteins are found almost exclusively in bacteria, with CPH sequence motif, while CopA-like proteins have CPC sequence motif, also found in eukaryotic copper transporters including human ATP7A and ATP7B. CopB extrudes Cu2+ across the membrane. CopA is activated by and transports Cu+ but the direction of transport is debated. Attempts were made to over-express the plant proton pump AHA2 in yeast Pichia pastoris. However, the yeast expressed only a truncated protein, which could not be used for further studies. It can be concluded that P. pastoris strain SMD1163 is not a good host for expression of AHA2. Focus was then shifted to AHA2 that has been over-expressed and purified from S. cerevisiae strain RS72. Growth and purification protocols had to be changed from published methods because of laboratory constraints and this probably had an effect on the protein produced. The protein purified from S. cerevisiae could not be crystallized reproducibly for structural studies by electron microscopy. CtrA3 was expressed in E. coli and purified using Ni2+-NTA matrix. Like CopB of A. fulgidus (Mana Capelli 2003), it was active only in the presence of Cu2+ and to some extent in Ag+. The protein was maximally active at 75°C, at pH 7 and in presence of cysteine. Lipids were essential for the activity of CtrA3. However, when the protein was purified in Cymal-6, CtrA3 could not hydrolyze ATP, even when lipids were added to the reaction mixture. For reconstitution of CtrA3 into liposomes for 2D crystallization, several lipids were tested. To screen the lipids compatible for protein incorporation, CtrA3 was dialyzed with different lipids at a high lipid-to-protein ratio of 10:1 and centrifuged by sucrose density gradient. Protein incorporated in lipids localized with liposome fraction in the gradient. Most of the CtrA3 was incorporated into DPPC with no aggregation. This lipid was used for reconstitution of CtrA3 at low LPRs, and at an LPR of 0.3-0.5, the protein formed 2D crystals. A NaCl concentration of 50mM was necessary for the formation of crystals. However, salt removal by dialysis prior to harvesting was essential for obtaining wellordered lattices of CtrA3. Addition of preservatives like trehalose and tannin or direct plunging in liquid ethane for cryo-microscopy destroyed the crystal lattice. Similar to CtrA3, the gene responsible for expression of CtrA2 was amplified from genomic DNA of A. aeolicus and expressed in E. coli and purified by Ni2+-NTA. Functional characterization of CtrA2 was done by analyzing ATP hydrolysis activity of the enzyme. Similar to CopA of A. fulgidus (Mandal 2002), CtrA2 was activated in the presence of Ag+ and to some extent, Cu+. It is possible that both the copper ATPases of A. aeolicus have different ion selectivity- CtrA3, specific for Cu2+ and CtrA2, specific for Cu+. Maximal activity of CtrA2 was also at 75°C. Cysteine was essential for activity of CtrA2, but the protein was not dependent on addition of lipids for activation. Reconstitution of CtrA2 was done similar to CtrA3 for screening of lipids for 2D crystallization. Of the lipids tested, DOPC reconstituted the protein best. However, screening at low LPRs did not yield any crystals. Even though both CtrA3 and CtrA2 are similar heavy metal transporting Ptype ATPases from the same organism and have 36% identity, they behaved completely different in their expression levels in E. coli, purification profiles, activity and reconstitution in lipids.
Purification and characterization of heterologously produced cannabinoid receptor 1 and G proteins
(2007)
G protein coupled receptors form the largest group of transmembrane proteins, which are involved in signal transduction and are targeted directly or indirectly by 40-50% of the drugs in the market. Even though a lot of biochemical and pharmacological information was acquired for these receptors in the past decades, structural information is still insufficient. G protein coupled receptors are expressed in a very minute scale in the tissues. Purification of G protein coupled receptors, in amounts needed for structural studies, from native tissue is tedious and almost impossible. To overcome this first hurdle of insufficient protein, several heterologous protein expression systems are being used. Another difficulty in structural determination of a G protein coupled receptor is that it is a membrane protein. Membrane proteins are difficult targets for structural studies. One of the possible reasons is the little hydrophilic surface area on the membrane protein, reducing the chances of crystal contact between the molecules. The present work is an attempt to investigate possible ways to overcome these problems. Aim of the project was to use G proteins to increase the hydrophilic area of the G protein coupled receptor. G protein is a physiological partner to the G protein coupled receptor which makes the complex functionally relevant. In the present work five G alpha proteins were purified to homogeneity by a two step purification using metal affinity and ion-exchange chromatography. The G alpha subunits purified were tested for their detergent susceptibility. It was found that only some G proteins were active in the presence of detergent. Observation from contemporary reports also suggest that the G alpha proteins expressed in Escherichia coli, alone may not be sufficient to bind to the G protein coupled receptors in solution. So the project was extended towards expressing a G protein coupled receptor which was reported to exist in a complex with the G proteins, in the cells. Purifying such a functional complex could be more beneficial to use for crystallization. Cannabinoid receptors were chosen for heterologous expression and purification. Production of recombinant cannabinoid receptor 2 was investigated in Pichia pastoris. The protein obtained was highly heterogenous. There were several oligomeric forms as well as degradation products in the cell membranes. Most of the protein was lost in the purification steps leading to a poor yield. Several oligomeric forms and other impurities were still present in the protein sample after purification. Alternatively, a baculovirus mediated insect cell expression system was investigated, to produce the receptors. Cannabinoid receptor 1 was investigated in insect cell expression system because of its better biochemical understanding and pharmacological importance than cannabinoid receptor 2. Cannabinoid receptor 1 was produced in two forms, a full length and a distal carboxy terminal truncated version. All the several gene constructs made could be expressed in the Spodoptera frugiperda (Sf9) insect cells. Expression levels (Bmax) for the constructs with a decahistidine tag at the amino terminus and Strep-tagII at the carboxy terminus were 40 pmol/mg and 53 pmol/mg respectively, for full length and truncated versions. These expression levels are 2 fold higher than the levels reported till now in the literature. As was quite evident from previous experiences of other research groups, purification of this receptor was a challenge. Protein purified from immobilized metal affinity chromatography (Ni-nitrilo tri acetate)(Ni-NTA) was not even 50% pure. A second purification by immobilized monomeric avidin or Streptactin agarose, making use of Biotag and StreptagII respectively, drastically reduced the protein recovery. Later on, purification of receptor was investigated on different metal chelating resins. His-Select, a Ni-NTA based matrix from Sigma, with much lesser density than Ni-NTA from Qiagen, showed a better purification profile. Purification was optimized to get 80% homogeneity but with low yield (20%). Further efforts are needed to improve the yield and purity of the receptor, to use it for crystallization. Cannabinoid receptors are known to exist in a precoupled form to G proteins in the cells. The existence of such precoupled forms of the receptor was investigated using the fluorescence techniques. Guanosine-5-triphosphate binding assay on the cell membranes, in the absence of agonists confirmed the active precoupled form of the receptor. It was found that it is possible to co-immunoprecipitate the complex. These results show that the truncated cannabinoid receptor can be produced in functional form in insect cells in much higher yields than reported. This receptor exists as a complex with G proteins even in the absence of ligands. It was also shown that the receptor/G protein complex can be coimmunoprecipitated. Further work is required to investigate the possibility of purifying this complex to use it for co-crystallization.
G-protein coupled receptors (GPCRs) comprise the largest superfamily of cell surface receptors and possess a signature motif of seven transmembrane helices. The endothelin B (ETB) receptor is a member of rhodopsin like GPCR family. It plays an important role in vasodilation and is found in the membranes of the endothelial cells enveloping blood vessels. Knowledge of the three-dimensional structure of G-protein coupled receptors in general would significantly add to our understanding of their molecular mechanisms and would be useful in the search for new specific drugs. However, three-dimensional structural analysis will require milligram quantities of pure and homogeneous protein. This dissertation is a study of the production, biochemical characterization and preliminary structural studies of the human ETB G-protein coupled receptor. The present work aimed at elucidating the structure and mechanistic details of function of the receptor by using a combination of X-ray crystallographic and NMR methods for collecting structural data. To obtain homogenous and monodisperse receptor protein preparation for structural and functional studies, we implemented the baculovirus expression system for the production of ETB receptor for the present work. The two step affinity purification ensured capture of full-length receptor. Silver stained SDS-PAGE of the purified receptor-ligand complex indicated greater than 90% protein purity. Based on previous reports, we used the high affinity ligand (endothelin -1) binding to the receptor for co-crystallization of receptor-ligand complex by locking the receptor in the activated conformation. As a prerequisite for 3D crystallization trials, the stability of the detergent solubilized receptor-ligand complex was assessed with respect to pH, temperature and time. Receptor-ligand complex did not show any degradation and aggregation over 6 days at 4°C and 18°C. Interestingly, change of pH suggested that receptor-ligand complex is unstable at lower pH due to possible charge induced conformational changes. In our work, we introduced the idea of using fluorophore labeled ligand for simple visual recognition of the receptor-ligand complex during purification and crystallization. On the other hand, we alternatively used biotinylated endothelin-1 to produce an adequate amount of ligand bound receptor complex, thus ensuring homogeneity of the purified complex for use in structural studies. Thus far, preliminary crystals have been obtained for both the unlabelled ET-1 and fluorophore labeled ET-1 complexed with ETB receptor. Moreover, we performed the systematic investigation of the protein/peptide binding partner for the receptor-ligand complex with the chief aims of stabilizing structure and increasing the possibilities of 3D-crystal contacts. Thus subsequent to formation of receptor-ligand complex, the additional in vitro formation of a ternary arrestin-receptor-ligand complex was also attempted for use in structural studies. We successfully demonstrated that arrestin mutant (R169E) forms a tight complex with ETB receptor regardless of its phosphorylation state. A second approach to get insight into the ETB receptor ligand binding site relied on the use of spin isotope labeled ET-1 ligand peptide by employing solid state MAS NMR method. Preliminary data provided compelling evidence that the C-terminal region of the peptide is immobilized in an ordered environment and presumably bound to the receptor. This indicates that the approach is feasible, although there are difficulties in sample preparation for further spectral measurements and data collection which are currently being discussed in ongoing investigations. At this point of our research work, we initiated a collaborative effort to obtain high yields of pure, active receptor without post translational modifications, from an E. coli cell lysate based in vitro expression system. We successfully optimized the production of homogenous and monodisperse endothelin B receptor in mg amounts. Thus this could potentially provide an alternative source of high quality receptor production in large quantities for immediate crystallization trials. Thus we hope that the results from these investigations can be applied in a more general sense to the production and crystallization of other G protein-coupled receptors.
Metabotropic glutamate receptor subtype 7 (mGluR7) belongs to the family of G-protein coupled receptors. mGluR7 is widely distributed in the brain and primarily localized at presynaptic terminals, where it is thought to regulate neurotransmitter release and synaptic plasticity. Studies have shown that the intracellular C-terminal tail of mGluR7 binds a variety of proteins in addition to trimeric G-proteins. These newly identified protein interactions are believed to play a key role in the synaptic targeting and G-protein dependent signaling of mGluR7. Protein interacting with C kinase 1 (PICK1), a PDZ-domain protein, is a strong interaction partner of mGluR7a. In order to investigate the role of PICK1 in the synaptic trafficking and signaling of mGluR7a, a knock-in mouse line in which the interaction of mGluR7a and PICK1 is disrupted was generated. Analysis of the mutant mice by immunocytochemistry and immunoelectron microscopy showed that the synaptic targeting and clustering of mGluR7a was not altered, indicating that PICK1 is not required for mGluR7a receptor membrane trafficking and synaptic localization. However, when the spontaneous synaptic activity of cerebellar granule cell cultures prepared from both wild-type and knock-in mice was monitored, and L-AP4 (400μm) was found to decrease the frequency, but not the amplitude, of spontaneous excitatory currents in wild-type neurons, while no effect of L-AP4 on spontaneous synaptic activity was observed in knock-in neurons. This indicates that PICK1 binding to the C-terminal region of mGluR7a plays an essential role in mGluR7a mediated G-protein signaling. We examined the threshold sensitivity for the convulsant pentetrazole (PTZ) in knock-in mice. It was found that mGluR7a knock-in mice had a greater sensitivity to PTZ than wild-type mice. Moreover, the surface parietal cortex EEG recordings of the mutant mice revealed spontaneous synchronous oscillation, or "spike-and-wave discharges" (SWD), which displayed similar characteristics to absence-like seizures. It was also observed that the knock-in mice responded to pharmacology as human absence epilepsy. These data suggests that the knock-in mice displayed the phenotype of absencelike epilepsy. Furthermore, the behavioral analysis of the mGluR7a knock-in mice showed no deficits in motor coordination, pain sensation, anxiety as well as spatial learning and memory, thus the interaction of mGluR7a and PICK1 appears not to contribute to these physiological processes. Taken together, our data provides evidence for an important role of PICK1 in Gprotein dependent signaling of mGluR7a, whereas PICK1 is not required for synaptic targeting and clustering of mGluR7a. Our results also provide an animal model of absencelike epilepsy generated by disruption of a single mGluR7a-PDZ interaction, thus creating a novel therapeutic target against this neurological disease.
Safety concerns associated with the use of viral vectors in gene therapy applications have attracted considerable attention towards the development of nonviral vectors as alternatives for DNA delivery. While nonviral vectors are commonly not associated with safety problems, they are still very inefficient compared to viral vectors, and require significant improvements to approach the efficiency of their viral counterparts. Meanwhile ligands or single-chain antibody fragments that bind to cell surface receptors for increased and/or specific cellular uptake, endosome escape activities, and nuclear localization sequences (NLSs) to enhance transport of plasmid DNA into the nucleus, have become available that can be incorporated into nonviral vectors to improve their efficacy. However, as gene delivery is a multistep process, the challenge is to incorporate multiple of these functional elements into a single nonviral vector system, while retaining their specific activities. A promising method to attach such entities to plasmid DNA is the use of multifunctional fusion proteins that bind to DNA through a DNA-binding domain. In principle, two types of DNA-binding domains/proteins can be used to anchor additional functional domains or peptides to a plasmid, namely sequence-specific DNA-binding domains, described in the first part of this thesis, or those that bind DNA independent of its sequence, exemplified in the second part of this work by a derivative of the human HMGB2 protein. The first fusion protein constructed and analyzed contained the E. coli LexA repressor as a sequence-specific DNA-binding domain. In addition, this DNA-carrier protein, termed TEL, included a bacterial translocation domain as an integrated endosome escape activity, and human TGF-a for specific targeting to the EGF-receptor (EGFR). TEL was expressed in E. coli and purified under both native and denaturing conditions. Purified, denatured TEL was refolded and subsequently shown to bind specifically to EGFR-expressing cells. However, inclusion of TEL in complexes of plasmid DNA and poly-L-lysine (pL) did not lead to increased gene delivery into EGFR-expressing COS-1 cells. Most likely this was due to the absence of DNA-binding activity of the LexA moiety in TEL. In contrast, native TEL was able to interact specifically with DNA. Nevertheless, since this interaction was rather weak, and refolding of denatured TEL had not resulted in functional activity of all of its protein domains, it seemed unlikely that fusion proteins containing LexA would exhibit gene transfer capabilities superior to those of similar DNA-carrier proteins previously constructed in our group. Further work therefore focused on the use of the E2C-Sp1C protein as an alternative sequencespecific DNA-binding domain. This artificial zinc-finger protein was fused to the single-chain antibody fragment scFv(FRP5), directed against the human ErbB2 growth factor receptor. The resulting 5-E2C fusion protein was expressed in E. coli and purified under native and denaturing conditions. Refolded and native 5-E2C were found to bind specifically to ErbB2-expressing cells, indicating that scFv(FRP5) in 5-E2C was functional in both preparations. In contrast, whereas refolded 5-E2C bound DNA only weakly, significant DNA binding was observed for native 5-E2C. In addition, it could not only be shown that the interaction of native 5-E2C with DNA containing its recognition sequence was specific, but also that this protein was able to bind DNA and recombinant ErbB2 simultaneously, demonstrating the functionality of both domains in native 5-E2C. Despite these encouraging results, the inclusion of native 5-E2C in pL- or polyethyleneimine (PEI)-DNA complexes did not lead to an (5-E2C-specific) enhancement of gene transfer efficiency, irrespective of the presence of the endosome-disruptive reagent chloroquine during transfection. In the second part of this thesis an alternative approach for the development of DNA-carrier proteins for nonviral gene delivery is described, based on human HMGB2, a DNA-binding protein without sequence specificity. HMGB2 contains an acidic C-terminus that has been found to decrease the affinity of the protein for DNA. Therefore, this C-terminal tail was deleted, resulting in an HMGB2-variant consisting of amino acids 1-186. HMGB2186, purified under native conditions from E. coli lysates, was able to interact with DNA and bound to the surface of different cell lines. Importantly, after binding to plasmid DNA HMGB2186 mediated gene delivery into COS-7 cells with higher efficiency than pL. In addition, HMGB2186-mediated gene transfer was strongly enhanced in the presence of chloroquine, indicating that the endocytic pathway was involved in cellular uptake. To improve internalization and intracellular routing of HMGB2186 as a DNA-carrier, a derivative containing the TAT47-57 cell-penetrating peptide (CPP), reported to facilitate cell entry independent of endocytosis, was constructed. Since this peptide also contains an NLS, in addition an HGMB2186-variant containing the SV40-NLS was constructed to investigate the effect of a peptide that has only nuclear localizing properties. Interestingly, the resulting TAT-HMGB2186 and SV40-HMGB2186 fusion proteins displayed DNA-binding activities similar to HMGB2186, but mediated gene delivery into different cell lines clearly more efficiently than the parental molecule. Furthermore, the efficacy of both fusion proteins was enhanced markedly in the presence of chloroquine, an indication that endocytosis was involved in the transfection process mediated by these proteins. This suggests that the increased transfection efficiency observed for TAT-HMGB2186 was more likely due to the NLS function present in the TAT47-57 peptide, rather than to its ‘cell penetrating properties’. Finally, the incorporation of functional peptides derived from human proteins into HMGB2186 was investigated. An uncharged CPP originating from Kaposi-FGF, reported to facilitate efficient cellular uptake of fused protein domains in an endocytosis-independent manner, was fused to HMGB2186 together with the SV40-NLS. Interestingly, the resulting KSV40-HMGB2186 fusion protein bound DNA similarly as previously tested DNA-carrier proteins, but did not mediate enhanced transfection compared to HMGB2186. In addition, the importin-b-binding (IBB) domain derived from human importin-a2 was investigated as a component of a DNA-carrier protein. Since the IBB domain can function as an NLS, it was fused to HMGB2186 resulting in the DNA-carrier protein IBBHMGB2186. Although IBB-HMGB2186 bound DNA in a similar manner as the other HMGB2186-derivatives, gene delivery mediated by IBB-HMGB2186 was only as effective as HMGB2186 mediated transfection, suggesting no significant role of the IBB domain. However, addition of chloroquine resulted in a remarkable enhancement of IBB-HMGB2186-mediated gene transfer, which was now more efficient than with any other HMGB2186-variant tested, and not much lower than gene transfer mediated by PEI, one of the most efficient transfection reagents available to date. To enhance nonviral gene delivery even further, the HMGB2186-based DNA-carrier proteins described in this thesis might now serve as building blocks for novel fusion proteins that include additional complementing activities. In this respect it seems particularly promising that, under conditions of effective end some escape, IBB-HMGB2186, which consists entirely of protein domains of human origin, was the most efficient of all proteins tested in this work.
RcsB is a central transcriptional regulator in enteric bacteria involved in exopolysaccharide (EPS) biosynthesis, in cell division, in the expression of osmoregulated genes, and regulates at least 20 other genes and operons. It is a member of a phosphorelay system and signal transfer is mediated by phosphorylation through the RcsC/YojN phosphorelay. RcsB proteins modified with the phosphorylation mimic BeF3- as shown by its conformational changes and DNA binding properties and resulted phosphorylated RcsB derivatives with sufficient stability. Both, the wild type RcsB protein and the mutant RcsBD11A could be modified with BeF3-. Non-phosphorylated RcsB has been shown to bind as a heterodimer with the coinducer RcsA at the conserved RcsAB box in Rcs regulated promoters. In this study, it has been shown that the modification of RcsB by BeF3 - (I) has a negative effect on its homodimerization, (II) abolishes the complex formation of RcsAB with the RcsAB box as shown by the EMSA and SPR technique. All the effects were found to be reversible by increasing the NaF concentration in the assays presumably leading to the formation of the inactive BeF4 2- salt. This hypothesis of RcsB being modified by BeF3- was also supported by other phosphodonors like ATP and acetyl phosphate, both of them showed the same negative effect on DNA binding by RcsAB heterodimer giving evidence that BeF3- could be used as a phosphorylation mimic. In addition, the phosphorylation mimic BeF3- was found to be a better phosphorylating agent than ATP and acetyl phosphate. This is the first evidence that phosphorylation of RcsB might have a negative effect on the activation of RcsAB regulated operons. Autophosphorylation of RcsB proves that it has the ability to take up phosphoryl groups and the mutant protein also become autophosphorylated with less efficiency or stability than the wild type protein. RcsB probably takes up phosphoryl groups through RcsC -> YojN -> RcsB phosphorelay pathway. To study the interaction among the proteins in this pathway, fluorescence spectroscopy, NMR spectroscopy, and an in vivo ß galactosidase assay were performed by using two domains of RcsC (T-RcsC and R-RcsC), HPt domain of the protein YojN, and RcsB. The interactions between R-RcsC/YojN-HPt and YojN-HPt/RcsB supports the proposed pathway of phosphorylating RcsB. RcsB might also be phosphorylated by YojN-HPt that is phosphorylated by other sensor kinase other than RcsC in a cross-talk mechanism. The phosphorylation of RcsB by YojN-HPt probably has the same negative effect on cps induction as obtained with BeF3 - effect on DNA binding by RcsAB heterodimer.
P2X receptors are ligand (ATP)-gated ion channels that open an intrinsic cation permeable pathway in response to extracellular ATP released from both neuronal and non-neuronal cells. P2X receptors are abundantly distributed and mediate a wide variety of physiological functions, ranging from fast synaptic transmission in the central, peripheral, and enteric nervous system, to proinflammatory cytokine release from immune cells. The primary aim of this work was to elucidate the pathway that leads to the finally assembled trimeric P2X receptors, including the assessment of a possible role of ER chaperones and folding factors in this process. Additionally, the study was conducted to investigate the various ER quality control processes involved in the selection of “properly folded and assembled” P2X receptors that are suitable for the surface expression.
Summary and Outlook The aim of this work was the investigation of the Mn2+ binding sites in hammerhead and the Diels-Alder ribozymes. This project consists of three main topics. In the first part quantification and structural characterization of Mn2+ binding sites in the m- and the tsHHRz using Electron Paramagnetic Resonance (EPR) spectroscopy are described. The second part summarizes the newest results obtained for the cleavage activity of both mand tsHHRzs in the presence of different Mg2+ and Mn2+ and Na+ ion concentrations using the new method with fluorescent-labeled RNAs. Here the influence of neomycin B on the structure of Mn2+ binding pockets and on the catalytic activity of both HHRzs is discussed. In addition, a possible role of Mn2+ ions is suggested from correlation of the EPR data with the kinetic results. The last chapter is devoted to quantification and differentiation of Mn2+ binding sites of the Diels-Alder ribozyme using continuous wave (cw) EPR experiments in solution. In this work EPR spectroscopy was used to study the binding of Mn2+ ions to the cis tsHHRz and to compare it with the binding to the trans mHHRz and to the Diels-Alder ribozyme. Cw EPR measurements showed that the tsHHRz possesses a single highaffinity Mn2+ binding site with a KD of < 10 nM at a NaCl concentration of 0.1 M. This dissociation constant is three orders of magnitude smaller than the KD determined for the single high-affinity Mn2+ site in the mHHRz (KD = 4.4 μM). The measurements of catalytic activity have been performed using fluorescent-labeled RNAs. Compared to the mHHRz, the cis tsHHRz cleaves up to 20-fold faster in the presence of Mg2+/Mn2+ ions with no saturation of the cleavage rates at high metal(II) ion concentrations. This is in good agreement with the last investigations on the trans tsHHRz (Nelson et al. 2005). Thus, the much stronger Mn2+ binding and higher cleavage activity were attributed to the interaction between the two external loops of the tsHHRz which reduces the RNA dynamics and traps the Mn2+ in the tightly folded conformation. Intriguingly, according to the EPR studies the binding constants for Mn2+ ions are several orders higher than the concentration of Mn2+ ions required for the catalytic activity (mHHRz: KD = 4.4 ± 0.5 μM and the Mn2+ concentration required to achieve half of the maximum cleavage rate [Mn2+]1/2 = 4.1 ± 0.6 mM respectively). Therefore, strongly bound Mn2+ ions seem to be needed for the folding of the HHRz, whereas weakly bound metal(II) ions are required to achieve full catalytic activity, and may be directly involved in catalysis. A comparison between the Electron Spin Echo Envelope Modulation (ESEEM) and Hyperfine Sublevel Correlation (HYSCORE) spectra of m- and tsHHRz demonstrates that both binding sites in HHRzs are structurally very similar. This suggests that the Mn2+ is located in both ribozymes between the bases A9 and G10.1 of the sheared G•A tandem basepair, as shown previously and in detail for the mHHRz (Vogt and DeRose 1998, Schiemann et al. 2003). However, the hyperfine spectra of the tsHHRz with 15N labeled G10.1 revealed no difference in comparison with the ones with 14N. This leads to an interpretation that the Mn2+ binding sites in both ribozymes are not identical. In addition, aminoglycoside antibiotic neomycin B inhibits the cleavage activity of both despite of the fact that it displaces the high-affinity Mn2+ ion only from the mHHRz. Hence, binding of neomycin B to the m- and the tsHHRzs probably occurs at different sites and neomycin B displaces only loosely bound Me2+ ions from the tsHHRs, whereas in the mHHRz both the high-affinity ion and the weakly bound ions are replaced. Therefore, it cannot be excluded that weakly bound Mg2+/Mn2+ ions, together with looploop interactions, induce a structural rearrangement which brings the high-affinity ion closer to the cleavage site. In the case of the Diels-Alder ribozyme it possesses five Mn2+ binding sites with KD = 0.6 ± 0.2 μM in solution under conditions where it is catalytically active. The competition experiment with Cd2+ allows to distinguish three different types of Mn2+ binding sites in the Diels-Alder ribozyme including inner-sphere monomeric Mn2+, monomeric Mn2+ bound through water-mediated contacts and electronically coupled dimeric Mn2+. Three Mn2+ ions are more strongly bound to the ribozyme via inner-sphere contacts, whereas two other Mn2+ ions form water-mediated outer-sphere contacts with the nucleotides of the ribozyme. The inner-sphere Mn2+ with the highest affinity and the fourth Mn2+ ions added to the ribozyme form a dimer with a Mn2+-Mn2+ distance of ~6 Å (as arises from simulations). Moreover, an addition of the product analog inhibitor (AMDA) to the [Diels-Alder ribozymes/ Mn2+] complex shows no conformational changes in the Mn2+ binding pockets. This is in good agreement with the recent studies which suggest that the Diels-Alder ribozyme is preorganized (Keiper et al. 2004). Some considerations on the evolution of the project (Outlook) There may be several venues of continuation of this project, which exploit on unique combination of EPR experiments and biochemical studies on RNA. This combination may allow us to significantly contribute to understanding of metal role in HHRz catalysis. Since the tsHHRz possesses the high affinity Mn2+ binding site (Kd < 10 nM) it creates a possibility to find conditions where the structural site is occupied by Mn2+, while catalytic sites are occupied by Mg2+ ions. If these conditions will be established by EPR titration, a set of standard biochemical experiments may be designed to look at the kinetic of cleavage and differentiate the “structural” and catalytic effects. The other experiment would be to look at the Mn2+ binding site in the tsHHRz in comparison with P1 and P1/P2 complexes and compare the results with the ones for the mHHRz. No matter the answer, P1 can be used as a simpler model to study the effect of tertiary structure on Mn2+ binding. A set of the tsHHRz mutants can be created to observe the mutations affect on Mn2+ binding sites, Mn2+ affinity and correlate the data with the kinetic analysis. FRET-based kinetic assay with fluorophore pairs on P1 and P2 can be designed for the kinetic experiments. Having this system one will be able to perform kinetic measurements 100-fold faster comparing to standard gel procedures (everything will be done in 96-wells). By manipulating the lengths and the sequence of P2 we most likely will be able to use FRET assay for the chemical step analysis (provided Kd > k2), and measure it using stop-flow system with time resolution of microseconds. And finally, one will be able to quantitatively measure the effect of neomycin B on the tsHHRz. Another interesting possibility would be to look at the state of metal(II) in the tsHHRz – enzyme alone (dissociated product) and in the enzyme-product complex and compare with the full-length tsHHRz. It will provide the information about the local rearrangements upon catalysis and the role of metal(II) ions. Furthermore, additional pulse-EPR experiments using 15N labeling have to be performed in order to reveal the location of the high-affinity Mn2+ binding site in the tsHHRz. Additionally, paramagnetic Mn2+ ions can be localized within the global fold of HHRzs using PELDOR and site-directed spin labeling. Further characterization of the high-affinity binding site in the tsHHRz can be performed using high-field ENDOR measurements in order to obtain the 14N and 31P tensors.
Unlimited self-renewal is an absolute prerequisite for any malignancy, and is the ultimate arbiter of the continuous growth and metastasis of tumors. It has been suggested that the self-renewal properties of a tumor are exclusively contained within a small population, i.e., the so-called cancer stem cells. Enhanced self-renewal potential plays a pivotal role in the development of leukemia. My data have shown that APL associated translocation products PML/RARalpha and PLZF/RARalpha increased the replating efficiency of mouse lin-/Sca1+ hematopoietic stem cells (HSCs). This effect is partly mediated by induction of gamma–catenin which is an important mediator of the Wnt signaling pathway and has been shown to be up regulated by the AML associated translocation products(AATPs). Suppression of gamma–catenin by siRNA can abrogate the increased replating efficiency induced by AATPs. Transduction of gamma–catenin in lin-/Sca1+ HSCs led to increased replating efficiency and the expression of stem cell markers Sca1 and c-kit. Additionally it induced accelerated cell cycle progression of mouse bone marrow HSCs. Transduction/transplantation mouse models have shown that ectopic expression of gamma–catenin in HSCs led to acute myeloid leukemia without maturation. These data suggest important roles of Wnt signaling pathway in the leukemogenesis induced by PML/RARalpha, PLZF/RARalpha and AML1/ETO. In contrast to AATPs, CML and Ph+-ALL associated translocation products p185(BCR-ABL) and p210(BCR-ABL) did not affect the self-renewal potential of hematopoietic stem/progenitor cells. However my studies indicated that their reciprocal translocation products p40(ABL/BCR) and p96(ABL/BCR) actually increased the replating efficiency of hematopoietic stem/progenitor cells. The effect is stronger when induced by p96(ABL/BCR) than by p40(ABL/BCR). It is very intriguing that p96(ABL/BCR) can activate Wnt signaling and up regulate the expression of HoxB4. Transduction/transplantation mouse model has shown that p40(ABL/BCR) and p96(ABL/BCR) both have their own leukemogenic potential. Given the fact that leukemic stem cells maintain the growth of tumor and are the origin of relapse, the cure of leukemia is dependent on the eradication of the leukemic stem cell and abrogation of aberrantly regulated self-renewal capability. Both t-RA and As2O3 have been shown to induce complete remission in APL patients with PML/RARalpha translocation product. However, t-RA as a single agent achieves completeremission (CR) but not complete molecular remissions (CMR). Therefore, virtually all patients will experience a relapse within a few months. In contrast to t-RA, As2O3 as a single agent is able to induce CR as well as CMR followed by long-term relapse-free survival in about 50% of APL patients even if relapsed after treatment with t-RA-containing chemotherapy regimens. Nothing is known about the mechanisms leading to the complete different clinical outcomes by the two compounds although both have been shown to induce differentiation of blast cells, proliferation arrest, induction of apoptosis and degradation of PML/RARalpha. We investigated the effect of t-RA and arsenic on PML/RARalpha-expressing cell population with stem cell capacity derived from the APL cell line NB4 as well as Sca1+/lin- murine bone marrow cells. We found that t-RA did not reduce the replating efficiency in PML/RARalpha- and PLZF/RARalpha-infected Sca1+/lincells whereas it selected small compact colonies representing very early progenitor cells. T-RA was unable to reduce the capacity to form colony forming units-spleen (CFU-S) of Sca1+/lin-cells expressing PML/RARalpha, additionally t-RA did not impair the capability of engraftment of NB4 cells in NOD/SCID mouse. On the contrary to t-RA, As2O3 abolished the aberrant self-renewal potential of Sca1+/lin- cells expressing PML/RARalpha. As2O3 not only abolished the replating efficiency of PML/RARalpha positive cells but also completely abrogated the ability of PML/RARalpha-positive HSC to produce CFU-S in vivo. On the contrary to As2O3, t-RA increased the absolute cell number and the percentage of cells in the side population with respect to the whole cell population in NB4 cells. Taken together these data suggest that arsenic but not all-trans retinoic acid overcomes the aberrant stem cell capacity of PML/RARalpha positive leukemic stem cells. My data prove for the first time that there is a direct relationship between the capacity of compounds to effectively target the LSC and their capacity to eradicate the leukemia, and, thereby, to induce complete molecular remission and long-term relapse-free survival. Thus, in order to increase the curative potential of leukemia therapies, future studies need to include the effect of given compounds on the stem cell compartment to determine their ability to eradicate the LSC.
Membranes are essential for life, because a cell must separate itself from the environment to keep its molecules from dissipating away and also must keep out foreign molecules that disturb them or their cell components. However, the cell must communicate with the environment and adapt to the external conditions, needs to pump in nutrients and release toxic products of its metabolism. Membrane proteins present in the membranes of the cell and cell organelles, help the cell to gather information about the environment and perform various biological processes. Membrane proteins perform a wide range of biological functions including respiration, signal transduction and transport. Despite their high importance in biological function, only few structures have been determined because of the difficulties in producing high amounts of membrane proteins and obtaining good quality crystals. This Ph. D. thesis involves the study of different kinds of cytochrome oxidases and a membrane anchored cytochrome oxidase electron donor. Though structures of many cytochrome oxidases are known to date, there exist many different types of oxidases in different organisms, which help the organism to survive under unfavorable environmental conditions. The structural differences between these terminal oxidases which make the organism to survive in extreme environments are unclear. To investigate these, structures of different types of oxidases are necessary. Therefore, we are interested in revealing the structural details of different types of oxidases. The different types of oxidase I worked with were the caa3 HiPIP:oxygen oxidoreductase from Rhodothermus marinus, the aa3-type quinol oxidase from Acidianus ambivalens and bd-type quinol oxidase from three different organisms (Escherichia coli, Bacillus thermodenitrificans and Aquifex aeolicus). Besides the protein from E. coli all other proteins are from thermophilic organisms from which the proteins obtained are generally believed to be highly stable. The presence of a high content of charged amino acids that enhances the occurrence of salt bridges contributes to the stability of thermophilic proteins. ....
In this thesis the three dimensional solution strucutre of the RbfA protein from Thermotoga maritima was solved using multidimensional heteronuclear NMR spectroscopy. The RbfA protein binds to the helix I region of the 16S rRNA. To gain insights into the binding mode of RbfA to its target, a second RbfA construct from Helicobacter pylori was used. Comparison of the RbfA proteins with the published structure of RbfA from Escherichia coli, led to studies concerning the differences between proteins from thermophile and mesophile systems. In the second part of this thesis the native binding motive of the RbfA protein was identified. The RbfA protein binds to an alternate helix fold within the pre-sequence of the immature 16S rRNA.
Membrane proteins play vital role in a variety of cellular processes, such as signal transduction, transport and recognition. In turn they are involved in numerous human diseases and currently represent one of the most prevalent drug targets. A comprehensive understanding of the mechanisms mediated by membrane proteins requires information about their structures at near-atomic resolution, although structural studies of membrane proteins remain behind those of soluble proteins. A bottleneck in the study of membrane proteins resides in the difficulties that are encountered during their high-level production in cell based systems. However, many toxic effects attributed to the over production of membrane proteins are eliminated by cell-free expression, as viable host cells are no longer required. Therefore, the objective of this study was to obtain adequate amounts of selected membrane transport proteins for their structural studies using a cell-free expression system. For the establishment of the cell-free system for membrane proteins, the transporters YbgR and YiiP from Salmonella typhimurium LT2, PF0558 and PF1373 from Pyrococcus furiosus, from the cation diffusion family (CDF), BetP from Corynebacterium glutamicum from the betaine/carnitine/choline transporter (BCCT) family and Aq-2030 from Aquifex aeolicus VF5 from the monovalent cation/proton antiporter-2 (CPA2) family were selected. An Escherichia coli S-30 extract based cellfree system was established by generating the best expression constructs of the target proteins, preparing T7 RNA polymerase and an S-30 extract with high translation efficiency. The functionality of the S-30 extract was shown by the cell-free expression of correctly folded Green Fluorescent Protein (GFP). Essential factors of the cell-free system such as the Mg2+ concentration, the bacterial S-30 extract proportion in the reaction mixture and the time-course of cell-free reactions have been optimized. For the cell-free production of membrane proteins in soluble form, the possibility to supplement cell-free reactions with detergents was explored. A wide range of non-ionic or zwitterionic detergents, were found to be compatible with cell-free synthesis, while ionic detergents and non-ionic detergents at high concentrations had an inhibitory effect. Moreover, high concentrations of polyoxyethylene-alkyl-ethers (Brij) detergents were found to have enhancing effect on the production levels as well as on the solubility of cell-free produced proteins. As membrane proteins tend to misfold and aggregate in a membrane-free translation system, the possibility to supplement the cell-free reactions with inner membrane vesicles (IMVs) to obtain correctly folded target transport proteins was explored. All the target proteins were successfully produced in the batch cell-free reactions and were found to be incorporated in the IMVs. A continuous exchange cell-free (CECF) system was established, where consumable substrates (amino acids, nucleotides and energy regenerating compounds) were supplied to the cell-free reaction mixture through a dialysis membrane, which in consequence resulted in high-level production of target proteins compared to the batch system. The osmosensing and osmoregulated sodium-coupled symporter BetP from C. glutamicum was chosen for the large scale production in CECF set-up. The protein is easily produced in E. coli and is functional as assayed by its transport activity, after purification and reconstitution in liposomes. It is therefore possible to compare in-vivo and cell-free production. High-level cell-free production of BetP was achieved in CECF mode in different forms: (i) as precipitate, (ii) as soluble form in detergent, and (iii) incorporated in IMVs. Cell-free production of BetP resulted in the yield of about 0.5 mg of purified BetP from 1 ml of CECF reaction. The yield of purified BetP was increased to 1.6 fold by addition of 1% polyoxyethylene-(20)-cetyl-ether (Brij58) detergent in the reaction mixture. Moreover, the high level cell-free production of BetP (0.5 mg purified BetP/ml reaction mixture) incorporated in IMVs was shown for the first time in this work.However, it was observed that oligomerization of BetP was not efficient in the cell-free system. Factors that can promote the folding of membrane proteins such as lipids and chaperones were investigated. Addition of lipids and molecular chaperone GroE facilitated correct folding of BetP resulting in increased yield and stability of cell-free produced BetP. The results obtained indicate that most of the cell-free produced BetP exists in functional oligomeric form. The possibility of obtaining milligram amounts of BetP, a 12 trans-membrane protein from the cell-free reactions holds promise for structural and functional studies of other membrane proteins. In any case, the strategies adapted in this study should prove extremely valuable for the production of membrane proteins in the E. coli cell-free expression system.
First milestone of this Ph.D. thesis was the successful extension of conventional NTA/His-tag technique to self-assembling, multivalent chelator thiols for high-affinity recognition as well as stable and uniform immobilization of His-tagged proteins on chip surfaces. Bis-NTA was linked via an oligoethylene glycol to alkyl thiols by an efficient modular synthesis strategy yielding a novel, multivalent compound for formation of mixed SAMs with anti-adsorptive matrix thiols on gold. Multivalent chelator chips allow a specific, high-affinity, reversible, long-term immobilization of His-tagged proteins. In AFM studies reversibility of the specific protein immobilization process was visualized at single molecule level. The entire control over the orientation of the immobilized protein promotes this chip surface to an optimal platform for studies focusing on research targets at single molecule level and nanobiotechnology. Based on the constructed protein chip platform above and a novel AFM mode (contact oscillation mode, COM) – developed during the current Ph.D. work – protein nanolithography under physiological conditions enabling fabrication of active biomolecular patterns in countless variety has been established. Reversible COM-mediated nanostructuring is exceptionally suitable for multiplexed patterning of protein assemblies in situ. The first selfassembled protein layer acts as a biocompatible and ductile patterning material. Immobilized proteins can be replaced by the AFM tip applying COM, and the generated structures can be erased and refilled with different proteins, which are immobilized in a uniform and functional manner. Multi-protein arrays can be systematically fabricated by iterative erase-and-write processes, and employed for protein-protein interaction analysis. Fabrication of two-dimensionally arranged nanocatalytic centres with biological activity will establish a versatile tool for nanobiotechnology. As an alternative chip fabrication approach, the combined application of methodologies from surface chemistry, semiconductor technology, and chemical biology demonstrated successfully how pre-patterned templates for micro- and nanoarrays for protein chips are fabricated. The surface physical, as well the biophysical experiments, proved the functionality of this technology. The promises of such process technology are fast and economic fabrication of ready-to-use nanostructured biochips at industrial scale. Membrane proteins are complicated in handling and hence require sophisticated solutions for chip technological application. A silicon-on-insulator (SOI) chip substrate with microcavities and nanopores was employed for first technological investigation to construct a protein chip suitable for membrane proteins. The formation of an artificial lipid bilayer using vesicle fusion on oxidized SOI cavity substrates was verified by CLSM. Future AFM experiments will give further insights into the chip architecture and topography. This will provide last evidence of the sealing of the cavity by the lipid bilayer. Transmembrane proteins will be employed for reconstitution experiments on this membrane protein chip platform. Highly integrated microdevices will find application in basic biomedical and pharmaceutical research, whereas robust and portable point-of-care devices will be used in clinical settings.
Chalcogen-based species are common ligands in transition-metal chemistry and display a variety of coordination modes. Like alkyl- and arylchalcogenolates, silylchalcogenolates are able to stabilize transition-metal complexes. Metal chalcogenolates LnM-ESiR3 with small organic residues R can serve as precursors for larger metal–chalcogenide clusters, which can be accessed by cleaving the E-Si bond. Furthermore, large silyl residues at the chalcogen atom serve to kinetically stabilize reactive systems. To explore the diverse chemistry of this class of compounds, a number of different silyl chalcogenolates were synthesized, including the sodium siloxide Ph2MeSiONa and the chalcogen derivatives of the extremely sterically hindered silyl residues tBu2PhSi- und tBu3Si-. The anionic silyl species tBu2PhSiNa and tBu3SiNa nucleophilically degrade elemental chalcogens (S, Se, and Te), thus producing the silyl chalcogenolates tBu2PhSiENa and tBu3SiENa (E = S, Se, Te). The chemical and structural properties of these compounds were studied. Protonolysis produces the corresponding chalcogenols tBu2RSiEH, while oxidation leads to the dichalcogenides tBu2RSiE-ESiRtBu2 (R = tBu, Ph; E = S, Se, Te). Oxidative addition of the dichalcogenides to metal centers in low oxidation states offers one route to chalcogenolate complexes. To investigate the realm of this approach, three oligochalcogen compounds R3SiE-E′n-ESiR3 were synthesized. The tetrasulfane tBu3SiS-S2-SSitBu3 and the chalcogen(II)dithiolates (tBu3SiS)2Se and (tBu3SiS)2Te were produced, and their stability was investigated. The direct comparison of isoelectronic species allows for a deeper understanding of their similarities and differences. The silanides R3Si– can be considered as anionic phosphane analogues in which a phosphorus atom has been formally replaced with a Si– unit. Phosphanylborhydrides R2BH3P– also belong to this isoelectronic series. The same analogy holds true for the chalcogen derivatives related to the phosphane chalcogenides R3P=E. With this in mind, complexes of the CpFe(CO)2 fragment with the different isoelectronic ligands were synthesized and compared. The silyl-based ligands were found to be the strongest donors of the two isoelectronic series. The differences in donor strength were roughly twice as large for the nonchalcogen species as for the chalcogen-based ligands. To further investigate the chemistry of transition-metal silyl chalcogenolate complexes, the coordination behavior of the chalcogenolates tBu2RSiE– (R = tBu, Ph; E = S, Se, Te) was studied. Salt metathesis of silyl thiolates with appropriate metal halides leads to the multinuclear complexes [Cu(SSitBu2Ph)]4 and [ZnCl(SSitBu3)(THF)]2. Metathesis products were identified in the reactions of BrMn(CO)5 with one or two equivalents of tBu3SiSNa(THF)2. Diproporationation of these compounds leads to dimeric Mn(I)Mn(II) complexes. The crystal structure of the dinuclear disproportionation product [(CO)3Mn(mu-SSitBu3)3Mn(SSitBu3)]– displays a terminal tBu3SiS– ligand, which coordinates with a Mn-S-Si angle of 180°. This geometry indicates that the thiolate can be considered as a six-electron donor (2 sigma e–, 4 pie–), analogous to the cyclopentadienyl ligand. Photoinduced oxidative addition of the dichalcogenides to Fe(CO)5 leads to the dimeric complexes [(CO)3Fe(ESitBu3)]2 (E = S, Se, Te). The tellurolate complex forms quantitatively within 8 h. The thiolate complex, on the other hand, is formed slowly over a period of six months. IR-spectroscopic investigation of the CO vibrations of the three homologous complexes indicates that the tellurolate is the strongest donor of the series.
An application of EPR spectroscopy that is becoming increasingly important is the measurement of distances between electron spins. Several EPR methods have been developed for this purpose, all based on measuring the dipolar coupling between two spins. Due to the specific nature of the sample, we applied dipolar relaxation enhancement measurements to study the geometry of a protein-protein complex. The paramagnetic centers in question had EPR spectra that were too broad and had such short relaxation time that they could not be studied using the more straightforward PELDOR technique. EPR spectral resolution can be increased appreciably by measuring at a frequency higher than conventional X-band (9 GHz) frequency. The spectra of many paramagnetic species can only be resolved at frequencies higher than 90 GHz. For accurate measurement of the orientation of the vector between two dipolar coupled spins with respect to the g-tensors of the spins, high spectral resolution is required. We therefore performed our EPR measurements at G-band (180 GHz) frequency. Dipolar relaxation measurements were applied to study the complex that is formed by the two electron-transfer proteins cytochrome c and cytochrome c oxidase (CcO) from the soil bacterium Paracoccus denitrificans. We were able to detect dipolar relaxation enhancement due to complex formation of soluble subunit II of P.d. CcO (CcOII) with two substrate cytochromes, which was practically absent in a mixture of CcOII with the negative control protein cytochrome c1. This complex formation was characterized by a pronounced temperature dependence that could be simulated using a home-written computer program. The G-band EPR measurements could not be simulated with a single complex geometry. This provided evidence for the hypothesis that electron-transfer protein complexes are short-lived and highly dynamic; they do not seem to form one specific electron-transfer conformation, but rather move around on each other’s binding surfaces and transfer an electron as soon as the distance between donor and acceptor is short enough. As a test of our simulation program, we also applied dipolar relaxation measurements to specially synthesized organic molecules that contained a nitroxide radical and a metal center. The transverse relaxation of Cu2+-OEP-TPA was compared to the relaxation of Ni2+-OEP-TPA at temperatures between 20 and 120 K. In this temperature range, the nitroxide relaxation was enhanced due to the presence of Cu2+, but not by Ni2+. Similarly, relaxation enhancement was found in the nitroxide-Mn2+ pair in Mn2+-terpyridine-TPA with respect to the terpyridine-TPA ligand. Due to the fast T2 relaxation of the nitroxide radical at high temperatures, the measurements were all performed in the low-temperature regime where the T1 relaxation rate of the metal ion was smaller than the dipolar coupling frequency. In this region, no structural information about the molecule can be deduced, since the dipolar relaxation enhancement is only determined by the T1 of the metal ion. The dipolar relaxation measurements we performed at high field indicated a difference in relaxation times between X-band and G-band frequencies. Extensive T1 - measurements of different paramagnetic centers (CuA, Cu2+) confirmed a strong dependence of T1 on magnetic field in the temperature range where the direct process is the dominating T1 relaxation process. This dependence is very strong (factor of 103 with respect to X-band), but does not follow the B04 dependence predicted in literature. The T1 relaxation of low-spin iron in cytochrome c at high magnetic field, estimated from dipolar relaxation data, is also in agreement with a larger contribution by the direct process (factor of 104). Dipolar relaxation enhancement was found to be a technique that is useful for measuring distances between paramagnetic centers, but only for systems where several important conditions are met, such as: the system exists in one certain static geometry, and the relaxation rate of the fast-relaxing spin is faster than the dipolar coupling frequency within the accessible temperature range. Additionally, it is a great advantage for the analysis of dipolar relaxation data if the procedure of dividing the relaxation trace of the dipolar-coupled slow-relaxing spin by the relaxation trace of the slow-relaxing spin in absence of dipolar coupling can be applied. Another useful application of dipolar relaxation enhancement measurements is the measurement of T1 relaxation of extremely fast-relaxing spins, or spins that are otherwise difficult to detect.
Die vorliegende Arbeit behandelt die Entwicklung und Überprüfung von Modellen zur Berechnung von Schwingungspektren von Peptiden und Proteinen. Solche Modelle verbinden die Konformationsstruktur eines Moleküls mit seinen Schwingungseigenschaften und sind demzufolge wichtig für die Interpretation der Schwingungspektren. Die im Rahmen dieser Arbeit durchgeführte theoretische Erforschung dieses Gebietes beschränkt sich auf die Betrachtung der Amide-I-Moden, welche aufgrund ihrer physikalischen Eigenschaften sich zur Untersuchung der Peptidkonformationen eignen. Die Arbeit kann prinzipiell in zwei Teile separiert werden. In dem ersten Teil werden Fragen betrachtet, die mit der Entwicklung des Schwingungshamiltonian verbunden sind. Im zweiten Teil wurden die erhaltenen Hamiltonian für die Berechnung der Schwingungspektren verwendet. Bei der Berechnung der Schwingungspektren wurden verschiedene spektroskopische Näherungen verwendet und erforscht. Die Entwicklung des Schwingungshamiltonian beinhaltet zwei Aufgaben. Die ab initio Parametrisierung des Schwingungshamiltonian von Dipeptiden, sowie die Analyse der Entwicklungsmethoden für Schwingungshamiltonian von Polypeptiden. Die Entwicklungsmethoden stützen sich auf ab initio berecheten Schwingungseigenschaften von Dipeptiden und/oder elektrostatische Modelle. Die ab initio Parametrisierung basiert auf einer Geometrieoptimierung und anschließender Berechnung von Normalmoden. Hierbei wurde die Abhängigkeit der Ergebnisse vom theoretischen Niveau und dem verwendeten Basissatz untersucht. Die Transformation der errechneten Normalmoden lieferte die Schwingungseigenschaften der lokale Amide-I-Mode. Die Lokalisierung der Normalmode folgt diversen Kriterien. Sie ist von der Wahl der Lokalmoden und somit implizit auch von der Art der Geometrieoptimierung abhängig. Mit dieser Arbeit konnte die Abhängigkeit der Ergebnisse von der Parameterwahl weitgehend aufgeklärt und eine für das Amide-I-System geeignet Parametrisierung gefunden werden. Im nächsten Arbeitsschritt wurde die Abhängigkeit der Amide-I-Schwingungseigenschaften von den Peptidseitenketten und terminalen Gruppen untersucht. Desweiteren wurden Methoden zur Formulierung der Hamiltonian für Polypeptide konzeptionell entwickelt. Diese Untersuchung ist außerordentlich wichtig, da direkte quantenmechanische Berechnungen von Polypeptiden zu zeitaufwendig sind. Solche Methoden beruhen auf dem sogenannten “Building-Block”-Ansatz und verschiedenen elektrostatischen Modellen. In dieser Arbeit wurden sowohl die einzelnen Methoden als auch ihre Kombination für die Entwicklung des Hamiltonians verwendet. Zur Abschätzung der Genauigkeit der verwendeten Methoden wurden Vergleichsrechnungen durchgeführt. Im zweiten Teil dieser Arbeit wurden die erhaltenen Schwingungshamiltonian zur Berechnung von Schwingungsspektren diverser gelöster Peptide angewandt. In diesem Zusammenhang konnte die Genauigkeit unterschiedlicher spektroskopischer Approximationen überprüft werden. Auf Grundlage der erhaltenen Ergebnisse können wir sagen, dass eine angemessene Beschreibung der konformationellen Verteilung und eine korrekte Berechnung des dynamischen Absorptionsspektrum gewährleistet ist. Was noch fehlt, ist ein hinreichend genaues quantenchemisches Modell für die Schwingungsfrequenzen eines gelösten Peptids. Diese Aufgabe stellt zur Zeit ein aktives Forschungsgebiet dar. Zuletzt wurde das Schwingungsspektrum eines sogenanten “Photoschaltbaren”-Peptids simuliert. Mit Hilfe des dafür aufgestellten Hamiltonians ist man in der Lage spektroskopische Beobachtungen auf Konformationsänderungen direkt zu übertragen.
The aim of the thesis was to identify structure activity relationships (SAR) in the primary screening data of high-throughput screening (HTS) assays. The strategy was to perform a hierarchical clustering of the molecules, assign the primary screening data to the created clusters and derive models from the clusters. The models should serve to identify singletons, clusters enriched with actives, not confirmed hits and false-negatives. Two hierarchical clustering algorithms, NIPALSTREE and hierarchical k-means have been developed and adapted for this purpose, respectively. A graphical user interface (GUI) has been implemented to extract SAR from the clustering results. Retrospective and prospective applications of the clustering approach were performed. SAR models were created by combining the clustering results with different chemoinformatic methods. NIPALSTREE projects a data set onto one dimension using principle component analysis. The data set is sorted according to the scoring vector and split at the median position into two subsets. The algorithm is applied recursively onto the subsets. The hierarchical k-means recursively separates a data set into two clusters using the k-means algorithm. Both algorithms are capable of clustering large data sets with more than a million data points. They were validated and compared to each other on the basis of different structural classes. NIPALSTREE provided with the loading vectors first insights into SAR whereas the hierarchical k-means yielded superior results. A GUI was developed allowing the display of and the navigation in the clustering results. Functionalities were integrated to analyse the clusters in the dendrogram, molecules in a cluster, and physicochemical properties of a molecule. Measures were developed to identify clusters enriched with actives, to characterize singletons and to analyse selectivity and specificity. Different protease inhibitors of the COBRA database were examined using the hierarchical k-means algorithm. Supported by similarity searches and nearest neighbour analyses thrombin inhibitor singletons were quickly isolated and displayed in the dendrogram. By scaling enrichment factors to the logarithm of the dendrogram level, clusters enriched with different structural classes of factor Xa inhibitors were simultaneously identified. The observed co-clustering of other protease inhibitors provided a deeper insight into selectivity and specificity and shows the utility of the approach for constructing focussed screening libraries. Specificity was analyzed by extracting and clustering relative frequencies of the protease inhibitors from the clusters of dendrogram level 7. A unique ligand based point of view on the pocketome of the protease enzymes was obtained. To identify not confirmed hits and false-negatives in the primary screening data of HTS assays, three assays were retrospectively analysed with the hierarchical k-means algorithm. A rule catalogue was developed judging hits in terminal clusters based on the cluster size, the percent control values of the entries in a cluster, the overall hit rate, the hit rate in the cluster and the environment of a cluster in the dendrogram. It resulted in the identification of a high proportion of not confirmed hits and provided for each hit a rating in context of related non-hits. This allows prioritizing compounds for follow-up studies. Non-hits and hits were retrieved from terminal clusters containing hits. Molecules bearing false-negative scaffolds were co-extracted and enriched. To minimize the number of false-positives in the extracted lists, Bayesian regularized artificial neutral network classification models were trained with the data. Applying the models marked improvement of enrichment factors for the false-negatives was obtained. It proofs the scaffold-hopping potential of the approach. NIPALSTREE, the hierarchical k-means algorithm and self-organising maps were prospectively applied to identify novel lead candidates for dopamine D3 receptors. Compounds with novel scaffolds and low nanomolar binding affinity (65 nM, compound 42) were identified. To provide a deeper insight into the SAR of these molecules, different alternative computational methods were employed. Support vector-based regression and partial least squares were examined. Predictive models for dopamine D2 and D3 receptor binding affinity values were obtained. Important features explaining SAR were extracted from the models. The prospective application of the models to the diverse and novel virtual screening data was of limited success only. Docking studies were performed using a homology model of the dopamine D3 receptor. The visual inspection of the binding modes resulted in the hypothesis of two alternative binding pockets for the aryl moiety of dopamine D3 receptor antagonists. A pharmacophore model was created simultaneously requiring both aryl moieties. Virtual screening with the model identified a nanomolar hit (65 nM, compound 59) corroborating the hypothesis of the two binding pockets and providing a new lead structure for dopamine D3 receptors. The presented data shows that the combined approach of hierarchically clustering a data set in combination with the subsequent usage of the clusters for model generation is suited to extract SAR from screening data. The models are successful in identifying singletons, clusters enriched with actives, not confirmed hits and false-negative scaffolds.