Refine
Year of publication
Document Type
- Doctoral Thesis (97)
Has Fulltext
- yes (97)
Is part of the Bibliography
- no (97)
Keywords
- Heterologe Genexpression (3)
- Endothelin (2)
- G-Protein gekoppelte Rezeptoren (2)
- Gentherapie (2)
- HIV (2)
- Membrane Proteins (2)
- NMR-Spektroskopie (2)
- gene therapy (2)
- 5-Lipoxygenase (1)
- APOBEC3G (1)
Institute
- Biochemie und Chemie (63)
- Biochemie, Chemie und Pharmazie (30)
- Pharmazie (5)
- Biowissenschaften (1)
- Georg-Speyer-Haus (1)
The focus of this thesis is the integral membrane protein Escherichia coli diacylglycerol kinase (DGK). It is located within the inner membrane, where it catalyzes the ATP-dependent phosphorylation of diacylglycerol (DAG) to phosphatic acid (PA). DGK is a unique enzyme, which does not share any sequence homology with typical kinases. In spite of its small size, it exhibits a notable complexity in structure and function. The aim of this thesis is the investigation of DGK’s structure and function at an atomic level directly within the native-like lipid bilayer using MAS NMR. This way, a deeper understanding of DGK’s catalytic mechanism should be obtained.
First, the preparation of DGK was optimized, leading to a sample, which provides well-resolved MAS NMR spectra. The high quality MAS NMR spectra formed the foundation for the second step, the resonance assignment of DGK’s backbone and side chains. The assignment was performed at high magnetic field (1H frequency 850 MHz). The sequential assignment of immobile domains was carried out using dipolar coupling based 3D experiments, NCACX, NCOCX and CONCA. The measurement time could be reduced by paramagnetic doping with Gd3+-DOTA in combination with an E-free probehead. The sequential assignment was mainly performed using a uniformly labelled sample (U-13C,15N-DGK). Residual ambiguities could be resolved by reverse labelling (U-13C,15N-DGK-I,L,V). Resonances could be assigned for 82% of the residues, from which 74% were completely assigned. For validation, ssFLYA was applied, which is a generally applicable algorithm for the automatic assignment of protein solid state NMR spectra. Its principal applicability for demanding systems as membrane proteins could be proven for the first time. Overall, ~90% of the manually obtained assignments could be confirmed by ssFLYA. For the completion of DGK’s assignment, J-coupling based 2D experiments, 1H-13C/15N HETCOR and 13C-13C TOBSY, were carried out to detect highly mobile residues. This way, residues of the two termini and the cytosolic loop, which were not detectable by dipolar coupling based experiments, could be assigned tentatively. Whereupon, peaks for arginine and lysine were assigned unambiguously to Arg9 and Lys12. Overall, ~84% of the residues could be assigned by the applied NMR strategy. Furthermore, a secondary structure analysis was carried out. It showed substantial similarities between wild-type DGK, its thermostable mutant determined both by MAS NMR and the crystal structure of wtDGK. However, there are few differences around the flexible regions most likely caused by the high mobility of these regions. During the assignment procedure, no systematic peak doublets or triplets were detected, indicating that the DGK trimer adopts a symmetric conformation. This is in contrast to the X-ray structure, which shows asymmetries between the three subunits. Especially, crystal packing may be a potential source for these structural asymmetries.
On the basis of the nearly complete assignment of DGK, the apo state was compared with the substrate bound states. Perturbations in peak position and intensity of the substrate bound states were analysed for all assigned residues in 3D and 2D spectra. The nucleotide-bound state was emulated by adenylylmethylenediphosphonate (AMP-PCP), a non-hydrolysable ATP analogue, whereas the DAG-bound state was mimicked by 1,2-dioctanoyl-sn-glycerol (DOG, chain length n = 8). Upon nucleotide binding, extensive chemical shift perturbations could be observed. These data provide evidence for a symmetric DGK trimer with all of its three active sites concurrently occupied. Additionally, it could be demonstrated that the nucleotide substrate induces a substantial conformational change. This most likely supports the enzyme in binding of the lipid substrate, indicating positive heteroallostery. In contrast, the overall alterations caused by DOG are very minor. They involve mainly changes in peak intensities. For DGK bound with either AMP-PCP+DOG or only AMP-PCP, a similar spectral fingerprint was observed. This implies that binding of the nucleotide seems to set the enzyme into a catalytic active state, triggering the actual phosphoryl transfer reaction.
The investigation of DGK’s remarkable stability and the cross-talk between its subunits forms the last part of this thesis. This demands for the identification of key intra- and interprotomer contacts, which are of structural or functional importance. For this purpose, 13C-13C DARR and 2D NCOCX spectra with long mixing times were recorded using high field MAS NMR. Additionally, DNP-enhanced 13C−15N TEDOR experiments were conducted on mixed labelled DGK trimers to enable the visualization of interprotomer contacts. With the applied NMR strategy, intra- (Arg32 - Trp25/ Glu28/ Ala29 and Trp112 - Ser61) and interprotomer (ArgNn,e - AspCg/ GluCd/ AsnCg) long-range interactions could be identified.
The focus of this research was to understand the molecular mechanism that lies behind the insertion of tail-anchored membrane proteins into the ER membrane of yeast cells. State-of-art instruments such as LILBID, and Cryo-EM, combined with the introduction of direct electron detectors, were used to analyze the proteins that capture tail-anchored proteins near the ER membrane and help their releases from a chaperone, an ATPase named Get3. Get3 escorts TA proteins to the ER membrane, where both Get3 and the TA proteins interact sequentially to Get3 membrane bound receptors Get1 and Get2. Get1 and Get2 are homologs of mammalian WRB and CAML.
The native host was used to separately produce Get1, Get2, and the Get2/Get1 single chain constructs. The studies showed that when Get1 is expressed alone, Get1 does not seems to be located in the ER membrane but rather in microbodies like shape organelles (or peroxisome). Interestingly, Get1 seems to be located in the ER membrane when it is linked to Get2 as single chain construct.
The localization study of Get2/Get1 fused to GFP shows from the fluorescence intensity that Get2/Get1.GFP has a tube-like morphology or membrane-enclosed sacs (cisterna), implying that Get2/Get1 is actually targeted to the ER membrane and is likely functional. In other words, Get1 and Get2 stabilize each other in the ER membrane.
The expression of Get2/Get1 was found to be already optimum when expressed as single chain construct because the fluorescence counts did not improve when additives such as DMSO or histidine were added. However, when Get1 and Get2 are expressed separately, additives improve their protein production yield. In 1 liter culture, Get1 yield is increased by about 3 mg and Get2 by 1.8 mg. This can be explained by the space that Get1 and Get2 should occupy within the ER membrane as they must coexist with other membrane components to maintain the homeostasis of the cell. Hence, if there were no gain for single chain construct expression, it meant that Get2/Get1 was already well expressed on its own in ER membrane and has reached its optimum expression without the help of additives. The Get2/Get1 overexpression is more stable, tolerated and less toxic for the cells to express it at a high level.
DDM has proved to be the best detergent from the detergents tested to solubilize Get1, Get2, and Get2/Get1.
Thereafter, Get1, Get2 (data not shown), and Get2/Get1 were successfully purified in DDM micelles.
Furthermore, for the first time using LILBID, the actual study has shown that Get1 and Get2 are predominantly a heterotetramer (2xGet1 and 2xGet2) but higher oligomerization may exist as well.
Get3 binds to Get1 in a biphasic way with a specific strong binding of an affinity of 57 nM and the second of 740 nM nonspecific indicative of heterogeneity within the interaction between Get1 and Get3. This heterogeneity is caused by the presence of different conformation of either protein. However, in order to characterize a high-resolution structure model of a specific target one needs highly homogenous and identical molecules of the target protein or complex in solution. The homogeneity increases the chances of growing crystals during crystallography as the good homogeneity will likely generate a perfect packing of unit cells stack (also known as crystal lattice) in the three-dimensional spaces. The same truth goes for the single particles analysis Cryo-EM, especially for smaller complexes where having less or no conformation alterations of specific targets will enable the researcher to classify the particles in 2D and 3D, therefore improving the signal-to-noise-ratio that will ultimately lead to high-resolution structure determination.
Get1, Get2/Get1 and chimeric variants (tGet2/Get1, T4l.Get2/Get1, T4l.Get2.apocyte.Get1) were crystallized but none of the crystals could diffract due to heterogeneity.
This heterogeneity was not only occurring upon the binding of Get3 to its membrane receptors, but seems to be already present within the receptors themselves through possibly different conformation.
In this Ph.D. thesis, the heterogeneity of purified Get2 and Get1 as complex or individually in detergent is then, so far, the limiting factor for obtaining a high-resolution structure model of Get1 and Get2. As mentioned above, the heterogeneity observed was not due to the quality of the sample preparation but rather to the effect of different conformations that could have been native, or just because of the micelle used, as it was proven by the 3-D heterogeneity classification by Cryo-EM.
In general, crosslinking is one way to keep the integrity of protein complexes, however it appeared not to improve the sample quality when it was analyzed in micelles. Often the integrity of some membrane proteins is affected when they are solubilized and purified in detergents.
Finally, in this study, the structural map of Get2 and Get1 complex linked with chimeric protein T4 lysozyme and apocytochrome C b562RIL gene was obtained at 10 Å. However, this single chain construct has a density map corresponding to heterodimer species (one Get1 and Get2). Therefore, based on those data the tertiary structure of Get2/Get1 in micelle is poorly defined. It could be that the membrane extraction in DDM and the purification destabilizes the structure of the complex.
Die Tumorprotein-Familie des Proteins p53 besteht aus drei Familienmitgliedern p53, p63 und p73 mit diversen Funktionen als Transkriptionsfaktoren. p53 war das erste Mitglied dieser Familie, das im Jahre 1979 entdeckt wurde und wurde zunächst als krebsverursachendes Protein eingeordnet, weil es in vielen Tumorgeweben in erhöhter Menge vorgefunden wurde. Es wurde allerdings festgestellt, dass der Großteil dieser gefundenen p53-Proteine funktionsunfähig durch Mutationen in ihrer Aminosäuresequenz waren. Unmutiertes p53 hingegen führt zu einem Stopp von Zellteilung oder sogar Zelltod, sofern die Zellen genetischem Stress durch Strahlung oder mutagene Chemikalien ausgesetzt sind. Heute wird p53 als eines der wichtigsten Tumor-Unterdrückungsproteine betrachtet. Die beiden anderen Familienmitglieder p63 und p73 existieren in einer Vielzahl von Isoformen. Neben carboxyterminaler alternativer mRNA-Prozessierung (α, β, γ, usw. Isoformen) führen zwei unabhängige Promotoren auch zu zwei unterschiedlichen Aminotermini. Hier wird zwischen ΔN- und TA-Isoformen unterschieden. Im Falle von p63 treten zwei dominante Isoformen auf, ΔNp63α und TAp63α. Während ΔNp63α eine Rolle in der Differenzierung von Haut spielt, wurde TAp63α bisher ausschließlich in Eizellen gefunden. Dort hat es die Funktion eines Sensors, der die genetische Integrität der weiblichen Keimbahn sicherstellt. Es liegt in Eizellen in hoher Konzentration vor, allerdings in einer komplett inaktiven Form. Werden Schäden im der Erbgut der Eizelle festgestellt, so wird das Protein aktiviert und kann so den Prozess des Zelltods der Eizelle einleiten. Mutationen oder das Fehlen des p63-Genes führen zu Missbildungen während der Entwicklung und zu unvollständig ausgebildeter Haut. Im Falle von p73 gibt es ebenfalls mehrere Isoformen, wobei die Funktionen und Relevanzen der einzelnen Isoformen bisher nicht komplett geklärt werden konnten. Eine p73-negative Maus hat einen diffusen Phänotyp, der sich durch niedrige Intelligenz, fast sterile Männchen und chronische bronchiale Infektion auszeichnet. Generell sind alle Mitglieder der p53-Familie tetramere Proteine und sind nur in diesem Zustand auch aktiv. Die einzige Ausnahme stellt, wie oben beschrieben, TAp63α dar, das in einem inaktiven dimeren Zustand vorliegt und nur durch Modifikation durch zwei unabhängige Kinasen aktiviert werden kann. Dabei geht es in den tetrameren Zustand über und ist daraufhin aktiv.
Alle drei Proteine haben (anhand ihrer längsten Isoform beschrieben) eine konservierte Domänenstruktur. Am Aminoterminus befindet sich zunächst die transaktivierende-Domäne (TAD), die für Interaktionen mit transkriptionellen Koaktivatioren relevant ist. Danach folgt die stark konservierte Desoxyribonukleinsäure (DNA) bindende Domäne (DBD). Sie stellt sicher, dass der Transkriptionsfaktor sequenzspezifisch an der richtigen Stelle auf die DNA bindet. Weitergehend folgt die Tetramerisierungsdomäne (TD), welche den oligomeren Zustand des Proteins herstellt. Im Falle von p53 endet das Protein an dieser Stelle, bei p63 und p73 folgen noch das Sterile-Alpha-Motiv (SAM) und die Transkription-inhibierende Domäne (TID). Die SAM Domäne wird generell als Interaktionsdomäne beschrieben, es konnte allerdings bis dato kein Interaktionspartner gefunden werden. Die TID hat einen negativen Einfluss auf die transkriptionelle Aktivität der Proteine. Im Falle von TAp63α interagiert sie zusätzlich mit der TAD um den Dimeren Zustand zu stabilisieren.
Histon Acetylasen
Die Acetylierung von Histonen ist neben deren Methylierung die wichtigste Modifikation. Sie ist essenziell für die Transkription innerhalb aller eukaryontischen Lebewesen, da sie durch die Modifikation von Histonen die DNA für die DNA-Polymerase II zugänglich macht. Es gibt insgesamt fünf verschiedene, nicht näher miteinander verwandte Familien von Histonacetylasen. Diese Studie beschäftigt sich ausschließlich mit der KAT3 Familie, bestehend aus den Proteinen p300 und CBP. Beide sind hochgradig konserviert, in gefalteten Bereichen der Proteine erreicht die Sequenzidentität fast 100%. Beide Proteine scheinen sehr ähnliche Aufgaben zu erfüllen, die jedoch nicht komplett identisch sind. Die Fehlfunktion von einem Allel von CBP führt zum Krankheitsbild des Rubinstein-Taybi-Syndrom (RTS), während ein Mangel an p300 sich in Mäusen auf das Gedächtnis auswirkt. Der komplette Verlust beider Allele eines der Proteine ist immer tödlich, genauso wie auch Verlust jeweils eines Allels bei beiden Proteinen. Insgesamt vier unabhängige Domänen in p300/CBP sind in der Lange die transaktivierende Domänen der p53-Familie zu binden. Bei zwei der Domänen handelt es sich um Zinkfinger-Proteine (Taz1 und Taz2), die anderen beiden sind kleine, ausschließlich α-helikale Domänen (Kix und IBiD).
Diese Studie beschäftigt sich mit der Lösung von Strukturen von der transaktivierenden Domäne von p63 und p73 mit der p300-Domäne Taz2. Außerdem wurden die Auswirkungen von direkten Acetylierungen von TAp63α charakterisiert und der Effekt von einem potenten p300/CBP Inhibitor auf Oozyten unter genotoxischem Stress analysiert. Zusätzlich wurde die Phosphorylierungskinetiken von Tap63α wärend der Aktivierung durch Kinasen untersucht.
...
In this thesis the integral membrane protein diacylglycerol kinase (DAGK) from E.coli is investigated with solid-state NMR. The aim is to gain an insight into the enzyme’s mechanism through integration of kinetic, structural and dynamic data. The biological function of DAGK is the transfer of the γ-phosphate group from Mg*ATP to diacylglycerol (DAG) building phosphatidic acid (PA)[6] as port of the membrane-derived oligosaccharide cycle[31,34]. Surprisingly, DAGK does not share structural or sequential similarities with other kinases[12]. Typical sequence motives found in other kinases, which catalyze phosphoryl transfer reactions, are not found[13]. In its physiological form DAGK is a homo-trimer with nine transmembrane helices, three catalytic centers and a size of 39.6 kDa.
First, the set-up of a real-time 31P MAS NMR experiment is shown. This experiment allows measuring in real-time the simultaneous ATP hydrolysis in the aqueous phase and lipid substrate phos-phorylation in the membrane phase with atomic resolution under magic angle spinning[56]. After fast transfer of the sample into the NMR spectrometer the enzymatic reaction is started with a temperature jump. This approach of real-time MAS NMR in a dual-phase system was demonstrated for the lipid substrate analogs dioleoyl- (DOG) and dibutyrylglycerol (DBG), with a C8 and C4 aliphatic chain, respectively. The combination of 31P direct and cross polarization functions as a dynamic filter. In the 31P direct polarized experiment nuclei in both phases are detected, while in the 31P cross polar-ized experiment, only nuclei in the membrane phase are detected. Rates for substrate turnover, i.e. degradation of γP-, βP, αP-ATP and build-up of βP-, αP-ADP, free phosphate as side reaction, and PA are obtained, which reveal a Michaelis-Menten behavior with regard to Mg*ATP and DBG. Here Mg*ATP and DBG follow a random-equilibrium model, where every substrate can bind indepen-dently from the other substrate. Analyses of the peak integrals from educts and products of the enzymatic reaction, revealed the stoichiometry of the reaction: 1.5 ATP molecules are used to phos-phorylate one DBG molecule. The excess of ATP is attributed to the basal ATPase activity. Further-more, experiments with ATPγS, usually regarded as a non-hydrolysable ATP-analog, where carried out. Surprisingly, DAGK hydrolyzes ATPγS and also transfers the thio-phosphate group to the lipid acceptor DBG, which points to a certain degree of plasticity in the active center. A phosphorylated enzyme intermediate was not detected. These results suggest the building of a ternary complex of Mg*ATP, DBG and DAGK performing a direct-phosphoryl transfer reaction, without passing through a phosphorylated enzyme intermediate. Experiments with the transition state analog ortho-vanadate (Vi) showed a decoupling of the ATP hydrolysis activity from lipid substrate phosphorylation. This indicates a specific transfer site for the γ-phosphate group from ATP to DAG, which can be blocked by Vi.
A general disadvantage of NMR spectroscopy compared to other spectroscopic methods is its inherent low sensitivity. One possible starting point for the improvement of signal-to-noise per unit time is the reduction of the spin-lattice relaxation time of protons[209]. Usually 95 % of the experi-mental time is required for the relaxation of the 1H to equilibrium. The addition of paramagnetic species can be used to reduce the 1H T1[233]. In a comprehensive study four different paramagnetic agents were tested: Cu2+-EDTA, Cu2+-EDTA-tag, Gd3+-TTAHA and Gd3+-DOTA. The titration of these paramagnetic complexes showed the principle feasibility of this approach, but differences between the tested species exist. The most promising complex is Gd3+-DOTA which, at a concentration of 2 mM, causes a 10-time improvement of signal-to-noise ratio per unit time. This allowed measuring 2D 13C-13C correlation spectra of proteoliposomes in one tenth of the usual required experimental time (i.e. 10 hours vs. 4 days) with good signal-to-noise.
For the investigation of structural or dynamic changes in the protein upon substrate interaction with MAS NMR, the spectral properties CP efficiency and resolution of the DAGK in liposomes needed to be improved. The most critical step during sample preparation is the reconstitution of the membrane protein from detergent micelles into a membrane of synthetic lipids under detergent removal. For this procedure the important criteria are enzymatic activity, measured in a coupled ATPase assay[55], and homogeneity of the proteoliposomes, which was tested e.g. on a discontinuous sucrose step gradient. Therefore an extensive study was carried out, in which different detergents, lipids and lipid mixtures, techniques for detergent removal and different protein-to-lipid ratios were tested. A direct correlation between high ATPase activity and good resolution was not found. Moreover, active DAGK in a mixture of DMPC and cholesterol, which emulates the membrane features of a membrane containing DAG, showed the best CP efficiency and resolution.
The assignment of the protein backbone and amino acid side chains the first mandatory step towards the investigation of structural and dynamical features influencing and defining the enzymatic mechanism by MAS NMR. As the assignment procedure is very time consuming for a total protein, a special labeling scheme for DAGK was developed, which allows assigning most of the protein areas presumably involved in enzyme catalysis. The assignment of DAGK with solution NMR[132] was not transferable to the MAS NMR spectra. Most important for the assignment process were the unique pairs[335], two consecutive amino acids which only appear once in the amino acid sequence. These unique pairs served as anchor points. Five different multinuclear MAS NMR experiments (DARR, NCO, NCA, NCACX, NCOCX) were required for the sequential assignment. It was possible to assign 35 % of the total amino acid sequence with one sample and 8 experiments acquired at 850 MHz. The secondary structure analysis showed subtle differences to the DAGK assignment with solution NMR[132], which can be attributed to the different environment in lipid bilayers and detergent micelles.
Data about structural and dynamical changes under substrate interaction can reveal details about the enzymatic mechanism. Therefore changes in chemical shift in 2D heteronuclear correlation experiments in the apo-state and under substrate saturated conditions with the substrates Mg*AMP-PNP, a non-hydrolysable ATP-analog, DOG, a mixture of Mg*AMP-PNP and DOG as well as inhibited by Vi were recorded. The most significant peak changes were observed at the interface membrane-cytoplasm as well as the the N-terminal amphipathic helix. The residues revealing chemical shift perturbations correlate with conserved residues or such residues, for which importance for catalysis and/or folding could be shown in mutation studies[8]. Especially noticeable were the changes at the amino acids Asn 72, Lys 64, His 87, Tyr 86 and Asp 95.
Beside changes of the chemical shift, changes of line width or signal doubling were observable. These changes can point to a correlation with dynamic reorientations in the μs-ms time regime, which are most relevant for enzymatic processes. The protein backbone dynamics in the apo-state as well as saturated with the substrates or inhibited with Vi were investigated with a 15N-CODEX experiment, which is based on the reorientation of the CSA tensor upon dynamical changes[350]. Specific effects of the different substrates or analogs on the protein backbone dynamic were revealed complementing the structural data and the chemical shift perturbation experiments.
The formation and maintenance of a defined three-dimensional structure is a prerequisite for most proteins in order to fulfill their function in the native context. However, there are proteins, which are intrinsically unstructured and thus natively unfolded. In addition, the misfolding and aggregation of many proteins can lead to severe diseases. The investigation of non-native states of proteins significantly contributes to the understanding of protein folding and misfolding. Nuclear magnetic resonance (NMR) spectroscopy is the only known technique that can provide information on structure and dynamics of non-native states of proteins at atomic resolution. Unfolded and non-native states of proteins have to be treated as ensembles of rapidly interconverting conformers and their observed properties are ensemble and time averaged. In this thesis, hen egg white lysozyme (HEWL) and mutants thereof have been investigated by NMR spectroscopy. The reduction of its four disulfide bridges and the successive methylation of the cysteine residues renders HEWL permanently non-native (‘HEWL-SMe’). Alternatively, the exchange of the eight cysteines for alanines results in very similar states (‘all-Ala-HEWL’). Under these conditions, HEWL-SMe and all-Ala-HEWL do not resemble random coil conformations, but exhibit residual secondary and tertiary structure. The presence of hydrophobic clusters and long-range interactions around the proteins six tryptophan residues and the modulation of these properties by single-point mutants has been observed. For the NMR spectroscopic investigation, HEWL has been isotopically labelled in E. coli by expression into inclusion bodies. After purification, the 1HN, 15NH, 13Calpha, 13Cbeta, 13C’, 1Halpha and 1Hbeta resonances of HEWL-SMe and all-Ala-HEWL have been assigned almost completely using three-dimensional NMR experiments. The analysis of secondary chemical shifts revealed regions in the proteins sequence — particularly around the six tryptophan residues—with significantly populated alpha-helix like conformations. In order to further elucidate the influence of the tryptophan side chains, a set of two new pulse sequences has been developed that allowed for the successful assignment of the 13Cg, 15Ne and 1HNe resonances in these side chains. This knowledge was eventually exploited in the interpretation of two-dimensional 15N-1H photo-CIDNP spectra, which revealed a differential solvent accessibility of the tryptophan residues in all-Ala-HEWL but not in the single point mutant W62G-all-Ala-HEWL. In addition, heteronuclear R2 relaxation rates have been determined for the indole 15Ne nuclei of all-Ala-HEWL and W62G. While in the wild-type like all-Ala-HEWL, the rates are different among the six tryptophan residues, in W62G they are more uniform. Together with relaxation data from the amide backbone, these results indicate the significant destabilization of the hydrophobic clusters in the absence of W62. In contrast, in the W108G mutant the profile of the R2 relaxation rates was not found to be significantly altered. No evidence was found by R1rho relaxation rates and relaxation dispersion measurements for conformational exchange on slower (micro- to millisecond) timescales. Residual dipolar couplings have been determined for non-native HEWL in order to retrieve structural information of these states. The differences of the W62G and the wild-type like non-native HEWL is also picked up in NH-RDCs of these proteins aligned in polyacrylamide gels. Significant positive RDCs are observed in the regions of the hydrophobic clusters in all-Ala-HEWL, but to a much lesser degree in W62G. So far, all attempts to simulate RDCs from generated non-native ensembles failed even when including long-range contacts or specific phi/psi backbone angle propensities. However, the measured RDCs can be used to cross-validate structural ensembles of non-native HEWL generated by molecular dynamics simulations that are based on restraints from the other experimental data, such as the differential solvent accessibilities from the photo-CIDNP experiments and the data on the hydrophobic clustering gained from the combined mutational and relaxation studies. Finally, non-native HEWL has been investigated for the first time using two-dimensional NMR in organic solvents, which are able to induce secondary structures and ultimately lead to amyloid formation. Under these conditions severe line broadening was observed, which was attributed to exchange between different — mostly a-helical— conformations. In summary, in this thesis methods have been developed, optimized and successfully applied for the structural and dynamical characterization of non-native states of proteins and the effect of single-point mutants on the properties of such ensembles has been investigated. Data has been gained that can considerably contribute to the further elucidation of the nature of non-native states of HEWL by molecular dynamics simulations.
Integral membrane proteins (IMPs) account for 20-40% of all open reading frames in fully sequenced genomes and they are target of approximately 60% of all modern drugs. So far, cellular expression systems are often very insufficient for the high-level production of IMPs. Toxic effects, instability or formation of inclusion bodies are frequently observed effects that prevent the synthesis of sufficient amounts of functional protein. I have successfully established an individual cell-free (CF) expression system to overcome these IMP synthesis difficulties. The CF system was established in two different expression modes. If no hydrophobic compartment is provided, the IMPs precipitate in the reaction mixture. Interestingly, these insoluble proteins are found to differ from inclusion bodies as they readily solubilize in mild detergents and the bacterial small multi drug transporter EmrE, expressed in the insoluble mode was shown to reconstitute into liposomes in an active form. Alternatively, IMPs can be synthesized in a soluble way by supplementing the CF system with detergents. A comprehensive overview of 24 commonly used detergents was provided by analyzing their impact on the CF system as well as their ability to keep three structurally very different proteins in solution. The class of long chain polyoxyethylene-alkyl-ethers turned out to be most suitable for soluble expression of a-helical EmrE, the bacterial b-barrel type nucleoside transporter Tsx and the porcine vasopressin receptor type 2, resulting in several mg of protein per mL of reaction mixture. So far IMPs have almost completely been excluded from solution nuclear magnetic resonance (NMR) analyses. I could demonstrate that CF expression enables efficient isotopic labeling of IMPs for NMR analysis and further facilitates selective labeling strategies with combinations of 13C and 15N enriched amino acids that have not been feasible before. Four different G-protein coupled receptors (GPCRs) were successfully CF expressed in preparative scale and for the human endothelin B receptor (ETB), ligand binding ability was observed. A series of truncated ETB derivatives containing nested terminal deletions have been CF produced and functionally characterized. The core area essential for Endothelin-1 binding as well as a central region responsible for ETB oligomer formation was confined to a 39 amino acid fragment including the proposed transmembrane segment 1. The binding constant (KD) of ETB was determined to 6 nM for circular ET-1 by SPR and 29 nM for linear ET-1 by TIRFS. This data indicate a large potential of the established individual CF expression system for functional IMP synthesis.
Eine große Zahl natürlicher sekundärer Metabolite sind kleine und strukturell oft sehr verschiedene Polypeptide und Polyketide. Diese bioaktiven Substanzen haben im allgemeinen ein breit aufgestelltes therapeutisches Potential und werden von verschiedenen bakteriellen Stämmen und Pilzen biosynthetisiert. Sie sind sowohl biologisch, als auch therapeutisch wichtig als Cytostatika, Immunsuppressiva und Antibiotika mit einem sehr großen antibakteriellen und antiviralen Potential. Diese oft äußerst komplexen Polypeptide und Polyketide werden von modular aufgebauten Megaenzymen in mehrstufigen Mechanismen synthetisiert. Für die Synthese dieser Peptide sind sehr große Proteincluster verantwortlich, die meistens aus einer begrenzten Anzahl sehr großer, Multidomänen umfassenden, Superenzyme aufgebaut werden. Diese Proteincluster mit einem Molekulargewicht bis in den Bereich von MegaDalton werden als nicht-ribosomale Peptidsynthetasen (NRPS) und Polyketidsynthetasen (PKS) bezeichnet. Die NRPS Systeme zeichnen sich dadurch aus, daß für die biosynthetisierten Polypeptide keine Information in Form von Nukleinsäuren wie DNA oder RNA kodiert (Walsh, C.T., 2004; Sieber & Marahiel, 2005). Für die Synthese der Polypeptide ist eine Aktivierung der einzelnen Bausteine, der Aminosäuren, durch Amino-acyl-adenylierung notwendig. Im Anschluß an die Aktivierung, wird die aktivierte Aminosäure über einen Thioester gebunden weitertransportiert. Die Thioesterbildung erfolgt an Cysteaminthiolgruppen intrinsischer 4’-Phosphopantethein-kofaktoren. Eine Modul einer NRPS stellt eine geschlossene Einheit zum Einbau einer Aminosäure mit einer hohen Spezifität für das Substrat und die biosynthetische Reaktion dar. Diese Module sind aus Domänen aufgebaut, die definierte Funktionen haben und mittels flexibler Linker miteinander verbunden sind. Die Domänen werden nach ihrer Funktion unterschieden. Die Acyl-adenylierung oder Aktivierung eines Substrates, beispielsweise einer Aminosäure, erfolgt durch die A-Domänen. Die Peptidyl- oder Acyltransportfunktion der aktivierten Substrate wird durch Thioester-domänen (T-Domäne), auch PCP (peptidyl carrier domain) genannt, bewältigt. Die Biosynthese der Kopplungsreaktion, beispielsweise die Ausbildung der Peptidbindung in NRPS Systemen, erfolgt an den Kondensations-Domänen (C-Domäne). Für die Substratspezifität eines Synthesemoduls sind die A-Domänen verantwortlich, welche die Aktivierung eines Substrat durch ATP-Hydrolyse ermöglichen. In NRPS Systemen sind auch Zyklisierungsreaktionen, durchgeführt von Cyclase-Domänen (Cy-Domänen), L/D-Epimerase-funktionen (E-Domänen) und N-Methylierungen (M-Domänen) beschrieben. So wird in Tyrocidin A an zwei Positionen spezifisch Phenylalanin in die D-Form epimerisiert und anschließend in der Peptidbiosynthese verwendet. Die Interaktion und Erkennung zwischen den multi-modularen Superenzymen, zum korrekten Aufbau der kompletten Synthetase, wurden in letzter Zeit Kommunikations-Domänen (COM-Domänen) beschrieben. Wie die aufgebaute Synthetase die korrekte Sequenz der biosynthetischen Reaktionsschritte sicherstellt ist nicht bekannt. Die enorme Diversität biosynthetischer Reaktionen in NRPS Systemen und die hohe Substratvielfalt in den verschiedensten Synthetasen unterschiedlicher Stämme eröffnet ein weites Feld für mögliche Neukombinationen von Modulen und Modifikationen von Produkten, um neue bioaktive Polypeptide mit antibiotischen Eigenschaften durch die Gestaltung neuer biosynthetischer Reaktionswege zu erhalten. Die Biosyntheseprodukte der NRPS und PKS Systeme lassen sich Gruppen kategorisieren wie Peptidantibiotika, beispielsweise beta-Lactame und makrozyklischer Polypeptide. Weitere Gruppen sind die makrozyklischen Lactone, beispielsweise Polyene und Makrolide, aromatische Verbindungen, wie Chloramphenicol, und Chinone (Tetracyclin). Die näher diskutierten Beispiele sind die antibakteriellen Polypeptide Surfactin und Tyrocidin A. Surfactin ist ein antibakteriell wirkendes makrozyklisches Lipoheptapeptid, welches von Bacillus subtilis synthetisiert wird und ein enormes antivirales Potential besitzt. Tyrocidin A ist ein antibakteriell wirkendes makrozyklisches Decapeptid und wird von Bacillus brevis und Brevisbacillus parabrevis synthetisiert. Zusätzlich werden viele bakterielle Toxine ebenfalls durch solche Systeme multi-modularer Synthetasen erzeugt. Ein Beispiel ist das Polyketid Vibriobactin, das Toxin des humanpathogenen Bakterium Vibrio cholerae. Ein zunehmendes Problem der wachsenden Weltbevölkerung moderner Gesellschaften und in den Entwicklungsländern ist die wachsende Zahl multiresistenter Bakterienstämme. Die starke Progression in der Entwicklung von Resistenzen gegen Antibiotika ist auch Gegenstand des aktuellen WHO-Reports (2006). Alarmierend ist die beschleunigte Resistenzentwicklung gegen die sogenannten Reserveantibiotika Vancomycin und Ceftazidim. Ein umfangreicheres Verständnis der Interaktion zwischen Domänen in einem Modul und zwischen Modulen eines NRPS Systems ist Grundlage für die Neukombination unterschiedlicher Module zur erfolgreichen Gestaltung neuer Biosynthesen. Da die meisten dieser Biosynthesen oder die Synthese alternativer Substanzen nicht in der Organischen Chemie zu realisieren sind oder die Produkte zu teuer wären, um diese in großen Mengen zu erzeugen, muß das Ziel sein die NRPS und PKS Systeme in ihrem modularen Aufbau und ihre Interaktion zu verstehen, um alternative Antibiotika biosynthetisch herzustellen. Peptidyl Carrier Proteine (PCPs) sind kleine zentrale Transport-Domänen, integriert in den Modulen nicht-ribosomaler Peptidsynthetasen (NRPSs). PCPs tragen kovalent über eine Phosphoesterbindung einen aus dem Protein herausragenden 4’-phosphopantetheinyl (4’-PP) Kofaktor. Der 4’-PP Kofaktor ist an der Seitenkette eines hochkonservierten Serins gebunden, welche ein zentraler Bestandteil der Phosphopantethein-Erkennungs-Sequenz ist. Die Erkennungssequenz ist homolog in vielen Proteinen mit ähnlicher Funktion, inklusive Acyl Carrier Proteinen (ACPs) der Fettsäuresynthetasen (FAS) und der Polyketidsynthetasen (PKS). Die Thiolgruppe des 4’-PP Kofaktors dient zum aktiven Transport der Substrate und der Intermediate der NRPS Systeme. Die generelle Organisation und die Kontrolle der exakt aufeinander folgenden Reaktionsschritte in der Peptidsynthetase, ist die entscheidende Frage für die Funktion des Proteinclusters (assembly line mechanism). In Modulen der NRPS Systeme folgen die PCP-Domänen C-terminal auf die Adenylierungsdomänen (A-Domäne). Die Aufgabe der A-Domänen ist die Selektion and die Aktivierung einer spezifischen Aminosäure für die „assembly line“. Die eigentliche Bildung der Peptidbindung erfolgt an der Kondensations-Domäne (C-Domäne). Der Transfer der Peptidintermediate und der aktivierten Aminosäuren zwischen A-Domänen und C-Domänen ist Aufgabe der PCPs. Um diese Funktion erfüllen zu können, ist eine große Bewegung in PCPs, bzw. des 4’-PP Kofaktors notwendig, welche als „swinging arm model“ (Weber et al., 2001) beschrieben wurde. Die PCPs koordinieren damit die Peptidbiosynthese während sie mit diversen Domänen der Synthetasen spezifisch wechselwirken müssen. Die molekularen Mechanismen des Transportes wurden bisher allerdings nicht untersucht. Eine Dynamik der Transport-Domänen wurde bereits postuliert (Kim & Prestegard, 1989; Andrec et al., 1995), konnte bisher aber nicht gezeigt werden (Weber et al., 2001). Interessanterweise zeigt sowohl apo-PCP (ohne den kovalent gebundenen 4’-PP Kofaktor) also auch holo-PCP langsamen chemischen Austausch, der als jeweils zwei stabile Konformationen beschrieben werden konnte. Diese jeweils zwei stabilen Zustände, welche sich im Austausch befinden, wurden als A und A*, für apo-PCP, und entsprechend H und H* für holo-PCP bezeichnet. Während der A- und der H-Zustand sich sowohl voneinander als auch von den entsprechenden A* und H*-Zuständen unterscheiden und spezifisch für die apo- und die holo-Form von PCP sind, ist die kalkulierte Struktur vom A*-Zustand größten Teils identisch mit der des H*-Zustandes. Die erhaltenen NMR-Strukturen des A-Zustandes, des H-Zustandes und des gemeinsamen A/H-Zustandes beschreiben in ihrer Gesamtheit ein neues Modell für ein allosterie-kontrolliertes System dualer konformationeller Zwei-Zustands-Dynamik. Zu dem beobachteten konformationellen Austausch der PCP-Domäne, konnte die Bewegung des 4’-PP Kofaktors koordiniert werden. Die Bewegung des 4’-PP Kofaktors in Verbindung mit dem konformationellen Austausch der PCP-Domäne charakterisiert die Interaktion mit katalytischen Domänen eines NRPS Moduls. Des weiteren konnte mit Hilfe des Modells die Wechselwirkung mit externen Interaktionspartnern, wie der Thioesterase II und der 4’-PP Transferase, untersucht werden. Die externe Thioesterase II der Surfactin-Synthetase (SrfTEII) von Bacillus subtilis ist ein separat expremiertes 28 KDa Protein. Sie gehört zur Familie der alpha/beta-Hydrolasen und ist verantwortlich für die Regenerierung falsch beladener 4’- PP Kofaktoren der Peptidyl Carrier Domänen. Die SrfTEII wurde mittels Lösungs-NMR untersucht, die Resonanzen wurden zugeordnet, erste strukturelle Modelle konnte berechnet werden und das Interaktionsverhalten mit verschiedenen modifizierten Kofaktoren und PCPs wurde analysiert. Die Spezifität der Substraterkennung durch die SrfTEII kann beschrieben werden. Interessanterweise zeigt auch die SrfTEII Doppelpeaks für einzelne Aminosäuren, diese können als Indikator für eine spezifische Substraterkennung durch das Enzym verwendet werden und helfen den funktionellen Unterschied zwischen der SrfTEI-Domäne und SrfTEII zu verstehen.
According to the World Health Organization (WHO) bacterial resistance to antibiotic drug therapy is emerging as a major public health problem around the world. Infectious diseases seriously threaten the health and economy of all countries. Hence, the preservation of the effectiveness of antibiotics is a world wide priority. The key to preserving the power of antibiotics lies in maintaining their diversity. Many microorganisms are capable of producing these bioactive products, the so called antibiotics. Specifically in microorganisms, polyketide synthases (PKS) and non-ribosomal peptide synthases (NRPS) produce these natural bioactive compounds. Besides being used as antibiotics these non-ribosomal peptides and polyketides display an even broader spectrum of biological activities, e.g. as antivirals, immunosuppressants or in antitumor therapy. The wide functional spectrum of the peptides and ketides is due to their structural diversity. Mostly they are cyclic or branched cyclic compounds, containing non-proteinogenic amino acids, small heterocyclic rings and other unusual modifications such as epimerization, methylation, N‐formylation or heterocyclization. It is has been shown that these modifications are important for biological activity, but little is known about their biosynthetic origin.
PKS and NRPS are multidomain protein assembly lines which function by sequentially elongating a growing polyketide or peptide chain by incorporating acyl units or amino acids, respectively. The growing product is attached via a thioester linkage to the 4’-phosphopantetheine (4’-Ppant) arm of a holo acyl carrier protein (ACP) in PKSs or holo peptidyl carrier protein (PCP) in NRPSs and is passed from one module to another along the chain of reaction centers. The modular arrangement makes PKS and NRPS systems an interesting target for protein engineering. More than 200 novel polyketide compounds have already been created by module swapping, gene deletion or other specific manipulations. Unfortunately, however, engineered PKS often fail to produce significant amounts of the desired products. Structural studies may faciliate yield improvement from engineered systems by providing a more complete understanding of the interface between the different domains. While some information about domain-domain interactions, involving the most common enzymatic modules, ketosynthase and acyltransferase, is starting to emerge, little is known about the interaction of ACP domains with other modifying enzymes such as methyltransferases, epimerases or halogenases.
To further improve the understanding of domain-domain interactions this work focuses on the curacin A assembly line. Curacin A, which exhibits anti-mitotic activity, is from the marine cyanobacterium Lyngbya majuscula. This outstanding natural product contains a cyclopropane ring, a thiazoline ring, an internal cis double bond and a terminal alkene. The biosynthesis of curacin A is performed by a 2.2 Mega Dalton (MDa) hybrid PKS-NRPS cluster. A 10-enzyme assembly catalyzes the formation of the cyclopropane moiety as the first building block of the final product. Interestingly, for these enzymes the substrate is presented by an unusual cluster of three consecutive ACPs (ACPI,II,III). Little is known about the function of multiple ACPs which are supposed to increase the overall flux for enhanced production of secondary metabolites.
The first task in this work was to elucidate the structural effect of the triplet ACP repetition by nuclear magnetic resonance (NMR). The initial data show that the excised ACPI, ACPII or ACPIII proteins resulted in [15N, 1H]-TROSY spectra with strong chemical shift perturbations (CSPs), suggesting an effect on the structure. The triplet ACP domains display a high sequence identity (93- 100%) making structural investigation using usual NMR techniques due to high peak overlap impossible. To enable the investigation of the triplet ACP in its native composition we developed a powerful method, the three fragment ligation. Segmental labeling allows incorporating isotopes into one single domain in its multidomain context. As a result we could prepare the triplet ACP with only one domain isotopically labeled and therefore assign the full length protein. In this way our method paved the way to study the structural effects of the triplet ACP repetition. We could show unexpectedly, that, despite the fact that the triplet repeat of CurA ACPI,II,III has a synergistic effect in the biosynthesis of CurA, the domains are structurally independent.
In the second part of this work, we studied the structure of the isolated ACPI domain. Our results show that the CurA ACPI undergoes no major conformational changes upon activation via phosphopantetheinylation and therefore contradicts the conformational switching model which has been proposed for PCPs. Further we report the NMR solution structures of holo-ACPI and 3-hydroxyl-3-methylglutaryl (HMG)-ACPI. Data obtained from filtered nuclear overhauser effect (NOE) experiments indicate that the substrate HMG is not sequestered but presented on the ACP surface.
In the third part of this work we focussed on the protein-protein interactions of the isolated ACPI with its cognate interaction partners. We were especially interested in the interaction with the halogenase (Cur Hal), the first enzyme within the curacin A sub-cluster, acting on the initial hydroxyl-methyl-glutaryl (HMG) attached to ACPI. Primarily we studied the interaction using NMR titration and fluorescence anisotropy measurements. Surprisingly no complex between ACPI and Cur Hal could be detected. The combination of an activity assay using matrix-assisted laser desorption/ionization (MALDI) mass spectroscopy and mutational analysis revealed several amino acids of ACPI that strongly decrease the activity of CurA Hal. Mapping these mutations according to their effect on the Cur Hal activity onto the structure of HMG-ACPI displays that these amino acids surround the substrate and form a consecutive surface. These results suggest that this surface is important for Cur Hal recognition and selectivity. Our research presented herein is an excellent example for protein-protein interactions in PKS systems underlying a specific recognition process.
Der retinoid-related orphan receptor α (RORα) ist ein nukleärer Rezeptor, der nach Bindung an sein Responselement die Transkription zahlreicher Gene reguliert. Pharmazeutisches Interesse erlangt der Rezeptor vor allem durch seine Verwicklung in pathophysiologische Prozesse wie Osteoporose und Arteriosklerose sowie durch seine antiinflammatorische Wirkung, die auf der negativen Interferenz mit dem NF-κB-Signalweg beruht. Bisher konnten vier RORα-Isoformen isoliert werden, die durch alternatives Spleißen sowie durch die Regulation über unterschiedliche Promotorregionen entstehen. In verschiedenen Studien konnte eine isoformspezifische Regulation als Antwort auf pathophysiologische Veränderungen der Zellen festgestellt werden, wie beispielsweise die Induktion der RORα4-Transkription in Leberzellen infolge einer Sauerstoffunterversorgung. Um Einblicke in die Mechanismen zu gewinnen, die der spezifischen Regulation der RORα4-Expression zugrunde liegen, wurde in der vorliegenden Arbeit der RORα4-Promotor als erster Promotor einer RORα-Isoform identifiziert und analysiert.
Sechs Fragmente mit einer Länge von bis zu 5,1 kbp der aus Datenbanken entnommenen, putativen Promotorsequenz wurden in einen Reportergenvektor kloniert. Transiente Transfektionsexperimente und Reportergenanalysen deckten die Promotoraktivität der gewählten Sequenz auf.
In dem durch einen hohen Gehalt an den Nukleotiden G und C auffallenden Promotor wurden drei einzelne GC-Boxen (A, B und C) sowie eine Viererkette (Box D) und eine Tandem-GCBox (Box E) als mögliche Bindungsmotive für Sp-Transkriptionsfaktoren gefunden. Mithilfe von Kotransfektionen konnte eine Induktion der Promotoraktivität durch die Transkriptionsfaktoren Sp1 und Sp4 nachgewiesen werden, während Sp3 die Promotoraktivität in diesen Experimenten nicht beeinflusste.
Durch die gezielte Mutation oder Deletion, bzw. die Inkubation mit verschiedenen Substanzen konnten diesen GC-Boxen unterschiedliche Funktionen zugeordnet werden. Durch transiente Transfektionen stark verkürzter Promotorfragmente wurde ein für die Promotoraktivität nötiger Sequenzbereich von 170 Basenpaaren eingegrenzt. In Mutationsanalysen wurde demonstriert, dass die beiden proximalen GC-Boxen A und B für die basale Promotoraktivität essentiell sind.
Die RORα4-Promotoraktivität ließ sich zelltypabhängig durch den Phorbolester TPA induzieren. In Deletionsanalysen ließ sich dieser Effekt teilweise auf die GC-Boxen C und D zurückführen. Der distalen GC-Box E konnte ebenfalls eine Funktion zugeordnet werden. In Reportergenanalysen konnte demonstriert werden, dass sie die Induktion der Promotoraktivität durch den HDAC-Inhibitor Trichostatin A vermittelt.
Durch die Untersuchungen an den TK-luc-Konstrukten mit RORα-Responselementen konnte gezeigt werden, dass der virale Promotor aufgrund der einklonierten RORα-Responselemente sehr stark auf die Kotransfektion der RORα-Isoformen reagiert. Die Reportergenanalyse mit diesen Konstrukten stellt daher eine effiziente Methode dar, um die RORα-vermittelte Transaktivierung zu bestimmen.
Obwohl der RORα4-Promotor zahlreiche RORα-Responselemente trägt, konnte in den Kotransfektionen mit Expressionsplasmiden für die einzelnen Isoformen in keiner der drei Zelllinien eine Autoregulation gefunden werden. Ebensowenig zeigte sich ein Einfluss des putativen RORα-Liganden Melatonin auf die Promotoraktivität.
Des Weiteren wurde gezeigt, dass die RORα4-Promotoraktivität in HeLa und MCF-7-Zellen durch das cAMP-Analogon DbcAMP induzierbar ist, während in HEK 293 keine Beeinflussung der Promotoraktivität erzielt wurde. Neben der Steigerung der Promotoraktivität durch TPA, konnte mit der DbcAMP-Induktion folglich ein zweiter, zelltypabhängiger Effekt auf die RORα4-Promotoraktivität identifiziert werden.
5-LO is the key enzyme in the biosynthesis of proinflammatory leukotrienes, converting arachidonic acid to 5-HPETE, and in a second step 5-HPETE to leukotriene A4. Although the 5-LO promoter possesses characteristics of so called housekeeping genes, such as lack of TATA/CCAAT boxes and existence of several Sp1 binding sites, the 5 -LO gene is tissue specifically expressed in primarily immune competent cells of myeloid origin including granulocytes, monocytes, macrophages, mast cells and B-lymphocytes. 5-LO gene expression in MM6 and HL-60 cells is strongly induced after differentiation of the cells with TGF-beta and 1,25(OH)2D3. In some monocytic cancer cell lines, such as HL-60 TB and U937, TGF-beta and 1,25(OH)2D3 treatment are not able to activate 5-LO gene transcription. It was demonstrated, that in these cell lines the 5-LO core promoter is heavily methylated and that only demethylation by the DNA methyltransferase inhibitor 5-aza-2 deoxycytidine (Adc) upregulated the 5-LO mRNA levels. It was also shown that the histone deacetylase inhibitor TsA could induce 5-LO mRNA levels, but only in 1,25(OH)2D3/TGF-beta inducible MM6 cells. Interestingly the 1,25(OH)2D3/TGF-beta effect on 5-LO expression is reduced, when combined with TsA. Reporter gene assays revealed that 5-LO promoter activity is strongly induced after 24 h treatment with 330 nM TsA (construct N10 up to 35 fold in HeLa cells). The effect is dependent on the presence of the proximal Sp1 binding site GC4 (-53 bp to –48 bp in relation to the major TIS) in both HeLa and MM6 cells. In vitro binding of the transcription factor Sp1 to this site has been demonstrated in gel shift assays and DNase I footprints. Mutation of the binding site resulted in a loss of basal promoter activity in both 5-LO negative HeLa cells and in 5-LO positive MM6 cells, as well as in the loss of TsA inducibility. The mutational study of different Sp1 binding sites in a larger promoter context revealed the interaction or respectively the additive effect of the multiple Sp1 binding sites of the 5-LO promoter on basal as well as on TsA upregulated promoter activity. However, GC4 seems to be of special relevance for both the basal promoter activity, possibly recruiting the basal transcription machinery, as well as for the TsA induced upregulation of 5-LO promoter activity. TsA does not alter the protein expression levels of Sp1 and Sp3 as investigated in Western blot analysis, neither in HeLa nor in MM6 cells. DNA affinity purification assays revealed that TsA had no effect on the DNA affinity of Sp1 or Sp3. In vitro binding of both Sp1 and Sp3 to the 5-fold GC box, GC4 and GC5 was demonstrated by DAPA analysis, but histone deacetylase inhibition did not change the associated protein amounts. Finally, in vivo binding of Sp1 and Sp3 was investigated in chromatin immunoprecipitation assay (ChIP) in MM6 cells. TsA clearly induced the association of both proteins to the promoter area surrounding the TIS. Upon TsA treatment also RNA polymerase II binding to the area surrounding the TIS (-318 to +52 bp) was increased and even initiated in the more distal promoter parts –1049 to –292 bp, which are negatively regulated in reporter gene assays. Interestingly histone H4 is already highly acetylated without TsA treatment and the acetylation status of H4 remains unchanged after histone deacetylase inhibition, indicating an open chromatin structure of the 5-LO gene in MM6 cells. In a cotransfection study with Sp1 and Sp3, the transactivating potential of factors was investigated and in accordance with the ChIP data, Sp1 and Sp3 increased the promoter activity, but only after TsA treatment. In gel shift assays, the influence of DNA methylation on Sp1 binding was investigated. The results indicate different roles for the three proximal promoter sites. Whereas Sp1 binding to the 5-fold GC box and GC4 is impaired by DNA methylation, binding to GC5 is even increased. A cotransfection study with methylated 5-LO promoter constructs and the murine methyl-CpG binding proteins suggest MBD1 involvement in the regulation of the 5-LO promoter. Since in gel shifts Sp1 binding is inhibited by DNA methylation, at least to the 5-fold GC box and the activating element GC4, and similarly the mutation/deletion of the same sites strongly reduces or inhibits promoter activity, it is likely to assume, that the loss of promoter activity after in vitro methylation is in the first place due to impaired Sp1/Sp3 binding. Together the data underline the importance and complexity of Sp1/Sp3 binding to the GC rich sites in the regulation of 5-LO promoter activity in response to the histone deacetylase inhibitor TsA as well as in respect to DNA methylation.