Zentrum für Biomolekulare Magnetische Resonanz (BMRZ)
Refine
Year of publication
Document Type
- Article (111)
- Preprint (2)
- Part of a Book (1)
Has Fulltext
- yes (114)
Is part of the Bibliography
- no (114)
Keywords
- RNA (9)
- SARS-CoV-2 (8)
- Solution-state NMR (8)
- NMR spectroscopy (6)
- structural biology (6)
- rna (5)
- Biophysical chemistry (4)
- COVID19-NMR (4)
- NMR (4)
- rhodesain (4)
Institute
- Zentrum für Biomolekulare Magnetische Resonanz (BMRZ) (114)
- Biochemie und Chemie (63)
- Exzellenzcluster Makromolekulare Komplexe (39)
- Biochemie, Chemie und Pharmazie (30)
- Biowissenschaften (28)
- Sonderforschungsbereiche / Forschungskollegs (25)
- Medizin (4)
- Frankfurt Institute for Advanced Studies (FIAS) (3)
- Georg-Speyer-Haus (3)
- Pharmazie (3)
A plethora of modified nucleotides extends the chemical and conformational space for natural occurring RNAs. tRNAs constitute the class of RNAs with the highest modification rate. The extensive modification modulates their overall stability, the fidelity and efficiency of translation. However, the impact of nucleotide modifications on the local structural dynamics is not well characterized. Here we show that the incorporation of the modified nucleotides in tRNAfMet from Escherichia coli leads to an increase in the local conformational dynamics, ultimately resulting in the stabilization of the overall tertiary structure. Through analysis of the local dynamics by NMR spectroscopic methods we find that, although the overall thermal stability of the tRNA is higher for the modified molecule, the conformational fluctuations on the local level are increased in comparison to an unmodified tRNA. In consequence, the melting of individual base pairs in the unmodified tRNA is determined by high entropic penalties compared to the modified. Further, we find that the modifications lead to a stabilization of long-range interactions harmonizing the stability of the tRNA’s secondary and tertiary structure. Our results demonstrate that the increase in chemical space through introduction of modifications enables the population of otherwise inaccessible conformational substates.
Ubiquitin (Ub)-mediated regulation of plasmalemmal ion channel activity canonically occurs via stimulation of endocytosis. Whether ubiquitination can modulate channel activity by alternative mechanisms remains unknown. Here, we show that the transient receptor potential vanilloid 4 (TRPV4) cation channel is multiubiquitinated within its cytosolic N-terminal and C-terminal intrinsically disordered regions (IDRs). Mutagenizing select lysine residues to block ubiquitination of the N-terminal but not C-terminal IDR resulted in a marked elevation of TRPV4-mediated intracellular calcium influx, without increasing cell surface expression levels. Conversely, enhancing TRPV4 ubiquitination via expression of an E3 Ub ligase reduced TRPV4 channel activity but did not decrease plasma membrane abundance. These results demonstrate Ub-dependent regulation of TRPV4 channel function independent of effects on plasma membrane localization. Consistent with ubiquitination playing a key negative modulatory role of the channel, gain-of-function neuropathy-causing mutations in the TRPV4 gene led to reduced channel ubiquitination in both cellular and Drosophila models of TRPV4 neuropathy, whereas increasing mutant TRPV4 ubiquitination partially suppressed channel overactivity. Together, these data reveal a novel mechanism via which ubiquitination of an intracellular flexible IDR domain modulates ion channel function independently of endocytic trafficking and identify a contributory role for this pathway in the dysregulation of TRPV4 channel activity by neuropathy-causing mutations.
We compiled an NMR data set consisting of exact nuclear Overhauser enhancement (eNOE) distance limits, residual dipolar couplings (RDCs) and scalar (J) couplings for GB3, which forms one of the largest and most diverse data set for structural characterization of a protein to date. All data have small experimental errors, which are carefully estimated. We use the data in the research article Vogeli et al., 2015, Complementarity and congruence between exact NOEs and traditional NMR probes for spatial decoding of protein dynamics, J. Struct. Biol., 191, 3, 306–317, doi:10.1016/j.jsb.2015.07.008 [1] for cross-validation in multiple-state structural ensemble calculation. We advocate this set to be an ideal test case for molecular dynamics simulations and structure calculations.
Folding of G-protein coupled receptors (GPCRs) according to the two-stage model (Popot, J. L., and Engelman, D. M. (1990) Biochemistry 29, 4031–4037) is postulated to proceed in 2 steps: partitioning of the polypeptide into the membrane followed by diffusion until native contacts are formed. Herein we investigate conformational preferences of fragments of the yeast Ste2p receptor using NMR. Constructs comprising the first, the first two, and the first three transmembrane (TM) segments, as well as a construct comprising TM1–TM2 covalently linked to TM7 were examined. We observed that the isolated TM1 does not form a stable helix nor does it integrate well into the micelle. TM1 is significantly stabilized upon interaction with TM2, forming a helical hairpin reported previously (Neumoin, A., Cohen, L. S., Arshava, B., Tantry, S., Becker, J. M., Zerbe, O., and Naider, F. (2009) Biophys. J. 96, 3187–3196), and in this case the protein integrates into the hydrophobic interior of the micelle. TM123 displays a strong tendency to oligomerize, but hydrogen exchange data reveal that the center of TM3 is solvent exposed. In all GPCRs so-far structurally characterized TM7 forms many contacts with TM1 and TM2. In our study TM127 integrates well into the hydrophobic environment, but TM7 does not stably pack against the remaining helices. Topology mapping in microsomal membranes also indicates that TM1 does not integrate in a membrane-spanning fashion, but that TM12, TM123, and TM127 adopt predominantly native-like topologies. The data from our study would be consistent with the retention of individual helices of incompletely synthesized GPCRs in the vicinity of the translocon until the complete receptor is released into the membrane interior.
Nuclear magnetic resonance (NMR) spectroscopy is a powerful and popular technique for probing the molecular structures, dynamics and chemical properties. However the conventional NMR spectroscopy is bottlenecked by its low sensitivity. Dynamic nuclear polarization (DNP) boosts NMR sensitivity by orders of magnitude and resolves this limitation. In liquid-state this revolutionizing technique has been restricted to a few specific non-biological model molecules in organic solvents. Here we show that the carbon polarization in small biological molecules, including carbohydrates and amino acids, can be enhanced sizably by in situ Overhauser DNP (ODNP) in water at room temperature and at high magnetic field. An observed connection between ODNP 13C enhancement factor and paramagnetic 13C NMR shift has led to the exploration of biologically relevant heterocyclic compound indole. The QM/MM MD simulation underscores the dynamics of intermolecular hydrogen bonds as the driving force for the scalar ODNP in a long-living radical-substrate complex. Our work reconciles results obtained by DNP spectroscopy, paramagnetic NMR and computational chemistry and provides new mechanistic insights into the high-field scalar ODNP.
Autophagy is an important survival mechanism that allows recycling of nutrients and removal of damaged organelles and has been shown to contribute to the proliferation of acute myeloid leukemia (AML) cells. However, little is known about the mechanism by which autophagy- dependent AML cells can overcome dysfunctional autophagy. In our study we identified autophagy related protein 3 (ATG3) as a crucial autophagy gene for AML cell proliferation by conducting a CRISPR/Cas9 dropout screen with a library targeting around 200 autophagy-related genes. shRNA-mediated loss of ATG3 impaired autophagy function in AML cells and increased their mitochondrial activity and energy metabolism, as shown by elevated mitochondrial ROS generation and mitochondrial respiration. Using tracer-based NMR metabolomics analysis we further demonstrate that the loss of ATG3 resulted in an upregulation of glycolysis, lactate production, and oxidative phosphorylation. Additionally, loss of ATG3 strongly sensitized AML cells to the inhibition of mitochondrial metabolism. These findings highlight the metabolic vulnerabilities that AML cells acquire from autophagy inhibition and support further exploration of combination therapies targeting autophagy and mitochondrial metabolism in AML.
Respiratory complex I catalyzes electron transfer from NADH to ubiquinone (Q) coupled to vectorial proton translocation across the inner mitochondrial membrane. Despite recent progress in structure determination of this very large membrane protein complex, the coupling mechanism is a matter of ongoing debate and the function of accessory subunits surrounding the canonical core subunits is essentially unknown. Concerted rearrangements within a cluster of conserved loops of central subunits NDUFS2 (β1-β2S2 loop), ND1 (TMH5-6ND1 loop) and ND3 (TMH1-2ND3 loop) were suggested to be critical for its proton pumping mechanism. Here, we show that stabilization of the TMH1-2ND3 loop by accessory subunit LYRM6 (NDUFA6) is pivotal for energy conversion by mitochondrial complex I. We determined the high-resolution structure of inactive mutant F89ALYRM6 of eukaryotic complex I from the yeast Yarrowia lipolytica and found long-range structural changes affecting the entire loop cluster. In atomistic molecular dynamics simulations of the mutant, we observed conformational transitions in the loop cluster that disrupted a putative pathway for delivery of substrate protons required in Q redox chemistry. Our results elucidate in detail the essential role of accessory subunit LYRM6 for the function of eukaryotic complex I and offer clues on its redox-linked proton pumping mechanism.
Translational riboswitches are cis-acting RNA regulators that modulate the expression of genes during translation initiation. Their mechanism is considered as an RNA-only gene-regulatory system inducing a ligand-dependent shift of the population of functional ON- and OFF-states. The interaction of riboswitches with the translation machinery remained unexplored. For the adenine-sensing riboswitch from Vibrio vulnificus we show that ligand binding alone is not sufficient for switching to a translational ON-state but the interaction of the riboswitch with the 30S ribosome is indispensable. Only the synergy of binding of adenine and of 30S ribosome, in particular protein rS1, induces complete opening of the translation initiation region. Our investigation thus unravels the intricate dynamic network involving RNA regulator, ligand inducer and ribosome protein modulator during translation initiation.
1H, 13C, and 15N backbone chemical shift assignments of coronavirus-2 non-structural protein Nsp10
(2020)
The international Covid19-NMR consortium aims at the comprehensive spectroscopic characterization of SARS-CoV-2 RNA elements and proteins and will provide NMR chemical shift assignments of the molecular components of this virus. The SARS-CoV-2 genome encodes approximately 30 different proteins. Four of these proteins are involved in forming the viral envelope or in the packaging of the RNA genome and are therefore called structural proteins. The other proteins fulfill a variety of functions during the viral life cycle and comprise the so-called non-structural proteins (nsps). Here, we report the near-complete NMR resonance assignment for the backbone chemical shifts of the non-structural protein 10 (nsp10). Nsp10 is part of the viral replication-transcription complex (RTC). It aids in synthesizing and modifying the genomic and subgenomic RNAs. Via its interaction with nsp14, it ensures transcriptional fidelity of the RNA-dependent RNA polymerase, and through its stimulation of the methyltransferase activity of nsp16, it aids in synthesizing the RNA cap structures which protect the viral RNAs from being recognized by the innate immune system. Both of these functions can be potentially targeted by drugs. Our data will aid in performing additional NMR-based characterizations, and provide a basis for the identification of possible small molecule ligands interfering with nsp10 exerting its essential role in viral replication.
The current outbreak of the highly infectious COVID-19 respiratory disease is caused by the novel coronavirus SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus 2). To fight the pandemic, the search for promising viral drug targets has become a cross-border common goal of the international biomedical research community. Within the international Covid19-NMR consortium, scientists support drug development against SARS-CoV-2 by providing publicly available NMR data on viral proteins and RNAs. The coronavirus nucleocapsid protein (N protein) is an RNA-binding protein involved in viral transcription and replication. Its primary function is the packaging of the viral RNA genome. The highly conserved architecture of the coronavirus N protein consists of an N-terminal RNA-binding domain (NTD), followed by an intrinsically disordered Serine/Arginine (SR)-rich linker and a C-terminal dimerization domain (CTD). Besides its involvement in oligomerization, the CTD of the N protein (N-CTD) is also able to bind to nucleic acids by itself, independent of the NTD. Here, we report the near-complete NMR backbone chemical shift assignments of the SARS-CoV-2 N-CTD to provide the basis for downstream applications, in particular site-resolved drug binding studies.
One current goal in native mass spectrometry is the assignment of binding affinities to noncovalent complexes. Here we introduce a novel implementation of the existing laser-induced liquid bead ion desorption (LILBID) mass spectrometry method: this new method, LILBID laser dissociation curves, assesses binding strengths quantitatively. In all LILBID applications, aqueous sample droplets are irradiated by 3 µm laser pulses. Variation of the laser energy transferred to the droplet during desorption affects the degree of complex dissociation. In LILBID laser dissociation curves, laser energy transfer is purposely varied, and a binding affinity is calculated from the resulting complex dissociation. A series of dsDNAs with different binding affinities was assessed using LILBID laser dissociation curves. The binding affinity results from the LILBID laser dissociation curves strongly correlated with the melting temperatures from UV melting curves and with dissociation constants from isothermal titration calorimetry, standard solution phase methods. LILBID laser dissociation curve data also showed good reproducibility and successfully predicted the melting temperatures and dissociation constants of three DNA sequences. LILBID laser dissociation curves are a promising native mass spectrometry binding affinity method, with reduced time and sample consumption compared to melting curves or titrations.
The Mycobacterium tuberculosis tyrosine-specific phosphatase MptpA and its cognate kinase PtkA are prospective targets for anti-tuberculosis drugs as they interact with the host defense response within the macrophages. Although both are structurally well-characterized, the functional mechanism regulating their activity remains poorly understood. Here, we investigate the effect of post-translational oxidation in regulating the function of MptpA. Treatment of MptpA with H2O2/NaHCO3, mimicking cellular oxidative stress conditions, leads to oxidation of the catalytic cysteine (C11) and to a conformational rearrangement of the phosphorylation loop (D-loop) by repositioning the conserved tyrosine 128 (Y128) and generating a temporarily inactive preclosed state of the phosphatase. Thus, the catalytic cysteine in the P-loop acts as a redox switch and regulates the phosphatase activity of MptpA.
The p53 protein family is the most studied protein family of all. Sequence analysis and structure determination have revealed a high
similarity of crucial domains between p53, p63 and p73. Functional studies, however, have shown a wide variety of different tasks in
tumor suppression, quality control and development. Here we review the structure and organization of the individual domains of
p63 and p73, the interaction of these domains in the context of full-length proteins and discuss the evolutionary origin of this
protein family.
FACTS:
● Distinct physiological roles/functions are performed by specific isoforms.
● The non-divided transactivation domain of p63 has a constitutively high activity while the transactivation domains of p53/p73
are divided into two subdomains that are regulated by phosphorylation.
● Mdm2 binds to all three family members but ubiquitinates only p53.
● TAp63α forms an autoinhibited dimeric state while all other vertebrate p53 family isoforms are constitutively tetrameric.
● The oligomerization domain of p63 and p73 contain an additional helix that is necessary for stabilizing the tetrameric states.
During evolution this helix got lost independently in different phylogenetic branches, while the DNA binding domain became
destabilized and the transactivation domain split into two subdomains.
OPEN QUESTIONS:
● Is the autoinhibitory mechanism of mammalian TAp63α conserved in p53 proteins of invertebrates that have the same function
of genomic quality control in germ cells?
● What is the physiological function of the p63/p73 SAM domains?
● Do the short isoforms of p63 and p73 have physiological functions?
● What are the roles of the N-terminal elongated TAp63 isoforms, TA* and GTA?
tRNAs are L-shaped RNA molecules of ~ 80 nucleotides that are responsible for decoding the mRNA and for the incorporation of the correct amino acid into the growing peptidyl-chain at the ribosome. They occur in all kingdoms of life and both their functions, and their structure are highly conserved. The L-shaped tertiary structure is based on a cloverleaf-like secondary structure that consists of four base paired stems connected by three to four loops. The anticodon base triplet, which is complementary to the sequence of the mRNA, resides in the anticodon loop whereas the amino acid is attached to the sequence CCA at the 3′-terminus of the molecule. tRNAs exhibit very stable secondary and tertiary structures and contain up to 10% modified nucleotides. However, their structure and function can also be maintained in the absence of nucleotide modifications. Here, we present the assignments of nucleobase resonances of the non-modified 77 nt tRNAIle from the gram-negative bacterium Escherichia coli. We obtained assignments for all imino resonances visible in the spectra of the tRNA as well as for additional exchangeable and non-exchangeable protons and for heteronuclei of the nucleobases. Based on these assignments we could determine the chemical shift differences between modified and non-modified tRNAIle as a first step towards the analysis of the effect of nucleotide modifications on tRNA’s structure and dynamics.
The family of scaffold attachment factor B (SAFB) proteins comprises three members and was first identified as binders of the nuclear matrix/scaffold. Over the past two decades, SAFBs were shown to act in DNA repair, mRNA/(l)ncRNA processing and as part of protein complexes with chromatin-modifying enzymes. SAFB proteins are approximately 100 kDa-sized dual nucleic acid-binding proteins with dedicated domains in an otherwise largely unstructured context, but whether and how they discriminate DNA and RNA binding has remained enigmatic. We here provide the SAFB2 DNA- and RNA-binding SAP and RRM domains in their functional boundaries and use solution NMR spectroscopy to ascribe DNA- and RNA-binding functions. We give insight into their target nucleic acid preferences and map the interfaces with respective nucleic acids on sparse data-derived SAP and RRM domain structures. Further, we provide evidence that the SAP domain exhibits intra-domain dynamics and a potential tendency to dimerize, which may expand its specifically targeted DNA sequence range. Our data provide a first molecular basis of and a starting point towards deciphering DNA- and RNA-binding functions of SAFB2 on the molecular level and serve a basis for understanding its localization to specific regions of chromatin and its involvement in the processing of specific RNA species.
During evolution of an RNA world, the development of enzymatic function was essential. Such enzymatic function was linked to RNA sequences capable of adopting specific RNA folds that possess catalytic pockets to promote catalysis. Within this primordial RNA world, initially evolved self-replicating ribozymes presumably mutated to ribozymes with new functions. Schultes and Bartel (Science 2000, 289, 448–452) investigated such conversion from one ribozyme to a new ribozyme with distinctly different catalytic functions. Within a neutral network that linked these two prototype ribozymes, a single RNA chain could be identified that exhibited both enzymatic functions. As commented by Schultes and Bartel, this system possessing one sequence with two enzymatic functions serves as a paradigm for an evolutionary system that allows neutral drifts by stepwise mutation from one ribozyme into a different ribozyme without loss of intermittent function. Here, we investigated this complex functional diversification of ancestral ribozymes by analyzing several RNA sequences within this neutral network between two ribozymes with class III ligase activity and with self-cleavage reactivity. We utilized rapid RNA sample preparation for NMR spectroscopic studies together with SHAPE analysis and in-line probing to characterize secondary structure changes within the neutral network. Our investigations allowed delineation of the secondary structure space and by comparison with the previously determined catalytic function allowed correlation of the structure-function relation of ribozyme function in this neutral network.
Polo-like kinase 1 (PLK1) is a crucial regulator of cell cycle progression. It is established that the activation of PLK1 depends on the coordinated action of Aurora-A and Bora. Nevertheless, very little is known about the spatiotemporal regulation of PLK1 during G2, specifically, the mechanisms that keep cytoplasmic PLK1 inactive until shortly before mitosis onset. Here, we describe PLK1 dimerization as a new mechanism that controls PLK1 activation. During the early G2 phase, Bora supports transient PLK1 dimerization, thus fine-tuning the timely regulated activation of PLK1 and modulating its nuclear entry. At late G2, the phosphorylation of T210 by Aurora-A triggers dimer dissociation and generates active PLK1 monomers that support entry into mitosis. Interfering with this critical PLK1 dimer/monomer switch prevents the association of PLK1 with importins, limiting its nuclear shuttling, and causes nuclear PLK1 mislocalization during the G2-M transition. Our results suggest a novel conformational space for the design of a new generation of PLK1 inhibitors.
Wir untersuchen eine neuartige Gruppe von Polarisationsmitteln – gemischtvalente Verbindungen – mittels theoretischer und experimenteller Methoden und demonstrieren ihre Leistungsfähigkeit in NMR-Experimenten mit Hochfeld-DNP (DNP=Dynamic Nuclear Polarization, dynamische Kernpolarisation) im festen Zustand. Diese gemischtvalenten Verbindungen stellen eine Gruppe von Molekülen dar, bei denen die molekulare Mobilität auch in Festkörpern erhalten bleibt. Folglich können solche Polarisationsmittel unter günstigen Bedingungen für die dynamische Kernpolarisationsbildung bei ultrahohen Magnetfeldern verwendet werden, um Overhauser-DNP-Experimente im Festkörper durchzuführen.
Riboswitches are regulatory RNA elements that undergo functionally important allosteric conformational switching upon binding of specific ligands. The here investigated guanidine-II riboswitch binds the small cation, guanidinium, and forms a kissing loop-loop interaction between its P1 and P2 hairpins. We investigated the structural changes to support previous studies regarding the binding mechanism. Using NMR spectroscopy, we confirmed the structure as observed in crystal structures and we characterized the kissing loop interaction upon addition of Mg2+ and ligand for the riboswitch aptamer from Escherichia coli. We further investigated closely related mutant constructs providing further insight into functional differences between the two (different) hairpins P1 and P2. Formation of intermolecular interactions were probed by small-angle X-ray scattering (SAXS) and NMR DOSY data. All data are consistent and show the formation of oligomeric states of the riboswitch induced by Mg2+ and ligand binding.
The β-barrel assembly machinery (BAM) consisting of the central β-barrel BamA and four other lipoproteins mediates the folding of the majority of the outer membrane proteins. BamA is placed in an asymmetric bilayer and its lateral gate is suggested to be the functional hotspot. Here we used in situ pulsed electron-electron double resonance spectroscopy to characterize BamA in the native outer membrane. In the detergent micelles, the data is consistent with mainly an inward-open conformation of BamA. The native membrane considerably enhanced the conformational heterogeneity. The lateral gate and the extracellular loop 3 exist in an equilibrium between different conformations. The outer membrane provides a favorable environment for occupying multiple conformational states independent of the lipoproteins. Our results reveal a highly dynamic behavior of the lateral gate and other key structural elements and provide direct evidence for the conformational modulation of a membrane protein in situ.
1H, 13C and 15N chemical shift assignment of the stem-loops 5b + c from the 5′-UTR of SARS-CoV-2
(2022)
The ongoing pandemic of the respiratory disease COVID-19 is caused by the SARS-CoV-2 (SCoV2) virus. SCoV2 is a member of the Betacoronavirus genus. The 30 kb positive sense, single stranded RNA genome of SCoV2 features 5′- and 3′-genomic ends that are highly conserved among Betacoronaviruses. These genomic ends contain structured cis-acting RNA elements, which are involved in the regulation of viral replication and translation. Structural information about these potential antiviral drug targets supports the development of novel classes of therapeutics against COVID-19. The highly conserved branched stem-loop 5 (SL5) found within the 5′-untranslated region (5′-UTR) consists of a basal stem and three stem-loops, namely SL5a, SL5b and SL5c. Both, SL5a and SL5b feature a 5′-UUUCGU-3′ hexaloop that is also found among Alphacoronaviruses. Here, we report the extensive 1H, 13C and 15N resonance assignment of the 37 nucleotides (nts) long sequence spanning SL5b and SL5c (SL5b + c), as basis for further in-depth structural studies by solution NMR spectroscopy.
Herein, we present a multi-cycle chemoenzymatic synthesis of modified RNA with simplified solid-phase handling to overcome size limitations of RNA synthesis. It combines the advantages of classical chemical solid-phase synthesis and enzymatic synthesis using magnetic streptavidin beads and biotinylated RNA. Successful introduction of light-controllable RNA nucleotides into the tRNAMet sequence was confirmed by gel electrophoresis and mass spectrometry. The methods tolerate modifications in the RNA phosphodiester backbone and allow introductions of photocaged and photoswitchable nucleotides as well as photocleavable strand breaks and fluorophores.
The family of scaffold attachment factor B (SAFB) proteins comprises three members and was first identified as binders of the nuclear matrix/scaffold. Over the past two decades, SAFBs were shown to act in DNA repair, mRNA/(l)ncRNA processing, and as part of protein complexes with chromatin-modifying enzymes. SAFB proteins are approximately-100-kDa-sized dual nucleic acid-binding proteins with dedicated domains in an otherwise largely unstructured context, but whether and how they discriminate DNA- and RNA-binding has remained enigmatic. We here provide the SAFB2 DNA- and RNA-binding SAP and RRM domains in their functional boundaries and use solution NMR spectroscopy to ascribe DNA- and RNA-binding functions. We give insight into their target nucleic acid preferences and map the interfaces with respective nucleic acids on sparse data-derived SAP and RRM domain structures. Further, we provide evidence that the SAP domain exhibits intra-domain dynamics and a potential tendency to dimerise, which may expand its specifically targeted DNA sequence range. Our data provide a first molecular basis of and a starting point towards deciphering DNA- and RNA-binding functions of SAFB2 on the molecular level and serve a basis for understanding its localization to specific regions of chromatin and its involvement in the processing of specific RNA species.
G-quadruplexes (G4), found in numerous places within the human genome, are involved in essential processes of cell regulation. Chromosomal DNA G4s are involved for example, in replication and transcription as first steps of gene expression. Hence, they influence a plethora of downstream processes. G4s possess an intricate structure that differs from canonical B-form DNA. Identical DNA G4 sequences can adopt multiple long-lived conformations, a phenomenon known as G4 polymorphism. A detailed understanding of the molecular mechanisms that drive G4 folding is essential to understand their ambivalent regulatory roles. Disentangling the inherent dynamic and polymorphic nature of G4 structures thus is key to unravel their biological functions and make them amenable as molecular targets in novel therapeutic approaches. We here review recent experimental approaches to monitor G4 folding and discuss structural aspects for possible folding pathways. Substantial progress in the understanding of G4 folding within the recent years now allows drawing comprehensive models of the complex folding energy landscape of G4s that we herein evaluate based on computational and experimental evidence.
Specialized surveillance mechanisms are essential to maintain the genetic integrity of germ cells, which are not only the source of all somatic cells but also of the germ cells of the next generation. DNA damage and chromosomal aberrations are, therefore, not only detrimental for the individual but affect the entire species. In oocytes, the surveillance of the structural integrity of the DNA is maintained by the p53 family member TAp63α. The TAp63α protein is highly expressed in a closed and inactive state and gets activated to the open conformation upon the detection of DNA damage, in particular DNA double-strand breaks. To understand the cellular response to DNA damage that leads to the TAp63α triggered oocyte death we have investigated the RNA transcriptome of oocytes following irradiation at different time points. The analysis shows enhanced expression of pro-apoptotic and typical p53 target genes such as CDKn1a or Mdm2, concomitant with the activation of TAp63α. While DNA repair genes are not upregulated, inflammation-related genes become transcribed when apoptosis is initiated by activation of STAT transcription factors. Furthermore, comparison with the transcriptional profile of the ΔNp63α isoform from other studies shows only a minimal overlap, suggesting distinct regulatory programs of different p63 isoforms.
Rhodesain is the lysosomal cathepsin L-like cysteine protease of Trypanosoma brucei rhodesiense, the causative agent of Human African Trypanosomiasis. The enzyme is essential for the proliferation and pathogenicity of the parasite as well as its ability to overcome the blood–brain barrier of the host. Lysosomal cathepsins are expressed as zymogens with an inactivating prodomain that is cleaved under acidic conditions. A structure of the uncleaved maturation intermediate from a trypanosomal cathepsin L-like protease is currently not available. We thus established the heterologous expression of T. brucei rhodesiense pro-rhodesain in Escherichia coli and determined its crystal structure. The trypanosomal prodomain differs from nonparasitic pro-cathepsins by a unique, extended α-helix that blocks the active site and whose side-chain interactions resemble those of the antiprotozoal inhibitor K11777. Interdomain dynamics between pro- and core protease domain as observed by photoinduced electron transfer fluorescence correlation spectroscopy increase at low pH, where pro-rhodesain also undergoes autocleavage. Using the crystal structure, molecular dynamics simulations, and mutagenesis, we identify a conserved interdomain salt bridge that prevents premature intramolecular cleavage at higher pH values and may thus present a control switch for the observed pH sensitivity of proenzyme cleavage in (trypanosomal) CathL-like proteases.
Rhodesain is the lysosomal cathepsin L-like cysteine protease of T. brucei rhodesiense, the causative agent of Human African Trypanosomiasis. The enzyme is essential for the proliferation and pathogenicity of the parasite as well as its ability to overcome the blood-brain barrier of the host. Lysosomal cathepsins are expressed as zymogens with an inactivating pro-domain that is cleaved under acidic conditions. A structure of the uncleaved maturation intermediate from a trypanosomal cathepsin L-like protease is currently not available. We thus established the heterologous expression of T. brucei rhodesiense pro-rhodesain in E. coli and determined its crystal structure. The trypanosomal pro-domain differs from non-parasitic pro-cathepsins by a unique, extended α-helix that blocks the active site and whose interactions resemble that of the antiprotozoal inhibitor K11777. Interdomain dynamics between pro- and core protease domain as observed by photoinduced electron transfer fluorescence correlation spectroscopy increase at low pH, where pro-rhodesain also undergoes autocleavage. Using the crystal structure, molecular dynamics simulations and mutagenesis, we identify a conserved interdomain salt bridge that prevents premature intramolecular cleavage at higher pH values and may thus present a control switch for the observed pH-sensitivity of pro-enzyme cleavage in (trypanosomal) CathL-like proteases.
The function of the p53 transcription factor family is dependent on several folded domains. In addition to a DNA-binding domain, members of this family contain an oligomerization domain. p63 and p73 also contain a C-terminal Sterile α-motif domain. Inhibition of most transcription factors is difficult as most of them lack deep pockets that can be targeted by small organic molecules. Genetic knock-out procedures are powerful in identifying the overall function of a protein, but they do not easily allow one to investigate roles of individual domains. Here we describe the characterization of Designed Ankyrin Repeat Proteins (DARPins) that were selected as tight binders against all folded domains of p63. We determine binding affinities as well as specificities within the p53 protein family and show that DARPins can be used as intracellular inhibitors for the modulation of transcriptional activity. By selectively inhibiting DNA binding of the ΔNp63α isoform that competes with p53 for the same promoter sites, we show that p53 can be reactivated. We further show that inhibiting the DNA binding activity stabilizes p63, thus providing evidence for a transcriptionally regulated negative feedback loop. Furthermore, the ability of DARPins to bind to the DNA-binding domain and the Sterile α-motif domain within the dimeric-only and DNA-binding incompetent conformation of TAp63α suggests a high structural plasticity within this special conformation. In addition, the developed DARPins can also be used to specifically detect p63 in cell culture and in primary tissue and thus constitute a very versatile research tool for studying the function of p63.
The SARS-CoV-2 nucleocapsid (N) protein is crucial for the highly organized packaging and transcription of the genomic RNA. Studying atomic details of the role of its intrinsically disordered regions (IDRs) in RNA recognition is challenging due to the absence of structure and to the repetitive nature of their primary sequence. IDRs are known to act in concert with the folded domains of N and here we use NMR spectroscopy to identify the priming events of N interacting with a regulatory SARS-CoV-2 RNA element. 13C-detected NMR experiments, acquired simultaneously to 1H detected ones, provide information on the two IDRs flanking the N-terminal RNA binding domain (NTD) within the N-terminal region of the protein (NTR, 1–248). We identify specific tracts of the IDRs that most rapidly sense and engage with RNA, and thus provide an atom-resolved picture of the interplay between the folded and disordered regions of N during RNA interaction.
Human African Trypanosomiasis (HAT) is an endemic protozoan disease widespread in the sub-Saharan region that is caused by T. b. gambiense and T. b. rhodesiense. The development of molecules targeting rhodesain, the main cysteine protease of T. b. rhodesiense, has led to a panel of inhibitors endowed with micro/sub-micromolar activity towards the protozoa. However, whilst impressive binding affinity against rhodesain has been observed, the limited selectivity towards the target still remains a hard challenge for the development of antitrypanosomal agents. In this paper, we report the synthesis, biological evaluation, as well as docking studies of a series of reduced peptide bond pseudopeptide Michael acceptors (SPR10–SPR19) as potential anti-HAT agents. The new molecules show Ki values in the low-micro/sub-micromolar range against rhodesain, coupled with k2nd values between 1314 and 6950 M−1 min−1. With a few exceptions, an appreciable selectivity over human cathepsin L was observed. In in vitro assays against T. b. brucei cultures, SPR16 and SPR18 exhibited single-digit micromolar activity against the protozoa, comparable to those reported for very potent rhodesain inhibitors, while no significant cytotoxicity up to 70 µM towards mammalian cells was observed. The discrepancy between rhodesain inhibition and the antitrypanosomal effect could suggest additional mechanisms of action. The biological characterization of peptide inhibitor SPR34 highlights the essential role played by the reduced bond for the antitrypanosomal effect. Overall, this series of molecules could represent the starting point for further investigations of reduced peptide bond-containing analogs as potential anti-HAT agents
NMR structure calculation using NOE-derived distance restraints requires a considerable number of assignments of both backbone and sidechains resonances, often difficult or impossible to get for large or complex proteins. Pseudocontact shifts (PCSs) also play a well-established role in NMR protein structure calculation, usually to augment existing structural, mostly NOE-derived, information. Existing refinement protocols using PCSs usually either require a sizeable number of sidechain assignments or are complemented by other experimental restraints. Here, we present an automated iterative procedure to perform backbone protein structure refinements requiring only a limited amount of backbone amide PCSs. Already known structural features from a starting homology model, in this case modules of repeat proteins, are framed into a scaffold that is subsequently refined by experimental PCSs. The method produces reliable indicators that can be monitored to judge about the performance. We applied it to a system in which sidechain assignments are hardly possible, designed Armadillo repeat proteins (dArmRPs), and we calculated the solution NMR structure of YM4A, a dArmRP containing four sequence-identical internal modules, obtaining high convergence to a single structure. We suggest that this approach is particularly useful when approximate folds are known from other techniques, such as X-ray crystallography, while avoiding inherent artefacts due to, for instance, crystal packing.
The mammalian Transient Receptor Potential Vanilloid (TRPV) channels are a family of six tetrameric ion channels localized at the plasma membrane. The group I members of the family, TRPV1 through TRPV4, are heat-activated and exhibit remarkable polymodality. The distal N-termini of group I TRPV channels contain large intrinsically disordered regions (IDRs), ranging from ~ 75 amino acids (TRPV2) to ~ 150 amino acids (TRPV4), the vast majority of which is invisible in the structural models published so far. These IDRs provide important binding sites for cytosolic partners, and their deletion is detrimental to channel activity and regulation. Recently, we reported the NMR backbone assignments of the distal TRPV4 N-terminus and noticed some discrepancies between the extent of disorder predicted solely based on protein sequence and from experimentally determined chemical shifts. Thus, for an analysis of the extent of disorder in the distal N-termini of all group I TRPV channels, we now report the NMR assignments for the human TRPV1, TRPV2 and TRPV3 IDRs.
The ribosomal S1 protein (rS1) is indispensable for translation initiation in Gram-negative bacteria. rS1 is a multidomain protein that acts as an RNA chaperone and ensures that mRNAs can bind the ribosome in a single-stranded conformation, which could be related to fast recognition. Although many ribosome structures were solved in recent years, a high-resolution structure of a two-domain mRNA-binding competent rS1 construct is not yet available. Here, we present the NMR solution structure of the minimal mRNA-binding fragment of Vibrio Vulnificus rS1 containing the domains D3 and D4. Both domains are homologues and adapt an oligonucleotide-binding fold (OB fold) motif. NMR titration experiments reveal that recognition of miscellaneous mRNAs occurs via a continuous interaction surface to one side of these structurally linked domains. Using a novel paramagnetic relaxation enhancement (PRE) approach and exploring different spin-labeling positions within RNA, we were able to track the location and determine the orientation of the RNA in the rS1–D34 bound form. Our investigations show that paramagnetically labeled RNAs, spiked into unmodified RNA, can be used as a molecular ruler to provide structural information on protein-RNA complexes. The dynamic interaction occurs on a defined binding groove spanning both domains with identical β2-β3-β5 interfaces. Evidently, the 3′-ends of the cis-acting RNAs are positioned in the direction of the N-terminus of the rS1 protein, thus towards the 30S binding site and adopt a conformation required for translation initiation.
The SARS-CoV-2 virus is the cause of the respiratory disease COVID-19. As of today, therapeutic interventions in severe COVID-19 cases are still not available as no effective therapeutics have been developed so far. Despite the ongoing development of a number of effective vaccines, therapeutics to fight the disease once it has been contracted will still be required. Promising targets for the development of antiviral agents against SARS-CoV-2 can be found in the viral RNA genome. The 5′- and 3′-genomic ends of the 30 kb SCoV-2 genome are highly conserved among Betacoronaviruses and contain structured RNA elements involved in the translation and replication of the viral genome. The 40 nucleotides (nt) long highly conserved stem-loop 4 (5_SL4) is located within the 5′-untranslated region (5′-UTR) important for viral replication. 5_SL4 features an extended stem structure disrupted by several pyrimidine mismatches and is capped by a pentaloop. Here, we report extensive 1H, 13C, 15N and 31P resonance assignments of 5_SL4 as the basis for in-depth structural and ligand screening studies by solution NMR spectroscopy.
The SARS-CoV-2 genome encodes for approximately 30 proteins. Within the international project COVID19-NMR, we distribute the spectroscopic analysis of the viral proteins and RNA. Here, we report NMR chemical shift assignments for the protein Nsp3b, a domain of Nsp3. The 217-kDa large Nsp3 protein contains multiple structurally independent, yet functionally related domains including the viral papain-like protease and Nsp3b, a macrodomain (MD). In general, the MDs of SARS-CoV and MERS-CoV were suggested to play a key role in viral replication by modulating the immune response of the host. The MDs are structurally conserved. They most likely remove ADP-ribose, a common posttranslational modification, from protein side chains. This de-ADP ribosylating function has potentially evolved to protect the virus from the anti-viral ADP-ribosylation catalyzed by poly-ADP-ribose polymerases (PARPs), which in turn are triggered by pathogen-associated sensing of the host immune system. This renders the SARS-CoV-2 Nsp3b a highly relevant drug target in the viral replication process. We here report the near-complete NMR backbone resonance assignment (1H, 13C, 15N) of the putative Nsp3b MD in its apo form and in complex with ADP-ribose. Furthermore, we derive the secondary structure of Nsp3b in solution. In addition, 15N-relaxation data suggest an ordered, rigid core of the MD structure. These data will provide a basis for NMR investigations targeted at obtaining small-molecule inhibitors interfering with the catalytic activity of Nsp3b.
Non-ribosomal peptide synthetases (NRPS) produce natural products from amino acid building blocks. They often consist of multiple polypeptide chains which assemble in a specific linear order via specialized N- and C-terminal docking domains (N/CDDs). Typically, docking domains function independently from other domains in NRPS assembly. Thus, docking domain replacements enable the assembly of “designer” NRPS from proteins that normally do not interact. The multiprotein “peptide-antimicrobial-Xenorhabdus” (PAX) peptide-producing PaxS NRPS is assembled from the three proteins PaxA, PaxB and PaxC. Herein, we show that the small CDD of PaxA cooperates with its preceding thiolation (T1) domain to bind the NDD of PaxB with very high affinity, establishing a structural and thermodynamical basis for this unprecedented docking interaction, and we test its functional importance in vivo in a truncated PaxS assembly line. Similar docking interactions are apparently present in other NRPS systems.
The desensitized channelrhodopsin-2 photointermediate contains 13 -cis, 15 -syn retinal Schiff base
(2021)
Channelrhodopsin-2 (ChR2) is a light-gated cation channel and was used to lay the foundations of optogenetics. Its dark state X-ray structure has been determined in 2017 for the wild-type, which is the prototype for all other ChR variants. However, the mechanistic understanding of the channel function is still incomplete in terms of structural changes after photon absorption by the retinal chromophore and in the framework of functional models. Hence, detailed information needs to be collected on the dark state as well as on the different photointermediates. For ChR2 detailed knowledge on the chromophore configuration in the different states is still missing and a consensus has not been achieved. Using DNP-enhanced solid-state MAS NMR spectroscopy on proteoliposome samples, we unambiguously determined the chromophore configuration in the desensitized state, and we show that this state occurs towards the end of the photocycle.
Polymorphic G-quadruplex (G4) secondary DNA structures have received increasing attention in medicinal chemistry owing to their key involvement in the regulation of the maintenance of genomic stability, telomere length homeostasis and transcription of important proto-oncogenes. Different classes of G4 ligands have been developed for the potential treatment of several human diseases. Among them, the carbazole scaffold with appropriate side chain appendages has attracted much interest for designing G4 ligands. Because of its large and rigid π-conjugation system and ease of functionalization at three different positions, a variety of carbazole derivatives have been synthesized from various natural or synthetic sources for potential applications in G4-based therapeutics and biosensors. Herein, we provide an updated close-up of the literatures on carbazole-based G4 ligands with particular focus given on their detailed binding insights studied by NMR spectroscopy. The structure-activity relationships and the opportunities and challenges of their potential applications as biosensors and therapeutics are also discussed. This review will provide an overall picture of carbazole ligands with remarkable G4 topological preference, fluorescence properties and significant bioactivity; portraying carbazole as a very promising scaffold for assembling G4 ligands with a range of novel functional applications.
SARS-CoV-2 contains a positive single-stranded RNA genome of approximately 30 000 nucleotides. Within this genome, 15 RNA elements were identified as conserved between SARS-CoV and SARS-CoV-2. By nuclear magnetic resonance (NMR) spectroscopy, we previously determined that these elements fold independently, in line with data from in vivo and ex-vivo structural probing experiments. These elements contain non-base-paired regions that potentially harbor ligand-binding pockets. Here, we performed an NMR-based screening of a poised fragment library of 768 compounds for binding to these RNAs, employing three different 1H-based 1D NMR binding assays. The screening identified common as well as RNA-element specific hits. The results allow selection of the most promising of the 15 RNA elements as putative drug targets. Based on the identified hits, we derive key functional units and groups in ligands for effective targeting of the RNA of SARS-CoV-2.
The bacteriophage ΦX174 causes large pore formation in Escherichia coli and related bacteria. Lysis is mediated by the small membrane-bound toxin ΦX174-E, which is composed of a transmembrane domain and a soluble domain. The toxin requires activation by the bacterial chaperone SlyD and inhibits the cell wall precursor forming enzyme MraY. Bacterial cell wall biosynthesis is an important target for antibiotics; therefore, knowledge of molecular details in the ΦX174-E lysis pathway could help to identify new mechanisms and sites of action. In this study, cell-free expression and nanoparticle technology were combined to avoid toxic effects upon ΦX174-E synthesis, resulting in the efficient production of a functional full-length toxin and engineered derivatives. Pre-assembled nanodiscs were used to study ΦX174-E function in defined lipid environments and to analyze its membrane insertion mechanisms. The conformation of the soluble domain of ΦX174-E was identified as a central trigger for membrane insertion, as well as for the oligomeric assembly of the toxin. Stable complex formation of the soluble domain with SlyD is essential to keep nascent ΦX174-E in a conformation competent for membrane insertion. Once inserted into the membrane, ΦX174-E assembles into high-order complexes via its transmembrane domain and oligomerization depends on the presence of an essential proline residue at position 21. The data presented here support a model where an initial contact of the nascent ΦX174-E transmembrane domain with the peptidyl-prolyl isomerase domain of SlyD is essential to allow a subsequent stable interaction of SlyD with the ΦX174-E soluble domain for the generation of a membrane insertion competent toxin.
We report here the in-cell NMR-spectroscopic observation of the binding of the cognate ligand 2′-deoxyguanosine to the aptamer domain of the bacterial 2′-deoxyguanosine-sensing riboswitch in eukaryotic cells, namely Xenopus laevis oocytes and in human HeLa cells. The riboswitch is sufficiently stable in both cell types to allow for detection of binding of the ligand to the riboswitch. Most importantly, we show that the binding mode established by in vitro characterization of this prokaryotic riboswitch is maintained in eukaryotic cellular environment. Our data also bring important methodological insights: Thus far, in-cell NMR studies on RNA in mammalian cells have been limited to investigations of short (<15 nt) RNA fragments that were extensively modified by protecting groups to limit their degradation in the intracellular space. Here, we show that the in-cell NMR setup can be adjusted for characterization of much larger (≈70 nt) functional and chemically non-modified RNA.
The stem-loop (SL1) is the 5'-terminal structural element within the single-stranded SARS-CoV-2 RNA genome. It is formed by nucleotides 7–33 and consists of two short helical segments interrupted by an asymmetric internal loop. This architecture is conserved among Betacoronaviruses. SL1 is present in genomic SARS-CoV-2 RNA as well as in all subgenomic mRNA species produced by the virus during replication, thus representing a ubiquitous cis-regulatory RNA with potential functions at all stages of the viral life cycle. We present here the 1H, 13C and 15N chemical shift assignment of the 29 nucleotides-RNA construct 5_SL1, which denotes the native 27mer SL1 stabilized by an additional terminal G-C base-pair.
The genome of the halophilic archaeon Haloferax volcanii encodes more than 40 one-domain zinc finger µ-proteins. Only one of these, HVO_2753, contains four C(P)XCG motifs, suggesting the presence of two zinc binding pockets (ZBPs). Homologs of HVO_2753 are widespread in many euryarchaeota. An in frame deletion mutant of HVO_2753 grew indistinguishably from the wild-type in several media, but had a severe defect in swarming and in biofilm formation. For further analyses, the protein was produced homologously as well as heterologously in Escherichia coli. HVO_2753 was stable and folded in low salt, in contrast to many other haloarchaeal proteins. Only haloarchaeal HVO_2753 homologs carry a very hydrophilic N terminus, and NMR analysis showed that this region is very flexible and not part of the core structure. Surprisingly, both NMR analysis and a fluorimetric assay revealed that HVO_2753 binds only one zinc ion, despite the presence of two ZBPs. Notably, the analysis of cysteine to alanine mutant proteins by NMR as well by in vivo complementation revealed that all four C(P)XCG motifs are essential for folding and function. The NMR solution structure of the major conformation of HVO_2753 was solved. Unexpectedly, it was revealed that ZBP1 was comprised of C(P)XCG motifs 1 and 3, and ZBP2 was comprised of C(P)XCG motifs 2 and 4. There are several indications that ZBP2 is occupied by zinc, in contrast to ZBP1. To our knowledge, this study represents the first in-depth analysis of a zinc finger µ-protein in all three domains of life.
We report here the nuclear magnetic resonance 19F screening of 14 RNA targets with different secondary and tertiary structure to systematically assess the druggability of RNAs. Our RNA targets include representative bacterial riboswitches that naturally bind with nanomolar affinity and high specificity to cellular metabolites of low molecular weight. Based on counter-screens against five DNAs and five proteins, we can show that RNA can be specifically targeted. To demonstrate the quality of the initial fragment library that has been designed for easy follow-up chemistry, we further show how to increase binding affinity from an initial fragment hit by chemistry that links the identified fragment to the intercalator acridine. Thus, we achieve low-micromolar binding affinity without losing binding specificity between two different terminator structures.
The structure and flexibility of RNA depends sensitively on the microenvironment. Using pulsed electron-electron double-resonance (PELDOR)/double electron-electron resonance (DEER) spectroscopy combined with advanced labeling techniques, we show that the structure of double-stranded RNA (dsRNA) changes upon internalization into Xenopus lævis oocytes. Compared to dilute solution, the dsRNA A-helix is more compact in cells. We recapitulate this compaction in a densely crowded protein solution. Atomic-resolution molecular dynamics simulations of dsRNA semi-quantitatively capture the compaction, and identify non-specific electrostatic interactions between proteins and dsRNA as a possible driver of this effect.
Proteins encoded by small open reading frames (sORFs) have a widespread occurrence in diverse microorganisms and can be of high functional importance. However, due to annotation biases and their technically challenging direct detection, these small proteins have been overlooked for a long time and were only recently rediscovered. The currently rapidly growing number of such proteins requires efficient methods to investigate their structure–function relationship. Herein, a method is presented for fast determination of the conformational properties of small proteins. Their small size makes them perfectly amenable for solution-state NMR spectroscopy. NMR spectroscopy can provide detailed information about their conformational states (folded, partially folded, and unstructured). In the context of the priority program on small proteins funded by the German research foundation (SPP2002), 27 small proteins from 9 different bacterial and archaeal organisms have been investigated. It is found that most of these small proteins are unstructured or partially folded. Bioinformatics tools predict that some of these unstructured proteins can potentially fold upon complex formation. A protocol for fast NMR spectroscopy structure elucidation is described for the small proteins that adopt a persistently folded structure by implementation of new NMR technologies, including automated resonance assignment and nonuniform sampling in combination with targeted acquisition.
Small ORF (sORF)-encoded small proteins have been overlooked for a long time due to challenges in prediction and distinguishing between coding- and noncoding-predicted sORFs and in their biochemical detection and characterization. We report on the first biochemical and functional characterization of a small protein (sP26) in the archaeal model organism Methanosarcina mazei, comprising 23 amino acids. The corresponding encoding leaderless mRNA (spRNA26) is highly conserved on nucleotide level as well as on the coded amino acids within numerous Methanosarcina strains strongly arguing for a cellular function of the small protein. spRNA26 level is significantly enhanced under nitrogen limitation, but also under oxygen and salt stress conditions. Using heterologously expressed and purified sP26 in independent biochemical approaches [pull-down by affinity chromatography followed by MS analysis, reverse pull-down, microscale thermophoresis, size-exclusion chromatography, and nuclear magnetic resonance spectroscopy (NMR) analysis], we observed that sP26 interacts and forms complexes with M. mazei glutamine synthetase (GlnA1) with high affinity (app. KD = 0.76 µm ± 0.29 µm). Moreover, seven amino acids were identified by NMR analysis to directly interact with GlnA1. Upon interaction with sP26, GlnA1 activity is significantly stimulated, independently and in addition to the known activation by the metabolite 2-oxoglutarate (2-OG). Besides, strong interaction of sP26 with the PII-like protein GlnK1 was demonstrated (app. KD = 2.9 µm ± 0.9 µm). On the basis of these findings, we propose that in addition to 2-OG, sP26 enhances GlnA1 activity under nitrogen limitation most likely by stabilizing the dodecameric structure of GlnA1.
Advanced colorectal carcinoma is currently incurable, and new therapies are urgently needed. We report that phosphotyrosine-dependent Eph receptor signaling sustains colorectal carcinoma cell survival, thereby uncovering a survival pathway active in colorectal carcinoma cells. We find that genetic and biochemical inhibition of Eph tyrosine kinase activity or depletion of the Eph ligand EphrinB2 reproducibly induces colorectal carcinoma cell death by autophagy. Spautin and 3-methyladenine, inhibitors of early steps in the autophagic pathway, significantly reduce autophagy-mediated cell death that follows inhibition of phosphotyrosine-dependent Eph signaling in colorectal cancer cells. A small-molecule inhibitor of the Eph kinase, NVP-BHG712 or its regioisomer NVP-Iso, reduces human colorectal cancer cell growth in vitro and tumor growth in mice. Colorectal cancers express the EphrinB ligand and its Eph receptors at significantly higher levels than numerous other cancer types, supporting Eph signaling inhibition as a potential new strategy for the broad treatment of colorectal carcinoma.
Mechanistic understanding of dynamic membrane proteins such as transporters, receptors, and channels requires accurate depictions of conformational ensembles, and the manner in which they interchange as a function of environmental factors including substrates, lipids, and inhibitors. Spectroscopic techniques such as electron spin resonance (ESR) pulsed electron–electron double resonance (PELDOR), also known as double electron–electron resonance (DEER), provide a complement to atomistic structures obtained from x-ray crystallography or cryo-EM, since spectroscopic data reflect an ensemble and can be measured in more native solvents, unperturbed by a crystal lattice. However, attempts to interpret DEER data are frequently stymied by discrepancies with the structural data, which may arise due to differences in conditions, the dynamics of the protein, or the flexibility of the attached paramagnetic spin labels. Recently, molecular simulation techniques such as EBMetaD have been developed that create a conformational ensemble matching an experimental distance distribution while applying the minimal possible bias. Moreover, it has been proposed that the work required during an EBMetaD simulation to match an experimentally determined distribution could be used as a metric with which to assign conformational states to a given measurement. Here, we demonstrate the application of this concept for a sodium-coupled transport protein, BetP. Because the probe, protein, and lipid bilayer are all represented in atomic detail, the different contributions to the work, such as the extent of protein backbone movements, can be separated. This work therefore illustrates how ranking simulations based on EBMetaD can help to bridge the gap between structural and biophysical data and thereby enhance our understanding of membrane protein conformational mechanisms.
Resonance assignments are challenging for membrane proteins due to the size of the lipid/detergent-protein complex and the presence of line-broadening from conformational exchange. As a consequence, many correlations are missing in the triple-resonance NMR experiments typically used for assignments. Herein, we present an approach in which correlations from these solution-state NMR experiments are supplemented by data from 13C unlabeling, single-amino acid type labeling, 4D NOESY data and proximity of moieties to lipids or water in combination with a structure of the protein. These additional data are used to edit the expected peaklists for the automated assignment protocol FLYA, a module of the program package CYANA. We demonstrate application of the protocol to the 262-residue proton pump from archaeal bacteriorhodopsin (bR) in lipid nanodiscs. The lipid-protein assembly is characterized by an overall correlation time of 44 ns. The protocol yielded assignments for 62% of all backbone (H, N, Cα, Cβ, C′) resonances of bR, corresponding to 74% of all observed backbone spin systems, and 60% of the Ala, Met, Ile (δ1), Leu and Val methyl groups, thus enabling to assign a large fraction of the protein without mutagenesis data. Most missing resonances stem from the extracellular half, likely due intermediate exchange line-broadening. Further analysis revealed that missing information of the amino acid type of the preceding residue is the largest problem, and that 4D NOESY experiments are particularly helpful to compensate for that information loss.
The facile synthesis and detailed investigation of a class of highly potent protease inhibitors based on 1,4-naphthoquinones with a dipeptidic recognition motif (HN-l-Phe-l-Leu-OR) in the 2-position and an electron-withdrawing group (EWG) in the 3-position is presented. One of the compound representatives, namely the acid with EWG = CN and with R = H proved to be a highly potent rhodesain inhibitor with nanomolar affinity. The respective benzyl ester (R = Bn) was found to be hydrolyzed by the target enzyme itself yielding the free acid. Detailed kinetic and mass spectrometry studies revealed a reversible covalent binding mode. Theoretical calculations with different density functionals (DFT) as well as wavefunction-based approaches were performed to elucidate the mode of action.
Current metabolomics approaches utilize cellular metabolite extracts, are destructive, and require high cell numbers. We introduce here an approach that enables the monitoring of cellular metabolism at lower cell numbers by observing the consumption/production of different metabolites over several kinetic data points of up to 48 hours. Our approach does not influence cellular viability, as we optimized the cellular matrix in comparison to other materials used in a variety of in‐cell NMR spectroscopy experiments. We are able to monitor real‐time metabolism of primary patient cells, which are extremely sensitive to external stress. Measurements are set up in an interleaved manner with short acquisition times (approximately 7 minutes per sample), which allows the monitoring of up to 15 patient samples simultaneously. Further, we implemented our approach for performing tracer‐based assays. Our approach will be important not only in the metabolomics fields, but also in individualized diagnostics.
Current metabolomics approaches utilize cellular metabolite extracts, are destructive, and require high cell numbers. We introduce here an approach that enables the monitoring of cellular metabolism at lower cell numbers by observing the consumption/production of different metabolites over several kinetic data points of up to 48 hours. Our approach does not influence cellular viability, as we optimized the cellular matrix in comparison to other materials used in a variety of in‐cell NMR spectroscopy experiments. We are able to monitor real‐time metabolism of primary patient cells, which are extremely sensitive to external stress. Measurements are set up in an interleaved manner with short acquisition times (approximately 7 minutes per sample), which allows the monitoring of up to 15 patient samples simultaneously. Further, we implemented our approach for performing tracer‐based assays. Our approach will be important not only in the metabolomics fields, but also in individualized diagnostics.
Amorphous formulation technologies to improve oral absorption of poorly soluble active pharmaceutical ingredients (APIs) have become increasingly prevalent. Currently, polymer-based amorphous formulations manufactured by spray drying, hot melt extrusion (HME), or co-precipitation are most common. However, these technologies have challenges in terms of the successful stabilization of poor glass former compounds in the amorphous form. An alternative approach is mesoporous silica, which stabilizes APIs in non-crystalline form via molecular adsorption inside nano-scale pores. In line with these considerations, two poor glass formers, haloperidol and carbamazepine, were formulated as polymer-based solid dispersion via HME and with mesoporous silica, and their stability was compared under accelerated conditions. Changes were monitored over three months with respect to solid-state form and dissolution. The results were supported by solid-state nuclear magnetic resonance spectroscopy (SS-NMR) and scanning electron microscopy (SEM). It was demonstrated that mesoporous silica was more successful than HME in the stabilization of the selected poor glass formers. While both drugs remained non-crystalline during the study using mesoporous silica, polymer-based HME formulations showed recrystallization after one week. Thus, mesoporous silica represents an attractive technology to extend the formulation toolbox to poorly soluble poor glass formers.
PELDOR (pulse electron-electron double resonance) is an established method to study intramolecular distances and can give evidence for conformational changes and flexibilities. However, it can also be used to study intermolecular interactions as for example oligerimization. Here, we used PELDOR to study the ‘end-to-end’ stacking of small double stranded (ds)RNAs. For this study, the dsRNA molecules were only singly labelled with the spin label TPA to avoid multi-spin effects and to measure only the intermolecular stacking interactions. It can be shown that small dsRNAs tend to assemble to rod-like structures due to π-π-interactions between the base pairs at the end of the strands. On the one hand, these interactions can influence or complicate measurements aimed at the determining of the structure and dynamics of the dsRNA molecule itself. On the other hand, it can be interesting to study such intermolecular stacking interactions in more detail, as for example their dependence on ion concentration. We quantitatively determined the stacking probability as a function of the monovalent NaCl salt and the dsRNA concentration. From this data the dissociation constant Kd was deduced and found to depend on the ratio between the NaCl salt and dsRNA concentrations. Additionally, the distances and distance distributions obtained predict a model for the stacking geometry of dsRNAs. Introducing a nucleotide overhangs at one end of the dsRNA molecule restricts the stacking to the other end, leading only to dimer formations. Introducing such an overhang at both ends of the dsRNA molecule fully suppresses stacking, as we could demonstrate by PELDOR experiments quantitatively.
In every established species, protein-protein interactions have evolved such that they are fit for purpose. However, the molecular details of the evolution of new protein-protein interactions are poorly understood. We have used nuclear magnetic resonance spectroscopy to investigate the changes in structure and dynamics during the evolution of a protein-protein interaction involving the intrinsically disordered CREBBP (CREB-binding protein) interaction domain (CID) and nuclear coactivator binding domain (NCBD) from the transcriptional coregulators NCOA (nuclear receptor coactivator) and CREBBP/p300, respectively. The most ancient low-affinity “Cambrian-like” [540 to 600 million years (Ma) ago] CID/NCBD complex contained less secondary structure and was more dynamic than the complexes from an evolutionarily younger “Ordovician-Silurian” fish ancestor (ca. 440 Ma ago) and extant human. The most ancient Cambrian-like CID/NCBD complex lacked one helix and several interdomain interactions, resulting in a larger solvent-accessible surface area. Furthermore, the most ancient complex had a high degree of millisecond-to-microsecond dynamics distributed along the entire sequences of both CID and NCBD. These motions were reduced in the Ordovician-Silurian CID/NCBD complex and further redistributed in the extant human CID/NCBD complex. Isothermal calorimetry experiments show that complex formation is enthalpically favorable and that affinity is modulated by a largely unfavorable entropic contribution to binding. Our data demonstrate how changes in structure and motion conspire to shape affinity during the evolution of a protein-protein complex and provide direct evidence for the role of structural, dynamic, and frustrational plasticity in the evolution of interactions between intrinsically disordered proteins.
1H-detected solid-state NMR experiments feasible at fast magic-angle spinning (MAS) frequencies allow accessing 1H chemical shifts of proteins in solids, which enables their interpretation in terms of secondary structure. Here we present 1H and 13C-detected NMR spectra of the RNA polymerase subunit Rpo7 in complex with unlabeled Rpo4 and use the 13C, 15N, and 1H chemical-shift values deduced from them to study the secondary structure of the protein in comparison to a known crystal structure. We applied the automated resonance assignment approach FLYA including 1H-detected solid-state NMR spectra and show its success in comparison to manual spectral assignment. Our results show that reasonably reliable secondary-structure information can be obtained from 1H secondary chemical shifts (SCS) alone by using the sum of 1Hα and 1HN SCS rather than by TALOS. The confidence, especially at the boundaries of the observed secondary structure elements, is found to increase when evaluating 13C chemical shifts, here either by using TALOS or in terms of 13C SCS.
The tetracycline-binding RNA aptamer (TC-aptamer) is a synthetic riboswitch that binds the antibiotic tetracycline (TC) with exceptionally high affinity. Although a crystal structure exists of the TC-bound state, little is known about the conformational dynamics and changes upon ligand binding. In this study, pulsed electron paramagnetic resonance techniques for measuring distances (PELDOR) in combination with rigid nitroxide spin labels (Çm spin label) were used to investigate the conformational flexibility of the TC-aptamer in the presence and absence of TC at different Mg2+ concentrations. TC was found to be the essential factor for stabilizing the tertiary structure at intermediate Mg2+ concentrations. At higher Mg2+ concentrations, Mg2+ alone is sufficient to stabilize the tertiary structure. In addition, the orientation of the two spin-labeled RNA helices with respect to each other was analyzed with orientation-selective PELDOR and compared to the crystal structure. These results demonstrate for the first time the unique value of the Çm spin label in combination with PELDOR to provide information about conformational flexibilities and orientations of secondary structure elements of biologically relevant RNAs.
Web spiders connect silk proteins, so-called spidroins, into fibers of extraordinary toughness. The spidroin N-terminal domain (NTD) plays a pivotal role in this process: it polymerizes spidroins through a complex mechanism of dimerization. Here we analyze sequences of spidroin NTDs and find an unusually high content of the amino acid methionine. We simultaneously mutate all methionines present in the hydrophobic core of a spidroin NTD from a nursery web spider’s dragline silk to leucine. The mutated NTD is strongly stabilized and folds at the theoretical speed limit. The structure of the mutant is preserved, yet its ability to dimerize is substantially impaired. We find that side chains of core methionines serve to mobilize the fold, which can thereby access various conformations and adapt the association interface for tight binding. Methionine in a hydrophobic core equips a protein with the capacity to dynamically change shape and thus to optimize its function.
As adapter molecules to convert the nucleic acid information into the amino acid sequence, tRNAs play a central role in protein synthesis. To fulfill this function in a reliable way, tRNAs exhibit highly conserved structural features common in all organisms and in all cellular compartments active in translation. However, in mitochondria of metazoans, certain dramatic deviations from the consensus tRNA structure are described, where some tRNAs lack the D- or T-arm without losing their function. In Enoplea, this miniaturization comes to an extreme, and functional mitochondrial tRNAs can lack both arms, leading to a considerable size reduction. Here, we investigate the secondary and tertiary structure of two such armless tRNAs from Romanomermis culicivorax. Despite their high AU content, the transcripts fold into a single and surprisingly stable hairpin structure, deviating from standard tRNAs. The three-dimensional form is boomerang-like and diverges from the standard L-shape. These results indicate that such unconventional miniaturized tRNAs can still fold into a tRNA-like shape, although their length and secondary structure are very unusual. They highlight the remarkable flexibility of the protein synthesis apparatus and suggest that the translational machinery of Enoplea mitochondria may show compensatory adaptations to accommodate these armless tRNAs for efficient translation.
The neomycin sensing riboswitch is the smallest biologically functional RNA riboswitch, forming a hairpin capped with a U-turn loop—a well-known RNA motif containing a conserved uracil. It was shown previously that a U→C substitution of the eponymous conserved uracil does not alter the riboswitch structure due to C protonation at N3. Furthermore, cytosine is evolutionary permitted to replace uracil in other U-turns. Here, we use molecular dynamics simulations to study the molecular basis of this substitution in the neomycin sensing riboswitch and show that a structure-stabilizing monovalent cation-binding site in the wild-type RNA is the main reason for its negligible structural effect. We then use NMR spectroscopy to confirm the existence of this cation-binding site and to demonstrate its effects on RNA stability. Lastly, using quantum chemical calculations, we show that the cation-binding site is altering the electronic environment of the wild-type U-turn so that it is more similar to the cytosine mutant. The study reveals an amazingly complex and delicate interplay between various energy contributions shaping up the 3D structure and evolution of nucleic acids.
LILBID and nESI : different native mass spectrometry techniques as tools in structural biology
(2018)
Native mass spectrometry is applied for the investigation of proteins and protein complexes worldwide. The challenge in native mass spectrometry is maintaining the features of the proteins of interest, such as oligomeric state, bound ligands, or the conformation of the protein complex, during transfer from solution to gas phase. This is an essential prerequisite to allow conclusions about the solution state protein complex, based on the gas phase measurements. Therefore, soft ionization techniques are required. Widely used for the analysis of protein complexes are nanoelectro spray ionization (nESI) mass spectrometers. A newer ionization method is laser induced liquid bead ion desorption (LILBID), which is based on the release of protein complexes from solution phase via infrared (IR) laser desorption. We use both methods in our lab, depending on the requirements of the biological system we are interested in. Here we benchmark the performance of our LILBID mass spectrometer in comparison to a nESI instrument, regarding sample conditions, buffer and additive tolerances, dissociation mechanism and applicability towards soluble and membrane protein complexes.
Global response of diacylglycerol kinase towards substrate binding observed by 2D and 3D MAS NMR
(2019)
Escherichia coli diacylglycerol kinase (DGK) is an integral membrane protein, which catalyses the ATP-dependent phosphorylation of diacylglycerol (DAG) to phosphatic acid (PA). It is a unique trimeric enzyme, which does not share sequence homology with typical kinases. It exhibits a notable complexity in structure and function despite of its small size. Here, chemical shift assignment of wild-type DGK within lipid bilayers was carried out based on 3D MAS NMR, utilizing manual and automatic analysis protocols. Upon nucleotide binding, extensive chemical shift perturbations could be observed. These data provide evidence for a symmetric DGK trimer with all of its three active sites concurrently occupied. Additionally, we could detect that the nucleotide substrate induces a substantial conformational change, most likely directing DGK into its catalytic active form. Furthermore, functionally relevant interprotomer interactions are identified by DNP-enhanced MAS NMR in combination with site-directed mutagenesis and functional assays.
HUWE1 E3 ligase promotes PINK1/PARKIN-independent mitophagy by regulating AMBRA1 activation via IKKα
(2018)
The selective removal of undesired or damaged mitochondria by autophagy, known as mitophagy, is crucial for cellular homoeostasis, and prevents tumour diffusion, neurodegeneration and ageing. The pro-autophagic molecule AMBRA1 (autophagy/beclin-1 regulator-1) has been defined as a novel regulator of mitophagy in both PINK1/PARKIN-dependent and -independent systems. Here, we identified the E3 ubiquitin ligase HUWE1 as a key inducing factor in AMBRA1-mediated mitophagy, a process that takes place independently of the main mitophagy receptors. Furthermore, we show that mitophagy function of AMBRA1 is post-translationally controlled, upon HUWE1 activity, by a positive phosphorylation on its serine 1014. This modification is mediated by the IKKα kinase and induces structural changes in AMBRA1, thus promoting its interaction with LC3/GABARAP (mATG8) proteins and its mitophagic activity. Altogether, these results demonstrate that AMBRA1 regulates mitophagy through a novel pathway, in which HUWE1 and IKKα are key factors, shedding new lights on the regulation of mitochondrial quality control and homoeostasis in mammalian cells.
Several peptides in clinical use are derived from non-ribosomal peptide synthetases (NRPS). In these systems multiple NRPS subunits interact with each other in a specific linear order mediated by specific docking domains (DDs), whose structures are not known yet, to synthesize well-defined peptide products. In contrast to classical NRPSs, single-module NRPS subunits responsible for the generation of rhabdopeptide/xenortide-like peptides (RXPs) can act in different order depending on subunit stoichiometry thereby producing peptide libraries. To define the basis for their unusual interaction patterns, we determine the structures of all N-terminal DDs (NDDs) as well as of an NDD-CDD complex and characterize all putative DD interactions thermodynamically for such a system. Key amino acid residues for DD interactions are identified that upon their exchange change the DD affinity and result in predictable changes in peptide production. Recognition rules for DD interactions are identified that also operate in other megasynthase complexes.
NMR and chromatography methods combined with mass spectrometry are the most important analytical techniques employed for plant metabolomics screening. Metabolomic analysis integrated to transcriptome screening add an important extra dimension to the information flow from DNA to RNA to protein. The most useful NMR experiment in metabolomics analysis is the proton spectra due the high receptivity of 1H and important structural information, through proton–proton scalar coupling. Routinely, databases have been used in identification of primary metabolites, however, there is currently no comparable data for identification of secondary metabolites, mainly, due to signal overlap in normal 1H NMR spectra and natural variation of plant. Related to spectra overlap, alternatively, better resolution can be find using 1H pure shift and 2D NMR pulse sequence in complex samples due to spreading the resonances in a second dimension. Thus, in data brief we provide a catalogue of metabolites and expression levels of genes identified in soy leaves and roots under flooding stress.
The ATP-binding cassette transporter TAPL translocates polypeptides from the cytosol into the lysosomal lumen. TAPL can be divided into two functional units: coreTAPL, active in ATP-dependent peptide translocation, and the N-terminal membrane spanning domain, TMD0, responsible for cellular localization and interaction with the lysosomal associated membrane proteins LAMP-1 and LAMP-2. Although the structure and function of ABC transporters were intensively analyzed in the past, the knowledge about accessory membrane embedded domains is limited. Therefore, we expressed the TMD0 of TAPL via a cell-free expression system and confirmed its correct folding by NMR and interaction studies. In cell as well as cell-free expressed TMD0 forms oligomers, which were assigned as dimers by PELDOR spectroscopy and static light scattering. By NMR spectroscopy of uniformly and selectively isotope labeled TMD0 we performed a complete backbone and partial side chain assignment. Accordingly, TMD0 has a four transmembrane helix topology with a short helical segment in a lysosomal loop. The topology of TMD0 was confirmed by paramagnetic relaxation enhancement with paramagnetic stearic acid as well as by nuclear Overhauser effects with c6-DHPC and cross-peaks with water.
The mitophagy receptor Nix interacts with LC3/GABARAP proteins, targeting mitochondria into autophagosomes for degradation. Here we present evidence for phosphorylation-driven regulation of the Nix:LC3B interaction. Isothermal titration calorimetry and NMR indicate a ~100 fold enhanced affinity of the serine 34/35-phosphorylated Nix LC3-interacting region (LIR) to LC3B and formation of a very rigid complex compared to the non-phosphorylated sequence. Moreover, the crystal structure of LC3B in complex with the Nix LIR peptide containing glutamic acids as phosphomimetic residues and NMR experiments revealed that LIR phosphorylation stabilizes the Nix:LC3B complex via formation of two additional hydrogen bonds between phosphorylated serines of Nix LIR and Arg11, Lys49 and Lys51 in LC3B. Substitution of Lys51 to Ala in LC3B abrogates binding of a phosphomimetic Nix mutant. Functionally, serine 34/35 phosphorylation enhances autophagosome recruitment to mitochondria in HeLa cells. Together, this study provides cellular, biochemical and biophysical evidence that phosphorylation of the LIR domain of Nix enhances mitophagy receptor engagement.
The field of dynamic nuclear polarization has undergone tremendous developments and diversification since its inception more than 6 decades ago. In this review we provide an in-depth overview of the relevant topics involved in DNP-enhanced MAS NMR spectroscopy. This includes the theoretical description of DNP mechanisms as well as of the polarization transfer pathways that can lead to a uniform or selective spreading of polarization between nuclear spins. Furthermore, we cover historical and state-of-the art aspects of dedicated instrumentation, polarizing agents, and optimization techniques for efficient MAS DNP. Finally, we present an extensive overview on applications in the fields of structural biology and materials science, which underlines that MAS DNP has moved far beyond the proof-of-concept stage and has become an important tool for research in these fields.
A key function of reversible protein phosphorylation is to regulate protein–protein interactions, many of which involve short linear motifs (3–12 amino acids). Motif‐based interactions are difficult to capture because of their often low‐to‐moderate affinities. Here, we describe phosphomimetic proteomic peptide‐phage display, a powerful method for simultaneously finding motif‐based interaction and pinpointing phosphorylation switches. We computationally designed an oligonucleotide library encoding human C‐terminal peptides containing known or predicted Ser/Thr phosphosites and phosphomimetic variants thereof. We incorporated these oligonucleotides into a phage library and screened the PDZ (PSD‐95/Dlg/ZO‐1) domains of Scribble and DLG1 for interactions potentially enabled or disabled by ligand phosphorylation. We identified known and novel binders and characterized selected interactions through microscale thermophoresis, isothermal titration calorimetry, and NMR. We uncover site‐specific phospho‐regulation of PDZ domain interactions, provide a structural framework for how PDZ domains accomplish phosphopeptide binding, and discuss ligand phosphorylation as a switching mechanism of PDZ domain interactions. The approach is readily scalable and can be used to explore the potential phospho‐regulation of motif‐based interactions on a large scale.
The p53 family of transcription factors (p53, p63 and p73) covers a wide range of functions critical for development, homeostasis and health of mammals across their lifespan. Beside the well-established tumor suppressor role, recent evidence has highlighted novel non-oncogenic functions exerted by p73. In particular, p73 is required for multiciliated cell (MCC) differentiation; MCCs have critical roles in brain and airways to move fluids across epithelial surfaces and to transport germ cells in the reproductive tract. This novel function of p73 provides a unifying cellular mechanism for the disparate inflammatory and immunological phenotypes of p73-deficient mice. Indeed, mice with Trp73 deficiency suffer from hydrocephalus, sterility and chronic respiratory tract infections due to profound defects in ciliogenesis and complete loss of mucociliary clearance since MCCs are essential for cleaning airways from inhaled pollutants, pathogens and allergens. Cross-species genomic analyses and functional rescue experiments identify TAp73 as the master transcriptional integrator of ciliogenesis, upstream of previously known central nodes. In addition, TAp73 shows a significant ability to regulate cellular metabolism and energy production through direct transcriptional regulation of several metabolic enzymes, such as glutaminase-2 and glucose-6 phosphate dehydrogenase. This recently uncovered role of TAp73 in the regulation of cellular metabolism strongly affects oxidative balance, thus potentially influencing all the biological aspects associated with p73 function, including development, homeostasis and cancer. Although through different mechanisms, p63 isoforms also contribute to regulation of cellular metabolism, thus indicating a common route used by all family members to control cell fate. At the structural level, the complexity of p73's function is further enhanced by its ability to form heterotetramers with some p63 isoforms, thus indicating the existence of an intrafamily crosstalk that determines the global outcome of p53 family function. In this review, we have tried to summarize all the recent evidence that have emerged on the novel non-oncogenic roles of p73, in an attempt to provide a unified view of the complex function of this gene within its family.
The bile acid activated transcription factor farnesoid X receptor (FXR) regulates numerous metabolic processes and is a rising target for the treatment of hepatic and metabolic disorders. FXR agonists have revealed efficacy in treating non-alcoholic steatohepatitis (NASH), diabetes and dyslipidemia. Here we characterize imatinib as first-in-class allosteric FXR modulator and report the development of an optimized descendant that markedly promotes agonist induced FXR activation in a reporter gene assay and FXR target gene expression in HepG2 cells. Differential effects of imatinib on agonist-induced bile salt export protein and small heterodimer partner expression suggest that allosteric FXR modulation could open a new avenue to gene-selective FXR modulators.
The identification of inhibitors of eukaryotic protein biosynthesis, which are targeting single translation factors, is highly demanded. Here we report on a small molecule inhibitor, gephyronic acid, isolated from the myxobacterium Archangium gephyra that inhibits growth of transformed mammalian cell lines in the nM range. In direct comparison, primary human fibroblasts were shown to be less sensitive to toxic effects of gephyronic acid than cancer-derived cells. Gephyronic acid is targeting the protein translation system. Experiments with IRES dual luciferase reporter assays identified it as an inhibitor of the translation initiation. DARTs approaches, co-localization studies and pull-down assays indicate that the binding partner could be the eukaryotic initiation factor 2 subunit alpha (eIF2α). Gephyronic acid seems to have a different mode of action than the structurally related polyketides tedanolide, myriaporone, and pederin and is a valuable tool for investigating the eukaryotic translation system. Because cancer derived cells were found to be especially sensitive, gephyronic acid could potentially find use as a drug candidate.
TEMPO spin labels protected with 2-nitrobenzyloxymethyl groups were attached to the amino residues of three different nucleosides: deoxycytidine, deoxyadenosine, and adenosine. The corresponding phosphoramidites could be incorporated by unmodified standard procedures into four different self-complementary DNA and two RNA oligonucleotides. After photochemical removal of the protective group, elimination of formic aldehyde and spontaneous air oxidation, the nitroxide radicals were regenerated in high yield. The resulting spin-labeled palindromic duplexes could be directly investigated by PELDOR spectroscopy without further purification steps. Spin–spin distances measured by PELDOR correspond well to the values obtained from molecular models.
Up to now, very small protein-coding genes have remained unrecognized in sequenced genomes. We identified an mRNA of 165 nucleotides (nt), which is conserved in Bradyrhizobiaceae and encodes a polypeptide with 14 amino acid residues (aa). The small mRNA harboring a unique Shine-Dalgarno sequence (SD) with a length of 17 nt was localized predominantly in the ribosome-containing P100 fraction of Bradyrhizobium japonicum USDA 110. Strong interaction between the mRNA and 30S ribosomal subunits was demonstrated by their co-sedimentation in sucrose density gradient. Using translational fusions with egfp, we detected weak translation and found that it is impeded by both the extended SD and the GTG start codon (instead of ATG). Biophysical characterization (CD- and NMR-spectroscopy) showed that synthesized polypeptide remained unstructured in physiological puffer. Replacement of the start codon by a stop codon increased the stability of the transcript, strongly suggesting additional posttranscriptional regulation at the ribosome. Therefore, the small gene was named rreB (ribosome-regulated expression in Bradyrhizobiaceae). Assuming that the unique ribosome binding site (RBS) is a hallmark of rreB homologs or similarly regulated genes, we looked for similar putative RBS in bacterial genomes and detected regions with at least 16 nt complementarity to the 3′-end of 16S rRNA upstream of sORFs in Caulobacterales, Rhizobiales, Rhodobacterales and Rhodospirillales. In the Rhodobacter/Roseobacter lineage of α-proteobacteria the corresponding gene (rreR) is conserved and encodes an 18 aa protein. This shows how specific RBS features can be used to identify new genes with presumably similar control of expression at the RNA level.
Investigating three-dimensional (3D) structures of proteins in living cells by in-cell nuclear magnetic resonance (NMR) spectroscopy opens an avenue towards understanding the structural basis of their functions and physical properties under physiological conditions inside cells. In-cell NMR provides data at atomic resolution non-invasively, and has been used to detect protein-protein interactions, thermodynamics of protein stability, the behavior of intrinsically disordered proteins, etc. in cells. However, so far only a single de novo 3D protein structure could be determined based on data derived only from in-cell NMR. Here we introduce methods that enable in-cell NMR protein structure determination for a larger number of proteins at concentrations that approach physiological ones. The new methods comprise (1) advances in the processing of non-uniformly sampled NMR data, which reduces the measurement time for the intrinsically short-lived in-cell NMR samples, (2) automatic chemical shift assignment for obtaining an optimal resonance assignment, and (3) structure refinement with Bayesian inference, which makes it possible to calculate accurate 3D protein structures from sparse data sets of conformational restraints. As an example application we determined the structure of the B1 domain of protein G at about 250 μM concentration in living E. coli cells.
Modification of SMN2 exon 7 (E7) splicing is a validated therapeutic strategy against spinal muscular atrophy (SMA). However, a target-based approach to identify small-molecule E7 splicing modifiers has not been attempted, which could reveal novel therapies with improved mechanistic insight. Here, we chose as a target the stem-loop RNA structure TSL2, which overlaps with the 5′ splicing site of E7. A small-molecule TSL2-binding compound, homocarbonyltopsentin (PK4C9), was identified that increases E7 splicing to therapeutic levels and rescues downstream molecular alterations in SMA cells. High-resolution NMR combined with molecular modelling revealed that PK4C9 binds to pentaloop conformations of TSL2 and promotes a shift to triloop conformations that display enhanced E7 splicing. Collectively, our study validates TSL2 as a target for small-molecule drug discovery in SMA, identifies a novel mechanism of action for an E7 splicing modifier, and sets a precedent for other splicing-mediated diseases where RNA structure could be similarly targeted.
Protein aggregation of the p63 transcription factor underlies severe skin fragility in AEC syndrome
(2018)
The p63 gene encodes a master regulator of epidermal commitment, development, and differentiation. Heterozygous mutations in the C-terminal domain of the p63 gene can cause ankyloblepharon-ectodermal defects-cleft lip/palate (AEC) syndrome, a life-threatening disorder characterized by skin fragility and severe, long-lasting skin erosions. Despite deep knowledge of p63 functions, little is known about mechanisms underlying disease pathology and possible treatments. Here, we show that multiple AEC-associated p63 mutations, but not those causative of other diseases, lead to thermodynamic protein destabilization, misfolding, and aggregation, similar to the known p53 gain-of-function mutants found in cancer. AEC mutant proteins exhibit impaired DNA binding and transcriptional activity, leading to dominant negative effects due to coaggregation with wild-type p63 and p73. Importantly, p63 aggregation occurs also in a conditional knock-in mouse model for the disorder, in which the misfolded p63 mutant protein leads to severe epidermal defects. Variants of p63 that abolish aggregation of the mutant proteins are able to rescue p63’s transcriptional function in reporter assays as well as in a human fibroblast-to-keratinocyte conversion assay. Our studies reveal that AEC syndrome is a protein aggregation disorder and opens avenues for therapeutic intervention.
Membrane proteins frequently assemble into higher order homo- or hetero-oligomers within their natural lipid environment. This complex formation can modulate their folding, activity as well as substrate selectivity. Non-disruptive methods avoiding critical steps, such as membrane disintegration, transfer into artificial environments or chemical modifications are therefore essential to analyze molecular mechanisms of native membrane protein assemblies. The combination of cell-free synthetic biology, nanodisc-technology and non-covalent mass spectrometry provides excellent synergies for the analysis of membrane protein oligomerization within defined membranes. We exemplify our strategy by oligomeric state characterization of various membrane proteins including ion channels, transporters and membrane-integrated enzymes assembling up to hexameric complexes. We further indicate a lipid-dependent dimer formation of MraY translocase correlating with the enzymatic activity. The detergent-free synthesis of membrane protein/nanodisc samples and the analysis by LILBID mass spectrometry provide a versatile platform for the analysis of membrane proteins in a native environment.
NMR spectroscopy is a powerful technique to study ribonucleic acids (RNAs) which are key players in a plethora of cellular processes. Although the NMR toolbox for structural studies of RNAs expanded during the last decades, they often remain challenging. Here, we show that solvent paramagnetic relaxation enhancements (sPRE) induced by the soluble, paramagnetic compound Gd(DTPA-BMA) provide a quantitative measure for RNA solvent accessibility and encode distance-to-surface information that correlates well with RNA structure and improves accuracy and convergence of RNA structure determination. Moreover, we show that sPRE data can be easily obtained for RNAs with any isotope labeling scheme and is advantageous regarding sample preparation, stability and recovery. sPRE data show a large dynamic range and reflect the global fold of the RNA suggesting that they are well suited to identify interaction surfaces, to score structural models and as restraints in RNA structure determination.
Structural and functional dissection of the DH and PH domains of oncogenic Bcr-Abl tyrosine kinase
(2017)
The two isoforms of the Bcr-Abl tyrosine kinase, p210 and p190, are associated with different leukemias and have a dramatically different signaling network, despite similar kinase activity. To provide a molecular rationale for these observations, we study the Dbl-homology (DH) and Pleckstrin-homology (PH) domains of Bcr-Abl p210, which constitute the only structural differences to p190. Here we report high-resolution structures of the DH and PH domains and characterize conformations of the DH–PH unit in solution. Our structural and functional analyses show no evidence that the DH domain acts as a guanine nucleotide exchange factor, whereas the PH domain binds to various phosphatidylinositol-phosphates. PH-domain mutants alter subcellular localization and result in decreased interactions with p210-selective interaction partners. Hence, the PH domain, but not the DH domain, plays an important role in the formation of the differential p210 and p190 Bcr-Abl signaling networks.
"Ästhetisch ist, was hilft"
(2017)
The full-length translation-regulating add adenine riboswitch (Asw) from Vibrio vulnificus has a more complex conformational space than its isolated aptamer domain. In addition to the predicted apo (apoA) and holo conformation that feature the conserved three-way junctional purine riboswitch aptamer, it adopts a second apo (apoB) conformation with a fundamentally different secondary structure. Here, we characterized the ligand-dependent conformational dynamics of the full-length add Asw by NMR and by single-molecule FRET (smFRET) spectroscopy. Both methods revealed an adenine-induced secondary structure switch from the apoB-form to the apoA-form that involves no tertiary structural interactions between aptamer and expression platform. This strongly suggests that the add Asw triggers translation by capturing the apoA-form secondary structure in the holo state. Intriguingly, NMR indicated a homogenous, docked aptamer kissing loop fold for apoA and holo, while smFRET showed persistent aptamer kissing loop docking dynamics between comparably stable, undocked and docked substates of the apoA and the holo conformation. Unraveling the folding of large junctional riboswitches thus requires the integration of complementary solution structural techniques such as NMR and smFRET.
ATP-binding cassette (ABC) transporters, a superfamily of integral membrane proteins, catalyse the translocation of substrates across the cellular membrane by ATP hydrolysis. Here we demonstrate by nucleotide turnover and binding studies based on 31P solid-state NMR spectroscopy that the ABC exporter and lipid A flippase MsbA can couple ATP hydrolysis to an adenylate kinase activity, where ADP is converted into AMP and ATP. Single-point mutations reveal that both ATPase and adenylate kinase mechanisms are associated with the same conserved motifs of the nucleotide-binding domain. Based on these results, we propose a model for the coupled ATPase-adenylate kinase mechanism, involving the canonical and an additional nucleotide-binding site. We extend these findings to other prokaryotic ABC exporters, namely LmrA and TmrAB, suggesting that the coupled activities are a general feature of ABC exporters.
Proton-pumping complex I of the mitochondrial respiratory chain is among the largest and most complex membrane protein complexes. The enzyme contributes substantially to oxidative energy-conversion in eukaryotic cells. Its malfunctions are implicated in many hereditary and degenerative disorders. Here, we report the X-ray structure of mitochondrial complex I at 3.6- 3.9 Å resolution describing in detail the central subunits that execute the bioenergetic function. A continuous axis of basic and acidic residues running centrally through the membrane arm connects the ubiquinone reduction site in the hydrophilic arm to four putative proton-pumping units. The binding position for a substrate analogous inhibitor and blockage of the predicted ubiquinone binding site provide a model for the ‘deactive’ form of the enzyme. The proposed transition into the active form is based on a concerted structural rearrangement at the ubiquinone reduction site rendering support for a two-state stabilization-change mechanism of protonpumping.
Mammalian oocytes are arrested in the dictyate stage of meiotic prophase I for long periods of time, during which the high concentration of the p53 family member TAp63α sensitizes them to DNA damage-induced apoptosis. TAp63α is kept in an inactive and exclusively dimeric state but undergoes rapid phosphorylation-induced tetramerization and concomitant activation upon detection of DNA damage. Here we show that the TAp63α dimer is a kinetically trapped state. Activation follows a spring-loaded mechanism not requiring further translation of other cellular factors in oocytes and is associated with unfolding of the inhibitory structure that blocks the tetramerization interface. Using a combination of biophysical methods as well as cell and ovary culture experiments we explain how TAp63α is kept inactive in the absence of DNA damage but causes rapid oocyte elimination in response to a few DNA double strand breaks thereby acting as the key quality control factor in maternal reproduction.
In bacteria, the regulation of gene expression by cis-acting transcriptional riboswitches located in the 5'-untranslated regions of messenger RNA requires the temporal synchronization of RNA synthesis and ligand binding-dependent conformational refolding. Ligand binding to the aptamer domain of the riboswitch induces premature termination of the mRNA synthesis of ligand-associated genes due to the coupled formation of 3'-structural elements acting as terminators. To date, there has been no high resolution structural description of the concerted process of synthesis and ligand-induced restructuring of the regulatory RNA element. Here, we show that for the guanine-sensing xpt-pbuX riboswitch from Bacillus subtilis, the conformation of the full-length transcripts is static: it exclusively populates the functional off-state but cannot switch to the on-state, regardless of the presence or absence of ligand. We show that only the combined matching of transcription rates and ligand binding enables transcription intermediates to undergo ligand-dependent conformational refolding.
Many naturally occurring or artificially created RNAs are capable of binding to guanine or guanine derivatives with high affinity and selectivity. They bind their ligands using very different recognition modes involving a diverse set of hydrogen bonding and stacking interactions. Apparently, the potential structural diversity for guanine, guanosine, and guanine nucleotide binding motifs is far from being fully explored. Szostak and coworkers have derived a large set of different GTP-binding aptamer families differing widely in sequence, secondary structure, and ligand specificity. The so-called class V–GTP aptamer from this set binds GTP with very high affinity and has a complex secondary structure. Here we use solution NMR spectroscopy to demonstrate that the class V aptamer binds GTP through the formation of an intermolecular two-layered G-quadruplex structure that directly incorporates the ligand and folds only upon ligand addition. Ligand binding and G-quadruplex formation depend strongly on the identity of monovalent cations present with a clear preference for potassium ions. GTP binding through direct insertion into an intermolecular G-quadruplex is a previously unobserved structural variation for ligand-binding RNA motifs and rationalizes the previously observed specificity pattern of the class V aptamer for GTP analogs.
The spliceosomal protein SF3b49, a component of the splicing factor 3b (SF3b) protein complex in the U2 small nuclear ribonucleoprotein, contains two RNA recognition motif (RRM) domains. In yeast, the first RRM domain (RRM1) of Hsh49 protein (yeast orthologue of human SF3b49) reportedly interacts with another component, Cus1 protein (orthologue of human SF3b145). Here, we solved the solution structure of the RRM1 of human SF3b49 and examined its mode of interaction with a fragment of human SF3b145 using NMR methods. Chemical shift mapping showed that the SF3b145 fragment spanning residues 598-631 interacts with SF3b49 RRM1, which adopts a canonical RRM fold with a topology of β1-α1-β2-β3-α2-β4. Furthermore, a docking model based on NOESY measurements suggests that residues 607-616 of the SF3b145 fragment adopt a helical structure that binds to RRM1 predominantly via α1, consequently exhibiting a helix-helix interaction in almost antiparallel. This mode of interaction was confirmed by a mutational analysis using GST pull-down assays. Comparison with structures of all RRM domains when complexed with a peptide found that this helix-helix interaction is unique to SF3b49 RRM1. Additionally, all amino acid residues involved in the interaction are well conserved among eukaryotes, suggesting evolutionary conservation of this interaction mode between SF3b49 RRM1 and SF3b145.
Mistakes in translation of messenger RNA into protein are clearly a detriment to the recombinant production of pure proteins for biophysical study or the biopharmaceutical market. However, they may also provide insight into mechanistic details of the translation process. Mistakes often involve the substitution of an amino acid having an abundant codon for one having a rare codon, differing by substitution of a G base by an A base, as in the case of substitution of a lysine (AAA) for arginine (AGA). In these cases one expects the substitution frequency to depend on the relative abundances of the respective tRNAs, and thus, one might expect frequencies to be similar for all sites having the same rare codon. Here we demonstrate that, for the ADP-ribosylation factor from yeast expressed in E. coli, lysine for arginine substitutions frequencies are not the same at the 9 sites containing a rare arginine codon; mis-incorporation frequencies instead vary from less than 1 to 16%. We suggest that the context in which the codons occur (clustering of rare sites) may be responsible for the variation. The method employed to determine the frequency of mis-incorporation involves a novel mass spectrometric analysis of the products from the parallel expression of wild type and codon-optimized genes in 15N and 14N enriched media, respectively. The high sensitivity and low material requirements of the method make this a promising technology for the collection of data relevant to other mis-incorporations. The additional data could be of value in refining models for the ribosomal translation elongation process.
Plant-released flavonoids induce the transcription of symbiotic genes in rhizobia and one of the first bacterial responses is the synthesis of so called Nod factors. They are responsible for the initial root hair curling during onset of root nodule development. This signal exchange is believed to be essential for initiating the plant symbiosis with rhizobia affiliated with the Alphaproteobacteria. Here, we provide evidence that in the broad host range strain Sinorhizobium fredii NGR234 the complete lack of quorum sensing molecules results in an elevated copy number of its symbiotic plasmid (pNGR234a). This in turn triggers the expression of symbiotic genes and the production of Nod factors in the absence of plant signals. Therefore, increasing the copy number of specific plasmids could be a widespread mechanism of specialized bacterial populations to bridge gaps in signaling cascades.
Relative orientation of POTRA domains from cyanobacterial Omp85 studied by pulsed EPR spectroscopy
(2016)
Many proteins of the outer membrane of Gram-negative bacteria and of the outer envelope of the endosymbiotically derived organelles mitochondria and plastids have a β-barrel fold. Their insertion is assisted by membrane proteins of the Omp85-TpsB superfamily. These proteins are composed of a C-terminal β-barrel and a different number of N-terminal POTRA domains, three in the case of cyanobacterial Omp85. Based on structural studies of Omp85 proteins, including the five POTRA-domain-containing BamA protein of Escherichia coli, it is predicted that anaP2 and anaP3 bear a fixed orientation, whereas anaP1 and anaP2 are connected via a flexible hinge. We challenged this proposal by investigating the conformational space of the N-terminal POTRA domains of Omp85 from the cyanobacterium Anabaena sp. PCC 7120 using pulsed electron-electron double resonance (PELDOR, or DEER) spectroscopy. The pronounced dipolar oscillations observed for most of the double spin-labeled positions indicate a rather rigid orientation of the POTRA domains in frozen liquid solution. Based on the PELDOR distance data, structure refinement of the POTRA domains was performed taking two different approaches: 1) treating the individual POTRA domains as rigid bodies; and 2) using an all-atom refinement of the structure. Both refinement approaches yielded ensembles of model structures that are more restricted compared to the conformational ensemble obtained by molecular dynamics simulations, with only a slightly different orientation of N-terminal POTRA domains anaP1 and anaP2 compared with the x-ray structure. The results are discussed in the context of the native environment of the POTRA domains in the periplasm.
We investigate complexes of two paramagnetic metal ions Gd3+ and Mn2+ to serve as polarizing agents for solid-state dynamic nuclear polarization (DNP) of 1H, 13C, and 15N at magnetic fields of 5, 9.4, and 14.1 T. Both ions are half-integer high-spin systems with a zero-field splitting and therefore exhibit a broadening of the mS = −1/2 ↔ +1/2 central transition which scales inversely with the external field strength. We investigate experimentally the influence of the chelator molecule, strong hyperfine coupling to the metal nucleus, and deuteration of the bulk matrix on DNP properties. At small Gd-DOTA concentrations the narrow central transition allows us to polarize nuclei with small gyromagnetic ratio such as 13C and even 15N via the solid effect. We demonstrate that enhancements observed are limited by the available microwave power and that large enhancement factors of >100 (for 1H) and on the order of 1000 (for 13C) can be achieved in the saturation limit even at 80 K. At larger Gd(III) concentrations (≥10 mM) where dipolar couplings between two neighboring Gd3+ complexes become substantial a transition towards cross effect as dominating DNP mechanism is observed. Furthermore, the slow spin-diffusion between 13C and 15N, respectively, allows for temporally resolved observation of enhanced polarization spreading from nuclei close to the paramagnetic ion towards nuclei further removed. Subsequently, we present preliminary DNP experiments on ubiquitin by site-directed spin-labeling with Gd3+ chelator tags. The results hold promise towards applications of such paramagnetically labeled proteins for DNP applications in biophysical chemistry and/or structural biology.
Structural biology and life sciences in general, and NMR in particular, have always been associated with advanced computing. The current challenges in the post-genomic era call for virtual research platforms that provide the worldwide research community with both user-friendly tools, platforms for data analysis and exchange, and an underlying e-Infrastructure. WeNMR, a three-year European Commission co-funded project started in November 2010, groups different research teams into a worldwide virtual research community. It builds on the established eNMR e-Infrastructure and its steadily growing virtual organisation, which is currently the second largest VO in the area of life sciences. WeNMR provides an e-Infrastructure platform and Science Gateway for structural biology. It involves researchers from around the world and will build bridges to other areas of structural biology.
Structured RNA regions are important gene control elements in prokaryotes and eukaryotes. Here, we show that the mRNA of a cyanobacterial heat shock gene contains a built-in thermosensor critical for photosynthetic activity under stress conditions. The exceptionally short 5´-untranslated region is comprised of a single hairpin with an internal asymmetric loop. It inhibits translation of the Synechocystis hsp17 transcript at normal growth conditions, permits translation initiation under stress conditions and shuts down Hsp17 production in the recovery phase. Point mutations that stabilized or destabilized the RNA structure deregulated reporter gene expression in vivo and ribosome binding in vitro. Introduction of such point mutations into the Synechocystis genome produced severe phenotypic defects. Reversible formation of the open and closed structure was beneficial for viability, integrity of the photosystem and oxygen evolution. Continuous production of Hsp17 was detrimental when the stress declined indicating that shutting-off heat shock protein production is an important, previously unrecognized function of RNA thermometers. We discovered a simple biosensor that strictly adjusts the cellular level of a molecular chaperone to the physiological need.
The mfl-riboswitch regulates expression of ribonucleotide reductase subunit in Mesoplasma florum by binding to 2´-deoxyguanosine and thereby promoting transcription termination. We characterized the structure of the ligand-bound aptamer domain by NMR spectroscopy and compared the mfl-aptamer to the aptamer domain of the closely related purine-sensing riboswitches. We show that the mfl-aptamer accommodates the extra 2´-deoxyribose unit of the ligand by forming a more relaxed binding pocket than these found in the purine-sensing riboswitches. Tertiary structures of the xpt-aptamer bound to guanine and of the mfl-aptamer bound to 2´-deoxyguanosine exhibit very similar features, although the sequence of the mfl-aptamer contains several alterations compared to the purine-aptamer consensus sequence. These alterations include the truncation of a hairpin loop which is crucial for complex formation in all purine-sensing riboswitches characterized to date. We further defined structural features and ligand binding requirements of the free mfl-aptamer and found that the presence of Mg2+ is not essential for complex formation, but facilitates ligand binding by promoting pre-organization of key structural motifs in the free aptamer.
Background: The automation of objectively selecting amino acid residue ranges for structure superpositions is important for meaningful and consistent protein structure analyses. So far there is no widely-used standard for choosing these residue ranges for experimentally determined protein structures, where the manual selection of residue ranges or the use of suboptimal criteria remain commonplace. Results: We present an automated and objective method for finding amino acid residue ranges for the superposition and analysis of protein structures, in particular for structure bundles resulting from NMR structure calculations. The method is implemented in an algorithm, CYRANGE, that yields, without protein-specific parameter adjustment, appropriate residue ranges in most commonly occurring situations, including low-precision structure bundles, multi-domain proteins, symmetric multimers, and protein complexes. Residue ranges are chosen to comprise as many residues of a protein domain that increasing their number would lead to a steep rise in the RMSD value. Residue ranges are determined by first clustering residues into domains based on the distance variance matrix, and then refining for each domain the initial choice of residues by excluding residues one by one until the relative decrease of the RMSD value becomes insignificant. A penalty for the opening of gaps favours contiguous residue ranges in order to obtain a result that is as simple as possible, but not simpler. Results are given for a set of 37 proteins and compared with those of commonly used protein structure validation packages. We also provide residue ranges for 6351 NMR structures in the Protein Data Bank. Conclusions: The CYRANGE method is capable of automatically determining residue ranges for the superposition of protein structure bundles for a large variety of protein structures. The method correctly identifies ordered regions. Global structure superpositions based on the CYRANGE residue ranges allow a clear presentation of the structure, and unnecessary small gaps within the selected ranges are absent. In the majority of cases, the residue ranges from CYRANGE contain fewer gaps and cover considerably larger parts of the sequence than those from other methods without significantly increasing the RMSD values. CYRANGE thus provides an objective and automatic method for standardizing the choice of residue ranges for the superposition of protein structures. Additional files Additional file 1: Dependence of Q on the order parameter rank. The quantity Qi is plotted against the order parameter rank i for 9 different protein structure bundles. Additional file 2: Dependence of P on the clustering stage. The quantity Pi is plotted against the clustering stage i for 9 different protein structure bundles. Additional file 3: Dependence of CYRANGE results on the minimal cluster size parameter my. The sequence coverage (red) and RMSD (blue) of the residue ranges determined by CYRANGE were plotted as a function of my for 9 different protein structure bundles. The dotted vertical line indicates the default value, my = 8. Where CYRANGE found two domains, the RMSD values of the individual domains are shown in light and dark blue. Additional file 4: Dependence of CYRANGE results on the domain boundary extension parameter m. See Additional File 3 for details. Additional file 5: Dependence of CYRANGE results on the minimal gap width g. See Additional File 3 for details. Additional file 6: Dependence of CYRANGE results on the relative RMSD decrease parameter delta. See Additional File 3 for details. Additional file 7: Dependence of CYRANGE results on the absolute RMSD decrease parameter delta abs. See Additional File 3 for details. Additional file 8: Dependence of CYRANGE results on the gap penalty parameter gamma. See Additional File 3 for details. Additional file 9: Correlation between the sequence coverage from CYRANGE, FindCore and PSVS, and the GDT total score, GDT_TS. Each data point represents a protein shown in Figures 3 and 4. The coverage is the percentage of amino acid residues included in the residue ranges found by the different methods. The GDT_TS value is defined by GDT_TS = (P1 + P2 + P4 + P8)/4, where Pd is the fraction of residues that can be superimposed under a distance cutoff of d Å. Additional file 10: Correlation between the RMSD value for the residue ranges from CYRANGE, FindCore and PSVS, and the GDT total score, GDT_TS. Each data point represents one protein domain. See Additional File 9 for details.
G-quadruplex topologies of telomeric repeat sequences from vertebrates were investigated in the presence of molecular crowding (MC) mimetics, namely polyethylene glycol 200 (PEG), Ficoll 70 as well as Xenopus laevis egg extract by CD and NMR spectroscopy and native PAGE. Here, we show that the conformational behavior of the telomeric repeats in X. laevis egg extract or in Ficoll is notably different from that observed in the presence of PEG. While the behavior of the telomeric repeat in X. laevis egg extract or in Ficoll resembles results obtained under dilute conditions, PEG promotes the formation of high-order parallel topologies. Our data suggest that PEG should not be used as a MC mimetic.