Biochemie und Chemie
Refine
Year of publication
Document Type
- Article (15)
Language
- English (15)
Has Fulltext
- yes (15)
Is part of the Bibliography
- no (15)
Keywords
- Solution-state NMR (3)
- Kinases (2)
- AEC syndrome (1)
- Biopolymers in vivo (1)
- Computational models (1)
- Data processing (1)
- G protein-coupled receptor (GPCR) (1)
- Membrane protein (1)
- Molecular conformation (1)
- Nanodisc (1)
Institute
- Zentrum für Biomolekulare Magnetische Resonanz (BMRZ) (15) (remove)
We compiled an NMR data set consisting of exact nuclear Overhauser enhancement (eNOE) distance limits, residual dipolar couplings (RDCs) and scalar (J) couplings for GB3, which forms one of the largest and most diverse data set for structural characterization of a protein to date. All data have small experimental errors, which are carefully estimated. We use the data in the research article Vogeli et al., 2015, Complementarity and congruence between exact NOEs and traditional NMR probes for spatial decoding of protein dynamics, J. Struct. Biol., 191, 3, 306–317, doi:10.1016/j.jsb.2015.07.008 [1] for cross-validation in multiple-state structural ensemble calculation. We advocate this set to be an ideal test case for molecular dynamics simulations and structure calculations.
Human Transformer2-beta (hTra2-beta) is an important member of the serine/arginine-rich protein family, and contains one RNA recognition motif (RRM). It controls the alternative splicing of several pre-mRNAs, including those of the calcitonin/calcitonin gene-related peptide (CGRP), the survival motor neuron 1 (SMN1) protein and the tau protein. Accordingly, the RRM of hTra2-beta specifically binds to two types of RNA sequences [the CAA and (GAA)2 sequences]. We determined the solution structure of the hTra2-beta RRM (spanning residues Asn110–Thr201), which not only has a canonical RRM fold, but also an unusual alignment of the aromatic amino acids on the beta-sheet surface. We then solved the complex structure of the hTra2-beta RRM with the (GAA)2 sequence, and found that the AGAA tetra-nucleotide was specifically recognized through hydrogen-bond formation with several amino acids on the N- and C-terminal extensions, as well as stacking interactions mediated by the unusually aligned aromatic rings on the beta-sheet surface. Further NMR experiments revealed that the hTra2-beta RRM recognizes the CAA sequence when it is integrated in the stem-loop structure. This study indicates that the hTra2-beta RRM recognizes two types of RNA sequences in different RNA binding modes.
The CUG-binding protein 1 (CUG-BP1) is a member of the CUG-BP1 and ETR-like factors (CELF) family or the Bruno-like family and is involved in the control of splicing, translation and mRNA degradation. Several target RNA sequences of CUG-BP1 have been predicted, such as the CUG triplet repeat, the GU-rich sequences and the AU-rich element of nuclear pre-mRNAs and/or cytoplasmic mRNA. CUG-BP1 has three RNA-recognition motifs (RRMs), among which the third RRM (RRM3) can bind to the target RNAs on its own. In this study, we solved the solution structure of the CUG-BP1 RRM3 by hetero-nuclear NMR spectroscopy. The CUG-BP1 RRM3 exhibited a noncanonical RRM fold, with the four-stranded b-sheet surface tightly associated with the N-terminal extension. Furthermore, we determined the solution structure of the CUG-BP1 RRM3 in the complex with (UG)3 RNA, and discovered that the UGU trinucleotide is specifically recognized through extensive stacking interactions and hydrogen bonds within the pocket formed by the b-sheet surface and the N-terminal extension. This study revealed the unique mechanism that enables the CUG-BP1 RRM3 to discriminate the short RNA segment from other sequences, thus providing the molecular basis for the comprehension of the role of the RRM3s in the CELF/Bruno-like family.
1H-detected solid-state NMR experiments feasible at fast magic-angle spinning (MAS) frequencies allow accessing 1H chemical shifts of proteins in solids, which enables their interpretation in terms of secondary structure. Here we present 1H and 13C-detected NMR spectra of the RNA polymerase subunit Rpo7 in complex with unlabeled Rpo4 and use the 13C, 15N, and 1H chemical-shift values deduced from them to study the secondary structure of the protein in comparison to a known crystal structure. We applied the automated resonance assignment approach FLYA including 1H-detected solid-state NMR spectra and show its success in comparison to manual spectral assignment. Our results show that reasonably reliable secondary-structure information can be obtained from 1H secondary chemical shifts (SCS) alone by using the sum of 1Hα and 1HN SCS rather than by TALOS. The confidence, especially at the boundaries of the observed secondary structure elements, is found to increase when evaluating 13C chemical shifts, here either by using TALOS or in terms of 13C SCS.
A key function of reversible protein phosphorylation is to regulate protein–protein interactions, many of which involve short linear motifs (3–12 amino acids). Motif‐based interactions are difficult to capture because of their often low‐to‐moderate affinities. Here, we describe phosphomimetic proteomic peptide‐phage display, a powerful method for simultaneously finding motif‐based interaction and pinpointing phosphorylation switches. We computationally designed an oligonucleotide library encoding human C‐terminal peptides containing known or predicted Ser/Thr phosphosites and phosphomimetic variants thereof. We incorporated these oligonucleotides into a phage library and screened the PDZ (PSD‐95/Dlg/ZO‐1) domains of Scribble and DLG1 for interactions potentially enabled or disabled by ligand phosphorylation. We identified known and novel binders and characterized selected interactions through microscale thermophoresis, isothermal titration calorimetry, and NMR. We uncover site‐specific phospho‐regulation of PDZ domain interactions, provide a structural framework for how PDZ domains accomplish phosphopeptide binding, and discuss ligand phosphorylation as a switching mechanism of PDZ domain interactions. The approach is readily scalable and can be used to explore the potential phospho‐regulation of motif‐based interactions on a large scale.
Protein aggregation of the p63 transcription factor underlies severe skin fragility in AEC syndrome
(2018)
The p63 gene encodes a master regulator of epidermal commitment, development, and differentiation. Heterozygous mutations in the C-terminal domain of the p63 gene can cause ankyloblepharon-ectodermal defects-cleft lip/palate (AEC) syndrome, a life-threatening disorder characterized by skin fragility and severe, long-lasting skin erosions. Despite deep knowledge of p63 functions, little is known about mechanisms underlying disease pathology and possible treatments. Here, we show that multiple AEC-associated p63 mutations, but not those causative of other diseases, lead to thermodynamic protein destabilization, misfolding, and aggregation, similar to the known p53 gain-of-function mutants found in cancer. AEC mutant proteins exhibit impaired DNA binding and transcriptional activity, leading to dominant negative effects due to coaggregation with wild-type p63 and p73. Importantly, p63 aggregation occurs also in a conditional knock-in mouse model for the disorder, in which the misfolded p63 mutant protein leads to severe epidermal defects. Variants of p63 that abolish aggregation of the mutant proteins are able to rescue p63’s transcriptional function in reporter assays as well as in a human fibroblast-to-keratinocyte conversion assay. Our studies reveal that AEC syndrome is a protein aggregation disorder and opens avenues for therapeutic intervention.
Structural and functional dissection of the DH and PH domains of oncogenic Bcr-Abl tyrosine kinase
(2017)
The two isoforms of the Bcr-Abl tyrosine kinase, p210 and p190, are associated with different leukemias and have a dramatically different signaling network, despite similar kinase activity. To provide a molecular rationale for these observations, we study the Dbl-homology (DH) and Pleckstrin-homology (PH) domains of Bcr-Abl p210, which constitute the only structural differences to p190. Here we report high-resolution structures of the DH and PH domains and characterize conformations of the DH–PH unit in solution. Our structural and functional analyses show no evidence that the DH domain acts as a guanine nucleotide exchange factor, whereas the PH domain binds to various phosphatidylinositol-phosphates. PH-domain mutants alter subcellular localization and result in decreased interactions with p210-selective interaction partners. Hence, the PH domain, but not the DH domain, plays an important role in the formation of the differential p210 and p190 Bcr-Abl signaling networks.
Folding of G-protein coupled receptors (GPCRs) according to the two-stage model (Popot, J. L., and Engelman, D. M. (1990) Biochemistry 29, 4031–4037) is postulated to proceed in 2 steps: partitioning of the polypeptide into the membrane followed by diffusion until native contacts are formed. Herein we investigate conformational preferences of fragments of the yeast Ste2p receptor using NMR. Constructs comprising the first, the first two, and the first three transmembrane (TM) segments, as well as a construct comprising TM1–TM2 covalently linked to TM7 were examined. We observed that the isolated TM1 does not form a stable helix nor does it integrate well into the micelle. TM1 is significantly stabilized upon interaction with TM2, forming a helical hairpin reported previously (Neumoin, A., Cohen, L. S., Arshava, B., Tantry, S., Becker, J. M., Zerbe, O., and Naider, F. (2009) Biophys. J. 96, 3187–3196), and in this case the protein integrates into the hydrophobic interior of the micelle. TM123 displays a strong tendency to oligomerize, but hydrogen exchange data reveal that the center of TM3 is solvent exposed. In all GPCRs so-far structurally characterized TM7 forms many contacts with TM1 and TM2. In our study TM127 integrates well into the hydrophobic environment, but TM7 does not stably pack against the remaining helices. Topology mapping in microsomal membranes also indicates that TM1 does not integrate in a membrane-spanning fashion, but that TM12, TM123, and TM127 adopt predominantly native-like topologies. The data from our study would be consistent with the retention of individual helices of incompletely synthesized GPCRs in the vicinity of the translocon until the complete receptor is released into the membrane interior.
The degradation of the poly(A) tail is crucial for posttranscriptional gene regulation and for quality control of mRNA. Poly(A)-specific ribonuclease (PARN) is one of the major mammalian 3’ specific exo-ribonucleases involved in the degradation of the mRNA poly(A) tail, and it is also involved in the regulation of translation in early embryonic development. The interaction between PARN and the m7GpppG cap of mRNA plays a key role in stimulating the rate of deadenylation. Here we report the solution structures of the cap-binding domain of mouse PARN with and without the m7GpppG cap analog. The structure of the cap-binding domain adopts the RNA recognition motif (RRM) with a characteristic a-helical extension at its C-terminus, which covers the b-sheet surface (hereafter referred to as PARN RRM). In the complex structure of PARN RRM with the cap analog, the base of the N7-methyl guanosine (m7G) of the cap analog stacks with the solvent-exposed aromatic side chain of the distinctive tryptophan residue 468, located at the C-terminal end of the second b-strand. These unique structural features in PARN RRM reveal a novel cap-binding mode, which is distinct from the nucleotide recognition mode of the canonical RRM domains.
Global response of diacylglycerol kinase towards substrate binding observed by 2D and 3D MAS NMR
(2019)
Escherichia coli diacylglycerol kinase (DGK) is an integral membrane protein, which catalyses the ATP-dependent phosphorylation of diacylglycerol (DAG) to phosphatic acid (PA). It is a unique trimeric enzyme, which does not share sequence homology with typical kinases. It exhibits a notable complexity in structure and function despite of its small size. Here, chemical shift assignment of wild-type DGK within lipid bilayers was carried out based on 3D MAS NMR, utilizing manual and automatic analysis protocols. Upon nucleotide binding, extensive chemical shift perturbations could be observed. These data provide evidence for a symmetric DGK trimer with all of its three active sites concurrently occupied. Additionally, we could detect that the nucleotide substrate induces a substantial conformational change, most likely directing DGK into its catalytic active form. Furthermore, functionally relevant interprotomer interactions are identified by DNP-enhanced MAS NMR in combination with site-directed mutagenesis and functional assays.
The spliceosomal protein SF3b49, a component of the splicing factor 3b (SF3b) protein complex in the U2 small nuclear ribonucleoprotein, contains two RNA recognition motif (RRM) domains. In yeast, the first RRM domain (RRM1) of Hsh49 protein (yeast orthologue of human SF3b49) reportedly interacts with another component, Cus1 protein (orthologue of human SF3b145). Here, we solved the solution structure of the RRM1 of human SF3b49 and examined its mode of interaction with a fragment of human SF3b145 using NMR methods. Chemical shift mapping showed that the SF3b145 fragment spanning residues 598-631 interacts with SF3b49 RRM1, which adopts a canonical RRM fold with a topology of β1-α1-β2-β3-α2-β4. Furthermore, a docking model based on NOESY measurements suggests that residues 607-616 of the SF3b145 fragment adopt a helical structure that binds to RRM1 predominantly via α1, consequently exhibiting a helix-helix interaction in almost antiparallel. This mode of interaction was confirmed by a mutational analysis using GST pull-down assays. Comparison with structures of all RRM domains when complexed with a peptide found that this helix-helix interaction is unique to SF3b49 RRM1. Additionally, all amino acid residues involved in the interaction are well conserved among eukaryotes, suggesting evolutionary conservation of this interaction mode between SF3b49 RRM1 and SF3b145.
Resonance assignments are challenging for membrane proteins due to the size of the lipid/detergent-protein complex and the presence of line-broadening from conformational exchange. As a consequence, many correlations are missing in the triple-resonance NMR experiments typically used for assignments. Herein, we present an approach in which correlations from these solution-state NMR experiments are supplemented by data from 13C unlabeling, single-amino acid type labeling, 4D NOESY data and proximity of moieties to lipids or water in combination with a structure of the protein. These additional data are used to edit the expected peaklists for the automated assignment protocol FLYA, a module of the program package CYANA. We demonstrate application of the protocol to the 262-residue proton pump from archaeal bacteriorhodopsin (bR) in lipid nanodiscs. The lipid-protein assembly is characterized by an overall correlation time of 44 ns. The protocol yielded assignments for 62% of all backbone (H, N, Cα, Cβ, C′) resonances of bR, corresponding to 74% of all observed backbone spin systems, and 60% of the Ala, Met, Ile (δ1), Leu and Val methyl groups, thus enabling to assign a large fraction of the protein without mutagenesis data. Most missing resonances stem from the extracellular half, likely due intermediate exchange line-broadening. Further analysis revealed that missing information of the amino acid type of the preceding residue is the largest problem, and that 4D NOESY experiments are particularly helpful to compensate for that information loss.
Background: The automation of objectively selecting amino acid residue ranges for structure superpositions is important for meaningful and consistent protein structure analyses. So far there is no widely-used standard for choosing these residue ranges for experimentally determined protein structures, where the manual selection of residue ranges or the use of suboptimal criteria remain commonplace. Results: We present an automated and objective method for finding amino acid residue ranges for the superposition and analysis of protein structures, in particular for structure bundles resulting from NMR structure calculations. The method is implemented in an algorithm, CYRANGE, that yields, without protein-specific parameter adjustment, appropriate residue ranges in most commonly occurring situations, including low-precision structure bundles, multi-domain proteins, symmetric multimers, and protein complexes. Residue ranges are chosen to comprise as many residues of a protein domain that increasing their number would lead to a steep rise in the RMSD value. Residue ranges are determined by first clustering residues into domains based on the distance variance matrix, and then refining for each domain the initial choice of residues by excluding residues one by one until the relative decrease of the RMSD value becomes insignificant. A penalty for the opening of gaps favours contiguous residue ranges in order to obtain a result that is as simple as possible, but not simpler. Results are given for a set of 37 proteins and compared with those of commonly used protein structure validation packages. We also provide residue ranges for 6351 NMR structures in the Protein Data Bank. Conclusions: The CYRANGE method is capable of automatically determining residue ranges for the superposition of protein structure bundles for a large variety of protein structures. The method correctly identifies ordered regions. Global structure superpositions based on the CYRANGE residue ranges allow a clear presentation of the structure, and unnecessary small gaps within the selected ranges are absent. In the majority of cases, the residue ranges from CYRANGE contain fewer gaps and cover considerably larger parts of the sequence than those from other methods without significantly increasing the RMSD values. CYRANGE thus provides an objective and automatic method for standardizing the choice of residue ranges for the superposition of protein structures. Additional files Additional file 1: Dependence of Q on the order parameter rank. The quantity Qi is plotted against the order parameter rank i for 9 different protein structure bundles. Additional file 2: Dependence of P on the clustering stage. The quantity Pi is plotted against the clustering stage i for 9 different protein structure bundles. Additional file 3: Dependence of CYRANGE results on the minimal cluster size parameter my. The sequence coverage (red) and RMSD (blue) of the residue ranges determined by CYRANGE were plotted as a function of my for 9 different protein structure bundles. The dotted vertical line indicates the default value, my = 8. Where CYRANGE found two domains, the RMSD values of the individual domains are shown in light and dark blue. Additional file 4: Dependence of CYRANGE results on the domain boundary extension parameter m. See Additional File 3 for details. Additional file 5: Dependence of CYRANGE results on the minimal gap width g. See Additional File 3 for details. Additional file 6: Dependence of CYRANGE results on the relative RMSD decrease parameter delta. See Additional File 3 for details. Additional file 7: Dependence of CYRANGE results on the absolute RMSD decrease parameter delta abs. See Additional File 3 for details. Additional file 8: Dependence of CYRANGE results on the gap penalty parameter gamma. See Additional File 3 for details. Additional file 9: Correlation between the sequence coverage from CYRANGE, FindCore and PSVS, and the GDT total score, GDT_TS. Each data point represents a protein shown in Figures 3 and 4. The coverage is the percentage of amino acid residues included in the residue ranges found by the different methods. The GDT_TS value is defined by GDT_TS = (P1 + P2 + P4 + P8)/4, where Pd is the fraction of residues that can be superimposed under a distance cutoff of d Å. Additional file 10: Correlation between the RMSD value for the residue ranges from CYRANGE, FindCore and PSVS, and the GDT total score, GDT_TS. Each data point represents one protein domain. See Additional File 9 for details.
In every established species, protein-protein interactions have evolved such that they are fit for purpose. However, the molecular details of the evolution of new protein-protein interactions are poorly understood. We have used nuclear magnetic resonance spectroscopy to investigate the changes in structure and dynamics during the evolution of a protein-protein interaction involving the intrinsically disordered CREBBP (CREB-binding protein) interaction domain (CID) and nuclear coactivator binding domain (NCBD) from the transcriptional coregulators NCOA (nuclear receptor coactivator) and CREBBP/p300, respectively. The most ancient low-affinity “Cambrian-like” [540 to 600 million years (Ma) ago] CID/NCBD complex contained less secondary structure and was more dynamic than the complexes from an evolutionarily younger “Ordovician-Silurian” fish ancestor (ca. 440 Ma ago) and extant human. The most ancient Cambrian-like CID/NCBD complex lacked one helix and several interdomain interactions, resulting in a larger solvent-accessible surface area. Furthermore, the most ancient complex had a high degree of millisecond-to-microsecond dynamics distributed along the entire sequences of both CID and NCBD. These motions were reduced in the Ordovician-Silurian CID/NCBD complex and further redistributed in the extant human CID/NCBD complex. Isothermal calorimetry experiments show that complex formation is enthalpically favorable and that affinity is modulated by a largely unfavorable entropic contribution to binding. Our data demonstrate how changes in structure and motion conspire to shape affinity during the evolution of a protein-protein complex and provide direct evidence for the role of structural, dynamic, and frustrational plasticity in the evolution of interactions between intrinsically disordered proteins.
Investigating three-dimensional (3D) structures of proteins in living cells by in-cell nuclear magnetic resonance (NMR) spectroscopy opens an avenue towards understanding the structural basis of their functions and physical properties under physiological conditions inside cells. In-cell NMR provides data at atomic resolution non-invasively, and has been used to detect protein-protein interactions, thermodynamics of protein stability, the behavior of intrinsically disordered proteins, etc. in cells. However, so far only a single de novo 3D protein structure could be determined based on data derived only from in-cell NMR. Here we introduce methods that enable in-cell NMR protein structure determination for a larger number of proteins at concentrations that approach physiological ones. The new methods comprise (1) advances in the processing of non-uniformly sampled NMR data, which reduces the measurement time for the intrinsically short-lived in-cell NMR samples, (2) automatic chemical shift assignment for obtaining an optimal resonance assignment, and (3) structure refinement with Bayesian inference, which makes it possible to calculate accurate 3D protein structures from sparse data sets of conformational restraints. As an example application we determined the structure of the B1 domain of protein G at about 250 μM concentration in living E. coli cells.