Biochemie und Chemie
Refine
Year of publication
Document Type
- Article (61)
- Part of a Book (1)
- Preprint (1)
Has Fulltext
- yes (63)
Is part of the Bibliography
- no (63)
Keywords
- rna (5)
- Kinases (3)
- Phosphorylation (3)
- RNA (3)
- Solution-state NMR (3)
- p63 (3)
- Biophysical chemistry (2)
- E. coli (2)
- EPR (2)
- Membrane protein (2)
Institute
- Zentrum für Biomolekulare Magnetische Resonanz (BMRZ) (63) (remove)
LILBID and nESI : different native mass spectrometry techniques as tools in structural biology
(2018)
Native mass spectrometry is applied for the investigation of proteins and protein complexes worldwide. The challenge in native mass spectrometry is maintaining the features of the proteins of interest, such as oligomeric state, bound ligands, or the conformation of the protein complex, during transfer from solution to gas phase. This is an essential prerequisite to allow conclusions about the solution state protein complex, based on the gas phase measurements. Therefore, soft ionization techniques are required. Widely used for the analysis of protein complexes are nanoelectro spray ionization (nESI) mass spectrometers. A newer ionization method is laser induced liquid bead ion desorption (LILBID), which is based on the release of protein complexes from solution phase via infrared (IR) laser desorption. We use both methods in our lab, depending on the requirements of the biological system we are interested in. Here we benchmark the performance of our LILBID mass spectrometer in comparison to a nESI instrument, regarding sample conditions, buffer and additive tolerances, dissociation mechanism and applicability towards soluble and membrane protein complexes.
High-resolution NMR structure of an RNA model system : the 14-mer cUUCGg tetraloop hairpin RNA
(2009)
We present a high-resolution nuclear magnetic resonance (NMR) solution structure of a 14-mer RNA hairpin capped by cUUCGg tetraloop. This short and very stable RNA presents an important model system for the study of RNA structure and dynamics using NMR spectroscopy, molecular dynamics (MD) simulations and RNA force-field development. The extraordinary high precision of the structure (root mean square deviation of 0.3 Å) could be achieved by measuring and incorporating all currently accessible NMR parameters, including distances derived from nuclear Overhauser effect (NOE) intensities, torsion-angle dependent homonuclear and heteronuclear scalar coupling constants, projection-angle-dependent cross-correlated relaxation rates and residual dipolar couplings. The structure calculations were performed with the program CNS using the ARIA setup and protocols. The structure quality was further improved by a final refinement in explicit water using OPLS force field parameters for non-bonded interactions and charges. In addition, the 2'-hydroxyl groups have been assigned and their conformation has been analyzed based on NOE contacts. The structure currently defines a benchmark for the precision and accuracy amenable to RNA structure determination by NMR spectroscopy. Here, we discuss the impact of various NMR restraints on structure quality and discuss in detail the dynamics of this system as previously determined.
Metal-ion binding and metal-ion induced folding of the adenine-sensing riboswitch aptamer domain
(2007)
Divalent cations are important in the folding and stabilization of complex RNA structures. The adenine-sensing riboswitch controls the expression of mRNAs for proteins involved in purine metabolism by directly sensing intracellular adenine levels. Adenine binds with high affinity and specificity to the ligand binding or aptamer domain of the adenine-sensing riboswitch. The X-ray structure of this domain in complex with adenine revealed an intricate RNA-fold consisting of a three-helix junction stabilized by long-range base-pairing interactions and identified five binding sites for hexahydrated Mg2+-ions. Furthermore, a role for Mg2+-ions in the ligand-induced folding of this RNA was suggested. Here, we describe the interaction of divalent cations with the RNA–adenine complex in solution as studied by high-resolution NMR spectroscopy. Paramagnetic line broadening, chemical shift mapping and intermolecular nuclear Overhauser effects (NOEs) indicate the presence of at least three binding sites for divalent cations. Two of them are similar to those in the X-ray structure. The third site, which is important for the folding of this RNA, has not been observed previously. The ligand-free state of the RNA is conformationally heterogeneous and contains base-pairing patterns detrimental to ligand binding in the absence of Mg2+, but becomes partially pre-organized for ligand binding in the presence of Mg2+. Compared to the highly similar guanine-sensing riboswitch, the folding pathway for the adenine-sensing riboswitch aptamer domain is more complex and the influence of Mg2+ is more pronounced.
The p53 family of transcription factors (p53, p63 and p73) covers a wide range of functions critical for development, homeostasis and health of mammals across their lifespan. Beside the well-established tumor suppressor role, recent evidence has highlighted novel non-oncogenic functions exerted by p73. In particular, p73 is required for multiciliated cell (MCC) differentiation; MCCs have critical roles in brain and airways to move fluids across epithelial surfaces and to transport germ cells in the reproductive tract. This novel function of p73 provides a unifying cellular mechanism for the disparate inflammatory and immunological phenotypes of p73-deficient mice. Indeed, mice with Trp73 deficiency suffer from hydrocephalus, sterility and chronic respiratory tract infections due to profound defects in ciliogenesis and complete loss of mucociliary clearance since MCCs are essential for cleaning airways from inhaled pollutants, pathogens and allergens. Cross-species genomic analyses and functional rescue experiments identify TAp73 as the master transcriptional integrator of ciliogenesis, upstream of previously known central nodes. In addition, TAp73 shows a significant ability to regulate cellular metabolism and energy production through direct transcriptional regulation of several metabolic enzymes, such as glutaminase-2 and glucose-6 phosphate dehydrogenase. This recently uncovered role of TAp73 in the regulation of cellular metabolism strongly affects oxidative balance, thus potentially influencing all the biological aspects associated with p73 function, including development, homeostasis and cancer. Although through different mechanisms, p63 isoforms also contribute to regulation of cellular metabolism, thus indicating a common route used by all family members to control cell fate. At the structural level, the complexity of p73's function is further enhanced by its ability to form heterotetramers with some p63 isoforms, thus indicating the existence of an intrafamily crosstalk that determines the global outcome of p53 family function. In this review, we have tried to summarize all the recent evidence that have emerged on the novel non-oncogenic roles of p73, in an attempt to provide a unified view of the complex function of this gene within its family.
The degradation of the poly(A) tail is crucial for posttranscriptional gene regulation and for quality control of mRNA. Poly(A)-specific ribonuclease (PARN) is one of the major mammalian 3’ specific exo-ribonucleases involved in the degradation of the mRNA poly(A) tail, and it is also involved in the regulation of translation in early embryonic development. The interaction between PARN and the m7GpppG cap of mRNA plays a key role in stimulating the rate of deadenylation. Here we report the solution structures of the cap-binding domain of mouse PARN with and without the m7GpppG cap analog. The structure of the cap-binding domain adopts the RNA recognition motif (RRM) with a characteristic a-helical extension at its C-terminus, which covers the b-sheet surface (hereafter referred to as PARN RRM). In the complex structure of PARN RRM with the cap analog, the base of the N7-methyl guanosine (m7G) of the cap analog stacks with the solvent-exposed aromatic side chain of the distinctive tryptophan residue 468, located at the C-terminal end of the second b-strand. These unique structural features in PARN RRM reveal a novel cap-binding mode, which is distinct from the nucleotide recognition mode of the canonical RRM domains.
Global response of diacylglycerol kinase towards substrate binding observed by 2D and 3D MAS NMR
(2019)
Escherichia coli diacylglycerol kinase (DGK) is an integral membrane protein, which catalyses the ATP-dependent phosphorylation of diacylglycerol (DAG) to phosphatic acid (PA). It is a unique trimeric enzyme, which does not share sequence homology with typical kinases. It exhibits a notable complexity in structure and function despite of its small size. Here, chemical shift assignment of wild-type DGK within lipid bilayers was carried out based on 3D MAS NMR, utilizing manual and automatic analysis protocols. Upon nucleotide binding, extensive chemical shift perturbations could be observed. These data provide evidence for a symmetric DGK trimer with all of its three active sites concurrently occupied. Additionally, we could detect that the nucleotide substrate induces a substantial conformational change, most likely directing DGK into its catalytic active form. Furthermore, functionally relevant interprotomer interactions are identified by DNP-enhanced MAS NMR in combination with site-directed mutagenesis and functional assays.
The identification of inhibitors of eukaryotic protein biosynthesis, which are targeting single translation factors, is highly demanded. Here we report on a small molecule inhibitor, gephyronic acid, isolated from the myxobacterium Archangium gephyra that inhibits growth of transformed mammalian cell lines in the nM range. In direct comparison, primary human fibroblasts were shown to be less sensitive to toxic effects of gephyronic acid than cancer-derived cells. Gephyronic acid is targeting the protein translation system. Experiments with IRES dual luciferase reporter assays identified it as an inhibitor of the translation initiation. DARTs approaches, co-localization studies and pull-down assays indicate that the binding partner could be the eukaryotic initiation factor 2 subunit alpha (eIF2α). Gephyronic acid seems to have a different mode of action than the structurally related polyketides tedanolide, myriaporone, and pederin and is a valuable tool for investigating the eukaryotic translation system. Because cancer derived cells were found to be especially sensitive, gephyronic acid could potentially find use as a drug candidate.
Mistakes in translation of messenger RNA into protein are clearly a detriment to the recombinant production of pure proteins for biophysical study or the biopharmaceutical market. However, they may also provide insight into mechanistic details of the translation process. Mistakes often involve the substitution of an amino acid having an abundant codon for one having a rare codon, differing by substitution of a G base by an A base, as in the case of substitution of a lysine (AAA) for arginine (AGA). In these cases one expects the substitution frequency to depend on the relative abundances of the respective tRNAs, and thus, one might expect frequencies to be similar for all sites having the same rare codon. Here we demonstrate that, for the ADP-ribosylation factor from yeast expressed in E. coli, lysine for arginine substitutions frequencies are not the same at the 9 sites containing a rare arginine codon; mis-incorporation frequencies instead vary from less than 1 to 16%. We suggest that the context in which the codons occur (clustering of rare sites) may be responsible for the variation. The method employed to determine the frequency of mis-incorporation involves a novel mass spectrometric analysis of the products from the parallel expression of wild type and codon-optimized genes in 15N and 14N enriched media, respectively. The high sensitivity and low material requirements of the method make this a promising technology for the collection of data relevant to other mis-incorporations. The additional data could be of value in refining models for the ribosomal translation elongation process.
Mechanistic understanding of dynamic membrane proteins such as transporters, receptors, and channels requires accurate depictions of conformational ensembles, and the manner in which they interchange as a function of environmental factors including substrates, lipids, and inhibitors. Spectroscopic techniques such as electron spin resonance (ESR) pulsed electron–electron double resonance (PELDOR), also known as double electron–electron resonance (DEER), provide a complement to atomistic structures obtained from x-ray crystallography or cryo-EM, since spectroscopic data reflect an ensemble and can be measured in more native solvents, unperturbed by a crystal lattice. However, attempts to interpret DEER data are frequently stymied by discrepancies with the structural data, which may arise due to differences in conditions, the dynamics of the protein, or the flexibility of the attached paramagnetic spin labels. Recently, molecular simulation techniques such as EBMetaD have been developed that create a conformational ensemble matching an experimental distance distribution while applying the minimal possible bias. Moreover, it has been proposed that the work required during an EBMetaD simulation to match an experimentally determined distribution could be used as a metric with which to assign conformational states to a given measurement. Here, we demonstrate the application of this concept for a sodium-coupled transport protein, BetP. Because the probe, protein, and lipid bilayer are all represented in atomic detail, the different contributions to the work, such as the extent of protein backbone movements, can be separated. This work therefore illustrates how ranking simulations based on EBMetaD can help to bridge the gap between structural and biophysical data and thereby enhance our understanding of membrane protein conformational mechanisms.
The spliceosomal protein SF3b49, a component of the splicing factor 3b (SF3b) protein complex in the U2 small nuclear ribonucleoprotein, contains two RNA recognition motif (RRM) domains. In yeast, the first RRM domain (RRM1) of Hsh49 protein (yeast orthologue of human SF3b49) reportedly interacts with another component, Cus1 protein (orthologue of human SF3b145). Here, we solved the solution structure of the RRM1 of human SF3b49 and examined its mode of interaction with a fragment of human SF3b145 using NMR methods. Chemical shift mapping showed that the SF3b145 fragment spanning residues 598-631 interacts with SF3b49 RRM1, which adopts a canonical RRM fold with a topology of β1-α1-β2-β3-α2-β4. Furthermore, a docking model based on NOESY measurements suggests that residues 607-616 of the SF3b145 fragment adopt a helical structure that binds to RRM1 predominantly via α1, consequently exhibiting a helix-helix interaction in almost antiparallel. This mode of interaction was confirmed by a mutational analysis using GST pull-down assays. Comparison with structures of all RRM domains when complexed with a peptide found that this helix-helix interaction is unique to SF3b49 RRM1. Additionally, all amino acid residues involved in the interaction are well conserved among eukaryotes, suggesting evolutionary conservation of this interaction mode between SF3b49 RRM1 and SF3b145.
Structured RNA regions are important gene control elements in prokaryotes and eukaryotes. Here, we show that the mRNA of a cyanobacterial heat shock gene contains a built-in thermosensor critical for photosynthetic activity under stress conditions. The exceptionally short 5´-untranslated region is comprised of a single hairpin with an internal asymmetric loop. It inhibits translation of the Synechocystis hsp17 transcript at normal growth conditions, permits translation initiation under stress conditions and shuts down Hsp17 production in the recovery phase. Point mutations that stabilized or destabilized the RNA structure deregulated reporter gene expression in vivo and ribosome binding in vitro. Introduction of such point mutations into the Synechocystis genome produced severe phenotypic defects. Reversible formation of the open and closed structure was beneficial for viability, integrity of the photosystem and oxygen evolution. Continuous production of Hsp17 was detrimental when the stress declined indicating that shutting-off heat shock protein production is an important, previously unrecognized function of RNA thermometers. We discovered a simple biosensor that strictly adjusts the cellular level of a molecular chaperone to the physiological need.
The family of scaffold attachment factor B (SAFB) proteins comprises three members and was first identified as binders of the nuclear matrix/scaffold. Over the past two decades, SAFBs were shown to act in DNA repair, mRNA/(l)ncRNA processing, and as part of protein complexes with chromatin-modifying enzymes. SAFB proteins are approximately-100-kDa-sized dual nucleic acid-binding proteins with dedicated domains in an otherwise largely unstructured context, but whether and how they discriminate DNA- and RNA-binding has remained enigmatic. We here provide the SAFB2 DNA- and RNA-binding SAP and RRM domains in their functional boundaries and use solution NMR spectroscopy to ascribe DNA- and RNA-binding functions. We give insight into their target nucleic acid preferences and map the interfaces with respective nucleic acids on sparse data-derived SAP and RRM domain structures. Further, we provide evidence that the SAP domain exhibits intra-domain dynamics and a potential tendency to dimerise, which may expand its specifically targeted DNA sequence range. Our data provide a first molecular basis of and a starting point towards deciphering DNA- and RNA-binding functions of SAFB2 on the molecular level and serve a basis for understanding its localization to specific regions of chromatin and its involvement in the processing of specific RNA species.
The family of scaffold attachment factor B (SAFB) proteins comprises three members and was first identified as binders of the nuclear matrix/scaffold. Over the past two decades, SAFBs were shown to act in DNA repair, mRNA/(l)ncRNA processing and as part of protein complexes with chromatin-modifying enzymes. SAFB proteins are approximately 100 kDa-sized dual nucleic acid-binding proteins with dedicated domains in an otherwise largely unstructured context, but whether and how they discriminate DNA and RNA binding has remained enigmatic. We here provide the SAFB2 DNA- and RNA-binding SAP and RRM domains in their functional boundaries and use solution NMR spectroscopy to ascribe DNA- and RNA-binding functions. We give insight into their target nucleic acid preferences and map the interfaces with respective nucleic acids on sparse data-derived SAP and RRM domain structures. Further, we provide evidence that the SAP domain exhibits intra-domain dynamics and a potential tendency to dimerize, which may expand its specifically targeted DNA sequence range. Our data provide a first molecular basis of and a starting point towards deciphering DNA- and RNA-binding functions of SAFB2 on the molecular level and serve a basis for understanding its localization to specific regions of chromatin and its involvement in the processing of specific RNA species.
Resonance assignments are challenging for membrane proteins due to the size of the lipid/detergent-protein complex and the presence of line-broadening from conformational exchange. As a consequence, many correlations are missing in the triple-resonance NMR experiments typically used for assignments. Herein, we present an approach in which correlations from these solution-state NMR experiments are supplemented by data from 13C unlabeling, single-amino acid type labeling, 4D NOESY data and proximity of moieties to lipids or water in combination with a structure of the protein. These additional data are used to edit the expected peaklists for the automated assignment protocol FLYA, a module of the program package CYANA. We demonstrate application of the protocol to the 262-residue proton pump from archaeal bacteriorhodopsin (bR) in lipid nanodiscs. The lipid-protein assembly is characterized by an overall correlation time of 44 ns. The protocol yielded assignments for 62% of all backbone (H, N, Cα, Cβ, C′) resonances of bR, corresponding to 74% of all observed backbone spin systems, and 60% of the Ala, Met, Ile (δ1), Leu and Val methyl groups, thus enabling to assign a large fraction of the protein without mutagenesis data. Most missing resonances stem from the extracellular half, likely due intermediate exchange line-broadening. Further analysis revealed that missing information of the amino acid type of the preceding residue is the largest problem, and that 4D NOESY experiments are particularly helpful to compensate for that information loss.
Background: The automation of objectively selecting amino acid residue ranges for structure superpositions is important for meaningful and consistent protein structure analyses. So far there is no widely-used standard for choosing these residue ranges for experimentally determined protein structures, where the manual selection of residue ranges or the use of suboptimal criteria remain commonplace. Results: We present an automated and objective method for finding amino acid residue ranges for the superposition and analysis of protein structures, in particular for structure bundles resulting from NMR structure calculations. The method is implemented in an algorithm, CYRANGE, that yields, without protein-specific parameter adjustment, appropriate residue ranges in most commonly occurring situations, including low-precision structure bundles, multi-domain proteins, symmetric multimers, and protein complexes. Residue ranges are chosen to comprise as many residues of a protein domain that increasing their number would lead to a steep rise in the RMSD value. Residue ranges are determined by first clustering residues into domains based on the distance variance matrix, and then refining for each domain the initial choice of residues by excluding residues one by one until the relative decrease of the RMSD value becomes insignificant. A penalty for the opening of gaps favours contiguous residue ranges in order to obtain a result that is as simple as possible, but not simpler. Results are given for a set of 37 proteins and compared with those of commonly used protein structure validation packages. We also provide residue ranges for 6351 NMR structures in the Protein Data Bank. Conclusions: The CYRANGE method is capable of automatically determining residue ranges for the superposition of protein structure bundles for a large variety of protein structures. The method correctly identifies ordered regions. Global structure superpositions based on the CYRANGE residue ranges allow a clear presentation of the structure, and unnecessary small gaps within the selected ranges are absent. In the majority of cases, the residue ranges from CYRANGE contain fewer gaps and cover considerably larger parts of the sequence than those from other methods without significantly increasing the RMSD values. CYRANGE thus provides an objective and automatic method for standardizing the choice of residue ranges for the superposition of protein structures. Additional files Additional file 1: Dependence of Q on the order parameter rank. The quantity Qi is plotted against the order parameter rank i for 9 different protein structure bundles. Additional file 2: Dependence of P on the clustering stage. The quantity Pi is plotted against the clustering stage i for 9 different protein structure bundles. Additional file 3: Dependence of CYRANGE results on the minimal cluster size parameter my. The sequence coverage (red) and RMSD (blue) of the residue ranges determined by CYRANGE were plotted as a function of my for 9 different protein structure bundles. The dotted vertical line indicates the default value, my = 8. Where CYRANGE found two domains, the RMSD values of the individual domains are shown in light and dark blue. Additional file 4: Dependence of CYRANGE results on the domain boundary extension parameter m. See Additional File 3 for details. Additional file 5: Dependence of CYRANGE results on the minimal gap width g. See Additional File 3 for details. Additional file 6: Dependence of CYRANGE results on the relative RMSD decrease parameter delta. See Additional File 3 for details. Additional file 7: Dependence of CYRANGE results on the absolute RMSD decrease parameter delta abs. See Additional File 3 for details. Additional file 8: Dependence of CYRANGE results on the gap penalty parameter gamma. See Additional File 3 for details. Additional file 9: Correlation between the sequence coverage from CYRANGE, FindCore and PSVS, and the GDT total score, GDT_TS. Each data point represents a protein shown in Figures 3 and 4. The coverage is the percentage of amino acid residues included in the residue ranges found by the different methods. The GDT_TS value is defined by GDT_TS = (P1 + P2 + P4 + P8)/4, where Pd is the fraction of residues that can be superimposed under a distance cutoff of d Å. Additional file 10: Correlation between the RMSD value for the residue ranges from CYRANGE, FindCore and PSVS, and the GDT total score, GDT_TS. Each data point represents one protein domain. See Additional File 9 for details.
We investigate complexes of two paramagnetic metal ions Gd3+ and Mn2+ to serve as polarizing agents for solid-state dynamic nuclear polarization (DNP) of 1H, 13C, and 15N at magnetic fields of 5, 9.4, and 14.1 T. Both ions are half-integer high-spin systems with a zero-field splitting and therefore exhibit a broadening of the mS = −1/2 ↔ +1/2 central transition which scales inversely with the external field strength. We investigate experimentally the influence of the chelator molecule, strong hyperfine coupling to the metal nucleus, and deuteration of the bulk matrix on DNP properties. At small Gd-DOTA concentrations the narrow central transition allows us to polarize nuclei with small gyromagnetic ratio such as 13C and even 15N via the solid effect. We demonstrate that enhancements observed are limited by the available microwave power and that large enhancement factors of >100 (for 1H) and on the order of 1000 (for 13C) can be achieved in the saturation limit even at 80 K. At larger Gd(III) concentrations (≥10 mM) where dipolar couplings between two neighboring Gd3+ complexes become substantial a transition towards cross effect as dominating DNP mechanism is observed. Furthermore, the slow spin-diffusion between 13C and 15N, respectively, allows for temporally resolved observation of enhanced polarization spreading from nuclei close to the paramagnetic ion towards nuclei further removed. Subsequently, we present preliminary DNP experiments on ubiquitin by site-directed spin-labeling with Gd3+ chelator tags. The results hold promise towards applications of such paramagnetically labeled proteins for DNP applications in biophysical chemistry and/or structural biology.
ATP-binding cassette (ABC) transporters, a superfamily of integral membrane proteins, catalyse the translocation of substrates across the cellular membrane by ATP hydrolysis. Here we demonstrate by nucleotide turnover and binding studies based on 31P solid-state NMR spectroscopy that the ABC exporter and lipid A flippase MsbA can couple ATP hydrolysis to an adenylate kinase activity, where ADP is converted into AMP and ATP. Single-point mutations reveal that both ATPase and adenylate kinase mechanisms are associated with the same conserved motifs of the nucleotide-binding domain. Based on these results, we propose a model for the coupled ATPase-adenylate kinase mechanism, involving the canonical and an additional nucleotide-binding site. We extend these findings to other prokaryotic ABC exporters, namely LmrA and TmrAB, suggesting that the coupled activities are a general feature of ABC exporters.
Ribosomal proteins are assumed to stabilize specific RNA structures and promote compact folding of the large rRNA. The conformational dynamics of the protein between the bound and unbound state play an important role in the binding process. We have studied those dynamical changes in detail for the highly conserved complex between the ribosomal protein L11 and the GTPase region of 23S rRNA. The RNA domain is compactly folded into a well defined tertiary structure, which is further stabilized by the association with the C-terminal domain of the L11 protein (L11ctd). In addition, the N-terminal domain of L11 (L11ntd) is implicated in the binding of the natural thiazole antibiotic thiostrepton, which disrupts the elongation factor function. We have studied the conformation of the ribosomal protein and its dynamics by NMR in the unbound state, the RNA bound state and in the ternary complex with the RNA and thiostrepton. Our data reveal a rearrangement of the L11ntd, placing it closer to the RNA after binding of thiostrepton, which may prevent binding of elongation factors. We propose a model for the ternary L11–RNA–thiostrepton complex that is additionally based on interaction data and conformational information of the L11 protein. The model is consistent with earlier findings and provides an explanation for the role of L11ntd in elongation factor binding.
In every established species, protein-protein interactions have evolved such that they are fit for purpose. However, the molecular details of the evolution of new protein-protein interactions are poorly understood. We have used nuclear magnetic resonance spectroscopy to investigate the changes in structure and dynamics during the evolution of a protein-protein interaction involving the intrinsically disordered CREBBP (CREB-binding protein) interaction domain (CID) and nuclear coactivator binding domain (NCBD) from the transcriptional coregulators NCOA (nuclear receptor coactivator) and CREBBP/p300, respectively. The most ancient low-affinity “Cambrian-like” [540 to 600 million years (Ma) ago] CID/NCBD complex contained less secondary structure and was more dynamic than the complexes from an evolutionarily younger “Ordovician-Silurian” fish ancestor (ca. 440 Ma ago) and extant human. The most ancient Cambrian-like CID/NCBD complex lacked one helix and several interdomain interactions, resulting in a larger solvent-accessible surface area. Furthermore, the most ancient complex had a high degree of millisecond-to-microsecond dynamics distributed along the entire sequences of both CID and NCBD. These motions were reduced in the Ordovician-Silurian CID/NCBD complex and further redistributed in the extant human CID/NCBD complex. Isothermal calorimetry experiments show that complex formation is enthalpically favorable and that affinity is modulated by a largely unfavorable entropic contribution to binding. Our data demonstrate how changes in structure and motion conspire to shape affinity during the evolution of a protein-protein complex and provide direct evidence for the role of structural, dynamic, and frustrational plasticity in the evolution of interactions between intrinsically disordered proteins.
Investigating three-dimensional (3D) structures of proteins in living cells by in-cell nuclear magnetic resonance (NMR) spectroscopy opens an avenue towards understanding the structural basis of their functions and physical properties under physiological conditions inside cells. In-cell NMR provides data at atomic resolution non-invasively, and has been used to detect protein-protein interactions, thermodynamics of protein stability, the behavior of intrinsically disordered proteins, etc. in cells. However, so far only a single de novo 3D protein structure could be determined based on data derived only from in-cell NMR. Here we introduce methods that enable in-cell NMR protein structure determination for a larger number of proteins at concentrations that approach physiological ones. The new methods comprise (1) advances in the processing of non-uniformly sampled NMR data, which reduces the measurement time for the intrinsically short-lived in-cell NMR samples, (2) automatic chemical shift assignment for obtaining an optimal resonance assignment, and (3) structure refinement with Bayesian inference, which makes it possible to calculate accurate 3D protein structures from sparse data sets of conformational restraints. As an example application we determined the structure of the B1 domain of protein G at about 250 μM concentration in living E. coli cells.