Refine
Year of publication
Document Type
- Article (28)
Language
- English (28)
Has Fulltext
- yes (28)
Is part of the Bibliography
- no (28)
Keywords
- Solution-state NMR (4)
- NMR (3)
- Kinases (2)
- Molecular conformation (2)
- AEC syndrome (1)
- Antibiotic Resistance (1)
- Armadillo repeat protein (1)
- B-factor (1)
- BPTI (1)
- Bacillus (1)
Lantibiotics are peptide-derived antibiotics that inhibit the growth of Gram-positive bacteria via interactions with lipid II and lipid II-dependent pore formation in the bacterial membrane. Due to their general mode of action the Gram-positive producer strains need to express immunity proteins (LanI proteins) for protection against their own lantibiotics. Little is known about the immunity mechanism protecting the producer strain against its own lantibiotic on the molecular level. So far, no structures have been reported for any LanI protein. We solved the structure of SpaI, a LanI protein from the subtilin producing strain Bacillus subtilis ATCC 6633. SpaI is a 16.8-kDa lipoprotein that is attached to the outside of the cytoplasmic membrane via a covalent diacylglycerol anchor. SpaI together with the ABC transporter SpaFEG protects the B. subtilis membrane from subtilin insertion. The solution-NMR structure of a 15-kDa biologically active C-terminal fragment reveals a novel fold. We also demonstrate that the first 20 N-terminal amino acids not present in this C-terminal fragment are unstructured in solution and are required for interactions with lipid membranes. Additionally, growth tests reveal that these 20 N-terminal residues are important for the immunity mediated by SpaI but most likely are not part of a possible subtilin binding site. Our findings are the first step on the way of understanding the immunity mechanism of B. subtilis in particular and of other lantibiotic producing strains in general.
Protein aggregation of the p63 transcription factor underlies severe skin fragility in AEC syndrome
(2018)
The p63 gene encodes a master regulator of epidermal commitment, development, and differentiation. Heterozygous mutations in the C-terminal domain of the p63 gene can cause ankyloblepharon-ectodermal defects-cleft lip/palate (AEC) syndrome, a life-threatening disorder characterized by skin fragility and severe, long-lasting skin erosions. Despite deep knowledge of p63 functions, little is known about mechanisms underlying disease pathology and possible treatments. Here, we show that multiple AEC-associated p63 mutations, but not those causative of other diseases, lead to thermodynamic protein destabilization, misfolding, and aggregation, similar to the known p53 gain-of-function mutants found in cancer. AEC mutant proteins exhibit impaired DNA binding and transcriptional activity, leading to dominant negative effects due to coaggregation with wild-type p63 and p73. Importantly, p63 aggregation occurs also in a conditional knock-in mouse model for the disorder, in which the misfolded p63 mutant protein leads to severe epidermal defects. Variants of p63 that abolish aggregation of the mutant proteins are able to rescue p63’s transcriptional function in reporter assays as well as in a human fibroblast-to-keratinocyte conversion assay. Our studies reveal that AEC syndrome is a protein aggregation disorder and opens avenues for therapeutic intervention.
NMR structure calculation using NOE-derived distance restraints requires a considerable number of assignments of both backbone and sidechains resonances, often difficult or impossible to get for large or complex proteins. Pseudocontact shifts (PCSs) also play a well-established role in NMR protein structure calculation, usually to augment existing structural, mostly NOE-derived, information. Existing refinement protocols using PCSs usually either require a sizeable number of sidechain assignments or are complemented by other experimental restraints. Here, we present an automated iterative procedure to perform backbone protein structure refinements requiring only a limited amount of backbone amide PCSs. Already known structural features from a starting homology model, in this case modules of repeat proteins, are framed into a scaffold that is subsequently refined by experimental PCSs. The method produces reliable indicators that can be monitored to judge about the performance. We applied it to a system in which sidechain assignments are hardly possible, designed Armadillo repeat proteins (dArmRPs), and we calculated the solution NMR structure of YM4A, a dArmRP containing four sequence-identical internal modules, obtaining high convergence to a single structure. We suggest that this approach is particularly useful when approximate folds are known from other techniques, such as X-ray crystallography, while avoiding inherent artefacts due to, for instance, crystal packing.
Background: The automation of objectively selecting amino acid residue ranges for structure superpositions is important for meaningful and consistent protein structure analyses. So far there is no widely-used standard for choosing these residue ranges for experimentally determined protein structures, where the manual selection of residue ranges or the use of suboptimal criteria remain commonplace. Results: We present an automated and objective method for finding amino acid residue ranges for the superposition and analysis of protein structures, in particular for structure bundles resulting from NMR structure calculations. The method is implemented in an algorithm, CYRANGE, that yields, without protein-specific parameter adjustment, appropriate residue ranges in most commonly occurring situations, including low-precision structure bundles, multi-domain proteins, symmetric multimers, and protein complexes. Residue ranges are chosen to comprise as many residues of a protein domain that increasing their number would lead to a steep rise in the RMSD value. Residue ranges are determined by first clustering residues into domains based on the distance variance matrix, and then refining for each domain the initial choice of residues by excluding residues one by one until the relative decrease of the RMSD value becomes insignificant. A penalty for the opening of gaps favours contiguous residue ranges in order to obtain a result that is as simple as possible, but not simpler. Results are given for a set of 37 proteins and compared with those of commonly used protein structure validation packages. We also provide residue ranges for 6351 NMR structures in the Protein Data Bank. Conclusions: The CYRANGE method is capable of automatically determining residue ranges for the superposition of protein structure bundles for a large variety of protein structures. The method correctly identifies ordered regions. Global structure superpositions based on the CYRANGE residue ranges allow a clear presentation of the structure, and unnecessary small gaps within the selected ranges are absent. In the majority of cases, the residue ranges from CYRANGE contain fewer gaps and cover considerably larger parts of the sequence than those from other methods without significantly increasing the RMSD values. CYRANGE thus provides an objective and automatic method for standardizing the choice of residue ranges for the superposition of protein structures. Additional files Additional file 1: Dependence of Q on the order parameter rank. The quantity Qi is plotted against the order parameter rank i for 9 different protein structure bundles. Additional file 2: Dependence of P on the clustering stage. The quantity Pi is plotted against the clustering stage i for 9 different protein structure bundles. Additional file 3: Dependence of CYRANGE results on the minimal cluster size parameter my. The sequence coverage (red) and RMSD (blue) of the residue ranges determined by CYRANGE were plotted as a function of my for 9 different protein structure bundles. The dotted vertical line indicates the default value, my = 8. Where CYRANGE found two domains, the RMSD values of the individual domains are shown in light and dark blue. Additional file 4: Dependence of CYRANGE results on the domain boundary extension parameter m. See Additional File 3 for details. Additional file 5: Dependence of CYRANGE results on the minimal gap width g. See Additional File 3 for details. Additional file 6: Dependence of CYRANGE results on the relative RMSD decrease parameter delta. See Additional File 3 for details. Additional file 7: Dependence of CYRANGE results on the absolute RMSD decrease parameter delta abs. See Additional File 3 for details. Additional file 8: Dependence of CYRANGE results on the gap penalty parameter gamma. See Additional File 3 for details. Additional file 9: Correlation between the sequence coverage from CYRANGE, FindCore and PSVS, and the GDT total score, GDT_TS. Each data point represents a protein shown in Figures 3 and 4. The coverage is the percentage of amino acid residues included in the residue ranges found by the different methods. The GDT_TS value is defined by GDT_TS = (P1 + P2 + P4 + P8)/4, where Pd is the fraction of residues that can be superimposed under a distance cutoff of d Å. Additional file 10: Correlation between the RMSD value for the residue ranges from CYRANGE, FindCore and PSVS, and the GDT total score, GDT_TS. Each data point represents one protein domain. See Additional File 9 for details.
The YaeJ protein is a codon-independent release factor with peptidyl-tRNA hydrolysis (PTH) activity, and functions as a stalled-ribosome rescue factor in Escherichia coli. To identify residues required for YaeJ function, we performed mutational analysis for in vitro PTH activity towards rescue of ribosomes stalled on a non-stop mRNA, and for ribosome-binding efficiency. We focused on residues conserved among bacterial YaeJ proteins. Additionally, we determined the solution structure of the GGQ domain of YaeJ from E. coli using nuclear magnetic resonance spectroscopy. YaeJ and a human homolog, ICT1, had similar levels of PTH activity, despite various differences in sequence and structure. While no YaeJ-specific residues important for PTH activity occur in the structured GGQ domain, Arg118, Leu119, Lys122, Lys129 and Arg132 in the following C-terminal extension were required for PTH activity. All of these residues are completely conserved among bacteria. The equivalent residues were also found in the C-terminal extension of ICT1, allowing an appropriate sequence alignment between YaeJ and ICT1 proteins from various species. Single amino acid substitutions for each of these residues significantly decreased ribosome-binding efficiency. These biochemical findings provide clues to understanding how YaeJ enters the A-site of stalled ribosomes.
The spliceosomal protein SF3b49, a component of the splicing factor 3b (SF3b) protein complex in the U2 small nuclear ribonucleoprotein, contains two RNA recognition motif (RRM) domains. In yeast, the first RRM domain (RRM1) of Hsh49 protein (yeast orthologue of human SF3b49) reportedly interacts with another component, Cus1 protein (orthologue of human SF3b145). Here, we solved the solution structure of the RRM1 of human SF3b49 and examined its mode of interaction with a fragment of human SF3b145 using NMR methods. Chemical shift mapping showed that the SF3b145 fragment spanning residues 598-631 interacts with SF3b49 RRM1, which adopts a canonical RRM fold with a topology of β1-α1-β2-β3-α2-β4. Furthermore, a docking model based on NOESY measurements suggests that residues 607-616 of the SF3b145 fragment adopt a helical structure that binds to RRM1 predominantly via α1, consequently exhibiting a helix-helix interaction in almost antiparallel. This mode of interaction was confirmed by a mutational analysis using GST pull-down assays. Comparison with structures of all RRM domains when complexed with a peptide found that this helix-helix interaction is unique to SF3b49 RRM1. Additionally, all amino acid residues involved in the interaction are well conserved among eukaryotes, suggesting evolutionary conservation of this interaction mode between SF3b49 RRM1 and SF3b145.
The degradation of the poly(A) tail is crucial for posttranscriptional gene regulation and for quality control of mRNA. Poly(A)-specific ribonuclease (PARN) is one of the major mammalian 3’ specific exo-ribonucleases involved in the degradation of the mRNA poly(A) tail, and it is also involved in the regulation of translation in early embryonic development. The interaction between PARN and the m7GpppG cap of mRNA plays a key role in stimulating the rate of deadenylation. Here we report the solution structures of the cap-binding domain of mouse PARN with and without the m7GpppG cap analog. The structure of the cap-binding domain adopts the RNA recognition motif (RRM) with a characteristic a-helical extension at its C-terminus, which covers the b-sheet surface (hereafter referred to as PARN RRM). In the complex structure of PARN RRM with the cap analog, the base of the N7-methyl guanosine (m7G) of the cap analog stacks with the solvent-exposed aromatic side chain of the distinctive tryptophan residue 468, located at the C-terminal end of the second b-strand. These unique structural features in PARN RRM reveal a novel cap-binding mode, which is distinct from the nucleotide recognition mode of the canonical RRM domains.
The CUG-binding protein 1 (CUG-BP1) is a member of the CUG-BP1 and ETR-like factors (CELF) family or the Bruno-like family and is involved in the control of splicing, translation and mRNA degradation. Several target RNA sequences of CUG-BP1 have been predicted, such as the CUG triplet repeat, the GU-rich sequences and the AU-rich element of nuclear pre-mRNAs and/or cytoplasmic mRNA. CUG-BP1 has three RNA-recognition motifs (RRMs), among which the third RRM (RRM3) can bind to the target RNAs on its own. In this study, we solved the solution structure of the CUG-BP1 RRM3 by hetero-nuclear NMR spectroscopy. The CUG-BP1 RRM3 exhibited a noncanonical RRM fold, with the four-stranded b-sheet surface tightly associated with the N-terminal extension. Furthermore, we determined the solution structure of the CUG-BP1 RRM3 in the complex with (UG)3 RNA, and discovered that the UGU trinucleotide is specifically recognized through extensive stacking interactions and hydrogen bonds within the pocket formed by the b-sheet surface and the N-terminal extension. This study revealed the unique mechanism that enables the CUG-BP1 RRM3 to discriminate the short RNA segment from other sequences, thus providing the molecular basis for the comprehension of the role of the RRM3s in the CELF/Bruno-like family.
To date, in-cell NMR has elucidated various aspects of protein behaviour by associating structures in physiological conditions. Meanwhile, current studies of this method mostly have deduced protein states in cells exclusively based on ‘indirect’ structural information from peak patterns and chemical shift changes but not ‘direct’ data explicitly including interatomic distances and angles. To fully understand the functions and physical properties of proteins inside cells, it is indispensable to obtain explicit structural data or determine three-dimensional (3D) structures of proteins in cells. Whilst the short lifetime of cells in a sample tube, low sample concentrations, and massive background signals make it difficult to observe NMR signals from proteins inside cells, several methodological advances help to overcome the problems. Paramagnetic effects have an outstanding potential for in-cell structural analysis. The combination of a limited amount of experimental in-cell data with software for ab initio protein structure prediction opens an avenue to visualise 3D protein structures inside cells. Conventional nuclear Overhauser effect spectroscopy (NOESY)-based structure determination is advantageous to elucidate the conformations of side-chain atoms of proteins as well as global structures. In this article, we review current progress for the structure analysis of proteins in living systems and discuss the feasibility of its future works.
An automated NMR chemical shift assignment algorithm was developed using multi-objective optimization techniques. The problem is modeled as a combinatorial optimization problem and its objective parameters are defined separately in different score functions. Some of the heuristic approaches of evolutionary optimization are employed in this problem model. Both, a conventional genetic algorithm and multi-objective methods, i.e., the non-dominated sorting genetic algorithms II and III (NSGA2 and NSGA3), are applied to the problem. The multi-objective approaches consider each objective parameter separately, whereas the genetic algorithm followed a conventional way, where all objectives are combined in one score function. Several improvement steps and repetitions on these algorithms are performed and their combinations are also created as a hyper-heuristic approach to the problem. Additionally, a hill-climbing algorithm is also applied after the evolutionary algorithm steps. The algorithms are tested on several different datasets with a set of 11 commonly used spectra. The test results showed that our algorithm could assign both sidechain and backbone atoms fully automatically without any manual interactions. Our approaches could provide around a 65% success rate and could assign some of the atoms that could not be assigned by other methods.