Refine
Year of publication
Language
- English (29)
Has Fulltext
- yes (29)
Is part of the Bibliography
- no (29)
Keywords
- orthology (2)
- Adhesion (1)
- Arabidopsis (1)
- Biodiversity (1)
- Brachiopoda (1)
- Brachiozoa (1)
- Bryozoa (1)
- Cell biology (1)
- Cetraria aculeata (1)
- Collembola (1)
Institute
- Biowissenschaften (24)
- Senckenbergische Naturforschende Gesellschaft (12)
- Biodiversität und Klima Forschungszentrum (BiK-F) (10)
- Medizin (6)
- Exzellenzcluster Makromolekulare Komplexe (3)
- Institut für Ökologie, Evolution und Diversität (3)
- Biochemie und Chemie (1)
- Buchmann Institut für Molekulare Lebenswissenschaften (BMLS) (1)
- Frankfurt Institute for Advanced Studies (FIAS) (1)
Bacterial and fungal toll-like receptor activation elicits type I IFN responses in mast cells
(2021)
Next to their role in IgE-mediated allergic diseases and in promoting inflammation, mast cells also have antiinflammatory functions. They release pro- as well as antiinflammatory mediators, depending on the biological setting. Here we aimed to better understand the role of mast cells during the resolution phase of a local inflammation induced with the Toll-like receptor (TLR)-2 agonist zymosan. Multiple sequential immunohistology combined with a statistical neighborhood analysis showed that mast cells are located in a predominantly antiinflammatory microenvironment during resolution of inflammation and that mast cell-deficiency causes decreased efferocytosis in the resolution phase. Accordingly, FACS analysis showed decreased phagocytosis of zymosan and neutrophils by macrophages in mast cell-deficient mice. mRNA sequencing using zymosan-induced bone marrow-derived mast cells (BMMC) revealed a strong type I interferon (IFN) response, which is known to enhance phagocytosis by macrophages. Both, zymosan and lipopolysaccharides (LPS) induced IFN-β synthesis in BMMCs in similar amounts as in bone marrow derived macrophages. IFN-β was expressed by mast cells in paws from naïve mice and during zymosan-induced inflammation. As described for macrophages the release of type I IFNs from mast cells depended on TLR internalization and endosome acidification. In conclusion, mast cells are able to produce several mediators including IFN-β, which are alone or in combination with each other able to regulate the phagocytotic activity of macrophages during resolution of inflammation.
Heat stress transcription factors (HSFs) regulate transcriptional response to a large number of environmental influences, such as temperature fluctuations and chemical compound applications. Plant HSFs represent a large and diverse gene family. The HSF members vary substantially both in gene expression patterns and molecular functions. HEATSTER is a web resource for mining, annotating, and analyzing members of the different classes of HSFs in plants. A web-interface allows the identification and class assignment of HSFs, intuitive searches in the database and visualization of conserved motifs, and domains to classify novel HSFs.
Background: Within the complex metazoan phylogeny, the relationships of the three lophophorate lineages, ectoprocts, brachiopods and phoronids, are particularly elusive. To shed further light on this issue, we present phylogenomic analyses of 196 genes from 58 bilaterian taxa, paying particular attention to the influence of compositional heterogeneity.
Results: The phylogenetic analyses strongly support the monophyly of Lophophorata and a sister-group relationship between Ectoprocta and Phoronida. Our results contrast previous findings based on rDNA sequences and phylogenomic datasets which supported monophyletic Polyzoa (= Bryozoa sensu lato) including Ectoprocta, Entoprocta and Cycliophora, Brachiozoa including Brachiopoda and Phoronida as well as Kryptrochozoa including Brachiopoda, Phoronida and Nemertea, thus rendering Lophophorata polyphyletic. Our attempts to identify the causes for the conflicting results revealed that Polyzoa, Brachiozoa and Kryptrochozoa are supported by character subsets with deviating amino acid compositions, whereas there is no indication for compositional heterogeneity in the character subsets supporting the monophyly of Lophophorata.
Conclusion: Our results indicate that the support for Polyzoa, Brachiozoa and Kryptrochozoa gathered so far is likely an artifact caused by compositional bias. The monophyly of Lophophorata implies that the horseshoe-shaped mesosomal lophophore, the tentacular feeding apparatus of ectoprocts, phoronids and brachiopods is, indeed, a synapomorphy of the lophophorate lineages. The same may apply to radial cleavage. However, among phoronids also spiral cleavage is known. This suggests that the cleavage pattern is highly plastic and has changed several times within lophophorates. The sister group relationship of ectoprocts and phoronids is in accordance with the interpretation of the eversion of a ventral invagination at the beginning of metamorphosis as a common derived feature of these taxa.
The taxon Syndermata comprises the biologically interesting wheel animals (“Rotifera”: Bdelloidea + Monogononta + Seisonidea) and thorny-headed worms (Acanthocephala), and is central for testing superordinate phylogenetic hypotheses (Platyzoa, Gnathifera) in the metazoan tree of life. Recent analyses of syndermatan phylogeny suggested paraphyly of Eurotatoria (free-living bdelloids and monogononts) with respect to endoparasitic acanthocephalans. Data of epizoic seisonids, however, were absent, which may have affected the branching order within the syndermatan clade. Moreover, the position of Seisonidea within Syndermata should help in understanding the evolution of acanthocephalan endoparasitism. Here, we report the first phylogenomic analysis that includes all four higher-ranked groups of Syndermata. The analyzed data sets comprise new transcriptome data for Seison spec. (Seisonidea), Brachionus manjavacas (Monogononta), Adineta vaga (Bdelloidea), and Paratenuisentis ambiguus (Acanthocephala). Maximum likelihood and Bayesian trees for a total of 19 metazoan species were reconstructed from up to 410 functionally diverse proteins. The results unanimously place Monogononta basally within Syndermata, and Bdelloidea appear as the sister group to a clade comprising epizoic Seisonidea and endoparasitic Acanthocephala. Our results support monophyly of Syndermata, Hemirotifera (Bdelloidea + Seisonidea + Acanthocephala), and Pararotatoria (Seisonidea + Acanthocephala), rejecting monophyly of traditional Rotifera and Eurotatoria. This serves as an indication that early acanthocephalans lived epizoically or as ectoparasites on arthropods, before their complex lifecycle with arthropod intermediate and vertebrate definite hosts evolved.
Translation fidelity and efficiency require multiple ribosomal (r)RNA modifications that are mostly mediated by small nucleolar (sno)RNPs during ribosome production. Overlapping basepairing of snoRNAs with pre-rRNAs often necessitates sequential and efficient association and dissociation of the snoRNPs, however, how such hierarchy is established has remained unknown so far. Here, we identify several late-acting snoRNAs that bind pre-40S particles in human cells and show that their association and function in pre-40S complexes is regulated by the RNA helicase DDX21. We map DDX21 crosslinking sites on pre-rRNAs and show their overlap with the basepairing sites of the affected snoRNAs. While DDX21 activity is required for recruitment of the late-acting snoRNAs SNORD56 and SNORD68, earlier snoRNAs are not affected by DDX21 depletion. Together, these observations provide an understanding of the timing and ordered hierarchy of snoRNP action in pre-40S maturation and reveal a novel mode of regulation of snoRNP function by an RNA helicase in human cells.
Calmodulins (CaMs) are important mediators of Ca2+ signals that are found ubiquitously in all eukaryotic organisms. Plants contain a unique family of calmodulin-like proteins (CMLs) that exhibit greater sequence variance compared to canonical CaMs. The Arabidopsis thaliana proteins AtCML4 and AtCML5 are members of CML subfamily VII and possess a CaM domain comprising the characteristic double pair of EF-hands, but they are distinguished from other members of this subfamily and from canonical CaMs by an N-terminal extension of their amino acid sequence. Transient expression of yellow fluorescent protein-tagged AtCML4 and AtCML5 under a 35S-promoter in Nicotiana benthamiana leaf cells revealed a spherical fluorescence pattern. This pattern was confirmed by transient expression in Arabidopsis protoplasts under the native promoter. Co-localization analyses with various endomembrane marker proteins suggest that AtCML4 and AtCML5 are localized to vesicular structures in the interphase between Golgi and the endosomal system. Further studies revealed AtCML5 to be a single-pass membrane protein that is targeted into the endomembrane system by an N-terminal signal anchor sequence. Self-assembly green fluorescent protein and protease protection assays support a topology with the CaM domain exposed to the cytosolic surface and not the lumen of the vesicles, indicating that AtCML5 could sense Ca 2+ signals in the cytosol. Phylogenetic analysis suggests that AtCML4 and AtCML5 are closely related paralogues originating from a duplication event within the Brassicaceae family. CML4/5-like proteins seem to be universally present in eudicots but are absent in some monocots. Together these results show that CML4/5-like proteins represent a flowering plant-specific subfamily of CMLs with a potential function in vesicle transport within the plant endomembrane system.
Orthologs document the evolution of genes and metabolic capacities encoded in extant and ancient genomes. However, the similarity between orthologs decays with time, and ultimately it becomes insufficient to infer common ancestry. This leaves ancient gene set reconstructions incomplete and distorted to an unknown extent. Here we introduce the "evolutionary traceability" as a measure that quantifies, for each protein, the evolutionary distance beyond which the sensitivity of the ortholog search becomes limiting. Using yeast, we show that genes that were thought to date back to the last universal common ancestor are of high traceability. Their functions mostly involve catalysis, ion transport, and ribonucleoprotein complex assembly. In turn, the fraction of yeast genes whose traceability is not sufficient to infer their presence in last universal common ancestor is enriched for regulatory functions. Computing the traceabilities of genes that have been experimentally characterized as being essential for a self-replicating cell reveals that many of the genes that lack orthologs outside bacteria have low traceability. This leaves open whether their orthologs in the eukaryotic and archaeal domains have been overlooked. Looking at the example of REC8, a protein essential for chromosome cohesion, we demonstrate how a traceability-informed adjustment of the search sensitivity identifies hitherto missed orthologs in the fast-evolving microsporidia. Taken together, the evolutionary traceability helps to differentiate between true absence and nondetection of orthologs, and thus improves our understanding about the evolutionary conservation of functional protein networks. "protTrace," a software tool for computing evolutionary traceability, is freely available at https://github.com/BIONF/protTrace.git; last accessed February 10, 2019.
Phylogenetic relationships of the primarily wingless insects are still considered unresolved. Even the most comprehensive phylogenomic studies that addressed this question did not yield congruent results. In order to get a grip on these problems, we here analyzed the sources of incongruence in these phylogenomic studies using an extended transcriptome dataset.Our analyses showed that unevenly distributed missing data can be severely misleading by inflating node support despite the absence of phylogenetic signal. In consequence, only decisive datasets should be used which exclusively comprise data blocks containing all taxa whose relationships are addressed. Additionally, we employed Four-cluster Likelihood-Mapping (FcLM) to measure the degree of congruence among genes of a dataset, as a measure of support alternative to bootstrap. FcLM showed incongruent signal among genes, which in our case is correlated with neither functional class assignment of these genes, nor with model misspecification due to unpartitioned analyses. The herein analyzed dataset is the currently largest dataset covering primarily wingless insects, but failed to elucidate their interordinal phylogenetic relationships. While this is unsatisfying from a phylogenetic perspective, we try to show that the analyses of structure and signal within phylogenomic data can protect us from biased phylogenetic inferences due to analytical artefacts.
Gene families evolve by the processes of speciation (creating orthologs), gene duplication (paralogs), and horizontal gene transfer (xenologs), in addition to sequence divergence and gene loss. Orthologs in particular play an essential role in comparative genomics and phylogenomic analyses. With the continued sequencing of organisms across the tree of life, the data are available to reconstruct the unique evolutionary histories of tens of thousands of gene families. Accurate reconstruction of these histories, however, is a challenging computational problem, and the focus of the Quest for Orthologs Consortium. We review the recent advances and outstanding challenges in this field, as revealed at a symposium and meeting held at the University of Southern California in 2017. Key advances have been made both at the level of orthology algorithm development and with respect to coordination across the community of algorithm developers and orthology end-users. Applications spanned a broad range, including gene function prediction, phylostratigraphy, genome evolution, and phylogenomics. The meetings highlighted the increasing use of meta-analyses integrating results from multiple different algorithms, and discussed ongoing challenges in orthology inference as well as the next steps toward improvement and integration of orthology resources.
Ribosome assembly is an essential and carefully choreographed cellular process. In eukaryotes, several 100 proteins, distributed across the nucleolus, nucleus, and cytoplasm, co-ordinate the step-wise assembly of four ribosomal RNAs (rRNAs) and approximately 80 ribosomal proteins (RPs) into the mature ribosomal subunits. Due to the inherent complexity of the assembly process, functional studies identifying ribosome biogenesis factors and, more importantly, their precise functions and interplay are confined to a few and very well-established model organisms. Although best characterized in yeast (Saccharomyces cerevisiae), emerging links to disease and the discovery of additional layers of regulation have recently encouraged deeper analysis of the pathway in human cells. In archaea, ribosome biogenesis is less well-understood. However, their simpler sub-cellular structure should allow a less elaborated assembly procedure, potentially providing insights into the functional essentials of ribosome biogenesis that evolved long before the diversification of archaea and eukaryotes. Here, we use a comprehensive phylogenetic profiling setup, integrating targeted ortholog searches with automated scoring of protein domain architecture similarities and an assessment of when search sensitivity becomes limiting, to trace 301 curated eukaryotic ribosome biogenesis factors across 982 taxa spanning the tree of life and including 727 archaea. We show that both factor loss and lineage-specific modifications of factor function modulate ribosome biogenesis, and we highlight that limited sensitivity of the ortholog search can confound evolutionary conclusions. Projecting into the archaeal domain, we find that only few factors are consistently present across the analyzed taxa, and lineage-specific loss is common. While members of the Asgard group are not special with respect to their inventory of ribosome biogenesis factors (RBFs), they unite the highest number of orthologs to eukaryotic RBFs in one taxon. Using large ribosomal subunit maturation as an example, we demonstrate that archaea pursue a simplified version of the corresponding steps in eukaryotes. Much of the complexity of this process evolved on the eukaryotic lineage by the duplication of ribosomal proteins and their subsequent functional diversification into ribosome biogenesis factors. This highlights that studying ribosome biogenesis in archaea provides fundamental information also for understanding the process in eukaryotes.
Accurate determination of the evolutionary relationships between genes is a foundational challenge in biology. Homology—evolutionary relatedness—is in many cases readily determined based on sequence similarity analysis. By contrast, whether or not two genes directly descended from a common ancestor by a speciation event (orthologs) or duplication event (paralogs) is more challenging, yet provides critical information on the history of a gene. Since 2009, this task has been the focus of the Quest for Orthologs (QFO) Consortium. The sixth QFO meeting took place in Okazaki, Japan in conjunction with the 67th National Institute for Basic Biology conference. Here, we report recent advances, applications, and oncoming challenges that were discussed during the conference. Steady progress has been made toward standardization and scalability of new and existing tools. A feature of the conference was the presentation of a panel of accessible tools for phylogenetic profiling and several developments to bring orthology beyond the gene unit—from domains to networks. This meeting brought into light several challenges to come: leveraging orthology computations to get the most of the incoming avalanche of genomic data, integrating orthology from domain to biological network levels, building better gene models, and adapting orthology approaches to the broad evolutionary and genomic diversity recognized in different forms of life and viruses.
Acinetobacter baumannii is a Gram-negative pathogen that causes a multitude of nosocomial infections. The Acinetobacter trimeric autotransporter adhesin (Ata) belongs to the superfamily of trimeric autotransporter adhesins which are important virulence factors in many Gram-negative species. Phylogenetic profiling revealed that ata is present in 78% of all sequenced A. baumannii isolates but only in 2% of the closely related species A. calcoaceticus and A. pittii. Employing a markerless ata deletion mutant of A. baumannii ATCC 19606 we show that adhesion to and invasion into human endothelial and epithelial cells depend on Ata. Infection of primary human umbilical cord vein endothelial cells (HUVECs) with A. baumannii led to the secretion of interleukin (IL)-6 and IL-8 in a time- and Ata-dependent manner. Furthermore, infection of HUVECs by WT A. baumannii was associated with higher rates of apoptosis via activation of caspases-3 and caspase-7, but not necrosis, in comparison to ∆ata. Ata deletion mutants were furthermore attenuated in their ability to kill larvae of Galleria mellonella and to survive in larvae when injected at sublethal doses. This indicates that Ata is an important multifunctional virulence factor in A. baumannii that mediates adhesion and invasion, induces apoptosis and contributes to pathogenicity in vivo.
In solid tumors, tumor‐associated macrophages (TAMs) commonly accumulate within hypoxic areas. Adaptations to such environments evoke transcriptional changes by the hypoxia‐inducible factors (HIFs). While HIF‐1α is ubiquitously expressed, HIF‐2α appears tissue‐specific with consequences of HIF‐2α expression in TAMs only being poorly characterized. An E0771 allograft breast tumor model revealed faster tumor growth in myeloid HIF‐2α knockout (HIF‐2αLysM−/−) compared with wildtype (wt) mice. In an RNA‐sequencing approach of FACS sorted wt and HIF‐2α LysM−/− TAMs, serine protease inhibitor, Kunitz type‐1 ( Spint1) emerged as a promising candidate for HIF‐2α‐dependent regulation. We validated reduced Spint1 messenger RNA expression and concomitant Spint1 protein secretion under hypoxia in HIF‐2α‐deficient bone marrow–derived macrophages (BMDMs) compared with wt BMDMs. In line with the physiological function of Spint1 as an inhibitor of hepatocyte growth factor (HGF) activation, supernatants of hypoxic HIF‐2α knockout BMDMs, not containing Spint1, were able to release proliferative properties of inactive pro‐HGF on breast tumor cells. In contrast, hypoxic wt BMDM supernatants containing abundant Spint1 amounts failed to do so. We propose that Spint1 contributes to the tumor‐suppressive function of HIF‐2α in TAMs in breast tumor development.
Ribosome biogenesis is fundamental for cellular life, but surprisingly little is known about the underlying pathway. In eukaryotes a comprehensive collection of experimentally verified ribosome biogenesis factors (RBFs) exists only for Saccharomyces cerevisiae. Far less is known for other fungi, animals or plants, and insights are even more limited for archaea. Starting from 255 yeast RBFs, we integrated ortholog searches, domain architecture comparisons and, in part, manual curation to investigate the inventories of RBF candidates in 261 eukaryotes, 26 archaea and 57 bacteria. The resulting phylogenetic profiles reveal the evolutionary ancestry of the yeast pathway. The oldest core comprising 20 RBF lineages dates back to the last universal common ancestor, while the youngest 20 factors are confined to the Saccharomycotina. On this basis, we outline similarities and differences of ribosome biogenesis across contemporary species. Archaea, so far a rather uncharted domain, possess 38 well-supported RBF candidates of which some are known to form functional sub-complexes in yeast. This provides initial evidence that ribosome biogenesis in eukaryotes and archaea follows similar principles. Within eukaryotes, RBF repertoires vary considerably. A comparison of yeast and human reveals that lineage-specific adaptation via RBF exclusion and addition characterizes the evolution of this ancient pathway.
What is in Umbilicaria pustulata? A metagenomic approach to reconstruct the holo-genome of a lichen
(2020)
Lichens are valuable models in symbiosis research and promising sources of biosynthetic genes for biotechnological applications. Most lichenized fungi grow slowly, resist aposymbiotic cultivation, and are poor candidates for experimentation. Obtaining contiguous, high-quality genomes for such symbiotic communities is technically challenging. Here, we present the first assembly of a lichen holo-genome from metagenomic whole-genome shotgun data comprising both PacBio long reads and Illumina short reads. The nuclear genomes of the two primary components of the lichen symbiosis—the fungus Umbilicaria pustulata (33 Mb) and the green alga Trebouxia sp. (53 Mb)—were assembled at contiguities comparable to single-species assemblies. The analysis of the read coverage pattern revealed a relative abundance of fungal to algal nuclei of ∼20:1. Gap-free, circular sequences for all organellar genomes were obtained. The bacterial community is dominated by Acidobacteriaceae and encompasses strains closely related to bacteria isolated from other lichens. Gene set analyses showed no evidence of horizontal gene transfer from algae or bacteria into the fungal genome. Our data suggest a lineage-specific loss of a putative gibberellin-20-oxidase in the fungus, a gene fusion in the fungal mitochondrion, and a relocation of an algal chloroplast gene to the algal nucleus. Major technical obstacles during reconstruction of the holo-genome were coverage differences among individual genomes surpassing three orders of magnitude. Moreover, we show that GC-rich inverted repeats paired with nonrandom sequencing error in PacBio data can result in missing gene predictions. This likely poses a general problem for genome assemblies based on long reads.
Protein disulfide isomerases (PDIs) support endoplasmic reticulum redox protein folding and cell-surface thiol-redox control of thrombosis and vascular remodeling. The family prototype PDIA1 regulates NADPH oxidase signaling and cytoskeleton organization, however the related underlying mechanisms are unclear. Here we show that genes encoding human PDIA1 and its two paralogs PDIA8 and PDIA2 are each flanked by genes encoding Rho guanine-dissociation inhibitors (GDI), known regulators of RhoGTPases/cytoskeleton. Evolutionary histories of these three microsyntenic regions reveal their emergence by two successive duplication events of a primordial gene pair in the last common vertebrate ancestor. The arrangement, however, is substantially older, detectable in echinoderms, nematodes, and cnidarians. Thus, PDI/RhoGDI pairing in the same transcription orientation emerged early in animal evolution and has been largely maintained. PDI/RhoGDI pairs are embedded into conserved genomic regions displaying common cis-regulatory elements. Analysis of gene expression datasets supports evidence for PDI/RhoGDI coexpression in developmental/inflammatory contexts. PDIA1/RhoGDIα were co-induced in endothelial cells upon CRISP-R-promoted transcription activation of each pair component, and also in mouse arterial intima during flow-induced remodeling. We provide evidence for physical interaction between both proteins. These data support strong functional links between PDI and RhoGDI families, which likely maintained PDI/RhoGDI microsynteny along > 800-million years of evolution.
Acinetobacter baumannii virulence is mediated by the concerted action of three phospholipases D
(2015)
Acinetobacter baumannii causes a broad range of opportunistic infections in humans. Its success as an emerging pathogen is due to a combination of increasing antibiotic resistance, environmental persistence and adaptation to the human host. To date very little is known about the molecular basis of the latter. Here we demonstrate that A. baumannii can use phosphatidylcholine, an integral part of human cell membranes, as sole carbon and energy source. We report on the identification of three phospholipases belonging to the PLD superfamily. PLD1 and PLD2 appear restricted to the bacteria and display the general features of bacterial phospholipases D. They possess two PLDc_2 PFAM domains each encompassing the HxKx4Dx6GS/GGxN (HKD) motif necessary for forming the catalytic core. The third candidate, PLD3, is found in bacteria as well as in eukaryotes and harbours only one PLDc_2 PFAM domain and one conserved HKD motif, which however do not overlap. Employing a markerless mutagenesis system for A. baumannii ATCC 19606T, we generated a full set of PLD knock-out mutants. Galleria mellonella infection studies as well as invasion experiments using A549 human lung epithelial cells revealed that the three PLDs act in a concerted manner as virulence factors and are playing an important role in host cell invasion.
Premise of the study: Polymorphic microsatellite markers were developed for the lichen species Cetraria aculeata (Parmeliaceae) to study fine-scale population diversity and phylogeographic structure.
Methods and Results: Using Illumina HiSeq and MiSeq, 15 fungus-specific microsatellite markers were developed and tested on 81 specimens from four populations from Spain. The number of alleles ranged from four to 13 alleles per locus with a mean of 7.9, and average gene diversities varied from 0.40 to 0.73 over four populations. The amplification rates of 10 markers (CA01– CA10) in populations of C. aculeata exceeded 85%. The markers also amplified across a range of closely related species, except for locus CA05, which did not amplify in C. australiensis and C. "panamericana," and locus CA10 which did not amplify in C. australiensis.
Conclusions: The identified microsatellite markers will be used to study the genetic diversity and phylogeographic structure in populations of C. aculeata in western Eurasia.
Orthologs document the evolution of genes and metabolic capacities encoded in extant and ancient genomes. Orthologous genes that are detected across the full diversity of contemporary life allow reconstructing the gene set of LUCA, the last universal common ancestor. These genes presumably represent the functional repertoire common to – and necessary for – all living organisms. Design of artificial life has the potential to test this. Recently, a minimal gene (MG) set for a self-replicating cell was determined experimentally, and a surprisingly high number of genes have unknown functions and are not represented in LUCA. However, as similarity between orthologs decays with time, it becomes insufficient to infer common ancestry, leaving ancient gene set reconstructions incomplete and distorted to an unknown extent. Here we introduce the evolutionary traceability, together with the software protTrace, that quantifies, for each protein, the evolutionary distance beyond which the sensitivity of the ortholog search becomes limiting. We show that the LUCA set comprises only high-traceable proteins most of which have catalytic functions. We further show that proteins in the MG set lacking orthologs outside bacteria mostly have low traceability, leaving open whether their eukaryotic orthologs have just been overlooked. On the example of REC8, a protein essential for chromosome cohesion, we demonstrate how a traceability-informed adjustment of the search sensitivity identifies hitherto missed orthologs in the fast-evolving microsporidia. Taken together, the evolutionary traceability helps to differentiate between true absence and non-detection of orthologs, and thus improves our understanding about the evolutionary conservation of functional protein networks.
Combinatorial CRISPR-Cas screens have advanced the mapping of genetic interactions, but their experimental scale limits the number of targetable gene combinations. Here, we describe 3Cs multiplexing, a rapid and scalable method to generate highly diverse and uniformly distributed combinatorial CRISPR libraries. We demonstrate that the library distribution skew is the critical determinant of its required screening coverage. By circumventing iterative cloning of PCR-amplified oligonucleotides, 3Cs multiplexing facilitates the generation of combinatorial CRISPR libraries with low distribution skews. We show that combinatorial 3Cs libraries can be screened with minimal coverages, reducing associated efforts and costs at least 10-fold. We apply a 3Cs multiplexing library targeting 12,736 autophagy gene combinations with 247,032 paired gRNAs in viability and reporter-based enrichment screens. In the viability screen, we identify, among others, the synthetic lethal WDR45B-PIK3R4 and the proliferation-enhancing ATG7-KEAP1 genetic interactions. In the reporter-based screen, we identify over 1,570 essential genetic interactions for autophagy flux, including interactions among paralogous genes, namely ATG2A-ATG2B, GABARAP-MAP1LC3B and GABARAP-GABARAPL2. However, we only observe few genetic interactions within paralogous gene families of more than two members, indicating functional compensation between them. This work establishes 3Cs multiplexing as a platform for genetic interaction screens at scale.
Motivation Expert curation to differentiate between functionally diverged homologs and those that may still share a similar function routinely relies on the visual interpretation of domain architecture changes. However, the size of contemporary data sets integrating homologs from hundreds to thousands of species calls for alternate solutions. Scoring schemes to evaluate domain architecture similarities can help to automatize this procedure, in principle. But existing schemes are often too simplistic in the similarity assessment, many require an a-priori resolution of overlapping domain annotations, and those that allow overlaps to extend the set of annotations sources cannot account for redundant annotations. As a consequence, the gap between the automated similarity scoring and the similarity assessment based on visual architecture comparison is still too wide to make the integration of both approaches meaningful.
Results Here, we present FAS, a scoring system for the comparison of multi-layered feature architectures integrating information from a broad spectrum of annotation sources. Feature architectures are represented as directed acyclic graphs, and redundancies are resolved in the course of comparison using a score maximization algorithm. A benchmark using more than 10,000 human-yeast ortholog pairs reveals that FAS consistently outperforms existing scoring schemes. Using three examples, we show how automated architecture similarity assessments can be routinely applied in the benchmarking of orthology assignment software, in the identification of functionally diverged orthologs, and in the identification of entries in protein collections that most likely stem from a faulty gene prediction.
Molluscs are the second most species-rich phylum in the animal kingdom, yet only eleven genomes of this group have been published so far. Here, we present the draft genome sequence of the pulmonate freshwater snail Radix auricularia. Six whole genome shotgun libraries with different layouts were sequenced. The resulting assembly comprises 4,823 scaffolds with a cumulative length of 910 Mb and an overall read coverage of 72x. The assembly contains 94.6 % of a metazoan core gene collection, indicating an almost complete coverage of the coding fraction. The discrepancy of ~690 Mb compared to the estimated genome size of R. auricularia (1.6 Gb) results from a high repeat content of 70 % mainly comprising DNA transposons. The annotation of 17,338 protein coding genes was supported by the use of publicly-available transcriptome data. This draft will serve as starting point for further genomic and population genetic research in this scientifically important phylum.
Background: Alternative splicing is a key regulatory mechanism in eukaryotic cells and increases the effective number of functionally distinct gene products. Using bulk RNA sequencing, splicing variation has been studied across human tissues and in genetically diverse populations. This has identified disease-relevant splicing events, as well as associations between splicing and genomic variations, including sequence composition and conservation. However, variability in splicing between single cells from the same tissue or cell type and its determinants remain poorly understood.
Results: We applied parallel DNA methylation and transcriptome sequencing to differentiating human induced pluripotent stem cells to characterize splicing variation (exon skipping) and its determinants. Our results shows that variation in single-cell splicing can be accurately predicted based on local sequence composition and genomic features. We observe moderate but consistent contributions from local DNA methylation profiles to splicing variation across cells. A combined model that is built based on sequence as well as DNA methylation information accurately predicts different splicing modes of individual cassette exons (AUC=0.85). These categories include the conventional inclusion and exclusion patterns, but also more subtle modes of cell-to-cell variation in splicing. Finally, we identified and characterized associations between DNA methylation and splicing changes during cell differentiation.
Conclusions: Our study yields new insights into alternative splicing at the single-cell level and reveals a previously underappreciated link between DNA methylation variation and splicing.
Molluscs are the second most species-rich phylum in the animal kingdom, yet only 11 genomes of this group have been published so far. Here, we present the draft genome sequence of the pulmonate freshwater snail Radix auricularia. Six whole genome shotgun libraries with different layouts were sequenced. The resulting assembly comprises 4,823 scaffolds with a cumulative length of 910 Mb and an overall read coverage of 72×. The assembly contains 94.6% of a metazoan core gene collection, indicating an almost complete coverage of the coding fraction. The discrepancy of ∼690 Mb compared with the estimated genome size of R. auricularia (1.6 Gb) results from a high repeat content of 70% mainly comprising DNA transposons. The annotation of 17,338 protein coding genes was supported by the use of publicly available transcriptome data. This draft will serve as starting point for further genomic and population genetic research in this scientifically important phylum.
Functional genomics studies in model organisms and human cell lines provided important insights into gene functions and their context-dependent role in genetic circuits. However, our functional understanding of many of these genes and how they combinatorically regulate key biological processes, remains limited. To enable the SpCas9-dependent mapping of gene-gene interactions in human cells, we established 3Cs multiplexing for the generation of combinatorial gRNA libraries in a distribution-unbiased manner and demonstrate its robust performance. The optimal number for combinatorial hit calling was 16 gRNA pairs and the skew of a library’s distribution was identified as a critical parameter dictating experimental scale and data quality. Our approach enabled us to investigate 247,032 gRNA-pairs targeting 12,736 gene-interactions in human autophagy. We identified novel genes essential for autophagy and provide experimental evidence that gene-associated categories of phenotypic strengths exist in autophagy. Furthermore, circuits of autophagy gene interactions reveal redundant nodes driven by paralog genes. Our combinatorial 3Cs approach is broadly suitable to investigate unexpected gene-interaction phenotypes in unperturbed and diseased cell contexts.
Background: Alternative splicing is a key mechanism in eukaryotic cells to increase the effective number of functionally distinct gene products. Using bulk RNA sequencing, splicing variation has been studied both across human tissues and in genetically diverse individuals. This has identified disease-relevant splicing events, as well as associations between splicing and genomic variations, including sequence composition and conservation. However, variability in splicing between single cells from the same tissue and its determinants remain poorly understood.
Results: We applied parallel DNA methylation and transcriptome sequencing to differentiating human induced pluripotent stem cells to characterize splicing variation (exon skipping) and its determinants. Our results shows that splicing rates in single cells can be accurately predicted based on sequence composition and other genomic features. We also identified a moderate but significant contribution from DNA methylation to splicing variation across cells. By combining sequence information and DNA methylation, we derived an accurate model (AUC=0.85) for predicting different splicing modes of individual cassette exons. These explain conventional inclusion and exclusion patterns, but also more subtle modes of cell-to-cell variation in splicing. Finally, we identified and characterized associations between DNA methylation and splicing changes during cell differentiation.
Conclusions: Our study yields new insights into alternative splicing at the single-cell level and reveals a previously underappreciated component of DNA methylation variation on splicing.
Background: Alternative splicing is a key regulatory mechanism in eukaryotic cells and increases the effective number of functionally distinct gene products. Using bulk RNA sequencing, splicing variation has been studied across human tissues and in genetically diverse populations. This has identified disease-relevant splicing events, as well as associations between splicing and genomic features, including sequence composition and conservation. However, variability in splicing between single cells from the same tissue or cell type and its determinants remains poorly understood.
Results: We applied parallel DNA methylation and transcriptome sequencing to differentiating human induced pluripotent stem cells to characterize splicing variation (exon skipping) and its determinants. Our results show that variation in single-cell splicing can be accurately predicted based on local sequence composition and genomic features. We observe moderate but consistent contributions from local DNA methylation profiles to splicing variation across cells. A combined model that is built based on genomic features as well as DNA methylation information accurately predicts different splicing modes of individual cassette exons. These categories include the conventional inclusion and exclusion patterns, but also more subtle modes of cell-to-cell variation in splicing. Finally, we identified and characterized associations between DNA methylation and splicing changes during cell differentiation.
Conclusions: Our study yields new insights into alternative splicing at the single-cell level and reveals a previously underappreciated link between DNA methylation variation and splicing.