Institut für Ökologie, Evolution und Diversität
Refine
Year of publication
Document Type
- Preprint (22) (remove)
Language
- English (22)
Has Fulltext
- yes (22)
Is part of the Bibliography
- no (22)
Keywords
- Chiton (1)
- Chocó rainforest (1)
- East Africa (1)
- Ecuador (1)
- Giraffa (1)
- Mollusca (1)
- Phylogenomics (1)
- Polyplacophora (1)
- aboveground biomass (1)
- biodiversity (1)
Institute
- Senckenbergische Naturforschende Gesellschaft (22) (remove)
The snake pipefish, Entelurus aequoreus (Linnaeus, 1758), is a slender, up to 60 cm long, northern Atlantic fish that dwells in open seagrass habitats and has recently expanded its distribution range. The snake pipefish is part of the family Syngnathidae (seahorses and pipefish) that has undergone several characteristic morphological changes, such as loss of pelvic fins and elongated snout. Here, we present a highly contiguous, near chromosome-scale genome of the snake pipefish assembled as part of a university master’s course. The final assembly has a length of 1.6 Gbp in 7,391 scaffolds, a scaffold and contig N50 of 62.3 Mbp and 45.0 Mbp and L50 of 12 and 14, respectively. The largest 28 scaffolds (>21 Mbp) span 89.7% of the assembly length. A BUSCO completeness score of 94.1% and a mapping rate above 98% suggest a high assembly completeness. Repetitive elements cover 74.93% of the genome, one of the highest proportions so far identified in vertebrate genomes. Demographic modeling using the PSMC framework indicates a peak in effective population size (50 – 100 kya) during the last interglacial period and suggests that the species might largely benefit from warmer water conditions, as seen today. Our updated snake pipefish assembly forms an important foundation for further analysis of the morphological and molecular changes unique to the family Syngnathidae.
All giraffe (Giraffa) were previously assigned to a single species (G. Camelopardalis) and nine subspecies. However, multi-locus analyses of all subspecies have shown that there are four genetically distinct clades and suggest four giraffe species. This conclusion might not be fully accepted due to limited data and lack of explicit gene flow analyses. Here we present an extended study based on 21 independent nuclear loci from 137 individuals. Explicit gene flow analyses identify less than one migrant per generation, including between the closely related northern and reticulated giraffe. Thus, gene flow analyses and population genetics of the extended dataset confirm four genetically distinct giraffe clades and support four independent giraffe species. The new findings call for a revision of the IUCN classification of giraffe taxonomy. Three of the four species are threatened with extinction, mostly occurring in politically unstable regions, and as such, require the highest conservation support possible.
The gradual heterogeneity of climatic factors pose varying selection pressures across geographic distances that leave signatures of clinal variation in the genome. Separating signatures of clinal adaptation from signatures of other evolutionary forces, such as demographic processes, genetic drift, and adaptation to non-clinal conditions of the immediate local environment is a major challenge. Here, we examine climate adaptation in five natural populations of the harlequin fly Chironomus riparius sampled along a climatic gradient across Europe. Our study integrates experimental data, individual genome resequencing, Pool-Seq data, and population genetic modelling. Common-garden experiments revealed a positive correlation of population growth rates corresponding to the population origin along the climate gradient, suggesting thermal adaptation on the phenotypic level. Based on a population genomic analysis, we derived empirical estimates of historical demography and migration. We used an FST outlier approach to infer positive selection across the climate gradient, in combination with an environmental association analysis. In total we identified 162 candidate genes as genomic basis of climate adaptation. Enriched functions among these candidate genes involved the apoptotic process and molecular response to heat, as well as functions identified in other studies of climate adaptation in other insects. Our results show that local climate conditions impose strong selection pressures and lead to genomic adaptation despite strong gene flow. Moreover, these results imply that selection to different climatic conditions seems to converge on a functional level, at least between different insect species.
Ongoing climate change is a major threat to biodiversity and impacts on species distributions and abundances are already evident. Heterogenous responses of species due to varying abiotic tolerances and dispersal abilities have the potential to further amplify or ameliorate these impacts through changes in species assemblages. Here we investigate the impacts of climate change on terrestrial bird distributions and, subsequently, on species richness as well as on different aspects of phylogenetic diversity of species assemblages across the globe. We go beyond previous work by disentangling the potential impacts on assemblage phylogenetic diversity of species gains vs. losses under climate change and compare the projected impacts to randomized assemblage changes.
We show that climate change might not only affect species numbers and composition of global species assemblages but could also have profound impacts on assemblage phylogenetic diversity, which, across extensive areas, differ significantly from random changes. Both the projected impacts on phylogenetic diversity and on phylogenetic structure vary greatly across the globe. Projected increases in the evolutionary history contained within species assemblages, associated with either increasing phylogenetic diversification or clustering, are most frequent at high northern latitudes. By contrast, projected declines in evolutionary history, associated with increasing phylogenetic over-dispersion or homogenisation, are projected across all continents.
The projected widespread changes in the phylogenetic structure of species assemblages show that changes in species richness do not fully reflect the potential threat from climate change to ecosystems. Our results indicate that the most severe changes to the phylogenetic diversity and structure of species assemblages are likely to be caused by species range shifts rather than range reductions and extinctions. Our findings highlight the importance of considering diverse measures in climate impact assessments and the value of integrating species-specific responses into assessments of entire community changes.
Genome mining as a biotechnological tool for the discovery of novel biosynthetic genes in lichens
(2022)
The ever-increasing demand for novel drugs highlights the need for bioprospecting unexplored taxa for their biosynthetic potential. Lichen-forming fungi (LFF) are a rich source of natural products but their implementation in pharmaceutical industry is limited, mostly because the genes corresponding to a majority of their natural products is unknown. Furthermore, it is not known to what extent these genes encode structurally novel molecules. Advance in next-generation sequencing technologies has expanded the range of organisms that could be exploited for their biosynthetic potential. In this study, we mine the genomes of nine lichen-forming fungal species of the genus Umbilicaria for biosynthetic genes, and categorize the BGCs as “associated product structurally known”, and “associated product putatively novel”. We found that about 25-30% of the biosynthetic genes are divergent when compared to the global database of BGCs comprising of 1,200,000 characterized biosynthetic genes from planta, bacteria and fungi. Out of 217 total BGCs, 43 were only distantly related to known BGCs, suggesting they encode structurally and functionally unknown natural products. Clusters encoding the putatively novel metabolic diversity comprise PKSs (30), NRPSs (12) and terpenes (1). Our study emphasizes the utility of genomic data in bioprospecting microorganisms for their biosynthetic potential and in advancing the industrial application of unexplored taxa. We highlight the untapped structural metabolic diversity encoded in the lichenized fungal genomes. To the best of our knowledge, this is the first investigation identifying genes coding for NPs with potentially novel therapeutic properties in LFF.
A candidate gene cluster for the bioactive natural product gyrophoric acid in lichen-forming fungi
(2022)
Natural products of lichen-forming fungi are structurally diverse and have a variety of medicinal properties. Despite this, they a have limited implementation in industry, because the corresponding genes remain unknown for most of the natural products. Here we implement a long-read sequencing and bioinformatic approach to identify the biosynthetic gene cluster of the bioactive natural product gyrophoric acid (GA). Using 15 high-quality genomes representing nine GA-producing species of the lichen-forming fungal genus Umbilicaria, we identify the most likely GA cluster and investigate cluster gene organization and composition across the nine species. Our results show that GA clusters are promiscuous within Umbilicaria, with only three genes that are conserved across species, including the PKS gene. In addition, our results suggest that the same cluster codes for different but structurally similar NPs, i.e., GA, umbilicaric acid and hiascic acid, bringing new evidence that lichen metabolite diversity is also generated through regulatory mechanisms at the molecular level. Ours is the first study to identify the most likely GA cluster, and thus provides essential information to open new avenues for biotechnological approaches to producing and modifying GA and similar lichen-derived compounds. We show that bioinformatics approaches are useful in linking genes and potentially associated natural products. Genome analyses help unlocking the pharmaceutical potential of organisms such as lichens, which are biosynthetically diverse but slow growing, and difficult to cultivate due to their symbiotic nature.
The basidiomycete smut fungi are predominantly plant parasitic, causing severe losses in some crops. Most species feature a saprotrophic haploid yeast stage, and several smut fungi are only known from this stage, with some isolated from habitats without suitable hosts, e.g. from Antarctica. Thus, these species are generally believed to be apathogenic, but recent findings that some of these might have a plant pathogenic sexual counterpart, casts doubts on the validity of this hypothesis. Here, four Pseudozyma genomes were re-annotated and compared to published smut pathogens and the well-characterised effector gene Pep1 from these species was checked for its ability to complement a Pep1 deletion strain of Ustilago maydis. It was found that 113 high-confidence putative effector proteins were conserved among smut and Pseudozyma genomes. Among these were several validated effector proteins, including Pep1. By genetic complementation we show that Pep1 homologs from the supposedly apathogenic yeasts restore virulence in Pep1-deficient mutants Ustilago maydis. Thus, it is concluded that Pseudozyma species have retained a suite of effectors. This hints at the possibility that Pseudozyma species have kept an unknown plant pathogenic stage for sexual recombination or that these effectors have positive effects when colonising plant surfaces.
Molluscs are the second most species-rich phylum in the animal kingdom, yet only eleven genomes of this group have been published so far. Here, we present the draft genome sequence of the pulmonate freshwater snail Radix auricularia. Six whole genome shotgun libraries with different layouts were sequenced. The resulting assembly comprises 4,823 scaffolds with a cumulative length of 910 Mb and an overall read coverage of 72x. The assembly contains 94.6 % of a metazoan core gene collection, indicating an almost complete coverage of the coding fraction. The discrepancy of ~690 Mb compared to the estimated genome size of R. auricularia (1.6 Gb) results from a high repeat content of 70 % mainly comprising DNA transposons. The annotation of 17,338 protein coding genes was supported by the use of publicly-available transcriptome data. This draft will serve as starting point for further genomic and population genetic research in this scientifically important phylum.
Precise estimates of genome sizes are important parameters for both theoretical and practical biodiversity genomics. We present here a fast, easy-to-implement and precise method to estimate genome size from the number of bases sequenced and the mean sequence coverage. To estimate the latter, we take advantage of the fact that a precise estimation of the Poisson distribution parameter lambda is possible from truncated data, restricted to the part of the coverage distribution representing the true underlying distribution. With simulations we could show that reasonable genome size estimates can be gained even from low-coverage (10X), highly discontinuous genome drafts. Comparison of estimates from a wide range of taxa and sequencing strategies with flow-cytometry estimates of the same individuals showed a very good fit and suggested that both methods yield comparable, interchangeable results.
In the course of global climate change, central Europe is experiencing more frequent and prolonged periods of drought. The drought years 2018 and 2019 affected European beeches (Fagus sylvatica L.) differently: even in the same stand, drought damaged trees neighboured healthy trees, suggesting that the genotype rather than the environment was responsible for this conspicuous pattern. We used this natural experiment to study the genomic basis of drought resistance with Pool-GWAS. Contrasting the extreme phenotypes identified 106 significantly associated SNPs throughout the genome. Most annotated genes with associated SNPs (>70%) were previously implicated in the drought reaction of plants. Non-synonymous substitutions led either to a functional amino acid exchange or premature termination. A SNP-assay with 70 loci allowed predicting drought phenotype in 98.6% of a validation sample of 92 trees. Drought resistance in European beech is a moderately polygenic trait that should respond well to natural selection, selective management, and breeding.
Active transposable elements (TEs) may result in divergent genomic insertion and abundance patterns among conspecific populations. Upon secondary contact, such divergent genetic backgrounds can theoretically give rise to classical Dobzhansky-Muller incompatibilities (DMI), a way how TEs can contribute to the evolution of endogenous genetic barriers and eventually population divergence. We investigated whether differential TE activity created endogenous selection pressures among conspecific populations of the non-biting midge Chironomus riparius, focussing on a Chironomus-specific TE, the minisatellite-like Cla-element, whose activity is associated with speciation in the genus. Using an improved and annotated draft genome for a genomic study with five natural C. riparius populations, we found highly population-specific TE insertion patterns with many private insertions. A highly significant correlation of pairwise population FST from genome-wide SNPs with the FST estimated from TEs suggests drift as the major force driving TE population differentiation. However, the significantly higher Cla-element FST level due to a high proportion of differentially fixed Cla-element insertions indicates that segregating, i.e. heterozygous insertions are selected against. With reciprocal crossing experiments and fluorescent in-situ hybridisation of Cla-elements to polytene chromosomes, we documented phenotypic effects on female fertility and chromosomal mispairings that might be linked to DMI in hybrids. We propose that the inferred negative selection on heterozygous Cla-element insertions causes endogenous genetic barriers and therefore acts as DMI among C. riparius populations. The intrinsic genomic turnover exerted by TEs, thus, may have a direct impact on population divergence that is operationally different from drift and local adaptation.
Plant communities provide floral resource-landscapes for pollinators. Yet, it is insufficiently understood how these landscapes shape pollinator-mediated interactions among multiple plant species. Here, we study how pollinators and the seed set of plants respond to the distribution of a floral resource (nectar sugar) in space and across plant species, inflorescences and flowering phenologies. In a global biodiversity hotspot, we quantified floral resource-landscapes on 27 sites of 4 ha comprising 127,993 shrubs of 19 species. Visitation rates of key bird pollinators strongly depended on the phenology of site-scale resource amounts. Seed set of focal plants increased with resources of conspecific neighbours and with site-scale resources, notably with heterospecific resources of lower quality (less sugar per inflorescence). Floral resources are thus a common currency determining how multiple plant species interact via pollinators. These interactions may alter conditions for species coexistence in plant communities and cause community-level Allee effects that promote extinction cascades.
The European Beech is the dominant climax tree in most regions of Central Europe and valued for its ecological versatility and hardwood timber. Even though a draft genome has been published recently, higher resolution is required for studying aspects of genome architecture and recombination. Here we present a chromosome-level assembly of the more than 300 year-old reference individual, Bhaga, from the Kellerwald-Edersee National Park (Germany). Its nuclear genome of 541 Mb was resolved into 12 chromosomes varying in length between 28 Mb and 73 Mb. Multiple nuclear insertions of parts of the chloroplast genome were observed, with one region on chromosome 11 spanning more than 2 Mb of the genome in which fragments up to 54,784 bp long and covering the whole chloroplast genome were inserted randomly. Unlike in Arabidopsis thaliana, ribosomal cistrons are present in Fagus sylvatica only in four major regions, in line with FISH studies. On most assembled chromosomes, telomeric repeats were found at both ends, while centromeric repeats were found to be scattered throughout the genome apart from their main occurrence per chromosome. The genome- wide distribution of SNPs was evaluated using a second individual from Jamy Nature Reserve (Poland). SNPs, repeat elements and duplicated genes were unevenly distributed in the genomes, with one major anomaly on chromosome 4. The genome presented here adds to the available highly resolved plant genomes and we hope it will serve as a valuable basis for future research on genome architecture and for understanding the past and future of European Beech populations in a changing climate.
Intraspecific genomic variability affects a species’ adaptive potential towards climatic conditions. Variation in gene content across populations and environments may point at genomic adaptations to specific environments. The lichen symbiosis, a stable association of fungal and photobiont partners, offers an excellent system to study environmentally driven gene content variation. Many species have remarkable environmental tolerances, and often form populations in different climate zones. Here we combine comparative and population genomics to assess the presence and absence of genes in high elevation and low elevation genomes of two lichenized fungi of the genus Umbilicaria. The two species have non-overlapping ranges, but occupy similar climatic niches in North America (U. phaea) and Europe (U. pustulata): high elevation populations are located in the cold temperate zone and low elevation populations in the Mediterranean zone. We assessed gene content variation along replicated elevation gradients in each of the two species, based on a total of 2050 individuals across 26 populations. Specifically, we assessed shared orthologs across species within the same climate zone, and tracked which genes increase or decrease in abundance within populations along elevation. In total, we found 16 orthogroups with shared orthologous genes in genomes at low elevation and 13 at high elevation. Coverage analysis revealed one ortholog that is exclusive to genomes at low elevation. Conserved domain search revealed domains common to the protein kinases (PKs) superfamily. We traced the discovered ortholog in populations along five replicated elevation gradients on both continents. The protein kinase gene linearly declined in abundance with increasing elevation, and was absent in the highest populations. We consider the parallel loss of an ortholog in two species and in two geographic settings a rare find, and a step forward in understanding the genomic underpinnings of climatic tolerances in lichenized fungi. In addition, the tracking of gene content variation provides a widely applicable framework for retrieving biogeographical determinants of gene presence/absence patterns. Our work provides insights into gene content variation of lichenized fungi in relation to climatic gradients, suggesting a new research direction with implications for understanding evolutionary trajectories of complex symbioses in relation to climatic change.
The plant family Brassicaceae includes some of the most studied hosts of plant microbiomes, targeting microbial diversity, community assembly rules, and effects on host performance. Compared to bacteria, eukaryotes in the brassicaceous microbiome remain understudied, especially under natural settings. Here, we assessed the impact of host identity and age on the assembly of fungal and oomycete root communities, using DNA metabarcoding of roots and associated soil of three annual co-habiting Brassicaceae collected at two time points. Our results showed that fungal communities are more diverse and structured than those of oomycetes. In both cases, plant identity and sampling time had little influence on community variation, whereas root/soil compartment had a strong effect by exerting control on the entry of soil microorganisms into the roots. The enrichment in roots of specific fungi suggests a specialization towards the asymptomatic colonization of plant tissues, which could be relevant to host’s fitness and health.
Mollusca is the second-largest animal phylum with over 100,000 species among eight distinct taxonomic classes. Across 1000 living species in the class Polyplacophora, chitons have a relatively constrained morphology but with some notable deviations. Several genera possess “shell eyes”, true eyes with a lens and retina that are embedded within the dorsal shells, which represent the most recent evolution of animal eyes. The phylogeny of major chiton clades is mostly well established, in a set of superfamily and higher-level taxa supported by various approaches including multiple gene markers, mitogenome-phylogeny and phylotranscritomic approaches as well as morphological studies. However, one critical lineage has remained unclear: Schizochiton was controversially suggested as a potential independent origin of chiton shell eyes. Here, with the draft genome sequencing of Schizochiton incisus (superfamily Schizochitonoidea) plus assembly of transcriptome data from other polyplacophorans, we present phylogenetic reconstructions using both mitochondrial genomes and phylogenomic approaches with multiple methods. Phylogenetic trees from mitogenomic data are inconsistent, reflecting larger scale confounding factors in molluscan mitogenomes. A consistent robust topology was generated with protein coding genes using different models and methods. Our results support Schizochitonoidea is a sister group to other Chitonoidea in Chitonina, in agreement with established classification. This suggests that the earliest origin of shell eyes is in Schizochitonoidea, which were also gained secondarily in other genera in Chitonoidea. Our results have generated a holistic review of the internal relationship within Polyplacophora, and a better understanding on the evolution of Polyplacophora.
Compared to sequence analyses, phylogenetic reconstruction from transposable elements (TEs) offers an additional perspective to study evolutionary processes. However, detecting phylogenetically informative TE insertions requires tedious experimental work, limiting the power of phylogenetic inference. Here, we analyzed the genomes of seven bear species using high throughput sequencing data to detect thousands of TE insertions. The newly developed pipeline for TE detection called TeddyPi (TE detection and discovery for Phylogenetic Inference) obtained 150,513 high-quality TE insertions in the genomes of ursine and tremarctine bears. By integrating different TE insertion callers and using a stringent filtering approach, the TeddyPi pipeline produced highly reliable TE insertion calls, which were confirmed by extensive in vitro validation experiments. Screening for single nucleotide substitutions in the flanking regions of the TEs show that these substitutions correlate with the phylogenetic signal from the TE insertions. Our phylogenomic analyses show that TEs are a major driver of genomic variation in bears and enabled phylogenetic reconstruction of a well-resolved species tree, even with strong signals for incomplete lineage sorting and introgression. The analyses show that the Asiatic black, sun and sloth bear form a monophyletic clade. TeddyPi is open source and can be adapted to various TE and structural variation callers. The pipeline makes it easy to confidently extract thousands of TE insertions even from low coverage genomes of non-model organisms, opening new possibilities for biologists to study phylogenies, evolutionary processes as well as rates and patterns of (retro-)transposition and structural variation.
Bears are iconic mammals with a complex evolutionary history. Natural bear hybrids and studies of few nuclear genes indicate that gene flow among bears may be more common than expected and not limited to the closely related polar and brown bears. Here we present a genome analysis of the bear family with representatives of all living species. Phylogenomic analyses of 869 mega base pairs divided into 18,621 genome fragments yielded a well-resolved coalescent species tree despite signals for extensive gene flow across species. However, genome analyses using three different statistical methods show that gene flow is not limited to closely related species pairs. Strong ancestral gene flow between the Asiatic black bear and the ancestor to polar, brown and American black bear explains numerous uncertainties in reconstructing the bear phylogeny. Gene flow across the bear clade may be mediated by intermediate species such as the geographically wide-spread brown bears leading to massive amounts of phylogenetic conflict. Genome-scale analyses lead to a more complete understanding of complex evolutionary processes. The increasing evidence for extensive inter-specific gene flow, found also in other animal species, necessitates shifting the attention from speciation processes achieving genome-wide reproductive isolation to the selective processes that maintain species divergence in the face of gene flow.
From hunting and foraging to clearing land for agriculture, humans modify forest biodiversity, landscapes, and climate. Forests constantly undergo disturbance–recovery dynamics and understanding them is a major objective of ecologists and conservationists. Chronosequences are a useful tool for understanding global restoration efforts. They represent a space-for-time substitution approach suited for the quantification of the resistance of ecosystem properties to withstand disturbance and the resilience of these properties until reaching pre-disturbance levels. Here we introduce a newly established chronosequence with 62 plots (50 ⍰ 50 m) in active cacao plantations and pastures, early and late regeneration, and mature old-growth forests, across a 200 km2 area in the extremely wet Chocó rainforest. Our chronosequence covers by far the largest total area of plots compared to others in the Neotropics. Plots ranged from 159–615 masl in a forested landscape with 74 ± 2.8 % forest cover within a 1-km radius including substantial old-growth forest cover. Land-use legacy and regeneration time were not confounded by elevation. We tested how six forest structure variables (maximum tree height and DBH, basal area, number of stems, vertical vegetation heterogeneity, and light availability), aboveground biomass (AGB), and rarefied tree species richness change along our chronosequence. Forest structure variables, AGB, and tree species richness increased with regeneration time and are predicted to reach similar levels to those in old-growth forests after ca. 30–116, 202, and 108 yrs, respectively. Compared to previous work in the Neotropics, old-growth forests in Canandé accumulate high AGB that takes one of the largest time spans reported until total recovery. Our chronosequence comprises one of the largest tree species pools, covers the largest total area of regenerating and old-growth forests, and has higher forest cover than other Neotropical chronosequences. Hence, our chronosequence can be used to determine the time for recovery and stability (resistance and resilience) of different taxa and ecosystem functions, including species interaction networks. This integrative effort will ultimately help to understand how one of the most diverse forests on the planet recovers from large-scale disturbances.
Tree bark constitutes ideal habitat for microbial communities, because it is a stable substrate, rich in micro-niches. Bacteria, fungi, and terrestrial microalgae together form microbial communities, which in turn support more bark-associated organisms, such as mosses, lichens, and invertebrates, thus contributing to forest biodiversity. We have a limited understanding of the diversity and biotic interactions of the bark-associated microbiome, as investigations have mainly focussed on agriculturally relevant systems and on single taxonomic groups. Here we implemented a multi-kingdom metabarcoding approach to analyse diversity and community structure of the green algal, bacterial, and fungal components of the bark-associated microbial communities of beech, the most common broadleaved tree of Central European forests. We identified the most abundant taxa, hub taxa, and co-occurring taxa. We found that tree size (as a proxy for age) is an important driver of community assembly, suggesting that environmental filtering leads to less diverse fungal and algal communities over time. Conversely, forest management intensity had negligible effects on microbial communities on bark. Our study suggests the presence of undescribed, yet ecologically meaningful taxa, especially in the fungi, and highlights the importance of bark surfaces as a reservoir of microbial diversity. Our results constitute a first, essential step towards an integrated framework for understanding microbial community assembly processes on bark surfaces, an understudied habitat and neglected component of terrestrial biodiversity. Finally, we propose a cost-effective sampling strategy to study bark-associated microbial communities across large spatial or environmental scales.