Endocannabinoids are important lipid-signaling mediators. Both protective and deleterious effects of endocannabinoids in the cardiovascular system have been reported but the mechanistic basis for these contradicting observations is unclear. We set out to identify anti-inflammatory mechanisms of endocannabinoids in the murine aorta and in human vascular smooth muscle cells (hVSMC). In response to combined stimulation with cytokines, IL-1β and TNFα, the murine aorta released several endocannabinoids, with anandamide (AEA) levels being the most significantly increased. AEA pretreatment had profound effects on cytokine-induced gene expression in hVSMC and murine aorta. As revealed by RNA-Seq analysis, the induction of a subset of 21 inflammatory target genes, including the important cytokine CCL2 was blocked by AEA. This effect was not mediated through AEA-dependent interference of the AP-1 or NF-κB pathways but rather through an epigenetic mechanism. In the presence of AEA, ATAC-Seq analysis and chromatin-immunoprecipitations revealed that CCL2 induction was blocked due to increased levels of H3K27me3 and a decrease of H3K27ac leading to compacted chromatin structure in the CCL2 promoter. These effects were mediated by recruitment of HDAC4 and the nuclear corepressor NCoR1 to the CCL2 promoter. This study therefore establishes a novel anti-inflammatory mechanism for the endogenous endocannabinoid AEA in vascular smooth muscle cells. Furthermore, this work provides a link between endogenous endocannabinoid signaling and epigenetic regulation.
KDEL receptors (KDELRs) represent transmembrane proteins of the secretory pathway which regulate the retention of soluble ER-residents as well as retrograde and anterograde vesicle trafficking. In addition, KDELRs are involved in the regulation of cellular stress response and ECM degradation. For a deeper insight into KDELR1 specific functions, we characterised a KDELR1-KO cell line (HAP1) through whole transcriptome analysis by comparing KDELR1-KO cells with its respective HAP1 wild-type. Our data indicate more than 300 significantly and differentially expressed genes whose gene products are mainly involved in developmental processes such as cell adhesion and ECM composition, pointing out to severe cellular disorders due to a loss of KDELR1. Impaired adhesion capacity of KDELR1-KO cells was further demonstrated through in vitro adhesion assays, while collagen- and/or laminin-coating nearly doubled the adhesion property of KDELR1-KO cells compared to wild-type, confirming a transcriptional adaptation to improve or restore the cellular adhesion capability. Perturbations within the secretory pathway were verified by an increased secretion of ER-resident PDI and decreased cell viability under ER stress conditions, suggesting KDELR1-KO cells to be severely impaired in maintaining cellular homeostasis.
Background: Bidirectional promoters (BPs) are prevalent in eukaryotic genomes. However, it is poorly understood how the cell integrates different epigenomic information, such as transcription factor (TF) binding and chromatin marks, to drive gene expression at BPs. Single-cell sequencing technologies are revolutionizing the field of genome biology. Therefore, this study focuses on the integration of single-cell RNA-seq data with bulk ChIP-seq and other epigenetics data, for which single-cell technologies are not yet established, in the context of BPs.
Results: We performed integrative analyses of novel human single-cell RNA-seq (scRNA-seq) data with bulk ChIP-seq and other epigenetics data. scRNA-seq data revealed distinct transcription states of BPs that were previously not recognized. We find associations between these transcription states to distinct patterns in structural gene features, DNA accessibility, histone modification, DNA methylation and TF binding profiles.
Conclusions: Our results suggest that a complex interplay of all of these elements is required to achieve BP-specific transcriptional output in this specialized promoter configuration. Further, our study implies that novel statistical methods can be developed to deconvolute masked subpopulations of cells measured with different bulk epigenomic assays using scRNA-seq data.
An ontology-based method for assessing batch effect adjustment approaches in heterogeneous datasets
(2018)
Motivation: International consortia such as the Genotype-Tissue Expression (GTEx) project, The Cancer Genome Atlas (TCGA) or the International Human Epigenetics Consortium (IHEC) have produced a wealth of genomic datasets with the goal of advancing our understanding of cell differentiation and disease mechanisms. However, utilizing all of these data effectively through integrative analysis is hampered by batch effects, large cell type heterogeneity and low replicate numbers. To study if batch effects across datasets can be observed and adjusted for, we analyze RNA-seq data of 215 samples from ENCODE, Roadmap, BLUEPRINT and DEEP as well as 1336 samples from GTEx and TCGA. While batch effects are a considerable issue, it is non-trivial to determine if batch adjustment leads to an improvement in data quality, especially in cases of low replicate numbers.
Results: We present a novel method for assessing the performance of batch effect adjustment methods on heterogeneous data. Our method borrows information from the Cell Ontology to establish if batch adjustment leads to a better agreement between observed pairwise similarity and similarity of cell types inferred from the ontology. A comparison of state-of-the art batch effect adjustment methods suggests that batch effects in heterogeneous datasets with low replicate numbers cannot be adequately adjusted. Better methods need to be developed, which can be assessed objectively in the framework presented here.
Cancer-associated fibroblasts (CAFs) in the tumor microenvironment contribute to all stages of tumorigenesis and are usually considered to be tumor-promoting cells. CAFs show a remarkable degree of heterogeneity, which is attributed to developmental origin or to local environmental niches, resulting in distinct CAF subsets within individual tumors. While CAF heterogeneity is frequently investigated in late-stage tumors, data on longitudinal CAF development in tumors are lacking. To this end, we used the transgenic polyoma middle T oncogene-induced mouse mammary carcinoma model and performed whole transcriptome analysis in FACS-sorted fibroblasts from early- and late-stage tumors. We observed a shift in fibroblast populations over time towards a subset previously shown to negatively correlate with patient survival, which was confirmed by multispectral immunofluorescence analysis. Moreover, we identified a transcriptomic signature distinguishing CAFs from early- and late-stage tumors. Importantly, the signature of early-stage CAFs correlated well with tumor stage and survival in human mammary carcinoma patients. A random forest analysis suggested predictive value of the complete set of differentially expressed genes between early- and late-stage CAFs on bulk tumor patient samples, supporting the clinical relevance of our findings. In conclusion, our data show transcriptome alterations in CAFs during tumorigenesis in the mammary gland, which suggest that CAFs are educated by the tumor over time to promote tumor development. Moreover, we show that murine CAF gene signatures can harbor predictive value for human cancer.
Electrocardiograms (ECG) record the heart activity and are the most common and reliable method to detect cardiac arrhythmias, such as atrial fibrillation (AFib). Lately, many commercially available devices such as smartwatches are offering ECG monitoring. Therefore, there is increasing demand for designing deep learning models with the perspective to be physically implemented on these small portable devices with limited energy supply. In this paper, a workflow for the design of small, energy-efficient recurrent convolutional neural network (RCNN) architecture for AFib detection is proposed. However, the approach can be well generalized to every type of long time series. In contrast to previous studies, that demand thousands of additional network neurons and millions of extra model parameters, the logical steps for the generation of a CNN with only 114 trainable parameters are described. The model consists of a small segmented CNN in combination with an optimal energy classifier. The architectural decisions are made by using the energy consumption as a metric in an equally important way as the accuracy. The optimization steps are focused on the software which can be embedded afterwards on a physical chip. Finally, a comparison with some previous relevant studies suggests that the widely used huge CNNs for similar tasks are mostly redundant and unessentially computationally expensive.
For medicine to fulfill its promise of personalized treatments based on a better understanding of disease biology, computational and statistical tools must exist to analyze the increasing amount of patient data that becomes available. A particular challenge is that several types of data are being measured to cope with the complexity of the underlying systems, enhance predictive modeling and enrich molecular understanding.
Here we review a number of recent approaches that specialize in the analysis of multimodal data in the context of predictive biomedicine. We focus on methods that combine different OMIC measurements with image or genome variation data. Our overview shows the diversity of methods that address analysis challenges and reveals new avenues for novel developments.
Highlights
• Single nucleotide variants (SNVs) may affect transcription factor (TF) binding
• Fast statistical approach to assess significance of differential TF binding for SNVs
• Validate new approach on in vitro and in vivo TF binding assays
• Applications on GWAS SNVs and large eQTL studies illustrate utility
Summary
Non-coding variants located within regulatory elements may alter gene expression by modifying transcription factor (TF) binding sites, thereby leading to functional consequences. Different TF models are being used to assess the effect of DNA sequence variants, such as single nucleotide variants (SNVs). Often existing methods are slow and do not assess statistical significance of results. We investigated the distribution of absolute maximal differential TF binding scores for general computational models that affect TF binding. We find that a modified Laplace distribution can adequately approximate the empirical distributions. A benchmark on in vitro and in vivo datasets showed that our approach improves upon an existing method in terms of performance and speed. Applications on eQTLs and on a genome-wide association study illustrate the usefulness of our statistics by highlighting cell type-specific regulators and target genes. An implementation of our approach is freely available on GitHub and as bioconda package.
The transcription factor vitamin D receptor (VDR) is the high affinity nuclear target of the biologically active form of vitamin D3 (1,25(OH)2D3). In order to identify pure genomic transcriptional effects of 1,25(OH)2D3, we used VDR cistrome, transcriptome and open chromatin data, obtained from the human monocytic cell line THP-1, for a novel hierarchical analysis applying three bioinformatics approaches. We predicted 75.6% of all early 1,25(OH)2D3-responding (2.5 or 4 h) and 57.4% of the late differentially expressed genes (24 h) to be primary VDR target genes. VDR knockout led to a complete loss of 1,25(OH)2D3–induced genome-wide gene regulation. Thus, there was no indication of any VDR-independent non-genomic actions of 1,25(OH)2D3 modulating its transcriptional response. Among the predicted primary VDR target genes, 47 were coding for transcription factors and thus may mediate secondary 1,25(OH)2D3 responses. CEBPA and ETS1 ChIP-seq data and RNA-seq following CEBPA knockdown were used to validate the predicted regulation of secondary vitamin D target genes by both transcription factors. In conclusion, a directional network containing 47 partly novel primary VDR target transcription factors describes secondary responses in a highly complex vitamin D signaling cascade. The central transcription factor VDR is indispensable for all transcriptome-wide effects of the nuclear hormone.
Endothelial cells play a critical role in the adaptation of tissues to injury. Tissue ischemia induced by infarction leads to profound changes in endothelial cell functions and can induce transition to a mesenchymal state. Here we explore the kinetics and individual cellular responses of endothelial cells after myocardial infarction by using single cell RNA sequencing. This study demonstrates a time dependent switch in endothelial cell proliferation and inflammation associated with transient changes in metabolic gene signatures. Trajectory analysis reveals that the majority of endothelial cells 3 to 7 days after myocardial infarction acquire a transient state, characterized by mesenchymal gene expression, which returns to baseline 14 days after injury. Lineage tracing, using the Cdh5-CreERT2;mT/mG mice followed by single cell RNA sequencing, confirms the transient mesenchymal transition and reveals additional hypoxic and inflammatory signatures of endothelial cells during early and late states after injury. These data suggest that endothelial cells undergo a transient mes-enchymal activation concomitant with a metabolic adaptation within the first days after myocardial infarction but do not acquire a long-term mesenchymal fate. This mesenchymal activation may facilitate endothelial cell migration and clonal expansion to regenerate the vascular network.