Refine
Document Type
- Article (22) (remove)
Language
- English (22) (remove)
Has Fulltext
- yes (22)
Is part of the Bibliography
- no (22)
Keywords
Institute
- Medizin (22) (remove)
Background: Understanding the location and cell-type specific binding of Transcription Factors (TFs) is important in the study of gene regulation. Computational prediction of TF binding sites is challenging, because TFs often bind only to short DNA motifs and cell-type specific co-factors may work together with the same TF to determine binding. Here, we consider the problem of learning a general model for the prediction of TF binding using DNase1-seq data and TF motif description in form of position specific energy matrices (PSEMs).
Methods: We use TF ChIP-seq data as a gold-standard for model training and evaluation. Our contribution is a novel ensemble learning approach using random forest classifiers. In the context of the ENCODE-DREAM in vivo TF binding site prediction challenge we consider different learning setups.
Results: Our results indicate that the ensemble learning approach is able to better generalize across tissues and cell-types compared to individual tissue-specific classifiers or a classifier built based upon data aggregated across tissues. Furthermore, we show that incorporating DNase1-seq peaks is essential to reduce the false positive rate of TF binding predictions compared to considering the raw DNase1 signal.
Conclusions: Analysis of important features reveals that the models preferentially select motifs of other TFs that are close interaction partners in existing protein protein-interaction networks. Code generated in the scope of this project is available on GitHub: https://github.com/SchulzLab/TFAnalysis (DOI: 10.5281/zenodo.1409697).
Background: Enhancers play a fundamental role in orchestrating cell state and development. Although several methods have been developed to identify enhancers, linking them to their target genes is still an open problem. Several theories have been proposed on the functional mechanisms of enhancers, which triggered the development of various methods to infer promoter–enhancer interactions (PEIs). The advancement of high-throughput techniques describing the three-dimensional organization of the chromatin, paved the way to pinpoint long-range PEIs. Here we investigated whether including PEIs in computational models for the prediction of gene expression improves performance and interpretability.
Results: We have extended our TEPIC framework to include DNA contacts deduced from chromatin conformation capture experiments and compared various methods to determine PEIs using predictive modelling of gene expression from chromatin accessibility data and predicted transcription factor (TF) motif data. We designed a novel machine learning approach that allows the prioritization of TFs binding to distal loop and promoter regions with respect to their importance for gene expression regulation. Our analysis revealed a set of core TFs that are part of enhancer–promoter loops involving YY1 in different cell lines.
Conclusion: We present a novel approach that can be used to prioritize TFs involved in distal and promoter-proximal regulatory events by integrating chromatin accessibility, conformation, and gene expression data. We show that the integration of chromatin conformation data can improve gene expression prediction and aids model interpretability.
Background: Understanding the location and cell-type specific binding of Transcription Factors (TFs) is important in the study of gene regulation. Computational prediction of TF binding sites is challenging, because TFs often bind only to short DNA motifs and cell-type specific co-factors may work together with the same TF to determine binding. Here, we consider the problem of learning a general model for the prediction of TF binding using DNase1-seq data and TF motif description in form of position specific energy matrices (PSEMs).
Methods: We use TF ChIP-seq data as a gold-standard for model training and evaluation. Our contribution is a novel ensemble learning approach using random forest classifiers. In the context of the ENCODE-DREAM in vivo TF binding site prediction challenge we consider different learning setups.
Results: Our results indicate that the ensemble learning approach is able to better generalize across tissues and cell-types compared to individual tissue-specific classifiers or a classifier applied to the data aggregated across tissues. Furthermore, we show that incorporating DNase1-seq peaks is essential to reduce the false positive rate of TF binding predictions compared to considering the raw DNase1 signal.
Conclusions: Analysis of important features reveals that the models preferentially select motifs of other TFs that are close interaction partners in existing protein protein-interaction networks. Code generated in the scope of this project is available on GitHub: https://github.com/SchulzLab/TFAnalysis (DOI: 10.5281/zenodo.1409697)
Specialized de novo assemblers for diverse datatypes have been developed and are in widespread use for the analyses of single-cell genomics, metagenomics and RNA-seq data. However, assembly of large sequencing datasets produced by modern technologies is challenging and computationally intensive. In-silico read normalization has been suggested as a computational strategy to reduce redundancy in read datasets, which leads to significant speedups and memory savings of assembly pipelines. Previously, we presented a set multi-cover optimization based approach, ORNA, where reads are reduced without losing important k-mer connectivity information, as used in assembly graphs. Here we propose extensions to ORNA, named ORNA-Q and ORNA-K, which consider a weighted set multi-cover optimization formulation for the in-silico read normalization problem. These novel formulations make use of the base quality scores obtained from sequencers (ORNA-Q) or k-mer abundances of reads (ORNA-K) to improve normalization further. We devise efficient heuristic algorithms for solving both formulations. In applications to human RNA-seq data, ORNA-Q and ORNA-K are shown to assemble more or equally many full length transcripts compared to other normalization methods at similar or higher read reduction values. The algorithm is implemented under the latest version of ORNA (v2.0, https://github.com/SchulzLab/ORNA).
Background: With the rise of single-cell RNA sequencing new bioinformatic tools have been developed to handle specific demands, such as quantifying unique molecular identifiers and correcting cell barcodes. Here, we benchmarked several datasets with the most common alignment tools for single-cell RNA sequencing data. We evaluated differences in the whitelisting, gene quantification, overall performance, and potential variations in clustering or detection of differentially expressed genes. We compared the tools Cell Ranger version 6, STARsolo, Kallisto, Alevin, and Alevin-fry on 3 published datasets for human and mouse, sequenced with different versions of the 10X sequencing protocol.
Results: Striking differences were observed in the overall runtime of the mappers. Besides that, Kallisto and Alevin showed variances in the number of valid cells and detected genes per cell. Kallisto reported the highest number of cells; however, we observed an overrepresentation of cells with low gene content and unknown cell type. Conversely, Alevin rarely reported such low-content cells. Further variations were detected in the set of expressed genes. While STARsolo, Cell Ranger 6, Alevin-fry, and Alevin produced similar gene sets, Kallisto detected additional genes from the Vmn and Olfr gene family, which are likely mapping artefacts. We also observed differences in the mitochondrial content of the resulting cells when comparing a prefiltered annotation set to the full annotation set that includes pseudogenes and other biotypes.
Conclusion: Overall, this study provides a detailed comparison of common single-cell RNA sequencing mappers and shows their specific properties on 10X Genomics data.
The aging process is characterized by a chronic, low‐grade inflammatory state, termed “inflammaging.” It has been suggested that macrophage activation plays a key role in the induction and maintenance of this state. In the present study, we aimed to elucidate the mechanisms responsible for aging‐associated changes in the myeloid compartment of mice. The aging phenotype, characterized by elevated cytokine production, was associated with a dysfunction of the hypothalamic–pituitary–adrenal (HPA) axis and diminished serum corticosteroid levels. In particular, the concentration of corticosterone, the major active glucocorticoid in rodents, was decreased. This could be explained by an impaired expression and activity of 11β‐hydroxysteroid dehydrogenase type 1 (11β‐HSD1), an enzyme that determines the extent of cellular glucocorticoid responses by reducing the corticosteroids cortisone/11‐dehydrocorticosterone to their active forms cortisol/corticosterone, in aged macrophages and peripheral leukocytes. These changes were accompanied by a downregulation of the glucocorticoid receptor target gene glucocorticoid‐induced leucine zipper (GILZ) in vitro and in vivo. Since GILZ plays a central role in macrophage activation, we hypothesized that the loss of GILZ contributed to the process of macroph‐aging. The phenotype of macrophages from aged mice was indeed mimicked in young GILZ knockout mice. In summary, the current study provides insight into the role of glucocorticoid metabolism and GILZ regulation during aging.
Hepatic lipid deposition and inflammation represent risk factors for hepatocellular carcinoma (HCC). The mRNA-binding protein tristetraprolin (TTP, gene name ZFP36) has been suggested as a tumor suppressor in several malignancies, but it increases insulin resistance. The aim of this study was to elucidate the role of TTP in hepatocarcinogenesis and HCC progression. Employing liver-specific TTP-knockout (lsTtp-KO) mice in the diethylnitrosamine (DEN) hepatocarcinogenesis model, we observed a significantly reduced tumor burden compared to wild-type animals. Upon short-term DEN treatment, modelling early inflammatory processes in hepatocarcinogenesis, lsTtp-KO mice exhibited a reduced monocyte/macrophage ratio as compared to wild-type mice. While short-term DEN strongly induced an abundance of saturated and poly-unsaturated hepatic fatty acids, lsTtp-KO mice did not show these changes. These findings suggested anti-carcinogenic actions of TTP deletion due to effects on inflammation and metabolism. Interestingly, though, investigating effects of TTP on different hallmarks of cancer suggested tumor-suppressing actions: TTP inhibited proliferation, attenuated migration, and slightly increased chemosensitivity. In line with a tumor-suppressing activity, we observed a reduced expression of several oncogenes in TTP-overexpressing cells. Accordingly, ZFP36 expression was downregulated in tumor tissues in three large human data sets. Taken together, this study suggests that hepatocytic TTP promotes hepatocarcinogenesis, while it shows tumor-suppressive actions during hepatic tumor progression.
Background/Aims: Hepatocellular carcinoma (HCC) represents the second most common cause of cancer-related deaths worldwide, not least due to its high chemoresistance. The long non-coding RNA nuclear paraspeckle assembly transcript 1 (NEAT1), localised in nuclear paraspeckles, has been shown to enhance chemoresistance in several cancer types. Since data on NEAT1 in HCC chemosensitivity are completely lacking and chemoresistance is linked to poor prognosis, we aimed to study NEAT1 expression in HCC chemoresistance and its link to HCC prognosis.
Methods: NEAT1 expression was determined in either sensitive, or sorafenib, or doxorubicin resistant HepG2, PLC/PRF/5, and Huh7 cells by qPCR. Paraspeckles were detected by immunostaining of paraspeckle component 1 (PSPC1) in cell culture and in a cohort of HCC patients. PSPC1 expression was correlated with clinical data. The expression of transcript variants of NEAT1 and transcripts encoding the paraspeckle-associated proteins was analysed in the TCGA liver cancer data set.
Results: NEAT1 was overexpressed in all three sorafenib and doxorubicin resistant cell lines. Paraspeckles were present in all chemoresistant cells, whereas no signal was detected in the sensitive cells. Expression of NEAT1 transcripts as well as transcripts encoding PSPC1, NONO, and RBM14 was increased in tumour tissue. Expression of PSPC1, NONO, and RBM14 transcripts was significantly associated with poor survival, whereas NEAT1 expression was not. Immunohistochemical analysis revealed that nuclear and cytoplasmic PSPC1-positivity was significantly associated with shorter overall survival of HCC patients.
Conclusion: Our data show an induction of NEAT1 in HCC chemoresistance and a high correlation of transcripts encoding paraspeckle-associated proteins with poor survival in HCC. Therefore, NEAT1, PSPC1, NONO, and RBM14 might be promising targets for novel HCC therapies, and the paraspeckle-associated proteins might be clinical markers and predictors for poor survival in HCC.
Endocannabinoids are important lipid-signaling mediators. Both protective and deleterious effects of endocannabinoids in the cardiovascular system have been reported but the mechanistic basis for these contradicting observations is unclear. We set out to identify anti-inflammatory mechanisms of endocannabinoids in the murine aorta and in human vascular smooth muscle cells (hVSMC). In response to combined stimulation with cytokines, IL-1β and TNFα, the murine aorta released several endocannabinoids, with anandamide (AEA) levels being the most significantly increased. AEA pretreatment had profound effects on cytokine-induced gene expression in hVSMC and murine aorta. As revealed by RNA-Seq analysis, the induction of a subset of 21 inflammatory target genes, including the important cytokine CCL2 was blocked by AEA. This effect was not mediated through AEA-dependent interference of the AP-1 or NF-κB pathways but rather through an epigenetic mechanism. In the presence of AEA, ATAC-Seq analysis and chromatin-immunoprecipitations revealed that CCL2 induction was blocked due to increased levels of H3K27me3 and a decrease of H3K27ac leading to compacted chromatin structure in the CCL2 promoter. These effects were mediated by recruitment of HDAC4 and the nuclear corepressor NCoR1 to the CCL2 promoter. This study therefore establishes a novel anti-inflammatory mechanism for the endogenous endocannabinoid AEA in vascular smooth muscle cells. Furthermore, this work provides a link between endogenous endocannabinoid signaling and epigenetic regulation.
KDEL receptors (KDELRs) represent transmembrane proteins of the secretory pathway which regulate the retention of soluble ER-residents as well as retrograde and anterograde vesicle trafficking. In addition, KDELRs are involved in the regulation of cellular stress response and ECM degradation. For a deeper insight into KDELR1 specific functions, we characterised a KDELR1-KO cell line (HAP1) through whole transcriptome analysis by comparing KDELR1-KO cells with its respective HAP1 wild-type. Our data indicate more than 300 significantly and differentially expressed genes whose gene products are mainly involved in developmental processes such as cell adhesion and ECM composition, pointing out to severe cellular disorders due to a loss of KDELR1. Impaired adhesion capacity of KDELR1-KO cells was further demonstrated through in vitro adhesion assays, while collagen- and/or laminin-coating nearly doubled the adhesion property of KDELR1-KO cells compared to wild-type, confirming a transcriptional adaptation to improve or restore the cellular adhesion capability. Perturbations within the secretory pathway were verified by an increased secretion of ER-resident PDI and decreased cell viability under ER stress conditions, suggesting KDELR1-KO cells to be severely impaired in maintaining cellular homeostasis.