Refine
Language
- English (37)
Has Fulltext
- yes (37)
Is part of the Bibliography
- no (37)
Keywords
- Chromatin accessibility (3)
- DNase1-seq (3)
- ENCODE-DREAM in vivo Transcription Factor binding site prediction challenge (2)
- Ensemble learning (2)
- Epigenetics (2)
- Gene regulation (2)
- Indirect-binding (2)
- TF-complexes (2)
- Transcription Factors (2)
- Actinomycin D (1)
- Anandamide (1)
- Atrial fibrillation classification (1)
- Automated sRNA analysis (1)
- BCL2 (1)
- Bidirectional genes (1)
- CCL2 (1)
- Chemosensitivity (1)
- Chemotherapy (1)
- Chromatin conformation (1)
- Comparative analysis (1)
- Computational sRNA analysis (1)
- Convolutional Neural Networks (1)
- Data mining (1)
- Data processing (1)
- Endocrinology (1)
- Energy-efficiency (1)
- Eukaryotic sRNA (1)
- Gene expression (1)
- Gene expression prediction (1)
- HDAC4 (1)
- HepG2 (1)
- HiC (1)
- HiChIP (1)
- Huh7 (1)
- Inflammation (1)
- Liver cancer (1)
- MALAT1 (1)
- MYC (1)
- Machine Learning (1)
- Machine learning (1)
- Multi-Omics (1)
- Multimodal Modeling (1)
- NASH (1)
- NCoR1 (1)
- NEAT1 (1)
- NEAT2 (1)
- Personalized Medicine (1)
- Predictive Modeling (1)
- Sequencing (1)
- Single-cell RNA-seq (1)
- Small RNA analysis (1)
- TSC22D3 (1)
- Therapy response (1)
- VEGFA (1)
- aligners (1)
- benchmarking (1)
- cancer (1)
- cancer-associated fibroblasts (1)
- cellular immunology (1)
- chemoresistance (1)
- cytokines (1)
- flow cytometry (1)
- gene signature (1)
- inflammation (1)
- liver cancer (1)
- lncRNA (1)
- mRNA decay (1)
- mRNA stability (1)
- mammary carcinoma (1)
- mappers (1)
- mapping-algorithms (1)
- mononuclear cell (1)
- mouse models (1)
- reactive oxygen species (1)
- sRNA (1)
- sRNA tool (1)
- siRNA analysis (1)
- siRNA quantification (1)
- single-cell RNA sequencing (1)
- steroid control of aging (1)
- transcriptional profiling (1)
- transcriptomics (1)
Hepatic lipid deposition and inflammation represent risk factors for hepatocellular carcinoma (HCC). The mRNA-binding protein tristetraprolin (TTP, gene name ZFP36) has been suggested as a tumor suppressor in several malignancies, but it increases insulin resistance. The aim of this study was to elucidate the role of TTP in hepatocarcinogenesis and HCC progression. Employing liver-specific TTP-knockout (lsTtp-KO) mice in the diethylnitrosamine (DEN) hepatocarcinogenesis model, we observed a significantly reduced tumor burden compared to wild-type animals. Upon short-term DEN treatment, modelling early inflammatory processes in hepatocarcinogenesis, lsTtp-KO mice exhibited a reduced monocyte/macrophage ratio as compared to wild-type mice. While short-term DEN strongly induced an abundance of saturated and poly-unsaturated hepatic fatty acids, lsTtp-KO mice did not show these changes. These findings suggested anti-carcinogenic actions of TTP deletion due to effects on inflammation and metabolism. Interestingly, though, investigating effects of TTP on different hallmarks of cancer suggested tumor-suppressing actions: TTP inhibited proliferation, attenuated migration, and slightly increased chemosensitivity. In line with a tumor-suppressing activity, we observed a reduced expression of several oncogenes in TTP-overexpressing cells. Accordingly, ZFP36 expression was downregulated in tumor tissues in three large human data sets. Taken together, this study suggests that hepatocytic TTP promotes hepatocarcinogenesis, while it shows tumor-suppressive actions during hepatic tumor progression.
Background/Aims: Hepatocellular carcinoma (HCC) represents the second most common cause of cancer-related deaths worldwide, not least due to its high chemoresistance. The long non-coding RNA nuclear paraspeckle assembly transcript 1 (NEAT1), localised in nuclear paraspeckles, has been shown to enhance chemoresistance in several cancer types. Since data on NEAT1 in HCC chemosensitivity are completely lacking and chemoresistance is linked to poor prognosis, we aimed to study NEAT1 expression in HCC chemoresistance and its link to HCC prognosis.
Methods: NEAT1 expression was determined in either sensitive, or sorafenib, or doxorubicin resistant HepG2, PLC/PRF/5, and Huh7 cells by qPCR. Paraspeckles were detected by immunostaining of paraspeckle component 1 (PSPC1) in cell culture and in a cohort of HCC patients. PSPC1 expression was correlated with clinical data. The expression of transcript variants of NEAT1 and transcripts encoding the paraspeckle-associated proteins was analysed in the TCGA liver cancer data set.
Results: NEAT1 was overexpressed in all three sorafenib and doxorubicin resistant cell lines. Paraspeckles were present in all chemoresistant cells, whereas no signal was detected in the sensitive cells. Expression of NEAT1 transcripts as well as transcripts encoding PSPC1, NONO, and RBM14 was increased in tumour tissue. Expression of PSPC1, NONO, and RBM14 transcripts was significantly associated with poor survival, whereas NEAT1 expression was not. Immunohistochemical analysis revealed that nuclear and cytoplasmic PSPC1-positivity was significantly associated with shorter overall survival of HCC patients.
Conclusion: Our data show an induction of NEAT1 in HCC chemoresistance and a high correlation of transcripts encoding paraspeckle-associated proteins with poor survival in HCC. Therefore, NEAT1, PSPC1, NONO, and RBM14 might be promising targets for novel HCC therapies, and the paraspeckle-associated proteins might be clinical markers and predictors for poor survival in HCC.
Endocannabinoids are important lipid-signaling mediators. Both protective and deleterious effects of endocannabinoids in the cardiovascular system have been reported but the mechanistic basis for these contradicting observations is unclear. We set out to identify anti-inflammatory mechanisms of endocannabinoids in the murine aorta and in human vascular smooth muscle cells (hVSMC). In response to combined stimulation with cytokines, IL-1β and TNFα, the murine aorta released several endocannabinoids, with anandamide (AEA) levels being the most significantly increased. AEA pretreatment had profound effects on cytokine-induced gene expression in hVSMC and murine aorta. As revealed by RNA-Seq analysis, the induction of a subset of 21 inflammatory target genes, including the important cytokine CCL2 was blocked by AEA. This effect was not mediated through AEA-dependent interference of the AP-1 or NF-κB pathways but rather through an epigenetic mechanism. In the presence of AEA, ATAC-Seq analysis and chromatin-immunoprecipitations revealed that CCL2 induction was blocked due to increased levels of H3K27me3 and a decrease of H3K27ac leading to compacted chromatin structure in the CCL2 promoter. These effects were mediated by recruitment of HDAC4 and the nuclear corepressor NCoR1 to the CCL2 promoter. This study therefore establishes a novel anti-inflammatory mechanism for the endogenous endocannabinoid AEA in vascular smooth muscle cells. Furthermore, this work provides a link between endogenous endocannabinoid signaling and epigenetic regulation.
Mechanisms by which specific histone modifications regulate distinct gene regulatory networks remain little understood. We investigated how H3K79me2, a modification catalyzed by DOT1L and previously considered a general transcriptional activation mark, regulates gene expression in mammalian cardiogenesis. Early embryonic cardiomyocyte ablation of Dot1l revealed that H3K79me2 does not act as a general transcriptional activator, but rather regulates highly specific gene regulatory networks at two critical cardiogenic junctures: left ventricle patterning and postnatal cardiomyocyte cell cycle withdrawal. Mechanistic analyses revealed that H3K79me2 in two distinct domains, gene bodies and regulatory elements, synergized to promote expression of genes activated by DOT1L. Surprisingly, these analyses also revealed that H3K79me2 in specific regulatory elements contributed to silencing genes usually not expressed in cardiomyocytes. As DOT1L mutants had increased numbers of postnatal mononuclear cardiomyocytes and prolonged cardiomyocyte cell cycle activity, controlled inhibition of DOT1L might be a strategy to promote cardiac regeneration post-injury.
Non-coding variations located within regulatory elements may alter gene expression by modifying Transcription Factor (TF) binding sites and thereby lead to functional consequences like various traits or diseases. To understand these molecular mechanisms, different TF models are being used to assess the effect of DNA sequence variations, such as Single Nucleotide Polymorphisms (SNPs). However, few statistical approaches exist to compute statistical significance of results but they often are slow for large sets of SNPs, such as data obtained from a genome-wide association study (GWAS) or allele-specific analysis of chromatin data.
Results We investigate the distribution of maximal differential TF binding scores for general computational models that assess TF binding. We find that a modified Laplace distribution can adequately approximate the empirical distributions. A benchmark on in vitro and in vivo data sets showed that our new approach improves on an existing method in terms of performance and speed. In applications on large sets of eQTL and GWAS SNPs we could illustrate the usefulness of the novel statistic to highlight cell type specific regulators and TF target genes.
Conclusions Our approach allows the evaluation of DNA changes that induce differential TF binding in a fast and accurate manner, permitting computations on large mutation data sets. An implementation of the novel approach is freely available at https://github.com/SchulzLab/SNEEP.
Motivation DNA CpG methylation (CpGm) has proven to be a crucial epigenetic factor in the gene regulatory system. Assessment of DNA CpG methylation values via whole-genome bisulfite sequencing (WGBS) is, however, computationally extremely demanding.
Results We present FAst MEthylation calling (FAME), the first approach to quantify CpGm values directly from bulk or single-cell WGBS reads without intermediate output files. FAME is very fast but as accurate as standard methods, which first produce BS alignment files before computing CpGm values. We present experiments on bulk and single-cell bisulfite datasets in which we show that data analysis can be significantly sped-up and help addressing the current WGBS analysis bottleneck for large-scale datasets without compromising accuracy.
Availability An implementation of FAME is open source and licensed under GPL-3.0 at https://github.com/FischerJo/FAME.
Improved integration of single cell transcriptome data demonstrated on heart failure in mice and men
(2023)
Biomedical research frequently uses murine models to study disease mechanisms. However, the translation of these findings to human disease remains a significant challenge. In order to improve the comparability of mouse and human data, we present a cross-species integration pipeline for single-cell transcriptomic assays.
The pipeline merges expression matrices and assigns clear orthologous relationships. Starting from Ensembl ortholog assignments, we allocated 82% of mouse genes to unique orthologs by using additional publicly available resources such as Uniprot, and NCBI databases. For genes with multiple matches, we employed the Needleman-Wunsch global alignment based on either amino acid or nucleotide sequence to identify the ortholog with the highest degree of similarity.
The workflow was tested for its functionality and efficiency by integrating scRNA-seq datasets from heart failure patients with the corresponding mouse model. We were able to assign unique human orthologs to up to 80% of the mouse genes, utilizing the known 17,492 orthologous pairs. Curiously, the integration process enabled the identification of both common and unique regulatory pathways between species in heart failure.
In conclusion, our pipeline streamlines the integration process, enhances gene nomenclature alignment and simplifies the translation of mouse models to human disease. We have made the OrthoIntegrate R-package accessible on GitHub (https://github.com/MarianoRuzJurado/OrthoIntegrate), which includes the assignment of ortholog definitions for human and mouse, as well as the pipeline for integrating single cells.
Background Enhancers play a fundamental role in orchestrating cell state and development. Although several methods have been developed to identify enhancers, linking them to their target genes is still an open problem. Several theories have been proposed on the functional mechanisms of enhancers, which triggered the development of various methods to infer promoter enhancer interactions (PEIs). The advancement of high-throughput techniques describing the three-dimensional organisation of the chromatin, paved the way to pinpoint long-range PEIs. Here we investigated whether including PEIs in computational models for the prediction of gene expression improves performance and interpretability.
Results We have extended our Tepic framework to include DNA contacts deduced from chromatin conformation capture experiments and compared various methods to determine PEIs using predictive modelling of gene expression from chromatin accessibility data and predicted transcription factor (TF) motif data. We found that including long-range PEIs deduced from both HiC and HiChIP data indeed improves model performance. We designed a novel machine learning approach that allows to prioritize TFs in distal loop and promoter regions with respect to their importance for gene expression regulation. Our analysis revealed a set of core TFs that are part of enhancer-promoter loops involving YY1 in different cell lines.
Conclusion: We show that the integration of chromatin conformation data improves gene expression prediction, underlining the importance of enhancer looping for gene expression regulation. Our general approach can be used to prioritize TFs that are involved in distal and promoter-proximal regulation using accessibility, conformation and expression data.
Summary: Understanding the role of short-interfering RNA (siRNA) in diverse biological processes is of current interest and often approached through small RNA sequencing. However, analysis of these datasets is difficult due to the complexity of biological RNA processing pathways, which differ between species. Several properties like strand specificity, length distribution, and distribution of soft-clipped bases are few parameters known to guide researchers in understanding the role of siRNAs. We present RAPID, a generic eukaryotic siRNA analysis pipeline, which captures information inherent in the datasets and automatically produces numerous visualizations as user-friendly HTML reports, covering multiple categories required for siRNA analysis. RAPID also facilitates an automated comparison of multiple datasets, with one of the normalization techniques dedicated for siRNA knockdown analysis, and integrates differential expression analysis using DESeq2. RAPID is available under MIT license at https://github.com/SchulzLab/RAPID. We recommend using it as a conda environment available from https://anaconda.org/bioconda/rapid.
The unicellular ciliate Paramecium contains a large vegetative macronucleus with several unusual characteristics including an extremely high coding density and high polyploidy. As macronculear chromatin is devoid of heterochromatin our study characterizes the functional epigenomic organisation necessary for gene regulation and proper PolII activity. Histone marks (H3K4me3, H3K9ac, H3K27me3) revealed no narrow peaks but broad domains along gene bodies, whereas intergenic regions were devoid of nucleosomes. Our data implicates H3K4me3 levels inside ORFs to be the main factor to associate with gene expression and H3K27me3 appears to occur as a bistable domain with H3K4me3 in plastic genes. Surprisingly, silent and lowly expressed genes show low nucleosome occupancy suggesting that gene inactivation does not involve increased nucleosome occupancy and chromatin condensation. Due to a high occupancy of Pol II along highly expressed ORFs, transcriptional elongation appears to be quite different to other species. This is supported by missing heptameric repeats in the C-terminal domain of Pol II and a divergent elongation system. Our data implies that unoccupied DNA is the default state, whereas gene activation requires nucleosome recruitment together with broad domains of H3K4me3. This could represent a buffer for paused Pol II along ORFs in absence of elongation factors of higher eukaryotes.
Electrocardiograms (ECG) record the heart activity and are the most common and reliable method to detect cardiac arrhythmias, such as atrial fibrillation (AFib). Lately, many commercially available devices such as smartwatches are offering ECG monitoring. Therefore, there is increasing demand for designing deep learning models with the perspective to be physically implemented on these small portable devices with limited energy supply. In this paper, a workflow for the design of small, energy-efficient recurrent convolutional neural network (RCNN) architecture for AFib detection is proposed. However, the approach can be well generalized to every type of long time series. In contrast to previous studies, that demand thousands of additional network neurons and millions of extra model parameters, the logical steps for the generation of a CNN with only 114 trainable parameters are described. The model consists of a small segmented CNN in combination with an optimal energy classifier. The architectural decisions are made by using the energy consumption as a metric in an equally important way as the accuracy. The optimisation steps are focused on the software which can be embedded afterwards on a physical chip. Finally, a comparison with some previous relevant studies suggests that the widely used huge CNNs for similar tasks are mostly redundant and unessentially computationally expensive.
KDEL receptors (KDELRs) represent transmembrane proteins of the secretory pathway which regulate the retention of soluble ER-residents as well as retrograde and anterograde vesicle trafficking. In addition, KDELRs are involved in the regulation of cellular stress response and ECM degradation. For a deeper insight into KDELR1 specific functions, we characterised a KDELR1-KO cell line (HAP1) through whole transcriptome analysis by comparing KDELR1-KO cells with its respective HAP1 wild-type. Our data indicate more than 300 significantly and differentially expressed genes whose gene products are mainly involved in developmental processes such as cell adhesion and ECM composition, pointing out to severe cellular disorders due to a loss of KDELR1. Impaired adhesion capacity of KDELR1-KO cells was further demonstrated through in vitro adhesion assays, while collagen- and/or laminin-coating nearly doubled the adhesion property of KDELR1-KO cells compared to wild-type, confirming a transcriptional adaptation to improve or restore the cellular adhesion capability. Perturbations within the secretory pathway were verified by an increased secretion of ER-resident PDI and decreased cell viability under ER stress conditions, suggesting KDELR1-KO cells to be severely impaired in maintaining cellular homeostasis.
Background: Bidirectional promoters (BPs) are prevalent in eukaryotic genomes. However, it is poorly understood how the cell integrates different epigenomic information, such as transcription factor (TF) binding and chromatin marks, to drive gene expression at BPs. Single-cell sequencing technologies are revolutionizing the field of genome biology. Therefore, this study focuses on the integration of single-cell RNA-seq data with bulk ChIP-seq and other epigenetics data, for which single-cell technologies are not yet established, in the context of BPs.
Results: We performed integrative analyses of novel human single-cell RNA-seq (scRNA-seq) data with bulk ChIP-seq and other epigenetics data. scRNA-seq data revealed distinct transcription states of BPs that were previously not recognized. We find associations between these transcription states to distinct patterns in structural gene features, DNA accessibility, histone modification, DNA methylation and TF binding profiles.
Conclusions: Our results suggest that a complex interplay of all of these elements is required to achieve BP-specific transcriptional output in this specialized promoter configuration. Further, our study implies that novel statistical methods can be developed to deconvolute masked subpopulations of cells measured with different bulk epigenomic assays using scRNA-seq data.
An ontology-based method for assessing batch effect adjustment approaches in heterogeneous datasets
(2018)
Motivation: International consortia such as the Genotype-Tissue Expression (GTEx) project, The Cancer Genome Atlas (TCGA) or the International Human Epigenetics Consortium (IHEC) have produced a wealth of genomic datasets with the goal of advancing our understanding of cell differentiation and disease mechanisms. However, utilizing all of these data effectively through integrative analysis is hampered by batch effects, large cell type heterogeneity and low replicate numbers. To study if batch effects across datasets can be observed and adjusted for, we analyze RNA-seq data of 215 samples from ENCODE, Roadmap, BLUEPRINT and DEEP as well as 1336 samples from GTEx and TCGA. While batch effects are a considerable issue, it is non-trivial to determine if batch adjustment leads to an improvement in data quality, especially in cases of low replicate numbers.
Results: We present a novel method for assessing the performance of batch effect adjustment methods on heterogeneous data. Our method borrows information from the Cell Ontology to establish if batch adjustment leads to a better agreement between observed pairwise similarity and similarity of cell types inferred from the ontology. A comparison of state-of-the art batch effect adjustment methods suggests that batch effects in heterogeneous datasets with low replicate numbers cannot be adequately adjusted. Better methods need to be developed, which can be assessed objectively in the framework presented here.
Cancer-associated fibroblasts (CAFs) in the tumor microenvironment contribute to all stages of tumorigenesis and are usually considered to be tumor-promoting cells. CAFs show a remarkable degree of heterogeneity, which is attributed to developmental origin or to local environmental niches, resulting in distinct CAF subsets within individual tumors. While CAF heterogeneity is frequently investigated in late-stage tumors, data on longitudinal CAF development in tumors are lacking. To this end, we used the transgenic polyoma middle T oncogene-induced mouse mammary carcinoma model and performed whole transcriptome analysis in FACS-sorted fibroblasts from early- and late-stage tumors. We observed a shift in fibroblast populations over time towards a subset previously shown to negatively correlate with patient survival, which was confirmed by multispectral immunofluorescence analysis. Moreover, we identified a transcriptomic signature distinguishing CAFs from early- and late-stage tumors. Importantly, the signature of early-stage CAFs correlated well with tumor stage and survival in human mammary carcinoma patients. A random forest analysis suggested predictive value of the complete set of differentially expressed genes between early- and late-stage CAFs on bulk tumor patient samples, supporting the clinical relevance of our findings. In conclusion, our data show transcriptome alterations in CAFs during tumorigenesis in the mammary gland, which suggest that CAFs are educated by the tumor over time to promote tumor development. Moreover, we show that murine CAF gene signatures can harbor predictive value for human cancer.
Several studies suggested that transcription factor (TF) binding to DNA may be impaired or enhanced by DNA methylation. We present MeDeMo, a toolbox for TF motif analysis that combines information about DNA methylation with models capturing intra-motif dependencies. In a large-scale study using ChIP-seq data for 335 TFs, we identify novel TFs that are affected by DNA methylation. Overall, we find that CpG methylation decreases the likelihood of binding for the majority of TFs. For a considerable subset of TFs, we show that intra-motif dependencies are pivotal for accurately modelling the impact of DNA methylation on TF binding.
Long non-coding RNAs (lncRNAs) can act as regulatory RNAs which, by altering the expression of target genes, impact on the cellular phenotype and cardiovascular disease development. Endothelial lncRNAs and their vascular functions are largely undefined. Deep RNA-Seq and FANTOM5 CAGE analysis revealed the lncRNA LINC00607 to be highly enriched in human endothelial cells. LINC00607 was induced in response to hypoxia, arteriosclerosis regression in non-human primates and also in response to propranolol used to induce regression of human arteriovenous malformations. siRNA knockdown or CRISPR/Cas9 knockout of LINC00607 attenuated VEGF-A-induced angiogenic sprouting. LINC00607 knockout in endothelial cells also integrated less into newly formed vascular networks in an in vivo assay in SCID mice. Overexpression of LINC00607 in CRISPR knockout cells restored normal endothelial function. RNA- and ATAC-Seq after LINC00607 knockout revealed changes in the transcription of endothelial gene sets linked to the endothelial phenotype and in chromatin accessibility around ERG-binding sites. Mechanistically, LINC00607 interacted with the SWI/SNF chromatin remodeling protein BRG1. CRISPR/Cas9-mediated knockout of BRG1 in HUVEC followed by CUT&RUN revealed that BRG1 is required to secure a stable chromatin state, mainly on ERG-binding sites. In conclusion, LINC00607 is an endothelial-enriched lncRNA that maintains ERG target gene transcription by interacting with the chromatin remodeler BRG1.
Understanding the role of short-interfering RNA (siRNA) in diverse biological processes is of current interest and often approached through small RNA sequencing. However, analysis of these datasets is difficult due to the complexity of biological RNA processing pathways, which differ between species. Several properties like strand specificity, length distribution, and distribution of soft-clipped bases are few parameters known to guide researchers in understanding the role of siRNAs. We present RAPID, a generic eukaryotic siRNA analysis pipeline, which captures information inherent in the datasets and automatically produces numerous visualizations as user-friendly HTML reports, covering multiple categories required for siRNA analysis. RAPID also facilitates an automated comparison of multiple datasets, with one of the normalization techniques dedicated for siRNA knockdown analysis, and integrates differential expression analysis using DESeq2.
Endothelial cells play a critical role in the adaptation of tissues to injury. Tissue ischemia induced by infarction leads to profound changes in endothelial cell functions and can induce transition to a mesenchymal state. Here we explore the kinetics and individual cellular responses of endothelial cells after myocardial infarction by using single cell RNA sequencing. This study demonstrates a time dependent switch in endothelial cell proliferation and inflammation associated with transient changes in metabolic gene signatures. Trajectory analysis reveals that the majority of endothelial cells 3 to 7 days after myocardial infarction acquire a transient state, characterized by mesenchymal gene expression, which returns to baseline 14 days after injury. Lineage tracing, using the Cdh5-CreERT2;mT/mG mice followed by single cell RNA sequencing, confirms the transient mesenchymal transition and reveals additional hypoxic and inflammatory signatures of endothelial cells during early and late states after injury. These data suggest that endothelial cells undergo a transient mes-enchymal activation concomitant with a metabolic adaptation within the first days after myocardial infarction but do not acquire a long-term mesenchymal fate. This mesenchymal activation may facilitate endothelial cell migration and clonal expansion to regenerate the vascular network.
The unicellular ciliate Paramecium contains a large vegetative macronucleus with several unusual characteristics, including an extremely high coding density and high polyploidy. As macronculear chromatin is devoid of heterochromatin, our study characterizes the functional epigenomic organization necessary for gene regulation and proper Pol II activity. Histone marks (H3K4me3, H3K9ac, H3K27me3) reveal no narrow peaks but broad domains along gene bodies, whereas intergenic regions are devoid of nucleosomes. Our data implicate H3K4me3 levels inside ORFs to be the main factor associated with gene expression, and H3K27me3 appears in association with H3K4me3 in plastic genes. Silent and lowly expressed genes show low nucleosome occupancy, suggesting that gene inactivation does not involve increased nucleosome occupancy and chromatin condensation. Because of a high occupancy of Pol II along highly expressed ORFs, transcriptional elongation appears to be quite different from that of other species. This is supported by missing heptameric repeats in the C-terminal domain of Pol II and a divergent elongation system. Our data imply that unoccupied DNA is the default state, whereas gene activation requires nucleosome recruitment together with broad domains of H3K4me3. In summary, gene activation and silencing in Paramecium run counter to the current understanding of chromatin biology.