• search hit 7 of 7
Back to Result List

A statistical approach to identify regulatory DNA variations

  • Non-coding variations located within regulatory elements may alter gene expression by modifying Transcription Factor (TF) binding sites and thereby lead to functional consequences like various traits or diseases. To understand these molecular mechanisms, different TF models are being used to assess the effect of DNA sequence variations, such as Single Nucleotide Polymorphisms (SNPs). However, few statistical approaches exist to compute statistical significance of results but they often are slow for large sets of SNPs, such as data obtained from a genome-wide association study (GWAS) or allele-specific analysis of chromatin data. Results We investigate the distribution of maximal differential TF binding scores for general computational models that assess TF binding. We find that a modified Laplace distribution can adequately approximate the empirical distributions. A benchmark on in vitro and in vivo data sets showed that our new approach improves on an existing method in terms of performance and speed. In applications on large sets of eQTL and GWAS SNPs we could illustrate the usefulness of the novel statistic to highlight cell type specific regulators and TF target genes. Conclusions Our approach allows the evaluation of DNA changes that induce differential TF binding in a fast and accurate manner, permitting computations on large mutation data sets. An implementation of the novel approach is freely available at https://github.com/SchulzLab/SNEEP.

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Nina BaumgartenORCiDGND, Laura RumpfORCiD, Thorsten KeßlerORCiDGND, Marcel Holger SchulzORCiDGND
URN:urn:nbn:de:hebis:30:3-731671
DOI:https://doi.org/10.1101/2023.01.31.526404
Parent Title (English):bioRxiv
Document Type:Preprint
Language:English
Date of Publication (online):2023/02/03
Date of first Publication:2023/02/03
Publishing Institution:Universitätsbibliothek Johann Christian Senckenberg
Release Date:2023/07/18
Issue:2023.01.31.526404
Page Number:15
HeBIS-PPN:510553273
Institutes:Medizin
Informatik und Mathematik / Informatik
Dewey Decimal Classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
5 Naturwissenschaften und Mathematik / 57 Biowissenschaften; Biologie / 570 Biowissenschaften; Biologie
6 Technik, Medizin, angewandte Wissenschaften / 61 Medizin und Gesundheit / 610 Medizin und Gesundheit
Sammlungen:Universitätspublikationen
Licence (German):License LogoCreative Commons - CC BY-ND - Namensnennung - Keine Bearbeitungen 4.0 International