- The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE (2011)
- Background: The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.). While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other) crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. Results: We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26bp deepSuperSAGE tags (UniTags) from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold) under salt stress in both organs, witnessing a differential organ-specific response to stress. Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 UniTags were able to be linked to UniProt entries. Additionally, gene ontology (GO) categories over-representation analysis enabled to filter out enriched biological processes among the differentially expressed UniTags. Subsequently, the gathered information was further cross-checked with stress-related pathways. From several filtered pathways, here we focus exemplarily on transcripts associated with the generation and scavenging of reactive oxygen species (ROS), as well as on transcripts involved in Na+ homeostasis. Although both processes are already very well characterized in other plants, the information generated in the present work is of high value. Information on expression profiles and sequence similarity for several hundreds of transcripts of potential interest is now available. Conclusions: This report demonstrates, that the combination of the high-throughput transcriptome profiling technology SuperSAGE with one of the next-generation sequencing platforms allows deep insights into the first molecular reactions of a plant exposed to salinity. Cross validation with recent reports enriched the information about the salt stress dynamics of more than 9,000 chickpea ESTs, and enlarged their pool of alternative transcripts isoforms. As an example for the high resolution of the employed technology that we coin deepSuperSAGE, we demonstrate that ROS-scavenging and -generating pathways undergo strong global transcriptome changes in chickpea roots and nodules already 2 hours after onset of moderate salt stress (25mM NaCl). Additionally, a set of more than 15 candidate transcripts are proposed to be potential components of the salt overly sensitive (SOS) pathway in chickpea. Newly identified transcript isoforms are potential targets for breeding novel cultivars with high salinity tolerance. We demonstrate that these targets can be integrated into breeding schemes by micro-arrays and RT-PCR assays downstream of the generation of 26bp tags by SuperSAGE.
- SymGRASS: a database of sugarcane orthologous genes involved in arbuscular mycorrhiza and root nodule symbiosis : from Seventh International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics, (CIBB 2010), Palermo, Italy, 16 - 18 September 2010 (2013)
- Background: The rationale for gathering information from plants procuring nitrogen through symbiotic interactions controlled by a common genetic program for a sustainable biofuel production is the high energy demanding application of synthetic nitrogen fertilizers. We curated sequence information publicly available for the biofuel plant sugarcane, performed an analysis of the common SYM pathway known to control symbiosis in other plants, and provide results, sequences and literature links as an online database. Methods: Sugarcane sequences and informations were downloaded from the nucEST database, cleaned and trimmed with seqclean, assembled with TGICL plus translating mapping method, and annotated. The annotation is based on BLAST searches against a local formatted plant Uniprot90 generated with CD-HIT for functional assignment, rpsBLAST to CDD database for conserved domain analysis, and BLAST search to sorghum's for Gene Ontology (GO) assignment. Gene expression was normalized according the Unigene standard, presented as ESTs/100 kb. Protein sequences known in the SYM pathway were used as queries to search the SymGRASS sequence database. Additionally, antimicrobial peptides described in the PhytAMP database served as queries to retrieve and generate expression profiles of these defense genes in the libraries compared to the libraries obtained under symbiotic interactions. Results: We describe the SymGRASS, a database of sugarcane orthologous genes involved in arbuscular mycorrhiza (AM) and root nodule (RN) symbiosis. The database aggregates knowledge about sequences, tissues, organ, developmental stages and experimental conditions, and provides annotation and level of gene expression for sugarcane transcripts and SYM orthologous genes in sugarcane through a web interface. Several candidate genes were found for all nodes in the pathway, and interestingly a set of symbiosis specific genes was found. Conclusions: The knowledge integrated in SymGRASS may guide studies on molecular, cellular and physiological mechanisms by which sugarcane controls the establishment and efficiency of endophytic associations. We believe that the candidate sequences for the SYM pathway together with the pool of exclusively expressed tentative consensus (TC) sequences are crucial for the design of molecular studies to unravel the mechanisms controlling the establishment of symbioses in sugarcane, ultimately serving as a basis for the improvement of grass crops.
- SuperSAGE : the drought stress-responsive transcriptome of chickpea roots (2008)
- Background Drought is the major constraint to increase yield in chickpea (Cicer arietinum). Improving drought tolerance is therefore of outmost importance for breeding. However, the complexity of the trait allowed only marginal progress. A solution to the current stagnation is expected from innovative molecular tools such as transcriptome analyses providing insight into stress-related gene activity, which combined with molecular markers and expression (e)QTL mapping, may accelerate knowledge-based breeding. SuperSAGE, an improved version of the serial analysis of gene expression (SAGE) technique, generating genome-wide, high-quality transcription profiles from any eukaryote, has been employed in the present study. The method produces 26 bp long fragments (26 bp tags) from defined positions in cDNAs, providing sufficient sequence information to unambiguously characterize the mRNAs. Further, SuperSAGE tags may be immediately used to produce microarrays and probes for real-time-PCR, thereby overcoming the lack of genomic tools in non-model organisms. Results We applied SuperSAGE to the analysis of gene expression in chickpea roots in response to drought. To this end, we sequenced 80,238 26 bp tags representing 17,493 unique transcripts (UniTags) from drought-stressed and non-stressed control roots. A total of 7,532 (43%) UniTags were more than 2.7-fold differentially expressed, and 880 (5.0%) were regulated more than 8-fold upon stress. Their large size enabled the unambiguous annotation of 3,858 (22%) UniTags to genes or proteins in public data bases and thus to stress-response processes. We designed a microarray carrying 3,000 of these 26 bp tags. The chip data confirmed 79% of the tag-based results, whereas RT-PCR confirmed the SuperSAGE data in all cases. Conclusion This study represents the most comprehensive analysis of the drought-response transcriptome of chickpea available to date. It demonstrates that – inter alias – signal transduction, transcription regulation, osmolyte accumulation, and ROS scavenging undergo strong transcriptional remodelling in chickpea roots already 6 h after drought stress. Certain transcript isoforms characterizing these processes are potential targets for breeding for drought tolerance. We demonstrate that these can be easily accessed by micro-arrays and RT-PCR assays readily produced downstream of SuperSAGE. Our study proves that SuperSAGE owns potential for molecular breeding also in non-model crops.
- Novel SSR markers from BAC-end sequences, DArT arrays and a comprehensive genetic map with 1,291 marker loci for chickpea (Cicer arietinum L.) (2011)
- Chickpea (Cicer arietinum L.) is the third most important cool season food legume, cultivated in arid and semi-arid regions of the world. The goal of this study was to develop novel molecular markers such as microsatellite or simple sequence repeat (SSR) markers from bacterial artificial chromosome (BAC)-end sequences (BESs) and diversity arrays technology (DArT) markers, and to construct a high-density genetic map based on recombinant inbred line (RIL) population ICC 4958 (C. arietinum)×PI 489777 (C. reticulatum). A BAC-library comprising 55,680 clones was constructed and 46,270 BESs were generated. Mining of these BESs provided 6,845 SSRs, and primer pairs were designed for 1,344 SSRs. In parallel, DArT arrays with ca. 15,000 clones were developed, and 5,397 clones were found polymorphic among 94 genotypes tested. Screening of newly developed BES-SSR markers and DArT arrays on the parental genotypes of the RIL mapping population showed polymorphism with 253 BES-SSR markers and 675 DArT markers. Segregation data obtained for these polymorphic markers and 494 markers data compiled from published reports or collaborators were used for constructing the genetic map. As a result, a comprehensive genetic map comprising 1,291 markers on eight linkage groups (LGs) spanning a total of 845.56 cM distance was developed (http://cmap.icrisat.ac.in/cmap/sm/cp/thudi/). The number of markers per linkage group ranged from 68 (LG 8) to 218 (LG 3) with an average inter-marker distance of 0.65 cM. While the developed resource of molecular markers will be useful for genetic diversity, genetic mapping and molecular breeding applications, the comprehensive genetic map with integrated BES-SSR markers will facilitate its anchoring to the physical map (under construction) to accelerate map-based cloning of genes in chickpea and comparative genome evolution studies in legumes.
- Massive analysis of cDNA Ends (MACE) and miRNA expression profiling identifies proatherogenic pathways in chronic kidney disease (2013)
- Epigenetic dysregulation contributes to the high cardiovascular disease burden in chronic kidney disease (CKD) patients. Although microRNAs (miRNAs) are central epigenetic regulators, which substantially affect the development and progression of cardiovascular disease (CVD), no data on miRNA dysregulation in CKD-associated CVD are available until now. We now performed high-throughput miRNA sequencing of peripheral blood mononuclear cells from ten clinically stable hemodialysis (HD) patients and ten healthy controls, which allowed us to identify 182 differentially expressed miRNAs (e.g., miR-21, miR-26b, miR-146b, miR-155). To test biological relevance, we aimed to connect miRNA dysregulation to differential gene expression. Genome-wide gene expression profiling by MACE (Massive Analysis of cDNA Ends) identified 80 genes to be differentially expressed between HD patients and controls, which could be linked to cardiovascular disease (e.g., KLF6, DUSP6, KLF4), to infection / immune disease (e.g., ZFP36, SOCS3, JUND), and to distinct proatherogenic pathways such as the Toll-like receptor signaling pathway (e.g., IL1B, MYD88, TICAM2), the MAPK signaling pathway (e.g., DUSP1, FOS, HSPA1A), and the chemokine signaling pathway (e.g., RHOA, PAK1, CXCL5). Formal interaction network analysis proved biological relevance of miRNA dysregulation, as 68 differentially expressed miRNAs could be connected to 47 reciprocally expressed target genes. Our study is the first comprehensive miRNA analysis in CKD that links dysregulated miRNA expression with differential expression of genes connected to inflammation and CVD. After recent animal data suggested that targeting miRNAs is beneficial in experimental CVD, our data may now spur further research in the field of CKD-associated human CVD.
- Integration of novel SSR and gene-based SNP marker loci in the chickpea genetic map and establishment of new anchor points with Medicago truncatula genome (2010)
- This study presents the development and mapping of simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers in chickpea. The mapping population is based on an inter-specific cross between domesticated and non-domesticated genotypes of chickpea (Cicer arietinum ICC 4958 × C. reticulatum PI 489777). This same population has been the focus of previous studies, permitting integration of new and legacy genetic markers into a single genetic map. We report a set of 311 novel SSR markers (designated ICCM—ICRISAT chickpea microsatellite), obtained from an SSR-enriched genomic library of ICC 4958. Screening of these SSR markers on a diverse panel of 48 chickpea accessions provided 147 polymorphic markers with 2–21 alleles and polymorphic information content value 0.04–0.92. Fifty-two of these markers were polymorphic between parental genotypes of the inter-specific population. We also analyzed 233 previously published (H-series) SSR markers that provided another set of 52 polymorphic markers. An additional 71 gene-based SNP markers were developed from transcript sequences that are highly conserved between chickpea and its near relative Medicago truncatula. By using these three approaches, 175 new marker loci along with 407 previously reported marker loci were integrated to yield an improved genetic map of chickpea. The integrated map contains 521 loci organized into eight linkage groups that span 2,602 cM, with an average inter-marker distance of 4.99 cM. Gene-based markers provide anchor points for comparing the genomes of Medicago and chickpea, and reveal extended synteny between these two species. The combined set of genetic markers and their integration into an improved genetic map should facilitate chickpea genetics and breeding, as well as translational studies between chickpea and Medicago.
- APADB: a database for alternative polyadenylation and microRNA regulation events (2014)
- Alternative polyadenylation (APA) is a widespread mechanism that contributes to the sophisticated dynamics of gene regulation. Approximately 50% of all protein-coding human genes harbor multiple polyadenylation (PA) sites; their selective and combinatorial use gives rise to transcript variants with differing length of their 3' untranslated region (3'UTR). Shortened variants escape UTR-mediated regulation by microRNAs (miRNAs), especially in cancer, where global 3'UTR shortening accelerates disease progression, dedifferentiation and proliferation. Here we present APADB, a database of vertebrate PA sites determined by 3' end sequencing, using massive analysis of complementary DNA ends. APADB provides (A)PA sites for coding and non-coding transcripts of human, mouse and chicken genes. For human and mouse, several tissue types, including different cancer specimens, are available. APADB records the loss of predicted miRNA binding sites and visualizes next-generation sequencing reads that support each PA site in a genome browser. The database tables can either be browsed according to organism and tissue or alternatively searched for a gene of interest. APADB is the largest database of APA in human, chicken and mouse. The stored information provides experimental evidence for thousands of PA sites and APA events. APADB combines 3' end sequencing data with prediction algorithms of miRNA binding sites, allowing to further improve prediction algorithms. Current databases lack correct information about 3'UTR lengths, especially for chicken, and APADB provides necessary information to close this gap. Database URL: http://tools.genxpro.net/apadb/