This article has Open Peer Review reports available.
‘MicroRNA Targets’, a new AthaMap web-tool for genome-wide identification of miRNA targets in Arabidopsis thaliana
© Bülow et al.; licensee BioMed Central Ltd. 2012
Received: 13 February 2012
Accepted: 16 July 2012
Published: 16 July 2012
The AthaMap database generates a genome-wide map for putative transcription factor binding sites for A. thaliana. When analyzing transcriptional regulation using AthaMap it may be important to learn which genes are also post-transcriptionally regulated by inhibitory RNAs. Therefore, a unified database for transcriptional and post-transcriptional regulation will be highly useful for the analysis of gene expression regulation.
To identify putative microRNA target sites in the genome of A. thaliana, processed mature miRNAs from 243 annotated miRNA genes were used for screening with the psRNATarget web server. Positional information, target genes and the psRNATarget score for each target site were annotated to the AthaMap database. Furthermore, putative target sites for small RNAs from seven small RNA transcriptome datasets were used to determine small RNA target sites within the A. thaliana genome.
Putative 41,965 genome wide miRNA target sites and 10,442 miRNA target genes were identified in the A. thaliana genome. Taken together with genes targeted by small RNAs from small RNA transcriptome datasets, a total of 16,600 A. thaliana genes are putatively regulated by inhibitory RNAs. A novel web-tool, ‘MicroRNA Targets’, was integrated into AthaMap which permits the identification of genes predicted to be regulated by selected miRNAs. The predicted target genes are displayed with positional information and the psRNATarget score of the target site. Furthermore, putative target sites of small RNAs from selected tissue datasets can be identified with the new ‘Small RNA Targets’ web-tool.
The integration of predicted miRNA and small RNA target sites with transcription factor binding sites will be useful for AthaMap-assisted gene expression analysis. URL: http://www.athamap.de/
KeywordsArabidopsis thaliana AthaMap MicroRNAs Small RNAs Post-transcriptional regulation
Small RNAs play an important role in post-transcriptional gene expression regulation. Currently, five classes of small RNAs are described in plants. Heterochromatin associated siRNAs, trans-acting siRNAs, repeat associated siRNAs, naturally occurring antisense siRNAs, and miRNAs . These small RNAs function as inhibitory RNAs (RNAi) by causing degradation of target mRNAs, inhibition of mRNA translation, or by eliciting epigenetic effects such as DNA methylation. The different classes of small RNAs are generated by distinct, sometimes converging pathways involving, among others, several DICER-LIKE proteins for processing double stranded RNA and several ARGONAUTE proteins for incorporating small RNAs into RNA induced silencing complexes (RISC) or RNA induced transcriptional silencing (RITS) complexes .
The fact that small RNAs have a high degree of complementarity to their target sites led to the establishment of bioinformatic resources for determining target sites and target genes. Towards these ends the miRBase database was established . Initially, miRBase served as a species independent repository for miRNAs containing mainly structural data. Later on, predicted gene targets and expression data derived from RNA deep sequencing experiments were incorporated [4, 5]. Currently, miRBase contains data for 243 different A. thaliana miRNA genes (miRBase, Release 16). In addition to miRBase, several other A. thaliana or plant specific databases with information on siRNAs and miRNAs are available such as ASRP, CSRDB, PMRD, and SoMART [6–10].
A major challenge for the bioinformatic prediction of miRNA regulated genes is the lack of identity between the miRNAs and their target sequences . Several tools were developed for computer-assisted plant miRNA target prediction such as miRU and Tapir [12, 13]. Based on experimentally verified target genes, Target-align and psRNATarget were developed for plant miRNA target detection [14, 15]. These tools take into account the sequence identity as well as the positional conservation of mismatches between miRNAs and their target genes [16, 17]. Furthermore, they consider recent findings on non-perfectly binding miRNAs and those leading to translational inhibition of the target mRNA [18, 19].
The identification of genes targeted by inhibitory RNA is useful when studying gene expression regulation using databases on cis-regulatory elements or transcription factor binding sites such as AGRIS or AthaMap [20, 21]. With these databases it is possible to predict regulatory sequences in gene promoters. Knowledge about the putative post-transcriptional regulation of the genes of interest will be important when studying gene expression regulation.
The work presented here describes the genome wide identification of miRNA target sites with the psRNATarget web server in A. thaliana. Based on genomic positions and orientation, putative target genes were identified. The genomic positions of miRNA target sites were integrated into AthaMap, a database for predicted transcription factor binding sites in the A. thaliana genome . New web-tools were developed for the online identification of putative miRNA and small RNA target genes.
Annotation of miRNA target sites in AthaMap
For genome-wide miRNA target identification in A. thaliana, the psRNATarget web server at http://plantgrn.noble.org/psRNATarget/ was used online . The psRNATarget web server contains 243 published miRNAs from the mirRBase database (Release 16, September 2010, ). Since transcripts from different miRNA genes can be processed into the same mature miRNA, target sites for closely related miRNA genes were identified with the same screening sequence. A total of 190 different screening sequences represent the 243 microRNA genes. To obtain positional information on putative miRNA target sites, the genomic sequence of A. thaliana (TAIR8) was fragmented into overlapping pieces to accommodate the size limitation of the web server. The following parameters were chosen online. The maximum expectation was set to 5.0 and flanking length around target site for target accessibility analysis was set to zero. All other parameters were left unchanged. After obtaining positional information of putative binding sites with corresponding psRNATarget score (range 0.0-5.0), the data was downloaded, and the fragmented Arabidopsis chromosomes were reassembled to obtain absolute positional information. This information was then integrated into the AthaMap database .
AthaMap update with small RNA transcriptome datasets
Small RNA datasets used in the analysis
Number of sequences
Number of genomic hits
Results and Discussion
New web-tools for genome-wide identification of miRNA and small RNA targets
The genomic screens with the processed mature miRNAs determined between 121 and 314 putative target sites for each sequence. A total of 41,965 target sites were determined for all screening sequences in the genome of A. thaliana. Next, the number of genes putatively regulated by miRNAs was predicted. Therefore, target sites in all annotated genes were determined. A target site is defined as the reverse complement of the small RNAs in the annotated transcript. A total of 15,390 miRNA target sites were detected in A. thaliana genes within the transcript of the genes (Additional file 1, S1.xls). Because the same gene can have target sites for more than one miRNA, the total number of different genes was determined to be 10,442 (Additional file 2, S2.txt). For a further resolution of specific miRNA target sites, additional file 1 provides the gene ID of the transcript, the miRNA that may target this gene, the absolute chromosomal position of the miRNA target site (the chromosome is identified by the gene ID), and the maximum expectation (score) which was obtained for this target site with the psRNATarget web server.
In addition to the ‘MicroRNA Targets’ web-tool, putative small RNA targets can be predicted using a second novel web-tool called ‘Small RNA Targets’ accessible at http://www.athamap.de/smallRNA_targets.php. It permits identification of small RNA target genes for selected small RNA transcriptome datasets similar to the ‘MicroRNA Targets’ tool.
Update of other AthaMap web-tools to identify putative post-transcriptionally regulated genes
Genes targeted by small RNAs and miRNAs are now identified in AthaMap at http://www.athamap.de when the web-tools ‘Colocalization’, ‘Gene-Analysis’, and ‘Gene Identification’ are used. With these web-tools one can determine, for example, putative combinatorial transcription factor binding sites (TFBS), common TFBS in a set of user submitted genes, or genome wide TFBS for user selected TFs [20, 32, 33]. On the result pages the number of putatively post-transcriptionally regulated genes targeted by small RNAs and/or miRNAs are identified and those genes are tagged with a gene ID in italics (small RNA), in bold (miRNA), or in italics and bold in the result table. Furthermore, putatively post-transcriptionally regulated genes can be omitted from the analysis by checking a box designated ‘exclude genes putatively regulated by small RNA’ and/or ‘exclude genes putatively regulated by miRNAs’. Furthermore, the ‘Gene Analysis’ web-tool which permits the graphical display of selected TFBS and small RNA target sites was complemented with an option to select miRNA target sites (MIR). When this option is selected, a graphic display of all submitted genes showing target sites of miRNAs within the selected gene region will be shown.
Database assisted analysis of transcriptionally and post-transcriptionally regulated genes
The information on putative post-transcriptionally regulated genes may be valuable when determining target genes of transcription factors or transcription factor binding sites within the A. thaliana genome. This will add another level of specificity to the analysis because it permits the identification of genes potentially regulated by miRNAs and this regulation may complement the regulation by TFs. The expression of microRNA genes themselves is also tightly regulated [34, 35]. Therefore, the tissue-specific or stress-specific induction or repression of a miRNA gene may downregulate or activate a target gene independently of TFs that target the same gene. Therefore, the integration of genes possibly targeted by small RNAs and/or miRNAs in a database for transcriptional regulation may contribute significantly to the functional analysis of gene expression regulation.
There are several A. thaliana specific databases that identify cis-regulatory sequences. For example AtcisDB which is part of the Arabidopsis Gene Regulatory Information Server (AGRIS) harbours experimentally verified and predicted cis-regulatory sequences in the upstream region of approximately 33,000 A. thaliana genes [21, 36–38]. Another database, Athena, contains 30,067 predicted Arabidopsis promoter sequences and consensus sequences for 105 previously characterized transcription factor (TF) binding sites . Also ATTED-II and PlantPan identify cis-regulatory sequences in A. thaliana genes [38, 40–42]. In contrast to these databases, AthaMap, a database for transcription factor bindings sites for the whole A. thaliana genome, integrates transcriptional and post-transcriptional data . The present work has extended the data content from 5,772 putatively post-transcriptionally regulated genes targeted by small RNAs to a total of 9,166 genes identified with small RNA transcriptome datasets. Most importantly, 10,442 putative target genes of mature miRNAs corresponding to 243 different microRNA genes were also identified. By tagging these genes in the AthaMap ‘Colocalization’, ‘Gene-Analysis’, and ‘Gene Identification’ web-tool results, the user can identify all putatively post-transcriptionally regulated genes and can also omit those genes in the analysis. Furthermore, the new web-tool ‘MicroRNA Targets’ reported here permits the identification of miRNA target genes in AthaMap. The identification and annotation of small RNA target sites also for intergenic regions may contribute to the functional analysis of small RNAs in epigenetic regulation of gene expression.
The identification of genes targeted by inhibitory RNAs is useful when studying gene expression regulation using databases. With AthaMap it is possible to predict regulatory sequences in gene promoters and to identify those genes that are potentially targeted by inhibitory RNAs. With the annotation of putative miRNA target sites from processed mature miRNAs from 243 miRNA genes and small RNA target sites from nine small RNA transcriptome datasets a total of 16,600 A. thaliana genes are predicted to be potentially regulated by inhibitory RNAs.
This work was supported by the German Federal Ministry for Education and Research within the GABI ADVANCIS and PLANT-KBBE STREG networks (BMBF Grants No. 0315037B and No. 0315459A). Part of the results have been achieved within the framework of the Transnational (Germany, France, Spain) Cooperation within the PLANT-KBBE Initiative, with funding from Ministerio de Ciencia e Innovación, Agence Nationale de la Recherche (ANR) and BMBF.
- Allen E, Howell MD: miRNAs in the biogenesis of trans-acting siRNAs in higher plants. Semin Cell Dev Biol. 2010, 21: 798-804. 10.1016/j.semcdb.2010.03.008.View ArticlePubMedGoogle Scholar
- Jones-Rhoades MW, Bartel DP, Bartel B: MicroRNAS and their regulatory roles in plants. Annu Rev Plant Biol. 2006, 57: 19-53. 10.1146/annurev.arplant.57.032905.105218.View ArticlePubMedGoogle Scholar
- Griffiths-Jones S: The microRNA Registry. Nucleic Acids Res. 2004, 32: D109-111. 10.1093/nar/gkh023.View ArticlePubMedPubMed CentralGoogle Scholar
- Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2006, 34: D140-144. 10.1093/nar/gkj112.View ArticlePubMedGoogle Scholar
- Kozomara A, Griffiths-Jones S: miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 2011, 39: D152-157. 10.1093/nar/gkq1027.View ArticlePubMedGoogle Scholar
- Gustafson AM, Allen E, Givan S, Smith D, Carrington JC, Kasschau KD: ASRP: the Arabidopsis Small RNA Project Database. Nucleic Acids Res. 2005, 33: D637-640.View ArticlePubMedGoogle Scholar
- Backman TW, Sullivan CM, Cumbie JS, Miller ZA, Chapman EJ, Fahlgren N, Givan SA, Carrington JC, Kasschau KD: Update of ASRP: the Arabidopsis Small RNA Project database. Nucleic Acids Res. 2008, 36: D982-985.View ArticlePubMedGoogle Scholar
- Johnson C, Bowman L, Adai AT, Vance V, Sundaresan V: CSRDB: a small RNA integrated database and browser resource for cereals. Nucleic Acids Res. 2007, 35: D829-833. 10.1093/nar/gkl991.View ArticlePubMedGoogle Scholar
- Zhang Z, Yu J, Li D, Zhang Z, Liu F, Zhou X, Wang T, Ling Y, Su Z: PMRD: plant microRNA database. Nucleic Acids Res. 2009, 38: D806-813.View ArticlePubMedPubMed CentralGoogle Scholar
- Li F, Orban R, Baker B: SoMART, a web server for plant miRNA, tasiRNA and target gene analysis. Plant J,70:891–901.Google Scholar
- Dai X, Zhuang Z, Zhao PX: Computational analysis of miRNA targets in plants: current status and challenges. Brief Bioinform. 2011, 12: 115-121. 10.1093/bib/bbq065.View ArticlePubMedGoogle Scholar
- Zhang Y: miRU: an automated plant miRNA target prediction server. Nucleic Acids Res. 2005, 33: W701-704. 10.1093/nar/gki383.View ArticlePubMedPubMed CentralGoogle Scholar
- Bonnet E, He Y, Billiau K, Van de Peer Y: TAPIR, a web server for the prediction of plant microRNA targets, including target mimics. Bioinformatics. 2010, 26: 1566-1568. 10.1093/bioinformatics/btq233.View ArticlePubMedGoogle Scholar
- Xie F, Zhang B: Target-align: a tool for plant microRNA target identification. Bioinformatics. 2010, 26: 3002-3003. 10.1093/bioinformatics/btq568.View ArticlePubMedGoogle Scholar
- Dai X, Zhao PX: psRNATarget: a plant small RNA target analysis server. Nucleic Acids Res. 2011, 39: W155-159. 10.1093/nar/gkr319.View ArticlePubMedPubMed CentralGoogle Scholar
- Rhoades MW, Reinhart BJ, Lim LP, Burge CB, Bartel B, Bartel DP: Prediction of plant microRNA targets. Cell. 2002, 110: 513-520. 10.1016/S0092-8674(02)00863-2.View ArticlePubMedGoogle Scholar
- Schwab R, Palatnik JF, Riester M, Schommer C, Schmid M, Weigel D: Specific effects of microRNAs on the plant transcriptome. Dev Cell. 2005, 8: 517-527. 10.1016/j.devcel.2005.01.018.View ArticlePubMedGoogle Scholar
- Brodersen P, Sakvarelidze-Achard L, Bruun-Rasmussen M, Dunoyer P, Yamamoto YY, Sieburth L, Voinnet O: Widespread translational inhibition by plant miRNAs and siRNAs. Science. 2008, 320: 1185-1190. 10.1126/science.1159151.View ArticlePubMedGoogle Scholar
- Montgomery TA, Howell MD, Cuperus JT, Li D, Hansen JE, Alexander AL, Chapman EJ, Fahlgren N, Allen E, Carrington JC: Specificity of ARGONAUTE7-miR390 interaction and dual functionality in TAS3 trans-acting siRNA formation. Cell. 2008, 133: 128-141. 10.1016/j.cell.2008.02.033.View ArticlePubMedGoogle Scholar
- Bülow L, Brill Y, Hehl R: AthaMap-assisted transcription factor target gene identification in Arabidopsis thaliana. Database (Oxford). 2010, 2010: baq034-10.1093/database/baq034.View ArticleGoogle Scholar
- Yilmaz A, Mejia-Guerra MK, Kurz K, Liang X, Welch L, Grotewold E: AGRIS: the Arabidopsis Gene Regulatory Information Server, an update. Nucleic Acids Res. 2011, 39: D1118-1122. 10.1093/nar/gkq1120.View ArticlePubMedGoogle Scholar
- Steffens NO, Galuschka C, Schindler M, Bülow L, Hehl R: AthaMap: an online resource for in silico transcription factor binding sites in the Arabidopsis thaliana genome. Nucleic Acids Res. 2004, 32: D368-372. 10.1093/nar/gkh017.View ArticlePubMedPubMed CentralGoogle Scholar
- Bülow L, Engelmann S, Schindler M, Hehl R: AthaMap, integrating transcriptional and post-transcriptional data. Nucleic Acids Res. 2009, 37: D983-986. 10.1093/nar/gkn709.View ArticlePubMedGoogle Scholar
- Lu C, Tej SS, Luo S, Haudenschild CD, Meyers BC, Green PJ: Elucidation of the small RNA component of the transcriptome. Science. 2005, 309: 1567-1569. 10.1126/science.1114112.View ArticlePubMedGoogle Scholar
- Swarbreck D, Wilks C, Lamesch P, Berardini TZ, Garcia-Hernandez M, Foerster H, Li D, Meyer T, Muller R, Ploetz L: The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucleic Acids Res. 2008, 36: D1009-1014.View ArticlePubMedGoogle Scholar
- Axtell MJ, Jan C, Rajagopalan R, Bartel DP: A two-hit trigger for siRNA biogenesis in plants. Cell. 2006, 127: 565-577. 10.1016/j.cell.2006.09.032.View ArticlePubMedGoogle Scholar
- Rajagopalan R, Vaucheret H, Trejo J, Bartel DP: A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. Genes Dev. 2006, 20: 3407-3425. 10.1101/gad.1476406.View ArticlePubMedPubMed CentralGoogle Scholar
- Kasschau KD, Fahlgren N, Chapman EJ, Sullivan CM, Cumbie JS, Givan SA, Carrington JC: Genome-wide profiling and analysis of Arabidopsis siRNAs. PLoS Biol. 2007, 5: e57-10.1371/journal.pbio.0050057.View ArticlePubMedPubMed CentralGoogle Scholar
- Fahlgren N, Howell MD, Kasschau KD, Chapman EJ, Sullivan CM, Cumbie JS, Givan SA, Law TF, Grant SR, Dangl JL, Carrington JC: High-throughput sequencing of Arabidopsis microRNAs: evidence for frequent birth and death of MIRNA genes. PLoS One. 2007, 2: e219-10.1371/journal.pone.0000219.View ArticlePubMedPubMed CentralGoogle Scholar
- Howell MD, Fahlgren N, Chapman EJ, Cumbie JS, Sullivan CM, Givan SA, Kasschau KD, Carrington JC: Genome-wide analysis of the RNA-DEPENDENT RNA POLYMERASE6/DICER-LIKE4 pathway in Arabidopsis reveals dependency on miRNA- and tasiRNA-directed targeting. Plant Cell. 2007, 19: 926-942. 10.1105/tpc.107.050062.View ArticlePubMedPubMed CentralGoogle Scholar
- Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Edgar R: NCBI GEO: mining tens of millions of expression profiles–database and tools update. Nucleic Acids Res. 2007, 35: D760-765. 10.1093/nar/gkl887.View ArticlePubMedGoogle Scholar
- Steffens NO, Galuschka C, Schindler M, Bülow L, Hehl R: AthaMap web tools for database-assisted identification of combinatorial cis-regulatory elements and the display of highly conserved transcription factor binding sites in Arabidopsis thaliana. Nucleic Acids Res. 2005, 33: W397-402. 10.1093/nar/gki395.View ArticlePubMedPubMed CentralGoogle Scholar
- Galuschka C, Schindler M, Bülow L, Hehl R: AthaMap web-tools for the analysis and identification of co-regulated genes. Nucleic Acids Res. 2007, 35: D857-D862. 10.1093/nar/gkl1006.View ArticlePubMedGoogle Scholar
- Megraw M, Baev V, Rusinov V, Jensen ST, Kalantidis K, Hatzigeorgiou AG: MicroRNA promoter element discovery in Arabidopsis. RNA. 2006, 12: 1612-1619. 10.1261/rna.130506.View ArticlePubMedPubMed CentralGoogle Scholar
- Liu HH, Tian X, Li YJ, Wu CA, Zheng CC: Microarray-based analysis of stress-regulated microRNAs in Arabidopsis thaliana. RNA. 2008, 14: 836-843. 10.1261/rna.895308.View ArticlePubMedPubMed CentralGoogle Scholar
- Davuluri RV, Sun H, Palaniswamy SK, Matthews N, Molina C, Kurtz M, Grotewold E: AGRIS: Arabidopsis Gene Regulatory Information Server, an information resource of Arabidopsis cis-regulatory elements and transcription factors. BMC Bioinforma. 2003, 4: 25-10.1186/1471-2105-4-25.View ArticleGoogle Scholar
- Palaniswamy SK, James S, Sun H, Lamb RS, Davuluri RV, Grotewold E: AGRIS and AtRegNet. a platform to link cis-regulatory elements and transcription factors into regulatory networks. Plant Physiol. 2006, 140: 818-829. 10.1104/pp.105.072280.View ArticlePubMedPubMed CentralGoogle Scholar
- Chang WC, Lee TY, Huang HD, Huang HY, Pan RL: PlantPAN: Plant promoter analysis navigator, for identifying combinatorial cis-regulatory elements with distance constraint in plant gene groups. BMC Genomics. 2008, 9: 561-10.1186/1471-2164-9-561.View ArticlePubMedPubMed CentralGoogle Scholar
- O'Connor TR, Dyreson C, Wyrick JJ: Athena: a resource for rapid visualization and systematic analysis of Arabidopsis promoter sequences. Bioinformatics. 2005, 21: 4411-4413. 10.1093/bioinformatics/bti714.View ArticlePubMedGoogle Scholar
- Obayashi T, Kinoshita K, Nakai K, Shibaoka M, Hayashi S, Saeki M, Shibata D, Saito K, Ohta H: ATTED-II: a database of co-expressed genes and cis elements for identifying co-regulated gene groups in Arabidopsis. Nucleic Acids Res. 2007, 35: D863-869. 10.1093/nar/gkl783.View ArticlePubMedGoogle Scholar
- Obayashi T, Hayashi S, Saeki M, Ohta H, Kinoshita K: ATTED-II provides coexpressed gene networks for Arabidopsis. Nucleic Acids Res. 2009, 37: D987-991. 10.1093/nar/gkn807.View ArticlePubMedGoogle Scholar
- Obayashi T, Nishida K, Kasahara K, Kinoshita K: ATTED-II updates: condition-specific gene coexpression to extend coexpression analyses and applications to a broad range of flowering plants. Plant Cell Physiol. 2011, 52: 213-219. 10.1093/pcp/pcq203.View ArticlePubMedPubMed CentralGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.