Journal of Computer Science and Technology  2010, 25(1) 53-70 DOI:     ISSN: 1000-9000 CN: CN 11-2296/TP

本期目录 | 过刊浏览 | 高级检索                                                            [打印本页]   [关闭]
扩展功能
本文信息
Supporting info
PDF(648KB)
服务与反馈
把本文推荐给朋友
加入我的书架
加入引用管理器
引用本文
Email Alert
文章反馈
浏览反馈信息
本文关键词相关文章
bioinformatics
microbial genomics
genome structure
comparative genome analysis
biological pathways
本文作者相关文章
Ying Xu
PubMed
Article by Ying Xu
中文题目: 探索细菌基因组结构中面临的计算挑战
中文导读

在进化的过程当中,细菌的细胞机器的功能是如何决定其基因组中基因的排列的?在这篇文章中,作者就如何用计算的方法研究这个问题展开了探讨,日益庞大的基因组数据库能够帮助我们对这一问题有更加深入的理解。作者讨论了在研究各个层次的基因组结构中遇到的一些挑战性的计算问题,试图得到对于细菌基因组如何组织的内在机理的认识。
细菌是地球上最简单的能够独立生存的生物。细菌虽小,却能够完成一些令人惊叹的事情,并且,他们在保持生态稳定、人体健康等方面也发挥着巨大的作用。所有这一切,都被编码在了细菌的基因组中。那么,对这个神秘的基因组,我们知道了多少呢?哪些还不知道呢?哪些能够知道呢?
A,关于细菌的基因组我们已知的信息。
细菌的基因组是有结构的。基因先被组织成了操纵子(operons),然后再被组织成更高层次的调控子(regulons)。操纵子是基本的转录单元,而调控子是基本的细胞反应系统单元。所有的基因可以被分成三类:1)编码蛋白质的基因,2)编码蛋白质功能的基因,3)非编码RNA基因。需要指出的是第二类基因,一个蛋白质的功能并没有直接编码在基因组里,而最终是由其氨基酸序列决定的。作者对这三类基因,以及操纵子、启动子、顺势调控元件、可移动元件等做了相应介绍。
B,关于细菌的基因组我们哪些还不知道。
我们把细菌看成一个复杂的机器,那么这台机器由三个相互作用的系统组成:1)代谢系统,2)调控系统,3)信号系统。同时,我们应该把细菌的基因组看成一本说明书,它指导这三个系统协调工作。但是:1)我们对书写这部说明书的语言知之甚少,2)我们对这部机器的设计原理知之甚少,3)我们对编码生物通路的基因在基因组中的组织形式有了一定认识,但是对他们为什么会以这种方式组织知之甚少,4)我们对外来基因如何适应新的环境以及如何行使新的功能知之甚少……总而言之,对于这部机器和其说明书,我们所知道的只是冰山一角。
C,显然,细菌基因组中蕴含的信息和我们已经知道的信息之间有一个巨大的鸿沟。好的消息是,很多新知识是能够从已知知识推导出来,来填补这个鸿沟的。在这篇文章剩下的部分里,作者讨论了在可预见的将来,一些可以只从基因组序列推导得到的新知识。
首先,很大一部分新信息可以通过基因组的比较得到(比较基因组分析,Comparative Genome Analyses)。比较基因组的目的是寻找同源基因(orthologous genes),但是由于对于同源基因没有一个可操作的定义(operational definition),使得到目前为止还没有一个普遍认可的算法来解决这个问题,现有的方法大多是基于序列相似性比较。
在发现新知识的过程中,基因组的可视化会给予我们很大的帮助。可视化不仅给予生物学家巨大的便利,而且,伴随着基因组数据库的膨胀,计算学家也可以独立的发现新知识。其中值得一提的是超级操纵子(uber-operons)和基因组条形码(barcode of genome)的发现。
目前,对于操纵子和顺势调控主体的预测和识别已经有了较大进展。很多预测软件已经经过了实践的检验,预测精度也已经达到了90%。另外,最近的研究表明操纵子还具有子结构,子结构中的基因独立于操纵子中的其他基因进行转录。
除此之外,寻找功能上有关联的基因,预测调控子,探究操纵子在代谢通路中的分布规律的内在机理,识别可以动的元件,发展新的基因组可视化工具……这些问题都在卓有成效的进展着,这些进展也使我们相信能够从已知知识得到很多新知识,并且最终揭示细菌基因组的整体特性和内在机理。“It is the right time to do this.”

Computational Challenges in Deciphering Genomic Structures of Bacteria

Ying Xu (徐鹰)

Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology and Institute of Bioinformatics, The University of Georgia, Athens, GA 30602, U.S.A.
BESC BioEnergy Science Center, U.S.A.
College of Computer Science and Technology, Jilin University, Changchun 130012, China

Abstract:

This article addresses how the functionalities of the cellular machinery of a bacterium might have constrained the genomic arrangement of its genes during evolution and how we can study such problems using computational approaches, taking full advantage of the rapidly increasing pool of the sequenced bacterial genomes, potentially leading to a much improved understanding of why a bacterial genome is organized in the way it is. This article discusses a number of challenging computational problems in elucidating the genomic structures at multiple levels and the information that is encoded through these genomic structures, gearing towards the ultimate understanding of the governing rules of bacterial genome organization.

Keywords: bioinformatics    microbial genomics    genome structure    comparative genome analysis    biological pathways  
收稿日期 2009-10-01 修回日期 2009-11-16 出版日期  
DOI:
基金项目:

The work is supported in part by the NSF of USA (Grant Nos. DBI-0354771, ITR-IIS-0407204, DBI-0542119, CCF0621700), NIH of USA (Grant Nos. 1R01GM075331 and 1R01GM081682) and the grant for the BioEnergy Science Center.

作者简介:
Ying Xu is the Regents-Georgia Research Alliance Eminent Scholar Chair and Professor in the Department of Biochemistry and Molecular Biology and the Institute of Bioinformatics, the University of Georgia. He received his Ph.D. degree in theoretical computer science from the University of Colorado at Boulder in 1991. He was a visiting assistant worked for Oak Ridge National Laboratory from 1993 to 2003, where he was a senior staff scientist and a group leader. His current research interests include (a) computational and systems biology relevant to human cancer and early detection, (b) microbial genomes and encoded pathways, and (c) plant genomes and plant cell walls. He has published over 200 research articles and four books covering different areas of bioinformatics and systems biology.

参考文献:

[1] Karatan E, Watnick P. Signals, regulatory networks, and materials that build and break bacterial biofilms. Microbiol. Mol. Biol. Rev., 2009, 73(2): 310-347.
[2] An D, Parsek M R. The promise and peril of transcriptional profiling in biofilm communities. Curr. Opin. Microbiol., 2007, 10(3): 292-296.
[3] Hoffman L R, D’Argenio D A, MacCoss M J, Zhang Z, Jones R A, Miller S I. Aminoglycoside antibiotics induce bacterial biofilm formation. Nature, 2005, 436(7054): 1171-1175.
[4] Hall-Stoodley L, Costerton J W, Stoodley P. Bacterial biofilms: From the natural environment to infectious diseases. Nat. Rev. Microbiol., 2004, 2(2): 95-108.
[5] How Deep is the Gene Pool? Astrobiology Magazine European Edition, 2008, http://www.astrobio.net/amee/summer 2008/ Interviews/AnthonyPooleInterview.php.
[6] Ben-Jacob E. Bacterial know how: From physics to cybernetics. PhysicaPlus, 2006, 7, http://physicaplus.org.il/zope/ home/en/1124811264/1145390912 eshel en.
[7] Fleischmann R D, Adams M D, White O, Clayton R A, Kirkness E F, Kerlavage A R, Bult C J, Tomb J F, Dougherty B A, Merrick J M et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science, 1995, 269(5223): 496-512.
[8] Pruitt K D, Tatusova T, Klimke W, Maglott D R. NCBI Reference sequences: Current status, policy and new initiatives. Nucleic Acids Res., 2009, 37(Database Issue): D32-D36.
[9] Rocha E P. The replication-related organization of bacterial genomes. Microbiology, 2004, 150(Pt 6): 1609-1627.
[10] Mackiewicz D, Mackiewicz P, Kowalczuk M, Dudkiewicz M, Dudek M R, Cebrat S. Rearrangements between differently replicating DNA strands in asymmetric bacterial genomes. Acta Microbiol. Pol., 2003, 52(3): 245-260.
[11] Reznikoff W S. The operon revisited. Annu. Rev. Genet., 1972, 6: 133-156.
[12] Ames B N, Martin R G. Biochemical aspects of genetics: The operon. Annu. Rev. Biochem., 1964, 33: 235-258.
[13] Mao F, Dam P, Chou J, Olman V, Xu Y. DOOR: A database for prokaryotic operons. Nucleic Acids Res., 2009, 37(Database Issue): D459-D463.
[14] Dam P, Olman V, Harris K, Su Z, Xu Y. Operon prediction using both genome-specific and general genomic information. Nucleic Acids Res., 2007, 35(1): 288-298.
[15] Su Z, Olman V, Xu Y. Computational prediction of Pho regulons in cyanobacteria. BMC Genomics, 2007, 8: 156.
[16] Claverys J P, Prudhomme M, Martin B. Induction of competence regulons as a general response to stress in gram-positive bacteria. Annu. Rev. Microbiol., 2006, 60(1): 451-475.
[17] Yasbin R E, Cheo D L, Bayles K W. Inducible DNA repair and differentiation in Bacillus subtilis: Interactions between global regulons. Mol. Microbiol., 1992, 6(10): 1263-1270.
[18] Zhou F, Xu Y. RepPop: A database for repetitive elements in Populus trichocarpa. BMC Genomics, 2009, 10: 14.
[19] Zhou F, Olman V, Xu Y. Insertion sequences show diverse recent activities in Cyanobacteria and Archaea. BMC Genomics, 2008, 9: 36.
[20] Zhou F, Tran T, Xu Y. Nezha, a novel active miniature inverted-repeat transposable element in cyanobacteria. Biochem. Biophys. Res. Commun., 2008, 365(4): 790-794.
[21] Hayes F. Transposon-based strategies for microbial functional genomics and proteomics. Annu. Rev. Genet., 2003, 37: 3- 29.
[22] Hamer L, DeZwaan T M, Montenegro-Chamorro M V, Frank S A, Hamer J E. Recent advances in large-scale transposon mutagenesis. Curr. Opin. Chem. Biol., 2001, 5(1): 67-73.
[23] Izawa T, Ohnishi T, Nakano T et al. Transposon tagging in rice. Plant Mol. Biol., 1997, 35(1/2): 219-229.
[24] Noguchi H, Park J, Takagi T. MetaGene: Prokaryotic gene finding from environmental genome shotgun sequences. Nucleic Acids Res., 2006, 34(19): 5623-5630.
[25] Nielsen P, Krogh A. Large-scale prokaryotic gene prediction and comparison to genome annotation. Bioinformatics, 2005, 21(24): 4322-4329.
[26] Hannenhalli S S, Hayes W S, Hatzigeorgiou A G, Fickett J W. Bacterial start site prediction. Nucleic Acids Res., 1999, 27(17): 3577-3582.
[27] Solovyev V, Kosarev P, Seledsov I, Vorobyev D. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol., 2006, 7(Suppl. 1): S10.1-12.
[28] Ellrott K, Guo J T, Olman V, Xu Y. Improving the performance of protein threading using insertion/deletion frequency arrays. J. Bioinform. Comput. Biol., 2008, 6(3): 585-602.
[29] Guo J T, Xu Y. Towards modeling of amyloid fibril structures. Front. Biosci., 2008, 13: 4039-4050.
[30] Marnef A, Sommerville J, Ladomery M R. RAP55: Insights into an evolutionarily conserved protein family. Int. J. Biochem. Cell Biol., 2009, 41(5): 977-981.
[31] Finn R D, Tate J, Mistry J, Coggill P C, Sammut S J, Hotz H R, Ceric G, Forslund K, Eddy S R, Sonnhammer E L et al. The Pfam protein families database. Nucleic Acids Res., 2008, 36(Database Issue): D281-D288.
[32] Hulo N, Bairoch A, Bulliard V, Cerutti L, Cuche B A, de Castro E, Lachaize C, Langendijk-Genevaux P S, Sigrist C J. The 20 years of PROSITE. Nucleic Acids Res., 2008, 36(Database Issue): D245-D249.
[33] Bork P. Powers and pitfalls in sequence analysis: The 70% hurdle. Genome Res., 2000, 10(4): 398-400.
[34] Aravin A A, Hannon G J. Small RNA silencing pathways in germ and stem cells. Cold Spring Harb. Symp. Quant. Biol., 2008, 73: 283-290.
[35] Mattick J S, Amaral P P, Dinger M E, Mercer T R, Mehler M F. RNA regulation of epigenetic processes. Bioessays, 2009, 31(1): 51-59.
[36] Stricklin S L, Griffiths-Jones S, Eddy S R. C. elegans noncoding RNA genes. WormBook, 2005, 1-7.
[37] Goodrich J A, Kugel J F. From bacteria to humans, chromatin to elongation, and activation to repression: The expanding roles of noncoding RNAs in regulating transcription. Crit. Rev. Biochem. Mol. Biol., 2009, 44(1): 3-15.
[38] Bradley R K, Uzilov A V, Skinner M E, Bendana Y R, Barquist L, Holmes I. Evolutionary modeling and prediction of non-coding RNAs in Drosophila. PLoS One, 2009, 4(8): e6478.
[39] Childs L, Nikoloski Z, May P, Walther D. Identification and classification of ncRNA molecules using graph properties. Nucleic Acids Res., 2009, 37(9): e66.
[40] Voss B, Georg J, Schon V, Ude S, Hess W R. Biocomputational prediction of non-coding RNAs in model cyanobacteria. BMC Genomics, 2009, 10: 123.
[41] Song D, Yang Y, Yu B, Zheng B, Deng Z, Lu B L, Chen X, Jiang T. Computational prediction of novel non-coding RNAs in Arabidopsis thaliana. BMC Bioinformatics, 2009, 10(Suppl 1): S36.
[42] Wang S, Wang Y, Du W, Sun F, Wang X, Zhou C, Liang Y. A multi-approaches-guided genetic algorithm with application to operon prediction. Artif. Intell. Med., 2007, 41(2): 151-159.
[43] Tran T T, Dam P, Su Z, Poole F L, 2nd, Adams M W, Zhou G T, Xu Y. Operon prediction in Pyrococcus furiosus. Nucleic Acids Res., 2007, 35(1): 11-20.
[44] Zhang G Q, Cao Z W, Luo Q M, Cai Y D, Li Y X. Operon prediction based on SVM. Comput. Biol. Chem., 2006, 30(3): 233-240.
[45] Price M N, Arkin A P, Alm E J. OpWise: Operons aid the identification of differentially expressed genes in bacterial microarray experiments. BMC Bioinformatics, 2006, 7: 19.
[46] Alm E J, Huang K H, Price M N, Koche R P, Keller K, Dubchak I L, Arkin A P. The MicrobesOnline Web site for comparative genomics. Genome Res., 2005, 15(7): 1015-1022.
[47] Loewen P C, Hengge-Aronis R. The role of the sigma factor sigma S (KatF) in bacterial global regulation. Annu. Rev. Microbiol., 1994, 48: 53-80.
[48] Errington J. Bacillus subtilis sporulation: Regulation of gene expression and control of morphogenesis. Microbiol. Rev., 1993, 57(1): 1-33.
[49] Stragier P, Losick R. Cascades of sigma factors revisited. Mol. Microbiol., 1990, 4(11): 1801-1806.
[50] Prakash A, Tompa M. Discovery of regulatory elements in vertebrates through comparative genomics. Nat. Biotechnol, 2005, 23(10): 1249-1256.
[51] Tompa M, Li N, Bailey T L, Church G M, De Moor B, Eskin E, Favorov A V, Frith M C, Fu Y, Kent W J et al. Assessing computational tools for the discovery of transcription factor binding sites. Nat. Biotechnol., 2005, 23(1): 137-144.
[52] Chen Y, Zhou F, Li G, Xu Y. A recently active miniature inverted-repeat transposable element, Chunjie, inserted into an operon without disturbing the operon structure in Geobacter uraniireducens Rf4. Genetics, 2008, 179(4): 2291-2297.
[53] Xu Z, Wang H. LTR FINDER: An efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res., 2007, 35(Web Server Issue): W265-W268.
[54] Feschotte C, Keswani U, Ranganathan N, Guibotsy M L, Levine D. Exploring repetitive DNA landscapes using REPCLASS, a tool that automates the classification of transposable elements in eukaryotic genomes. Genome Biol. Evol., 2009, pp.205-220.
[55] Zhou F, Olman V, Xu Y. Barcodes for genomes and applications. BMC Bioinformatics, 2008, 9(1): 546.
[56] Whitaker J W, McConkey G A, Westhead D R. Prediction of horizontal gene transfers in eukaryotes: Approaches and challenges. Biochem. Soc. Trans., 2009, 37(Pt 4): 792-795.
[57] Fournier G P, Huang J, Gogarten J P. Horizontal gene transfer from extinct and extant lineages: Biological innovation and the coral of life. Philos. Trans. R. Soc. Lond. B. Biol. Sci., 2009, 364(1527): 2229-2239.
[58] Huang J, Gogarten J P. Ancient gene transfer as a tool in phylogenetic reconstruction. Methods Mol. Biol., 2009, 532: 127-139.
[59] Taylor R, Singhal M. Biological network inference and analysis using SEBINI and CABIN. Methods Mol. Biol., 2009, 541: 551-576.
[60] Schadt E E, Zhang B, Zhu J. Advances in systems biology are enhancing our understanding of disease and moving us closer to novel disease treatments. Genetica, 2009, 136(2): 259-269.
[61] Kreutz C, Timmer J. Systems biology: Experimental design. FEBS J., 2009, 276(4): 923-942.
[62] Iyengar R. Computational biochemistry: Systems biology minireview series. J. Biol. Chem., 2009, 284(9): 5425-5426.
[63] van Gend C, Snoep J L. Systems biology model databases and resources. Essays Biochem., 2008, 45: 223-236.
[64] Sauro H M, Bergmann F T. Standards and ontologies in computational systems biology. Essays Biochem., 2008, 45: 211- 222.
[65] Brul S, Mensonides F I, Hellingwerf K J, Teixeira de Mattos M J. Microbial systems biology: New frontiers open to predictive microbiology. Int. J. Food Microbiol., 2008, 128(1): 16-21.
[66] Davidov E, Holland J, Marple E, Naylor S. Advancing drug discovery through systems biology. Drug Discov. Today, 2003, 8(4): 175-183.
[67] Ideker T, Galitski T, Hood L. A new approach to decoding life: Systems biology. Annu. Rev. Genomics. Hum. Genet, 2001, 2: 343-372.
[68] Griswold A. Genome packaging in prokaryotes: The circular chromosome of E. coli. Nature Education, 2008, 1(1).
[69] Mason D J, Powelson D M. Nuclear division as observed in live bacteria by a new technique. J. Bacteriol., 1956, 71(4): 474-479.
[70] Gogarten J P, Townsend J P. Horizontal gene transfer, genome innovation and evolution. Nat. Rev. Microbiol., 2005, 3(9): 679-687.
[71] Koonin E V, Makarova K S, Aravind L. Horizontal gene transfer in prokaryotes: Quantification and classification. Annu. Rev. Microbiol., 2001, 55: 709-742.
[72] Lawrence J G, Hendrickson H. Genome evolution in bacteria: Order beneath chaos. Curr. Opin. Microbiol., 2005, 8(5): 572-578.
[73] Preidis G A, Versalovic J. Targeting the human microbiome with antibiotics, probiotics, and prebiotics: Gastroenterology enters the metagenomics era. Gastroenterology, 2009, 136(6): 2015-2031.
[74] Petrosino J F, Highlander S, Luna R A, Gibbs R A, Versalovic J. Metagenomic pyrosequencing and microbial identification. Clin. Chem., 2009, 55(5): 856-866.
[75] Hattori M, Taylor T D. The human intestinal microbiome: A new frontier of human biology. DNA Res., 2009, 16(1): 1-12.
[76] Sivachenko A Y, Yuryev A, Daraselia N, Mazo I. Molecular networks in microarray analysis. J. Bioinform. Comput. Biol., 2007, 5(2B): 429-456.
[77] Wade J T, Struhl K, Busby S J, Grainger D C. Genomic analysis of protein-DNA interactions in bacteria: Insights into transcription and chromosome organization. Mol. Microbiol., 2007, 65(1): 21-26.
[78] Tian F, Shah P K, Liu X, Negre N, Chen J, Karpenko O, White K P, Grossman R L. Flynet: A genomic resource for Drosophila melanogaster transcriptional regulatory networks. Bioinformatics, 2009, 25(22): 3001-3004.
[79] Kaufmann K,Muino JM, Jauregui R, Airoldi C A, Smaczniak C, Krajewski P, Angenent G C. Target genes of the MADS transcription factor SEPALLATA3: Integration of developmental and hormonal pathways in the Arabidopsis flower. PLoS Biol., 2009, 7(4): e1000090.
[80] Gilchrist D A, Fargo D C, Adelman K. Using ChIP-chip and ChIP-seq to study the regulation of gene expression: Genomewide localization studies reveal widespread regulation of transcription elongation. Methods, 2009, 48(4): 398-408.
[81] Lau K W, Jones A R, Swainston N, Siepen J A, Hubbard S J. Capture and analysis of quantitative proteomic data. Proteomics, 2007, 7(16): 2787-2799.
[82] Budzikiewicz H, Grigsby R D. Mass spectrometry and isotopes: A century of research and discussion. Mass Spectrom Rev., 2006, 25(1): 146-157.
[83] Walker G S, O’Connell T N. Comparison of LC-NMR and conventional NMR for structure elucidation in drug metabolism studies. Expert Opin. Drug Metab. Toxicol., 2008, 4(10): 1295-1305.
[84] Mesnard F, Ratcliffe R G. NMR analysis of plant nitrogen metabolism. Photosynth. Res., 2005, 83(2): 163-180.
[85] Bruckner A, Polge C, Lentze N, Auerbach D, Schlattner U. Yeast two-hybrid, a powerful tool for systems biology. Int. J. Mol. Sci., 2009, 10(6): 2763-2788.
[86] Lee E J, Hyun S, Chun J, Shin S H, Kang S S. Ubiquitylation of Fe65 adaptor protein by neuronal precursor cell expressed developmentally down regulated 4-2 (Nedd4-2) via the WW domain interaction with Fe65. Exp. Mol. Med., 2009, 41(8): 555-568.
[87] Chun J, Kwon T, Lee E J, Hyun S, Hong S K, Kang S S. The subcellular localization of 3-phosphoinositide-dependent protein kinase is controlled by caveolin-1 binding. Biochem. Biophys. Res. Commun., 2005, 326(1): 136-146.
[88] Warren E M, Huang H, Fanning E, Chazin W J, Eichman B F. Physical interactions between MCM10, DNA, AND DNA polymerase α. J. Biol. Chem., 2009, 284(36): 24662-24672.
[89] Hrmova M, Fincher G B. Functional genomics and structural biology in the definition of gene function. Methods Mol. Biol., 2009, 513: 199-227.
[90] Li H, Thanassi D G. Use of a combined cryo-EM and X-ray crystallography approach to reveal molecular details of bacterial pilus assembly by the chaperone/usher pathway. Curr. Opin. Microbiol., 2009, 12(3): 326-332.
[91] Ritchie D W. Recent progress and future directions in proteinprotein docking. Curr. Protein Pept. Sci., 2008, 9(1): 1-15.
[92] Xie G, Keyhani N O, Bonner C A, Jensen R A. Ancient origin of the tryptophan operon and the dynamics of evolutionary change. Microbiol. Mol. Biol. Rev., 2003, 67(3): 303-342.
[93] Mao F, Su Z, Olman V, Dam P, Liu Z, Xu Y. Mapping of orthologous genes in the context of biological pathways: An application of integer programming. Proc. Natl. Acad. Sci. USA, 2006, 103(1): 129-134.
[94] Chen X, Su Z, Xu Y, Jiang T. Computational prediction of operons in Synechococcus sp. WH8102. Genome Inform., 2004, 15(2): 211-222.
[95] Fulton D L, Li Y Y, Laird M R, Horsman B G, Roche F M, Brinkman F S. Improving the specificity of high-throughput ortholog prediction. BMC Bioinformatics, 2006, 7: 270.
[96] Li G, Che D, Xu Y. A universal operon predictor for prokaryotic genomes. J. Bioinform. Comput. Biol., 2009, 7(1): 19- 38.
[97] Che D, Li G, Mao F, Wu H, Xu Y. Detecting uber-operons in prokaryotic genomes. Nucleic Acids Res., 2006, 34(8): 2418- 2427.
[98] Walker A K, See R, Batchelder C, Kophengnavong T, Gronniger J T, Shi Y, Blackwell T K. A conserved transcription motif suggesting functional parallels between Caenorhabditis elegans SKN-1 and Cap’n’Collar-related basic leucine zipper proteins. J. Biol. Chem., 2000, 275(29): 22166-22171.
[99] Musso G, Zhang Z, Emili A. Retention of protein complex membership by ancient duplicated gene products in budding yeast. Trends Genet., 2007, 23(6): 266-269.
[100] Wang T, Furey T S, Connelly J J, Ji S, Nelson S, Heber S, Gregory S G, Hauser E R. A general integrative genomic feature transcription factor binding site prediction method applied to analysis of USF1 binding in cardiovascular disease. Hum. Genomics, 2009, 3(3): 221-235.
[101] Conesa A, Gotz S. Blast2GO: A comprehensive suite for functional analysis in plant genomics. Int. J. Plant Genomics., 2008, 2008: 619832.
[102] Yan B, Methe B A, Lovley D R, Krushkal J. Computational prediction of conserved operons and phylogenetic footprinting of transcription regulatory elements in the metal-reducing bacterial family Geobacteraceae. J. Theor. Biol., 2004, 230(1): 133-144.
[103] Sharon I, Davis J V, Yona G. Prediction of protein-protein interactions: A study of the co-evolution model. Methods Mol. Biol., 2009, 541: 61-88.
[104] Ventura M, Turroni F, Canchaya C, Vaughan EE, O’Toole PW, van Sinderen D. Microbial diversity in the human intestine and novel insights from metagenomics. Front. Biosci., 2009, 14: 3214-3221.
[105] Jaeger C, Hatziagelaki E, Petzoldt R, Bretzel R G. Comparative analysis of organ-specific autoantibodies and celiac disease—Associated antibodies in type 1 diabetic patients, their first-degree relatives, and healthy control subjects. Diabetes Care, 2001, 24(1): 27-32.
[106] Morita M, Shibuya M, Kushiro T, Masuda K, Ebizuka Y. Molecular cloning and functional expression of triterpene synthases from pea (Pisum sativum) new alpha-amyrinproducing enzyme is a multifunctional triterpene synthase. Eur. J Biochem., 2000, 267(12): 3453-3460.
[107] Bader M, Abouelhoda M I, Ohlebusch E. A fast algorithm for the multiple genome rearrangement problem with weighted reversals and transpositions. BMC Bioinformatics, 2008, 9: 516.
[108] Jiang X F, Yang J. A novel approach to predict proteinprotein interactions related to alzheimer’s disease based on complex network. Protein Pept. Lett., Sept. 2009.
[109] Moriya Y, Itoh M, Okuda S, Yoshizawa A C, Kanehisa M. KAAS: An automatic genome annotation and pathway reconstruction server. Nucleic Acids Res., 2007, 35(Web Server Issue): W182-W185.
[110] Berglund A C, Sjolund E, Ostlund G, Sonnhammer E L. In- Paranoid 6: Eukaryotic ortholog clusters with inparalogs. Nucleic Acids Res., 2008, 36(Database Issue): D263-D266.
[111] Tatusov R L, Fedorova N D, Jackson J D, Jacobs A R, Kiryutin B, Koonin E V, Krylov D M, Mazumder R, Mekhedov S L, Nikolskaya A N et al. The COG database: An updated version includes eukaryotes. BMC Bioinformatics, 2003, 4: 41.
[112] Lathe W C, 3rd, Snel B, Bork P. Gene context conservation of a higher order than operons. Trends Biochem. Sci., 2000, 25(10): 474-479.
[113] Karlin S, Mrazek J, Ma J, Brocchieri L. Predicted highly expressed genes in archaeal genomes. Proc. Natl. Acad. Sci. USA, 2005, 102(20): 7303-7308.
[114] Cormen T H, Leiserson C E, Rivest R L, Stein C. Introduction to Algorithms, Second Edition. Cambridge, MA: The MIT Press, 2001.
[115] Fani R, Brilli M, Lio P. The origin and evolution of operons: The piecewise building of the proteobacterial histidine operon. J. Mol. Evol., 2005, 60(3): 378-390.
[116] Su Z, Mao F,DamP, WuH, OlmanV, Paulsen IT, Palenik B, Xu Y. Computational inference and experimental validation of the nitrogen assimilation regulatory network in cyanobacterium Synechococcus sp. WH 8102. Nucleic Acids Res., 2006, 34(3): 1050-1065.
[117] Salgado H, Gama-Castro S, Martinez-Antonio A, Diaz- Peredo E, Sanchez-Solano F, Peralta-Gil M, Garcia-Alonso D, Jimenez-Jacinto V, Santos-Zavaleta A, Bonavides-Martinez C et al. RegulonDB (version 4.0): Transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12. Nucleic Acids Res., 2004, 32(Database Issue): D303- D306.
[118] De Hoon M J, Imoto S, Kobayashi K, Ogasawara N, Miyano S. Predicting the operon structure of Bacillus subtilis using operon length, intergene distance, and gene expression information. Pac. Symp. Biocomput., 2004, 9: 276-287.
[119] Lin H Y, Bledsoe P J, Stewart V. Activation of yeaR-yoaG operon transcription by the nitrate-responsive regulator NarL is independent of oxygen-responsive regulator Fnr in Escherichia coli K-12. J. Bacteriol., 2007, 189(21): 7539-7548.
[120] Barthelmebs L, Lecomte B, Divies C, Cavin J F. Inducible metabolism of phenolic acids in Pediococcus pentosaceus is encoded by an autoregulated operon which involves a new class of negative transcriptional regulator. J. Bacteriol., 2000, 182(23): 6724-6731.
[121] Dale C J, Moses E K, Ong C C, Morrow C J, Reed M B, Hasse D, Strugnell R A. Identification and sequencing of the groE operon and flanking genes of Lawsonia intracellularis: Use in phylogeny. Microbiology, 1998, 144(Pt 8): 2073-2084.
[122] Bockhorst J, Craven M, Page D, Shavlik J, Glasner J. A Bayesian network approach to operon prediction. Bioinformatics, 2003, 19(10): 1227-1235.
[123] Kowarz L, Robbe-Saule V, Norel F. Identification of cis-acting DNA sequences involved in the transcription of the virulence regulatory gene spvR in Salmonella typhimurium. Mol. Gen. Genet., 1996, 251(2): 225-235.
[124] Mrazek J, Karlin S. Strand compositional asymmetry in bacterial and large viral genomes. Proc. Natl. Acad. Sci. USA, 1998, 95(7): 3720-3725.
[125] Yachie N, Arakawa K, Tomita M. On the interplay of gene positioning and the role of rho-independent terminators in Escherichia coli. FEBS Lett., 2006, 580(30): 6909-6914.
[126] Bockhorst J, Qiu Y, Glasner J, Liu M, Blattner F, Craven M. Predicting bacterial transcription units using sequence and expression data. Bioinformatics, 2003, 19(Suppl 1): i34-i43.
[127] Stormo G D, Hartzell G W, 3rd. Identifying protein-binding sites from unaligned DNA fragments. Proc. Natl. Acad .Sci. USA, 1989, 86(4): 1183-1187.
[128] Bailey T L, Boden M, Buske F A, Frith M, Grant C E, Clementi L, Ren J, Li W W, Noble W S. MEME SUITE: Tools for motif discovery and searching. Nucleic Acids Res., 2009, 37(Web Server Issue): W202-W208.
[129] Liu X, Brutlag D L, Liu J S. BioProspector: Discovering conserved DNA motifs in upstream regulatory regions of coexpressed genes. Pac. Symp. Biocomput., 2001, 6: 127-138.
[130] Cliften P, Sudarsanam P, Desikan A, Fulton L, Fulton B, Majors J, Waterston R, Cohen B A, Johnston M. Finding functional features in Saccharomyces genomes by phylogenetic footprinting. Science, 2003, 301(5629): 71-76.
[131] Blanchette M, Tompa M. Discovery of regulatory elements by a computational method for phylogenetic footprinting. Genome Res., 2002, 12(5): 739-748.
[132] Wu H, Mao F, Olman V, Xu Y. On application of directons to functional classification of genes in prokaryotes. Comput. Biol. Chem., 2008, 32(3): 176-184.
[133] Wu H, Mao F, Olman V, Xu Y. Hierarchical classification of functionally equivalent genes in prokaryotes. Nucleic Acids Res., 2007, 35(7): 2125-2140.
[134] Bowers P M, Cokus S J, Eisenberg D, Yeates T O. Use of logic relationships to decipher protein network organization. Science, 2004, 306(5705): 2246-2249.
[135] Jiang T, Keating A E. AVID: An integrative framework for discovering functional relationships among proteins. BMC Bioinformatics, 2005, 6: 136.
[136] Yu C, Zavaljevski N, Desai V, Johnson S, Stevens F J, Reifman J. The development of PIPA: An integrated and automated pipeline for genome-wide protein function annotation. BMC Bioinformatics, 2008, 9: 52.
[137] Aoki-Kinoshita K F, Kanehisa M. Gene annotation and pathway mapping in KEGG. Methods Mol. Biol., 2007, 396: 71- 91.
[138] Caspi R, Foerster H, Fulcher C A, Hopkinson R, Ingraham J, Kaipa P, Krummenacker M, Paley S, Pick J, Rhee S Yet al. MetaCyc: A multiorganism database of metabolic pathways and enzymes. Nucleic Acids Res., 2006, 34(Database Issue): D511-D516.
[139] Buckler D R, Zhou Y, Stock A M. Evidence of intradomain and interdomain flexibility in an OmpR/PhoB homolog from Thermotoga maritima. Structure, 2002, 10(2): 153-164.
[140] Perez E, Samper S, Bordas Y, Guilhot C, Gicquel B, Martin C. An essential role for phoP in Mycobacterium tuberculosis virulence. Mol. Microbiol., 2001, 41(1): 179-187.
[141] Hengge R. The two-component network and the general stress sigma factor RpoS (sigma S) in Escherichia coli. Adv. Exp. Med. Biol., 2008, 631: 40-53.
[142] Campbell E A, Westblade L F, Darst S A. Regulation of bacterial RNA polymerase sigma factor activity: A structural perspective. Curr. Opin. Microbiol., 2008, 11(2): 121-127.
[143] Germer J, Becker G, Metzner M, Hengge-Aronis R. Role of activator site position and a distal UP-element half-site for sigma factor selectivity at a CRP/H-NS-activated sigma(s)- dependent promoter in Escherichia coli. Mol. Microbiol., 2001, 41(3): 705-716.
[144] Colland F, Barth M, Hengge-Aronis R, Kolb A. Sigma factor selectivity of Escherichia coli RNA polymerase: Role for CRP, IHF and lrp transcription factors. EMBO J., 2000, 19(12): 3028-3037.
[145] Kivistik P A, Kivi R, Kivisaar M, Horak R. Identification of ColR binding consensus and prediction of regulon of ColRS two-component system. BMC Mol. Biol., 2009, 10: 46.
[146] Munch R, Hiller K, Grote A, Scheer M, Klein J, Schobert M, Jahn D. Virtual footprint and PRODORIC: An integrative framework for regulon prediction in prokaryotes. Bioinformatics, 2005, 21(22): 4187-4189.
[147] Yellaboina S, Ranjan S, Chakhaiyar P, Hasnain S E, Ranjan A. Prediction of DtxR regulon: Identification of binding sites and operons controlled by Diphtheria toxin repressor in Corynebacterium diphtheriae. BMC Microbiol., 2004, 4: 38.
[148] Dombrecht B, Marchal K, Vanderleyden J, Michiels J. Prediction and overview of the RpoN-regulon in closely related species of the Rhizobiales. Genome Biol., 2002, 3(12): RESEARCH0076.
[149] Smith A D, Sumazin P, Xuan Z, Zhang M Q. DNA motifs in human and mouse proximal promoters predict tissue-specific expression. Proc. Natl. Acad. Sci. USA, 2006, 103(16): 6275-6280.
[150] Jacob F, Monod J. On the regulation of gene activity. Cold Spring Harbor Symposia on Quantitative Biology, 1961, 26: 193-211.
[151] Okuda S, Yamada T, Hamajima M, Itoh M, Katayama T, Bork P, Goto S, Kanehisa M. KEGG Atlas mapping for global analysis of metabolic pathways. Nucleic Acids Res., 2008, 36(Web Server Issue): W423-W426.
[152] Yin Y, Zhang H, Xu Y. A governing rule for gene arrangement at a global scale in bacterial genomes. submitted, 2009.
[153] Faith J J, Driscoll M E, Fusaro V A, Cosgrove E J, Hayete B, Juhn F S, Schneider S J, Gardner T S. Many microbe microarrays database: Uniformly normalized Affymetrix compendia with structured experimental metadata. Nucleic Acids Res., 2008, 36(Database Issue): D866-D870.
[154] Langille M G I, Zhou F, Fedynak A, Hsiao W W L, Xu Y, Brinkman F S L. Mobile Genetic Elements and Their Prediction. Computational Methods for Understanding Bacterial and Archaeal Genomes, Xu Y, Gogarten J P (eds.), London: Imperial College Press, 2008, pp.113-136.
[155] Gogarten J P, Zhaxybayeva O. Horizontal Gene Transfer: Its Detection and Role in Microbial Evolution. Computational Methods for Understanding Bacterial and Archaeal Genomes, Xu Y, Gogarten J P (eds.), London: Imperial College Press, 2008, pp.137-152.
[156] Vitte C, Panaud O. LTR retrotransposons and flowering plant genome size: Emergence of the increase/decrease model. Cytogenet Genome Res., 2005, 110(1-4): 91-107.
[157] Craig N L, Craigie R, Gellert M, Lambowitz A M. Mobile DNA II. Washington DC: American Society for Microbiology, 2002.
[158] Bestor T H. Transposons reanimated in mice. Cell, 2005, 122(3): 322-325.
[159] Siguier P, Perochon J, Lestrade L, Mahillon J, Chandler M. ISfinder: The reference centre for bacterial insertion sequences. Nucleic Acids Res., 2006, 34(Database Issue): D32- D36.
[160] Chandler M, Mahillon J. Insertion Sequences Revisited. 2nd Ed, Washington DC: American Society of Microbiology, 2002.

文章评论

Copyright 2008 by Journal of Computer Science and Technology