חיפוש מתקדם
BMC Genomics
Kamburov, A., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, Max-Planck Institute for Molecular Genetics, Ihnestrasse 63-73, D-14195 Berlin, Germany
Goldovsky, L., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, Computational Genomics Unit, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece, Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece
Freilich, S., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, Computational Genomics Unit, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece, Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece
Kapazoglou, A., Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece
Kunin, V., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, DOE Joint Genome Institute, 2800 Mitchel Drive, Walnut Creek, CA 94598, United States
Enright, A.J., The Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, United Kingdom
Tsaftaris, A., Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece
Ouzounis, C.A., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, Computational Genomics Unit, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece, Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece, Centre for Bioinformatics, School of Physical Sciences and Engineering, King's College London, London WC2R 2LS, United Kingdom
Background: Gene fusion detection - also known as the 'Rosetta Stone' method - involves the identification of fused composite genes in a set of reference genomes, which indicates potential interactions between its un-fused counterpart genes in query genomes. The precision of this method typically improves with an ever-increasing number of reference genomes. Results: In order to explore the usefulness and scope of this approach for protein interaction prediction and generate a high-quality, non-redundant set of interacting pairs of proteins across a wide taxonomic range, we have exhaustively performed gene fusion analysis for 184 genomes using an efficient variant of a previously developed protocol. By analyzing interaction graphs and applying a threshold that limits the maximum number of possible interactions within the largest graph components, we show that we can reduce the number of implausible interactions due to the detection of promiscuous domains. With this generally applicable approach, we generate a robust set of over 2 million distinct and testable interactions encompassing 696,894 proteins in 184 species or strains, most of which have never been the subject of high-throughput experimental proteomics. We investigate the cumulative effect of increasing numbers of genomes on the fidelity and quantity of predictions, and show that, for large numbers of genomes, predictions do not become saturated but continue to grow linearly, for the majority of the species. We also examine the percentage of component (and composite) proteins with relation to the number of genes and further validate the functional categories that are highly represented in this robust set of detected genome-wide interactions. Conclusion: We illustrate the phylogenetic and functional diversity of gene fusion events across genomes, and their usefulness for accurate prediction of protein interaction and function. © 2007 Kamburov et al; licensee BioMed Central Ltd.
פותח על ידי קלירמאש פתרונות בע"מ -
הספר "אוצר וולקני"
אודות
תנאי שימוש
Denoising inferred functional association networks obtained by gene fusion analysis
8
Kamburov, A., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, Max-Planck Institute for Molecular Genetics, Ihnestrasse 63-73, D-14195 Berlin, Germany
Goldovsky, L., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, Computational Genomics Unit, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece, Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece
Freilich, S., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, Computational Genomics Unit, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece, Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece
Kapazoglou, A., Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece
Kunin, V., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, DOE Joint Genome Institute, 2800 Mitchel Drive, Walnut Creek, CA 94598, United States
Enright, A.J., The Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, United Kingdom
Tsaftaris, A., Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece
Ouzounis, C.A., Computational Genomics Group, The European Bioinformatics Institute, EMBL, Cambridge Outstation, Cambridge CB10 1SD, United Kingdom, Computational Genomics Unit, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece, Institute of Agrobiotechnology, Center for Research and Technology Hellas, GR-57001 Thessalonica, Greece, Centre for Bioinformatics, School of Physical Sciences and Engineering, King's College London, London WC2R 2LS, United Kingdom
Denoising inferred functional association networks obtained by gene fusion analysis
Background: Gene fusion detection - also known as the 'Rosetta Stone' method - involves the identification of fused composite genes in a set of reference genomes, which indicates potential interactions between its un-fused counterpart genes in query genomes. The precision of this method typically improves with an ever-increasing number of reference genomes. Results: In order to explore the usefulness and scope of this approach for protein interaction prediction and generate a high-quality, non-redundant set of interacting pairs of proteins across a wide taxonomic range, we have exhaustively performed gene fusion analysis for 184 genomes using an efficient variant of a previously developed protocol. By analyzing interaction graphs and applying a threshold that limits the maximum number of possible interactions within the largest graph components, we show that we can reduce the number of implausible interactions due to the detection of promiscuous domains. With this generally applicable approach, we generate a robust set of over 2 million distinct and testable interactions encompassing 696,894 proteins in 184 species or strains, most of which have never been the subject of high-throughput experimental proteomics. We investigate the cumulative effect of increasing numbers of genomes on the fidelity and quantity of predictions, and show that, for large numbers of genomes, predictions do not become saturated but continue to grow linearly, for the majority of the species. We also examine the percentage of component (and composite) proteins with relation to the number of genes and further validate the functional categories that are highly represented in this robust set of detected genome-wide interactions. Conclusion: We illustrate the phylogenetic and functional diversity of gene fusion events across genomes, and their usefulness for accurate prediction of protein interaction and function. © 2007 Kamburov et al; licensee BioMed Central Ltd.
Scientific Publication
You may also be interested in