Advanced Search
Journal of Computational Biology

Toubiana, D., Department of Plant Sciences, University of California, 1 Shields Avenue, Davis, CA  95616, United States; Puzis, R., Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer Sheva, Israel; Blumwald, E., Department of Plant Sciences, University of California, 1 Shields Avenue, Davis, CA  95616, United States.

Weighted gene co-expression network analysis (WGCNA) is a widely used software tool that is used to establish relationships between phenotypic traits and gene expression data. It generates gene modules and then correlates their first principal component to phenotypic traits, proposing a functional relationship expressed by the correlation coefficient. However, gene modules often contain thousands of genes of different functional backgrounds. Here, we developed a stochastic optimization algorithm, known as genetic algorithm (GA), optimizing the trait to gene module relationship by gradually increasing the correlation between the trait and a subset of genes of the gene module. We exemplified the GA on a Japanese plum hormone profile and an RNA-seq dataset. The correlation between the subset of module genes and the trait increased, whereas the number of correlated genes became sufficiently small, allowing for their individual assessment. Gene ontology (GO) term enrichment analysis of the gene sets identified by the GA showed an increase in specificity of the GO terms associated with fruit hormone balance as compared with the GO enrichment analysis of the gene modules generated by WGCNA and other methods.

Powered by ClearMash Solutions Ltd -
Volcani treasures
About
Terms of use
A Genetic Algorithm to Optimize Weighted Gene Co-Expression Network Analysis
26

Toubiana, D., Department of Plant Sciences, University of California, 1 Shields Avenue, Davis, CA  95616, United States; Puzis, R., Department of Software and Information Systems Engineering, Ben-Gurion University of the Negev, Beer Sheva, Israel; Blumwald, E., Department of Plant Sciences, University of California, 1 Shields Avenue, Davis, CA  95616, United States.

A Genetic Algorithm to Optimize Weighted Gene Co-Expression Network Analysis

Weighted gene co-expression network analysis (WGCNA) is a widely used software tool that is used to establish relationships between phenotypic traits and gene expression data. It generates gene modules and then correlates their first principal component to phenotypic traits, proposing a functional relationship expressed by the correlation coefficient. However, gene modules often contain thousands of genes of different functional backgrounds. Here, we developed a stochastic optimization algorithm, known as genetic algorithm (GA), optimizing the trait to gene module relationship by gradually increasing the correlation between the trait and a subset of genes of the gene module. We exemplified the GA on a Japanese plum hormone profile and an RNA-seq dataset. The correlation between the subset of module genes and the trait increased, whereas the number of correlated genes became sufficiently small, allowing for their individual assessment. Gene ontology (GO) term enrichment analysis of the gene sets identified by the GA showed an increase in specificity of the GO terms associated with fruit hormone balance as compared with the GO enrichment analysis of the gene modules generated by WGCNA and other methods.

Scientific Publication
You may also be interested in