Skip to main content


We’d like to understand how you use our websites in order to improve them. Register your interest.

Variation in the Correlation of G + C Composition with Synonymous Codon Usage Bias among Bacteria


G + C composition at the third codon position (GC3) is widely reported to be correlated with synonymous codon usage bias. However, no quantitative attempt has been made to compare the extent of this correlation among different genomes. Here, we applied Shannon entropy from information theory to measure the degree of GC3 bias and that of synonymous codon usage bias of each gene. The strength of the correlation of GC3 with synonymous codon usage bias, quantified by a correlation coefficient, varied widely among bacterial genomes, ranging from 0.07 to 0.95. Previous analyses suggesting that the relationship between GC3 and synonymous codon usage bias is independent of species are thus inconsistent with the more detailed analyses obtained here for individual species.



  1. 1.

    Ermolaeva MD: Synonymous codon usage in bacteria. Current Issues in Molecular Biology 2001, 3(4):91-97.

  2. 2.

    Carbone A, Képès F, Zinovyev A: Codon bias signatures, organization of microorganisms in codon space, and lifestyle. Molecular Biology and Evolution 2005, 22(3):547-561.

  3. 3.

    Carbone A, Zinovyev A, Képès F: Codon adaptation index as a measure of dominating codon bias. Bioinformatics 2003, 19(16):2005-2015. 10.1093/bioinformatics/btg272

  4. 4.

    Knight RD, Freeland SJ, Landweber LF: A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes. Genome Biology 2001, 2(4):research0010.1-research0010.13. 10.1186/gb-2001-2-4-research0010

  5. 5.

    Lobry JR, Necşulea A: Synonymous codon usage and its potential link with optimal growth temperature in prokaryotes. Gene 2006, 385: 128-136.

  6. 6.

    Lynn DJ, Singer GAC, Hickey DA: Synonymous codon usage is subject to selection in thermophilic bacteria. Nucleic Acids Research 2002, 30(19):4272-4277. 10.1093/nar/gkf546

  7. 7.

    Singer GAC, Hickey DA: Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content. Gene 2003, 317(1-2):39-47.

  8. 8.

    Suzuki H, Saito R, Tomita M: A problem in multivariate analysis of codon usage data and a possible solution. FEBS Letters 2005, 579(28):6499-6504. 10.1016/j.febslet.2005.10.032

  9. 9.

    Wan X-F, Xu D, Kleinhofs A, Zhou J: Quantitative relationship between synonymous codon usage bias and GC composition across unicellular genomes. BMC Evolutionary Biology 2004, 4: 19. 10.1186/1471-2148-4-19

  10. 10.

    Wright F: The 'effective number of codons' used in a gene. Gene 1990, 87(1):23-29. 10.1016/0378-1119(90)90491-9

  11. 11.

    Zeeberg B: Shannon information theoretic computation of synonymous codon usage biases in coding regions of human and mouse genomes. Genome Research 2002, 12(6):944-955. 10.1101/gr.213402

  12. 12.

    Suzuki H, Saito R, Tomita M: The 'weighted sum of relative entropy': a new index for synonymous codon usage bias. Gene 2004, 335(1-2):19-23.

  13. 13.

    Arakawa K, Mori K, Ikeda K, Matsuzaki T, Kobayashi Y, Tomita M: G-language genome analysis environment: a workbench for nucleotide sequence data mining. Bioinformatics 2003, 19(2):305-306. 10.1093/bioinformatics/19.2.305

  14. 14.

    R Development Core Team : R: a language and environment for statistical computing. Vienna, Austria; 2006.

  15. 15.

    Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL: GenBank. Nucleic Acids Research 2007, 35(1):D21-D25.

  16. 16.

    Shannon CE: A mathematical theory of communication. Bell System Technical Journal 1948, 27: 379-423.

  17. 17.

    Muto A, Osawa S: The guanine and cytosine content of genomic DNA and bacterial evolution. Proceedings of the National Academy of Sciences of the United States of America 1987, 84(1):166-169. 10.1073/pnas.84.1.166

  18. 18.

    Sueoka N: On the genetic basis of variation and heterogeneity of DNA base composition. Proceedings of the National Academy of Sciences of the United States of America 1962, 48(4):582-592. 10.1073/pnas.48.4.582

  19. 19.

    Garcia-Vallve S, Romeu A, Palau J: Horizontal gene transfer in bacterial and archaeal complete genomes. Genome Research 2000, 10(11):1719-1725. 10.1101/gr.130000

  20. 20.

    Grocock RJ, Sharp PM: Synonymous codon usage in Pseudomonas aeruginosa PA01. Gene 2002, 289(1-2):131-139. 10.1016/S0378-1119(02)00503-6

  21. 21.

    McInerney JO: Replicational and transcriptional selection on codon usage in Borrelia burgdorferi . Proceedings of the National Academy of Sciences of the United States of America 1998, 95(18):10698-10703. 10.1073/pnas.95.18.10698

  22. 22.

    Sharp PM, Bailes E, Grocock RJ, Peden JF, Sockett RE: Variation in the strength of selected codon usage bias among bacteria. Nucleic Acids Research 2005, 33(4):1141-1153. 10.1093/nar/gki242

  23. 23.

    Sharp PM, Stenico M, Peden JF, Lloyd AT: Codon usage: mutational bias, translational selection, or both? Biochemical Society Transactions 1993, 21(4):835-841.

Download references

Author information



Corresponding author

Correspondence to Rintaro Saito.

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Suzuki, H., Saito, R. & Tomita, M. Variation in the Correlation of G + C Composition with Synonymous Codon Usage Bias among Bacteria. J Bioinform Sys Biology 2007, 61374 (2007).

Download citation


  • Codon
  • System Biology
  • Codon Usage
  • Synonymous Codon
  • Synonymous Codon Usage