Identification of genomic islands in the genome of Bacillus cereus by comparative analysis with Bacillus anthracis1

Ren Zhang1 and Chun-Ting Zhang2

1 Department of Epidemiology and Biostatistics, Tianjin Cancer Institute and Hospital, Tianjin 300060
2 Department of Physics, Tianjin University, Tianjin 300072, China


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
Horizontal gene transfer has been recognized as a universal event throughout bacterial evolution. The availability of both complete genome sequences of Bacillus cereus and B. anthracis provides the possibility to perform comparative analysis based on their genomes. By using a windowless method to display the distribution of the genomic GC content of B. cereus and B. anthracis, we have found three genomic islands in the genome of B. cereus, i.e., BCGI-1, BCGI-2, and BCGI-3, respectively, which are absent in the genome of B. anthracis. All the genomic islands have abrupt changes in GC content compared with that of surrounding regions. BCGI-1 has many conserved features of genomic islands, e.g., a Val-tRNA gene is utilized as the integration site, and a site-specific recombinase gene is located at the 3' end. BCGI-2 has a large percentage of phage protein, suggesting a phage-related recombination is involved. BCGI-3 contains a ferric anguibactin transport system, which is likely to be involved in the iron transport that enables the bacterium to overcome the iron limitation in the host. In addition, BCGI-3 also contains a cluster of genes related to lantibiotics, which may play a role during the evolution of the genome. Furthermore, the integrations of the genomic islands, BCGI-1 and BCGI-3, result in deletions of DNA sequence fragments; therefore, such integrations lead to both gene gain and gene loss simultaneously.

genomic island; cumulative GC profile


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
BACILLUS CEREUS is a spore-forming, gram-positive, ubiquitous soil bacterium, which is an opportunistic pathogen causing both gastrointestinal and nongastrointestinal infections (11). One of the closest relatives of B. cereus is B. anthracis, and both of them belong to the B. cereus group of bacteria (8). B. anthracis has become notorious as a biological weapon because of its ability to cause inhalation anthrax (2). The toxicity of B. anthracis is believed to be due to the presence of the plasmids that contain the virulence genes (16, 17). Recently, both the genomes of B. cereus and B. anthracis were sequenced (10, 19). The availability of the complete genome sequences of these two bacteria provides the possibility to perform comparative analysis based on their genome sequences (18).

Horizontal gene transfer has been recognized as a universal event throughout bacterial evolution (9, 14, 15). Genomic islands contain clusters of horizontally transferred genes. Obtaining foreign genes is an effective way to alter the genotype of a bacterium, which may lead to the creation of new traits or even new species (3, 4, 7, 12, 13).

The identification of genomic islands has received intense interest during the past few years. Among the methods to detect the horizontal gene transfer events in bacteria, assessing the change in GC content remains an established and effective way. Usually, as a routine procedure, the distribution of the genomic GC content is calculated by counting the frequency of G and C bases within the sliding windows that move along genomes. However, in this method the window size is difficult to adjust, i.e., large window size leads to low resolution, whereas small window size leads to large statistical fluctuations. Recently, a windowless method to calculate GC content, the cumulative GC profile, was proposed (22). The resolution of the cumulative GC profile in displaying the genomic GC content is high since no sliding window is used. This method has been used to identify genomic islands in the genomes of Corynebacterium glutamicum and Vibrio vulnificus (24). In this brief communication, the cumulative GC profile was used to detect genomic islands in B. cereus, based on comparison with B. anthracis. Consequently, three genomic islands have been identified. One genomic island, BCGI-3, contains a cluster of genes that encode the ferric anguibactin transport system, which may play a role in enabling the bacteria to overcome iron limitation in the host. In addition, BCGI-3 also contains a cluster of genes related to lantibiotics, which may have an impact on the evolution of the genome.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
The complete genome sequences of the bacteria B. cereus ATCC 14579 (AE016877) and B. anthracis (AE016879) were downloaded from GenBank (http://www.ncbi.nlm.nih.gov/).

Using the cumulative GC profile to display the GC content distribution.
The Z-curve is a three-dimensional space curve constituting the unique representation of a given DNA sequence in the sense that each can be uniquely reconstructed given the other (23, 25). Based on the Z-curve, any DNA sequence can be uniquely described by three independent distributions, i.e., those of the bases of purine/pyrimidine (xn), amino/keto (yn), and weak/strong hydrogen bonds (zn), respectively. In particular, zn displays the distribution of bases of GC/AT types along the sequence, which is calculated as follows (23, 25)

(1)
where An, Cn, Gn, and Tn are the cumulative numbers of the bases A, C, G and T, respectively, occurring in the subsequence from the first base to the n-th base in the DNA sequence inspected.

Based on zn, GC content can be calculated using a windowless technique (22). Usually, for an AT-rich genome, zn is approximately a monotonously increasing linear function of n, whereas for a GC-rich genome, zn is approximately a monotonously decreasing linear function of n. To amplify the deviations, the curve of zn ~ n is fitted by a straight line using the least square technique

(2)
where (z, n) is the coordinate of a point on the straight line fitted, and k is its slope. Instead of using the curve of zn ~ n, we will use the z' curve, or cumulative GC profile, hereafter, where

(3)
Therefore, the deviations of zn ~ n curve from the straight line, which corresponds to a constant GC content (see Eq. 4, below), are protruded by the z' curve. A program to draw the z' curve online is accessible from http://tubic.tju.edu.cn/zcurve. The z' curve and the cumulative GC profile are used interchangeably in this paper.

Let denote the average GC content within a region {Delta}n in a sequence, then we find from Eqs. 1–3

(4)
where k' = {Delta}z'n/{Delta}n is the average slope of the z' curve within the region {Delta}n. Both quantities of {Delta}z'n and {Delta}n can be calculated by using the z' curve. The region {Delta}n is usually chosen to be a fragment of a natural DNA sequence, e.g., a genomic island. The above method is called the windowless technique for the GC content computation (22).


    RESULTS AND DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
Three genomic islands in the genome of B. cereus.
Some basic characteristics of the cumulative GC profile (z' curve) are: 1) an up jump (a drop) in the z' curve indicates a decrease (increase) of GC content; and 2) any sharp maximum (minimum) point in the z' curve indicates a turning point, where the GC content undergoes an abrupt change from a relatively GC-poor (GC-rich) region to a relatively GC-rich (GC-poor) region.

The horizontal transferred elements, such as genomic islands, are usually absent in the genomes of close relatives of the host genome. By comparing the cumulative GC profiles of B. cereus and B. anthracis, it is obvious that most parts of the genomes overlap. However, there are three regions in the genome of B. cereus that have a sharp change in GC content, reflected by the fact that the z' curves associated with these regions have sharp jumps. In addition, these three regions are absent in the genome of B. anthracis, suggesting a possibility that these three regions are genomic islands, which are designated the names BCGI-1, BCGI-2, and BCGI-3, respectively (Fig. 1).



View larger version (22K):
[in this window]
[in a new window]
 
Fig. 1. The cumulative GC profiles for Bacillus cereus, Bacillus anthracis, and genomic islands. The plot shows that the cumulative GC profiles for B. cereus (green) and B. anthracis (blue) largely overlap. However, in the plot for B. cereus, there are three regions that show abrupt change in GC content (red), which are absent in the plot for B. anthracis. If the corresponding sequences for these three regions are removed, then the cumulative GC profiles for the two bacteria can almost perfectly overlap (blue and pink). This fact suggests that the three regions of unusual GC content are horizontally transferred. The positions corresponding to the integration sites are marked by arrows in the cumulative GC profile for the B. anthracis genome. For a detailed analysis, refer to the text.

 
BCGI-1, a 15.9-kb genomic island, has a GC content of 0.30, much lower than 0.35, the GC content of the surrounding regions. Although the length of BCGI-1 is relatively short, it has many conserved features of genomic islands. The tRNA genes have been frequently found to be the integration sites of genomic islands. Indeed, BCGI-1 utilizes a Val-tRNA gene (BC1273) as the integration site. In addition, it is also frequently found that a gene coding for an integration protein is close to the site of integration. At the 3' end of BCGI-1 is located a gene coding for a DNA integration protein (BC1272).

BCGI-2, a 62.2-kb genomic island, has a GC content of 0.38, much higher than 0.34, the GC content of the surrounding regions. At the 3' end, there is also a gene coding for site-specific recombinase (BC1921). There are totally 77 genes in this genomic island. Among these genes, 52 code for phage proteins (67.5%). There are totally 81 phage proteins in the genome. This high percentage of phage proteins also indicates that a phage-related recombination event is involved in this genomic island.

BCGI-3, a 50.3-kb genomic island, has a GC content of 0.30, much lower than 0.36, the GC content of the surrounding regions. Among the 54 genes in this genomic island, 6 are transposase genes. BCG-3 contains an open-reading frame (ORF) (BC5092) coding for a bleomycin resistance protein, suggesting that this genomic island may play a role in its antibiotic resistance.

BCGI-3 contains a cluster of genes for a ferric anguibactin transport system. Four genes related to ferric anguibactin were found, which are ferric anguibactin transport ATP-binding protein (BC5103), ferric anguibactin transport system permease protein fatC (BC5104), ferric anguibactin transport system permease protein fatD (BC5105), and ferric anguibactin-binding protein (BC5106).

In the vertebrate host, iron is not freely available, and it is mostly found in red cells. In addition, iron in the vertebrate host is bound by the host protein transferring in blood and lactoferrin in secretions. Consequently, bacteria need to overcome the iron limitation to survive in the host and establish an infection (1). B. cereus is an opportunistic pathogen that causes food poisoning. Therefore, B. cereus should also have its own mechanism to transport the iron across the cytoplasmic membrane.

The system that transports the ferric anguibactin complex usually has an outer membrane receptor FatA, which binds the ferric anguibactin and shuttles it to the periplasm (1, 21). Among this cluster of genes, FatA gene is absent; however, indeed, there is a gene coding for ferric anguibactin-binding protein (BC5106). Although we did not detect high homology of this protein with FatA, there is still a possibility that this protein may function in the place of FatA. The ferric anguibactin transport system permease protein FatC and FatD are inner membrane proteins that catalyze the transport of ferric anguibactin from the periplasm to the cytosol where the ferric ion is released. The ferric anguibactin transport ATP-binding protein may be involved in the energy supply in this process.

The ferric anguibactin transport system in BCGI-3 is the only ferric anguibactin transport system in the genome of B. cereus. No other genes, including ferric anguibactin transport ATP-binding protein and ferric anguibactin transport system permease proteins fatA, fatB, fatC, and fatD, were found in the genome. Therefore, the ferric anguibactin transport system in BCGI-3 is very likely to be involved in the iron transport for B. cereus that enables the bacterium to overcome the iron limitation in the host.

BCGI-3 also contains a cluster of genes related to lantibiotics. Lantibiotics are a class of bactericidal peptides that are produced by and mainly act against gram-positive bacteria. Lantibiotic peptides are characterized by the presence of thioether bridges termed lanthionines, and hence the name lantibiotics (lanthionines-containing antibiotics). The thioether bridges are generated by dehydration of serine and threonine followed by addition of cysteine residues. In recent years, the interest in these lantibiotics has continuously increased, mainly because of their potential to serve as natural food preservatives that might replace harmful chemical agents (5, 20).

The ORFs BC5083 and BC5084 encode a lantibiotic biosynthesis protein and lanthionine biosynthesis protein, respectively. We then searched the deduced protein sequences of these two ORFs against the Conserved Domain Database (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi). Indeed, the ORF BC5083 has a domain of the COOH terminus of lantibiotic dehydratase, whereas the ORF BC5084 has the domains of both the NH2 terminus and COOH terminus of lantibiotic dehydratase. In addition, the ORF BC5086 encodes a putative lantibiotic biosynthesis protein, although no conserved domain was found. Furthermore, from the ORF BC5087 to BC5090, there are four consecutive ORFs encoding putative lantibiotic precursor peptides.

The presence of these lantibiotics in the genome of B. cereus poses many questions. A natural question is: what mechanisms does B. cereus use to protect itself form the toxicity of these bactericidal peptides? Generally, proteins conferring immunity to the producer strains antagonize specifically the lantibiotics (5). For B. cereus, now it is not clear which proteins have the above functions. Another question is: what advantages does B. cereus have by possessing these lantibiotics over other bacteria during evolution? All these questions remain to be answered.

We have also found that genes and gene orders are highly conserved between the regions around genomic island integration sites of B. cereus and the corresponding regions in the genome of B. anthracis. At the 5' junction of BCGI-1, for instance, the ORFs of B. cereus, BC1254, BC1255, BC1256, and BC1257, are homologs of the ORFs of B. anthracis, BA1272, BA1273, BA1274, and BA1275, respectively. At the 3' junction of BCGI-1, the ORFs BC1274, BC1275, BC1276, and BC1277 are homologs of the ORFs BA1281, BA1282, BA1284, and BA1286, respectively (Fig. 2A). The ORF BA1283 encodes a short polypeptide (34 residues) that does not have a homolog in public databases, based on the BLAST search. In addition, ZCURVE, a new system for protein-coding gene prediction, which has been shown to have low false-positive predication rate (6), does not predict this segment as a protein-coding gene. Therefore, it is likely that the annotation of BA1283 is due to the false-positive prediction. In the GenBank file for B. anthracis, there is no record of the ORF BA1285. Therefore, the ORFs BA1283 and BA1285 are skipped. It is interesting to point out that the segment of DNA sequence (from ORF BA1276 to BA1280) that is between the conserved regions of the B. anthracis genome is absent in the genome of B. cereus. Therefore, it is likely that the integration of BCGI-1 causes a deletion of a segment of DNA sequence. Similar gene-loss process applies to BCGI-3, in which the segment, ORFs BA5324–BA5331, is deleted in the genome of B. cereus. This segment is between the conserved regions, i.e., at the 5' end, BC5069 and BC5070 are homologs of BA5321 and BA5322; at the 3' end, BC5128 and BC5129 are homologs of BA5332 and BA5334, respectively. The process containing both gene gain and gene loss apparently has a more profound impact on the genome evolution than the process of gene gain only.



View larger version (19K):
[in this window]
[in a new window]
 
Fig. 2. Schematic diagram showing the conservation between the regions around genomic island integration sites of B. cereus and the corresponding regions in the genome of B. anthracis, for BCGI-1 (A) and BCGI-2 (B). The same colors indicate that two ORFs are homologous to each other. The color yellow denotes genomic islands. BCGI-1 has many conserved features of genomic islands, such as using a tRNA gene as the integration site and having a DNA integration protein gene at the junction. Note that in BCGI-1, the integration causes a deletion of a segment (BA1276–BA1280). In the GenBank file, there is no record of BA1285. The annotation of BA1283 is likely to be due to the false-positive prediction; therefore, BA1283 is not shown. For details, refer to the text. The ID for the tRNA gene of B. cereus is BC1273, whereas the tRNA gene of B. anthracis (denoted by blue square) does not have an ID in the GenBank file. Figure is not drawn to scale.

 
Likewise, the regions around BCGI-2 are also highly conserved between the two genomes. At the 5' junction of BCGI-2, the ORFs of B. cereus, BC1841, BC1842, BC1843, and BC1844, are homologs of the ORFs of B. anthracis, BA1916, BA1917, BA1918, and BA1919, respectively. At the 3' junction, the ORFs BC1922, BC1923, BC1924, and BC1925 are homologs of the ORFs BA1921, BA1922, BA1923, and BA1924, respectively (Fig. 2B). However, there is almost no gene loss for the integration of BCGI-2.

Comparison between the GC content distributions obtained based on windowless and window method.
As a routine procedure in analyzing genome sequencing results, the distribution of GC content is displayed by the GC content within the windows that move along genomes. Although this method is intuitive, i.e., it directly shows the GC content in each particular window, a drawback is that it only displays the local GC content along genomes. On the contrary, the GC content computed without windows is a cumulative GC content; therefore, it displays a global distribution of GC content. For instance, the cumulative GC profile shown in Fig. 1 clearly shows that the genome can be roughly divided into three domains, i.e., from 1.8 to 3.5 Mb is a GC-low region; from 3.5 to 0.8 Mb is a GC-rich region; and from 0.8 to 1.8 Mb has a GC content in between. This is consistent with the result reported by the authors of the published sequence (10). By using the windowless method, it is easily detected (compare with Fig. 3, which is based on the window method).



View larger version (36K):
[in this window]
[in a new window]
 
Fig. 3. The GC content distribution computed based on 20-kb windows sliding along the genomes of Bacillus cereus and Bacillus anthracis. Note that, because of the low resolution, the change in GC content and the precise position of the change cannot be detected. Refer to Fig. 1 for a comparison.

 
Another drawback of the window method is that the resolution is low. The size of window is hard to adjust, i.e., large window size leads to low resolution, whereas small window size leads to large statistical fluctuations. On the contrary, the resolution of the windowless method is high, e.g., in an extreme case, the GC content can be computed at a point (one single base), which does not have definition at all based on the window method. Therefore, by using the cumulative GC profile, the precise boundaries of the regions that have a change in GC content can be determined (Fig. 1); such boundaries, however, are hard to determine based on the window method (Fig. 3). In addition, the plots based on the window method are different when the window size is changed, but those based on the windowless method are unique. Furthermore, due to the special subtraction procedure, i.e., Eq. 3, which amplifies the variation of GC content, the cumulative GC profile has high sensitivity in detecting the changes in GC content, which is useful when the difference between the GC content of horizontally transferred elements and that of the host genome is small.

In summary, by using the cumulative GC profile to display the distribution of genomic GC content of B. cereus, based on comparison with that of B. anthracis, we have found three genomic islands in the genome of B. cereus, BCGI-1, BCGI-2 and BCGI-3, respectively. All the genomic islands have abrupt changes in GC content compared with that of surrounding regions. BCGI-1 has a typical structure of genomic islands, i.e., a Val-tRNA gene is utilized as the integration site, and a site-specific recombinase gene is located at the 3' end. BCGI-2 has a large percentage of phage protein, suggesting a phage-related recombination is involved. BCGI-3 contains a ferric anguibactin transport system, which is very likely to be involved in the iron transport that enables the bacterium to overcome the iron limitation in the host. In addition, BCGI-3 also contains a cluster of genes related to lantibiotics, which may play a role during the evolution of the genome. Furthermore, the integrations of the genomic islands, BCGI-1 and BCGI-3, result in deletions of DNA sequence fragments; therefore, such integrations lead to both gene gain and gene loss simultaneously.


    ACKNOWLEDGMENTS
 
The present study was supported in part by the 973 Project Grant G1999075606 of China.


    FOOTNOTES
 
1 This article was submitted for review in response to a Call for Papers on "Comparative Genomics." Back

Article published online before print. See web site for date of publication (http://physiolgenomics.physiology.org).

Address for reprint requests and other correspondence: C.-T. Zhang, Dept. of Physics, Tianjin Univ., Tianjin 300072, China (E-mail: ctzhang{at}tju.edu.cn).

10.1152/physiolgenomics.00170.2003.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 

  1. Crosa JH. Signal transduction and transcriptional and posttranscriptional control of iron-regulated genes in bacteria. Microbiol Mol Biol Rev 61: 319–336, 1997.[Abstract]
  2. Dixon TC, Meselson M, Guillemin J, and Hanna PC. Anthrax. N Engl J Med 341: 815–826, 1999.[Free Full Text]
  3. Gogarten JP, Doolittle WF, and Lawrence JG. Prokaryotic evolution in light of gene transfer. Mol Biol Evol 19: 2226–2238, 2002.[Abstract/Free Full Text]
  4. Groisman EA and Ochman H. Pathogenicity islands: bacterial evolution in quantum leaps. Cell 87: 791–794, 1996.[ISI][Medline]
  5. Guder A, Wiedemann I, and Sahl HG. Posttranslationally modified bacteriocins: the lantibiotics. Biopolymers 55: 62–73, 2000.[CrossRef][ISI][Medline]
  6. Guo FB, Ou HY, and Zhang CT. ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes. Nucleic Acids Res 31: 1780–1789, 2003.[Abstract/Free Full Text]
  7. Hacker J and Kaper JB. Pathogenicity islands and the evolution of microbes. Annu Rev Microbiol 54: 641–679, 2000.[CrossRef][ISI][Medline]
  8. Helgason E, Okstad OA, Caugant DA, Johansen HA, Fouet A, Mock M, Hegna I, and Kolsto. Bacillus anthracis, Bacillus cereus, and Bacillus thuringiensis: one species on the basis of genetic evidence. Appl Environ Microbiol 66: 2627–2630, 2000.[Abstract/Free Full Text]
  9. Hentschel U and Hacker J. Pathogenicity islands: the tip of the iceberg. Microbes Infect 3: 545–548, 2001.[CrossRef][ISI][Medline]
  10. Ivanova N, Sorokin A, Anderson I, Galleron N, Candelon B, Kapatral V, Bhattacharyya A, Reznik G, Mikhailova N, Lapidus A, Chu L, Mazur M, Goltsman E, Larsen N, D’Souza M, Walunas T, Grechkin Y, Pusch G, Haselkorn R, Fonstein M, Ehrlich SD, Overbeek R, and Kyrpides N. Genome sequence of Bacillus cereus and comparative analysis with Bacillus anthracis. Nature 423: 87–91, 2003.[CrossRef][ISI][Medline]
  11. Kotiranta A, Lounatmaa K, and Haapasalo M. Epidemiology and pathogenesis of Bacillus cereus infections. Microbes Infect 2: 189–198, 2000.[CrossRef][ISI][Medline]
  12. Lan R and Reeves PR. Gene transfer is a major factor in bacterial evolution. Mol Biol Evol 13: 47–55, 1996.[Abstract]
  13. Lawrence JG. Gene transfer, speciation, and the evolution of bacterial genomes. Curr Opin Microbiol 2: 519–523, 1999.[CrossRef][ISI][Medline]
  14. Ochman H. Lateral and oblique gene transfer. Curr Opin Genet Dev 11: 616–619, 2001.[CrossRef][ISI][Medline]
  15. Ochman H, Lawrence JG, and Groisman EA. Lateral gene transfer and the nature of bacterial innovation. Nature 405: 299–304, 2000.[CrossRef][ISI][Medline]
  16. Okinaka R, Cloud K, Hampton O, Hoffmaster A, Hill K, Keim P, Koehler T, Lamke G, Kumano S, Manter D, Martinez Y, Ricke D, Svensson R, and Jackson P. Sequence, assembly and analysis of pX01 and pX02. J Appl Microbiol 87: 261–262, 1999.[CrossRef][ISI][Medline]
  17. Okinaka RT, Cloud K, Hampton O, Hoffmaster AR, Hill KK, Keim P, Koehler TM, Lamke G, Kumano S, Mahillon J, Manter D, Martinez Y, Ricke D, Svensson R, and Jackson PJ. Sequence and organization of pXO1, the large Bacillus anthracis plasmid harboring the anthrax toxin genes. J Bacteriol 181: 6509–6515, 1999.[Abstract/Free Full Text]
  18. Parkhill J and Berry C. Genomics: relative pathogenic values. Nature 423: 23–25, 2003.[ISI][Medline]
  19. Read TD, Peterson SN, Tourasse N, Baillie LW, Paulsen IT, Nelson KE, Tettelin H, Fouts DE, Eisen JA, Gill SR, Holtzapple EK, Okstad OA, Helgason E, Rilstone J, Wu M, Kolonay JF, Beanan MJ, Dodson RJ, Brinkac LM, Gwinn M, DeBoy RT, Madpu R, Daugherty SC, Durkin AS, Haft DH, Nelson WC, Peterson JD, Pop M, Khouri HM, Radune D, Benton JL, Mahamoud Y, Jiang L, Hance IR, Weidman JF, Berry KJ, Plaut RD, Wolf AM, Watkins KL, Nierman WC, Hazen A, Cline R, Redmond C, Thwaite JE, White O, Salzberg SL, Thomason B, Friedlander AM, Koehler TM, Hanna PC, Kolsto AB, and Fraser CM. The genome sequence of Bacillus anthracis Ames and comparison to closely related bacteria. Nature 423: 81–86, 2003.[CrossRef][ISI][Medline]
  20. Sahl HG and Bierbaum G. Lantibiotics: biosynthesis and biological activities of uniquely modified peptides from gram-positive bacteria. Annu Rev Microbiol 52: 41–79, 1998.[CrossRef][ISI][Medline]
  21. Stork M, Di Lorenzo M, Welch TJ, Crosa LM, and Crosa JH. Plasmid-mediated iron uptake and virulence in Vibrio anguillarum. Plasmid 48: 222–228, 2002.[CrossRef][ISI][Medline]
  22. Zhang CT, Wang J, and Zhang R. A novel method to calculate the G+C content of genomic DNA sequences. J Biomol Struct Dyn 19: 333–341, 2001.[ISI][Medline]
  23. Zhang CT and Zhang R. Analysis of distribution of bases in the coding sequences by a diagrammatic technique. Nucleic Acids Res 19: 6313–6317, 1991.[Abstract]
  24. Zhang R and Zhang CT. A systematic method to identify genomic islands and its applications in analyzing the genomes of Corynebacterium glutamicum and Vibrio vulnificus CMCP6 chromosome I. Bioinformatics In press.
  25. Zhang R and Zhang CT. Z curves, an intuitive tool for visualizing and analyzing the DNA sequences. J Biomol Struct Dyn 11: 767–782, 1994.[ISI][Medline]