The sequence of the Helicoverpa armigera single nucleocapsid nucleopolyhedrovirus genome

Xinwen Chen1,2, Wilfred F. J. IJkel2, Renato Tarchini3, Xiulian Sun1, Hans Sandbrink3, Hualin Wang1, Sander Peters3, Douwe Zuidema2, René Klein Lankhorst3, Just M. Vlak2 and Zhihong Hu1

Joint-Laboratory of Invertebrate Virology, Institute of Virology, Chinese Academy of Sciences, Wuhan, Hubei 430071, People’s Republic of China1
Laboratory of Virology, Wageningen University, Binnenhaven 11, 6709 PD Wageningen, The Netherlands2
Greenomics, Plant Research International, PO Box 16, 6700 AA Wageningen, The Netherlands3

Author for correspondence: Zhihong Hu. Fax +86 27 87641072. e-mail huzh{at}pentium.whiov.ac.cn


   Abstract
Top
Abstract
Introduction
Methods
Results and Discussion
References
 
The nucleotide sequence of the Helicoverpa armigera single-nucleocapsid nucleopolyhedrovirus (HaSNPV) DNA genome was determined and analysed. The circular genome encompasses 131403 bp, has a G+C content of 39·1 mol% and contains five homologous regions with a unique pattern of repeats. Computer-assisted analysis revealed 135 putative ORFs of 150 nt or larger; 100 ORFs have homologues in Autographa californica multicapsid NPV (AcMNPV) and a further 15 ORFs have homologues in other baculoviruses such as Lymantria dispar MNPV (LdMNPV), Spodoptera exigua MNPV (SeMNPV) and Xestia c-nigrum granulovirus (XcGV). Twenty ORFs are unique to HaSNPV without homologues in GenBank. Among the six previously sequenced baculoviruses, AcMNPV, Bombyx mori NPV (BmNPV), Orgyia pseudotsugata MNPV (OpMNPV), SeMNPV, LdMNPV and XcGV, 65 ORFs are conserved and hence are considered as core baculovirus genes. The mean overall amino acid identity of HaSNPV ORFs was the highest with SeMNPV and LdMNPV homologues. Other than three ‘baculovirus repeat ORFs’ (bro) and two ‘inhibitor of apoptosis’ (iap) genes, no duplicated ORFs were found. A putative ORF showing similarity to poly(ADP-ribose) glycohydrolases (parg) was newly identified. The HaSNPV genome lacks a homologue of the major budded virus (BV) glycoprotein gene, gp64, of AcMNPV, BmNPV and OpMNPV. Instead, a homologue of SeMNPV ORF8, encoding the major BV envelope protein, has been identified. GeneParityPlot analysis suggests that HaSNPV, SeMNPV and LdMNPV (group II) have structural genomic features in common and are distinct from the group I NPVs and from the granuloviruses. Cluster alignment between group I and group II baculoviruses suggests that they have a common ancestor.


   Introduction
Top
Abstract
Introduction
Methods
Results and Discussion
References
 
Members of the family Baculoviridae are rod-shaped viruses with circular, covalently closed, double-stranded DNA genomes ranging from 100 to 180 kb. The virions are occluded into large proteinaceous capsules or occlusion bodies. Two genera, nucleopolyhedrovirus (NPV) and granulovirus (GV), have been recognized. Each genus is distinguished by a particular occlusion body morphology with single (GV) and multiple (NPV) virions occluded in granules and polyhedra, respectively. The NPVs are designated as single (S) or multiple (M) depending on the potential number of nucleocapsids packaged in a virion, but this appears to have no taxonomic value (Murphy et al., 1995 ).

Baculoviruses are frequently used as bio-insecticides of phytophagous insects, mainly belonging to the orders Lepidoptera, Hymenoptera and Diptera (Moscardi, 1999 ; Federici, 1999 ). The SNPV of the bollworm Helicoverpa armigera (HaSNPV) has been extensively used to control this insect in cotton and vegetable crops in China (Zhang, 1994 ). In 1999, about 100000 hectares of cotton had been treated with a commercial virus preparation based on HaSNPV. Recently, recombinant HaSNPVs with improved insecticidal properties have been engineered (Chen et al., 2000b ) and field tested (S. Sun, X. Chen, Z. Zhang, H. Wang, F. J. J. A. Bianchi, H. Peng, J. M. Vlak & Z. H. Hu, unpublished). However, the genetics of HaSNPV have only been partly described.

The nucleotide sequences of five MNPVs, Autographa californica (Ac) MNPV (Ayres et al., 1994 ), Bombyx mori (Bm) NPV (Gomi et al., 1999 ), Orgyia pseudotsugata (Op) MNPV (Ahrens et al., 1997 ), Lymantria dispar (Ld) MNPV (Kuzio et al., 1999 ) and Spodoptera exigua (Se) MNPV (IJkel et al., 1999 ), and one granulovirus, Xestia c-nigrum (Xc) GV (Hayakawa et al., 1999 ), have been determined. The size of these genomes ranges from 128413 bp for BmNPV to 178733 bp for XcGV. This size difference is predominantly due to the presence of gene duplications including the so-called ‘baculovirus repeat ORF’ or bro genes (Gomi et al., 1999 ). However, no SNPV genome has been sequenced to date and it is therefore of interest to see whether the sequence of HaSNPV would reveal some unique features contributing to, among others, the SNPV phenotype and to the specificity of this virus for heliothine insects.

A physical map of HaSNPV has been previously constructed and the size was estimated to be about 130 kb (Chen et al., 2000a ). Analysis of approximately 45 kb of random sequence from the HaSNPV genome resulted in the identification of 53 ORFs with homologies to ORFs of other baculoviruses. Partial alignment of the HaSNPV genome with other baculovirus genomes using GeneParityPlot (Hu et al., 1998 ) revealed a close relationship of HaSNPV and SeMNPV in terms of genomic organization (Chen et al., 2000a ). A few genes, notably polyhedrin (Chen et al., 1997b ), ecdysteroid UDP-glucosyltransferase (egt) (Chen et al., 1997a ), DNA polymerase (Bulach et al., 1999 ) and ‘late expression factor 2' (lef-2) (Chen et al., 1999 ), have been characterized in some detail. Phylogenetic analysis of these genes also revealed a close ancestral relationship between HaSNPV, SeMNPV and LdMNPV at the gene level.

In this paper we describe the complete nucleotide sequence and organization of the HaSNPV genome. This baculovirus is characterized by the absence of extensive gene duplications and by the presence of a limited number of homologous repeat (hr) regions, the structure of which is distinctly different from the hr sequences of other baculoviruses. Finally, a genomic comparison is made with the complete sequences of AcMNPV, SeMNPV, LdMNPV and XcGV using GeneParityPlot (Hu et al., 1998 ).


   Methods
Top
Abstract
Introduction
Methods
Results and Discussion
References
 
{blacksquare} Insect and virus.
The bollworm H. armigera was cultured as a laboratory colony and reared on artificial diet as described by Zhang et al. (1981) . The wild-type virus was originally isolated from diseased H. armigera larvae in the Hubei province of the People’s Republic of China in 1981. By in vivo cloning, eight HaSNPV genotypes were isolated (G1–G8) (Sun et al., 1998 ), of which the G4 strain was selected for sequencing. Polyhedra of the G4 strain were propagated in fourth instar H. armigera larvae.

{blacksquare} HaSNPV DNA isolation, cloning and sequence determination.
The HaSNPV G4 strain (Sun et al., 1998 ) was sequenced to a sixfold genomic coverage using a shotgun approach. The viral DNA was caesium chloride-purified (King & Possee, 1992 ) and sheared by nebulization into fragments with an average size of 1200 bp. Blunt repair of the ends was performed with Pfu DNA polymerase (Stratagene), according to the manufacturer’s directions. DNA fragments were size-fractionated by gel electrophoresis and cloned into the EcoRV site of pBluescriptSK (Stratagene). After transformation into E. coli XL2-Blue competent cells (Stratagene), 1000 recombinant colonies were picked randomly. DNA templates for sequencing were isolated using QIAprep Turbo kits (Qiagen) on a QIAGEN BioRobot 9600. Sequencing was performed using the ABI PRISM Big Dye Terminator Cycle Sequencing Ready reaction kit with FS AmpliTaq DNA polymerase (Perkin Elmer) and analysed on an ABI 3700 DNA Analyser.

Shotgun sequences were base-called by the PHRED basecaller and assembled with the PHRAP assembler (Ewing & Green, 1998 ; Ewing et al., 1998 ). Using the PREGAP4 interface, PHRAP-assembled data were stored in the GAP4 assembly database (Bonfield et al., 1995 ). The GAP4 interface and its features were then used for editing and sequence finishing. Consensus calculations with a quality cut-off value of 40 were performed from within GAP4 using a probabilistic consensus algorithm based on expected error rates output by PHRED. Sequencing the PCR products bridging the ends of existing contiguous fragments filled the remaining gaps in the sequence.

{blacksquare} DNA sequence analysis.
Genomic DNA composition, structure, repeats and restriction enzyme pattern were analysed with the University of Wisconsin Genetics Computer Group programs (Devereux et al., 1984 ) and DNASTAR (Lasergene). ORFs encoding more than 50 amino acids (150 bp) were considered to be protein-encoding and hence designated putative genes. The maximal alignment of 115 ORFs (out of 135) was checked with known baculovirus gene homologues extracted from GenBank; ORFs with an overlap of hr region were excluded from the alignment analysis. The overlap between any two ORFs with known baculovirus homologues was set to a maximum of 25 amino acids; otherwise the largest ORF was selected.

DNA and protein comparisons with entries in the sequence databases were performed with FASTA and BLAST programs (Pearson, 1990 ; Altschul et al., 1990 ). Multiple sequence alignments were performed with the GCG PileUp and Gap computer programs version 10.0 (Genetics Computer Group, Madison, WI, USA) with gap creation and extension penalties set to 9 and 2, respectively (Devereux et al., 1984 ). Percentage identity indicates the percentage of identical residues between two complete sequences. The GENESCAN program was used for gene predictions (http://ccr-081.mit.edu/GENESCAN.html). The DOTTER program (http://www.cgr.ki.se/cgr/groups/sonnhammer/Dotter.html) was used to identify and classify repeat families and miniature inverted repeat transposable elements (MITEs). GeneParityPlot analysis was performed on the HaSNPV genome versus the genomes of AcMNPV, SeMNPV, LdMNPV and XcGV as described previously (Hu et al., 1998 ).


   Results and Discussion
Top
Abstract
Introduction
Methods
Results and Discussion
References
 
Nucleotide sequence analysis of the HaSNPV genome
The HaSNPV genome was assembled into a contiguous sequence of 131403 bp (Table 1). This size is in good agreement with a previous estimate of 130·1 kb for HaSNPV DNA based on restriction enzyme analysis and physical mapping (Chen et al., 2000a ). AcMNPV, BmNPV, OpMNPV and SeMNPV have similar size genomes, which are much smaller than the genomes of LdMNPV and XcGV with 161 kb and 178 kb, respectively (Table 1). With a G+C content of 39·1 mol%, HaSNPV has the lowest G+C content among baculoviruses to date, which is close to that of AcMNPV (41 mol%) (Ayres et al., 1994 ), BmNPV (Gomi et al., 1999 ) and XcGV (Hayakawa et al., 1999 ). The G+C contents of OpMNPV (Ahrens et al., 1997 ) and LdMNPV (Kuzio et al., 1999 ) are much higher with 55 and 58 mol%, respectively. According to a recently adopted convention (IJkel et al., 1999 ; Hayakawa et al., 1999 ), the adenine residue at the translational initiation codon of the polyhedrin gene was designated as the zero point of the physical map of HaSNPV DNA (Fig. 1). Taking polyhedrin as the first gene determines the orientation of the physical map. This map is now reversed as compared with the original map presented by Chen et al. (2000a ) and positions the p10 gene at map unit 10.


View this table:
[in this window]
[in a new window]
 
Table 1. Characteristics of baculovirus genomes

 


View larger version (44K):
[in this window]
[in a new window]
 
Fig. 1. Circular map and genomic organization of the HaSNPV DNA genome. The sites for restriction enzyme HindIII are presented; the fragments are indicated A to M according to size from the largest to the smallest restriction fragment (Chen et al., 2000a ). The positions of the 135 identified ORFs are indicated with arrows that also represent the direction of transcription. Shaded arrows indicate that the ORF has a homologue in other baculoviruses in the protein sequence databases. Open arrows represent ORFs unique to HaSNPV. The corresponding number along the ORF represents the HaSNPV ORF number. The positions of the hr sequences are indicated by black boxes. The scale on the inner circle is in map units.

 
Using computer-assisted analysis, 326 ORFs defined as methionine-initiated ORFs larger than 50 amino acids were found. From these, 135 ORFs with fewer than 25 amino acids or no overlap with other ORFs have been identified on the HaSNPV genome (Fig. 1; Tables 1 and 2) and were further analysed. This number of 135 ORFs is roughly proportional to the size of the HaSNPV genome as compared with the other six completely sequenced baculovirus genomes AcMNPV, BmNPV, OpMNPV, SeMNPV, LdMNPV and XcGV. The HaSNPV ORFs are in general tightly packed with minimal intergenic distances; their orientation is almost evenly distributed along the genome (52% clockwise, 48% anticlockwise; Fig. 1). The locations, orientations and sizes of the predicted ORFs are shown in detail in Table 2. The 135 predicted ORFs account for 87% of the genome versus 8% for intergenic sequences and 6% for the hr region. The HaSNPV ORFs have an average length of 844 nt with Ha84 (helicase) being the largest (3758 nt) and Ha40, without a homologue in other baculoviruses, being the smallest (150 nt). Of the 135 HaSNPV ORFs, 115 (86%) have an assigned function or have homologues with other putative baculovirus genes (Table 2). So far it appears that 20 ORFs are unique to HaSNPV. These ORFs accounted for 6% (7·3 kb) of the genome in total.


View this table:
[in this window]
[in a new window]
 
Table 2. Listing of potentially expressed ORFs in HaSNPV

 

View this table:
[in this window]
[in a new window]
 
Table 3. Number of ORFs with homologues in baculoviruses and percentage amino acid identity

 
The HaSNPV nucleotide sequence was determined from an isolate cloned in vivo (Sun et al., 1998 ). Based on restriction enzyme analysis and Southern hybridization, no fragments in a less than molar ratio were observed in this isolate. However, sequence analysis showed that at approximately 100 nucleotide locations (0·07% of the genome) along the genome a polymorphism was observed in the nucleotide usage. None of these affected the ORFs. This polymorphism may be partly the result of the sequencing (error 10-5), but also the consequence of the intrinsic genetic variation that exists either in natural HaSNPV isolates (Gettig & McCarthy, 1982 ; Figueiredo et al., 1999 ) or in in vivo cloned isolates of HaSNPV (Sun et al., 1998 ), GV (Smith & Crook, 1988 ) and MNPVs (Muñoz et al., 1998 ). Despite the in vivo cloning and the apparent lack of genetic heterogeneity as evidenced from restriction enzyme analysis (Sun et al., 1998 ), microvariation may thus exist. This suggests that the quasispecies concept for RNA viruses, i.e. a virus species is defined not as a single nucleotide sequence but as a mixture of genotypes (Domingo et al., 1995 ), may also apply to DNA viruses including baculoviruses.

Homologous repeat (hr) regions
Regions with homologous repeats were first found in AcMNPV (Cochran & Faulkner, 1983 ) and appear to be present in all baculoviruses. They occur at multiple locations along the genome and may serve as origins of DNA replication (Kool et al., 1995 ) and as enhancers of transcription (Guarino & Summers, 1986 ; Guarino et al., 1986 ). Hr regions are characterized by the presence of multiple, often imperfect, tandemly repeated palindromic sequences (AcMNPV). Five hr regions were previously identified on the genome of HaSNPV by direct sequencing and Southern blot hybridization (Chen et al., 2000a ). No further hr regions were detected in the complete sequence (Fig. 1; Table 1). These five hr regions were found dispersed along the HaSNPV genome around map positions 17.5 (hr1), 37.7 (hr2), 40.2 (hr3), 70.8 (hr4) and 83.6 (hr5) and are located in AT-rich intergenic regions. Their sizes vary from 750 (hr3) to 2800 nt (hr5). It is interesting to note that hr2 and hr3 are separated by two bro-related genes (Fig. 1). This configuration might have been the result of an insertion of two bro genes into what originally may have been a single hr. Assuming that hr2 and hr3 have been a single hr, the hr regions of HaSNPV are remarkably similar is size (2100–2800 nt).

Using a dot matrix analysis, the HaSNPV sequence was compared to itself and its complementary strand. Two types of repeats were identified, type A and type B, with imperfect 40 and 107 bp long repeats, respectively, or truncated versions thereof (Fig. 2). The type A and type B repeats are found in each of the hr regions. There is no sequence homology with other known baculovirus hr regions. The type B repeats contain short internal stretches of palindromic and direct repeats. Not only is the sequence of the HaSNPV hr regions different from those of other baculovirus hr regions, but their structure is also rather unique. The function of the type A and type B repeats remains to be determined.



View larger version (112K):
[in this window]
[in a new window]
 
Fig. 2. Alignment of HaSNPV repeated sequences. The nucleotide sequences of the type A repeats (A) and type B repeats (B) are aligned to obtain maximum similarity. The repeats are denoted according to their presence in a homologous region (hr1–hr5), their type (a or b), their order number in the hr and whether they occur in the reverse orientation (r) or not. Shading is used to indicate the relevant occurrence of similar nucleotides in the repeats: black indicates >59%, grey 53% and white <47% representation. Short palindromes (by arrows), direct repeats (lines above) and XbaI sites (boxed) are indicated.

 
Comparison of the gene content of HaSNPV and other baculoviruses
The sequence of the HaSNPV genome was compared with those of AcMNPV (Ayres et al., 1994 ), BmNPV (Gomi et al., 1999 ), OpMNPV (Ahrens et al., 1997 ), SeMNPV (IJkel et al., 1999 ), LdMNPV (Kuzio et al., 1999 ) and XcGV (Hayakawa et al., 1999 ) for the presence or absence of putative ORFs (Table 2). These seven baculovirus genomes have a cumulative total of 354 different ORFs, of which 183 are unique to individual baculovirus genomes. Among the seven baculoviruses, including HaSNPV 65 ORFs are conserved. Among all NPVs, 84 ORFs are conserved (data not shown). This suggests that about 70 ORFs are the minimal requirement for basic baculovirus features, such as virus structure, transcription, DNA replication, auxiliary functions on the cellular or organism level and occlusion body morphogenesis (Table 2). Putative functions have been assigned to approximately 61% of these common baculovirus genes. Twenty ORFs larger than 50 amino acids were unique to HaSNPV. Nine of these, 50 to 100 amino acids long, have no consensus baculovirus promoter (Table 2). Most likely, these small ORFs in HaSNPV are not functional, but this has to be tested experimentally.

Of the 135 HaSNPV ORFs identified, 100 have homologues in AcMNPV and a further 15 have homologues in other baculoviruses (Tables 1 and 2). HaSNPV shares the largest number of homologues (103) with SeMNPV, underscoring the close relationship between these two viruses as evidenced from gene phylogeny analyses involving polyhedrin, egt, lef-2 (Chen et al., 1997a , b , 1999 ) and DNA polymerase (Bulach et al., 1999 ). Polyhedrin is the most conserved ORF of the six NPVs, with a mean amino acid identity of 83% to other NPV polyhedrin genes; the identity to GV granulin is much less (51% for XcGV). Ubiquitin (ubi), which is involved in the targetting of proteins for degradation, is the next most conserved gene among the seven sequenced baculoviruses, with 75% amino acid identity, followed by superoxide dismutase (sod) with 70% amino acid identity. The mean ORF amino acid identity between HaSNPV and the group II baculoviruses SeMNPV and LdMNPV is similar (46%) and higher than to group I baculoviruses (41%). This is in support of the distinct phylogenetic relationship between group I and group II NPVs (Zanotto et al., 1993 ; Bulach et al., 1999 ).

Structural virion genes
The HaSNPV genome contains the known genes encoding the common virion structural proteins of NPVs (Table 2). In contrast to SeMNPV, where odv-e66 is duplicated (IJkel et al., 1999 ), HaSNPV does not contain duplicate genes for virion structural proteins. However, HaSNPV apparently lacks a homologue of the BV envelope surface glycoprotein gene gp64 (Ac128). The product of this gene, GP64, is acquired by virions during budding through the plasma membrane and is involved in the association with cell receptors upon invasion and fusion in endosomes (Oomens & Blissard, 1999 ). However, an ORF has been identified in HaSNPV (Ha133) with an average amino acid identity with Ld130 (43%) and Se8 (40%). The latter viruses also lack a gp64 homologue and it has been suggested that Ld130 and Se8 are the functional homologues of AcMNPV gp64 (Kuzio et al., 1999 ; IJkel et al., 1999 ). Recently, direct evidence was obtained that the products of Ld130 and Se8 are the major constituents of the BV envelope and are responsible for the fusogenic activity of SeMNPV (Pearson et al., 2000 ; IJkel et al., 2000 ).

DNA replication and late gene expression
There are 19 lef genes in AcMNPV that have been implied in DNA replication and late gene expression (Kool et al., 1995 ; Lu & Miller, 1995 ). They were all required for late and very late gene expression. Of these, six (lef-1, lef-2, lef-3, dnapol, helicase, ie-1) are essential for DNA replication, whereas others are involved in transcription (ie-2, lef-4, lef-5, lef-8, lef-9) (Guarino et al., 1998 ) or in inhibition of apoptosis (such as p35 and iap genes) (Clem & Miller, 1994 ). In silico analysis indicated that the genome of HaSNPV contains homologues of 16 of the above AcMNPV lef genes and lacks ie-2, p35 and lef-12 (Table 4). The latter genes are also absent in LdMNPV, SeMNPV and XcGV, suggesting that they occur only in the group I NPVs. HaSNPV also has a homologue (Ha8) to the first exon of a spliced transcript from Ac141 (ie-0). This transcript also includes Ac147 located 4 kb downstream of ie-0 (Chisholm & Henner, 1988 ). In contrast, this exon encoded by Se138 is not functional in SeMNPV (Van Strien et al., 2000 ).


View this table:
[in this window]
[in a new window]
 
Table 4. Baculovirus ORFs without homologues in HaSNPV

 
The percentage amino acid identity of HaSNPV lef-8 (Ha38) and lef-9 (Ha55) with AcMNPV lef-8 and lef-9, encoding subunits of the RNA polymerase complex (Guarino et al., 1998 ), was the highest among the lefs at about 65%, whereas HaSNPV lef-3 (Ha65) and AcMNPV lef-3 shared only 27% of their amino acids. HaSNPV LEF3 has a low degree of homology with other NPVs as well (Table 2) and a lef-3 gene is not assigned in XcGV (Hayakawa et al., 1999 ). It has been suggested that LEF3 is chaperoning other replication factors, such as helicase and LEF2, across the nuclear membrane in infected cells (Wu & Carstens, 1998 ). Since this membrane is almost eliminated upon infection of cells with GV (Federici, 1999 ), LEF3 may not be required for GVs to replicate. However, there is a very low degree of homology of lef-3 to Xc134 and this ORF is also of roughly the same size and has a conserved location in the genome compared with the other baculovirus lef genes. Further experimentation is required to clarify this assumption. Ha25 shows approximately 36% amino acid identity to Ac25 and Bm16, which encode a putative DNA-binding protein (DBP) (Okano et al., 1999 ; Mikhailov et al., 1998 ). An AcMNPV gene involved in the modulation of very late gene expression (vlf-1) (Todd et al., 1996 ) has also been found in HaSNPV (Ha71).

Similar to SeMNPV, LdMNPV and XcGV, HaSNPV also lacks a p35 homologue (Table 4). Instead, two members of the iap (Crook et al., 1993 ) gene family were observed in HaSNPV, iap-2 (Ha62) and iap-3 (Ha103). Homologues of iap subclasses (1–4) have been found in AcMNPV (Ac27, iap-1 and Ac71, iap-2), OpMNPV (Op41, iap-1; Op74, iap-2; Op35, iap-3 and ORF106, iap-4), SeMNPV (Se88, iap-2 and Se110, iap-3), LdMNPV (Ld79, iap-2 and Ld139, iap-3) and XcGV (Xc137, iap-3). The HaSNPV iap-3 gene has high homology to the CpGV iap gene, which could functionally complement an AcMNPV p35 deletion mutant (Crook et al., 1993 ). OpMNPV iap-3 can also complement AcMNPV p35 null mutants (Birnbaum et al., 1994 ). The function of the iap-1, iap-2 and iap-4 genes is unknown. Through partial DNA sequence analysis, three iap gene homologues (iap-1, iap-2 and iap-3) were found in Buzura suppressaria SNPV (Hu et al., 1998 ).

HaSNPV lack genes for enzymatic functions in nucleotide metabolism, such as ribonucleotide reductase (rr) and deoxyuridyltriphosphatase (dUTPase). The products of rr and dUTPase allow the virus to convert rNTPs into dNTPs to the benefit of virus DNA replication. RR reduces NDPs into dNDPs and dUTPase converts dUTP into dUMP, thereby excluding dUTP from incorporation into DNA and providing dUMP as a precursor for dTTP. dUTPase and rr are present in SeMNPV (IJkel et al., 1999 ), OpMNPV (Ahrens et al., 1997 ) and LdMNPV (Kuzio et al., 1999 ) but are absent from AcMNPV and BmNPV and also from XcGV. The latter virus contained a DNA ligase (Xc141), which appeared to be absent from NPVs except LdMNPV.

Genes with auxiliary functions
Baculovirus auxiliary genes are not essential for virus replication per se but are important, for example, for interaction with the insect host (O’Reilly, 1997 ). HaSNPV has a very similar set of auxiliary genes as SeMNPV, encoding for example chitinase (chitA, Ha41), cathepsin (v-cath, Ha56) and egt (Ha126) (IJkel et al., 1999 ). These genes are quite well conserved, with 66, 47 and 49% amino acid identity, respectively, whereas the fibroblast growth factor (fgf, Ha113) is poorly conserved among baculoviruses with 28% amino acid identity.

HaSNPV lacks a gene for protein tyrosine/serine phosphatase (ptp) with dual-specificity (dsPTP) (Tilakaratne et al., 1991 ; Kim & Weaver, 1993 ). This protein specifically removes phosphates from both tyrosine and serine/threonine residues (Wishart et al., 1995 ). The absence of a ptp gene homologue in HaSNPV is striking, since such a gene is present in all NPV genomes sequenced to date and is thought to be involved in the regulation of the phosphorylation status of viral and host proteins during infection.

Duplicated bro genes
A common characteristic of baculovirus genomes is the presence of a group of related genes, the so-named bro genes. Five homologues of AcMNPV ORF2 (Ac2) are present in BmNPV (Gomi et al., 1999 ). In LdMNPV, SeMNPV and XcGV sixteen, one and five bro-related genes are found, respectively (Kuzio et al., 1999 ; IJkel et al., 1999 ; Hayakawa et al., 1999 ). In OpMNPV, a truncated version and two smaller bro-related ORFs are present (Ahrens et al., 1997 ). Three bro-related genes were identified in HaSNPV, named bro-a (Ha59), bro-b (Ha60) and bro-c (Ha105). Ha59 is most closely related to Ld150 (bro-m), belonging to the group II bro family (Kuzio et al., 1999 ), with 50% amino acid identity. Ha60 also belongs to the group II bro genes and shares the largest homology to Ld140 (bro-l) and Xc159 (bro-g), but has an N-terminal duplication of 183 amino acids. It thus seems unlikely that the Ha59 and Ha60 bro genes have a common recent ancestor and therefore might have been spliced in tandem into an hr sequence (hr3 and hr4). Ha105 and Xc60 are 66% identical and related to the group III bro genes (Kuzio et al., 1999 ).

HaSNPV ORFs with homologues in a few other baculoviruses
HaSNPV possesses 22 ORFs that have no homologues in AcMNPV, BmNPV, OpMNPV, SeMNPV, LdMNPV or XcGV (Table 2). Of these, Ha6 is identical to Hz480 from Helicoverpa zea SNPV (HzSNPV) (Le et al., 1997 ). In HaSNPV, a homologue of the Leucania separata NPV (LsNPV) p13 gene (Ha97) is found. This homologue is, in contrast to the SeMNPV homologue, not C-terminally extended (Wang et al., 1995 ; IJkel et al., 1999 ). The two leucine-zipper-like structures present in LsNPV P13 (Wang et al., 1995 ) are also conserved in Ha97. The function of this ORF in LsNPV as well as in SeMNPV (Se59) and XcGV (Xc48) is unknown.

Three HaSNPV ORFs have a homologue in only one other baculovirus. None of these genes has yet been assigned functions. Ha19 has a gene homologue in LdMNPV (Ld26) with an amino acid identity of 40%. This ORF, however, is rather small, encoding an 11 kDa protein. Ha57, encoding a putative 21 kDa product, has a homologue in XcGV (Xc83) with an amino acid identity of 33%. An Se68 homologue is identified in HaSNPV as Ha83 encoding a putative protein of 18·8 kDa but with a low amino acid identity of 26%. All ORFs, however, have baculovirus early and/or late transcription motifs and may therefore be functional.

HaSNPV ORF100 (Ha100) was found to encode a putative poly(ADP-ribose) glycohydrolase (parg). The homology with Drosophila melanogaster (24% identity) and Homo sapiens (23% identity) genes was found in the C-terminal portion of the putative protein. Homologues of Ha100 were also found in LdMNPV (Ld141) and SeMNPV (Se52), so that their presence appears to be limited to group II NPVs. In eukaryotes this enzyme is involved in the breakdown of polyribose and recruitment of this compound for nuclear functions such as DNA replication and repair (D’Amours et al., 1999 ). The function of this enzyme in baculovirus group II morphogenesis or pathology is not known, but it is possible that it is involved in similar capacity during the NPV infection process. The baculovirus parg gene is much longer than the eukaryotic counterpart and thus may have additional activities.

A few HaSNPV ORFs (Ha1–4, Ha6, Ha9–11, Ha13–15, Ha41, Ha69, Ha71 and Ha126; Table 2) have a high degree of amino acid identity (>90%) to sequences available from HzSNPV (Ma et al., 1993 ; Cowan et al., 1994 ; Le et al., 1997 ). This suggests that the overall homology between HaSNPV and HzSNPV is very high and that they are most likely variants of the same virus species. Sequencing of the HzSNPV genome would reveal whether this assumption is correct.

Unique HaSNPV ORFs
To date, 20 ORFs in the HaSNPV genome are unique to this virus and also do not exhibit significant homology to any other sequences in the GenBank. Most of these ORFs are either very small, encoding putative proteins of up to 100 amino acids (Ha5, Ha7, Ha17, Ha18, Ha40, Ha45, Ha54, Ha104 and Ha112), or contain no common baculovirus transcription initiation sites for early or late gene expression (Ha102, Ha108 and Ha109). Eight ORFs (Ha29, Ha34, Ha99, Ha107, Ha122, Ha125, Ha134 and Ha135) are larger than 100 amino acids and have early and late baculovirus promoter motifs. Ha34 and Ha107 are of interest as they encode putative proteins of 41·1 and 51·2 kDa, respectively. The possible functions of these ORFs are being investigated. For convenience, the ORFs present in the other baculovirus sequences, AcMNPV, BmNPV, OpMNPV, LdMNPV and SeMNPV, are listed in Table 4.

The HaSNPV genome organization
The genomic organization, i.e. the order of genes, of HaSNPV has been studied in a comparative manner using GeneParityPlot analysis (Hu et al., 1998 ). As the gene order between AcMNPV, BmNPV and OpMNPV is basically identical, except for a small number of rearrangements (Ahrens et al., 1997 ; Hu et al., 1998 ; Gomi et al., 1999 ), AcMNPV was taken as a representative example of this group in the analysis (Fig. 3A). A comparison was made between the recently sequenced MNPVs, SeMNPV (IJkel et al., 1999 ) and LdMNPV (Kuzio et al., 1999 ), and XcGV (Hayakawa et al., 1999 ) (Fig. 3B–D). To obtain maximum alignment in the GeneParityPlot analysis, the order of genes had to be reversed for the calculation. By convention, the orientation of a circular baculovirus genome is determined by the relative position of two genes, polyhedrin at map unit 0 and p10 approximately at map unit 90 (Vlak & Smith, 1982 ). In the initial GeneParityPlot analysis, the orientation of the HaSNPV genome appeared to be reversed for more than 50% of the ORFs compared with AcMNPV and LdMNPV in order to obtain maximum alignment compared with the physical map constructed previously (Chen et al., 2000a ). A similar situation exists for SeMNPV (IJkel et al., 1999 ). The gene organization of HaSNPV is most conserved in the ‘central region’ of the linearized baculovirus genomes and confirms the supposition of Heldens et al. (1998) . The left region of the linearized HaSNPV genome displays a considerable number of gene inversions and translocations as deduced from the GeneParityPlot analyses. The right region showed a high degree of gene scrambling (Fig. 3A–D). From these analyses it is concluded that the organization of HaSNPV is highly characteristic and distinct from those of AcMNPV, SeMNPV, LdMNPV and XcGV.



View larger version (27K):
[in this window]
[in a new window]
 
Fig. 3. GeneParityPlots of HaSNPV versus three other baculoviruses. Graphic representation of the collinearity of baculovirus genomes of AcMNPV (A), LdMNPV (B), SeMNPV (C) and XcGV (D) obtained by GeneParityPlot analysis (see Methods). Fourteen putative gene clusters of the HaSNPV genome, which are similar to those of AcMNPV (A), LdMNPV (B) and SeMNPV (C), were ordered alphabetically and underlined. The putative gene clusters are indicated in Table 2.

 
Comparison of the relative gene order between HaSNPV and AcMNPV, SeMNPV, LdMNPV and XcGV revealed the presence of certain gene clusters that are conserved in all baculovirus genomes (Fig. 3, Table 2). The juxtaposition of ORFs can be used as a phylogenetic marker to study the ancestral relationship of baculoviruses, independent of the evolution of individual genes. These clusters are numbered according to their sequential appearance in the GeneParityPlots. Fourteen clusters conserved in all five baculoviruses have been identified (Fig. 3, Table 2). In comparison with a previous analysis (IJkel et al., 1999 ), a small additional cluster, named 12a (Ac28/Ac29 and their homologues), has been identified. Cluster 12, which was conserved in AcMNPV, LdMNPV and SeMNPV, was interrupted in HaSNPV by a Lesel25 homologue. Furthermore, chitA (Ha41) has been inserted into cluster 11, whereas Ha40, Ha54 and Ha83 also intervened in this cluster. However, the latter three ORFs are very putative and relatively small genes and, in the cases of Ha40 and Ha54, without apparent transcription control motifs. One additional cluster has been identified in the GeneParityPlot of HaSNPV versus SeMNPV and LdMNPV encompassing Ha126, Ha128 and Ha129 (cluster n, Fig. 3).

Comparison of the cluster organization of HaSNPV with that of other baculoviruses (Fig. 4) suggests that the genomic organization of HaSNPV is more closely related to that of SeMNPV and LdMNPV than to that of group I NPVs (AcMNPV, BmNPV and OpMNPV) or XcGV. This is in agreement with the phylogenetic analysis of single genes such as egt, lef-2, dnapol and rr (Chen et al., 1997a , b , 1999 ; Bulach et al., 1999 ). When the order of gene clusters is taken to represent the baculovirus genome organization, the common structure of group II baculoviruses becomes apparent (Fig. 4A). Within each group, the structural difference is relatively small and predominantly determined by inversions of gene clusters as well as inversions of individual genes (e.g. polyhedrin). Comparison of the two groups showed extensive genomic translocations in addition to cluster inversions. When the inverted genes remained functional, they could be translocated to other genomic regions. These ‘jumping’ genes can be used as phylogenetic markers to follow baculovirus evolution in retrospect. A common genome structure for group I and group II viruses can be derived, showing a major inversion of a genomic segment containing the cluster c-b-a-m-n (Fig. 4B).



View larger version (19K):
[in this window]
[in a new window]
 
Fig. 4. Alignment of conserved genome clusters of AcMNPV, BmNPV, OpMNPV, SeMNPV, LdMNPV, HaSNPV and XcGV (A) and comparison between group I and group II baculoviruses (B). The arrows indicate the orientation of the cluster and the cluster inversions are underlined.

 
In conclusion, the sequence of the genome of HaSNPV is distinct from other baculoviruses both in gene content and in gene arrangement. Except for three bro-related genes and two iap-related genes, the HaSNPV genome contains 130 unique ORFs, many of which are shared with other NPVs. Based on the percentage identity of gene homologues, on the phylogeny of some particular genes and on the gene arrangement along the HaSNPV genome, we conclude that HaSNPV, SeMNPV and LdMNPV must have had a common ancestor. The HaSNPV sequence further confirmed the observation that the part of baculovirus genomes flanking DNA helicase is highly conserved, possibly as a result of transcriptional or regulatory constraints. By comparing gene clusters, a common structural genomic feature is revealed in group II baculoviruses. A study of the 11 unique putative ORFs (>100 amino acids) may provide insight in the determinants specifying the SNPV morphotype. From sequence analysis it is also clear that the SNPV and MNPV morphotype is the only taxonomic determinant and it is likely that SNPVs and MNPVs do not represent separate phylogenetic clades.


   Acknowledgments
 
This research was supported in part by the Royal Academy of Sciences of the Netherlands and the Chinese Academy of Sciences (98CDP008), the Dutch–Israeli Agricultural Research Program (DIARP) (93/20 and 97/29) and the Natural Science Foundation of China (NSFC). Marjo van Staveren, Marleen Abma-Henkens and Paul Mooijman are thanked for their skilful technical assistance. W.F.J.IJ. is supported by a fellowship from the Netherlands Foundation for Chemical Sciences (CW) with financial aid from the Netherlands Organization for Scientific Research (NWO). X.C. received a grant from the Royal Academy of Sciences of the Netherlands (KNAW) and a PhD sandwich fellowship from Wageningen University. Z.H. is a recipient of Hundreds of Talents Program award (STZ-3-01).


   Footnotes
 
The GenBank accession number of the sequence reported in this paper is AF271059.


   References
Top
Abstract
Introduction
Methods
Results and Discussion
References
 
Ahrens, C. H., Russell, R., Funk, C. J., Evans, J. T., Harwood, S. H. & Rohrmann, G. F.(1997). The sequence of the Orgyia pseudotsugata multinucleocapsid polyhedrosis virus genome. Virology 229, 381-399.[Medline]

Altschul, S. F., Gish, W., Miller, W., Meyers, E. W. & Lipman, D. J.(1990). Basic local alignment search tool. Journal of Molecular Biology 215, 403-410.[Medline]

Ayres, M. D., Howard, S. C., Kuzio, J., Lopez-Ferber, M. & Possee, R. D.(1994). The complete DNA sequence of Autographa californica nuclear polyhedrosis virus. Virology 202, 586-605.[Medline]

Birnbaum, M. J., Clem, R. J. & Miller, L. K.(1994). An apoptosis-inhibiting gene from a nuclear polyhedrosis virus encoding a polypeptide with Cys/His sequence motifs. Journal of Virology 68, 2521-2528.[Abstract]

Bonfield, J. K., Smith, K. F. & Staden, R.(1995). A new DNA sequence assembly program. Nucleic Acids Research 24, 4992-4999.[Abstract/Free Full Text]

Bulach, D. M., Kumar, C. A., Zaia, A., Liang, B. F. & Tribe, D. E.(1999). Group II nucleopolyhedrovirus subgroups revealed by phylogenetic analysis of polyhedrin and DNA polymerase gene sequences. Journal of Invertebrate Pathology 73, 59-73.[Medline]

Chen, X., Hu, Z., Jehle, J. A., Zhang, Y. & Vlak, J. M.(1997a). Analysis of the ecdysteroid UDP-glucosyltransferase gene of Heliothis armigera single-nucleocapsid baculovirus. Virus Genes 15, 219-225.[Medline]

Chen, X., Hu, Z. H. & Vlak, J. M.(1997b). Nucleotide sequence analysis of the polyhedrin gene of Heliothis armigera single nucleocapsid nuclear polyhedrosis virus. Virologica Sinica 12, 346-353.

Chen, X., IJkel, W. F. J., Dominy, C., Zanotto, P. de A., Hashimoto, Y., Faktor, O., Hayakawa, T., Wang, C. H., Krell, P. J., Hu, Z. & Vlak, J. M.(1999). Identification, sequence and phylogeny of the lef-2 gene of Helicoverpa armigera single-nucleocapsid baculovirus. Virus Research 65, 21-32.[Medline]

Chen, X., Li, M., Sun, X., Arif, B. M., Hu, Z. H. & Vlak, J. M. (2000a). Genomic organization of Helicoverpa armigera single-nucleocapsid nucleopolyhedrovirus. Archives of Virology 145, (in press).

Chen, X., Sun, X., Hu, Z. H., Li, M., O’Reilly, D. R., Zuidema, D. & Vlak, J. M.(2000b). Genetic engineering of Helicoverpa armigera single-nucleocapsid nucleopolyhedrovirus as an improved bioinsecticide. Journal of Invertebrate Pathology 76, 140-146.[Medline]

Chisholm, G. E. & Henner, D. J.(1988). Multiple early transcripts and splicing of the Autographa californica nuclear polyhedrosis virus IE-1 gene. Journal of Virology 62, 3193-3200.[Medline]

Clem, R. J. & Miller, L. K.(1994). Control of programmed cell death by the baculovirus genes p35 and iap. Molecular Cell Biology 14, 5212-5222.[Abstract]

Cochran, M. A. & Faulkner, P.(1983). Location of homologous DNA sequences interspersed at five regions in the baculovirus AcMNPV genome. Journal of Virology 45, 961-970.

Cowan, P., Bulach, D., Goodge, K., Robertson, A. & Tribe, D. E.(1994). Nucleotide sequence of the polyhedrin gene region of Helicoverpa zea single nucleocapsid nuclear polyhedrosis virus: placement of the virus in lepidopteran nuclear polyhedrosis group II. Journal of General Virology 75, 3211-3218.[Abstract]

Crook, N. E., Clem, R. J. & Miller, L. K.(1993). An apoptosis-inhibiting baculovirus gene with a zinc finger-like motif. Journal of Virology 67, 2168-2174.[Abstract]

D’Amours, D., Desnoyers, S., d’Silva, I. & Poirier, G. G.(1999). Poly (ADP-ribosyl)ation reactions in the regulation of nuclear functions. Biochemical Journal 342, 249-268.[Medline]

Devereux, J., Haeberli, P. & Smithies, O.(1984). A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Research 12, 387-395.[Abstract]

Domingo, E., Holland, J. J., Biebricher, C. & Eigen, M.(1995). Quasispecies: the concept and the world. In Molecular Basis of Evolution , pp. 171-180. Edited by A. Gibbs, C. Calisher& F. Garcia-Arenal. Cambridge:Cambridge University Press.

Ewing, B. & Green, P.(1998). Basecalling of automated sequencer traces using PHRED. II. Error probabilities. Genome Research 8, 186-194.[Abstract/Free Full Text]

Ewing, B., Hillier, L., Wendl, M. C. & Green, P.(1998). Basecalling of automated sequencer traces using PHRED. I. Accuracy assessment. Genome Research 8, 175-185.[Abstract/Free Full Text]

Federici, B. A.(1999). Naturally occurring baculoviruses for insect pest control. Methods in Biotechnology 5, 301-320.

Figueiredo, E., Muñoz, D., Escribano, A., Mexia, A., Vlak, J. M. & Caballero, P.(1999). Biochemical identification and comparative insecticidal activity of nucleopolyhedrovirus isolates pathogenic for Heliothis armigera (Lep., Noctuidae) larvae. Journal of Applied Entomology 123, 165-169.

Gettig, R. R. & McCarthy, W. J.(1982). Genotypic variation among wild isolates of Heliothis spp nuclear polyhedrosis viruses from different geographical regions. Virology 117, 245-252.

Gomi, S., Majima, K. & Maeda, S.(1999). Sequence analysis of the genome of Bombyx mori nucleopolyhedrovirus. Journal of General Virology 80, 1323-1337.[Abstract]

Guarino, L. A. & Summers, M. D.(1986). Interspersed homologous DNA of Autographa californica nuclear polyhedrosis virus enhances delayed-early gene expression. Journal of Virology 60, 215-223.

Guarino, L. A., Gonzales, M. A. & Summers, M. D.(1986). Complete sequence and enhancer function of the homologous DNA regions of Autographa californica nuclear polyhedrosis virus. Journal of Virology 60, 224-229.

Guarino, L. A., Xu, B., Jin, J. P. & Dong, W.(1998). A virus-encoded RNA polymerase purified from baculovirus-infected cells. Journal of Virology 72, 7985-7991.[Abstract/Free Full Text]

Hayakawa, T., Ko, R., Okano, K., Seong, S., Goto, C. & Maeda, S.(1999). Sequence analysis of the Xestia c-nigrum granulovirus genome. Virology 262, 277-297.[Medline]

Heldens, J. G. M., Yi, L., Zuidema, D., Goldbach, R. W. & Vlak, J. M.(1998). A highly conserved genomic region in baculoviruses: sequence and transcriptional analysis of an 11·3 kbp DNA fragment (46·5–55·1 mu) from the Spodoptera exigua multicapsid nucleopolyhedrovirus. Virus Research 55, 187-198.[Medline]

Hu, Z. H., Arif, B. M., Jin, F., Martens, J. W. M., Chen, X. W., Sun, J. S., Zuidema, D., Goldbach, R. W. & Vlak, J. M.(1998). Distinct gene arrangement in the Buzura suppressaria single-nucleocapsid nucleopolyhedrovirus genome. Journal of General Virology 79, 2841-2851.[Abstract]

IJkel, W. F. J., Van Strien, E. A., Heldens, J. G. M., Broer, R., Zuidema, D., Goldbach, R. W. & Vlak, J. M.(1999). Sequence and organization of the Spodoptera exigua multicapsid nucleopolyhedrovirus genome. Journal of General Virology 80, 3289-3304.[Abstract/Free Full Text]

IJkel, W. F. J., Westenberg, M., Goldbach, R. W., Blissard, G. W., Vlak, J. M. & Zuidema, D.(2000). A novel baculovirus envelope fusion protein with a proprotein convertase cleavage site. Virology 275, 30-41.[Medline]

Kim, D. & Weaver, R. F.(1993). Transcription mapping and functional analysis of the protein tyrosine/serine phosphatase (PTPase) gene of the Autographa californica nuclear polyhedrosis virus. Virology 195, 587-595.[Medline]

King, L. A. & Possee, R. D. (1992). The Baculovirus Expression System: A Laboratory Guide, pp. 180–194. London: Chapman and Hall.

Kool, M., Ahrens, C. H., Vlak, J. M. & Rohrmann, G. F.(1995). Replication of baculovirus DNA. Journal of General Virology 76, 2103-2118.[Medline]

Kuzio, J., Pearson, M. N., Harwood, S. H., Funk, C. J., Evans, J. T., Slavicek, J. M. & Rohrmann, G. F.(1999). Sequence and analysis of the genome of a baculovirus pathogenic for Lymantria dispar. Virology 253, 17-34.[Medline]

Le, T. H., Wu, T., Robertson, A., Bulach, D., Cowan, P., Goodge, K. & Tribe, D.(1997). Genetically variable triplet repeats in a RING-finger ORF of Helicoverpa species baculoviruses. Virus Research 49, 67-77.[Medline]

Lu, A. & Miller, L. K.(1995). The roles of eighteen baculovirus late expression factor genes in transcription and DNA replication. Journal of Virology 69, 975-982.[Abstract]

Ma, S.-W., Corsaro, B. G., Klebba, P. E. & Fraser, M. J.(1993). Cloning and sequence analysis of a p40 structural protein of Helicoverpa zea nuclear polyhedrosis virus. Virology 192, 224-233.[Medline]

Mikhailov, V. S., Mikhailova, A. L., Iwanaga, M., Gomi, S. & Maeda, S.(1998). Bombyx mori nucleopolyhedrovirus encodes a DNA-binding protein capable of destabilizing duplex DNA. Journal of Virology 72, 3107-3116.[Abstract/Free Full Text]

Moscardi, F.(1999). Assessment of the application of baculoviruses for control of Lepidoptera. Annual Reviews of Entomology 44, 257-289.

Muñoz, D., Castillejo, J. I. & Caballero, P.(1998). Naturally occurring deletion mutants are parasitic genotypes in a wild-type nucleopolyhedrovirus population of Spodoptera exigua. Applied and Environmental Microbiology 64, 4372-4377.[Abstract/Free Full Text]

Murphy, F. A., Fauquet, C. M., Bishop, D. H. L., Ghabrial, S. A., Jarvis, A. W., Martelli, G. P., Mayo, M. A. & Summers, M. D. (editors) (1995). Virus Taxonomy. Sixth Report of the International Committee on Taxonomy of Viruses. New York: Springer–Verlag.

Okano, K., Mikhailov, V. S. & Maeda, S.(1999). Colocalization of baculovirus IE-1 and two DNA-binding proteins, DBP and LEF-3, to viral replication factories. Journal of Virology 73, 110-119.[Abstract/Free Full Text]

Oomens, A. G. P. & Blissard, G. W.(1999). Requirement for gp64 to drive efficient budding of Autographa californica multicapsid nucleopolyhedrovirus. Virology 254, 297-314.[Medline]

O’Reilly, D. R.(1997). Auxiliary genes of baculoviruses. In The Baculoviruses , pp. 267-295. Edited by L. K. Miller. New York:Plenum.

Pearson, W. R.(1990). Rapid and sensitive sequence comparison with FASTP and FASTA. Methods in Enzymology 183, 63-98.[Medline]

Pearson, M. N., Groten, C. & Rohrmann, G. F.(2000). Identification of the Lymantria dispar nucleopolyhedrovirus envelope fusion protein provides evidence for a phylogenetic division of the Baculoviridae. Journal of Virology 74, 6126-6131.[Abstract/Free Full Text]

Smith, I. R. & Crook, N. E.(1988). In vivo isolation of baculovirus genotypes. Virology 166, 240-244.[Medline]

Sun, X. L., Zhang, G. Y., Zhang, Z. X., Hu, Z. H., Vlak, J. M. & Arif, B. M.(1998). In vivo cloning of Helicoverpa armigera single nucleocapsid nuclear polyhedrosis virus genotypes. Virologica Sinica 13, 83-88.

Tilakaratne, N., Hardin, S. E. & Weaver, R. F.(1991). Nucleotide sequence and transcript mapping of the HindIII F region of the Autographa californica nuclear polyhedrosis virus. Journal of General Virology 72, 285-291.[Abstract]

Todd, J. W., Passarelli, A. L. & Miller, L. K.(1996). Factors regulating baculovirus late and very late gene expression in transient-expression assays. Journal of Virology 70, 2307-2317.[Abstract]

Van Strien, E. A., IJkel, W. F. J., Gerrits, H., Vlak, J. M. & Zuidema, D.(2000). Characteristics of the transactivator gene ie-1 of Spodoptera exigua multiple-nucleocapsid nucleopolyhedrovirus. Archives of Virology 145, 2115-2133.[Medline]

Vlak, J. M. & Smith, G. E.(1982). Orientation of the genome of Autographa californica nuclear polyhedrosis virus: a proposal. Journal of Virology 41, 1118-1121.

Wang, J. W., Qi, Y. P., Huang, Y. X. & Li, S. D.(1995). Nucleotide sequence of a 1446 base pair SalI fragment and structure of a novel early gene of Leucania separata nuclear polyhedrosis virus. Archives of Virology 140, 2283-2291.[Medline]

Wishart, M. J., Denu, J. M., Williams, J. A. & Dixon, J. E.(1995). A single mutation converts a novel phosphotyrosine binding domain into a dual-specificity phosphatase. Journal of Biological Chemistry 270, 26782-26785.[Abstract/Free Full Text]

Wu, Y. & Carstens, E. H.(1998). A baculovirus single-stranded DNA binding protein, LEF-3, mediates the nuclear localization of the putative helicase P143. Virology 247, 32-40.[Medline]

Zanotto, P. M. de A., Kessing, B. D. & Maruniak, J. E.(1993). Phylogenetic interrelationships among baculoviruses: evolutionary rates and host associations. Journal of Invertebrate Pathology 62, 147-164.[Medline]

Zhang, G.(1994). Research, development and application of Heliothis viral pesticide in China. Resources and Environment in the Yangtze Valley 3, 1-6.

Zhang, G., Zhang, Y., Ge, L. & Shan, Z.(1981). The production and application of the nuclear polyhedrosis virus of Heliothis armigera (Hübner) in biological control. Acta Phytophylacica Sinica 8, 235-240.

Received 9 June 2000; accepted 15 September 2000.