©1996 by The American Society for Biochemistry and Molecular Biology, Inc.
Accelerated Evolution in Inhibitor Domains of Porcine Elafin Family Members (*)

(Received for publication, October 2, 1995; and in revised form, January 4, 1996)

Ichiro Tamechika Makoto Itakura Yoko Saruta Makoto Furukawa Akira Kato Shinro Tachibana (1) Shigehisa Hirose (§)

From the Department of Biological Sciences, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama 226 and the Tsukuba Research Laboratories, Eisai Co., Ltd., Tsukuba 300-26, Japan

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
ACKNOWLEDGEMENTS
REFERENCES

ABSTRACT

Through the analysis of the porcine gene encoding the elastase inhibitor elafin, we demonstrated that there are at least three closely related members of the elafin family, and their genes have arisen by accelerated evolution. A porcine genomic DNA library was screened with a previously cloned human elafin cDNA probe, and several positive clones were obtained that can be distinguished by a combination of restriction enzymes. Sequence analysis of these clones revealed the presence of three homologous members whose genes, all consisting of three exons and two introns, are almost identical except the exon 2 sequences encoding the inhibitor domain called ``WAP motif''; the intron sequences are related to each other with sequence similarities of 93-98%, whereas the exon 2 sequences exhibited only 60-77% similarities among the three members. The extreme divergence in the exon 2 sequences compared to the highly conserved intron sequences may be generated by accelerated mutations confined in a short stretch of the genes following recent duplication events of a single ancestral gene. An RNase protection assay indicated that the messages of the elafin family members are abundantly expressed in the trachea and intestine, suggesting that the most likely selective forces for the accelerated evolution are extrinsic proteinases produced by invasive microorganisms.


INTRODUCTION

Elafin is a unique elastase inhibitor (1) having a transglutaminase substrate domain that serves as an anchoring sequence to confine the inhibitor at its sites of action(2, 3, 4) . Elafin, also known as skin-derived antileukoproteinase (SKALP; (5) and (6) ) or elastase-specific inhibitor (ESI, (7) ), is synthesized as pre-elafin and secreted into the extracellular matrix space where it is cross-linked to certain structural proteins through its transglutaminase substrate domain by the action of transglutaminase (for review on transglutaminase, see (8, 9, 10) ). Although elafin was first isolated as a 57-amino acid protein with a characteristic disulfide-linked structure called four-disulfide core or WAP motif (11) , the 57-amino acid form is now considered to represent the inhibitor domain generated, by proteolytic cleavage, from the 95-amino acid native elafin which consists of the following two domains: 1) transglutaminase substrate domain that we termed cementoin moiety (3) and 2) inhibitor domain or WAP motif. Biochemical and molecular biological studies on elafin have so far been performed using human materials; however, for more detailed studies such as identification of the acceptor proteins to which elafin is anchored by covalent cross-linking through the transglutaminase substrate domain, we considered it more appropriate to use other mammalian species from which fresh materials are relatively easily obtainable and initiated cloning and characterization of porcine elafin gene, which resulted in the following unexpected findings that 1) there are at least three elafin-related genes that are very similar to each other over the entire length of about 4 kb (^1)including the intron sequences, suggesting that they have arisen by relatively recent gene duplications and 2) in contrast to the extremely high similarity (97-98%) in the intron and parts of the exon sequences corresponding to the noncoding regions, the gene sequences coding for the inhibitor domains or the WAP motif regions are surprisingly variable (77-81%) among the newly found elafin family members. This type of dramatic degree of non-homology within a short stretch of gene sequences that specifically affects the functionally important regions of protein sequences is called accelerated evolution.

Accelerated evolution, or an unusually high rate of mutation in a certain segment of the genes, is a process postulated to occur in genes following a duplication event (12, 13) and is considered to be an effective mechanism to provide the hosts with a defense system against unwanted outsiders such as pathogens and parasites and, inversely, to provide the intruders with an increased capacity of invasion. Currently, however, only a few cases of accelerated evolution have been reported. 1) The first case is a family of the serine proteinase inhibitors (serpins) in which an unusually high degree of polymorphism was found in a narrow region surrounding the reactive site. This original observation by Hill et al.(14, 15) was later extended to other members of the serpin family by Barriello and Krauter (16) and the divergence was confirmed at the cDNA level, but their genes, especially the introns, have not been analyzed. Extrinsic proteinases used by parasites to facilitate their spread throughout the host are considered as the most likely selective force. 2) An extremely high rate of nonsynonymous nucleotide substitutions that cause amino acid changes has been found in the active domain-coding region of the wheat thionin genes by cDNA cloning(17) . Thionins are a family of cysteine-rich proteins and active against plant pathogens. 3) Nakashima et al.(18, 19) have found that the introns and noncoding regions of the Trimeresurus flavoviridis (Habu snake) venom gland phospholipase A(2) isozyme genes are unusually conserved as compared to the protein-coding regions except for the presequence-coding regions, indicating that the genes have evolved so as to bring about accelerated amino acid substitutions in the mature protein-coding regions. An additional example of the accelerated evolution reported here for the elafin family genes will be helpful for both theoretical and experimental analyses of the unique events in the evolution of the genes. It should also be emphasized that the genes analyzed in the present study are the first mammalian family of genes that have unique introns whose sequences are exceptionally highly conserved (93-98% sequence identities) compared to their relatively divergent exons. The genes may therefore be one of the most recently duplicated gene families.


EXPERIMENTAL PROCEDURES

Materials

Fresh porcine tissues were obtained from the Shibaura abattoir sanitation inspection station, Tokyo, Japan; restriction enzymes, random primer DNA labeling kit version 2, and DNA ligation kit were from Takara, Kyoto, Japan; pBluescript II SK plasmid vector and RNA transcription kit were from Stratagene; porcine genomic library constructed in EMBL3 SP6/T7 using Sau3A partial digest was from Clontech; plasmid vector pBluescript II was from Stratagene; double-stranded nested deletion kit and AutoRead sequencing kit were from Pharmacia, Uppsala, Sweden; 17-mer oligonucleotide primers labeled with fluorescein isothiocyanate were from Biologica Co., Nagoya, Japan; P-labeled nucleotides were from DuPont NEN; RPA II ribonuclease protection assay kit was from Ambion Inc., Austin, TX; RNase-free DNase I was from Boehringer Mannheim; RNase inhibitor RNasin was from Toyobo, Osaka, Japan; X-Omat AR x-ray film was from Kodak, Rochester, NY; nitrocellulose filters (BA 85, 0.45 µm) were from Schleicher and Schuell, Dassel, Germany; human elafin and N-methoxysuccinyl-alanyl-alanyl-prolyl-valine 4-methylcoumaryl-7-amide (Suc(OMe)-Ala-Ala-Pro-Val-MCA) were from Peptide Institute Inc., Osaka, Japan; porcine pancreatic elastase was from Sigma. The WAP motif regions or inhibitor domains of the porcine WAP family members WAP-1, WAP-2, and WAP-3 were chemically synthesized according to the published method (20, 21) and named pWap-1, pWap-2, and pWap-3, respectively. Other chemicals were of reagent grade.

Screening of a Porcine Genomic DNA Library

A porcine liver genomic DNA library, constructed in EMBL3 SP6/T7 using Sau3A partial digest, was obtained from Clontech. Approximately 3 times 10^5 plaques were screened on nitrocellulose membranes by hybridization with the 3-kb XbaI/BamHI fragment of human elafin gene (22) labeled with [alpha-P]dCTP using a random primer DNA labeling kit. Hybridization was carried out for 16 h at 37 °C in 50% formamide plus 5 times SSPE (1 times SSPE: 0.15 M NaCl, 10 mM sodium phosphate, pH 7.4, and 1 mM EDTA), 5 times Denhardt's solution (1 times Denhardt's: 0.1% each of bovine serum albumin, polyvinylpyrrolidone, and Ficoll), and 0.1% SDS. The filters were rinsed twice at room temperature in 2 times SSC (1 times SSC: 0.1 M NaCl, 15 mM sodium citrate, pH 7.0) containing 0.05% SDS and twice at 55 °C in 1 times SSC containing 0.1% SDS for 1 h and exposed to Kodak X-Omat AR x-ray film with a Konica enhancer screen for 24 h at -70 °C.

DNA Sequencing

For DNA sequencing, 3-3.2-kb HindIII and 628-bp EcoRI/HindIII fragments of the porcine WAP family genes were subcloned into the pBluescript II SK plasmid vector (Stratagene). Overlapping deletion clones were obtained by exonuclease III digestion using a Pharmacia double-stranded nested deletion kit. Double-stranded DNA sequencing was performed by the dideoxy chain termination method (23) using a Pharmacia AutoRead sequencing kit with either T3, T7, or synthetic 17-mer primers labeled with fluorescein isothiocyanate.

Data Analysis of DNA and Amino Acid Sequences

DNA sequence data were organized and analyzed using the program GENETYX-MAC (Software Development Co., Ltd., Tokyo, Japan). Sequence comparison was done using the FASTA (24) and BLAST (25) electronic mail server on the GenomeNet and WAIS server in the DNA data bank of Japan. For the comparisons of nucleotide sequences among the human elafin gene and the genes of the porcine WAP family members, the numbers of nucleotide substitutions were estimated by the method of Jin and Nei(26) ; the numbers of nonsynonymous substitutions which cause amino acid changes and synonymous substitutions which cause no amino acid changes were computed according to the method of Nei and Gojobori (27) using the molecular evolution analysis system ODEN in the DNA data bank of Japan.

RNase Protection Analysis

Total RNA was prepared from porcine tissues by the acid guanidinium thiocyanate-phenol-chloroform method(28) . Antisense RNA probes were prepared from the subclones carrying the 249-bp BglII/BamHI fragment of pgWAP-1, the 417-bp BglII fragment of pgWAP-2, or the 359-bp BglII fragment of pgWAP-3 in the BamHI site of pBluescript II SK, which contains the T3 promoter for transcription of antisense RNA. These three fragments were designed to contain parts of exon 2 and intron 2. Riboprobes were transcribed from SalI-linearized templates using an RNA transcription kit in the presence of [alpha-P]UTP and RNasin, an RNase inhibitor. The template DNA was digested with RNase-free DNase I, and the labeled probe was purified by gel filtration using a Pharmacia NICK column. The following procedures were carried out using an RPA II ribonuclease protection assay kit. The radiolabeled probe (1-2 times10^5 cpm) was hybridized to 5 µg of total RNA from various porcine tissues, or to 5 µg of yeast tRNA as a negative control, and incubated overnight at 43 °C. Single-stranded RNA was removed by digestion with DNase-free RNase A and RNase T(1), and the protected fragments were separated on a 5% denaturing polyacrylamide gel containing 7 M urea and quantitated by fluorography.

Measurement of Elastase-inhibitory Activities of WAP Family Members

The inhibitory activities of human elafin and the structurally related porcine proteins WAP-1, WAP-2, and WAP-3 were assayed according to a modification of Wiedow et al. (1) by the use of the fluorogenic substrate Suc(OMe)-Ala-Ala-Pro-Val-MCA. Samples (100 µl) were preincubated with porcine pancreatic elastase (1 µg in 100 µl) in assay buffer (0.1 M Hepes, 0.5 M NaCl, 10% Me(2)SO, 0.01% lysozyme, pH 7.5) for 30 min at room temperature, and the remaining elastase activities were measured by adding 1.8 ml of assay buffer containing 0.1 mM substrate. The fluorescence intensities were monitored at an excitation wavelength of 383 nm and an emission wavelength of 455 nm using a Hitachi 850 fluorescence spectrophotometer.


RESULTS

Isolation and Sequences of the Genes Encoding Porcine Elafin and Its Family Members

Screening of a porcine genomic DNA library with a 3-kb XbaI/BamHI fragment of human elafin gene (22) led to the isolation of 13 positive clones. Restriction mapping with EcoRI, BamHI, BglII, HindIII, SacI, and SpeI revealed that they could be divided into three groups with very similar but slightly different restriction enzyme sites (Fig. 1); the three independent clones were designated pgWAP-1, pgWAP-2, and pgWAP-3. Southern blot and sequencing analyses of the restriction enzyme fragments of the three clones revealed that the common 3.7-kb EcoRI/HindIII fragment contained the entire gene sequence that is divided into three exons by two introns ( Fig. 1and 2). Fig. 2shows the nucleotide sequence of porcine elafin gene (pgWAP-1) and those of the 5`- and 3`-flanking regions. The nucleotide sequences of the other members pgWAP-2 and pgWAP-3 have been deposited in the DDBJ data bank with accession numbers D50320 and D50321, respectively. Interestingly, short interspersed repetitive elements (SINEs) were found in intron 2 of the three genes. Fig. 3shows the amino acid sequences of the three members of the porcine elafin family deduced from the open reading frames of their genes by reference to the exon-intron organization of the human elafin gene(22) . All three members contained, in common, a presequence of about 25 amino acids, a transglutaminase substrate domain composed of 10-17 semiconserved repeats of 6 amino acids rich in Lys and Gln, and an inhibitor domain where the 8 Cys residues responsible for forming the four-disulfide core structure or WAP motif (11) are conserved.


Figure 1: Schematic representation of the gene structures of porcine elafin (pgWAP-1), its family members (pgWAP-2 and pgWAP-3), and human elafin. Gray boxes represent exon 1. Repetitive sequence regions in exons 2 are indicated by hatched boxes. Filled boxes indicate the regions coding for the WAP motif region or inhibitor domain. Open boxes indicate the noncoding regions (exon 3). Restriction enzyme sites are also shown to illustrate a high degree of similarity of the three porcine genes even in the intron sequences. Gaps are introduced in regions a and b to produce optimal alignment; the gaps in region a are due to the difference in the number of repeats, and the gap in region b of the human elafin gene is due to the presence of the SINE sequence (PRE-1, small striped boxes) (29, 31) in the three porcine genes.




Figure 2: Nucleotide sequence of the porcine WAP-1 (elafin) gene and amino acid sequence deduced from its open reading frame. The nucleotide sequence is shown in upper case letters, and the deduced amino acid sequence is shown below the nucleotide sequence. Nucleotide and amino acid residue numbers are shown to the right of the sequence. Arrowhead indicates the putative cleavage site of the signal sequence; arrow marks the beginning of the WAP motif or the inhibitor domain. Intron sequences are bracketed [GT . . . AG]. A polyadenylation signal is indicated by closed box. Eight cysteine residues defining the WAP motif or four-disulfide core are shown white on black. Open box indicates the 3`-untranslated region (exon 3). Gray box represents SINE or PRE-1 flanked by a pair of direct repeats marked by double underlines.




Figure 3: Amino acid sequence comparison among elafin and its family members having the WAP motif. Amino acid sequences of human elafin (h-ela) and porcine WAP-1 (elafin), WAP-2 (also known as SPAI-2; (43, 44, 45) ), and WAP-3 are aligned, and the amino acid residues that are identical are shown white on black or gray boxes. Eight cysteine residues defining the WAP motif or four-disulfide core structure of the inhibitor domain (marked by horizontal lines) are indicated by asterisks. Amino acids are shown in the single-letter code. Residue numbers of the last residue in each line are noted on the right. The sequence of human elafin is from Saheki et al.(22) . Boxes 1 and 2 indicate highly variable regions and correspond to boxes 1 and 2 in Fig. 5, respectively. The box 2 region corresponds to the reactive site of the acid-stable proteinase inhibitor of human mucous secretions (HUSI-I or antileukoprotease) which consists of two WAP motifs(32, 46) .




Figure 5: High similarity in overall structures and extreme sequence divergence in a limited region of the genes of the elafin family members. The complete nucleotide sequences of the porcine WAP-1 (elafin), WAP-2, and WAP-3 genes were determined, deposited in the GSDB/DDBJ/EMBL/NCBI Data Bank, and compared to each other and to that of human elafin gene (h-ela)(22) . In this figure, only the exon 2 sequences and parts of the intron sequences flanking them are shown. Identical nucleotides are shown as dots. Gaps are designated by dashes. Stop codons are boxed. Note the high degree of sequence conservation in the intron sequences relative to the hypervariable regions 1 and 2 (open boxes) in the WAP motif-coding regions(- - -). The amino acid sequences encoded by the hypervariable regions are shown in Fig. 3(boxes 1 and 2).



Identification of WAP-1 as Porcine Elafin

Among the three members identified above, WAP-1 exhibited the highest similarity to human elafin in the amino acid sequence (Fig. 3), suggesting that WAP-1 is the porcine homolog of human elafin. To confirm this, we synthesized chemically the three polypeptides (pWap-1, pWap-2, and pWap-3) and examined their abilities to inhibit elastase. As expected, pWap-1 inhibited porcine pancreatic elastase with an IC of approximately 2 µg/ml (Fig. 4); however, pWap-2 and pWap-3 exerted no inhibitory effects on the elastase, suggesting that they display a relatively narrow and quite distinct spectrum of anti-proteinase activities.


Figure 4: Inhibition of pancreatic elastase by the WAP motif sequences of the porcine elafin family members. Three peptides (pWap-1, pWap-2, and pWap-3) corresponding to the WAP motif regions (indicated by horizontal line in Fig. 3) of the porcine WAP-1, WAP-2, and WAP-3 proteins were chemically synthesized and assayed for their inhibitory activities against elastase using a fluorogenic substrate. Assay mixtures contained 1 µg of elastase and 10 µg of the synthetic peptides or 0.3 µg of human (h) elafin in 2 ml.



Mutational Burst in a Short Segment of the Genes Encoding Inhibitor Domains of Elafin Family Members

Comparison of the nucleotide sequences of the elafin family genes revealed a surprisingly high similarity with each other (Table 1, Fig. 5). The highly conserved regions include the two introns, exon 1 encoding the presequence, and exon 3. Certain regions of exon 2, however, share only 60-77% nucleotide identity; the highest divergence is in the active center coding region. This degree of divergence is in striking contrast to the high conservation of the other regions that display at least 97-98% identity. It should be noted, however, that within the variable region, the structurally important 8 Cys residues are completely conserved among the members. To evaluate more quantitatively the nucleotide substitutions, we calculated the K(A) (number of substitutions per nonsynonymous site), K(S) (number of substitutions per synonymous site), and K(N) (number of substitutions per site in the noncoding regions) values for the 6 possible comparisons among the genes of the elafin family members (Table 2). The ratio K(A)/K(S) measures the rate of amino acid replacement in a selected region relative to the neutral rate; a ratio less than 1 indicates that the amino acid replacement rate is less than the neutral rate, and, generally, this is the case because of functional constraints placed on the proteins. In fact, in most regions of the elafin family genes, the K(A) values are lower than the K(S) values (Table 2). In the regions coding for the hypervariable regions 1 and 2 (Fig. 5), the K(A) values exceed significantly the K(S) values, suggesting that the hypervariable regions have undergone accelerated evolution.





The comparisons, especially the intraspecies and interspecies comparisons of the intron sequences, strongly suggest that the porcine elafin gene duplication and diversification occurred after the porcine had diverged from the human lineage. This conclusion is supported by the presence in the porcine genes but the absence in the human elafin gene of SINE (see below and ``Discussion'') (Fig. 1).

Size Variations of Repetitive Sequence in Exon 2

The 5` half of exon 2 of each elafin family gene encodes the transglutaminase substrate domain or cementoin-like moiety (3, 4) that consists of several semiconserved repeats of 6 amino acids. The corresponding gene sequences are also composed of semiconserved tandem repeats of 18 nucleotides. Although the repetitive sequences are highly conserved at both the nucleotide and amino acid levels among the family members, the numbers of repeats are variable (Fig. 3, Fig. 5, and Fig. 6). Such size polymorphism presumably arose from ``slippage'' of the DNA polymerases or gene conversions or both as often occurs in the regions containing repeated sequences.


Figure 6: Phylogenetic tree of the porcine WAP family members and generation of divergence in the repetitive sequences. A, phylogenetic tree constructed based on the amino acid and nucleotide sequences of the family members and their genes. B, alignment of repetitive sequences of pgWAP-1, -2, and -3. Only the nucleotides different from the consensus sequence are shown. The nucleotides boxed in black represent those different from the consensus but conserved among the family members, which helped to define a patchwork pattern of resemblance (blocks A, B, and C) in the repetitive sequence of the three members.



Fig. 6compares the repetitive sequences of pgWAP-1, -2, and -3 by highlighting the nucleotide positions that are different from the consensus sequence AAAGGTCAAGATCCAGTC, which revealed an interesting pattern of similarity among the family members; for example, the repetitive sequences can be subdivided into 3 blocks, each of which consists of several 18-nucleotide repeating units (blocks A, B, and C in Fig. 6). pgWAP-1 and pgWAP-2 have block A in common; pgWAP-1 and pgWAP-3, a part of block A; and pgWAP-2 and pgWAP-3, block B and a part of block C. To obtain some insight into how these blocks of repeating sequences were generated, we constructed a phylogenetic tree (dendrogram) of the three porcine WAP family genes based on their nucleotide sequence similarities and tried to locate, on the dendrogram, the points of insertion or deletion of the blocks (Fig. 6A). The following series of events may have occurred. 1) Block A was inserted before the first duplication of the ancestor gene; 2) block B was inserted after the first but before the second duplication that produced pgWAP-2 and pgWAP-3; 3) block C was inserted following the second duplication; and 4) block D was deleted after the second duplication from the pgWAP-3 trait. The insertion of block C is considered to be a relatively recent event since the repeating unit in the block is most highly conserved.

SINE in Intron 2

An approximately 250-bp porcine SINE, also known as PRE-1(29, 30, 31, 32, 33) , was found in the second intron of each elafin family gene ( Fig. 1and Fig. 2). The three SINE sequences, inserted in the opposite direction, are highly homologous and flanked by a 9-bp direct repeat, GAGTGTTTG. They have an internal RNA polymerase III promoter with a typical A and B box consensus, a property common to SINEs (34) .

Tissue-specific Expression

To define the localization of the elafin family members, we performed an RNase protection assay using P-labeled antisense RNA probes representing the variable regions of exon 2 and total RNA preparations from the following porcine tissues: cerebrum, cerebellum, trachea, lung, atrium, stomach, duodenum, small intestine, and large intestine. After hybridization of the RNA preparations with the antisense probes, the samples were digested with RNases A and T(1), and the protected fragments were analyzed by gel electrophoresis. The elafin family members exhibited quite distinct tissue distributions (Fig. 7); for example, the pWAP-1 (elafin) message is abundantly expressed in the trachea and large intestine, whereas that of pWAP-2 appears to be confined in the small intestine. pWAP-3 was expressed at relatively low levels in the large intestine.


Figure 7: Tissue distribution of porcine elafin family members determined by RNase protection analysis. Total RNA (5 µg) from the indicated porcine tissues were annealed with P-labeled cRNA probes specific for the porcine WAP-1, WAP-2, and WAP-3 messages and digested with RNases A and T(1). The protected fragments were then analyzed by electrophoresis and fluorography. The autoradiogram represents an exposure of 5 days.




DISCUSSION

In the present paper, we reported that the porcine elafin family genes show an exceptionally high rate of nonsynonymous nucleotide substitutions in a narrow region encoding the mature protein sequence; the overall sequence similarity outside this narrow region is very high including the intron sequences. A similar case has been reported for the genes of snake venom phospholipase A(2) isozymes(18, 19) . Among the reports describing accelerated evolution, the above two reports are unique in demonstrating an unexpectedly high conservation of introns since the evolution rates of introns are generally considered to be much greater than those of exons(35) ; the other studies concerning 1) alpha(1)-proteinase inhibitor or alpha(1)-antitrypsin, a major plasma proteinase inhibitor (15, 16, 36) and 2) plant thionins (17) have demonstrated greater substitution rates in the reactive center region based on the protein or cDNA sequences. For example, in the case of the plasma proteinase inhibitor alpha(1)-antitrypsin, certain species such as mouse(16) , guinea pig(37, 38) , and rabbit(36) , unlike humans who have a single gene(39) , have been suggested, by cDNA cloning, to have multiple genes that are very similar in the overall exon sequences (97-98% identity) but show a strikingly high level of sequence divergence at a short segment coding for a 9-amino acid stretch encompassing the reactive center. Such an unusual kind of divergence has been interpreted as a result of positive Darwinian evolution, thereby creating new reactive site sequences with varying specificities to cope with an increasing number of attacking proteinases. The localization of the porcine elafin family members in the trachea and intestine (Fig. 7) is consistent with this view since such regions are exposed to a variety of infectious agents such as bacteria and parasites. Another explanation for the divergence in the narrow region is the neutralist theory which suggests that whenever an exceptionally high rate of substitutions is encountered in molecular evolution, we should suspect loss of constraint that allows previously harmful mutants to become selectively neutral(13) . The variable region is indeed located in a surface loop that is not crucial for maintaining the tertiary structure of the protein, so that mutations in this region are not selected against, as opposed to other parts of the protein(40, 41) . In the case of the neutralist concept, however, the question remains as to why the intron sequences of the snake venom phospholipase A(2) isozyme (18, 19) and porcine elafin gene families are conserved much more highly than those of the exons coding for the hypervariable regions. The first identification, reported here, in the mammalian genomes of a family of genes that are almost identical including the intron sequences except for a short stretch of exons where mutational burst is observed should contribute to a better insight into the molecular mechanism of evolutionary manifold. Furthermore, the presence of short interspersed repetitive elements (SINEs) in the conserved introns may make the genes suitable for such analyses of diversification of duplicated genes as discussed below.

Eukaryotic genomes contain repeated sequence elements dispersed to thousands of locations. SINEs are the most abundant of such elements and constitute 1-5% of the total mass of DNA. Each species has its own SINEs; for example, the human SINE, Alu family, and the porcine SINE, PRE-1, are quite different. This fact and their common properties such as the presence of an internal RNA polymerase III promoter, an A-rich tract at the 3` side, and a direct terminal repeat at the 5` and 3` ends strongly suggest that SINEs are retroposons dispersed through an RNA intermediate after speciation(34) . SINEs are therefore useful markers for estimating relative dates of gene duplications and for constructing gene phylogenies(42) . In the present study, SINEs were detected in the second introns of the porcine elafin gene family. In the human elafin gene, such a sequence is not present (Fig. 1)(22) . This fact suggests that duplications of the porcine elafin gene and evolution, by the following diversification, of the three genes cloned here occurred after the speciation of porcines and humans.


FOOTNOTES

*
This work was supported by a grant-in-aid for developmental scientific research from the Ministry of Education, Science, Sports and Culture of Japan. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore by hereby marked ``advertisement'' in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBank(TM)/EMBL Data Bank with accession number(s) D50319[GenBank]-D50323[GenBank].

§
To whom correspondence should be addressed. Tel: 81-45-924-5726, Fax: 81-45-924-5824 (or 5805).

(^1)
The abbreviations used are: kb, kilobase(s) or kilobase pair(s) bp, base pair(s); pgWAP, porcine gene encoding an elafin family member or a WAP motif protein; pWAP, porcine WAP motif protein; pWap, inhibitor domain of porcine WAP motif protein; SINE, short interspersed repetitive element.


ACKNOWLEDGEMENTS

We thank Norihiro Okada, Takeshi Katafuchi, and Hiromi Hagiwara for discussion and Setuko Satoh and Kazuko Tanaka for secretarial and technical assistance.


REFERENCES

  1. Wiedow, O., Schroder, J. M., Gregory, H., Young, J. A., and Christophers, E. (1990) J. Biol. Chem. 265, 14791-14795 [Abstract/Free Full Text]
  2. Molhuizen, H. O., Alkemade, H. A. C., Zeeuwen, P. L. J. M., de Jongh, G. J., Wieringa, B., and Schalkwijk, J. (1993) J. Biol. Chem. 268, 12028-12032 [Abstract/Free Full Text]
  3. Nara, K., Ito, S., Ito, T., Suzuki, Y., Ghoneim, M. A., Tachibana, S., and Hirose, S. (1994) J. Biochem. (Tokyo) 115, 441-448 [Abstract]
  4. Steinert, P. M., and Marekov, L. N. (1995) J. Biol. Chem. 270, 17702-17711 [Abstract/Free Full Text]
  5. Schalkwijk, J., de Roo, C., and de Jongh, G. J. (1991) Biochim. Biophys. Acta 1096, 148-154 [Medline] [Order article via Infotrieve]
  6. Molhuizen, H. O., and Schalkwijk, J. (1995) Biol. Chem. Hoppe Seyler 376, 1-7 [Medline] [Order article via Infotrieve]
  7. Sallenave, J. M., Marsden, M. D., and Ryle, A. P. (1992) Biol. Chem. Hoppe Seyler 373, 27-33 [Medline] [Order article via Infotrieve]
  8. Greenberg, C. S., Birckbichler, P. J., and Rice, R. H. (1991) FASEB J. 5, 3071-3077 [Abstract/Free Full Text]
  9. Ichinose, A., Bottenus, R. E., and Davie, E. W. (1990) J. Biol. Chem. 265, 13411-13414 [Free Full Text]
  10. Aeschlimann, D., and Paulsson, M. (1994) Thromb. Haemostasis 71, 402-415 [Medline] [Order article via Infotrieve]
  11. Bairoch, A., and Bucher, P. (1994) Nucleic Acids Res. 22, 3583-3589 [Abstract]
  12. Goodman, M. (1981) Prog. Biophys. Mol. Biol. 37, 105-164
  13. Kimura, M. (1981) J. Mol. Evol. 17, 110-113 [Medline] [Order article via Infotrieve]
  14. Hill, R. E., Shaw, P. H., Boyd, P. A., Baumann, H., and Hastie, N. D. (1984) Nature 311, 175-177 [CrossRef][Medline] [Order article via Infotrieve]
  15. Hill, R. E., and Hastie, N. D. (1987) Nature 326, 96-99 [CrossRef][Medline] [Order article via Infotrieve]
  16. Borriello, F., and Krauter, K. S. (1991) Proc. Natl. Acad. Sci. U. S. A. 88, 9417-9421 [Abstract]
  17. Castagnaro, A., Marana, C., Carbonero, P., and Garcia-Olmedo, F. (1992) J. Mol. Biol. 224, 1003-1009 [Medline] [Order article via Infotrieve]
  18. Nakashima, K., Ogawa, T., Oda, N., Hattori, M., Sakaki, Y., Kihara, H., and Ohno, M. (1993) Proc. Natl. Acad. Sci. U. S. A. 90, 5964-5968 [Abstract]
  19. Nakashima, K., Nobuhisa, I., Deshimaru, M., Nakai, M., Ogawa, T., Shimohigashi, Y., Fukumaki, Y., Hattori, M., Sakaki, Y., Hattori, S., and Ohono, M. (1995) Proc. Natl. Acad. Sci. U. S. A. 92, 5605-5609 [Abstract]
  20. Ito, O., Araki, K., Kuroki, J., Kuwada, M., and Tachibana, S. (1990) Peptide Chemistry 1989, 353-358
  21. Tsunemi, M., Kato, H., Nishiuchi, Y., Kumagaye, S., and Sakakibara, S. (1992) Biochem. Biophys. Res. Commun. 185, 967-973 [Medline] [Order article via Infotrieve]
  22. Saheki, T., Ito, F., Hagiwara, H., Saito, Y., Kuroki, J., Tachibana, S., and Hirose, S. (1992) Biochem. Biophys. Res. Commun. 185, 240-245 [Medline] [Order article via Infotrieve]
  23. Sanger, F., Nicklen, S., and Coulson, A. R. (1977) Proc. Natl. Acad. Sci. U. S. A. 74, 5463-5467 [Abstract]
  24. Pearson, W. R., and Lipman, D. J. (1988) Proc. Natl. Acad. Sci. U. S. A. 85, 2444-2448 [Abstract]
  25. Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990) J. Mol. Biol. 215, 403-410 [CrossRef][Medline] [Order article via Infotrieve]
  26. Jin, L., and Nei, M. (1990) Mol. Biol. Evol. 7, 82-102 [Abstract]
  27. Nei, M., and Gojobori, T. (1986) Mol. Biol. Evol. 3, 418-426 [Abstract]
  28. Chomczynski, P., and Sacchi, N. (1987) Anal. Biochem. 162, 156-159 [CrossRef][Medline] [Order article via Infotrieve]
  29. Singer, D. S., Parent, L. J., and Ehrlich, R. (1987) Nucleic Acids Res. 15, 2780 [Medline] [Order article via Infotrieve]
  30. Singer, M. F. (1982) Cell 28, 433-434 [Medline] [Order article via Infotrieve]
  31. Takahashi, H., Awata, T., and Yasue, H. (1992) Anim. Genet. 23, 443-448 [Medline] [Order article via Infotrieve]
  32. Grütter, M. G., Fendrich, G., Huber, R., and Bode, W. (1988) EMBO J. 7, 345-351 [Abstract]
  33. Sugiura, N., Hagiwara, H., and Hirose, S. (1992) J. Biol. Chem. 267, 18067-18072 [Abstract/Free Full Text]
  34. Weiner, A. M., Deininger, P. L., and Efstratiadis, A. (1986) Annu. Rev. Biochem. 55, 631-661 [CrossRef][Medline] [Order article via Infotrieve]
  35. Kimura, M. (1983) The Neutral Theory of Molecular Evolution , Cambridge University Press, Cambridge
  36. Ray, B. K., Gao, X., and Ray, A. (1994) J. Biol. Chem. 269, 22080-22086 [Abstract/Free Full Text]
  37. Suzuki, Y., Yoshida, K., Honda, E., and Sinohara, H. (1991) J. Biol. Chem. 266, 928-932 [Abstract/Free Full Text]
  38. Suzuki, Y., Yoshida, K., Ichimiya, T., Yamamoto, T., and Sinohara, H. (1990) J. Biochem. (Tokyo) 107, 173-179 [Abstract]
  39. Long, G. L., Chandra, T., Woo, S. L. C., Davie, E. W., and Kurachi, K. (1984) Biochemistry 23, 4828-4837 [Medline] [Order article via Infotrieve]
  40. Braciak, T. A., Northemann, W., Hudson, G. O., Shiels, B. R., Gehring, M. R., and Fey, G. H. (1988) J. Biol. Chem. 263, 3999-4012 [Abstract/Free Full Text]
  41. Sottrup Jensen, L., Sand, O., Kristensen, L., and Fey, G. H. (1989) J. Biol. Chem. 264, 15781-15789 [Abstract/Free Full Text]
  42. Okada, N. (1991) Curr. Opin. Genet. Dev. 1, 498-504 [Medline] [Order article via Infotrieve]
  43. Araki, K., Kuwada, M., Ito, O., Kuroki, J., and Tachibana, S. (1990) Biochem. Biophys. Res. Commun. 172, 42-46 [Medline] [Order article via Infotrieve]
  44. Araki, K., Kuroki, J., Ito, O., Kuwada, M., and Tachibana, S. (1989) Biochem. Biophys. Res. Commun. 164, 496-502 [Medline] [Order article via Infotrieve]
  45. Kuroki, J., Hosoya, T., Itakura, M., Hirose, S., Tamechika, I., Yoshimoto, T., Ghoneim, M. A., Nara, K., Kato, A., Suzuki, Y., Furukawa, M., and Tachibana, S. (1995) J. Biol. Chem. 270, 22428-22433 [Abstract/Free Full Text]
  46. Seemüller, U., Arnhold, M., Fritz, H., Wiedenmann, K., Machleidt, W., Heinzel, R., Appelhans, H., Gassen, H., and Lottspeich, F. (1986) FEBS Lett. 199, 43-48 [CrossRef][Medline] [Order article via Infotrieve]

©1996 by The American Society for Biochemistry and Molecular Biology, Inc.