Copyright ©The Histochemical Society, Inc.


BRIEF REPORT

Proteome Analysis of Maternal Serum Samples for Trisomy 21 Pregnancies Using ProteinChip Arrays and Bioinformatics

Anne Busch, Susanne Michel, Constance Hoppe, Dominik Driesch, Uwe Claussen and Ferdinand von Eggeling

Core Unit Chip Application (CUCA), Institute of Human Genetics and Anthropology, Friedrich Schiller University (AB,SM,CH,UC,FvE), and BioControl (DD), Jena, Germany

Correspondence to: Ferdinand von Eggeling, Core Unit Chip Application (CUCA), Inst. of Human Genetics and Anthropology, Friedrich Schiller University, D-07740 Jena, Germany. E-mail: fegg{at}mti.uni-jena.de


    Summary
 Top
 Summary
 Literature Cited
 
A surface-enhanced laser desorption/ionization time of flight (SELDI-TOF)-based ProteinChip System was used as a tool for rapid discovery and identification of protein patterns in serum that discriminate between trisomy 21 and unaffected pregnancies. We analyzed 24 serum samples from women carrying a trisomy 21 pregnancy and 32 with an unaffected pregnancy, ranging from 10.0 to 14.0 weeks of gestation. The resulting protein profiles were submitted to a clustering algorithm, a rule extraction, a rating, and a rule base construction step. For the generated combined rule base, the specificity and sensitivity for the prediction of a trisomy 21 pregnancy reach 97% and 91%, respectively.

(J Histochem Cytochem 53:341343, 2005)

Key Words: trisomy 21 • first semester screening • ProteinChip arrays • SELDI • serum • prenatal diagnosis • bioinformatics

IN 1959 it was discovered that Down syndrome is caused by the presence of three copies of chromosome 21. Trisomy 21 is the most common aneuploidy, which affects up to 1 in 700 live births (Roizen and Patterson 2003Go). Prenatal detection of trisomy 21 fetuses became possible in the mid-1960s by amniocentesis (Steele and Breg 1966Go) and later by chorionic villous sampling (Simoni et al. 1983Go), but these procedures are associated with a 0.2–3% fetal loss rate. Therefore, prenatal diagnosis for the detection of chromosomal aberrations with these techniques is restricted to high-risk pregnancies (CEMAT Group 1998Go). The triple test combining the levels of {alpha}-fetoprotein, human chorionic gonadotropin, and unconjugated oestriol in maternal serum has a 69% detection rate and a 5% false-positive rate (Yaron and Mashiach 2001Go). The combined use of maternal serum screening (free human ß-chorionic gonadotropin and pregnancy-associated plasma protein A) and ultrasonographic measurement of fetal nuchal translucency has an approximately 85% detection rate, with a false-positive rate of 5% (Wapner et al. 2003Go). At present, the presence or absence of the fetal nasal bone is investigated as an additional ultrasound marker for more precise screening (Cicero et al. 2003Go).

In the work presented here, we tried to find differentially expressed protein profiles by ProteinChip technology in maternal serum samples, which constitute an easy way to handle noninvasive first-trimester screening for trisomy 21 pregnancies.

Our study population consisted of 56 maternal serum samples, taken between weeks 10.0 and 14.0 of gestation. Twenty-four of these serum samples were from women carrying a fetus with trisomy 21 and 32 were from women carrying a fetus with a normal karyotype. The chromosomal status of the fetus was confirmed by amniocentesis or chorionic villous sampling. Untreated whole blood was collected and allowed to clot at room temperature. Serum was purified from blood by centrifugation at 1500 x g for 10 min, shock-frozen and stored at –80C. Serum samples were not subjected to more than two freeze-thaw cycles before the assay.

For analysis, the proteomic technique SELDI-MS (surface-enhanced laser desorption/ionization-mass spectrometry)-based ProteinChip technology was used (Melle et al. 2003Go). This technology makes use of affinity surfaces to retain proteins on the basis of their physicochemical characteristics, followed by direct analysis by time-of-flight mass spectrometry (TOF-MS). Proteins retained on chromatographic surfaces can be easily purified from contaminants such as buffer salts or detergents, thus eliminating the need for pre-separation techniques as are required for other MS techniques. For higher disintegration we used ProteinChip array types with two different surface chemistries: SAX2 as a strong anion exchanger and H50 as a reverse-phase or hydrophobic interaction array. For SAX2 arrays, the binding buffer consisted of 100 mM Tris (pH 8.5) containing 0.02% Triton-100. For H50 arrays, it was 30% acetonitrile and 0.1% triflouracetic acid in water. After equilibration, the buffer was removed and 3.5 µl of fresh buffer and 1.5 µl serum were added to each spot. The array was incubated in a humidity chamber for 90 min at 20C. Then each target was washed three times with 5 µl binding buffer, washed two times with 5 µl water to remove the buffer salts, and subsequently air-dried. Next, 2 x 0.5 µl saturated sinapinic acid dissolved in 50% acetonitrile containing 0.5% trifluoroacetic acid (all from Sigma; St Louis, MO) was applied to each spot. Mass analysis was performed in the ProteinChip PBSII instrument according to an automated data collection protocol. Spectra were analyzed using the ProteinChip software (version 3.1).

The resulting protein profiles were subjected to a cluster- and rule-based data mining algorithm. The data analysis process consists of a clustering step, a rule extraction and rating step, and a rule base construction step. All these steps, except for the clustering step, are supervised with respect to the given sample classification (e.g., trisomy 21 vs unaffected). The data were log 2-transformed before the normalization procedure and were then normalized to the median. The preprocessed protein expressions were then clustered into two clusters, "low expressed" and "high expressed," for each peak using a modified fuzzy c-means algorithm (Bezdek 1981Go), in which samples with a protein expression below the lower cluster center were fully assigned to the cluster "low expressed" and samples with a protein expression above the higher cluster center were fully assigned to the cluster "high expressed." The main objective of the next step, the rule extraction and rating step, is to find those features (peaks) that have the most similar membership distribution compared with the given sample class distribution. For this purpose, so-called rules are generated and rated by a statistically based rule rating measure introduced by Kiendl and Krabs (1989)Go. Such rules can be written as follows: IF sample belongs to cluster "peak X high expressed," THEN sample belongs to class "trisomy 21 pregnancy." Rules with a rule rating measure greater than zero were then ranked in a rule list.

A small subset of rules out of the rule list can form a rule base that can be used for automatic classification of new patient samples. A rule base contains at least one rule for every possible classification outcome. To classify a new patient sample, the cluster memberships (condition part of the rules) of all rules out of the rule base that point to the same classification outcome (conclusion part of the rules) are added and the sample is assigned to the class with the highest sum. To construct a rule base that gives a good representation of the dataset investigated, all combinations of rules out of the rule lists were permuted and the rule base with the smallest classification error and the smallest number of rules was chosen. The rules contained in the chosen rule base can be considered to represent markers that can distinguish between the sample classes under investigation.

Analysis resulted in protein profiles for 56 patients for both ProteinChip Array surfaces. From these 56 profile sets, 24 were from women carrying a fetus with a trisomy 21 and 32 with a normal karyotype. Peaks were defined with a signal-to-noise ratio (S/N) greater than 5. Clusters were completed with estimated peaks added greater than 2 S/N, with a cluster mass window of 0.3% of mass. For the above-mentioned conditions in the range from 1.5 to 20 kD, 21 peaks for H50 arrays and 69 for SAX2 arrays could be found. In the range of 10 to 200 kD, 72 peaks for H50 arrays were found and 38 peaks for SAX2 arrays. The consideration of single peaks led to no sufficient discrimination between unaffected or trisomy 21 pregnancies. Moreover, we could not find a correlation of biomarkers within gestational age, although this is known for other serum markers (Wapner et al. 2003Go). The cluster- and rule-based data mining method described above revealed six peaks whose combination is relevant to distinguish between unaffected or trisomy 21 pregnancies (Table 1). These peaks are equally distributed between the two chip surfaces and between the criteria unaffected vs trisomy 21. They have a size ranging from 7.9 kD to 96.7 kD. The cluster borders are given in Table 2. One condition, for example, is if the expression at peak 34 is higher then 0.187 on H50 ProteinChip Array the sample belongs to a serum sample from a woman carrying a fetus with a normal karyotype. After combining all six generated rules (Table 1), it was possible to identify 24 samples of the trisomy 21 pregnancies, with a wrong prediction for two cases. Therefore, the calculated sensitivity is 91%. Of 32 unaffected pregnancies, one was predicted false-positive, resulting in a specificity of 97%. With this result, we reached an equal or higher sensitivity and specificity than ProteinChip System-based studies on cancer (Kozak et al. 2003Go; Won et al. 2003Go), also compared with other prenatal screening methods (Wapner et al. 2003Go) or to other testing based on fetal DNA in maternal serum (Lo 2000Go).


View this table:
[in this window]
[in a new window]
 
Table 1

Combined rule base for H50 and SAX2 chip surfaces to distinguish between a trisomy 21 and an unaffected pregnancya

 

View this table:
[in this window]
[in a new window]
 
Table 2

Cluster borders for rule base given in Table 1

 
In conclusion, we have established a promising procedure combining ProteinChip technology and rule-based analysis that allows a first-trimester screening for trisomy 21 with a high sensitivity and specificity. The specific proteomic signatures described here, even without the knowledge of the respective proteins, can serve as an additional parameter for a more reliable detection of trisomy 21 pregnancies.


    Acknowledgments
 
Supported by a grant from the German Federal Ministry of Education and Research (BMBF) and in part by the EU (ICA2-CT-2000-10012).

We thank Dr A. Kossakiewicz, Dr P. Kozlowski, Dr M. Pruggmayer, and Dr B. Eiben for their contributions of serum samples.


    Footnotes
 
Presented in part at the 14th Workshop on Fetal Cells and Fetal DNA: Recent Progress in Molecular Genetic and Cytogenetic Investigations for Early Prenatal and Postnatal Diagnosis, Friedrich Schiller University, Jena, Germany, April 17–18, 2004.

Received for publication May 7, 2004; accepted August 5, 2004


    Literature Cited
 Top
 Summary
 Literature Cited
 

Bezdek JC (1981) Pattern Recognition with Fuzzy Objective Function Algorithms. New York, Plenum Press

CEMAT Group (1998) Randomised trial to assess safety and fetal outcome of early and midtrimester amniocentesis. The Canadian Early and Mid-trimester Amniocentesis Trial (CEMAT) Group. Lancet 351:242–247[CrossRef][Medline]

Cicero S, Bindra R, Rembouskos G, Spencer K, Nicolaides KH (2003) Integrated ultrasound and biochemical screening for trisomy 21 using fetal nuchal translucency, absent fetal nasal bone, free beta-hCG and PAPP-A at 11 to 14 weeks. Prenat Diagn 23:306–310[CrossRef][Medline]

Kiendl H, Krabs M (1989) Ein Verfahren zur Generierung regelbasierter Modelle für dynamische Systeme. Automatisierungstechnik 37:423–430

Kozak KR, Amneus MW, Pusey SM, Su F, Luong MN, Luong SA, Reddy ST, et al. (2003) Identification of biomarkers for ovarian cancer using strong anion-exchange ProteinChips: potential use in diagnosis and prognosis. Proc Natl Acad Sci USA 100:12343–12348[Abstract/Free Full Text]

Lo YM (2000) Fetal DNA in maternal plasma: biology and diagnostic applications. Clin Chem 46:1903–1906[Abstract/Free Full Text]

Melle C, Ernst G, Schimmel B, Bleul A, Koscielny S, Wiesner A, Bogumil R, Moller U, et al. (2003) Biomarker discovery and identification in laser microdissected head and neck squamous cell carcinoma with ProteinChip(R) technology, Two-dimensional gel electrophoresis, tandem mass spectrometry, and immunohistochemistry. Mol Cell Proteomics 2:443–452[Abstract/Free Full Text]

Roizen NJ, Patterson D (2003) Down's syndrome. Lancet 361:1281–1289[CrossRef][Medline]

Simoni G, Brambati B, Danesino C, Rossella F, Terzoli GL, Ferrari M, Fraccaro M (1983) Efficient direct chromosome analyses and enzyme determinations from chorionic villi samples in the first trimester of pregnancy. Hum Genet 63:349–357[Medline]

Steele MW, Breg WR Jr (1966) Chromosome analysis of human amniotic-fluid cells. Lancet 1:383–385[CrossRef][Medline]

Wapner R, Thom E, Simpson JL, Pergament E, Silver R, Filkins K, Platt L, et al. (2003) First-trimester screening for trisomies 21 and 18. N Engl J Med 349:1405–1413[Abstract/Free Full Text]

Won Y, Song HJ, Kang TW, Kim JJ, Han BD, Lee SW (2003) Pattern analysis of serum proteome distinguishes renal cell carcinoma from other urologic diseases and healthy persons. Proteomics 3:2310–2316[CrossRef][Medline]

Yaron Y, Mashiach R (2001) First-trimester biochemical screening for Down syndrome. Clin Perinatol 28:321–331[Medline]