BRIEF REPORT |
Proteome Analysis of Maternal Serum Samples for Trisomy 21 Pregnancies Using ProteinChip Arrays and Bioinformatics
Core Unit Chip Application (CUCA), Institute of Human Genetics and Anthropology, Friedrich Schiller University (AB,SM,CH,UC,FvE), and BioControl (DD), Jena, Germany
Correspondence to: Ferdinand von Eggeling, Core Unit Chip Application (CUCA), Inst. of Human Genetics and Anthropology, Friedrich Schiller University, D-07740 Jena, Germany. E-mail: fegg{at}mti.uni-jena.de
![]() |
Summary |
---|
![]() ![]() ![]() |
---|
(J Histochem Cytochem 53:341343, 2005)
Key Words: trisomy 21 first semester screening ProteinChip arrays SELDI serum prenatal diagnosis bioinformatics
IN 1959 it was discovered that Down syndrome is caused by the presence of three copies of chromosome 21. Trisomy 21 is the most common aneuploidy, which affects up to 1 in 700 live births (Roizen and Patterson 2003). Prenatal detection of trisomy 21 fetuses became possible in the mid-1960s by amniocentesis (Steele and Breg 1966
) and later by chorionic villous sampling (Simoni et al. 1983
), but these procedures are associated with a 0.23% fetal loss rate. Therefore, prenatal diagnosis for the detection of chromosomal aberrations with these techniques is restricted to high-risk pregnancies (CEMAT Group 1998
). The triple test combining the levels of
-fetoprotein, human chorionic gonadotropin, and unconjugated oestriol in maternal serum has a 69% detection rate and a 5% false-positive rate (Yaron and Mashiach 2001
). The combined use of maternal serum screening (free human ß-chorionic gonadotropin and pregnancy-associated plasma protein A) and ultrasonographic measurement of fetal nuchal translucency has an approximately 85% detection rate, with a false-positive rate of 5% (Wapner et al. 2003
). At present, the presence or absence of the fetal nasal bone is investigated as an additional ultrasound marker for more precise screening (Cicero et al. 2003
).
In the work presented here, we tried to find differentially expressed protein profiles by ProteinChip technology in maternal serum samples, which constitute an easy way to handle noninvasive first-trimester screening for trisomy 21 pregnancies.
Our study population consisted of 56 maternal serum samples, taken between weeks 10.0 and 14.0 of gestation. Twenty-four of these serum samples were from women carrying a fetus with trisomy 21 and 32 were from women carrying a fetus with a normal karyotype. The chromosomal status of the fetus was confirmed by amniocentesis or chorionic villous sampling. Untreated whole blood was collected and allowed to clot at room temperature. Serum was purified from blood by centrifugation at 1500 x g for 10 min, shock-frozen and stored at 80C. Serum samples were not subjected to more than two freeze-thaw cycles before the assay.
For analysis, the proteomic technique SELDI-MS (surface-enhanced laser desorption/ionization-mass spectrometry)-based ProteinChip technology was used (Melle et al. 2003). This technology makes use of affinity surfaces to retain proteins on the basis of their physicochemical characteristics, followed by direct analysis by time-of-flight mass spectrometry (TOF-MS). Proteins retained on chromatographic surfaces can be easily purified from contaminants such as buffer salts or detergents, thus eliminating the need for pre-separation techniques as are required for other MS techniques. For higher disintegration we used ProteinChip array types with two different surface chemistries: SAX2 as a strong anion exchanger and H50 as a reverse-phase or hydrophobic interaction array. For SAX2 arrays, the binding buffer consisted of 100 mM Tris (pH 8.5) containing 0.02% Triton-100. For H50 arrays, it was 30% acetonitrile and 0.1% triflouracetic acid in water. After equilibration, the buffer was removed and 3.5 µl of fresh buffer and 1.5 µl serum were added to each spot. The array was incubated in a humidity chamber for 90 min at 20C. Then each target was washed three times with 5 µl binding buffer, washed two times with 5 µl water to remove the buffer salts, and subsequently air-dried. Next, 2 x 0.5 µl saturated sinapinic acid dissolved in 50% acetonitrile containing 0.5% trifluoroacetic acid (all from Sigma; St Louis, MO) was applied to each spot. Mass analysis was performed in the ProteinChip PBSII instrument according to an automated data collection protocol. Spectra were analyzed using the ProteinChip software (version 3.1).
The resulting protein profiles were subjected to a cluster- and rule-based data mining algorithm. The data analysis process consists of a clustering step, a rule extraction and rating step, and a rule base construction step. All these steps, except for the clustering step, are supervised with respect to the given sample classification (e.g., trisomy 21 vs unaffected). The data were log 2-transformed before the normalization procedure and were then normalized to the median. The preprocessed protein expressions were then clustered into two clusters, "low expressed" and "high expressed," for each peak using a modified fuzzy c-means algorithm (Bezdek 1981), in which samples with a protein expression below the lower cluster center were fully assigned to the cluster "low expressed" and samples with a protein expression above the higher cluster center were fully assigned to the cluster "high expressed." The main objective of the next step, the rule extraction and rating step, is to find those features (peaks) that have the most similar membership distribution compared with the given sample class distribution. For this purpose, so-called rules are generated and rated by a statistically based rule rating measure introduced by Kiendl and Krabs (1989)
. Such rules can be written as follows: IF sample belongs to cluster "peak X high expressed," THEN sample belongs to class "trisomy 21 pregnancy." Rules with a rule rating measure greater than zero were then ranked in a rule list.
A small subset of rules out of the rule list can form a rule base that can be used for automatic classification of new patient samples. A rule base contains at least one rule for every possible classification outcome. To classify a new patient sample, the cluster memberships (condition part of the rules) of all rules out of the rule base that point to the same classification outcome (conclusion part of the rules) are added and the sample is assigned to the class with the highest sum. To construct a rule base that gives a good representation of the dataset investigated, all combinations of rules out of the rule lists were permuted and the rule base with the smallest classification error and the smallest number of rules was chosen. The rules contained in the chosen rule base can be considered to represent markers that can distinguish between the sample classes under investigation.
Analysis resulted in protein profiles for 56 patients for both ProteinChip Array surfaces. From these 56 profile sets, 24 were from women carrying a fetus with a trisomy 21 and 32 with a normal karyotype. Peaks were defined with a signal-to-noise ratio (S/N) greater than 5. Clusters were completed with estimated peaks added greater than 2 S/N, with a cluster mass window of 0.3% of mass. For the above-mentioned conditions in the range from 1.5 to 20 kD, 21 peaks for H50 arrays and 69 for SAX2 arrays could be found. In the range of 10 to 200 kD, 72 peaks for H50 arrays were found and 38 peaks for SAX2 arrays. The consideration of single peaks led to no sufficient discrimination between unaffected or trisomy 21 pregnancies. Moreover, we could not find a correlation of biomarkers within gestational age, although this is known for other serum markers (Wapner et al. 2003). The cluster- and rule-based data mining method described above revealed six peaks whose combination is relevant to distinguish between unaffected or trisomy 21 pregnancies (Table 1). These peaks are equally distributed between the two chip surfaces and between the criteria unaffected vs trisomy 21. They have a size ranging from 7.9 kD to 96.7 kD. The cluster borders are given in Table 2. One condition, for example, is if the expression at peak 34 is higher then 0.187 on H50 ProteinChip Array the sample belongs to a serum sample from a woman carrying a fetus with a normal karyotype. After combining all six generated rules (Table 1), it was possible to identify 24 samples of the trisomy 21 pregnancies, with a wrong prediction for two cases. Therefore, the calculated sensitivity is 91%. Of 32 unaffected pregnancies, one was predicted false-positive, resulting in a specificity of 97%. With this result, we reached an equal or higher sensitivity and specificity than ProteinChip System-based studies on cancer (Kozak et al. 2003
; Won et al. 2003
), also compared with other prenatal screening methods (Wapner et al. 2003
) or to other testing based on fetal DNA in maternal serum (Lo 2000
).
|
|
![]() |
Acknowledgments |
---|
We thank Dr A. Kossakiewicz, Dr P. Kozlowski, Dr M. Pruggmayer, and Dr B. Eiben for their contributions of serum samples.
![]() |
Footnotes |
---|
Received for publication May 7, 2004; accepted August 5, 2004
![]() |
Literature Cited |
---|
![]() ![]() ![]() |
---|
Bezdek JC (1981) Pattern Recognition with Fuzzy Objective Function Algorithms. New York, Plenum Press
CEMAT Group (1998) Randomised trial to assess safety and fetal outcome of early and midtrimester amniocentesis. The Canadian Early and Mid-trimester Amniocentesis Trial (CEMAT) Group. Lancet 351:242247[CrossRef][Medline]
Cicero S, Bindra R, Rembouskos G, Spencer K, Nicolaides KH (2003) Integrated ultrasound and biochemical screening for trisomy 21 using fetal nuchal translucency, absent fetal nasal bone, free beta-hCG and PAPP-A at 11 to 14 weeks. Prenat Diagn 23:306310[CrossRef][Medline]
Kiendl H, Krabs M (1989) Ein Verfahren zur Generierung regelbasierter Modelle für dynamische Systeme. Automatisierungstechnik 37:423430
Kozak KR, Amneus MW, Pusey SM, Su F, Luong MN, Luong SA, Reddy ST, et al. (2003) Identification of biomarkers for ovarian cancer using strong anion-exchange ProteinChips: potential use in diagnosis and prognosis. Proc Natl Acad Sci USA 100:1234312348
Lo YM (2000) Fetal DNA in maternal plasma: biology and diagnostic applications. Clin Chem 46:19031906
Melle C, Ernst G, Schimmel B, Bleul A, Koscielny S, Wiesner A, Bogumil R, Moller U, et al. (2003) Biomarker discovery and identification in laser microdissected head and neck squamous cell carcinoma with ProteinChip(R) technology, Two-dimensional gel electrophoresis, tandem mass spectrometry, and immunohistochemistry. Mol Cell Proteomics 2:443452
Roizen NJ, Patterson D (2003) Down's syndrome. Lancet 361:12811289[CrossRef][Medline]
Simoni G, Brambati B, Danesino C, Rossella F, Terzoli GL, Ferrari M, Fraccaro M (1983) Efficient direct chromosome analyses and enzyme determinations from chorionic villi samples in the first trimester of pregnancy. Hum Genet 63:349357[Medline]
Steele MW, Breg WR Jr (1966) Chromosome analysis of human amniotic-fluid cells. Lancet 1:383385[CrossRef][Medline]
Wapner R, Thom E, Simpson JL, Pergament E, Silver R, Filkins K, Platt L, et al. (2003) First-trimester screening for trisomies 21 and 18. N Engl J Med 349:14051413
Won Y, Song HJ, Kang TW, Kim JJ, Han BD, Lee SW (2003) Pattern analysis of serum proteome distinguishes renal cell carcinoma from other urologic diseases and healthy persons. Proteomics 3:23102316[CrossRef][Medline]
Yaron Y, Mashiach R (2001) First-trimester biochemical screening for Down syndrome. Clin Perinatol 28:321331[Medline]