Inter-individual variability in the morphological assessment of human sperm: effect of the level of experience and the use of standard methods

F. Eustache and J. Auger1

Service de Biologie de la Reproduction-CECOS, Hôpital Cochin, 75014 Paris, France

1 To whom correspondence should be addressed at: Laboratoire d’Histologie, Biologie de la Reproduction, CECOS, and Groupe d’Etude de la Fertilité Humaine (GREFH), Université Paris V, Pavillon Cassini, Hôpital Cochin, 123 Bd de Port-Royal, 75014 Paris, France. e-mail: jacques.auger{at}cch.ap-hop-paris.fr


    Abstract
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 References
 
BACKGROUND: The assessment of human sperm morphology is not devoid of methodological problems, making comparison among laboratories difficult. The main goal of this study was to assess how the experience and compliance to the recommended procedures can modulate it. METHODS: Sixty-two technicians and biologists of varying degrees of andrological experience classified the same 100 sperm in a blind study from images captured on a video microscope and projected onto a screen. David’s morphology classification which distinguishes normal and abnormal sperm and calculates a Multiple Anomalies Index (MAI) was used. These sperm were also classified by two trained and experienced biologists using videotaped images to provide reference values. RESULTS: The inter-participant coefficient of variation was 40% for normal sperm, lower for five defects and, 12% for MAI. Experienced participants had the closest results to the reference values. Moreover, participants not routinely using the recommended methodologies, regardless of their experience level, consistently showed marked differences compared with the reference values. CONCLUSIONS: The present data confirm the wide variability in the assessment of normal sperm and show a lower variability for several anomalies and MAI. They underscore the role that experience plays and the importance of following the recommended methodologies for relevant and comparable results.

Key words: multiple anomalies index/quality control/routine semen analysis


    Introduction
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 References
 
During the past three decades, multiple techniques of sample staining and systems of classification to assess human sperm morphology have been proposed [see Mortimer and Menkveld, (2001)Go for review] without consensus on a method of a universal classification. Conscious of this problem, the editorial staff of the 4th edition of the World Health Organization (WHO) manual has recommended the use of a single method classifying essentially the morphologically normal sperm using strict criteria (Kruger et al., 1988Go; World Health Organization, 1999Go). Regardless of the morphological classification used, manual assessment of sperm cell morphology is visually subjective and creates intra- and inter- individual laboratory variability which can be assessed by internal and external quality control schemes (Neuwinger et al., 1990Go; Cooper et al., 1992Go; Matson, 1995Go; Auger et al., 2000Go; Franken et al., 2000Go). In the few external quality assessments (EQA) previously reported, samples of prepared semen were sent to the participating laboratories. In the present study, we invited technicians and biologists to participate in an EQA session to assess sperm morphology using high-resolution images of sperm projected directly from a video equipped microsocope. We could then evaluate their inter-individual variability in the recognition of the normal sperm and of the various sperm abnormalities according to the classification of David (David et al., 1975Go)—a classification used in the vast majority of French laboratories and proven useful from a clinical point of view (Jouannet et al., 1988Go; Auger et al., 2001Go; Slama et al., 2002Go)—and study the possible effect of their level of experience or of their compliance with the recommended technical procedures.


    Materials and methods
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 References
 
Sixty-two participants attended a Quality Control (QC) workshop on sperm morphology assessment at the University of Nancy, in the east of France. The participants were laboratory technicians, biologists or physicians from 40 French laboratories and two from Luxembourg. The participants had various experience levels and used various methods for sperm morphology assessment (Table IAA and IBB).


View this table:
[in this window]
[in a new window]
 
Table IA. Level of experience of the participants
 

View this table:
[in this window]
[in a new window]
 
Table IB. Routine procedures used for sperm morphology assessment
 
A single semen sample with a high level of various morphological abnormalities was assessed. Briefly, the smear was prepared from a 10 µl drop, air-dried, fixed for 1 h with a mixture of absolute ethanol (2/3) and acetic acid (1/3), then stained by Shorr staining using an automatic stainer (Sakura DRS601, Bayer Diagnostics, Puteaux, France). The slide was put onto the stage of a BX50 microscope (Olympus, Rungis, France) equipped with a 100x objective and a XC-003P 3-CCD RGB colour camera (video output elements: 739 x 575; Sony, Clichy, France). The camera was connected to a VPL-FE110 video projector (resolution RGB: 1280 x 1024; Sony) giving a final high-resolution image. Before the workshop started, images were projected on the screen to check that the projected images of the sperm had adequate definition for their in-depth analysis and classification. It was found that a participant sitting in any of the first six rows of the hall and front of the screen had a proper angle of resolution and vision. The normal and abnormal sperm were classified according to the classification of David et al. (1975Go), modified after the report of Jouannet et al. (1988Go). The details of the classification have been recently described (Auger et al., 2001Go). Briefly, normal sperm and abnormal sperm categories are defined by strict and specific criteria and all abnormalities of each sperm are recorded which makes it possible to calculate the Multiple Anomalies Index, MAI (Jouannet et al., 1988Go; World Health Organization, 1999Go), the mean number of anomalies per abnormal sperm. Two hours before the session started, a 1 h lecture using numerous slides and figures summed up the standardized procedure for David classification. Each participant received a form with a grid to record all of the anomalies observed for each sperm cell. A total of 100 sperm was assessed, each spermatozoon chosen being tagged with the projected light arrow of the microscope. Each sperm cell selected was projected for 45 s (delay necessary for a novice participant to assess a spermatozoon with multiple defects as determined in previous internal QC) and concomitantly recorded on a video tape inserted in a Panasonic professional video recorder (AG-7350 Panasonic, Saint Denis la Plaine, France). After the workshop and before data analysis, two trained biologists (F.E. and J.A.) jointly classified the videotaped sperm. For the rare sperm for which there was a disagreement between both biologists (mainly those having size defects), measurements were made to correctly classify them. The overall variability in the assessment of normal and abnormal sperm was studied. We then estimated the overall variability for the assessment of the same characteristics in reference to the level of training of the participants in two subgroups defined according to the number of assessments made per year (<=520 per year, n = 45 versus >520 per year, n =17). We have also investigated if the compliance with the recommended methodologies (World Health Organization,1992Go) for making smears, staining and routinely assessing sperm morphology could have any influence on the results of the current assessment. Therefore, we have studied the differences between participants and reference values for the normal and abnormal sperm cells for three groups of participants. The first group was composed of participants assessing more than 520 slides per year for at least 2 years and routinely using the WHO recommended methodologies (World Health Organization, 1992Go). The second group was composed of participants with less experience (<2 years, whatever the number of assessments per year) but who routinely used the recommended methodologies as for the first group. The third group was composed of the other participants who do not use routinely the recommended methodologies, irrespective of their experience level. Inter-participant variability in the assessment of sperm morphology was expressed as the coefficient of variation: CV (%) = 100 x standard deviation/mean value. The equality of means values of the characteristics studied among the three subgroups selected on their experience level and compliance with the WHO guidelines was tested by a one-way analysis of variance. When rejecting the null hypothesis, the Bonferroni test was used for pair wise mean comparisons. Other paired comparisons were made using the Mann–Whitney-test. All statistics were run using the BMDP statistical software (Dixon, 1988Go).


    Results
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 References
 
The mean and median values of the percentage of normal sperm, of each anomaly recorded and of the MAI, the corresponding overall inter-participant CV and the reference results are presented in Table II. The lowest CV were observed for the MAI (12%), the percentages of bent midpieces (23%), of absent tails (25%) and abnormal or absent acrosomes (26%). The overall CV for the percentage of normal sperm was found to be equal to 40% with a very broad range (6–39%), a marked difference with the low dispersion of values found for the MAI. The mean and median values of the percentage of normal sperm, MAI and each anomaly recorded and the corresponding overall inter-participant CV in two groups of participants selected according to their experience are presented in Table III. Experienced participants found significantly less normal sperm, and more multiple heads, abnormal acrosomes, thin midpieces and irregular calibres of the tail. Figure 1 summarizes the results of the three groups of participants studied according to their level of experience and compliance with the recommended methodologies. Participants who did not routinely use the recommended methodologies, regardless of their experience level, had the highest differences compared with the reference values. In addition, their results differed significantly from those obtained in the group of experienced participants following routinely the WHO guidelines.


View this table:
[in this window]
[in a new window]
 
Table II. Distributions and mean inter-individual coefficients of variation of the morphology characteristics studied by the 62 participants and reference results
 

View this table:
[in this window]
[in a new window]
 
Table III. Differences in the frequency of the sperm morphological characteristics and related inter individual coefficients of variation (CV) according to the number of analyses performed per year
 


View larger version (26K):
[in this window]
[in a new window]
 
Figure 1. Differences to the reference values according to the level of experience and the use of standardized methodologies.

 

    Discussion
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 References
 
A noticeable amount of variability in the assessment of human sperm morphology was found in the present study. This variability is obviously prejudicial to the comparison of the results in the same laboratory or from one laboratory to another. Moreover, it can also strongly impair the management of infertile couples because sperm morphology is a major determinant of human fertility in vivo (Guzick et al., 2001Go; Slama et al., 2002Go) and in vitro (Kruger et al., 1988Go). The results of the EQA reported herein are based on a large group of biologists or technicians with different levels of experience and practice. In this respect, they can be considered representative of the baseline variability in the assessment of human sperm morphology in a developed country. The overall CV for the percentage of normal sperm was found to be equal to 40%, a result similar to what was previously found in several EQA (Matson et al., 1995Go; Ombelet et al. 1998Go; Keel et al., 2000Go). Interestingly, the variability for various anomaly categories and the MAI was markedly lower. The inter-participant variability for visual pattern defect recognition (for example, multiple head) was lower than for the defect categories requiring a visual assessment of the head or tail size (for example, tapered heads), see Table II, a result previously reported (Baker and Clarke, 1987Go). However, continuous quality controls in our laboratory have indicated that training with the help of an eyepiece equipped with a measuring reticle strongly improves the classification of size defects which in turn decreases markedly the intra- and inter-individual variability in assessment. The better concordance in assessment of several categories of abnormal sperm together with their clinical value (Jeulin et al., 1986Go; Jouannet et al., 1988Go; Menkveld et al., 1996Go; Slama et al., 2002Go), suggest the relevance of determining the amount of morphological defects or the MAI besides the percentage of normal sperm. There are obvious sources of variability in the assessment of sperm morphology, for example, the current use of several systems of classification [strict criteria, Düsseldorf, WHO 87 and 92; American Society for Clinical Pathology (ASCP), Adelman and Cahill, 1989Go] or David classifications, (Ombelet et al., 1997Go). Table IV, summarizing the schemes and results of various EQA, illustrates the wide variation in human sperm morphology assessment whatever the classification method used (inter-individual CVs ranging between 28 and 65%). These marked variations point to other putative sources of variability such as the observer level of experience or the procedures used (Dunphy et al., 1989Go; Menkveld et al., 1990Go; Neuwinger et al., 1990Go; Davis and Gravance, 1993Go; Meschede et al., 1993Go). It was found here that participants regularly assessing high numbers of samples for sperm morphology scoring had stricter evaluations of the normal (and the abnormal sperm as illustrated by the MAI value) than novice participants, the former having the mean values closest to the reference values although with some differences. Such differences could be that the trained biologists have assessed the sperm after the workshop, with the possibility to discuss together and to size the cells (which was not the case during the EQA since no means of measuring was offered to the participants). The role of repeated testing on reproducibility and convergence with reference values has been reported (Mortimer, 1994Go; Franken et al., 2000Go). It was shown that the level of experience has clearly an impact on the results of the sperm morphology assessment (Baker and Clarke, 1987Go). However, the level of experience, when taken into consideration alone, is not a guarantee of low inter-individual variability in the assessment of human sperm morphology, even if trained technicians do show lower inter-individual variability than among less trained technicians in some EQA [Jequier and Ukombe, (1983)Go, see Table IV; Baker and Clarke, 1987Go]. In the present EQA, it was interesting to see that participants who do not routinely use the recommended procedures were less strict and gave significantly different results in comparison with participants who follow the guidelines (irrespective of their level of experience) and to the biologists who set reference values.


View this table:
[in this window]
[in a new window]
 
Table IV. Design and results of external quality assessments (EQA) of human sperm morphology from various reports
 
In conclusion, the present study shows that the practical experience as well as the compliance with recommended methodologies have a serious impact on the results of the sperm morphology assessment. It demonstrates the imperative need for standardization, training and quality control for human sperm morphology assessment.


    Acknowledgements
 
We would like to thank B.Foliguet and A.Clavert for the organization of the meeting, all the participants in this EQA and R.Dolan for reviewing the English revision of the manuscript.


    References
 Top
 Abstract
 Introduction
 Materials and methods
 Results
 Discussion
 References
 
Adelman, M.M. and Cahill, E.M. (eds) (1989) Atlas of Sperm Morphology. ASCP Press, Chicago, IL.

Auger, J., Eustache, F., Ducot, B., Blandin, T., Daudin, M., Diaz, I., El Matribi, S., Gony, B., Keskes, L., Kolbezen, M. et al. (2000) Intra- and inter-individual variability in human sperm concentration, motility and vitality assessment during a workshop involving ten laboratories. Hum. Reprod., 15, 2360–2368.[Abstract/Free Full Text]

Auger, J., Eustache, F., Andersen, A.G., Irvine, D.S., Jørgensen, N., Skakkebæk, N.E., Suominen, J., Toppari, J. Vierula, M. and Jouannet, P. (2001) Sperm morphological defects related to environment, lifestyle and medical history of 1001 male partners of pregnant women from four European cities. Hum. Reprod., 16, 2710–2717.[Abstract/Free Full Text]

Baker, H.W.G. and Clarke, G.N. (1987) Sperm morphology: consistency of assessment of the same sperm by different observers. Clin. Reprod. Fertil., 5, 37–43.[Medline]

Cooper, T.G., Neuwinger, J., Bahrs, S. and Nieschlag, E. (1992) Internal quality control of semen analysis. Fertil. Steril., 58, 172–178.[ISI][Medline]

David, G., Bisson, J.P., Czyglik, F., Jouannet, P. and Gernigon, C. (1975) Anomalies morphologiques du spermatozoïde humain. 1) Propositions pour un système de classification. J. Gynecol. Obstet. Biol. Reprod., (Suppl.1) 17–36.

Davis, R.O. and Gravance, C.G. (1993) Standardization of specimen preparation, staining, and sampling methods improves automated sperm-head morphometry analysis. Fertil. Steril., 59, 412–417.[ISI][Medline]

Davis, R.O., Gravance, C.G. and Overstreet, J.W. (1995) A standardized test for visual analysis of human sperm morphology. Fertil. Steril., 63, 1058–1063.[ISI][Medline]

Dixon, W.J. (1988) BMDP statistical software manual. University of California Press, Berkeley.

Dunphy, B.C., Kay, R. Barratt, C.L.R. and Cooke, I.D. (1989) Quality control during the conventional analysis of semen, an essential exercise. J. Androl., 10, 378–385.[Abstract/Free Full Text]

Franken, D.R., Smith, M., Menkveld, R., Kruger, T.F., Sekadde-Kigondu, C., Mbizvo, M. and Akande, E.O. (2000) The development of a continuous quality control programme for strict sperm morphology among sub-Saharan African laboratories. Hum. Reprod., 15, 667–671.[Abstract/Free Full Text]

Guzick, D.S., Overstreet, J.W., Factor-Litvak, P., Brazil, C.K., Nakajima, S.T., Coutifaris, C., Carson S.A., Cisneros, P., Steinkampf, M.P., Hill, J.A. et al. (2001) Sperm morphology, motility, and concentration in fertile and infertile men. N. Engl. J. Med., 345, 1388–1393.[Abstract/Free Full Text]

Jequier, A.M. and Ukombe, E.B. (1983) Errors inherent in the performance of a routine semen analysis. Br. J. Urol., 55, 434–436.[ISI][Medline]

Jeulin, C., Feneux, D., Serres, C., Jouannet, P., Guillet-Rosso, F., Belaisch-Allart, J., Frydman, R. and Testart, J. (1986) Sperm factors related to failure of human in-vitro fertilization. J. Reprod. Fertil., 76, 735–744.[Abstract]

Jouannet, P., Ducot, B., Feneux, D. and Spira, A. (1988) Male factors and the likelihood of pregnancy in infertile couples. I. Study of sperm characteristics. Int. J. Androl., 11, 379–384.[ISI][Medline]

Keel, B.A., Quinn, P., Schmidt, C.F. Jr, Serafy, N.T. Sr and Schalue, T.K. (2000) Results of the American Association of Bioanalysts national proficiency programme in andrology. Hum. Reprod., 15, 680–686.[Abstract/Free Full Text]

Kruger, T.F., Acosta, A.A., Simmons, K.F., Swanson, R.J., Matta, J.F. and Oehninger, S. (1988) Predictive value of abnormal sperm morphology in in vitro fertilization. Fertil. Steril., 49, 112–117.[ISI][Medline]

Matson, P.L. (1995) External quality assessment for semen analysis and sperm antibody detection: results of a pilot scheme. Hum. Reprod., 10, 620–625.[Abstract]

Menkveld, R., Stander, F.S.H., Kotze, T.J., Kruger, T.F. and van Zyl, J.A. (1990) The evaluation of morphological characteristics of human spermatozoa according to stricter criteria. Hum. Reprod., 5, 586–592.[Abstract]

Menkveld, R., Rhemrev, J.P.T., Franken, D.R., Vermeiden, J.P. and Kruger, T.F. (1996) Acrosomal morphology as a novel criterion for male fertility diagnosis: relation with acrosine activity, morphology (strict criteria), and fertilization in vitro. Fertil. Steril., 65, 637–644.[ISI][Medline]

Meschede, D., Keck, C., Zander, M., Cooper, T.G., Yeung, C.H. and Nieschlag, E. (1993) Influence of three different preparation techniques on the results of human sperm morphology analysis. Int. J. Androl., 16, 362–369.[ISI][Medline]

Mortimer, D. (1994) Technician training and quality control aspects. Practical Laboratory Andrology, Oxford University Press, Oxford, pp. 337–347.

Mortimer, D. and Menkveld, R. (2001) Sperm morphology assessment: historical perspectives and current opinions. J. Androl., 22, 192–205.[Free Full Text]

Neuwinger, J., Behre, H.M. and Nieschlag, E. (1990) External quality control in the andrology laboratory: an experimental multicenter trial. Fertil. Steril., 54, 308–314.[ISI][Medline]

Ombelet, W., Pollet, H., Bosmans, E. and Vereecken, A. (1997) Results of a questionnaire on sperm morphology assessment. Hum. Reprod., 12, 1015–1020.[CrossRef][ISI][Medline]

Ombelet, W., Bosmans, E., Janssen, M., Cox, A., Maes, M., Punjabi, U., Blaton, V., Gunst, J., Haidl, G. and Wouters, E. (1998) Multicenter study on reproducibility of sperm morphology assessments. Arch. Androl., 41, 103–114.[ISI][Medline]

Slama, R., Eustache, F., Ducot, B., Jensen, T.K., Jørgensen, N., Horte, A., Irvine, S., Suominen, J., Andersen, A.G. and Auger, J., et al. (2002) Time to pregnancy and semen parameters: a cross-sectional study among fertile couples from four European cities. Hum. Reprod., 17, 503–515.[Abstract/Free Full Text]

World Health Organization (1992) WHO laboratory manual for the examination of human semen and sperm-cervical mucus interaction. 3rd edn. Cambridge University Press, Cambridge.

World Health Organization (1999) WHO laboratory manual for the examination of human semen and sperm-cervical mucus interaction. 4th edn. Cambridge University Press, Cambridge.

Submitted on February 25, 2002; resubmitted on December 6, 2002; accepted on January 14, 2003.





This Article
Abstract
FREE Full Text (PDF )
Alert me when this article is cited
Alert me if a correction is posted
Services
Email this article to a friend
Similar articles in this journal
Similar articles in ISI Web of Science
Similar articles in PubMed
Alert me to new issues of the journal
Add to My Personal Archive
Download to citation manager
Search for citing articles in:
ISI Web of Science (2)
Request Permissions
Google Scholar
Articles by Eustache, F.
Articles by Auger, J.
PubMed
PubMed Citation
Articles by Eustache, F.
Articles by Auger, J.