1 Department of Epidemiology, Netherlands Cancer Institute, Amsterdam, the Netherlands
2 International Health Foundation, Utrecht, the Netherlands
3 Department of Gynecology and Obstetrics, Erasmus Medical Center, Rotterdam, the Netherlands
Correspondence to Flora E. van Leeuwen, Department of Epidemiology, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, the Netherlands (e-mail: f.v.leeuwen{at}nki.nl).
Received for publication January 8, 2004. Accepted for publication January 5, 2005.
![]() |
ABSTRACT |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
cohort studies; fertility; fertilization in vitro; medical records; questionnaires; reproductive techniques, assisted; sensitivity and specificity; women
![]() |
INTRODUCTION |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
To our knowledge, no reports are available on the validity and reliability of self-reported data on the cause of subfertility. Some studies evaluated the recall of menstrual and reproductive factors and reported a satisfactory-to-good recall for age at menarche and age at menopause, pregnancy-related events, and birth characteristics (1115
). Accuracy of recall was less for menstrual cycle characteristics (11
, 14
, 16
).
In a large Dutch cohort of women who underwent in vitro fertilization, we assessed the validity of self-reported causes of subfertility in comparison with medical record information. Detailed information on the causes of subfertility was available from two sources, that is, mailed questionnaires from the study participants and medical records that were abstracted by trained research assistants. With the advent of assisted reproductive technology, subfertile couples undergo a variety of diagnostic procedures prior to the start of subfertility treatment. The results of these tests may point to a single cause of subfertility but may also indicate two or even more contributing factors. Both the women and the trained research assistants in our study could report more than one cause of subfertility. Consequently, we could also investigate differences in validity of recall between women with only one and those with two causes of subfertility. Furthermore, we were able to examine the role of several covariates on the accuracy of reporting.
![]() |
MATERIALS AND METHODS |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Between 1997 and 2000, all women received a mailed questionnaire on reproductive history, history of subfertility (treatment), the use of exogenous hormones (fertility drugs, oral contraceptives, and hormone replacement therapy), and various lifestyle factors; women also received an informed consent form for data abstraction from the medical records. A total of 13,698 women returned the questionnaire (response rate: 71.2 percent). The upper part of figure 1 displays a graphical presentation of the OMEGA study population.
|
Data collection and methods of analyses
In the questionnaire, women were asked to report their cause(s) of subfertility according to 12 categories, that is, tubal obstruction, factors related to the male partner, endometriosis, ovulation disorders, other hormonal disorders, premature ovarian failure, polycystic ovary syndrome, uterine abnormalities, cervical disorders, factors related to vaginal disorders, unexplained subfertility (despite full subfertility assessment, no particular cause of subfertility found), and other factors. Women could report more than one cause of subfertility (table 1). For the present study, the self-reported causes of subfertility were classified into eight categories: tubal, male, ovarian (including ovulation disorders, polycystic ovary syndrome, premature ovarian failure, and hormonal disorders related to ovarian function), cervical disorders, uterine abnormalities, endometriosis, unexplained, and other (including vaginal disorders).
|
We assessed the agreement between self-reports and medical records, adjusting for chance agreement by use of the Cohen kappa statistic, which was estimated for various subfertility categories. Values of kappa greater than 0.75 represent excellent agreement, values of 0.400.75 were considered to indicate "moderate" agreement, and a value of less than 0.40 represents poor agreement (19).
Furthermore, we calculated the overall proportion of observed agreement (positive and negative) between self-reports and medical records. In addition, we analyzed the overall kappa value of all types of subfertility within a subgroup of women whose subfertility was ("strongly") attributed to only one factor, as reported in both the questionnaire and medical record.
Using the medical record information as our "gold standard," we determined the validity measures sensitivity and specificity and positive and negative predictive values. Sensitivity is defined as the proportion of women with a specific cause of subfertility according to medical record who correctly classified themselves with this particular cause of subfertility in the questionnaire. Specificity is the proportion of women who truly do not have a specific cause of subfertility (according to the medical record) and who correctly classified themselves as such in the questionnaire.
Logistic regression analysis was used to determine which variables were independently associated with overall agreement regarding the subfertility diagnosis reported in the questionnaire and the medical records. The variables of interest were age at completion of the questionnaire (34, 3537, 3840, and >40 years), duration of subfertility before in vitro fertilization (
1, 23, 45, and
6 years), prein vitro fertilization treatment (none, fertility drug use only, and intrauterine insemination with or without fertility drug use), years since first in vitro fertilization treatment as a "proxy" for the period of time that had elapsed since the woman received information about the cause(s) of subfertility (
2, 35, 67, and
8 years), number of in vitro fertilization treatment cycles (one, two, three, and greater than three), educational level (low (primary school), middle (secondary school), high (college or university), and unknown), and whether or not women had at least one livebirth after in vitro fertilization treatment. All analyses were processed with SPSS, version 11.0, software (SPSS, Inc., Chicago, Illinois).
![]() |
RESULTS |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
Table 2 presents the comparison between self-reports and medical records for various subfertility diagnoses. The most important causes of subfertility as reported by the women were tubal (38 percent), male (38 percent), unexplained (23 percent), endometriosis (13 percent), and hormonal (17 percent). These percentages add up to more than 100 percent because more than one cause of subfertility could be reported by the women (and abstracted from the medical records). Of all the women who attributed their subfertility to tubal factors, this cause was confirmed in the medical record in 91 percent of all cases (positive predictive value). Conversely, of all the women whose subfertility was caused by tubal factors according to the medical record, 84 percent reported tubal subfertility in the mailed questionnaire (sensitivity). The overall agreement (positive and negative) for tubal subfertility was 90 percent, and the kappa value was 0.79, indicating excellent agreement. For male subfertility, the validity measures and agreement were just a bit lower and also indicative of accurate reporting. For both unexplained subfertility and endometriosis, the kappa values indicated "moderate" agreement, with rather low positive predictive values (65 percent and 43 percent, respectively). Despite the low kappa value and sensitivity for women whose subfertility was attributed to hormonal, cervical, or uterine factors, the observed specificity was high.
|
|
Table 4 presents the accuracy of self-reported cause of subfertility in comparison with the medical records according to several characteristics of the study population. For women aged 38 years or more at the time of filling out the questionnaire, the adjusted odds ratio for accurate recall of type of subfertility was 0.62 (95 percent confidence interval: 0.49, 0.80) compared with that for women aged less than or equal to 34 years. Furthermore, women with a high educational level were more likely to recall their type of subfertility accurately than were women with a low educational level. A long period of subfertility and prein vitro fertilization fertility treatment were associated with less accurate reporting of the cause of subfertility.
|
![]() |
DISCUSSION |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|
The highly accurate self-report for tubal and male subfertility may be explained by the fact that these are widely known causes of subfertility. In the first years after the introduction of in vitro fertilization treatment in the Netherlands, the major indication for treatment with in vitro fertilization was tubal obstruction, while male subfertility became a major indication for in vitro fertilization in the mid-1980s. Tubal and male subfertility have remained the most important indications for in vitro fertilization treatment. Women might remember these causes more accurately as compared with, for example, hormonal subfertility, which is less prevalent and more difficult to understand.
For endometriosis and hormonal factors, sensitivity was high to moderate, but the positive predictive values were low. A possible explanation is that, in the era of assisted reproductive technology, couples undergo extensive fertility assessment; in the course of diagnostic work-up (which may already have started before the first in vitro fertilization treatment), it is quite possible that several subtle abnormalities are detected and communicated to the subfertile couple. Although such test results may be considered minor contributors to overall subfertility by the gynecologist, the women in our study may have remembered them and reported them in the questionnaire. This phenomenon may also explain the rather low sensitivity for "unexplained" subfertility. The distribution of subfertility causes attributed to only one factor as reported by the women in our study is more or less in line with that in the literature, which reports that 35 percent of all subfertility in couples is due to tubal factors, 35 percent to male factors, 5 percent to unusual problems, and 15 percent to ovulatory dysfunction, with 10 percent remaining unexplained (20). The percentage of unexplained subfertility in our study is higher while hormonal subfertility was less prevalent, which may be explained by the differences in source populations, the definitions used, and the differences in fertility assessment over time. In daily clinical practice, many women with hormonal subfertility are successfully treated without the necessity of in vitro fertilization treatment, explaining the lower proportion of women with hormonal subfertility in our study. Our study population was treated with in vitro fertilization between 1983 and 1994. The relatively high proportion of women diagnosed with unexplained subfertility may be due to more thorough fertility assessment in recent years.
In 54 percent of all women who reported two or more causes of subfertility, the medical record revealed that subfertility was attributed to only one major factor. Diagnostic tests in reproductive medicine may reveal subtle abnormalities and, in addition, do not always yield unambiguous results. Although the gynecologist did not consider subtle abnormalities as a major cause of subfertility, a woman might consider such abnormalities as an important cause of subfertility. Conversely, 43 percent of all the women whose subfertility was due to two or more major factors according to the medical record reported only one cause of subfertility themselves. In these cases, full fertility assessment revealed more than one factor classified as either "strongly" or "probably" contributing to the subfertility, but the women failed to report it in our questionnaire. The lower sensitivity for the most frequent combinations of two causes of subfertility suggests that women with two causes of subfertility are less likely to recall their subfertility accurately as compared with those who had only one cause of subfertility.
When considering the factors impacting on the accuracy of self-reports (table 4), we observed that younger age (<37 years) at questionnaire completion, a high educational level, a short period of subfertility duration (less than 2 years), and no fertility treatment prior to in vitro fertilization favorably affected the recall of cause of subfertility. Less accurate recall among women with a long duration of subfertility and prein vitro fertilization treatment may be explained by the higher prevalence of unexplained subfertility among these women. In our study, the time since in vitro fertilization treatment did not affect the accuracy of recall. Women who had their first in vitro fertilization treatment 8 or more years ago were able to recall their subfertility cause as accurately as did women whose treatment started 2 years ago. As pointed out above, tubal subfertility was the major indication for in vitro fertilization in the early years after its introduction. This was confirmed in our data; that is, those with a long follow-up period were more likely to have been diagnosed with tubal subfertility (54 percent). These women may have had a wish for a child for many years before the in vitro fertilization era, and they probably consulted several fertility experts while trying to achieve a pregnancy. Although our data show that the physician-patient communication must have been relatively good in the early era of assisted reproductive technology treatment, it is possible that the women treated in the first years of in vitro fertilization were more often told their subfertility diagnosis, resulting in accurate recall despite the longer time interval.
Since hardly any reports are available on the validation of the self-reported cause of subfertility, we cannot compare our results with those of the literature. A recent case-control study of ovarian cancer reported that validation of women's self-reported difficulties in conceiving (as a measure of infertility) with medical record information resulted in great difficulties. Of the 179 women for whom medical records were available, only 41 (23 percent) had their self-report of a fertility problem confirmed (21). Validation of self-reported ovulatory infertility was attempted in a randomly selected sample of 100 women participating in the Nurses' Health Study II (22
). Of the 71 women who gave permission to review their medical records, only 40 medical records could be located, of which in 95 percent ovulatory infertility was confirmed by diagnostic test or specific treatment (22
). Sensitivity was not assessed in this small sample of the study, however, and recall of other causes of subfertility was not examined.
When our results are interpreted, the strengths and limitations of the study design need to be considered. The strengths of our study include the large study size and the availability of detailed information on all causes of subfertility from two sources. Furthermore, the women filling out the questionnaires and the research assistants abstracting the medical record were instructed that they could record more than one factor that contributed to the overall subfertility. In addition, we collected information from the medical record on the importance of all subfertility causes mentioned in the record.
A limitation of our study is, however, that in calculating the sensitivity and specificity we considered the medical record as the gold standard, thus assuming the medical records to be correct and complete. Of all the eligible women in the study population (n = 9,942), only 7 percent of the women had to be excluded because of incompleteness of the medical records for the cause of subfertility. However, some medical records that were considered by us to be complete may have been incomplete for one of the subfertility causes considered, resulting in an underestimation of overall agreement and positive predictive value. This might explain the reporting of two causes of subfertility in the mailed questionnaire with confirmation of only one cause in the medical record. It is even possible that women reported causes of subfertility that were successfully treated in the past (e.g., tubal surgery), after which they received in vitro fertilization treatment (because of male subfertility) in the clinic participating in our study, to which the previously diagnosed cause of subfertility was not communicated. However, it seems unlikely to us that this has had more than a minor influence on our assessment of the accuracy of recall.
Our study was restricted to a subfertile population that received in vitro fertilization treatment. Recall of the cause of subfertility might be different among in vitro fertilization-treated women as compared with other subfertile women, because undergoing in vitro fertilization can be considered an important life event. In particular, women treated in the early era of assisted reproductive technology may remember their cause of subfertility more accurately because they undertook more extensive action to be treated. In addition, women treated with in vitro fertilization, compared with those subfertile women not receiving this treatment, might be more aware of reproductive health problems and might have higher educational levels (although in the Netherlands three in vitro fertilization attempts are covered by health insurance). These characteristics limit the generalizability of our results to subfertile women in general.
In the initial OMEGA study cohort, a subgroup of subfertile women who did not receive any treatment with in vitro fertilization was included (n = 6,588). However, because of limited project funding, we were able to abstract the medical records of only a small sample of these women who had not received in vitro fertilization treatment, and information on the cause of subfertility was absent in many cases. For 475 women not treated with in vitro fertilization, we had information on the cause of subfertility from both the questionnaire and the medical records. In this group, the positive predictive value, sensitivity, and kappa value for subfertility attributed to tubal factors were 94 percent, 91 percent, and 0.75, respectively (data not shown). These percentages are in line with those presented in table 2 for the in vitro fertilization-treated women.
The response rate to our mailed risk factor questionnaire was 71.2 percent. A survey among nonresponders showed that nonresponders were less likely to have had a livebirth after in vitro fertilization. The major reasons for nonresponse were lack of time to fill out the questionnaire and negative emotions regarding in vitro fertilization treatment, often related to the fact that no ongoing pregnancy was achieved. We cannot exclude the possibility that these reasons for nonresponse might be related to the accuracy of self-reported subfertility diagnosis.
In conclusion, our results indicate that self-reports of tubal and male subfertility are highly accurate. However, for unexplained subfertility, the validity of recall was considered to be only moderate and, for subfertility causes related to hormonal and uterine factors, the accuracy was considered low. The accuracy of recall was unsatisfactory if two causes of subfertility played a role. When self-reports on subfertility diagnosis are used in epidemiologic studies, verification by medical records should be a consideration for specific causes and combinations of more than one cause.
![]() |
ACKNOWLEDGMENTS |
---|
The OMEGA Project Group includes the following persons: Dr. M. Kortman and Dr. E. R. Te Velde (University Medical Center, Utrecht); Dr. N. Macklon (Erasmus Medical Center, Rotterdam); Dr. C. A. M. Jansen (Diaconessenhuis, Voorburg); Dr. R. A. Leerentveld (Isala clinics, Zwolle); Dr. W. N. P. Willemsen (Academic Hospital, Nijmegen, St. Radboud); Dr. R. Schats (Academic Hospital, Free University, Amsterdam); Dr. N. Naaktgeboren and Dr. F. M. Helmerhorst (Leiden University Medical Center, Leiden); Dr. R. S. G. M. Bots (St. Elisabeth Hospital, Tilburg); Dr. A. H. M. Simons (Academic Hospital, Groningen); Dr. H. V. Hogerzeil (Academic Medical Center, Amsterdam); Dr. J. L. H. Evers (Academic Hospital, Maastricht); and Dr. P. A. van Dop (Catharina Hospital, Eindhoven), the Netherlands.
![]() |
References |
---|
![]() ![]() ![]() ![]() ![]() ![]() ![]() |
---|