a Department of Cardiology, Rikshospitalet, Sognsvannsveien 20, University of Oslo, 0027 Oslo, Norway
b Department of Epidemiology, Akershus University Hospital, 1474 Nordbyhagen, Norway
c Department of Informatics, University of Oslo, Norway
* Corresponding author. Tel.: +47-23-07-00-00; fax: +47-23-07-39-17
E-mail address: k-eriks{at}online.no
Received 20 November 2003; revised 20 November 2003; accepted 9 April 2004
Abstract
Aim It has recently been suggested that exercise testing may be more valuable prognostically than it is diagnostically in apparently healthy subjects. We wanted to compare the accuracy of CHD risk assessment based on classical risk factors with an assessment also based on multiple exercise test parameters.
Methods and results In 197275, 2014 apparently healthy men aged 4060 had a symptom limited exercise test during a cardiovascular survey. Three hundred died from CHD during 26 years of follow-up. Compared to Cox regression models solely including classical risk factors (CRF), models also including multiple exercise test parameters (CRF+X) were clearly superior (). Risk scores were computed based on the models. CRF and CRF+X risk scores often differed markedly; CRF+X scores were generally most reliable in both the high and low risk range. In smokers with cholesterol >6.5 mmol/l (
), the CRF and CRF+X models identified 67 vs. 110 men at the highest CHD risk level according to European guidelines (34.2% vs. 38.2% CHD mortality). Three in five CRF+X-identified smokers with cholesterol >6.5 mmol/l had CHD mortality similar to the mean of all 2014 men.
Conclusion Integration of multiple exercise test parameters and conventional risk factors improved CHD risk assessment substantially especially in smokers with high cholesterol.
Key Words: Exercise testing Coronary heart disease mortality Prediction Healthy men Follow-up studies
Introduction
Based on the worldwide epidemiological experience, the evaluation of cardiovascular risk, given in the recent European guidelines on cardiovascular disease prevention,1 is based on four time-honoured classical coronary and cardiovascular disease (CHD and CVD) risk factors: age, serum cholesterol, resting systolic blood pressure and smoking status. Exercise testing a valuable diagnostic tool in populations with established CHD is not recommended in asymptomatic subjects due to both lacking evidence of its value and, especially, because false positive exercise ECGs are common.2 However, this view is mainly based on the diagnostic value of exercise testing, and Ashley et al.3 recently suggested in their literature review that emphasis should be shifted from a diagnostic to a prognostic perspective. It is well known that several exercise test indices, besides ECG findings, such as exercise capacity,4 heart rate responses both during and after exercise5,6 and blood pressure responses,7 are strong predictors of cardiovascular events. However, few studies have explored the combined predictive abilities of such data, or tried to compare the accuracy of CHD risk stratification, provided by a classical prediction model, with a model also including multiple exercise test parameters in apparently healthy subjects.8 The present report aims at providing such evidence.
Methods
Between 28 August1972 and 30 March 1975, 2014 apparently healthy males aged 4060 years participated in a cardiovascular survey at the University Hospital of Oslo, Norway. These were recruited from five Governmental agencies as described previously.4 Subjects with known or suspected CHD, other heart diseases, diabetes mellitus, treated hypertension, cancer, miscellaneous other chronic diseases, those unable to conduct a bicycle exercise test or were taking drugs regularly were excluded. Those with intercurrent febrile illnesses had to wait for at least 14 days before being examined. All were asked to fast for 12 h and abstain from smoking for at least 8 h.
Examination program
The examination program included height/weight and blood pressure measurements, a panel of blood tests, a case history (including smoking data obtained from a comprehensive questionnaire), clinical examination, resting ECG and a symptom limited exercise ECG test using an electrically braked Elema bicycle (Elema, Stockholm, Sweden). The starting load was 100 W, and the load was increased by 50 W every 6 min until exhaustion unless stopped earlier for safety reasons.4 Resting systolic blood pressure was measured three times with a mercury sphygmomanometer to the nearest 2 mmHg, after 5 min supine rest. The second reading almost always gave the lowest value, and was therefore chosen.9 Exercise blood pressure was measured, to the nearest 5 mm Hg, at 1, 3 and 5 min on all loads and immediately prior to test termination.9 Maximal heart rate was determined from the exercise ECG immediately prior to termination of exercise. ECGs were recorded at rest prior to the test, at 2, 4 and 6 min on all loads, at test termination, and at 1, 2, 3 and 5 min post exercise in the supine position. During the baseline survey (Survey 1), 105 men, suspected to have latent CHD based mainly on exercise ECG findings, underwent coronary angiography; 70 (67%) had a pathological angiogram.10 All participants and their physicians were informed about all survey findings.
A similar survey (Survey 2) was conducted during 198082; 1756 men (91% of all alive) participated, and among these 1428 (81%) remained healthy according to the baseline inclusion criteria.11 HDL cholesterol was measured in 95% of these men (but only in 8% during Survey 1). Data from Survey 2 have been used to validate results derived from Survey 1.
We defined two sets of predictors for CHD death:
Mortality and morbidity data
The Norwegian Data Inspectorate and the Norwegian Board of Health gave permission to obtain morbidity and mortality data. Mortality data were retrieved from the database of Statistics Norway covering all deaths in Norway, for our purposes complete up to 31 December 1999. Morbidity data, including data on diabetes, were obtained from one questionnaire survey taken in 1987, two additional clinical surveys taken in 198990 and 199596 and a nationwide search of all patient records from all Norwegian hospitals in 199596. Deaths caused by ischaemic heart disease and sudden, unexpected deaths were classified as coronary deaths. Cardiovascular deaths also include deaths caused by stroke and ruptured aortic aneurysms.
Statistical methods
Proportional hazards (Cox) models were used to (a) study associations between the selected predictors and time to death, and (b) to compute individual age, standardised CHD risk scores (estimated linear predicted values).13 Associations between the continuous variables and CHD mortality were all approximately linear. Diagnostic plots of vs.
indicate that the proportional hazards assumptions are acceptably fulfilled for all parameters used. Associations (a) are presented as relative risks (RRs) related to an increase of 1 SD for continuous variables and between group ratios for discrete variables.
We mainly studied two Cox models: a CRF model, which included the classical risk factors, and a CRF+X model, which included both the CRF and the exercise predictors. The co-variates were age-standardised to obtain risk scores not dependent on age (b), with age set to 49.8 years (population mean) for all participants in analyses based on Survey 1, and to 56.6 years in analyses based on Survey 2. Resting systolic blood pressure, maximal heart rate, exercise blood pressure and physical fitness (after log transformation) were all an approximate linear function of age with the dispersions around the regression lines virtually independent of age. The age-standardised values were computed as the differences between the original values and the respective regression lines.
Proportional hazards models (Table 2) were compared using standard likelihood ratio statistics. All -values are two-tailed, and a model parameter is described as significant if the corresponding
-value is below 0.05. To avoid that a subject's risk score was affected by his own outcome, we used a cross validation procedure where each man's risk score (b) was computed using a Cox model based on all participants except himself.
|
|
|
Results
Table 1 gives the relevant Survey 1 data of all 2014 men, and Survey 2 data of the 1428 men who remained healthy.
|
Table 3 indicates steep and similar gradients in age-adjusted CHD mortality across risk quintiles based on the CRF and the X only model, but a steeper gradient for the CRF+X model. All models show a good correspondence between the predicted and the observed numbers of deaths. The CRF+X model was also able to better discriminate between high and low non-cardiovascular risk, compared to the CRF model (data not shown). Relative CRF vs. CRF+X differences were similar in the 4050 and 5060 years age range (data not shown).
In a substantial number of men there was a marked discrepancy between CRF and CRF+X risk classification. In these cases, mortality data mainly followed the CRF+X classification, clearly indicating that the CRF+X scores were the most accurate (data not shown).
Table 4 shows three examples of the CRF and the CRF+X model performance in a few selected subgroups:
Example 1 total population
The CRF+X model identified twice as many men in the deep red risk zone1 ("very high risk") compared with the CRF model (157 vs. 79). The CRF model also identified a higher mortality (38.2% vs. 34.2%). CHD mortality among these CRF+X-identified men was also higher than the mortality observed in 70 men with a pathological baseline coronary angiogram (38.2% vs. 35.2%, see Methods section).
Example 2 men with one defined CRF
Among smokers, substantial differences in favour of the CRF+X classification appear. The CRF and the CRF+X model identified 77 and 143 men (1/6 of all smokers) with very high risk, respectively, and 5.4% vs. 22.9% in the orange and yellow (low) risk zones. CRF vs. CRF+X differences for men with high cholesterol were also markedly in favour of the CRF+X model, however this was somewhat smaller than for smokers (70 vs. 123 men, ). There were 74 vs. 103 men with very high CRF vs. CRF+X risk among men with blood pressure
140 mmHg (
).
Example 3 men with combined CRFs
Among smokers with high cholesterol, the CRF model identified 67 men with very high risk, whereas the CRF+X model identified 110. Fig. 1(a) shows how the CRF and the CRF+X risk distributions differ. Fig. 1(b) demonstrates how these differences are reflected in terms of CHD mortality. Importantly, differences in mortality appear early during follow-up. The CRF model indicates that 4/5 have a clustering of mortality clearly above the mean of the total population of 2014 men. The CRF+X model, on the other hand, identifies one group with very high mortality and 3/5 with a mortality similar to the mean of the total population despite being smokers with high cholesterol.
|
Increasing CRF+X risk, but not CRF, was strongly associated with increasing incidence of diabetes (Fig. 2). The CRF+X also appeared to enhance CHD risk assessment in men with fasting blood glucose in the upper normal range (details not shown).
|
|
Previous attempts to integrate exercise testing in CHD risk assessment of asymptomatic subjects have been only modestly successful. Only one study has, to a limited extent, considered exercise predictors other than ECG interpretation21. Our present paper is the first long-term follow-up study to compare a CHD risk assessment model, based on classical risk factors, with a model including both classical risk factors and multiple exercise test parameters. We have found the latter model to substantially improve the accuracy of CHD risk estimates.
Improved predictive abilities
For all comparisons investigated, the CRF+X model had superior predictive abilities compared to the CRF model, and absolute differences in favour of CRF+X increased with increasing pre-test CRF risk. The CRF+X model identified twice as many very high risk subjects as the CRF model (7.8% vs. 3.9%), and these CRF+X-identified men also had higher mortality even exceeding that of men with a positive coronary angiogram at baseline. It is suggested that this finding is clinically relevant. Similarly, 90% of all CRF+X-identified subjects with very high risk were smokers, and almost one in four of smokers with high cholesterol had very high risk (Table 4). However, in our opinion it is also important to note that a large proportion of smokers with high cholesterol proved to have only average CHD mortality according to CRF+X, but not according to the CRF model (Fig. 1(b)). In other words, the CRF+X model identified a substantial number of men with high pre-test CRF risk with comparatively low long-term CHD mortality. Thus, healthy smokers with high cholesterol should represent one subgroup well-suited for exercise testing in which post-test data can be used as a guide to suggest primary preventive measures and/or indications for further investigations.
Information not provided by conventional CHD risk assessment
Numerous studies have explored the diagnostic value of exercise ECG testing in asymptomatic subjects. This issue was studied in the present cohort study during Survey 1.10 In the present paper, however, the prognostic significance of exercise testing in relation to CHD is the main issue, and our study also shows that:
Considering the increasing incidence of diabetes, the role of exercise testing as a prognostic tool in subjects with at least high-normal fasting blood glucose should be studied further (see ACC/AHA Guidelines).
Prediction models are rarely tested for robustness in the presence of changes in outcome incidence, but such robustness should not be taken for granted. In this study, differences in risk-discriminating abilities were largely maintained despite a marked sequential decline in CHD mortality during the follow-up period; similar mortality declines have been seen in most developed countries. Moreover, the choice of CHD death, the hardest CHD end-point, may be important since its definition does not change like the definition of myocardial infarction for example.20
In summary, our main findings should qualify as level 2b evidence according to the criteria of the NHS Research and Development Centre for Evidence Based Medicine.21
Other studies of asymptomatic populations
Bruce et al.22,23 made the first attempt to add several exercise test parameters in a prognostic model when studying future CHD events. However, comparison with our study is of moderate value since follow-up was short, data were somewhat incomplete, and the analyses were mainly based on categorical data. Moreover, the low number of deaths necessitated inclusion of soft CHD end points, further making comparison difficult.
In the AHA/ACC Guidelines on exercise testing,2 seven additional studies are listed in which exercise testing has been introduced for prognostic purposes in asymptomatic subjects.2430 However, besides having relatively short observation times, these studies cannot be compared to ours as they are based on exercise ECG interpretation only. As clearly shown by us, the inclusion of exercise ECG interpretation alone improved the predictive abilities only modestly.
Contribution of classical risk factors vs. exercise predictors to CHD risk estimates
It is unsettled how the exercise parameters add prognostic information to the classical risk factors. In an attempt to elucidate this important consideration we also compared risk scores based on the CRF model with scores based only on the exercise predictors (X-model; Table 3). Interestingly, both models had comparable total high/low risk discriminatory properties, but the models identified different subjects to a large extent. Although epidemiological data can never prove causation, these observations suggest that the CRF and the exercise predictors contribute to risk assessments through different mechanisms. The classical risk factors probably operate through their basic influence on atheromatosis development. The exercise predictors, on the other hand, possibly add most of their prognostic information by unveiling the presence of asymptomatic CHD. Silent ischaemia may thus be indicated by ST-depressions10 and/or poor fitness in some cases; CHD-related disturbances in autonomic regulation by the attenuation of maximal heart rate12,31 and impaired arterial compliance by an early, excessive rise in exercise blood pressure.32 Incidentally, in this study an exaggerated blood pressure response was a strong predictor of CHD mortality even among the most fit (details not shown).
If accepted as coherent, these considerations would suggest that CRF risk assessments might maintain their value in all age groups. CRF+X risk assessment should possibly only be advocated among middle aged and old subjects, among whom sub-clinical CHD is increasingly prevalent, and they might serve as an impetus for considering further examinations and/or initiating aggressive primary preventive measures when exercise test data indicate adverse prognosis.
Introduction of other predictors
We have tried to refine our CRF+X model further by including other exercise parameters, such as post-exercise heart rate recovery,6 but with only modest success. Combination of several inter-correlated variables however, such as heart rates at different stages of an exercise test should be considered with caution, especially from a statistical point of view. Substitution e.g., of maximal heart rate with heart rate increase during exercise12 principally gave the same results. Addition of other CHD risk factors like fasting blood glucose, 19 erythrocyte sedimentation rate,14 CHD inheritance (first degree relative developing CHD before age 60) and HDL data from Survey 211 improved predictive abilities minimally (data not shown).
Study limitations
Although we have used a statistical cross validation procedure and shown that the results based on Survey 1 hold true even when data from Survey 2 are introduced, our results have still to be fully validated. Although being apparently healthy at baseline, our study population recruited in the mid 1970s had a high CHD mortality compared to the risk levels at which primary preventive intervention is indicated according to recent recommendations.1 Besides, risk-factor levels and risk-factor interactions may vary from one population to another, and our analyses should be repeated by others.
Our data may have been biased by the participants receiving advice on how to change their lifestyle (e.g., exercise, quit smoking, etc.) when appropriate, and the study findings (including coronary angiograms of 105 men) also being reported to their physicians. Such bias probably led to an underestimation, rather than an overestimation, of the prognostic significance of exercise testing. The significance may also have been underestimated by our very long term follow-up, since CRF vs. CRF+X differences appeared to be even larger in analyses based on shorter follow-up (e.g., 15 years, data not shown). This is not surprising considering, after 26 years, a large proportion of the survivors were in their 80s.
The relation between pre-test CRF risk and the prognostic significance of exercise test data may be biased by our choice of CRF cut-off levels. However, when analysing the data by sliding these levels up or down similar general patterns consistently emerged (details not shown).
We have only examined healthy men aged 4060 years, and it is probably unwise to extrapolate to women, beyond these age limits, or to subjects with clinical symptoms or signs of heart disease.
Importantly, the 6 min duration of each exercise load may have been advantageous considering the time needed to reach a haemodynamically steady state during exercise.33 On the other hand, extrapolation to other test protocols e.g. with shorter duration on each load should be done with caution. Furthermore, the prognostic significance of combined exercise test data obtained e.g. from treadmill testing should be studied further.
Conclusions
Exercise test data in combination with conventional risk factors improved, in many instances substantially, CHD risk assessment in apparently healthy, middle-aged men, both in the high and low risk range. When conventional screening suggests high risk, exercise testing can probably be advocated as a means do decide further actions, particularly among smokers with high cholesterol.
References