RE: "A CHAIN MULTINOMIAL MODEL FOR ESTIMATING THE REAL-TIME FATALITY RATE OF A DISEASE, WITH AN APPLICATION TO SEVERE ACUTE RESPIRATORY SYNDROME"
Eric H. Y. Lau and
Paul S. F. Yip
Department of Statistics and Actuarial Science, University of Hong Kong, Hong Kong, People's Republic of China
In a recent article (1
), we and our colleagues proposed a method of estimating a fatality rate in real time based on a chain multinomial model. We demonstrated that the proposed estimator is more sensitive than that of the World Health Organization. We proposed a kernel smoothing method that we applied to the crude estimator of the daily death and recovery probabilities, p1t and p2t, under the chain multinomial model. However, the kernel function we us,ed is a two-sided one with a bandwidth b, implying that when we estimate the fatality rate at time t, data within the period [t b, t + b] are needed. In a retrospective study, this is obviously not a problem. However, when the method is applied during an epidemic, there is no observation for the period (t, t + b]. Here we suggest a one-sided kernel function method such that only information up to time t is needed to obtain a timely estimate of fatality up to time t during the course of the epidemic. More specifically, a one-sided Epanechnikov kernel function (2
)
 |
can be used in calculating the real-time fatality rate at time t. Let Kb(x) = b1K(x/b); then only data within the period [t b, t] are needed. The estimated real-time fatality rate
t is given by
where
i = 1, 2, are crude estimators of the death and recovery probabilities at time t. Let N1t and N2t be the numbers of deaths and recoveries on day t, while Ht1 is the number of inpatients at the start of day t. Then
Finally, the standard error of
is given by
and hence the variances and covariances and the confidence interval can be found as in our paper 1
. Since the bandwidth is chosen using the empirical bias bandwidth selection (EBBS) method, in order to minimize the empirical mean squared error, it will be larger when data are more stable or sparse. For a one-sided kernel, we can expect that the bandwidth will be comparably larger and hence more data preceding the point of estimation are needed. This could decrease the sensitivity of the estimator.
We applied the proposed one-sided kernel function to data on severe acute respiratory syndrome (SARS) from Hong Kong and Beijing, China. Results are shown in figure 1.

View larger version (35K):
[in this window]
[in a new window]
|
FIGURE 1. Top left: estimated real-time fatality rate during the 2003 severe acute respiratory syndrome (SARS) epidemic obtained using a one-sided kernel (with its 95% confidence limits (CLs)) and the traditional fatality ratio for SARS in Hong Kong. Top right: estimated real-time fatality rate obtained using one-sided and two-sided kernels and the traditional fatality ratio in Hong Kong. Bottom left: estimated real-time fatality rate during the 2003 SARS epidemic obtained using a one-sided kernel (with its 95% confidence limits) and the traditional fatality ratio for SARS in Beijing. Bottom right: estimated real-time fatality rate obtained using one-sided and two-sided kernels and the traditional fatality ratio in Beijing.
|
|
The top left panel in figure 1 shows the SARS fatality rate in Hong Kong estimated using a one-sided kernel, as well as the traditional fatality ratio for SARS. The top right panel compares the estimated fatality rates obtained using one-sided and two-sided kernels. The bottom panels show the corresponding estimates in Beijing. From the left panels, one can see that the confidence intervals are wide at the beginning of the epidemic, for approximately 2 weeks, and near the end of the epidemic in Hong Kong. Otherwise, the confidence intervals are generally narrow enough to provide sufficient information for estimating the fatality level in both regions. The right panels show that using a one-sided kernel causes instability of the estimated fatality rate, especially at the beginning of the epidemic. Nevertheless, both estimates converge to nearly the same trend thereafter. In other words, the fatality rate estimated using a one-sided kernel allowed us to practically provide an estimator in real time, while the loss of accuracy in comparison with the two-sided kernel is insignificant, especially in the midst of an epidemic.
It is also important to note that the performance of the kernel-type estimator is highly dependent on the bandwidth chosen. We used the EBBS method to choose an optimal bandwidth for the estimator at each time point. In such cases, if the optimal b over the course of the epidemic is too large, the ability to reflect real-time information will also be weakened.
 |
References
|
---|
- Yip PSF, Lau EHY, Lam KF, et al. A chain multinomial model for estimating the real-time fatality rate of a disease, with an application to severe acute respiratory syndrome. Am J Epidemiol 2005;161:7006.[Abstract/Free Full Text]
- Epanechnikov VA. Nonparametric estimation of a multidimensional probability density. Theory Probab Appl 1969;14:1538.[CrossRef]