Abstract

Background Vital registration of causes of death in China is incomplete with poor coverage of medical certification. Information on the leading causes of mortality will continue to rely on verbal autopsy (VA) methods. A new international VA form is being considered for data collection in China, but it first needs to be validated to determine its operating characteristics.

Methods Detailed medical records and clinical evidence for 3290 deaths (mostly adults) among residents of six cities representative of the urban Chinese population were reviewed by a panel of physicians and coded by experts to establish a reference underlying cause of death. Independently, families of the deceased were interviewed using a structured symptomatic questionnaire and a separate death certificate was prepared for each matching case (2102). Validity of the VA procedure was assessed using standard measurement criteria of sensitivity, specificity, and positive predictive value.

Results VA methods perform reasonably well in identifying deaths from several leading causes of adult deaths including stroke, several major cancer sites (lung, liver, stomach, oesophagus, and colorectal), and transport accidents. Sensitivity was less satisfactory in detecting deaths from several causes of major public health concern in China including ischaemic heart disease, chronic obstructive pulmonary disease, diabetes, and tuberculosis, and was particularly poor in diagnosing deaths from viral hepatitis, hypertension, and kidney diseases.

Conclusions VA is an imprecise tool for detecting leading causes of death among adults. However, much of the misclassification generally occurs within broad cause groups (e.g. CVD, respiratory diseases, and liver diseases). Moreover, compensating patterns of misclassification would appear to suggest that, in urban China at least, the method yields population-level cause-specific estimates that are reasonably reliable. These results suggest the possible utility of these methods in rural China, to back up the low coverage of medical certification of cause of death owing to poor access to health facilities there.

Introduction

Ascertaining the cause of a death in the absence of medical attention is a difficult task, especially in the absence of medical records for the deceased. The only recourse then lies in the use of ‘verbal autopsy’ (VA) interviews to gather information about symptoms and events during the period leading up to death, which is then reviewed to derive a probable cause. The VA methodology has been used in several developing countries to collect information on cause-specific mortality, given the basic necessity of such data for health policy and planning. These methods have been previously used to investigate epidemics1 and for specific intervention evaluation.2,3 They have also been implemented in national mortality surveillance systems, principally in India, China, and Tanzania.4

Recall bias leads to uncertainty in the diagnosis of cause of death from such retrospective interviews. Other issues that could influence the quality of VA data include questionnaire design, choice of interviewers and respondents, cause of death ascertainment mechanisms, and procedures for coding and tabulation of the data.5 Implementation of standardized practices could minimize bias that results from variations in these characteristics. However, there is a critical need to measure the validity of such methods in correctly ascertaining the leading causes of death and others of epidemiological significance in specific populations, if there is to be any confidence in the data collected through VAs.

By definition, validation involves a comparison of the underlying cause of death derived from the VA with a ‘true’ or reference underlying cause for the same death, derived from either a pathological autopsy (‘gold standard’) or clinical records (next best alternative). Non-availability of a reference diagnosis for the underlying cause is an elementary constraint in measuring the validity of VA procedures. This effectively excludes validation in rural areas of China where few deaths have appropriate clinical records. Causes of childhood death have been the focus of several VA validation studies in Africa610 and South Asia,11,12 where child mortality remains a major public health problem. Disease/condition-specific algorithms have been developed and validated in these studies,1315 providing a wealth of empirical evidence on the usefulness of VA for childhood mortality.

Little work, however, has been done on the validation of VA for adult deaths. What has been done is largely limited to the investigation of deaths due to HIV,16 and maternal causes,1720 although VA methods have also been used to correct for ill-defined causes in India21,22 and Thailand,23 and to evaluate the reliability of routine vital registration in Jordan.24 Populations in developing countries are faced with an increasing burden from early adult mortality, from infections (HIV and tuberculosis), non-communicable diseases (cancers, cardiovascular disease, and diabetes), and injuries.25,26 In contrast to the major causes of child mortality, some of these conditions are difficult to diagnose even in clinical settings, let alone using VA, owing to complex symptomatology. Nevertheless, the use of VA is unavoidable in many populations with poor access to health facilities and with inefficient cause of death certification and registration mechanisms.27

To establish the public health utility of VA derived data, its validity first needs to be assessed. In this study, we validate a set of VA procedures that have been proposed for use in the disease surveillance point (DSP) system in China. A detailed description of the DSP is available elsewhere.28 In this system, retrospective interviews are conducted among relatives to ascertain the cause of death for domiciliary deaths occurring in a nationally representative sample of population clusters distributed throughout China. The VA methods used to date have varied across provinces and counties. The specific aims of this study were to ascertain the validity of a standard set of VA procedures developed by an international collaboration to identify leading causes of adult deaths in China, and second, to identify patterns of misclassification error for different causes, and the reasons for such misclassification (Setel P, Ras C, Hemed Y et al., submitted for publication). Knowledge of such misclassification patterns, if broadly applicable, can be used to correct data from routine death reporting systems to more reliably estimate overall cause-specific mortality patterns in China. Not only will this better inform national disease prevention and control strategies, but it is also of obvious interest for assessing global mortality patterns and burden of disease.

Materials and methods

The study was conducted in six cities representative of the urban population residing in different geographical regions of China, namely Beijing (North), Ha'erbin (North-east), Shanghai (East), Guangzhou (South), Wuhan (Central), and Chengdu (South-west) (Figure 1).

Figure 1

Map showing location of six urban sites in China selected for the VA validation study

The VA validation study formed one component of a broader study to evaluate the quality of mortality statistics from the routine death registration system in China. Deaths were selected into the study based on the frequency of occurrence of causes reported in the routine system, but also included adequate numbers of certain less-frequent conditions likely to be of local epidemiological significance (e.g. hepatitis). In order to ensure that comparative data (from medical records and VA) were available for 20 leading causes or causes otherwise of interest, and assuming a failure rate of about one-third (based on pilot investigation), 3300 cases were selected, resulting in just over 2100 VAs. The list of causes chosen for validation can be seen from Table 2. Certain high frequency causes of death such as ischaemic heart disease (IHD), cerebrovascular disease, and chronic obstructive pulmonary disease (COPD) were under-sampled, in order to achieve adequate samples (≥25 deaths) of less-frequent causes of interest for validation, such as site-specific cancers, renal disorders, and infectious diseases.

The criteria for selection into the study were as follows: Altogether, ∼100 hospitals in the six cities provided the primary records for this study. An overview of the data collection, processing, and analysis protocol for validation of the VA procedures is shown in Figure 2. Roughly equal numbers of deaths (500–600) were selected in each of the six study sites, yielding a total of 3290 deaths. For each selected death, an international death certificate29 was derived, through expert review of the medical records, by a panel of three physicians blind to the cause of death certificate filed at registration. To assist this review, trained personnel abstracted relevant clinical information, diagnostic results, and treatment details from the medical record onto a hospital information sheet. For some cases, photocopies of specific documents from the medical record were attached as supporting evidence. The identity of the deceased was kept confidential. Reference (expert review) death certificates were coded by the World Health Organization Collaborating Centre for Classification of Diseases in Beijing to select and code the underlying cause of death acccording to the core three character code, as recommended by the International Classification of Diseases, Tenth Revision (ICD-10).30 This underlying cause, based on expert review of the medical records, was used as the reference or ‘gold standard’ diagnosis for assessing validity in this study.

  • The deceased should have been a resident of the city.

  • The death should have occurred between June 2002 and November 2002.

  • The death should have occurred in a tertiary care health facility with adequate diagnostic facilities.

  • The medical records for that death should be available for determining cause of death.

Figure 2

Flow chart showing study protocol and data collection

From the original set of 3290 deaths, 363 cases were excluded owing to insufficient evidence being available to derive an underlying cause of death. For the remaining 2927 deaths, the address of the decedent was obtained from the hospital record, and trained field investigators, blind to the cause of death from either the routine system or the expert review, conducted interviews using standard VA instruments.31 These included modules to record demographic characteristics, a narrative of the illness before death as provided by the respondent, a symptom duration checklist, and a section to note any mention of the illness prior to death from health services staff or records. The instruments were pilot tested on a sample of 200 deaths in Beijing, and revised to incorporate Chinese terminology and local perceptions of specific symptoms and signs.

Non-response to the VA interview was 6.5%, mostly in Beijing, Shanghai, and Guangzhou. A further 21% of cases could not be interviewed because of incomplete address or out-migration of the household of the deceased, with Shanghai accounting for about one-third of these cases. Owing to funding constraints, the study was brought to a close on reaching the preliminary target of 2100 deaths, with no further pursuit of losses to follow-up. A comparison of the proportionate distribution of deaths by cause from the original sample (2927 cases with expert diagnoses) and the excluded cases (825 cases not investigated by VA) suggests that the exclusion was non-differential by cause (Figure 3).

Figure 3

Correlation between proportions of deaths by cause in the overall sample and among the excluded cases

Completed VA instruments were submitted for review to an independent panel of two physicians in Beijing, who reached consensus as to the content and ordering of the causes on a standard death certificate for each death. Here too, adequate care was taken to maintain confidentiality. Reviewing physicians applied standard clinical judgment to ascertain causes of death, using diagnostic guidelines similar to those described for use in other settings.32 They also recorded a subjective opinion as to the strength of evidence for each cause. These certificates were also sent to the WHO Collaborating Centre for selection and coding of the underlying cause. Procedures were established by the Centre to ensure that the expert and VA death certificates for the same individual were coded independently.

All information from the routine death certificates, hospital information sheets, expert death certificates, VA questionnaires (separate for neonates, under-fives, and adults), and VA-derived death certificates was entered twice and cleaned in an SAS data management program. Underlying causes from the expert and VA derived death certificates were tabulated at ICD 3 character level, and according to the ICD Mortality Tabulation List 133, as recommended by the WHO. Validation characteristics of VA diagnoses were assessed against categories in this Tabulation List. The parameters measured were sensitivity, specificity, and positive predictive value (PPV), using standard methods, each with 95% confidence intervals. Finally, misclassification was assessed from a matrix of cause attribution of deaths as assigned by the expert review and by VA.

Sensitivity and PPV convey subtly different messages about the ability of VA to identify deaths owing to a particular cause. Sensitivity assesses, from an epidemiological perspective, the probability that the VA correctly diagnoses true deaths from the cause of interest, while PPV, from a clinical perspective, measures the true chance of death from this cause, if diagnosed as such by VA. PPV is dependent on the proportionate distribution of deaths and the observed sensitivity for each cause of interest in the study sample, and this should be borne in mind when extrapolating the findings to make inferences about community-based cause of death patterns.

As a rough guide, VA might be considered to have ‘good’ validity for diseases or conditions for which sensitivity is >75%; for those with sensitivity in the range 50–75%, ‘tolerable’ validity; and when <50%, ‘poor’ validity. These thresholds are admittedly arbitrary, but have some commonsense appeal. For most purposes, a process that correctly diagnoses the cause of death in <50% of cases is unlikely to be of much public health value.

Results

Table 1 provides an overview of key qualitative characteristics of the VA interviews conducted for this study. Several points are worth noting. First, the study sample essentially comprised adult deaths only, and hence we have excluded the 63 childhood deaths, mostly due to perinatal causes, from the subsequent analyses. About two-thirds of the VA interviews were conducted within a year of the event, which may have affected recall bias. Respondents were first-order relatives (parent, spouse, and offspring) in 85% of cases, and most (81%) were present in the household during the illness preceding death. The majority of households had some information as to the cause of death from contact with health services, and of them, about half could produce some medical documents at interview. Validation was, therefore, largely based on responses to the structured questions on symptoms and duration.

Table 1

Qualitative characteristics of VA interviews conducted in the study

Variable
Distribution
Percentage of deaths in sample by age<15 years, 3.2%; 15–44 years, 6.8%; >45 years, 90%
Interval between death and VA<6 months, 13%; 6–12 months, 52%; 12–18 months, 35%
Respondent's relationship with deceasedParent, 19%; spouse, 34%; offspring, 32%; others, 15%
Respondent's presence at deathYes, 81%; No, 19%
Respondent informed by health worker about cause of deathYes, 81%; No, 14%; No recall, 5%
Medical documents available at VAYes, 42%; No, 58%
Variable
Distribution
Percentage of deaths in sample by age<15 years, 3.2%; 15–44 years, 6.8%; >45 years, 90%
Interval between death and VA<6 months, 13%; 6–12 months, 52%; 12–18 months, 35%
Respondent's relationship with deceasedParent, 19%; spouse, 34%; offspring, 32%; others, 15%
Respondent's presence at deathYes, 81%; No, 19%
Respondent informed by health worker about cause of deathYes, 81%; No, 14%; No recall, 5%
Medical documents available at VAYes, 42%; No, 58%
Table 1

Qualitative characteristics of VA interviews conducted in the study

Variable
Distribution
Percentage of deaths in sample by age<15 years, 3.2%; 15–44 years, 6.8%; >45 years, 90%
Interval between death and VA<6 months, 13%; 6–12 months, 52%; 12–18 months, 35%
Respondent's relationship with deceasedParent, 19%; spouse, 34%; offspring, 32%; others, 15%
Respondent's presence at deathYes, 81%; No, 19%
Respondent informed by health worker about cause of deathYes, 81%; No, 14%; No recall, 5%
Medical documents available at VAYes, 42%; No, 58%
Variable
Distribution
Percentage of deaths in sample by age<15 years, 3.2%; 15–44 years, 6.8%; >45 years, 90%
Interval between death and VA<6 months, 13%; 6–12 months, 52%; 12–18 months, 35%
Respondent's relationship with deceasedParent, 19%; spouse, 34%; offspring, 32%; others, 15%
Respondent's presence at deathYes, 81%; No, 19%
Respondent informed by health worker about cause of deathYes, 81%; No, 14%; No recall, 5%
Medical documents available at VAYes, 42%; No, 58%

The operational characteristics of the VA procedure are shown in Table 2 for the 20 causes of death selected for this study. The VA procedures we have used appear to have good validity for transport accidents, stroke, several leading sites of cancer (including lung, liver, stomach, oesophagus, and colorectal cancer), and pneumonia. On the other hand, the VA was much less successful in adequately diagnosing several conditions of major public health importance in China such as IHD, COPD, diabetes, tuberculosis, and diseases of the liver, and performed relatively poorly in accurately diagnosing viral hepatitis, hypertension, and kidney diseases.

Table 2

Validation characteristics of VA procedures for 20 selected causes of adult death in urban China

Cause
ICD codes
MR deaths
VA deaths
Sensitivity
PPV
Specificity
Transport accidentsV01–V99302996.7 (90.2–100)100100
Oesophageal cancerC15263296.1 (88.7–100)78.1 (63.8–92.5)99.7
Stomach cancerC16687189.7 (82.5–96.9)85.9 (77.8–94.0)99.5
Lung cancerC33–C3421221489.1 (85.0–93.3)87.4 (82.9–91.8)98.6
Liver cancerC2210210585.3 (78.4–92.2)81.9 (73.7–88.6)99
Cerebrovascular diseasesl60–l6937839881.5 (77.5–85.4)75.5 (71.3–79.7)93.6
Colorectal cancerC18–C34685675 (64.7–85.3)91.1 (83.6–98.6)99.7
PneumoniaJ12–J18163375 (53.8–96.2)36.4 (19.9–52.8)98.8
Liver diseasesK70–K76568371.4 (59.6–83.4)48.2 (37.4–58.9)97.9
IHDI20–I2520616763.8 (57.0–70.5)74.9 (68.3–81.4)97.8
TuberculosisA15–A16453962.2 (48.0–76.9)71.8 (57.7–85.9)99.3
Other malignant neoplasmsa333860.6 (43.9–77.2)52.6 (36.8–68.5)99
COPDJ40–J4719113959.7 (52.7–66.6)82.0 (75.6–88.4)98.2
DiabetesE10–E14818656.8 (46–67.6)53.5 (42.9–64)97.6
Kidney diseasesN00–N98436553.5 (39.6–61.7)35.3 (19.7–52.9)99.4
Other digestive disordersK00–K22, K28–K66, K80–K92565551.8 (38.7–64.9)52.7 (39.5–65.9)98.7
Hypertensive diseasesI10–I14484541.7 (27.7–55.6)44.4 (29.9–58.9)98.5
Viral hepatitisB15–B19743135.6 (24.6–46.6)83.8 (70.9–96.8)99.7
Other external causesb17635.3 (12.6–58.0)100100
Other respiratory diseasesJ00–J06, J30–J39, J60–J98353431.4 (16.1–46.8)32.3 (16.6–48.1)98.9
Cause
ICD codes
MR deaths
VA deaths
Sensitivity
PPV
Specificity
Transport accidentsV01–V99302996.7 (90.2–100)100100
Oesophageal cancerC15263296.1 (88.7–100)78.1 (63.8–92.5)99.7
Stomach cancerC16687189.7 (82.5–96.9)85.9 (77.8–94.0)99.5
Lung cancerC33–C3421221489.1 (85.0–93.3)87.4 (82.9–91.8)98.6
Liver cancerC2210210585.3 (78.4–92.2)81.9 (73.7–88.6)99
Cerebrovascular diseasesl60–l6937839881.5 (77.5–85.4)75.5 (71.3–79.7)93.6
Colorectal cancerC18–C34685675 (64.7–85.3)91.1 (83.6–98.6)99.7
PneumoniaJ12–J18163375 (53.8–96.2)36.4 (19.9–52.8)98.8
Liver diseasesK70–K76568371.4 (59.6–83.4)48.2 (37.4–58.9)97.9
IHDI20–I2520616763.8 (57.0–70.5)74.9 (68.3–81.4)97.8
TuberculosisA15–A16453962.2 (48.0–76.9)71.8 (57.7–85.9)99.3
Other malignant neoplasmsa333860.6 (43.9–77.2)52.6 (36.8–68.5)99
COPDJ40–J4719113959.7 (52.7–66.6)82.0 (75.6–88.4)98.2
DiabetesE10–E14818656.8 (46–67.6)53.5 (42.9–64)97.6
Kidney diseasesN00–N98436553.5 (39.6–61.7)35.3 (19.7–52.9)99.4
Other digestive disordersK00–K22, K28–K66, K80–K92565551.8 (38.7–64.9)52.7 (39.5–65.9)98.7
Hypertensive diseasesI10–I14484541.7 (27.7–55.6)44.4 (29.9–58.9)98.5
Viral hepatitisB15–B19743135.6 (24.6–46.6)83.8 (70.9–96.8)99.7
Other external causesb17635.3 (12.6–58.0)100100
Other respiratory diseasesJ00–J06, J30–J39, J60–J98353431.4 (16.1–46.8)32.3 (16.6–48.1)98.9

a C17, C23–C24, C26–C31, C37–C41, C44–C49, C51–C52, C57–C60, C62–C66, C68–C69, C73–C81, C88, C96–C97.

b W20–W64, W75–W99, X10–X39, X50–X59, Y10–Y89.

Table 2

Validation characteristics of VA procedures for 20 selected causes of adult death in urban China

Cause
ICD codes
MR deaths
VA deaths
Sensitivity
PPV
Specificity
Transport accidentsV01–V99302996.7 (90.2–100)100100
Oesophageal cancerC15263296.1 (88.7–100)78.1 (63.8–92.5)99.7
Stomach cancerC16687189.7 (82.5–96.9)85.9 (77.8–94.0)99.5
Lung cancerC33–C3421221489.1 (85.0–93.3)87.4 (82.9–91.8)98.6
Liver cancerC2210210585.3 (78.4–92.2)81.9 (73.7–88.6)99
Cerebrovascular diseasesl60–l6937839881.5 (77.5–85.4)75.5 (71.3–79.7)93.6
Colorectal cancerC18–C34685675 (64.7–85.3)91.1 (83.6–98.6)99.7
PneumoniaJ12–J18163375 (53.8–96.2)36.4 (19.9–52.8)98.8
Liver diseasesK70–K76568371.4 (59.6–83.4)48.2 (37.4–58.9)97.9
IHDI20–I2520616763.8 (57.0–70.5)74.9 (68.3–81.4)97.8
TuberculosisA15–A16453962.2 (48.0–76.9)71.8 (57.7–85.9)99.3
Other malignant neoplasmsa333860.6 (43.9–77.2)52.6 (36.8–68.5)99
COPDJ40–J4719113959.7 (52.7–66.6)82.0 (75.6–88.4)98.2
DiabetesE10–E14818656.8 (46–67.6)53.5 (42.9–64)97.6
Kidney diseasesN00–N98436553.5 (39.6–61.7)35.3 (19.7–52.9)99.4
Other digestive disordersK00–K22, K28–K66, K80–K92565551.8 (38.7–64.9)52.7 (39.5–65.9)98.7
Hypertensive diseasesI10–I14484541.7 (27.7–55.6)44.4 (29.9–58.9)98.5
Viral hepatitisB15–B19743135.6 (24.6–46.6)83.8 (70.9–96.8)99.7
Other external causesb17635.3 (12.6–58.0)100100
Other respiratory diseasesJ00–J06, J30–J39, J60–J98353431.4 (16.1–46.8)32.3 (16.6–48.1)98.9
Cause
ICD codes
MR deaths
VA deaths
Sensitivity
PPV
Specificity
Transport accidentsV01–V99302996.7 (90.2–100)100100
Oesophageal cancerC15263296.1 (88.7–100)78.1 (63.8–92.5)99.7
Stomach cancerC16687189.7 (82.5–96.9)85.9 (77.8–94.0)99.5
Lung cancerC33–C3421221489.1 (85.0–93.3)87.4 (82.9–91.8)98.6
Liver cancerC2210210585.3 (78.4–92.2)81.9 (73.7–88.6)99
Cerebrovascular diseasesl60–l6937839881.5 (77.5–85.4)75.5 (71.3–79.7)93.6
Colorectal cancerC18–C34685675 (64.7–85.3)91.1 (83.6–98.6)99.7
PneumoniaJ12–J18163375 (53.8–96.2)36.4 (19.9–52.8)98.8
Liver diseasesK70–K76568371.4 (59.6–83.4)48.2 (37.4–58.9)97.9
IHDI20–I2520616763.8 (57.0–70.5)74.9 (68.3–81.4)97.8
TuberculosisA15–A16453962.2 (48.0–76.9)71.8 (57.7–85.9)99.3
Other malignant neoplasmsa333860.6 (43.9–77.2)52.6 (36.8–68.5)99
COPDJ40–J4719113959.7 (52.7–66.6)82.0 (75.6–88.4)98.2
DiabetesE10–E14818656.8 (46–67.6)53.5 (42.9–64)97.6
Kidney diseasesN00–N98436553.5 (39.6–61.7)35.3 (19.7–52.9)99.4
Other digestive disordersK00–K22, K28–K66, K80–K92565551.8 (38.7–64.9)52.7 (39.5–65.9)98.7
Hypertensive diseasesI10–I14484541.7 (27.7–55.6)44.4 (29.9–58.9)98.5
Viral hepatitisB15–B19743135.6 (24.6–46.6)83.8 (70.9–96.8)99.7
Other external causesb17635.3 (12.6–58.0)100100
Other respiratory diseasesJ00–J06, J30–J39, J60–J98353431.4 (16.1–46.8)32.3 (16.6–48.1)98.9

a C17, C23–C24, C26–C31, C37–C41, C44–C49, C51–C52, C57–C60, C62–C66, C68–C69, C73–C81, C88, C96–C97.

b W20–W64, W75–W99, X10–X39, X50–X59, Y10–Y89.

Table 3 shows the misclassification matrix for leading causes with ‘low’ or ‘tolerable’ sensitivity scores. For each disease, the rows of the matrix indicate the total number of deaths identified as being due to that cause from the VA (VA diagnoses), while the columns show to which cause each of these deaths was assigned from the review of medical records (MR diagnosis) (i.e. ‘truth’). Thus, for example, IHD caused 206 of the 2102 matched cases in the sample, but only 128 (64%) were correctly identified as such by the VA. Twenty others (10%) were identified by the VA as due to stroke, nine (5%) to hypertensive diseases, and a somewhat smaller fractions to diabetes, COPD, and a variety of other causes. At the population level, however, the misclassification appears largely compensatory, since an additional 39 deaths were coded to this cause by the VA, but were in reality owing to some other cause, i.e. stroke, hypertensive diseases, COPD, and diabetes.

Table 3

Misclassification matrix for major diseases, urban China 2002

VA diagnosesMR diagnoses

Cerebrovascular diseases
IHD
Hypertensive diseases
Diabetes
Kidney diseases
COPD
Pnuemonia
TB
Liver diseases
Viral hepatitis
Ill-defined causes
All other causes
Total VA deaths
Cerebrovascular diseases3082061121411134398
IHD81287515112167
Hypertensive diseases792043245
Diabetes12744611686
Kidney diseases335531421265
COPD571111438139
Pneumonia42161211633
Tuberculosis1528539
Liver diseases214032883
Viral hepatitis426131
Ill-defined causes15724341143677
All other causes1622242272118110834939
Total MR deaths3782064881431911645567409642102
VA diagnosesMR diagnoses

Cerebrovascular diseases
IHD
Hypertensive diseases
Diabetes
Kidney diseases
COPD
Pnuemonia
TB
Liver diseases
Viral hepatitis
Ill-defined causes
All other causes
Total VA deaths
Cerebrovascular diseases3082061121411134398
IHD81287515112167
Hypertensive diseases792043245
Diabetes12744611686
Kidney diseases335531421265
COPD571111438139
Pneumonia42161211633
Tuberculosis1528539
Liver diseases214032883
Viral hepatitis426131
Ill-defined causes15724341143677
All other causes1622242272118110834939
Total MR deaths3782064881431911645567409642102
Table 3

Misclassification matrix for major diseases, urban China 2002

VA diagnosesMR diagnoses

Cerebrovascular diseases
IHD
Hypertensive diseases
Diabetes
Kidney diseases
COPD
Pnuemonia
TB
Liver diseases
Viral hepatitis
Ill-defined causes
All other causes
Total VA deaths
Cerebrovascular diseases3082061121411134398
IHD81287515112167
Hypertensive diseases792043245
Diabetes12744611686
Kidney diseases335531421265
COPD571111438139
Pneumonia42161211633
Tuberculosis1528539
Liver diseases214032883
Viral hepatitis426131
Ill-defined causes15724341143677
All other causes1622242272118110834939
Total MR deaths3782064881431911645567409642102
VA diagnosesMR diagnoses

Cerebrovascular diseases
IHD
Hypertensive diseases
Diabetes
Kidney diseases
COPD
Pnuemonia
TB
Liver diseases
Viral hepatitis
Ill-defined causes
All other causes
Total VA deaths
Cerebrovascular diseases3082061121411134398
IHD81287515112167
Hypertensive diseases792043245
Diabetes12744611686
Kidney diseases335531421265
COPD571111438139
Pneumonia42161211633
Tuberculosis1528539
Liver diseases214032883
Viral hepatitis426131
Ill-defined causes15724341143677
All other causes1622242272118110834939
Total MR deaths3782064881431911645567409642102

COPD has been reported as a leading cause of death in China,34,35 yet only 59% of COPD deaths in this sample were correctly identified as such by the VA, which misclassified several true COPD deaths as deaths due to stroke (7%) and diabetes (6%). The PPV for COPD was higher (82%), with relatively few strokes and IHD deaths being identified as COPD by the VA. Conversely, despite good sensitivity for pneumonia (75%), the VA misclassified deaths due to COPD (six deaths), stroke, IHD, and several other causes in smaller numbers to this cause, resulting in a poor PPV of 36%. Thus any VA diagnosis of pneumonia among adults in China should prompt a more detailed investigation for a more specific underlying cause.

There is considerable overlap and misclassification of diagnoses between viral hepatitis and chronic liver diseases, which is understandable given the bluntness of VA in differentiating between these conditions. Conversely, cause attribution by the VA for injury deaths was generally very good, as might be expected. The exception was deaths due to falls, for which the sensitivity was 78%, with four cases being misclassified by the VA to cerebrovascular disease.

Discussion

The need for cause of death information to support policy and programme development, and to monitor disease and injury control strategies, is clear. However, given the costs of establishing and maintaining adequate vital registration systems, most developing countries will be dependent for some time on a system of VA to provide information on the main causes of death in their populations, and the way in which these are changing. To be truly useful for public health purposes, the operational characteristics and biases associated with the specific VA procedures in use need to be clearly understood.36 Validation studies of the type reported here for urban China represent the best and possibly only means to assess confidence in the results of registration systems, which are largely based on VA.

A key element in these studies is the process for determining the reference diagnoses for assessing validity. Diagnoses from medical records are subject to their own information biases, in terms of the nature and strength of clinical, laboratory, or imaging evidence supporting them. Further, there are several nuances in the translation of evidence from medical records into diagnostic labels on the various sections of the death certificate. In this study, we employed a detailed protocol to judge and record the quality of supporting evidence for each case, and applied the principles of cause of death certification prescribed by the ICD.

The chronicity of disease and complex symptomatology makes the identification of a unique underlying cause for each death a particular challenge for many adult deaths. In several instances, despite the presence of the same multiple causes (e.g. COPD and stroke) on the death certificate from both the MR and VA for an individual death, coding experts differed in selection of the underlying cause from each study arm, resulting in a discordant pair. About 10% of all unmatched pairs were owing to such differences in judgement. These problems in the ascertainment of the reference diagnosis could have important implications for measuring the validity of the VA.

For three leading causes of death in China, namely stroke, IHD, and COPD, only 60–70% of deaths identified by the VA in this study sample as owing to these causes actually are. The VA procedures incorrectly assign other diagnoses to these specific diseases, but seemingly equally incorrectly, assign actual deaths from these diseases to other causes. As a result, while the sensitivity of the VA procedure is generally only modest for these causes, the population level estimates of cause-specific mortality from them is much less distorted, owing to compensatory patterns of misclassification.

Significantly, our results suggest that VA does not perform adequately for several other important causes, including diabetes, hypertensive disease, and diseases of the kidney and liver. For these diseases, symptoms are either not well reported, misunderstood, or cannot be easily differentiated from those leading to other causes. If there is symptom overlap that makes it difficult to distinguish between diseases, the validity of the VA will be affected. Further research is required to identify specific reasons for the poor performance of VA for these diseases.

There is some evidence to suggest that although there might well be poor validity for individual causes, combining causes with common aetiological or risk factor profiles vastly improves sensitivity and PPVs. For instance, if the categories of diabetes, IHD, stroke and hypertensive heart, and renal diseases, having common risk factors (i.e. diet, smoking, hypertension, and obesity) are considered together, the sensitivity for this combined group is ∼85%, and PPV also increases. Similar improvements in sensitivity are also observed on combining viral hepatitis and liver diseases. Although, this does not necessarily inspire confidence in VA as a method to correctly identify the true cause of death, it also does not invalidate the method as an approach to estimating population level cause of death patterns, at least for broad causes, within which such compensatory classification typically occurs. These findings would tend to underscore the public health utility of VA, in terms of its efficiency in identifying deaths due to disease clusters, which are principal causes of adult mortality, and thus the target of common public health responses. Further research testing specific questions or algorithms could improve the accuracy of the VA in identifying specific causes, resulting in better individual level cause of death assessment, a matter of importance for specific epidemiological research.

Another factor affecting the validity of VA is the availability of information on the cause of death from the household. In almost 60% of deaths due to cancers, for example, the diagnosis was conveyed to the respondents by health professionals. This information led to an increase in sensitivity for cancers of ∼15% compared with households where this information was not known. Similar differences in disease-specific sensitivities were also observed for cerebrovascular diseases, but not for infectious diseases, respiratory disease, or injuries. These findings suggest that contact with health services can substantially affect the information content and hence the validity of VA, at least for some conditions.

Overall, the VA procedures classified only ∼4% of all cases in this sample to ill-defined categories, reflecting deaths where either the respondent was not familiar with the deceased's illness, where there was confusing symptomatology, or perhaps poor interviewing skills. This percentage is low, and is perfectly acceptable given the blunt nature of the VA process.

For the reasons listed earlier, the study was carried out in health facilities in urban China, and the results are, therefore, only applicable at best to the one-third of Chinese living in urban areas. Strictly speaking, however, this study was intended to determine the operational characteristics and validity of VA procedures and not the reliability of routine data collection in urban areas. The study sample was purposively selected to assess validity for the major causes of death of interest in urban China, which may not necessarily be representative of the disease prevalence mix in the community. This restricts the generalizability of the study findings in urban areas.

What is of greater interest is the extent to which the validation characteristics from this urban sample can be generalized to the vastly greater number of deaths occurring in rural areas where separate validation studies are not feasible owing to the fact that the vast majority of deaths occur at home without adequate medical records. Although it is difficult to draw conclusions about the validity of rural mortality data in China without first validating the VA instrument in rural populations, the VA methods reported here show some promise, given the broadly satisfactory performance in urban areas for several important causes, or at least disease clusters. Given that 70% or so of China's population still live in rural areas, innovative methods to assess the accuracy of rural cause of death data need to be rapidly developed as a matter of priority if we are to have more confidence in assessments of national mortality patterns in China, and by implication, the world.

This work has been sponsored by the National Institute of Aging Grant 1-PO1-AG17625.

References

1

Etard JF, Le Hesran JY, Diallo A, Diallo JP, Ndiaye JL, Delaunay V. Childhood mortality and probable causes of death using verbal autopsy in Niakhar, Senegal, 1989–2000.

Int J Epidemiol
2004
;
33
:
1286
–92.

2

Genton B, al-Yaman F, Beck HP et al. The epidemiology of malaria in the Wosera area, East Sepik Province, Papua New Guinea, in preparation for vaccine trials. II. Mortality and morbidity.

Ann Trop Med Parasitol
1995
;
89
:
377
–90.

3

Meddings DR, Ronald LA, Marion S, Pinera JF, Oppliger A. Cost effectiveness of a latrine revision programme in Kabul, Afghanistan.

Bull World Health Organ
2004
;
82
:
281
–89.

4

Setel P, Sankoh O, Rao C, Velkoff V, Mathers CD, Yang G et al. Sample Registration of vital events with verbal autopsy: a renewed commitment to measuring and monitoring vital statistics.

Bull World Health Organ
2005
;
83
:
611
–17.

5

Chandramohan D, Maude GH, Rodrigues LC, Hayes RJ. Verbal autopsies for adult deaths: issues in their development and validation.

Int J Epidemiol
1994
;
23
:
213
–22.

6

Iriya N, Manji KP, Mbise RL. Verbal autopsy in establishing cause of perinatal death.

East Afr Med J
2002
;
79
:
82
–84.

7

Kahn K, Tollman SM, Garenne M, Gear JS. Validation and application of verbal autopsies in a rural area of South Africa.

Trop Med Int Health
2000
;
5
:
824
–31.

8

Mobley CC, Boerma JT, Titus S, Lohrke B, Shangula K, Black RE. Validation study of a verbal autopsy method for causes of childhood mortality in Namibia.

J Trop Pediatr
1996
;
42
:
365
–69.

9

Snow RW, Armstrong JR, Forster D et al. Childhood deaths in Africa: uses and limitations of verbal autopsies.

Lancet
1992
;
340
:
351
–55.

10

Vitamin A supplementation in northern Ghana: effects on clinic attendances, hospital admissions, and child mortality. Ghana VAST Study Team.

Lancet
1993
;
342
:
7
–12.

11

Benara SK, Singh P. Validity of causes of infant death by verbal autopsy.

Indian J Pediatr
1999
;
66
:
647
–50.

12

Marsh DR, Sadruddin S, Fikree FF, Krishnan C, Darmstadt GL. Validation of verbal autopsy to determine the cause of 137 neonatal deaths in Karachi, Pakistan.

Paediatr Perinat Epidemiol
2003
;
17
:
132
–42.

13

Coldham C, Ross D, Quigley M, Segura Z, Chandramohan D. Prospective validation of a standardized questionnaire for estimating childhood mortality and morbidity due to pneumonia and diarrhoea.

Trop Med Int Health
2000
;
5
:
134
–44.

14

Quigley MA, Armstrong Schellenberg JR, Snow RW. Algorithms for verbal autopsies: a validation study in Kenyan children.

Bull World Health Organ
1996
;
74
:
147
–54.

15

Rodriguez L, Reyes H, Tome P, Ridaura C, Flores S, Guiscafre H. Validation of the verbal autopsy method to ascertain acute respiratory infection as cause of death.

Indian J Pediatr
1998
;
65
:
579
–84.

16

Kamali A, Wagner HU, Nakiyingi J, Sabiiti I, Kengeya-Kayondo JF, Mulder DW. Verbal autopsy as a tool for diagnosing HIV-related adult deaths in rural Uganda.

Int J Epidemiol
1996
;
25
:
679
–84.

17

Chandramohan D, Rodrigues LC, Maude GH, Hayes RJ. The validity of verbal autopsies for assessing the causes of institutional maternal death.

Stud Fam Plann
1998
;
29
:
414
–22.

18

Walker GJ, Ashley DE, McCaw AM, Bernard GW. Maternal mortality in Jamaica.

Lancet
1986
;
1
:
486
–88.

19

Fauveau V, Koenig MA, Chakraborty J, Chowdhury AI. Causes of maternal mortality in rural Bangladesh, 1976–85.

Bull World Health Organ
1988
;
66
:
643
–51.

20

Ronsmans C, Vanneste AM, Chakraborty J, Van Ginneken J. A comparison of three verbal autopsy methods to ascertain levels and causes of maternal deaths in Matlab, Bangladesh.

Int J Epidemiol
1998
;
27
:
660
–66.

21

Gajalakshmi V, Peto R, Kanaka S, Balasubramanian S. Verbal autopsy of 48 000 adult deaths attributable to medical causes in Chennai (formerly Madras), India.

BMC Public Health
2002
;
2
:
7
.

22

Mahapatra P, Rao C. Causes of death in rural areas of Andhra Pradesh, 1998. Working Paper 40/2000. Hyderabad, India: Institute of Health Systems,

2000
.

23

Chanpen C. Thailand's national death registration reform: verifying causes of death between July 1997 and December 1999.

Asia Pac J Public Health
2005
; (Accepted for publication).

24

Khoury SA, Massad D, Fardous T. Mortality and causes of death in Jordan 1995–96: assessment by verbal autopsy.

Bull World Health Organ
1999
;
77
:
641
–50.

25

Gakidou E, Hogan M, Lopez AD. Adult mortality: time for a reappraisal.

Int J Epidemiol
2004
;
33
:
710
–17.

26

Beaglehole R, Yach D. Globalisation and the prevention and control of non-communicable disease: the neglected chronic diseases of adults.

Lancet
2003
;
362
:
903
–8.

27

Mathers CD, Ma Fat D, Inoue M, Rao C, Lopez AD. Counting the dead and what they died of: an assessment of the global status of cause of death data.

Bull World Health Organ
2005
;
83
:
171
–77.

28

Yang G, Hu J, Rao KQ, Ma J, Rao C, Lopez AD. Mortality registration and surveillance in China: History, current situation and challenges.

Popul Health Metr
2005
;
3
:
3
.

29

World Health Organization. Mortality: guidelines for certification and rules for coding. In: International Statistical Classification of Diseases and Health Related Problems—Tenth Revision. Volume 2: Instruction Manual. Geneva: World Health Organization,

1993
, pp. 30–65.

30

World Health Organization. Statistical presentation. In: International Statistical Classification of Diseases and Health Related ProblemsTenth Revision. Volume 2: Instruction Manual. Geneva: World Health Organization,

1993
, pp. 124–38.

31

AMMP-Tanzania. Adult mortality and morbidity project. Available at: http://www.ncl.ac.uk/ammp/tools_methods/index.html (Accessed 2003).

32

SRS Academic Partners. Manual for Assigning Causes of Death from Verbal Autopsy. New Delhi: Registrar General of India,

2005
.

33

World Health Organization. Special tabulation lists for mortality and morbidity; Mortality tabulation list 1. In: International Statistical Classification of Diseases and Health Related Problems—Tenth Revision. Volume 1. Geneva: World Health Organization,

1993
, pp. 1207–10.

34

Murray CJL, Lopez AD. The Global Burden of Disease : a Comprehensive Assessment of Mortality and Disability from Diseases, Injuries, and Risk Factors in 1990 and Projected to 2020. Cambridge, MA: Published by the Harvard School of Public Health on behalf of the World Health Organization and the World Bank; Distributed by Harvard University Press,

1996
.

35

Lopez AD. Counting the dead in China. Measuring tobacco's impact in the developing world.

BMJ
1998
;
317
:
1399
–400.

36

Rao C, Lopez AD, Yang GH, Begg S, Ma J. Evaluating national cause of death statistics: Principles and application to the case of China.

Bull World Health Organ
2005
;
83
:
618
–25.