Validating ICD coding algorithms for diabetes mellitus from administrative data

https://doi.org/10.1016/j.diabres.2010.03.007Get rights and content

Abstract

Aim

To assess validity of diabetes International Classification of Disease (ICD) 9 and 10 coding algorithms from administrative data using physicians’ charts as the ‘gold standard’ across time periods and geographic regions.

Methods

From 48 urban and 16 rural general practitioners’ clinics in Alberta and British Columbia, Canada, we randomly selected 50 patient charts/clinic for those who visited the clinic in either 2001 or 2004. Reviewed chart data were linked with inpatient discharge abstract and physician claims administrative data. We identified patients with diabetes in the administrative databases using ICD-9 code 250.xx and ICD-10 codes E10.x–E14.x.

Results

The prevalence of diabetes was 8.1% among clinic charts. The coding algorithm of “2 physician claims within 2 years or 1 hospitalization with the relevant diabetes ICD codes” had higher validity than other 7 algorithms assessed (sensitivity 92.3%, specificity 96.9%, positive predictive value 77.2%, and negative predictive value 99.3%). After adjustment for age, sex, and comorbid conditions, sensitivity and positive predictive values were not significantly different between time periods and regions.

Conclusion

Diabetes could be accurately identified in administrative data using the following case definition “2 physician claims within 2 years or 1 hospital discharge abstract record with diagnosis codes 250.xx or E10.x–E14.x”.

Introduction

Diabetes prevalence has increased substantially in the past two decades, and is a major cause of morbidity and mortality [1], [2]. As a leading cause of blindness, end-stage renal disease and cardiovascular disease, diabetes poses a major challenge to the healthcare system, and weighs heavily on patients and their families [3], [4]. Therefore, diabetes has been widely studied for projecting population incidence, identifying high-risk groups and evaluating prevention and control initiatives for reducing the disease and its complications [5], [6].

In recent years administrative data have been widely used to conduct large-scale diabetes studies as the data are considerably less expensive and ready to access than conducting population based surveys. As the results derived from administrative data depend on the data quality, validity of administrative data in recording diabetes has been evaluated [7], [8], [9]. However, these validation studies have been mainly conducted in metropolitan areas, at a fixed time period, and from one diagnosis coding system, such as the International Classification of Disease (ICD) 9 database.

Completeness of administrative data depends on healthcare system utilization because the data records patients who have used the system. Health services utilization is determined by predisposing factors of demographic characteristics, enabling factors such as geographic location and health insurance and need factors such as the presence of chronic diseases [10]. Thus, the validity of diabetes ICD coding algorithms may vary by these factors. We conducted this study to investigate the validation of diabetes case definitions for ICD-9/ICD-10 administrative data across time periods and geographic areas after adjustment for patient demographic characteristics and chronic diseases.

Section snippets

Chart review data

Charts of general practitioners (GPs) were employed as the reference standard. GP's charts were selected through a two stage sampling process.

The first stage was GP recruitment. A list of GPs in the provinces of Alberta and British Columbia, Canada was obtained from the provincial licensing physician directories. GPs were randomly selected from urban (i.e. Calgary with approximately 1.1 million population and Vancouver with approximately 2.1 million population), and rural (defined as less than

Results

From the chart review, 36.3% patients were male, and the mean age was 52.8 years. A total of 1642 (48.8%) individuals had at least one comorbid condition other than diabetes. Prevalence of diabetes was 8.1% in the chart review data (see Table 1).

The 8 ICD algorithms for defining diabetes (see Table 2) had a high sensitivity (ranging from 91.2% to 95.6%), specificity (ranging from 92.8% to 97.6%), and NPV (ranging from 99.2% to 99.6%) when 3-year data were analyzed. The algorithm of “2 physician

Discussion

We validated diabetes recorded in administration data and found that the algorithm of “2 physician claims within 2 years or 1 hospitalization with ICD-9 code of 250.xx or ICD-10 code of E10.x–E14.x” had high validity. The validity remained similar across time periods and regions. To optimize the validity, at least 3-year data was needed.

Our findings were consistent with some previous studies’ reports. Saydah et al. [7] reviewed validation studies of administrative data in recording diabetes,

Conflict of interest

All authors of this paper have no conflicts of interest to declare.

Acknowledgements

The data collection was funded by Canadian Institutes of Health Research. Dr. Quan is supported by Alberta Innovate-Health Solution salary award. Dr. Khan is supported by a New Investigator Award from the Canadian Institutes of Health Research.

References (15)

There are more references available in the full text version of this article.

Cited by (0)

View full text