An approach to identifying incident breast cancer cases using Medicare claims data

Jean L. Freeman, Dong Zhang, Daniel H. Freeman, James Goodwin

Research output: Contribution to journalArticle

91 Citations (Scopus)

Abstract

This study developed and evaluated a method for ascertaining a newly diagnosed breast cancer case using multiple sources of data from the Medicare claims system. Predictors of an incident case were operationally defined as codes for breast cancer-related diagnoses and procedures from hospital inpatient, hospital outpatient, and physician claims. The optimal combination of predictors was then determined from a logistic regression model using 1992 data from the linked SEER registries-Medicare claims data base and a sample of noncancer controls drawn from the SEER areas. While the ROC curve demonstrates that the model can produce levels of sensitivity and specificity above 90%, the positive predictive value is comparatively low (67-70%). This low predictive value is largely the result of the model's limitation in distinguishing recurrent and secondary malignancies from incident cases and possibly from the model identifying true incident cases not identified by SEER. Nevertheless, the logistic regression approach is a useful method for ascertaining incident cases because it allows for greater flexibility in changing the performance characteristics by selecting different cut-points depending on the application (e.g., high sensitivity for registry validation, high specificity for outcomes research). It also allows us to make specific adjustments to population based estimates of breast cancer incidence with claims. Copyright (C) 2000 Elsevier Science Inc.

Original languageEnglish (US)
Pages (from-to)605-614
Number of pages10
JournalJournal of Clinical Epidemiology
Volume53
Issue number6
DOIs
StatePublished - Jun 2000

Fingerprint

Medicare
Logistic Models
Breast Neoplasms
Registries
Information Storage and Retrieval
ROC Curve
Inpatients
Outpatients
Outcome Assessment (Health Care)
Databases
Physicians
Sensitivity and Specificity
Incidence
Population
Neoplasms

Keywords

  • Breast neoplasms
  • Incidence
  • Medicare
  • Registries
  • Sensitivity and specificity

ASJC Scopus subject areas

  • Medicine(all)
  • Public Health, Environmental and Occupational Health
  • Epidemiology

Cite this

An approach to identifying incident breast cancer cases using Medicare claims data. / Freeman, Jean L.; Zhang, Dong; Freeman, Daniel H.; Goodwin, James.

In: Journal of Clinical Epidemiology, Vol. 53, No. 6, 06.2000, p. 605-614.

Research output: Contribution to journalArticle

Freeman, Jean L. ; Zhang, Dong ; Freeman, Daniel H. ; Goodwin, James. / An approach to identifying incident breast cancer cases using Medicare claims data. In: Journal of Clinical Epidemiology. 2000 ; Vol. 53, No. 6. pp. 605-614.
@article{b6ae2ec319f340b099a9d093ff5505a8,
title = "An approach to identifying incident breast cancer cases using Medicare claims data",
abstract = "This study developed and evaluated a method for ascertaining a newly diagnosed breast cancer case using multiple sources of data from the Medicare claims system. Predictors of an incident case were operationally defined as codes for breast cancer-related diagnoses and procedures from hospital inpatient, hospital outpatient, and physician claims. The optimal combination of predictors was then determined from a logistic regression model using 1992 data from the linked SEER registries-Medicare claims data base and a sample of noncancer controls drawn from the SEER areas. While the ROC curve demonstrates that the model can produce levels of sensitivity and specificity above 90{\%}, the positive predictive value is comparatively low (67-70{\%}). This low predictive value is largely the result of the model's limitation in distinguishing recurrent and secondary malignancies from incident cases and possibly from the model identifying true incident cases not identified by SEER. Nevertheless, the logistic regression approach is a useful method for ascertaining incident cases because it allows for greater flexibility in changing the performance characteristics by selecting different cut-points depending on the application (e.g., high sensitivity for registry validation, high specificity for outcomes research). It also allows us to make specific adjustments to population based estimates of breast cancer incidence with claims. Copyright (C) 2000 Elsevier Science Inc.",
keywords = "Breast neoplasms, Incidence, Medicare, Registries, Sensitivity and specificity",
author = "Freeman, {Jean L.} and Dong Zhang and Freeman, {Daniel H.} and James Goodwin",
year = "2000",
month = "6",
doi = "10.1016/S0895-4356(99)00173-0",
language = "English (US)",
volume = "53",
pages = "605--614",
journal = "Journal of Clinical Epidemiology",
issn = "0895-4356",
publisher = "Elsevier USA",
number = "6",

}

TY - JOUR

T1 - An approach to identifying incident breast cancer cases using Medicare claims data

AU - Freeman, Jean L.

AU - Zhang, Dong

AU - Freeman, Daniel H.

AU - Goodwin, James

PY - 2000/6

Y1 - 2000/6

N2 - This study developed and evaluated a method for ascertaining a newly diagnosed breast cancer case using multiple sources of data from the Medicare claims system. Predictors of an incident case were operationally defined as codes for breast cancer-related diagnoses and procedures from hospital inpatient, hospital outpatient, and physician claims. The optimal combination of predictors was then determined from a logistic regression model using 1992 data from the linked SEER registries-Medicare claims data base and a sample of noncancer controls drawn from the SEER areas. While the ROC curve demonstrates that the model can produce levels of sensitivity and specificity above 90%, the positive predictive value is comparatively low (67-70%). This low predictive value is largely the result of the model's limitation in distinguishing recurrent and secondary malignancies from incident cases and possibly from the model identifying true incident cases not identified by SEER. Nevertheless, the logistic regression approach is a useful method for ascertaining incident cases because it allows for greater flexibility in changing the performance characteristics by selecting different cut-points depending on the application (e.g., high sensitivity for registry validation, high specificity for outcomes research). It also allows us to make specific adjustments to population based estimates of breast cancer incidence with claims. Copyright (C) 2000 Elsevier Science Inc.

AB - This study developed and evaluated a method for ascertaining a newly diagnosed breast cancer case using multiple sources of data from the Medicare claims system. Predictors of an incident case were operationally defined as codes for breast cancer-related diagnoses and procedures from hospital inpatient, hospital outpatient, and physician claims. The optimal combination of predictors was then determined from a logistic regression model using 1992 data from the linked SEER registries-Medicare claims data base and a sample of noncancer controls drawn from the SEER areas. While the ROC curve demonstrates that the model can produce levels of sensitivity and specificity above 90%, the positive predictive value is comparatively low (67-70%). This low predictive value is largely the result of the model's limitation in distinguishing recurrent and secondary malignancies from incident cases and possibly from the model identifying true incident cases not identified by SEER. Nevertheless, the logistic regression approach is a useful method for ascertaining incident cases because it allows for greater flexibility in changing the performance characteristics by selecting different cut-points depending on the application (e.g., high sensitivity for registry validation, high specificity for outcomes research). It also allows us to make specific adjustments to population based estimates of breast cancer incidence with claims. Copyright (C) 2000 Elsevier Science Inc.

KW - Breast neoplasms

KW - Incidence

KW - Medicare

KW - Registries

KW - Sensitivity and specificity

UR - http://www.scopus.com/inward/record.url?scp=0034042848&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034042848&partnerID=8YFLogxK

U2 - 10.1016/S0895-4356(99)00173-0

DO - 10.1016/S0895-4356(99)00173-0

M3 - Article

VL - 53

SP - 605

EP - 614

JO - Journal of Clinical Epidemiology

JF - Journal of Clinical Epidemiology

SN - 0895-4356

IS - 6

ER -