Evaluation of narrative text for case finding: the need for accuracy measurement

P A McCullough; G S Smith

doi:10.1002/(sici)1097-0274(199808)34:2<133::aid-ajim5>3.0.co;2-y

Evaluation of narrative text for case finding: the need for accuracy measurement

Am J Ind Med. 1998 Aug;34(2):133-6. doi: 10.1002/(sici)1097-0274(199808)34:2<133::aid-ajim5>3.0.co;2-y.

Authors

P A McCullough¹, G S Smith

Affiliation

¹ Center for Clinical Effectiveness, Henry Ford Health System, Detroit, MI 48202, USA. pmc@mich.com or pmccull1@smtpgw.is.hfh.edu

PMID: 9651622
DOI: 10.1002/(sici)1097-0274(199808)34:2<133::aid-ajim5>3.0.co;2-y

Abstract

This article reviews the analysis of a narrative text electronic search technique being used in the insurance industry. We reviewed a previously published study of motor vehicle crashes in roadway construction workzones as well as additional data supplied by the authors with respect to the methods of keyword selection. The narrative text search technique was evaluated with decision statistics and was found to have a sensitivity of 92.3%, 95% confidence interval 67.5%-99.6%. This range of sensitivity, at its most extreme value, led to a 32.5% underestimation of claims prevalence. Furthermore, because the electronic search developed two classification categories from a limited text field (approximately 20 words), only half of the cases had at least one classification. Systematic error estimates were used to obtain true population proportions for crash characteristics, revealing significant underestimations in costs. This analysis highlights the need for investigators to apply decision statistics to narrative text searching techniques when they are used essentially as diagnostic test procedures on insurance claims datasets.

MeSH terms

Accidents, Occupational / statistics & numerical data*
Accidents, Traffic / statistics & numerical data*
Confidence Intervals
Database Management Systems
Databases, Factual / statistics & numerical data*
Humans
Insurance, Liability
Population Surveillance / methods*
Predictive Value of Tests
Sensitivity and Specificity
United States