Background Numerous public health studies, especially in the area of violence, examine the effects of contextual or group-level factors on health outcomes. Often, these contextual factors exhibit strong pairwise correlations, which pose a challenge when these factors are included as covariates in a statistical model. Such models may be characterised by inflated standard errors and unstable parameter estimates that may fluctuate drastically from sample to sample, where the excessive estimation variability is reflected by inflated standard errors.
Methods We propose a three-stage approach for analysing correlated contextual factors that proceeds as follows: (1) a principal components analysis (PCA) is performed on the original set of correlated variables, (2) the primary generated principal components are included in a multilevel multivariable model and (3) the estimated parameters for these components are transformed into estimates for each of the original contextual factors. Using school violence data, we examined the associations between school crime and correlated contextual school factors (ie, English proficiency, academic performance, pupil to teacher ratio, average class size and children on free and reduced meals).
Results From models ignoring correlations, school crime was not reliably associated with any of the contextual school factors. When models were fit with principal components, school crime was found to be positively associated with a school’s student to teacher ratio, average classroom size and academic performance but negatively associated with the proportion of children who were on free and reduced meals.
Conclusion Our multistep approach is one way to address multicollinearity encountered in social epidemiological studies of violence.
Statistics from Altmetric.com
Contributors MR conceived the study and directed the data collection. MR, GC, JEF and JEC designed the study and analysed the data. MR, GC, CP-A, JEF and JEC interpreted findings, and wrote and revised the manuscript. All authors approved the final submitted manuscript.
Funding This work was supported by grants K01-CD000196 and R49CE003095 from the Centers for Disease Control and Prevention.
Competing interests None declared.
Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.
Patient consent for publication Not required.
Provenance and peer review Not commissioned; externally peer reviewed.
Data availability statement Data may be obtained from a third party and are not publicly available.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.