Interpreting regression models in clinical outcome studies (2024)

Measuring the outcome of an intervention is central tothe practice of evidence based medicine, and most research papersevaluating patient outcomes now incorporate some form of patient-basedmetric, such as questionnaires or performance tests. Once an outcomehas been defined, researchers typically want to know if any otherfactors can influence the result. This is typically assessed with regressionanalysis.

Regression analysis1 determinesthe relationship of an independent variable (such as bone mineraldensity) on a dependent variable (such as ageing) with the statistical assumptionthat all other variables remain fixed. The calculation of the relationship resultsin a theoretical straight line, and the correlation co-efficient(r) measures how closely the observed data are to the theoreticalstraight line that we have calculated.

In such a linear model, we can judge how well the line fits thedata (‘goodness of fit’) by calculating the coefficient of determination (orsquare of the regression line, R2). R2 isa measure of the percentage of total variation in the dependantvariable that is accounted for by the independent variable. An R2 of1.0 indicates that the data perfectly fit the linear model. AnyR2 value less than 1.0 indicates that at least some variabilityin the data cannot be accounted for by the model (e.g., an R2 of0.5 indicates that 50% of the variability in the outcome data cannotbe explained by the model).

Given these statistical tools, we can use the regression equationto predict the value of the dependent variable based on the known valueof independent variable. Since many variables may contribute tothe outcome (dependent variable), further statistical analysis canbe achieved with multiple regression analysis. These models areessentially the same as simple regression analysis, except thatthe multiple regression analysis equation describes the interrelationshipof many variables and allows us to evaluate the joint effect ofthese variables on the outcome variable in question.

Poitras et al2 reportan interesting study this month that aims to predict length of stay andearly clinical function following joint arthroplasty. Multiple linearregression analyses produced an equation based on the timed-up-and-gotest, which was associated with length of stay. In addition, modelsbased on the pre-operative WOMAC function sub-score produced thebest model for describing early post-operative function (as calculated bythe Older American Resources and Services ALD score). As such theauthors were able to conclude that the outcomes assessments (timed-up-and-goand WOMAC) were predictive of outcome, and further modelling identifiedthresholds of the outcome assessment scores that related to betterand worse outcomes.

How should we interpret these findings? The authors quite correctlysuggest that models such as these could be of value in dischargeplanning and resource utilisation by targeting the patients thatmost need intervention and rehabilitation. The reported R2 forthe models, however, was 0.18. Bearing in mind that R2,the coefficient of determination, measures the percentage of thevariation in the dependent variable that is explained by variationin the independent variable,3 takingthe compliment (100 – R2) we see that 82% of the variationin the outcome parameter assessed is unexplained by the model. Theprincipal problem is that the variance in the population studiedcan strongly influence R2 magnitude. Therefore, thereis no guarantee that a high coefficient of determination is indicativeof ‘goodness of fit’. Similarly there is no guarantee that a smallR2 indicates a weak relationship, given that the statisticis largely influenced by variation in the independent variable.4

Therefore, there is no rule for interpreting the strength ofR2 in its application to clinical relevance. Useful high valuesof R2 can be obtained with clinical data sets,5 however, a low R2 canstill provide a useful clinical model with respect to data trends,but may be low in precision. In this study there is an associationbetween the performance tests and length of stay; and, using theequations, we can indeed predict one from the other. The accuracyof this prediction though, needs to be borne in mind when usingit as a clinical tool.

Furthermore, it is not rational to compare R2 acrossdifferent samples, which given clinical populations, are likelyto differ significantly in the variance of the independent and dependentvariables.6

In controlled environments, such as biomechanical tests on cadavericbones, the variance across predictive measurements is likely tobe low, and therefore R2 values can be expected to liein the 0.8 range.7 Inclinical studies, however, R2 values vary widely dependingon the nature of the analysis. For example, when comparing radiographicparameters or associating surgical technical factors, values ofR2 are reported in the 0.2 to 0.4 range.8,9 Whereas, comparing data between separate(but intrinsically similar) outcome assessment questionnaires can yieldhigher values in excess of 0.7.10

As such, further validation of the Poitras study2 using new datasetsand, ideally, confirmatory analysis of the findings using a muchlarger sample size, would be required before their regression modelcould be recommended for use clinically. This does not devalue the appropriateness– or indeed ‘worthiness’ – of reporting these findings in the literature,as the important clinical tools typically start as ideas in smalldatasets. As with all research papers, the reader requires a basicunderstanding of methodology to evaluate how relevant the results areto wider practice.

References

1. Draper NR, Smith HApplied regression analysis. Wiley-Interscience, 1998.

2. Poitras S, Wood KS, Savard J, Dervin GF, Beaule PE. Predicting early clinical function after hip or knee arthroplasty. Bone Joint Res2015;4:145–151. [PMC free article] [PubMed] [Google Scholar]

3. Schroeder LD, Sjoquist DL, Stephen PEUnderstanding regression analysis: an introductory guide. 1986, Sage Publications; Beverly Hills, California.

4. Filho DBF, Silva JA, Rocha E. What is R2 all about?Leviathan – Cadernos de Pesquisa Política2011;3:60–68. [Google Scholar]

5. Maempel JF, Clement ND, Brenkel IJ, Walmsley PJ. Validation of a prediction model that allows direct comparison of the Oxford Knee Score and American Knee Society clinical rating system. Bone Joint J2015;97-B:503–509. [PubMed] [Google Scholar]

6. Kennedy PA guide to econometrics. 2008, Wiley-Blackwell; San Francisco, California:27.

7. Eckstein F, Wunderer C, Boehm H, et al. Reproducibility and side differences of mechanical tests for determining the structural strength of the proximal femur. J Bone Miner Res2004;19:379–385. [PubMed] [Google Scholar]

8. Weber M, Lechler P, von Kunow F, et al. The validity of a novel radiological method for measuring femoral stem version on anteroposterior radiographs of the hip after total hip arthroplasty. Bone Joint J2015;97-B:306–311. [PubMed] [Google Scholar]

9. Kuwashima U, Okazaki K, Tashiro Y, et al. Correction of coronal alignment correlates with reconstruction of joint height in unicompartmental knee arthroplasty. Bone Joint Res2015;4:128–133. [PMC free article] [PubMed] [Google Scholar]

10. Parsons N, Griffin XL, Achten J, Costa ML. Outcome assessment after hip fracture: is EQ-5D the answer?Bone Joint Res2014;19;3:69–75. [PMC free article] [PubMed] [Google Scholar]

Interpreting regression models in clinical outcome
studies (2024)
Top Articles
Latest Posts
Article information

Author: Sen. Emmett Berge

Last Updated:

Views: 6307

Rating: 5 / 5 (80 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Sen. Emmett Berge

Birthday: 1993-06-17

Address: 787 Elvis Divide, Port Brice, OH 24507-6802

Phone: +9779049645255

Job: Senior Healthcare Specialist

Hobby: Cycling, Model building, Kitesurfing, Origami, Lapidary, Dance, Basketball

Introduction: My name is Sen. Emmett Berge, I am a funny, vast, charming, courageous, enthusiastic, jolly, famous person who loves writing and wants to share my knowledge and understanding with you.