Sensitivity Evaluation for Unobserved Confounding | by Ugur Yildirim

[ad_1]

Find out how to know the unknowable in observational research

Introduction
Drawback Setup
2.1. Causal Graph
2.2. Mannequin With and With out Z
2.3. Power of Z as a Confounder
Sensitivity Evaluation
3.1. Aim
3.2. Robustness Worth
PySensemakr
Conclusion
Acknowledgements
References

The specter of unobserved confounding (aka omitted variable bias) is a infamous downside in observational research. In most observational research, except we are able to moderately assume that therapy task is as-if random as in a pure experiment, we are able to by no means be actually sure that we managed for all attainable confounders in our mannequin. Consequently, our mannequin estimates could be severely biased if we fail to regulate for an essential confounder–and we wouldn’t even understand it for the reason that unobserved confounder is, effectively, unobserved!

Given this downside, you will need to assess how delicate our estimates are to attainable sources of unobserved confounding. In different phrases, it’s a useful train to ask ourselves: how a lot unobserved confounding would there need to be for our estimates to drastically change (e.g., therapy impact not statistically vital)? Sensitivity evaluation for unobserved confounding is an lively space of analysis, and there are a number of approaches to tackling this downside. On this publish, I’ll cowl a easy linear methodology [1] based mostly on the idea of partial R² that’s extensively relevant to a big spectrum of instances.

2.1. Causal Graph

Allow us to assume that we now have 4 variables:

Y: end result
D: therapy
X: noticed confounder(s)
Z: unobserved confounder(s)

This can be a widespread setting in lots of observational research the place the researcher is taken with understanding whether or not the therapy of curiosity has an impact on the end result after controlling for attainable treatment-outcome confounders.

In our hypothetical setting, the connection between these variables are such that X and Z each have an effect on D and Y, however D has no impact on Y. In different phrases, we’re describing a state of affairs the place the true therapy impact is null. As will change into clear within the subsequent part, the aim of sensitivity evaluation is having the ability to motive about this therapy impact when we now have no entry to Z, as we usually gained’t because it’s unobserved. Determine 1 visualizes our setup.

Determine 1: Drawback Setup

2.2. Mannequin With and With out Z

To show the issue that our unobserved Z could cause, I simulated some information in keeping with the issue setup described above. You’ll be able to consult with this pocket book for the main points of the simulation.

Since Z could be unobserved in actual life, the one mannequin we are able to usually match to information is Y~D+X. Allow us to see what outcomes we get if we run that regression.