Conditionalisation Strategies and Valid Causal Inference
Nathaniel Forde
Data Science @ Personio
and Open Source Contributor @ PyMC
1/11/25
The Pitch
Valid causal claims are of paramount importance in science. Structural causal models impose extra-statistical constraints on the derivation of causal claims from data.
These constraints can be understood concretely with reference to the SEM literature. We outline the development of structural causal models in the Bayesian setting highlighting the use of the do-operator on generative model graph structures.
“Every proposition has a content and a form. We get the picture of the pure form if we abstract from the meaning of the single words, or symbols (so far as they have independent meanings)… By syntax in this general sense of the word I mean the rules which tell us in which connections only a word [makes] sense, thus excluding nonsensical structures.” - Wittgenstein Some Remarks on Logical Form
\[ \psi | \neg \psi | \psi \rightarrow \phi \]
Sets of Admissable Graphs and Well formed valid Fomulae
Abstract Conception
Non-Parametric Structural Diagrams
Non-parametric Structural Causal Models highlight the aspects of the Data Generating processes that threaten the valid construction of a causal claim.
\[ Y \sim f_{Y}(Z, X, U_{Y}) \]\[ Z \sim f_{Z}(X, IV, U_{Z})\]\[ X \sim f_{X}(U_{X})\]\[ IV \sim f_{IV}(U_{IV})\]
Concrete Conception
Parametric Approximation via Regression
\[ y \sim 1 + Z + u \] Regression Approximation to estimating valid coefficients in systems of simultaneous equation.
\[ y \sim 1 + \widehat{Z} + u \]\[ \widehat{Z} \sim 1 + IV + u \]
The treatment effect \(\color{blue}\beta_{01}\) of is the primary quantity of interest
\[ \color{purple} \Sigma = \begin{bmatrix}
1 & \color{purple} \sigma \\
\color{purple} \sigma & 1
\end{bmatrix}
\] But the estimation depends on the multivariate realisation of the data
Even the “simple” IV design is a structural causal model.
The crucial component is the covariance structure of the joint-distribution and the instrument’s theoretical validity
The Bayesian Estimation strategy estimates the IV model by seeking a parameterisation where the potential outcomes are conditionally exchangeable.
SEM model achieves better performance on the local and global model checks than the CFA model
The Do-Calculus and Sensitivity Analysis
The graph algebra of the do-calculus sets rules on admissable structures required to warrant valid causal claims when accounting for different species of confounding.
They complement the development and analysis of SEM models, providing minimalist admissablility conditions for a causal interpretation of these structural relations.
The substantive justification of the causal claims implicit in a SEM model need to be made more explicitly by the researcher with reference to evaluation of model fit to data by comparing to competing models.
What-if Structures and the Do-Operator
“[T]ypically the causal assumptions are less established, though they should be defensible and consistent with the current state of knowledge. The analysis is done under the speculation of “what if these causal assumptions were true.” These latter analyses are useful because there are often ways of testing the model, or parts of it. These tests can be helpful in rejecting one or more of the causal assumptions, thereby revealing flaws in specification. Of course, passing these tests does not prove the validity of the causal assumptions, but it lends credibility to them.”
Bollen and Pearl in Eight myths about causality and structural equation models
The Do-Operator uses graph mutilation techniques to interrogate the impact of different data generating conditions including the analysis of causal claims.
Graphs Mutilation in PyMC
## Model Estimated on Dataidata_sem, model_sem = make_sem()pm.model_to_graphviz(model_sem)
## The Do-Operator in Actionmodel_beta0 = do(model_sem, {"betas": [0, 0 , 0]}, prune_vars=True)pm.model_to_graphviz(model_beta0)
The plots are scaled identically here between -1 and 1. Highlighting significantly worse model fit under implausibly zero’d out beta coefficients.
Conclusion
“SEM is an inference engine that takes in two inputs, qualitative causal assumptions and empirical data, and derives two logical consequences of these inputs: quantitative causal conclusions and statistical measures of fit for the testable implications of the assumptions. Failure to fit the data casts doubt on the strong causal assumptions of zero coefficients or zero covariances and guides the researcher to diagnose, or repair the structural misspecifications. Fitting the data does not “prove” the causal assumptions, but it makes them tentatively more plausible. Any such positive results need to be replicated and to withstand the criticisms of researchers who suggest other models for the same data. ”
Bollen and Pearl in Eight myths about causality and structural equation models