Catalogue Search | MBRL

GEOMETRY OF THE FAITHFULNESS ASSUMPTION IN CAUSAL INFERENCE

by Uhler, Caroline , Yu, Bin , Raskutti, Garvesh in (strong) faithfulness , 14Q10 , 62H05

2013

Many algorithms for inferring causality rely heavily on the faithfulness assumption. The main justification for imposing this assumption is that the set of unfaithful distributions has Lebesgue measure zero, since it can be seen as a collection of hypersurfaces in a hypercube. However, due to sampling error the faithfulness condition alone is not sufficient for statistical estimation, and strong-faithfulness has been proposed and assumed to achieve uniform or high-dimensional consistency. In contrast to the plain faithfulness assumption, the set of distributions that is not strong-faithful has nonzero Lebesgue measure and in fact, can be surprisingly large as we show in this paper. We study the strong-faithfulness condition from a geometric and combinatorial point of view and give upper and lower bounds on the Lebesgue measure of strong-faithful distributions for various classes of directed acyclic graphs. Our results imply fundamental limitations for the PC-algorithm and potentially also for other algorithms based on partial correlation testing in the Gaussian case.

Journal Article

Share this book

Add to My Shelf

ESTIMATING HIGH-DIMENSIONAL INTERVENTION EFFECTS FROM OBSERVATIONAL DATA

by Maathuis, Marloes H. , Kalisch, Markus , Bühlmann, Peter in 62-09 , 62H99 , Absolute value

2009

We assume that we have observational data generated from an unknown underlying directed acyclic graph (DAG) model. A DAG is typically not identifiable from observational data, but it is possible to consistently estimate the equivalence class of a DAG. Moreover, for any given DAG, causal effects can be estimated using intervention calculus. In this paper, we combine these two parts. For each DAG in the estimated equivalence class, we use intervention calculus to estimate the causal effects of the covariates on the response. This yields a collection of estimated causal effects for each covariate. We show that the distinct values in this set can be consistently estimated by an algorithm that uses only local information of the graph. This local approach is computationally fast and feasible in high-dimensional problems. We propose to use summary measures of the set of possible causal effects to determine variable importance. In particular, we use the minimum absolute value of this set, since that is a lower bound on the size of the causal effect. We demonstrate the merits of our methods in a simulation study and on a data set about riboflavin production.

Journal Article

Share this book

Add to My Shelf

Exploring a Potential Interaction Between the Effect of Specific Maternal Smoking Patterns and Comorbid Antenatal Depression in Causing Postpartum Depression

by Barkin, Jennifer , Attia, John , Akin, Marshall in antenatal depression , Cigarettes , Comorbidity

2024

To explore a potential interaction between the effect of specific maternal smoking patterns and the presence of antenatal depression, as independent exposures, in causing postpartum depression (PPD). This case-control study of participants with singleton term births (N = 51220) was based on data from the 2017-2018 Pregnancy Risk Assessment Monitoring System. Multivariable log-binomial regression models examined the main effects of smoking patterns and self-reported symptoms of antenatal depression on the risk of PPD on the adjusted risk ratio (aRR) scale and tested a two-way interaction adjusting for covariates selected in a directed acyclic graph (DAG). The interaction effects were measured on the additive scale using relative excess risk due to interaction (RERI), the attributable proportion of interaction (AP), and the synergy index (SI). Causal effects were defined in a counterfactual framework. The E-value quantified the potential impact of unobserved/unknown covariates, conditional on observed covariates. Among 6841 women in the sample who self-reported PPD, 35.7% also reported symptoms of antenatal depression. Out of 3921 (7.7%) women who reported smoking during pregnancy, 32.6% smoked at high intensity (≥10 cigarettes/day) in all three trimesters and 36.6% had symptoms of antenatal depression. The main effect of PPD was the strongest for women who smoked at high intensity throughout pregnancy (aRR 1.65; 95% CI: 1.63, 1.68). A synergistic interaction was detected, and the effect of all maternal smoking patterns was augmented, particularly in late pregnancy for and . Strong associations and interaction effects between maternal smoking patterns and co-occurring antenatal depression support smoking prevention and cessation interventions during pregnancy to lower the likelihood of PPD.

Journal Article

Share this book

Add to My Shelf

Penalized likelihood methods for estimation of sparse high-dimensional directed acyclic graphs

by Shojaie, Ali , Michailidis, George in Adaptive lasso , Adjacency matrix , Causality

2010

Directed acyclic graphs are commonly used to represent causal relationships among random variables in graphical models. Applications of these models arise in the study of physical and biological systems where directed edges between nodes represent the influence of components of the system on each other. Estimation of directed graphs from observational data is computationally NP-hard. In addition, directed graphs with the same structure may be indistinguishable based on observations alone. When the nodes exhibit a natural ordering, the problem of estimating directed graphs reduces to the problem of estimating the structure of the network. In this paper, we propose an efficient penalized likelihood method for estimation of the adjacency matrix of directed acyclic graphs, when variables inherit a natural ordering. We study variable selection consistency of lasso and adaptive lasso penalties in high-dimensional sparse settings, and propose an error-based choice for selecting the tuning parameter. We show that although the lasso is only variable selection consistent under stringent conditions, the adaptive lasso can consistently estimate the true graph under the usual regularity assumptions.

Journal Article

Share this book

Add to My Shelf

Compositional Causal Identification from Imperfect or Disturbing Observations

by Spekkens, Robert W. , Kissinger, Aleks , Friend, Isaac in acyclic directed mixed graphs , Analysis , causal Bayesian networks

2025

The usual inputs for a causal identification task are a graph representing qualitative causal hypotheses and a joint probability distribution for some of the causal model’s variables when they are observed rather than intervened on. Alternatively, the available probabilities sometimes come from a combination of passive observations and controlled experiments. It also makes sense, however, to consider causal identification with data collected via schemes more generic than (perfect) passive observation or perfect controlled experiments. For example, observation procedures may be noisy, may disturb the variables, or may yield only coarse-grained specification of the variables’ values. In this work, we investigate identification of causal quantities when the probabilities available for inference are the probabilities of outcomes of these more generic schemes. Using process theories (aka symmetric monoidal categories), we formulate graphical causal models as second-order processes that respond to such data collection instruments. We pose the causal identification problem relative to arbitrary sets of available instruments. Perfect passive observation instruments—those that produce the usual observational probabilities used in causal inference—satisfy an abstract process-theoretic property called marginal informational completeness. This property also holds for other (sets of) instruments. The main finding is that in the case of Markovian models, as long as the available instruments satisfy this property, the probabilities they produce suffice for identification of interventional quantities, just as those produced by perfect passive observations do. This finding sharpens the distinction between the Markovianity of a causal model and that of a probability distribution, suggesting a more extensive line of investigation of causal inference within a process-theoretic framework.

Journal Article

Share this book

Add to My Shelf

Ancestral Graph Markov Models

by Richardson, Thomas , Spirtes, Peter in 60K99 , 62M45 , 68R10

2002

This paper introduces a class of graphical independence models that is closed under marginalization and conditioning but that contains all DAG independence models. This class of graphs, called maximal ancestral graphs, has two attractive features: there is at most one edge between each pair of vertices; every missing edge corresponds to an independence relation. These features lead to a simple parameterization of the corresponding set of distributions in the Gaussian case.

Journal Article

Share this book

Add to My Shelf

Conflict Diagnostics in Directed Acyclic Graphs, with Applications in Bayesian Evidence Synthesis

by Presanis, Anne M. , Ohlssen, David , Spiegelhalter, David J. in Age groups , Bayesian analysis , Children

2013

Complex stochastic models represented by directed acyclic graphs (DAGs) are increasingly employed to synthesise multiple, imperfect and disparate sources of evidence, to estimate quantities that are difficult to measure directly. The various data sources are dependent on shared parameters and hence have the potential to conflict with each other, as well as with the model. In a Bayesian framework, the model consists of three components: the prior distribution, the assumed form of the likelihood and structural assumptions. Any of these components may be incompatible with the observed data. The detection and quantification of such conflict and of data sources that are inconsistent with each other is therefore a crucial component of the model criticism process. We first review Bayesian model criticism, with a focus on conflict detection, before describing a general diagnostic for detecting and quantifying conflict between the evidence in different partitions of a DAG. The diagnostic is a p-value based on splitting the information contributing to inference about a \"separator\" node or group of nodes into two independent groups and testing whether the two groups result in the same inference about the separator node(s). We illustrate the method with three comprehensive examples: an evidence synthesis to estimate HIV prevalence; an evidence synthesis to estimate influenza case-severity; and a hierarchical growth model for rat weights.

Journal Article

Share this book

Add to My Shelf

CSVO: Clustered Sparse Voxel Octrees—A Hierarchical Data Structure for Geometry Representation of Voxelized 3D Scenes

by Chovancová, Eva , Ádám, Norbert , Madoš, Branislav in Algorithms , Computer graphics , Data compression

2022

When representing the geometry of voxelized three-dimensional scenes (especially if they have been voxelized to high resolutions) in a naive—uncompressed—form, one may end up using vast amounts of data. These can easily attack the available memory capacity of the graphics card, the operating memory or even secondary storage of computer. A viable solution to this problem is to use domain-specific hierarchical data structures, based on octant trees or directed acyclic graphs, which, among other advantages, provide a compact binary representation that can thus be considered to be their compressed encoding. These data structures include—inter alia—sparse voxel octrees, sparse voxel directed acyclic graphs and symmetry-aware sparse voxel directed acyclic graphs. The paper deals with the proposal of a new domain-specific hierarchical data structure: the clustered sparse voxel octrees. It is designed to represent the geometry of voxelized three-dimensional scenes and can be constructed using the out-of-core algorithm proposed in the paper. The advantage of the presented data structure is in its compact binary representation, achieved by omitting a significant number of pointers to child nodes (82.55% in case of Angel Lucy model in 1283 voxels resolution) and by using a wider range of child node pointer lengths, including 8b, 16b and 32b. We achieved from 6.57 to 6.82 times more compact encoding, compared to sparse voxel octrees, whose all node components were 32b aligned, and from 4.11 to 4.27 times more compact encoding, when not all node components were 32b aligned.

Journal Article

Share this book

Add to My Shelf

MARKOVIAN ACYCLIC DIRECTED MIXED GRAPHS FOR DISCRETE DATA

by Evans, Robin J. , Richardson, Thomas S. in 62M45 , Acyclic directed mixed graph , conditional independence

2014

Acyclic directed mixed graphs (ADMGs) are graphs that contain directed (→) and bidirected (↔) edges, subject to the constraint that there are no cycles of directed edges. Such graphs may be used to represent the conditional independence structure induced by a DAG model containing hidden variables on its observed margin. The Markovian model associated with an ADMG is simply the set of distributions obeying the global Markov property, given via a simple path criterion (m-separation). We first present a factorization criterion characterizing the Markovian model that generalizes the well-known recursive factorization for DAGs. For the case of finite discrete random variables, we also provide a parameterization of the model in terms of simple conditional probabilities, and characterize its variation dependence. We show that the induced models are smooth. Consequently, Markovian ADMG models for discrete variables are curved exponential families of distributions.

Journal Article

Share this book

Add to My Shelf

Variable selection in high-dimensional linear models: partially faithful distributions and the pc-simple algorithm

by Bühlmann, P. , Maathuis, M. H. , Kalisch, M. in Algorithms , Applications , Biology, psychology, social sciences

2010

We consider variable selection in high-dimensional linear models where the number of covariates greatly exceeds the sample size. We introduce the new concept of partial faithfulness and use it to infer associations between the covariates and the response. Under partial faithfulness, we develop a simplified version of the pc algorithm (Spirtes et al., 2000), which is computationally feasible even with thousands of covariates and provides consistent variable selection under conditions on the random design matrix that are of a different nature than coherence conditions for penalty-based approaches like the lasso. Simulations and application to real data show that our method is competitive compared to penalty-based approaches. We provide an efficient implementation of the algorithm in the R-package pcalg.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter