About Us

We are CausalAI4Health (Causal Artificial Intelligence for Health), a research group pushing the frontiers of computational and statistical methods to uncover and quantify causal mechanisms in complex biomedical systems.

The group is led by Dr. Adèle Helena Ribeiro and is hosted by the Institute of Medical Informatics at the University of Münster, with support from the German Federal Ministry of Research, Technology and Space (BMFTR).

Our mission is to advance both the theoretical foundations and translational research of explainable AI and causal inference, enabling deeper scientific insight and more effective data-driven decisions in the life and health sciences. We aim to uncover why biological and health systems function as they do and how targeted interventions can influence their behavior.

Real-world biomedical datasets rarely satisfy standard methodological assumptions: they are often high-dimensional, heterogeneous, and multimodal, and can be affected by latent confounding, selection bias, privacy constraints, and limited sample sizes. If these challenges are not rigorously addressed, causal analyses risk producing invalid, non-reproducible, or non-generalizable results.

Our research tackles these challenges by developing methods that are both rigorous and effective in real-world settings, with a focus on:

Robust Causal Learning

Learn reliable causal relationships despite hidden confounders and limited samples.

Uncertainty Quantification

Provide statistically valid and trustworthy conclusions with uncertainty estimates.

Expert Knowledge Integration

Integrate expert knowledge while accounting for uncertainty and conflicting information.

Collaborative Analysis

Enable multi-institution research without sharing sensitive data.

High-Dimensional Scaling

Efficiently handle complex, multimodal datasets at scale.


Our ongoing applications include public health and clinical research in malaria, mental health, cardiovascular diseases, post acute infection syndromes including long COVID, and cancer. In each of these domains, causal insights have the potential to improve mechanistic understanding and inform targeted prevention, diagnosis, and treatment strategies.

By addressing core challenges in causal inference such as reliability, heterogeneity, privacy, knowledge integration, and scalability, our research contributes to the development of explainable, trustworthy, and actionable AI for biomedical discovery and precision health.

Our Team

Group Leader

Dr. Adèle Helena Ribeiro

Research focus: Advancing Causal and Explainable AI for Health and Life Sciences.

Ph.D. Students

Ph.D. Student

Maximilian Hahn, M.Sc.

Research focus: Advancing Privacy-Preserving, Collaborative Causal AI in Real-World Settings.
Co-advised with Prof. Dr. Dominik Heider, Institute of Medical Informatics, Universität Münster, Münster, Germany.

(External) Ph.D. Student

Azlaan Mustafa Samad, M.Sc.

Research focus: Advancing Causal Representation Learning and Causal Abstraction in the Health Sciences.
Co-advised with Prof. Dr. Wolfgang Nejdl, L3S Research Center, CAIMed (Lower Saxony Research Center for AI and Causal Methods in Medicine) & Leibniz Universität Hannover, Hanover, Germany.

(External) Ph.D. Student

Matheus Becali Rocha, M.Sc.

Research focus: Causal Discovery for More Interpretable and Generalizable Predictions.
Co-advised with Prof. Dr. Renato Kroling, Department of Informatics, Federal University of Espírito Santo, Brazil.

Master Students

(External) Master Student

Aaron Zumdick

Research focus: Enabling Causal Discovery from Complex, Heterogeneous, and Non-IID Data.
Co-advised with Prof. Dr. Peter Florian Stadler, Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Leipzig, Germany.

(External) Master Student

Bárbara Alexandra Alves Sequeira

Research focus: Exploring Causal Discovery in Catalysis under Limited and Confounded Datasets.
Co-advised with Prof. Dr. Pedro Freitas Mendes, Department of Chemical Engineering, Instituto Superior Técnico, Lisbon, Portugal.

Selected Research Projects

Method Development

Uncertainty-Quantified Causal Discovery

Ensuring Robustness Under Limited Data and Potential Violations of Faithfulness

We develop robust causal discovery methods that address latent confounding, mixed variable types, and potential faithfulness violations, while quantifying structural uncertainty to support reliable downstream analyses such as effect estimation and intervention planning.

Data-Compatible Fast Causal Inference (dcFCI)

dcFCI (Ribeiro & Heider, forthcoming) is a robust causal discovery algorithm that combines FCI-guided search with score-based evaluation. It rigorously handles latent confounders and heterogeneous variable types, resolves conflicts in edge orientation, and guarantees that only valid PAGs are produced. Additionally, it ranks alternative causal structures to provide a measure of structural uncertainty. The dcFCI R package is publicly available on GitHub.

Expert-Guided Causal Discovery

Integrating Expert Priors and Feedback Into Data-Driven Learning

We develop methods that integrate expert knowledge into data-driven causal discovery, enhancing the reliability and validity of learned models. By incorporating expert insight directly into the learning process, our algorithms extract patterns from data while remaining anchored in domain expertise.

Anchor Fast Causal Inference (anchorFCI)

anchorFCI (Ribeiro et al., 2025) extends the conservative FCI algorithm by selecting reliable anchor variables (those known not to be caused by others) and encoding their non-ancestral relationships. This is particularly useful for uncovering causal relationships among demographic or clinical traits using genetic variants as anchors. The anchorFCI R package is available on GitHub.

Ancestral GFlowNet (AGFN)

AGFN (da Silva et al., ArXiv, forthcoming) is a a probabilistic, uncertainty-aware framework for causal discovery, equipped with an optimal elicitation strategy to guide expert interaction. It infers a data-driven distribution over ancestral graphs and iteratively refines it through (potentially noisy) expert feedback. The AGFN Python package is available on GitHub.

Abtraction-Aware Causal Inference

Enhancing Scalabiliy and Interpretability Through Variable Clustering

In high-dimensional and complex settings, it is often more informative to represent causal models at a higher level of abstraction than the individual variables themselves. By grouping related variables into clusters, we can simplify complex systems, highlight key causal relationships, and make inference more tractable.

Cluster Causal Diagrams (C-DAGs)

C-DAGs (Anand, et al. AAAI 2023) is a graphical framework for causal reasoning at a higher level of abstraction, enabling reliable causal inferences between clusters of variables without specifying the relationships within each cluster. Beyond formally defining C-DAGs, we have developed sound and complete methods for causal inference in this framework, supporting both interventional and counterfactual queries. Recently (Yvernes, et al., NeurIPS 2025), we extended the C-DAG framework to support arbitrary variable clusterings by relaxing the partition admissibility constraint, thereby allowing cyclic C-DAG representations.

Causal Learning Over Clusters (CLOC)

CLOC (Anand, et. al, NeurIPS 2025) is a causal discovery algorithm designed to uncover relationships between clusters of variables in high-dimensional systems. It leverages a novel graphical framework to encode and learn cluster-level dependencies and independencies in Markov causal systems. The algorithm is sound and complete, providing a reliable representation of learnable causal relationships between clusters. The CLOC implementation in R is available on GitHub.

Fully Data-Driven Causal Inference in Semi-Markovian Systems

From Causal Discovery to Effect Identification and Estimation under Latent Confounding

In general, a causal model cannot be uniquely determined from observational data alone. However, causal discovery algorithms can partially recover it, defining a set of plausible models known as the Markov Equivalence Class (MEC). In Semi-Markovian Systems, where hidden confounders may exist, such a class is typically represented as a Partial Ancestral Graph (PAG).

Conditional Causal Effect Identification from PAGs (CIDP)

We developed the first sound and complete calculus and algorithms for identifying (conditional) causal effects from PAGs (Jaber et al., NeurIPS 2022). Combined with causal discovery, this yields the first fully data-driven approach to causal inference, uncovering all identifiable effects directly from observational data. The CIDP algorithm is implemented in the PAGId R package on GitHub.

Biomedical Applications

Deciphering the Multifaceted Causes of Malaria Risk in Amazonian Communities

A Collaborative Approach Incorporating AI and Causality

Malaria remains a major health challenge, particularly in regions facing poverty, limited healthcare access, and harsh environments, such as the Amazon rainforest.

By combining AI and causal inference, we can significantly advance malaria research by identifying context-specific risk factors, uncovering underlying causal mechanisms, and guiding more effective, targeted interventions. (Ribeiro, et al. 2025 Front. Genet.).

Supported by BMFTR, this project leverages, in collaboration with the University of São Paulo, the Mâncio Lima cohort (Johansen, et al., 2021), which includes rich demographic, phenotypic, and genetic data from about 20% of households in Brazil’s main urban malaria hotspot, together with national surveillance data from SIVEP-Malaria, covering most symptomatic cases across the country.

Unraveling the Causal Role of the Gut Microbiome in Mental Health

Evidence for a Causal Contribution of Gut-Microbiota to Major Depressive Disorder in Humans

Understanding the causal influence of the gut microbiome on mental health is crucial for uncovering the biological pathways that connect the gut and the brain. By identifying potential causal taxa from observational data, researchers can better prioritize targets for experimental validation and develop more effective, personalized mental health interventions.

Using data-driven causal inference, we identified Eggerthella and Hungatella as causal contributors to major depressive disorder (MDD), acting through two distinct gut–brain pathways independent of body mass index (Fehse et al., forthcoming).

A follow-up study (Thanarajah, et al., 2025, JAMA Psychiatry), shows that soft drink consumption may contribute to MDD through gut microbiota alterations, notably involving Eggerthella.

Publications

Causal discovery is central to inferring causal relationships from observational data. In the presence of latent confounding, algorithms such as Fast Causal Inference (FCI) learn a Partial Ancestral Graph (PAG) representing the true model's Markov Equivalence Class. However, their correctness critically depends on empirical faithfulness, the assumption that observed (in)dependencies perfectly reflect those of the underlying causal model, which often fails in practice due to limited sample sizes. To address this, we introduce the first nonparametric score to assess a PAG's compatibility with observed data, even with mixed variable types. This score is both necessary and sufficient to characterize structural uncertainty and distinguish between distinct PAGs. We then propose data-compatible FCI (dcFCI), the first hybrid causal discovery algorithm to jointly address latent confounding, empirical unfaithfulness, and mixed data types. dcFCI integrates our score into an (Anytime)FCI-guided search that systematically explores, ranks, and validates candidate PAGs. Experiments on synthetic and real-world scenarios demonstrate that dcFCI significantly outperforms state-of-the-art methods, often recovering the true PAG even in small and heterogeneous datasets. Examining top-ranked PAGs further provides valuable insights into structural uncertainty, supporting more robust and informed causal reasoning and decision-making.
Causal discovery methods are powerful tools for uncovering the structure of relationships among variables, yet they face significant challenges in scalability and interpretability, especially in high-dimensional settings. In many domains, researchers are not only interested in causal links between individual variables, but also in relationships among sets or clusters of variables. Learning causal structure at the cluster level can both reveal higher-order relationships of interest and improve scalability. In this work, we introduce an approach for causal discovery over clusters in Markov causal systems. We propose a new graphical model that encodes knowledge of relationships between user-defined clusters while fully representing independencies and dependencies over clusters, faithful to a given distribution. We then define and characterize a graphical equivalence class of these models that share cluster-level independence information. Lastly, we present a sound and complete algorithm for causal discovery to represent learnable causal relationships between clusters of variables.
Cluster DAGs (C-DAGs) provide an abstraction of causal graphs in which nodes represent clusters of variables, and edges encode both cluster-level causal relationships and dependencies arisen from unobserved confounding. C-DAGs define an equivalence class of acyclic causal graphs that agree on cluster-level relationships, enabling causal reasoning at a higher level of abstraction. However, when the chosen clustering induces cycles in the resulting C-DAG, the partition is deemed inadmissible under conventional C-DAG semantics. In this work, we extend the C-DAG framework to support arbitrary variable clusterings by relaxing the partition admissibility constraint, thereby allowing cyclic C-DAG representations. We extend the notions of d-separation and causal calculus to this setting, significantly broadening the scope of causal reasoning across clusters and enabling the application of C-DAGs in previously intractable scenarios. Our calculus is both sound and atomically complete with respect to the do-calculus: all valid interventional queries at the cluster level can be derived using our rules, each corresponding to a primitive do-calculus step.
Soft drink consumption is linked to negative physical and mental health outcomes, but its association with major depressive disorder (MDD) and the underlying mechanisms remains unclear. To examine the association between soft drink consumption and MDD diagnosis and severity and whether this association is mediated by changes in the gut microbiota, particularly Eggerthella and Hungatella abundance. This multicenter cohort study was conducted in Germany using cross-sectional data from the Marburg-Münster Affective Cohort. Patients with MDD and healthy controls (aged 18-65 years) recruited from the general population and primary care between September 2014 and September 2018 were analyzed. Data analyses were conducted between May and December 2024. Primary analyses included multivariable regression and analysis of variance (ANOVA) models examining the association between soft drink consumption and MDD diagnosis and symptom severity, controlling for site and education, and Eggerthella and Hungatella abundance, controlling for site, education, and library size. Mediation analyses tested whether microbiota abundance mediated the soft drink–MDD link. It was found that soft drink consumption may contribute to MDD through gut microbiota alterations, notably involving Eggerthella. Public health strategies to reduce soft drink intake may help mitigate depression risk, especially among vulnerable populations; in addition, interventions for depression targeting the microbiome composition appear promising.
With an estimated 263 million cases recorded worldwide in 2023, malaria remains a major global health challenge, particularly in tropical regions with limited healthcare access. Beyond its health impact, malaria disrupts education, economic development, and social equality. While traditional research has focused on biological factors underlying human-mosquito interactions, growing evidence highlights the complex interplay of environmental, behavioral, and socioeconomic factors, alongside mobility and both human and parasite genetics, in shaping transmission dynamics, recurrence patterns, and control effectiveness. This work shows how integrating Artificial Intelligence (AI), Machine Learning (ML), and Causal Inference can advance malaria research by identifying context-specific risk factors, uncovering causal mechanisms, and informing more effective, targeted interventions. Drawing on the Mâncio Lima cohort, a longitudinal, multimodal study of malaria risk in Brazil’s main urban hotspot, and related studies in the Amazon, we highlight how rigorous, data-driven approaches can address the substantial variability in malaria risk across individuals and communities. AI-driven methods facilitate the integration of diverse high-dimensional datasets to uncover intricate patterns and improve individual risk stratification. Federated learning enables collaborative analysis across regions while preserving data privacy. Meanwhile, causal discovery and effect identification tools further strengthen these approaches by distinguishing genuine causal relationships from spurious associations. Together, these approaches offer a principled, scalable, and privacy-preserving framework that enables researchers to move beyond predictive modeling toward actionable causal insights. This shift supports precision public health strategies tailored to vulnerable populations, fostering more equitable and sustainable malaria control and contributing to the reduction of the global malaria burden.
Major Depressive Disorder (MDD) is a highly prevalent, severe mental health condition that constitutes one of the leading causes of disability worldwide. While recent animal studies suggest a causal role of the gut microbiome in the pathophysiology of MDD models, evidence in humans is still unclear due to small sample sizes, inconsistent clinical assessment of MDD diagnosis, and methodological limitations regarding causal inference in cross-sectional data. Here, we explicitly address these shortcomings to investigate the potential causal link between the gut microbiome and MDD: First, we replicate previous findings using one of the largest multicenter MDD cohorts for which microbiome data and in-depth diagnostic assessment are available (N=1,269 MDD patients and controls). We find a significant difference between healthy controls and MDD patients for the relative abundance of the four taxa Eggerthella, Hungatella, Coprobacillus, and Lachnospiraceae FCS020. Second, we employ state-of-the-art, fully data-driven causal inference tools within Judea Pearl’s framework, allowing us to derive model constraints from the data rather than relying on potentially strong, unrealistic assumptions. Using this approach, we found data-driven evidence for Eggerthella and Hungatella as causal contributors to MDD. Furthermore, we show that Eggerthella and Hungatella abundances are associated with MDD beyond the influence of body mass index, identifying two distinct pathways linking MDD to the gut microbiome. Finally, the difference in relative abundance of these taxa between healthy and MDD patients was independent of antidepressant medication. Our study provides the first evidence for a potential causal role of gut-microbiota in the pathophysiology of depression in humans.
Structure learning is the crux of causal inference. Notably, causal discovery (CD) algorithms are brittle when data is scarce, possibly inferring imprecise causal relations that contradict expert knowledge -- especially when considering latent confounders. To aggravate the issue, most CD methods do not provide uncertainty estimates, making it hard for users to interpret results and improve the inference process. Surprisingly, while CD is a human-centered affair, no works have focused on building methods that both 1) output uncertainty estimates that can be verified by experts and 2) interact with those experts to iteratively refine CD. To solve these issues, we start by proposing to sample (causal) ancestral graphs proportionally to a belief distribution based on a score function, such as the Bayesian information criterion (BIC), using generative flow networks. Then, we leverage the diversity in candidate graphs and introduce an optimal experimental design to iteratively probe the expert about the relations among variables, effectively reducing the uncertainty of our belief over ancestral graphs. Finally, we update our samples to incorporate human feedback via importance sampling. Importantly, our method does not require causal sufficiency (i.e., unobserved confounders may exist). Experiments with synthetic observational data show that our method can accurately sample from distributions over ancestral graphs and that we can greatly improve inference quality with human aid.
In data-scarce situations, causal discovery (CD) algorithms often produce unreliable causal relationships that may conflict with expert knowledge, especially in the presence of latent confounders. Additionally, most CD methods lack adequate uncertainty quantification, hindering users' ability to evaluate and refine results. To address these issues, we present a fully probabilistic CD method referred to as Ancestral GFlowNets (AGFNs). In a nutshell, AGFNs sample ancestral graphs (AGs) proportionally to a score-based belief distribution, allowing users to assess % and propagate the uncertainty of the discovered causal relationships. On top of that, we design an elicitation framework that enables the incorporation of human knowledge into the inference process via importance sampling. Notably, our approach naturally accommodates CD on data sets with latent confounding and potentially heterogeneous data types, a setting that has received little attention from the literature. Finally, experimental results with observational data show that our method effectively samples from distributions over AGs and significantly enhances inference quality with human aid.
Cardiometabolic diseases, a leading global health concern, arise from a complex interplay of lifestyle choices, genetic predispositions, and biochemical markers. Although extensive research has uncovered strong associations among various risk factors and these diseases, grasping their causal relationships is vital for gaining deeper mechanistic insights and designing effective prevention and intervention strategies. We address this gap by introducing anchorFCI, a novel adaptation of the conservative Really Fast Causal Inference (RFCI) algorithm designed to enhance the discovery of causal relationships by strategically selecting and integrating reliable anchor variables from an additional set known not to be caused by the variables of interest. This approach is particularly well-suited for learning causal networks involving phenotypic, clinical, and sociodemographic factors, leveraging genetic variables recognized as not being influenced by these factors. By integrating these anchor variables along with knowledge of their non-ancestral relationships, anchorFCI effectively handles latent confounding while enhancing both robustness and discovery power. We demonstrate its effectiveness using data from the 2015 ISA-Nutrition study in São Paulo, Brazil, and further estimate the effect sizes of the uncovered causal relationships with state-of-the-art tools from Judea Pearl's framework, presenting a fully data-driven causal inference pipeline. The results not only support many established causal relationships but also elucidate their interconnections within a complex network, enhancing our understanding of the broader dynamics and the multifaceted nature of cardiometabolic risk.
To investigate associations between Single Nucleotide Polymorphisms (SNPs) in the TAS1R and TAS2R taste receptors and diet quality, intake of alcohol, added sugar, and fat, using linear regression and machine learning techniques in a highly admixed population. In the ISA-Capital health survey, 901 individuals were interviewed and had socioeconomic, demographic, health characteristics, along with dietary information obtained through two 24-h recalls. Data on 12 components related to food groups, nutrients, and calories was combined into a diet quality score (BHEI-R). BHEI-R, SoFAAs (calories from added sugar, saturated fat, and alcohol) and Alcohol use were tested for associations with 255 TAS2R SNPs and 73 TAS1R SNPs for 637 individuals with regression analysis and Random Forest. Significant SNPs were combined into Genetic taste scores (GTSs). Among 23 SNPs significantly associated either by stepwise linear/logistic regression or random forest with any possible biological functionality, the missense variants rs149217752 in TAS2R40, for SoFAAs, and rs2233997 in TAS2R4, were associated with both BHEI-R (under 4% increase in Mean Squared Error) and SoFAAs. GTSs increased the variance explanation of quantitative phenotypes and there was a moderately high AUC for alcohol use. The study provides insights into the genetic basis of human taste perception through the identification of missense variants in the TAS2R gene family. These findings may contribute to future strategies in precision nutrition aimed at improving food quality by reducing added sugar, saturated fat, and alcohol intake.
The electrocardiogram (ECG) serves as a valuable diagnostic tool, providing crucial information about life-threatening cardiac conditions such as atrial fibrillation and myocardial infarction. A prompt and efficient assessment of ECG exams in environments such as Emergency Rooms (ERs) can significantly enhance the chances of survival for high-risk patients. Despite the presence of numerous works on ECG classification, most of these studies have concentrated on one-dimensional ECG signals, which are commonly found in publicly available ECG datasets. Nevertheless, the practical relevance of such methods is limited in hospital settings, where ECG exams are usually stored as images. In this study, we have developed an artificial intelligence-driven screening system specifically designed to analyze 12-lead ECG images. Our proposed method has been trained on an extensive dataset comprising 99,746 12-lead ECG exams collected from the ambulatory section of a tertiary hospital. The primary goal was to precisely classify the exams into three classes: Normal (N), Atrial Fibrillation (AFib), and Other (O). The evaluation of our approach yielded AUROC scores of 93.2%, 99.2%, and 93.1% for N, AFib, and O, respectively. To further validate our approach, we conducted evaluations using the 2018 China Physiological Signal Challenge (CPSC) database. In this evaluation, we achieved AUROC scores of 91.8%, 97.5%, and 70.4% for the classes N, AFib, and O, respectively. Additionally, we assessed our method using 1,074 exams acquired in the ER and obtained AUROC values of 98.3%, 98.0%, and 97.7% for the classes N, AFib, and O, respectively. Furthermore, we developed and deployed a system with a trained model within the ER of a tertiary hospital for research purposes. This system automatically retrieves newly captured ECG chart images from the Picture Archiving and Communication System (PACS) within the ER. These images undergo necessary preprocessing steps and serve as input for our proposed classification method. This comprehensive approach established an efficient and versatile end-to-end framework for ECG classification. The results of our study highlight the potential of leveraging artificial intelligence in the screening of ECG exams, offering a promising solution for the rapid assessment and prioritization of patients in the ER.
Artificial intelligence (AI) and data sharing go hand in hand. In order to develop powerful AI models for medical and health applications, data need to be collected and brought together over multiple centers. However, due to various reasons, including data privacy, not all data can be made publicly available or shared with other parties. Federated and swarm learning can help in these scenarios. However, in the private sector, such as between companies, the incentive is limited, as the resulting AI models would be available for all partners irrespective of their individual contribution, including the amount of data provided by each party. Here, we explore a potential solution to this challenge as a viewpoint, aiming to establish a fairer approach that encourages companies to engage in collaborative data analysis and AI modeling. Within the proposed approach, each individual participant could gain a model commensurate with their respective data contribution, ultimately leading to better diagnostic tools for all participants in a fair manner.
Reasoning about the effect of interventions and counterfactuals is a fundamental task found throughout the data sciences. A collection of principles, algorithms, and tools has been developed for performing such tasks in the last decades (Pearl, 2000). One of the pervasive requirements found throughout this literature is the articulation of assumptions, which commonly appear in the form of causal diagrams. Despite the power of this approach, there are significant settings where the knowledge necessary to specify a causal diagram over all variables is not available, particularly in complex, high-dimensional domains. In this paper, we introduce a new graphical modeling tool called cluster DAGs (for short, CDAGs) that allows for the partial specification of relationships among variables based on limited prior knowledge, alleviating the stringent requirement of specifying a full causal diagram. A C-DAG specifies relationships between clusters of variables, while the relationships between the variables within a cluster are left unspecified, and can be seen as a graphical representation of an equivalence class of causal diagrams that share the relationships among the clusters. We develop the foundations and machinery for valid inferences over C-DAGs about the clusters of variables at each layer of Pearl’s Causal Hierarchy (Pearl and Mackenzie 2018; Bareinboim et al. 2020) - L1 (probabilistic), L2 (interventional), and L3 (counterfactual). In particular, we prove the soundness and completeness of d-separation for probabilistic inference in C-DAGs. Further, we demonstrate the validity of Pearl’s do-calculus rules over C-DAGs and show that the standard ID identification algorithm is sound and complete to systematically compute causal effects from observational data given a C-DAG. Finally, we show that C-DAGs are valid for performing counterfactual inferences about clusters of variables.
Both of the fields of continual learning and causality investigate complementary aspects of human cognition and are fundamental components of artificial intelligence if it is to reason and generalize in complex environments. Despite the burgeoning interest in investigating the intersection of the two fields, it is currently unclear how causal models may describe continuous streams of data and vice versa, how continual learning may exploit learned causal structure. We proposed to bridge this gap through the inaugural AAAI-23 “Continual Causality” bridge program, where our aim was to take the initial steps towards a unified treatment of these fields by providing a space for learning, discussions, and to build a diverse community to connect researchers. The activities ranged from traditional tutorials and software labs, invited vision talks, and contributed talks based on submitted position papers, as well as a panel and breakout discussions. Whereas materials are publicly disseminated as a foundation for the community: https://www.continualcausality.org, respectively discussed ideas, challenges, and prospects beyond the inaugural bridge are summarized in this retrospective paper.
One common task in many data sciences applications is to answer questions about the effect of new interventions, like: what would happen to Y if we make X equal to x while observing covariates Z = z?. Formally, this is known as conditional effect identification, where the goal is to determine whether a post-interventional distribution is computable from the combination of an observational distribution and assumptions about the underlying domain represented by a causal diagram. A plethora of methods was developed for solving this problem, including the celebrated do-calculus [Pearl, 1995]. In practice, these results are not always applicable since they require a fully specified causal diagram as input, which is usually not available. In this paper, we assume as the input of the task a less informative structure known as a partial ancestral graph (PAG), which represents a Markov equivalence class of causal diagrams, learnable from observational data. We make the following contributions under this relaxed setting. First, we introduce a new causal calculus, which subsumes the current state-of-the-art, PAG-calculus. Second, we develop an algorithm for conditional effect identification given a PAG and prove it to be both sound and complete. In words, failure of the algorithm to identify a certain effect implies that this effect is not identifiable by any method. Third, we prove the proposed calculus to be complete for the same task.
Atrial fibrillation (AF) is a common arrhythmia (0.5% worldwide prevalence) associated with an increased risk of various cardiovascular disorders, including stroke. Automated routine AF detection by Electrocardiogram (ECG) is based on the analysis of one-dimensional ECG signals and requires dedicated software for each type of device, limiting its wide use, especially with the rapid incorporation of telemedicine into the healthcare system. Here, we implement a machine learning method for AF classification using the region of interest (ROI) corresponding to the long DII lead automatically extracted from DI-COM 12-lead ECG images. We observed 94.3%, 98.9%, 99.1%, and 92.2% for sensitivity, specificity, AUC, and F1 score, respectively. These results indicate that the proposed methodology performs similar to one-dimensional ECG signals as input, but does not require a dedicated software facilitating the integration into clinical practice, as ECGs are typically stored in PACS as 2D images.
Graphs/networks have become a powerful analytical approach for data modeling. Besides, with the advances in sensor technology, dynamic time-evolving data have become more common. In this context, one point of interest is a better understanding of the information flow within and between networks. Thus, we aim to infer Granger causality (G-causality) between networks' time series. In this case, the straightforward application of the well-established vector autoregressive model is not feasible. Consequently, we require a theoretical framework for modeling time-varying graphs. One possibility would be to consider a mathematical graph model with time-varying parameters (assumed to be random variables) that generates the network. Suppose we identify G-causality between the graph models' parameters. In that case, we could use it to define a G-causality between graphs. Here, we show that even if the model is unknown, the spectral radius is a reasonable estimate of some random graph model parameters. We illustrate our proposal's application to study the relationship between brain hemispheres of controls and children diagnosed with Autism Spectrum Disorder (ASD). We show that the G-causality intensity from the brain's right to the left hemisphere is different between ASD and controls.
Many challenging problems in biomedical research rely on understanding how variables are associated with each other and influenced by genetic and environmental factors. Probabilistic graphical models (PGMs) are widely acknowledged as a very natural and formal language to describe relationships among variables and have been extensively used for studying complex diseases and traits. In this work, we propose methods that leverage observational Gaussian family data for learning a decomposition of undirected and directed acyclic PGMs according to the influence of genetic and environmental factors. Many structure learning algorithms are strongly based on a conditional independence test. For independent measurements of normally distributed variables, conditional independence can be tested through standard tests for zero partial correlation. In family data, the assumption of independent measurements does not hold since related individuals are correlated due to mainly genetic factors. Based on univariate polygenic linear mixed models, we propose tests that account for the familial dependence structure and allow us to assess the significance of the partial correlation due to genetic (between-family) factors and due to other factors, denoted here as environmental (within-family) factors, separately. Then, we extend standard structure learning algorithms, including the IC/PC and the really fast causal inference (RFCI) algorithms, to Gaussian family data. The algorithms learn the most likely PGM and its decomposition into two components, one explained by genetic factors and the other by environmental factors. The proposed methods are evaluated by simulation studies and applied to the Genetic Analysis Workshop 13 simulated dataset, which captures significant features of the Framingham Heart Study.
Faced with the lack of reliability and reproducibility in omics studies, more careful and robust methods are needed to overcome the existing challenges in the multi-omics analysis. In conventional omics data analysis, signal intensity values (denoted by M and values) are estimated neglecting pixel-level uncertainties, which may reflect noise and systematic artifacts. For example, intensity values from two-color microarray data are estimated by taking the mean or median of the pixel intensities within the spot and then subjected to a within-slide normalization by LOWESS. Thus, focusing on estimation and normalization of gene expression profiles, we propose a spot quantification method that takes into account pixel-level variability. Also, to preserve relevant variation that may be removed in LOWESS normalization with poorly chosen parameters, we propose a parameter selection method that is parsimonious and considers intrinsic characteristics of microarray data, such as heteroskedasticity. The usefulness of the proposed methods is illustrated by an application to real intestinal metaplasia data. Compared with the conventional approaches, the analysis is more robust and conservative, identifying fewer but more reliable differentially expressed genes. Also, the variability preservation allowed the identification of new differentially expressed genes. Using the proposed approach, we have identified differentially expressed genes involved in pathways in cancer and confirmed some molecular markers already reported in the literature.
Causal inference may help us to understand the underlying mechanisms and the risk factors of diseases. In Genetics, it is crucial to understand how the connectivity among variables is influenced by genetic and environmental factors. Family data have proven to be useful in elucidating genetic and environmental influences, however, few existing approaches are able of addressing structure learning of probabilistic graphical models (PGMs) and family data analysis jointly. We propose methodologies for learning, from observational Gaussian family data, the most likely PGM and its decomposition into genetic and environmental components. They were evaluated by a simulation study and applied to the Genetic Analysis Workshop 13 simulated data, which mimic the real Framingham Heart Study data, and to the metabolic syndrome phenotypes from the Baependi Heart Study. In neuroscience, one challenge consists in identifying interactions between functional brain networks (FBNs) - graphs. We propose a method to identify Granger causality among FBNs. We show the statistical power of the proposed method by simulations and its usefulness by two applications: the identification of Granger causality between the FBNs of two musicians playing a violin duo, and the identification of a differential connectivity from the right to the left brain hemispheres of autistic subjects.
Blood pressure (BP) is associated with carotid intima-media thickness (CIMT), but few studies have explored the association between BP variability and CIMT. We aimed to investigate this association in the Brazilian Longitudinal Study of Adult Health (ELSA-Brasil) baseline. We found a small but significant association between SBP variability and CIMT values. This was additive to the association between SBP central tendency and CIMT values, supporting a role for high short-term SBP variability in atherosclerosis.
A major challenge in biomedical research is to identify causal relationships among genotypes, phenotypes, and clinical outcomes from high-dimensional measurements. Causal networks have been widely used in systems genetics for modeling gene regulatory systems and for identifying causes and risk factors of diseases. In this chapter, we describe fundamental concepts and algorithms for constructing causal networks from observational data. In biological context, causal inferences can be drawn from the natural experimental setting provided by Mendelian randomization, a term that refers to the random assignment of genotypes at meiosis. We show that genetic variants may serve as instrumental variables, improving estimation accuracy of the causal effects. In addition, identifiability issues that commonly arise when learning network structures may be overcome by using prior information on genotype–phenotype relations.
Any measurement, since it is made for a real instrument, has an uncertainty associated with it. In the present work, we address this issue of uncertainty in two-channel cDNA Microarray experiments, a technology that has been widely used in recent years and is still an important tool for gene expression studies. Tens of thousands of gene representatives are printed onto a glass slide and hybridized simultaneously with mRNA from two different cell samples. Different fluorescent dyes are used for labeling both samples. After hybridization, the glass slide is scanned yielding two images. Image processing and analysis programs are used for spot segmentation and pixel statistics computation, for instance, the mean, median and variance of pixel intensities for each spot. The same statistics are computed for the pixel intensities in the background region. Statistical estimators such as the variance gives us an estimate of the accuracy of a measurement. Based on the intensity estimates for each spot, some data transformations are applied in order to eliminate systematic variability so we can obtain the effective gene expression. This paper shows how to analyze gene expression measurements with an estimated error. We presented an estimate of this uncertainty and we studied, in terms of error propagation, the effects of some data transformations. An example of data transformation is the correction of the bias estimated by a robust local regression method, also known as lowess. With the propagated errors obtained, we also showed how to use them for detecting differentially expressed genes between different conditions. Finally, we compared the results with those obtained by classical analysis methods, in which the measurement errors are disregarded. We conclude that modeling the measurements uncertainties can improve the analysis, since the results obtained in a real gene expressions data base were consistent with the literature.

Open-Source Libraries

This package provides an implementation of the dcFCI algorithm. Technical details are provided in the paper by Ribeiro & Heider (2025), entitled "dcFCI: Robust Causal Discovery Under Latent Confounding, Unfaithfulness, and Mixed Data", available at arXiv preprint arXiv:2505.06542.
This package provides an implementation of the anchorFCI algorithm. Technical details are provided in the paper by Ribeiro et al. (2024), entitled "AnchorFCI: harnessing genetic anchors for enhanced causal discovery of cardiometabolic disease pathways", available at doi: 10.3389/fgene.2024.1436947. AnchorFCI is an extension of the FCI algorithm designed to improve robustness and discovery power in causal discovery by strategically selecting reliable anchors, while leveraging their known non-ancestral relationships. It operates on two sets of variables: the first set contains the variables of interest, while the second comprises variables that are not caused by any from the first. While this structure is beneficial for various applications, it is especially well-suited for datasets involving phenotypic, clinical, and sociodemographic variables (the first set), alongside genetic variables, such as SNPs (the second set), which are recognized as not being caused by the first set.
This package implements the CIDP and IDP algorithms for identifying (conditional) causal effects from a Partial Ancentral Graph (PAG). Technical details are provided in the NeurIPS 2022 paper by Jaber A., Ribeiro A. H., Zhang J., and Bareinboim E., (2022) entitled "Causal Identification under Markov equivalence: Calculus, Algorithm, and Completeness".
This package provides methods for learning, from observational Gaussian family data (i.e., Gaussian data clusterized in families), Gaussian undirected and directed acyclic PGMs describing linear relationships among multiple phenotypes and a decomposition of the learned PGM into unconfounded genetic and environmental PGMs. Methods are based on zero partial correlation tests derived in the work by Ribeiro and Soler (2020).
This package provides methods for estimating and normalizing the M (intensity log-ratio) and A (mean log intensity) values from two-channel (or two-color) microarrays. Unlike conventional estimation methods which take into account only measures of location (e.g., mean and median) of the pixel intensities of each channel, the provided estimation method takes into account pixel-level variability, which may reflects uncertainties due noise and systematic artifacts.