The learning algorithms of causal discovery mainly include score-based methods and genetic algorithms(GA).The score-based algorithms are prone to searching space explosion.Classical GA is slow to converge,and prone to...The learning algorithms of causal discovery mainly include score-based methods and genetic algorithms(GA).The score-based algorithms are prone to searching space explosion.Classical GA is slow to converge,and prone to falling into local optima.To address these issues,an improved GA with domain knowledge(IGADK)is proposed.Firstly,domain knowledge is incorporated into the learning process of causality to construct a new fitness function.Secondly,a dynamical mutation operator is introduced in the algorithm to accelerate the convergence rate.Finally,an experiment is conducted on simulation data,which compares the classical GA with IGADK with domain knowledge of varying accuracy.The IGADK can greatly reduce the number of iterations,populations,and samples required for learning,which illustrates the efficiency and effectiveness of the proposed algorithm.展开更多
Causality,the science of cause and effect,has made it possible to create a new family of models.Such models are often referred to as causal models.Unlike those of mathematical,numerical,empirical,or machine learning(M...Causality,the science of cause and effect,has made it possible to create a new family of models.Such models are often referred to as causal models.Unlike those of mathematical,numerical,empirical,or machine learning(ML)nature,causal models hope to tie the cause(s)to the effect(s)pertaining to a phenomenon(i.e.,data generating process)through causal principles.This paper presents one of the first works at creating causal models in the area of structural and construction engineering.To this end,this paper starts with a brief review of the principles of causality and then adopts four causal discovery algorithms,namely,PC(Peter-Clark),FCI(fast causal inference),GES(greedy equivalence search),and GRa SP(greedy relaxation of the sparsest permutation),have been used to examine four phenomena,including predicting the load-bearing capacity of axially loaded members,fire resistance of structural members,shear strength of beams,and resistance of walls against impulsive(blast)loading.Findings from this study reveal the possibility and merit of discovering complete and partial causal models.Finally,this study also proposes two simple metrics that can help assess the performance of causal discovery algorithms.展开更多
Causal inference is a powerful modeling tool for explanatory analysis,which might enable current machine learning to become explainable.How to marry causal inference with machine learning to develop explainable artifi...Causal inference is a powerful modeling tool for explanatory analysis,which might enable current machine learning to become explainable.How to marry causal inference with machine learning to develop explainable artificial intelligence(XAI)algorithms is one of key steps toward to the artificial intelligence 2.0.With the aim of bringing knowledge of causal inference to scholars of machine learning and artificial intelligence,we invited researchers working on causal inference to write this survey from different aspects of causal inference.This survey includes the following sections:“Estimating average treatment effect:A brief review and beyond”from Dr.Kun Kuang,“Attribution problems in counterfactual inference”from Prof.Lian Li,“The Yule–Simpson paradox and the surrogate paradox”from Prof.Zhi Geng,“Causal potential theory”from Prof.Lei Xu,“Discovering causal information from observational data”from Prof.Kun Zhang,“Formal argumentation in causal reasoning and explanation”from Profs.Beishui Liao and Huaxin Huang,“Causal inference with complex experiments”from Prof.Peng Ding,“Instrumental variables and negative controls for observational studies”from Prof.Wang Miao,and“Causal inference with interference”from Dr.Zhichao Jiang.展开更多
Fault detection and diagnosis(FDD)plays a significant role in ensuring the safety and stability of chemical processes.With the development of artificial intelligence(AI)and big data technologies,data-driven approaches...Fault detection and diagnosis(FDD)plays a significant role in ensuring the safety and stability of chemical processes.With the development of artificial intelligence(AI)and big data technologies,data-driven approaches with excellent performance are widely used for FDD in chemical processes.However,improved predictive accuracy has often been achieved through increased model complexity,which turns models into black-box methods and causes uncertainty regarding their decisions.In this study,a causal temporal graph attention network(CTGAN)is proposed for fault diagnosis of chemical processes.A chemical causal graph is built by causal inference to represent the propagation path of faults.The attention mechanism and chemical causal graph were combined to help us notice the key variables relating to fault fluctuations.Experiments in the Tennessee Eastman(TE)process and the green ammonia(GA)process showed that CTGAN achieved high performance and good explainability.展开更多
Microblogging platforms like X(formerly Twitter)and Sina Weibo have become key channels for spreading information online.Accurately predicting information spread,such as users’reposting activities,is essential for ap...Microblogging platforms like X(formerly Twitter)and Sina Weibo have become key channels for spreading information online.Accurately predicting information spread,such as users’reposting activities,is essential for applications including content recommendation and analyzing public sentiment.Current advanced models rely on deep representation learning to extract features from various inputs,such as users’social connections and repost history,to forecast reposting behavior.Nonetheless,these models frequently ignore intrinsic confounding factors,which may cause the models to capture spurious relationships,ultimately impacting prediction performance.To address this limitation,we propose a novel Debiased Reposting Prediction model(DRP).Our model mitigates the influence of confounding variables by incorporating intervention operations from causal inference,enabling it to learn the causal associations between features and user reposting behavior.Specifically,we introduce a memory network within DRP to enhance the model’s perception of confounder distributions.This network aggregates and learns confounding information dispersed across different training data batches by optimizing the reconstruction loss.Furthermore,recognizing the challenge of acquiring prior knowledge of causal graphs,which is crucial for causal inference,we develop a causal discovery module within DRP(CD-DRP).This module allows the model to autonomously uncover the causal graph of feature variables by analyzing microblogging data.Experimental results on multiple real-world datasets demonstrate that our proposed method effectively uncovers causal relationships between variables,exhibits strong time efficiency,and outperforms state-of-the-art models in prediction performance(improved by 2.54%)and overfitting reduction(by 7.44%).展开更多
Gene regulatory network (GRN) inference from gene expression data is asignificant approach to understanding aspects of the biological system.Compared with generalized correlation-based methods, causality-inspiredones ...Gene regulatory network (GRN) inference from gene expression data is asignificant approach to understanding aspects of the biological system.Compared with generalized correlation-based methods, causality-inspiredones seem more rational to infer regulatory relationships. We proposeGRINCD, a novel GRN inference framework empowered by graph representationlearning and causal asymmetric learning, considering both linearand non-linear regulatory relationships. First, high-quality representation ofeach gene is generated using graph neural network. Then, we apply theadditive noise model to predict the causal regulation of each regulator-targetpair. Additionally, we design two channels and finally assemble them forrobust prediction. Through comprehensive comparisons of our frameworkwith state-of-the-art methods based on different principles on numerousdatasets of diverse types and scales, the experimental results show that ourframework achieves superior or comparable performance under variousevaluation metrics. Our work provides a new clue for constructing GRNs,and our proposed framework GRINCD also shows potential in identifyingkey factors affecting cancerdevelopment.展开更多
Transformer top oil temperature prediction is a research focal point in online monitoring of transformer operational status.Existing methods lack interpretability in feature selection and do not consider the temporal ...Transformer top oil temperature prediction is a research focal point in online monitoring of transformer operational status.Existing methods lack interpretability in feature selection and do not consider the temporal correlation of features.To address these issues,we provide a top oil temperature of electric transformers prediction method by using causal discovery and the Graph Neural Network(GNN)-Long Short-Term Memory(LSTM)model in this paper.To conduct feature selection,we use causal discovery to reduce feature dimensionality.Then,construct causal graph data based on the causal relationship matrix.Finally,spatiotemporal features are extracted by the GNN-LSTM model for oil temperature prediction.Experimental results demonstrate that this method can scientifically carry out feature selection,ensuring prediction accuracy and result robustness.展开更多
Model-based methods have recently been shown promising for offline reinforcement learning(RL),which aims at learning good policies from historical data without interacting with the environment.Previous model-based off...Model-based methods have recently been shown promising for offline reinforcement learning(RL),which aims at learning good policies from historical data without interacting with the environment.Previous model-based offline RL methods employ a straightforward prediction method that maps the states and actions directly to the next-step states.However,such a prediction method tends to capture spurious relations caused by the sampling policy preference behind the offline data.It is sensible that the environment model should focus on causal influences,which can facilitate learning an effective policy that can generalize well to unseen states.In this paper,we first provide theoretical results that causal environment models can outperform plain environment models in offline RL by incorporating the causal structure into the generalization error bound.We also propose a practical algorithm,oFfline mOdel-based reinforcement learning with CaUsal Structured World Models(FOCUS),to illustrate the feasibility of learning and leveraging causal structure in offline RL.Experimental results on two benchmarks show that FOCUS reconstructs the underlying causal structure accurately and robustly,and,as a result,outperforms both model-based offline RL algorithms and causal model-based offline RL algorithms.展开更多
The utilization of big Earth data has provided insights into the planet we inhabit in unprecedented dimensions and scales.Unraveling the concealed causal connections within intricate data holds paramount importance fo...The utilization of big Earth data has provided insights into the planet we inhabit in unprecedented dimensions and scales.Unraveling the concealed causal connections within intricate data holds paramount importance for attaining a profound comprehension of the Earth system.Statistical methods founded on correlation have predominated in Earth system science(ESS)for a long time.Nevertheless,correlation does not imply causation,especially when confronted with spurious correlations resulting from big data.Consequently,traditional correlation and regression methods are inadequate for addressing causation related problems in the Earth system.In recent years,propelled by advancements in causal theory and inference methods,particularly the maturity of causal discovery and causal graphical models,causal inference has demonstrated vigorous vitality in various research directions in the Earth system,such as regularities revealing,processes understanding,hypothesis testing,and physical models improving.This paper commences by delving into the origins,connotations,and development of causality,subsequently outlining the principal frameworks of causal inference and the commonly used methods in ESS.Additionally,it reviews the applications of causal inference in the main branches of the Earth system and summarizes the challenges and development directions of causal inference in ESS.In the big Earth data era,as an important method of big data analysis,causal inference,along with physical model and machine learning,can assist the paradigm transformation of ESS from a model-driven paradigm to a paradigm of integration of both mechanism and data.Looking forward,the establishment of a meticulously structured and normalized causal theory can act as a foundational cornerstone for fostering causal cognition in ESS and propel the leap from fragmented research towards a comprehensive understanding of the Earth system.展开更多
基金supported by the National Social Science Fund of China(2022-SKJJ-B-084).
文摘The learning algorithms of causal discovery mainly include score-based methods and genetic algorithms(GA).The score-based algorithms are prone to searching space explosion.Classical GA is slow to converge,and prone to falling into local optima.To address these issues,an improved GA with domain knowledge(IGADK)is proposed.Firstly,domain knowledge is incorporated into the learning process of causality to construct a new fitness function.Secondly,a dynamical mutation operator is introduced in the algorithm to accelerate the convergence rate.Finally,an experiment is conducted on simulation data,which compares the classical GA with IGADK with domain knowledge of varying accuracy.The IGADK can greatly reduce the number of iterations,populations,and samples required for learning,which illustrates the efficiency and effectiveness of the proposed algorithm.
文摘Causality,the science of cause and effect,has made it possible to create a new family of models.Such models are often referred to as causal models.Unlike those of mathematical,numerical,empirical,or machine learning(ML)nature,causal models hope to tie the cause(s)to the effect(s)pertaining to a phenomenon(i.e.,data generating process)through causal principles.This paper presents one of the first works at creating causal models in the area of structural and construction engineering.To this end,this paper starts with a brief review of the principles of causality and then adopts four causal discovery algorithms,namely,PC(Peter-Clark),FCI(fast causal inference),GES(greedy equivalence search),and GRa SP(greedy relaxation of the sparsest permutation),have been used to examine four phenomena,including predicting the load-bearing capacity of axially loaded members,fire resistance of structural members,shear strength of beams,and resistance of walls against impulsive(blast)loading.Findings from this study reveal the possibility and merit of discovering complete and partial causal models.Finally,this study also proposes two simple metrics that can help assess the performance of causal discovery algorithms.
文摘Causal inference is a powerful modeling tool for explanatory analysis,which might enable current machine learning to become explainable.How to marry causal inference with machine learning to develop explainable artificial intelligence(XAI)algorithms is one of key steps toward to the artificial intelligence 2.0.With the aim of bringing knowledge of causal inference to scholars of machine learning and artificial intelligence,we invited researchers working on causal inference to write this survey from different aspects of causal inference.This survey includes the following sections:“Estimating average treatment effect:A brief review and beyond”from Dr.Kun Kuang,“Attribution problems in counterfactual inference”from Prof.Lian Li,“The Yule–Simpson paradox and the surrogate paradox”from Prof.Zhi Geng,“Causal potential theory”from Prof.Lei Xu,“Discovering causal information from observational data”from Prof.Kun Zhang,“Formal argumentation in causal reasoning and explanation”from Profs.Beishui Liao and Huaxin Huang,“Causal inference with complex experiments”from Prof.Peng Ding,“Instrumental variables and negative controls for observational studies”from Prof.Wang Miao,and“Causal inference with interference”from Dr.Zhichao Jiang.
基金support of the National Key Research and Development Program of China(2021YFB4000505).
文摘Fault detection and diagnosis(FDD)plays a significant role in ensuring the safety and stability of chemical processes.With the development of artificial intelligence(AI)and big data technologies,data-driven approaches with excellent performance are widely used for FDD in chemical processes.However,improved predictive accuracy has often been achieved through increased model complexity,which turns models into black-box methods and causes uncertainty regarding their decisions.In this study,a causal temporal graph attention network(CTGAN)is proposed for fault diagnosis of chemical processes.A chemical causal graph is built by causal inference to represent the propagation path of faults.The attention mechanism and chemical causal graph were combined to help us notice the key variables relating to fault fluctuations.Experiments in the Tennessee Eastman(TE)process and the green ammonia(GA)process showed that CTGAN achieved high performance and good explainability.
文摘Microblogging platforms like X(formerly Twitter)and Sina Weibo have become key channels for spreading information online.Accurately predicting information spread,such as users’reposting activities,is essential for applications including content recommendation and analyzing public sentiment.Current advanced models rely on deep representation learning to extract features from various inputs,such as users’social connections and repost history,to forecast reposting behavior.Nonetheless,these models frequently ignore intrinsic confounding factors,which may cause the models to capture spurious relationships,ultimately impacting prediction performance.To address this limitation,we propose a novel Debiased Reposting Prediction model(DRP).Our model mitigates the influence of confounding variables by incorporating intervention operations from causal inference,enabling it to learn the causal associations between features and user reposting behavior.Specifically,we introduce a memory network within DRP to enhance the model’s perception of confounder distributions.This network aggregates and learns confounding information dispersed across different training data batches by optimizing the reconstruction loss.Furthermore,recognizing the challenge of acquiring prior knowledge of causal graphs,which is crucial for causal inference,we develop a causal discovery module within DRP(CD-DRP).This module allows the model to autonomously uncover the causal graph of feature variables by analyzing microblogging data.Experimental results on multiple real-world datasets demonstrate that our proposed method effectively uncovers causal relationships between variables,exhibits strong time efficiency,and outperforms state-of-the-art models in prediction performance(improved by 2.54%)and overfitting reduction(by 7.44%).
文摘Gene regulatory network (GRN) inference from gene expression data is asignificant approach to understanding aspects of the biological system.Compared with generalized correlation-based methods, causality-inspiredones seem more rational to infer regulatory relationships. We proposeGRINCD, a novel GRN inference framework empowered by graph representationlearning and causal asymmetric learning, considering both linearand non-linear regulatory relationships. First, high-quality representation ofeach gene is generated using graph neural network. Then, we apply theadditive noise model to predict the causal regulation of each regulator-targetpair. Additionally, we design two channels and finally assemble them forrobust prediction. Through comprehensive comparisons of our frameworkwith state-of-the-art methods based on different principles on numerousdatasets of diverse types and scales, the experimental results show that ourframework achieves superior or comparable performance under variousevaluation metrics. Our work provides a new clue for constructing GRNs,and our proposed framework GRINCD also shows potential in identifyingkey factors affecting cancerdevelopment.
文摘Transformer top oil temperature prediction is a research focal point in online monitoring of transformer operational status.Existing methods lack interpretability in feature selection and do not consider the temporal correlation of features.To address these issues,we provide a top oil temperature of electric transformers prediction method by using causal discovery and the Graph Neural Network(GNN)-Long Short-Term Memory(LSTM)model in this paper.To conduct feature selection,we use causal discovery to reduce feature dimensionality.Then,construct causal graph data based on the causal relationship matrix.Finally,spatiotemporal features are extracted by the GNN-LSTM model for oil temperature prediction.Experimental results demonstrate that this method can scientifically carry out feature selection,ensuring prediction accuracy and result robustness.
文摘Model-based methods have recently been shown promising for offline reinforcement learning(RL),which aims at learning good policies from historical data without interacting with the environment.Previous model-based offline RL methods employ a straightforward prediction method that maps the states and actions directly to the next-step states.However,such a prediction method tends to capture spurious relations caused by the sampling policy preference behind the offline data.It is sensible that the environment model should focus on causal influences,which can facilitate learning an effective policy that can generalize well to unseen states.In this paper,we first provide theoretical results that causal environment models can outperform plain environment models in offline RL by incorporating the causal structure into the generalization error bound.We also propose a practical algorithm,oFfline mOdel-based reinforcement learning with CaUsal Structured World Models(FOCUS),to illustrate the feasibility of learning and leveraging causal structure in offline RL.Experimental results on two benchmarks show that FOCUS reconstructs the underlying causal structure accurately and robustly,and,as a result,outperforms both model-based offline RL algorithms and causal model-based offline RL algorithms.
基金supported by the Basic Science Center for Tibetan Plateau Earth System(BCTPES,NSFC project Grant Nos.41988101)the National Natural Science Foundation of China(Grant No.42101397)。
文摘The utilization of big Earth data has provided insights into the planet we inhabit in unprecedented dimensions and scales.Unraveling the concealed causal connections within intricate data holds paramount importance for attaining a profound comprehension of the Earth system.Statistical methods founded on correlation have predominated in Earth system science(ESS)for a long time.Nevertheless,correlation does not imply causation,especially when confronted with spurious correlations resulting from big data.Consequently,traditional correlation and regression methods are inadequate for addressing causation related problems in the Earth system.In recent years,propelled by advancements in causal theory and inference methods,particularly the maturity of causal discovery and causal graphical models,causal inference has demonstrated vigorous vitality in various research directions in the Earth system,such as regularities revealing,processes understanding,hypothesis testing,and physical models improving.This paper commences by delving into the origins,connotations,and development of causality,subsequently outlining the principal frameworks of causal inference and the commonly used methods in ESS.Additionally,it reviews the applications of causal inference in the main branches of the Earth system and summarizes the challenges and development directions of causal inference in ESS.In the big Earth data era,as an important method of big data analysis,causal inference,along with physical model and machine learning,can assist the paradigm transformation of ESS from a model-driven paradigm to a paradigm of integration of both mechanism and data.Looking forward,the establishment of a meticulously structured and normalized causal theory can act as a foundational cornerstone for fostering causal cognition in ESS and propel the leap from fragmented research towards a comprehensive understanding of the Earth system.