The use of data driven models has been shown to be useful for simulating complex engineering processes,when the only information available consists of the data of the process.In this study,four data-driven models,name...The use of data driven models has been shown to be useful for simulating complex engineering processes,when the only information available consists of the data of the process.In this study,four data-driven models,namely multiple linear regression,artificial neural network,adaptive neural fuzzy inference system,and K nearest neighbor models based on collection of 207 laboratory tests,are investigated for compressive strength prediction of concrete at high temperature.In addition for each model,two different sets of input variables are examined:a complete set and a parsimonious set of involved variables.The results obtained are compared with each other and also to the equations of NIST Technical Note standard and demonstrate the suitability of using the data driven models to predict the compressive strength at high temperature.In addition,the results show employing the parsimonious set of input variables is sufficient for the data driven models to make satisfactory results.展开更多
Hydrocarbon production from shale has attracted much attention in the recent years. When applied to this prolific and hydrocarbon rich resource plays, our understanding of the complexities of the flow mechanism(sorpt...Hydrocarbon production from shale has attracted much attention in the recent years. When applied to this prolific and hydrocarbon rich resource plays, our understanding of the complexities of the flow mechanism(sorption process and flow behavior in complex fracture systems- induced or natural) leaves much to be desired. In this paper, we present and discuss a novel approach to modeling, history matching of hydrocarbon production from a Marcellus shale asset in southwestern Pennsylvania using advanced data mining, pattern recognition and machine learning technologies. In this new approach instead of imposing our understanding of the flow mechanism, the impact of multi-stage hydraulic fractures, and the production process on the reservoir model, we allow the production history, well log, completion and hydraulic fracturing data to guide our model and determine its behavior. The uniqueness of this technology is that it incorporates the so-called "hard data" directly into the reservoir model, so that the model can be used to optimize the hydraulic fracture process. The "hard data" refers to field measurements during the hydraulic fracturing process such as fluid and proppant type and amount, injection pressure and rate as well as proppant concentration. This novel approach contrasts with the current industry focus on the use of "soft data"(non-measured, interpretive data such as frac length, width,height and conductivity) in the reservoir models. The study focuses on a Marcellus shale asset that includes 135 wells with multiple pads, different landing targets, well length and reservoir properties. The full field history matching process was successfully completed using this data driven approach thus capturing the production behavior with acceptable accuracy for individual wells and for the entire asset.展开更多
When designing large-sized complex machinery products, the design focus is always on the overall per- formance; however, there exist no design theory and method based on performance driven. In view of the defi- ciency...When designing large-sized complex machinery products, the design focus is always on the overall per- formance; however, there exist no design theory and method based on performance driven. In view of the defi- ciency of the existing design theory, according to the performance features of complex mechanical products, the performance indices are introduced into the traditional design theory of "Requirement-Function-Structure" to construct a new five-domain design theory of "Client Requirement-Function-Performance-Structure-Design Parameter". To support design practice based on this new theory, a product data model is established by using per- formance indices and the mapping relationship between them and the other four domains. When the product data model is applied to high-speed train design and combining the existing research result and relevant standards, the corresponding data model and its structure involving five domains of high-speed trains are established, which can provide technical support for studying the relationships between typical performance indices and design parame- ters and the fast achievement of a high-speed train scheme design. The five domains provide a reference for the design specification and evaluation criteria of high speed train and a new idea for the train's parameter design.展开更多
Nowadays, high-precision motion controls are needed in modern manufacturing industry. A data-driven nonparametric model adaptive control(NMAC) method is proposed in this paper to control the position of a linear servo...Nowadays, high-precision motion controls are needed in modern manufacturing industry. A data-driven nonparametric model adaptive control(NMAC) method is proposed in this paper to control the position of a linear servo system. The controller design requires no information about the structure of linear servo system, and it is based on the estimation and forecasting of the pseudo-partial derivatives(PPD) which are estimated according to the voltage input and position output of the linear motor. The characteristics and operational mechanism of the permanent magnet synchronous linear motor(PMSLM) are introduced, and the proposed nonparametric model control strategy has been compared with the classic proportional-integral-derivative(PID) control algorithm. Several real-time experiments on the motion control system incorporating a permanent magnet synchronous linear motor showed that the nonparametric model adaptive control method improved the system s response to disturbances and its position-tracking precision, even for a nonlinear system with incompletely known dynamic characteristics.展开更多
In the synthesis of the control algorithm for complex systems, we are often faced with imprecise or unknown mathematical models of the dynamical systems, or even with problems in finding a mathematical model of the sy...In the synthesis of the control algorithm for complex systems, we are often faced with imprecise or unknown mathematical models of the dynamical systems, or even with problems in finding a mathematical model of the system in the open loop. To tackle these difficulties, an approach of data-driven model identification and control algorithm design based on the maximum stability degree criterion is proposed in this paper. The data-driven model identification procedure supposes the finding of the mathematical model of the system based on the undamped transient response of the closed-loop system. The system is approximated with the inertial model, where the coefficients are calculated based on the values of the critical transfer coefficient, oscillation amplitude and period of the underdamped response of the closed-loop system. The data driven control design supposes that the tuning parameters of the controller are calculated based on the parameters obtained from the previous step of system identification and there are presented the expressions for the calculation of the tuning parameters. The obtained results of data-driven model identification and algorithm for synthesis the controller were verified by computer simulation.展开更多
Recently, the China haze becomes more and more serious, but it is very difficult to model and control it. Here, a data-driven model is introduced for the simulation and monitoring of China haze. First, a multi-dimensi...Recently, the China haze becomes more and more serious, but it is very difficult to model and control it. Here, a data-driven model is introduced for the simulation and monitoring of China haze. First, a multi-dimensional evaluation system is built to evaluate the government performance of China haze. Second, a data-driven model is employed to reveal the operation mechanism of China’s haze and is described as a multi input and multi output system. Third, a prototype system is set up to verify the proposed scheme, and the result provides us with a graphical tool to monitor different haze control strategies.展开更多
The car-following models are the research basis of traffic flow theory and microscopic traffic simulation. Among the previous work, the theory-driven models are dominant, while the data-driven ones are relatively rare...The car-following models are the research basis of traffic flow theory and microscopic traffic simulation. Among the previous work, the theory-driven models are dominant, while the data-driven ones are relatively rare. In recent years, the related technologies of Intelligent Transportation System (ITS) re</span><span style="font-family:Verdana;">- </span><span style="font-family:Verdana;">presented by the Vehicles to Everything (V2X) technology have been developing rapidly. Utilizing the related technologies of ITS, the large-scale vehicle microscopic trajectory data with high quality can be acquired, which provides the research foundation for modeling the car-following behavior based on the data-driven methods. According to this point, a data-driven car-following model based on the Random Forest (RF) method was constructed in this work, and the Next Generation Simulation (NGSIM) dataset was used to calibrate and train the constructed model. The Artificial Neural Network (ANN) model, GM model, and Full Velocity Difference (FVD) model are em</span><span style="font-family:Verdana;">- </span><span style="font-family:Verdana;">ployed to comparatively verify the proposed model. The research results suggest that the model proposed in this work can accurately describe the car-</span><span style="font-family:Verdana;"> </span><span style="font-family:Verdana;">following behavior with better performance under multiple performance indicators.展开更多
This paper presents a simple nonparametric regression approach to data-driven computing in elasticity. We apply the kernel regression to the material data set, and formulate a system of nonlinear equations solved to o...This paper presents a simple nonparametric regression approach to data-driven computing in elasticity. We apply the kernel regression to the material data set, and formulate a system of nonlinear equations solved to obtain a static equilibrium state of an elastic structure. Preliminary numerical experiments illustrate that, compared with existing methods, the proposed method finds a reasonable solution even if data points distribute coarsely in a given material data set.展开更多
We develop a data driven method(probability model) to construct a composite shape descriptor by combining a pair of scale-based shape descriptors. The selection of a pair of scale-based shape descriptors is modeled as...We develop a data driven method(probability model) to construct a composite shape descriptor by combining a pair of scale-based shape descriptors. The selection of a pair of scale-based shape descriptors is modeled as the computation of the union of two events, i.e.,retrieving similar shapes by using a single scale-based shape descriptor. The pair of scale-based shape descriptors with the highest probability forms the composite shape descriptor. Given a shape database, the composite shape descriptors for the shapes constitute a planar point set.A VoR-Tree of the planar point set is then used as an indexing structure for efficient query operation. Experiments and comparisons show the effectiveness and efficiency of the proposed composite shape descriptor.展开更多
In this study the medium-term response of beach profiles was investigated at two sites: a gently sloping sandy beach and a steeper mixed sand and gravel beach. The former is the Duck site in North Carolina, on the ea...In this study the medium-term response of beach profiles was investigated at two sites: a gently sloping sandy beach and a steeper mixed sand and gravel beach. The former is the Duck site in North Carolina, on the east coast of the USA, which is exposed to Atlantic Ocean swells and storm waves, and the latter is the Milford-on-Sea site at Christchurch Bay, on the south coast of England, which is partially sheltered from Atlantic swells but has a directionally bimodal wave exposure. The data sets comprise detailed bathymetric surveys of beach profiles covering a period of more than 25 years for the Duck site and over 18 years for the Milford-on-Sea site. The structure of the data sets and the data-driven methods are described. Canonical correlation analysis (CCA) was used to find linkages between the wave characteristics and beach profiles. The sensitivity of the linkages was investigated by deploying a wave height threshold to filter out the smaller waves incrementally. The results of the analysis indicate that, for the gently sloping sandy beach, waves of all heights are important to the morphological response. For the mixed sand and gravel beach, filtering the smaller waves improves the statistical fit and it suggests that low-height waves do not play a primary role in the medium-term morohological resoonse, which is primarily driven by the intermittent larger storm waves.展开更多
In this paper, a real-time online data-driven adaptive method is developed to deal with uncertainties such as high nonlinearity, strong coupling, parameter perturbation and external disturbances in attitude control of...In this paper, a real-time online data-driven adaptive method is developed to deal with uncertainties such as high nonlinearity, strong coupling, parameter perturbation and external disturbances in attitude control of fixed-wing unmanned aerial vehicles (UAVs). Firstly, a model-free adaptive control (MFAC) method requiring only input/output (I/O) data and no model information is adopted for control scheme design of angular velocity subsystem which contains all model information and up-mentioned uncertainties. Secondly, the internal model control (IMC) method featured with less tuning parameters and convenient tuning process is adopted for control scheme design of the certain Euler angle subsystem. Simulation results show that, the method developed is obviously superior to the cascade PID (CPID) method and the nonlinear dynamic inversion (NDI) method.展开更多
The capacitance-resistance model (CRM) is an alternative to conventional reservoir simulation. CRM, a simplification of complex numerical models, uses production and injection rates to infer a reservoir description....The capacitance-resistance model (CRM) is an alternative to conventional reservoir simulation. CRM, a simplification of complex numerical models, uses production and injection rates to infer a reservoir description. There is no prior geologic model. The principal output of CRM fitting is the fraction of injected fluid (usually water) that is produced at a producer at steady-state. These fractions are interwell connectivities. Interwell connectivities are fundamental information needed to manage waterfloods in oil reservoirs. The data-driven CRM is a fast tool to estimate these parameters in mature fields and allows one to make full use of the dynamic data available. This paper considers the problem of setting an upper bound on the uncertainty of interwell connectivities for linear-constrained models. Using analytical bounds and numerical simulations, we derive a consistent upper limit on the uncertainty of interwell connections that can be used to quantify the information content of a given dataset.展开更多
Recently, the heat and electricity integrated energy system (HE-IES) has become a hot topic in both industry and academia. In the HE- IES, the potential flexibility of the buildings' thermal loads can be exploited...Recently, the heat and electricity integrated energy system (HE-IES) has become a hot topic in both industry and academia. In the HE- IES, the potential flexibility of the buildings' thermal loads can be exploited to relax the heat power balance constraints and consequently allow a more flexible operation of the combined heat and power units. In this paper, model-driven and data-driven techniques are combined to quantify the demand flexibility of the buildings' thermal loads in a non-instructive way. First, the explicit analytical equivalent thermal parameter (ETP) model of the aggregated buildings is developed. The heat transfer coefficient (k) and thermal inertia coefficient (C) of the ETP model are designated to measure the potential demand flexibility. Second, the Particle Swarm Optimization optimized Radial Basis Function neural network (PSO-RBF) is used to identify the relationship between the values of k and C and the meteorological factors. To obtain the training data, an innovative two-stage regression method based on the adaptive temporal resolution is proposed to extract k and C values from the historical thermal load data. Finally, a flexible thermal load model is built based on the predictions of the meteorological factors, which can be conveniently incorporated into the online dispatch of the HE-IES. A comprehensive simulation environment is designed to verify the accuracy and availability of the proposed technique.展开更多
文摘The use of data driven models has been shown to be useful for simulating complex engineering processes,when the only information available consists of the data of the process.In this study,four data-driven models,namely multiple linear regression,artificial neural network,adaptive neural fuzzy inference system,and K nearest neighbor models based on collection of 207 laboratory tests,are investigated for compressive strength prediction of concrete at high temperature.In addition for each model,two different sets of input variables are examined:a complete set and a parsimonious set of involved variables.The results obtained are compared with each other and also to the equations of NIST Technical Note standard and demonstrate the suitability of using the data driven models to predict the compressive strength at high temperature.In addition,the results show employing the parsimonious set of input variables is sufficient for the data driven models to make satisfactory results.
基金RPSEA and U.S.Department of Energy for partially funding this study
文摘Hydrocarbon production from shale has attracted much attention in the recent years. When applied to this prolific and hydrocarbon rich resource plays, our understanding of the complexities of the flow mechanism(sorption process and flow behavior in complex fracture systems- induced or natural) leaves much to be desired. In this paper, we present and discuss a novel approach to modeling, history matching of hydrocarbon production from a Marcellus shale asset in southwestern Pennsylvania using advanced data mining, pattern recognition and machine learning technologies. In this new approach instead of imposing our understanding of the flow mechanism, the impact of multi-stage hydraulic fractures, and the production process on the reservoir model, we allow the production history, well log, completion and hydraulic fracturing data to guide our model and determine its behavior. The uniqueness of this technology is that it incorporates the so-called "hard data" directly into the reservoir model, so that the model can be used to optimize the hydraulic fracture process. The "hard data" refers to field measurements during the hydraulic fracturing process such as fluid and proppant type and amount, injection pressure and rate as well as proppant concentration. This novel approach contrasts with the current industry focus on the use of "soft data"(non-measured, interpretive data such as frac length, width,height and conductivity) in the reservoir models. The study focuses on a Marcellus shale asset that includes 135 wells with multiple pads, different landing targets, well length and reservoir properties. The full field history matching process was successfully completed using this data driven approach thus capturing the production behavior with acceptable accuracy for individual wells and for the entire asset.
基金Supported by National Natural Science Foundation of China(Grant Nos.51275432,51505390)Sichuan Application Foundation Projects(Grant No.2016JY0098)Independent Research Project of TPL(Grant No.TPL1501)
文摘When designing large-sized complex machinery products, the design focus is always on the overall per- formance; however, there exist no design theory and method based on performance driven. In view of the defi- ciency of the existing design theory, according to the performance features of complex mechanical products, the performance indices are introduced into the traditional design theory of "Requirement-Function-Structure" to construct a new five-domain design theory of "Client Requirement-Function-Performance-Structure-Design Parameter". To support design practice based on this new theory, a product data model is established by using per- formance indices and the mapping relationship between them and the other four domains. When the product data model is applied to high-speed train design and combining the existing research result and relevant standards, the corresponding data model and its structure involving five domains of high-speed trains are established, which can provide technical support for studying the relationships between typical performance indices and design parame- ters and the fast achievement of a high-speed train scheme design. The five domains provide a reference for the design specification and evaluation criteria of high speed train and a new idea for the train's parameter design.
基金supported by Beijing Natural Science Foundation(No.4142017)International Cooperation Project of National Natural Science Foundation of China(No.61120106009)Beijing Science and Technology Commission Precision Machinery Projects(No.Z121100001612007)
文摘Nowadays, high-precision motion controls are needed in modern manufacturing industry. A data-driven nonparametric model adaptive control(NMAC) method is proposed in this paper to control the position of a linear servo system. The controller design requires no information about the structure of linear servo system, and it is based on the estimation and forecasting of the pseudo-partial derivatives(PPD) which are estimated according to the voltage input and position output of the linear motor. The characteristics and operational mechanism of the permanent magnet synchronous linear motor(PMSLM) are introduced, and the proposed nonparametric model control strategy has been compared with the classic proportional-integral-derivative(PID) control algorithm. Several real-time experiments on the motion control system incorporating a permanent magnet synchronous linear motor showed that the nonparametric model adaptive control method improved the system s response to disturbances and its position-tracking precision, even for a nonlinear system with incompletely known dynamic characteristics.
文摘In the synthesis of the control algorithm for complex systems, we are often faced with imprecise or unknown mathematical models of the dynamical systems, or even with problems in finding a mathematical model of the system in the open loop. To tackle these difficulties, an approach of data-driven model identification and control algorithm design based on the maximum stability degree criterion is proposed in this paper. The data-driven model identification procedure supposes the finding of the mathematical model of the system based on the undamped transient response of the closed-loop system. The system is approximated with the inertial model, where the coefficients are calculated based on the values of the critical transfer coefficient, oscillation amplitude and period of the underdamped response of the closed-loop system. The data driven control design supposes that the tuning parameters of the controller are calculated based on the parameters obtained from the previous step of system identification and there are presented the expressions for the calculation of the tuning parameters. The obtained results of data-driven model identification and algorithm for synthesis the controller were verified by computer simulation.
文摘Recently, the China haze becomes more and more serious, but it is very difficult to model and control it. Here, a data-driven model is introduced for the simulation and monitoring of China haze. First, a multi-dimensional evaluation system is built to evaluate the government performance of China haze. Second, a data-driven model is employed to reveal the operation mechanism of China’s haze and is described as a multi input and multi output system. Third, a prototype system is set up to verify the proposed scheme, and the result provides us with a graphical tool to monitor different haze control strategies.
文摘The car-following models are the research basis of traffic flow theory and microscopic traffic simulation. Among the previous work, the theory-driven models are dominant, while the data-driven ones are relatively rare. In recent years, the related technologies of Intelligent Transportation System (ITS) re</span><span style="font-family:Verdana;">- </span><span style="font-family:Verdana;">presented by the Vehicles to Everything (V2X) technology have been developing rapidly. Utilizing the related technologies of ITS, the large-scale vehicle microscopic trajectory data with high quality can be acquired, which provides the research foundation for modeling the car-following behavior based on the data-driven methods. According to this point, a data-driven car-following model based on the Random Forest (RF) method was constructed in this work, and the Next Generation Simulation (NGSIM) dataset was used to calibrate and train the constructed model. The Artificial Neural Network (ANN) model, GM model, and Full Velocity Difference (FVD) model are em</span><span style="font-family:Verdana;">- </span><span style="font-family:Verdana;">ployed to comparatively verify the proposed model. The research results suggest that the model proposed in this work can accurately describe the car-</span><span style="font-family:Verdana;"> </span><span style="font-family:Verdana;">following behavior with better performance under multiple performance indicators.
基金Supported by National Basic Research Program of China(973 Program)(2013CB035500) National Natural Science Foundation of China(61233004,61221003,61074061)+1 种基金 International Cooperation Program of Shanghai Science and Technology Commission (12230709600) the Higher Education Research Fund for the Doctoral Program of China(20120073130006)
基金supported by JSPS KAKENHI (Grants 17K06633 and 18K18898)
文摘This paper presents a simple nonparametric regression approach to data-driven computing in elasticity. We apply the kernel regression to the material data set, and formulate a system of nonlinear equations solved to obtain a static equilibrium state of an elastic structure. Preliminary numerical experiments illustrate that, compared with existing methods, the proposed method finds a reasonable solution even if data points distribute coarsely in a given material data set.
基金supported by the National Key R&D Plan of China(2016YFB1001501)
文摘We develop a data driven method(probability model) to construct a composite shape descriptor by combining a pair of scale-based shape descriptors. The selection of a pair of scale-based shape descriptors is modeled as the computation of the union of two events, i.e.,retrieving similar shapes by using a single scale-based shape descriptor. The pair of scale-based shape descriptors with the highest probability forms the composite shape descriptor. Given a shape database, the composite shape descriptors for the shapes constitute a planar point set.A VoR-Tree of the planar point set is then used as an indexing structure for efficient query operation. Experiments and comparisons show the effectiveness and efficiency of the proposed composite shape descriptor.
基金supported by the UK Natural Environment Research Council(Grant No.NE/J005606/1)the UK Engineering and Physical Sciences Research Council(Grant No.EP/C005392/1)the Ensemble Estimation of Flood Risk in a Changing Climate(EFRa CC)project funded by the British Council under its Global Innovation Initiative
文摘In this study the medium-term response of beach profiles was investigated at two sites: a gently sloping sandy beach and a steeper mixed sand and gravel beach. The former is the Duck site in North Carolina, on the east coast of the USA, which is exposed to Atlantic Ocean swells and storm waves, and the latter is the Milford-on-Sea site at Christchurch Bay, on the south coast of England, which is partially sheltered from Atlantic swells but has a directionally bimodal wave exposure. The data sets comprise detailed bathymetric surveys of beach profiles covering a period of more than 25 years for the Duck site and over 18 years for the Milford-on-Sea site. The structure of the data sets and the data-driven methods are described. Canonical correlation analysis (CCA) was used to find linkages between the wave characteristics and beach profiles. The sensitivity of the linkages was investigated by deploying a wave height threshold to filter out the smaller waves incrementally. The results of the analysis indicate that, for the gently sloping sandy beach, waves of all heights are important to the morphological response. For the mixed sand and gravel beach, filtering the smaller waves improves the statistical fit and it suggests that low-height waves do not play a primary role in the medium-term morohological resoonse, which is primarily driven by the intermittent larger storm waves.
文摘In this paper, a real-time online data-driven adaptive method is developed to deal with uncertainties such as high nonlinearity, strong coupling, parameter perturbation and external disturbances in attitude control of fixed-wing unmanned aerial vehicles (UAVs). Firstly, a model-free adaptive control (MFAC) method requiring only input/output (I/O) data and no model information is adopted for control scheme design of angular velocity subsystem which contains all model information and up-mentioned uncertainties. Secondly, the internal model control (IMC) method featured with less tuning parameters and convenient tuning process is adopted for control scheme design of the certain Euler angle subsystem. Simulation results show that, the method developed is obviously superior to the cascade PID (CPID) method and the nonlinear dynamic inversion (NDI) method.
基金YPF for financial support and to the Center for Petroleum Asset Risk Management of the University of Texas at Austin for hospitality and an exciting research environment
文摘The capacitance-resistance model (CRM) is an alternative to conventional reservoir simulation. CRM, a simplification of complex numerical models, uses production and injection rates to infer a reservoir description. There is no prior geologic model. The principal output of CRM fitting is the fraction of injected fluid (usually water) that is produced at a producer at steady-state. These fractions are interwell connectivities. Interwell connectivities are fundamental information needed to manage waterfloods in oil reservoirs. The data-driven CRM is a fast tool to estimate these parameters in mature fields and allows one to make full use of the dynamic data available. This paper considers the problem of setting an upper bound on the uncertainty of interwell connectivities for linear-constrained models. Using analytical bounds and numerical simulations, we derive a consistent upper limit on the uncertainty of interwell connections that can be used to quantify the information content of a given dataset.
基金supported by the National Natural Science Foundation of China(52107072)International Cooperation and Exchange of NSFC(51861145406).
文摘Recently, the heat and electricity integrated energy system (HE-IES) has become a hot topic in both industry and academia. In the HE- IES, the potential flexibility of the buildings' thermal loads can be exploited to relax the heat power balance constraints and consequently allow a more flexible operation of the combined heat and power units. In this paper, model-driven and data-driven techniques are combined to quantify the demand flexibility of the buildings' thermal loads in a non-instructive way. First, the explicit analytical equivalent thermal parameter (ETP) model of the aggregated buildings is developed. The heat transfer coefficient (k) and thermal inertia coefficient (C) of the ETP model are designated to measure the potential demand flexibility. Second, the Particle Swarm Optimization optimized Radial Basis Function neural network (PSO-RBF) is used to identify the relationship between the values of k and C and the meteorological factors. To obtain the training data, an innovative two-stage regression method based on the adaptive temporal resolution is proposed to extract k and C values from the historical thermal load data. Finally, a flexible thermal load model is built based on the predictions of the meteorological factors, which can be conveniently incorporated into the online dispatch of the HE-IES. A comprehensive simulation environment is designed to verify the accuracy and availability of the proposed technique.