Medical Internet of Things(IoT)devices are becoming more and more common in healthcare.This has created a huge need for advanced predictive health modeling strategies that can make good use of the growing amount of mu...Medical Internet of Things(IoT)devices are becoming more and more common in healthcare.This has created a huge need for advanced predictive health modeling strategies that can make good use of the growing amount of multimodal data to find potential health risks early and help individuals in a personalized way.Existing methods,while useful,have limitations in predictive accuracy,delay,personalization,and user interpretability,requiring a more comprehensive and efficient approach to harness modern medical IoT devices.MAIPFE is a multimodal approach integrating pre-emptive analysis,personalized feature selection,and explainable AI for real-time health monitoring and disease detection.By using AI for early disease detection,personalized health recommendations,and transparency,healthcare will be transformed.The Multimodal Approach Integrating Pre-emptive Analysis,Personalized Feature Selection,and Explainable AI(MAIPFE)framework,which combines Firefly Optimizer,Recurrent Neural Network(RNN),Fuzzy C Means(FCM),and Explainable AI,improves disease detection precision over existing methods.Comprehensive metrics show the model’s superiority in real-time health analysis.The proposed framework outperformed existing models by 8.3%in disease detection classification precision,8.5%in accuracy,5.5%in recall,2.9%in specificity,4.5%in AUC(Area Under the Curve),and 4.9%in delay reduction.Disease prediction precision increased by 4.5%,accuracy by 3.9%,recall by 2.5%,specificity by 3.5%,AUC by 1.9%,and delay levels decreased by 9.4%.MAIPFE can revolutionize healthcare with preemptive analysis,personalized health insights,and actionable recommendations.The research shows that this innovative approach improves patient outcomes and healthcare efficiency in the real world.展开更多
In a competitive digital age where data volumes are increasing with time, the ability to extract meaningful knowledge from high-dimensional data using machine learning (ML) and data mining (DM) techniques and making d...In a competitive digital age where data volumes are increasing with time, the ability to extract meaningful knowledge from high-dimensional data using machine learning (ML) and data mining (DM) techniques and making decisions based on the extracted knowledge is becoming increasingly important in all business domains. Nevertheless, high-dimensional data remains a major challenge for classification algorithms due to its high computational cost and storage requirements. The 2016 Demographic and Health Survey of Ethiopia (EDHS 2016) used as the data source for this study which is publicly available contains several features that may not be relevant to the prediction task. In this paper, we developed a hybrid multidimensional metrics framework for predictive modeling for both model performance evaluation and feature selection to overcome the feature selection challenges and select the best model among the available models in DM and ML. The proposed hybrid metrics were used to measure the efficiency of the predictive models. Experimental results show that the decision tree algorithm is the most efficient model. The higher score of HMM (m, r) = 0.47 illustrates the overall significant model that encompasses almost all the user’s requirements, unlike the classical metrics that use a criterion to select the most appropriate model. On the other hand, the ANNs were found to be the most computationally intensive for our prediction task. Moreover, the type of data and the class size of the dataset (unbalanced data) have a significant impact on the efficiency of the model, especially on the computational cost, and the interpretability of the parameters of the model would be hampered. And the efficiency of the predictive model could be improved with other feature selection algorithms (especially hybrid metrics) considering the experts of the knowledge domain, as the understanding of the business domain has a significant impact.展开更多
Artificial neural network(ANN)has become an important method to model the nonlinear relationships between weather conditions,building characteristics and its heat demand.Due to the large amount of training data re-qui...Artificial neural network(ANN)has become an important method to model the nonlinear relationships between weather conditions,building characteristics and its heat demand.Due to the large amount of training data re-quired for ANN training,data reduction and feature selection are important to simplify the training.However,in building heat demand prediction,many weather-related input variables contain duplicated features.This paper develops a sensitivity analysis approach to analyse the correlation between input variables and to detect the variables that have high importance but contain duplicated features.The proposed approach is validated in a case study that predicts the heat demand of a district heating network containing tens of buildings at a university campus.The results show that the proposed approach detected and removed several unnecessary input variables and helped the ANN model to reduce approximately 20%training time compared with the traditional methods while maintaining the prediction accuracy.It indicates that the approach can be applied for analysing large num-ber of input variables to help improving the training efficiency of ANN in district heat demand prediction and other applications.展开更多
This paper proposes a model to analyze the massive data of electricity.Feature subset is determined by the correla-tion-based feature selection and the data-driven methods.The attribute season can be classified succes...This paper proposes a model to analyze the massive data of electricity.Feature subset is determined by the correla-tion-based feature selection and the data-driven methods.The attribute season can be classified successfully through five classi-fiers using the selected feature subset,and the best model can be determined further.The effects on analyzing electricity consump-tion of the other three attributes,including months,businesses,and meters,can be estimated using the chosen model.The data used for the project is provided by Beijing Power Supply Bureau.We use WEKA as the machine learning tool.The models we built are promising for electricity scheduling and power theft detection.展开更多
Electrical load forecasting is very crucial for electrical power systems’planning and operation.Both electrical buildings’load demand and meteorological datasets may contain hidden patterns that are required to be i...Electrical load forecasting is very crucial for electrical power systems’planning and operation.Both electrical buildings’load demand and meteorological datasets may contain hidden patterns that are required to be investigated and studied to show their potential impact on load forecasting.The meteorological data are analyzed in this study through different data mining techniques aiming to predict the electrical load demand of a factory located in Riyadh,Saudi Arabia.The factory load and meteorological data used in this study are recorded hourly between 2016 and 2017.These data are provided by King Abdullah City for Atomic and Renewable Energy and Saudi Electricity Company at a site located in Riyadh.After applying the data pre-processing techniques to prepare the data,different machine learning algorithms,namely Artificial Neural Network and Support Vector Regression(SVR),are applied and compared to predict the factory load.In addition,for the sake of selecting the optimal set of features,13 different combinations of features are investigated in this study.The outcomes of this study emphasize selecting the optimal set of features as more features may add complexity to the learning process.Finally,the SVR algorithm with six features provides the most accurate prediction values to predict the factory load.展开更多
文摘Medical Internet of Things(IoT)devices are becoming more and more common in healthcare.This has created a huge need for advanced predictive health modeling strategies that can make good use of the growing amount of multimodal data to find potential health risks early and help individuals in a personalized way.Existing methods,while useful,have limitations in predictive accuracy,delay,personalization,and user interpretability,requiring a more comprehensive and efficient approach to harness modern medical IoT devices.MAIPFE is a multimodal approach integrating pre-emptive analysis,personalized feature selection,and explainable AI for real-time health monitoring and disease detection.By using AI for early disease detection,personalized health recommendations,and transparency,healthcare will be transformed.The Multimodal Approach Integrating Pre-emptive Analysis,Personalized Feature Selection,and Explainable AI(MAIPFE)framework,which combines Firefly Optimizer,Recurrent Neural Network(RNN),Fuzzy C Means(FCM),and Explainable AI,improves disease detection precision over existing methods.Comprehensive metrics show the model’s superiority in real-time health analysis.The proposed framework outperformed existing models by 8.3%in disease detection classification precision,8.5%in accuracy,5.5%in recall,2.9%in specificity,4.5%in AUC(Area Under the Curve),and 4.9%in delay reduction.Disease prediction precision increased by 4.5%,accuracy by 3.9%,recall by 2.5%,specificity by 3.5%,AUC by 1.9%,and delay levels decreased by 9.4%.MAIPFE can revolutionize healthcare with preemptive analysis,personalized health insights,and actionable recommendations.The research shows that this innovative approach improves patient outcomes and healthcare efficiency in the real world.
文摘In a competitive digital age where data volumes are increasing with time, the ability to extract meaningful knowledge from high-dimensional data using machine learning (ML) and data mining (DM) techniques and making decisions based on the extracted knowledge is becoming increasingly important in all business domains. Nevertheless, high-dimensional data remains a major challenge for classification algorithms due to its high computational cost and storage requirements. The 2016 Demographic and Health Survey of Ethiopia (EDHS 2016) used as the data source for this study which is publicly available contains several features that may not be relevant to the prediction task. In this paper, we developed a hybrid multidimensional metrics framework for predictive modeling for both model performance evaluation and feature selection to overcome the feature selection challenges and select the best model among the available models in DM and ML. The proposed hybrid metrics were used to measure the efficiency of the predictive models. Experimental results show that the decision tree algorithm is the most efficient model. The higher score of HMM (m, r) = 0.47 illustrates the overall significant model that encompasses almost all the user’s requirements, unlike the classical metrics that use a criterion to select the most appropriate model. On the other hand, the ANNs were found to be the most computationally intensive for our prediction task. Moreover, the type of data and the class size of the dataset (unbalanced data) have a significant impact on the efficiency of the model, especially on the computational cost, and the interpretability of the parameters of the model would be hampered. And the efficiency of the predictive model could be improved with other feature selection algorithms (especially hybrid metrics) considering the experts of the knowledge domain, as the understanding of the business domain has a significant impact.
文摘Artificial neural network(ANN)has become an important method to model the nonlinear relationships between weather conditions,building characteristics and its heat demand.Due to the large amount of training data re-quired for ANN training,data reduction and feature selection are important to simplify the training.However,in building heat demand prediction,many weather-related input variables contain duplicated features.This paper develops a sensitivity analysis approach to analyse the correlation between input variables and to detect the variables that have high importance but contain duplicated features.The proposed approach is validated in a case study that predicts the heat demand of a district heating network containing tens of buildings at a university campus.The results show that the proposed approach detected and removed several unnecessary input variables and helped the ANN model to reduce approximately 20%training time compared with the traditional methods while maintaining the prediction accuracy.It indicates that the approach can be applied for analysing large num-ber of input variables to help improving the training efficiency of ANN in district heat demand prediction and other applications.
基金Supported by the National Earthquake Major Project of China (201008007)the Fundamental Research Funds for Central University of China (216275645)
文摘This paper proposes a model to analyze the massive data of electricity.Feature subset is determined by the correla-tion-based feature selection and the data-driven methods.The attribute season can be classified successfully through five classi-fiers using the selected feature subset,and the best model can be determined further.The effects on analyzing electricity consump-tion of the other three attributes,including months,businesses,and meters,can be estimated using the chosen model.The data used for the project is provided by Beijing Power Supply Bureau.We use WEKA as the machine learning tool.The models we built are promising for electricity scheduling and power theft detection.
基金Funding Statement:The researchers would like to thank the Deanship of Scientific Research,Qassim University for funding the publication of this project.
文摘Electrical load forecasting is very crucial for electrical power systems’planning and operation.Both electrical buildings’load demand and meteorological datasets may contain hidden patterns that are required to be investigated and studied to show their potential impact on load forecasting.The meteorological data are analyzed in this study through different data mining techniques aiming to predict the electrical load demand of a factory located in Riyadh,Saudi Arabia.The factory load and meteorological data used in this study are recorded hourly between 2016 and 2017.These data are provided by King Abdullah City for Atomic and Renewable Energy and Saudi Electricity Company at a site located in Riyadh.After applying the data pre-processing techniques to prepare the data,different machine learning algorithms,namely Artificial Neural Network and Support Vector Regression(SVR),are applied and compared to predict the factory load.In addition,for the sake of selecting the optimal set of features,13 different combinations of features are investigated in this study.The outcomes of this study emphasize selecting the optimal set of features as more features may add complexity to the learning process.Finally,the SVR algorithm with six features provides the most accurate prediction values to predict the factory load.