Manual investigation of chest radiography(CXR)images by physicians is crucial for effective decision-making in COVID-19 diagnosis.However,the high demand during the pandemic necessitates auxiliary help through image a...Manual investigation of chest radiography(CXR)images by physicians is crucial for effective decision-making in COVID-19 diagnosis.However,the high demand during the pandemic necessitates auxiliary help through image analysis and machine learning techniques.This study presents a multi-threshold-based segmentation technique to probe high pixel intensity regions in CXR images of various pathologies,including normal cases.Texture information is extracted using gray co-occurrence matrix(GLCM)-based features,while vessel-like features are obtained using Frangi,Sato,and Meijering filters.Machine learning models employing Decision Tree(DT)and RandomForest(RF)approaches are designed to categorize CXR images into common lung infections,lung opacity(LO),COVID-19,and viral pneumonia(VP).The results demonstrate that the fusion of texture and vesselbased features provides an effective ML model for aiding diagnosis.The ML model validation using performance measures,including an accuracy of approximately 91.8%with an RF-based classifier,supports the usefulness of the feature set and classifier model in categorizing the four different pathologies.Furthermore,the study investigates the importance of the devised features in identifying the underlying pathology and incorporates histogrambased analysis.This analysis reveals varying natural pixel distributions in CXR images belonging to the normal,COVID-19,LO,and VP groups,motivating the incorporation of additional features such as mean,standard deviation,skewness,and percentile based on the filtered images.Notably,the study achieves a considerable improvement in categorizing COVID-19 from LO,with a true positive rate of 97%,further substantiating the effectiveness of the methodology implemented.展开更多
This study presents an efficient method for automatically identifying the crystal structure and orientation of Y-doped HfO_(2)-based thin films using deep learning.This approach enables large-scale crystallographic an...This study presents an efficient method for automatically identifying the crystal structure and orientation of Y-doped HfO_(2)-based thin films using deep learning.This approach enables large-scale crystallographic analysis with sub-nanometer spatial resolution using only scanning transmission electron microscopy(STEM)atomic images,thereby reducing the reliance on manual expert interpretation.The Xception network-based model extracts detailed crystallographic information through structure and entropy maps,effectively identifying subtle pattern changes and local structural discontinuities.Entropy maps are utilized to analyze the atomic structure disorder and detect ambiguous boundaries and strained regions.Analysis of Y-doped HfO_(2)thin films reveals that the film thickness significantly affects the ferroelectric properties,with theOphase dominant in 5 nmfilms and the M phase proportion increasing as the thickness increases.This machine-learning-based STEM atomic image analysis method provides an automated solution to accelerate ferroelectric material research and promote the development of next-generation electronic devices,offering an accurate understanding and control of microstructural characteristics.展开更多
Smartphone devices particularly Android devices are in use by billions of people everywhere in the world.Similarly,this increasing rate attracts mobile botnet attacks which is a network of interconnected nodes operate...Smartphone devices particularly Android devices are in use by billions of people everywhere in the world.Similarly,this increasing rate attracts mobile botnet attacks which is a network of interconnected nodes operated through the command and control(C&C)method to expand malicious activities.At present,mobile botnet attacks launched the Distributed denial of services(DDoS)that causes to steal of sensitive data,remote access,and spam generation,etc.Consequently,various approaches are defined in the literature to detect mobile botnet attacks using static or dynamic analysis.In this paper,a novel hybrid model,the combination of static and dynamic methods that relies on machine learning to detect android botnet applications is proposed.Furthermore,results are evaluated using machine learning classifiers.The Random Forest(RF)classifier outperform as compared to other ML techniques i.e.,Naïve Bayes(NB),Support Vector Machine(SVM),and Simple Logistic(SL).Our proposed framework achieved 97.48%accuracy in the detection of botnet applications.Finally,some future research directions are highlighted regarding botnet attacks detection for the entire community.展开更多
Machinery condition monitoring is beneficial to equipment maintenance and has been receiving much attention from academia and industry.Machine learning,especially deep learning,has become popular for machinery conditi...Machinery condition monitoring is beneficial to equipment maintenance and has been receiving much attention from academia and industry.Machine learning,especially deep learning,has become popular for machinery condition monitoring because that can fully use available data and computational power.Since significant accidents might be caused if wrong fault alarms are given for machine condition monitoring,interpretable machine learning models,integrate signal processing knowledge to enhance trustworthiness of models,are gradually becoming a research hotspot.A previous spectrum-based and interpretable optimized weights method has been proposed to indicate faulty and fundamental frequencies when the analyzed data only contains a healthy type and a fault type.Considering that multiclass fault types are naturally met in practice,this work aims to explore the interpretable optimized weights method for multiclass fault type scenarios.Therefore,a new multiclass optimized weights spectrum(OWS)is proposed and further studied theoretically and numerically.It is found that the multiclass OWS is capable of capturing the characteristic components associated with different conditions and clearly indicating specific fault characteristic frequencies(FCFs)corresponding to each fault condition.This work can provide new insights into spectrum-based fault classification models,and the new multiclass OWS also shows great potential for practical applications.展开更多
In order to classify the minimal hepatic encephalopathy (MHE) patients from healthy controls, the independent component analysis (ICA) is used to generate the default mode network (DMN) from resting-state functi...In order to classify the minimal hepatic encephalopathy (MHE) patients from healthy controls, the independent component analysis (ICA) is used to generate the default mode network (DMN) from resting-state functional magnetic resonance imaging (fMRI). Then a Bayesian voxel- wised method, graphical-model-based multivariate analysis (GAMMA), is used to explore the associations between abnormal functional integration within DMN and clinical variable. Without any prior knowledge, five machine learning methods, namely, support vector machines (SVMs), classification and regression trees ( CART ), logistic regression, the Bayesian network, and C4.5, are applied to the classification. The functional integration patterns were alternative within DMN, which have the power to predict MHE with an accuracy of 98%. The GAMMA method generating functional integration patterns within DMN can become a simple, objective, and common imaging biomarker for detecting MIIE and can serve as a supplement to the existing diagnostic methods.展开更多
Web-blogging sites such as Twitter and Facebook are heavily influenced by emotions,sentiments,and data in the modern era.Twitter,a widely used microblogging site where individuals share their thoughts in the form of t...Web-blogging sites such as Twitter and Facebook are heavily influenced by emotions,sentiments,and data in the modern era.Twitter,a widely used microblogging site where individuals share their thoughts in the form of tweets,has become a major source for sentiment analysis.In recent years,there has been a significant increase in demand for sentiment analysis to identify and classify opinions or expressions in text or tweets.Opinions or expressions of people about a particular topic,situation,person,or product can be identified from sentences and divided into three categories:positive for good,negative for bad,and neutral for mixed or confusing opinions.The process of analyzing changes in sentiment and the combination of these categories is known as“sentiment analysis.”In this study,sentiment analysis was performed on a dataset of 90,000 tweets using both deep learning and machine learning methods.The deep learning-based model long-short-term memory(LSTM)performed better than machine learning approaches.Long short-term memory achieved 87%accuracy,and the support vector machine(SVM)classifier achieved slightly worse results than LSTM at 86%.The study also tested binary classes of positive and negative,where LSTM and SVM both achieved 90%accuracy.展开更多
Traffic encryption techniques facilitate cyberattackers to hide their presence and activities.Traffic classification is an important method to prevent network threats.However,due to the tremendous traffic volume and l...Traffic encryption techniques facilitate cyberattackers to hide their presence and activities.Traffic classification is an important method to prevent network threats.However,due to the tremendous traffic volume and limitations of computing,most existing traffic classification techniques are inapplicable to the high-speed network environment.In this paper,we propose a High-speed Encrypted Traffic Classification(HETC)method containing two stages.First,to efficiently detect whether traffic is encrypted,HETC focuses on randomly sampled short flows and extracts aggregation entropies with chi-square test features to measure the different patterns of the byte composition and distribution between encrypted and unencrypted flows.Second,HETC introduces binary features upon the previous features and performs fine-grained traffic classification by combining these payload features with a Random Forest model.The experimental results show that HETC can achieve a 94%F-measure in detecting encrypted flows and a 85%–93%F-measure in classifying fine-grained flows for a 1-KB flow-length dataset,outperforming the state-of-the-art comparison methods.Meanwhile,HETC does not need to wait for the end of the flow and can extract mass computing features.The average time for HETC to process each flow is only 2 or 16 ms,which is lower than the flow duration in most cases,making it a good candidate for high-speed traffic classification.展开更多
Predictive maintenance is essential for the implementation of an innovative and efficient structural health monitoring strategy.Models capable of accurately interpreting new data automatically collected by suitably pl...Predictive maintenance is essential for the implementation of an innovative and efficient structural health monitoring strategy.Models capable of accurately interpreting new data automatically collected by suitably placed sensors to assess the state of the infrastructure represent a fundamental step,particularly for the railway sector,whose safe and continuous operation plays a strategic role in the well-being and development of nations.In this scenario,the benefits of a digital twin of a bonded insu-lated rail joint(IRJ)with the predictive capabilities of advanced classification algorithms based on artificial intelligence have been explored.The digital model provides an accurate mechanical response of the infrastructure as a pair of wheels passes over the joint.As bolt preload conditions vary,four structural health classes were identified for the joint.Two parameters,i.e.gap value and vertical displacement,which are strongly correlated with bolt preload,are used in different combinations to train and test five predictive classifiers.Their classification effectiveness was assessed using several performance indica-tors.Finally,we compared the IRJ condition predictions of two trained classifiers with the available data,confirming their high accuracy.The approach presented provides an interesting solution for future predictive tools in SHM especially in the case of complex systems such as railways where the vehicle-infrastructure interaction is complex and always time varying.展开更多
文摘Manual investigation of chest radiography(CXR)images by physicians is crucial for effective decision-making in COVID-19 diagnosis.However,the high demand during the pandemic necessitates auxiliary help through image analysis and machine learning techniques.This study presents a multi-threshold-based segmentation technique to probe high pixel intensity regions in CXR images of various pathologies,including normal cases.Texture information is extracted using gray co-occurrence matrix(GLCM)-based features,while vessel-like features are obtained using Frangi,Sato,and Meijering filters.Machine learning models employing Decision Tree(DT)and RandomForest(RF)approaches are designed to categorize CXR images into common lung infections,lung opacity(LO),COVID-19,and viral pneumonia(VP).The results demonstrate that the fusion of texture and vesselbased features provides an effective ML model for aiding diagnosis.The ML model validation using performance measures,including an accuracy of approximately 91.8%with an RF-based classifier,supports the usefulness of the feature set and classifier model in categorizing the four different pathologies.Furthermore,the study investigates the importance of the devised features in identifying the underlying pathology and incorporates histogrambased analysis.This analysis reveals varying natural pixel distributions in CXR images belonging to the normal,COVID-19,LO,and VP groups,motivating the incorporation of additional features such as mean,standard deviation,skewness,and percentile based on the filtered images.Notably,the study achieves a considerable improvement in categorizing COVID-19 from LO,with a true positive rate of 97%,further substantiating the effectiveness of the methodology implemented.
基金supported by the Ministry of Trade,Industry,and Energy(MOTIE)of Korea(No.P0022331)supervised by the Korea Institute for Advancement of Technology(KIAT)+4 种基金the Technology Innovation Program(Alchemist Project,AI-based supercritical materials discovery)funded by MOTIE,Korea(No.20012196)supervised by the Korea Evaluation Institute of Industrial Technology(KEIT)the National R&D Program through the National Research Foundation of Korea(NRF),funded by the Ministry of Science and ICT(RS-2024-00450561)the institutional research program(2E33912)of the Korea Institute of Science and Technology(KIST)National Research Foundation of Korea(NRF)grant funded by the Ministry of Science and ICT(NRF-2020M3F3A2A01081572).
文摘This study presents an efficient method for automatically identifying the crystal structure and orientation of Y-doped HfO_(2)-based thin films using deep learning.This approach enables large-scale crystallographic analysis with sub-nanometer spatial resolution using only scanning transmission electron microscopy(STEM)atomic images,thereby reducing the reliance on manual expert interpretation.The Xception network-based model extracts detailed crystallographic information through structure and entropy maps,effectively identifying subtle pattern changes and local structural discontinuities.Entropy maps are utilized to analyze the atomic structure disorder and detect ambiguous boundaries and strained regions.Analysis of Y-doped HfO_(2)thin films reveals that the film thickness significantly affects the ferroelectric properties,with theOphase dominant in 5 nmfilms and the M phase proportion increasing as the thickness increases.This machine-learning-based STEM atomic image analysis method provides an automated solution to accelerate ferroelectric material research and promote the development of next-generation electronic devices,offering an accurate understanding and control of microstructural characteristics.
文摘Smartphone devices particularly Android devices are in use by billions of people everywhere in the world.Similarly,this increasing rate attracts mobile botnet attacks which is a network of interconnected nodes operated through the command and control(C&C)method to expand malicious activities.At present,mobile botnet attacks launched the Distributed denial of services(DDoS)that causes to steal of sensitive data,remote access,and spam generation,etc.Consequently,various approaches are defined in the literature to detect mobile botnet attacks using static or dynamic analysis.In this paper,a novel hybrid model,the combination of static and dynamic methods that relies on machine learning to detect android botnet applications is proposed.Furthermore,results are evaluated using machine learning classifiers.The Random Forest(RF)classifier outperform as compared to other ML techniques i.e.,Naïve Bayes(NB),Support Vector Machine(SVM),and Simple Logistic(SL).Our proposed framework achieved 97.48%accuracy in the detection of botnet applications.Finally,some future research directions are highlighted regarding botnet attacks detection for the entire community.
基金supported by the National Natural Science Foundation of China under Grant Nos.523B2043 and 52475112.
文摘Machinery condition monitoring is beneficial to equipment maintenance and has been receiving much attention from academia and industry.Machine learning,especially deep learning,has become popular for machinery condition monitoring because that can fully use available data and computational power.Since significant accidents might be caused if wrong fault alarms are given for machine condition monitoring,interpretable machine learning models,integrate signal processing knowledge to enhance trustworthiness of models,are gradually becoming a research hotspot.A previous spectrum-based and interpretable optimized weights method has been proposed to indicate faulty and fundamental frequencies when the analyzed data only contains a healthy type and a fault type.Considering that multiclass fault types are naturally met in practice,this work aims to explore the interpretable optimized weights method for multiclass fault type scenarios.Therefore,a new multiclass optimized weights spectrum(OWS)is proposed and further studied theoretically and numerically.It is found that the multiclass OWS is capable of capturing the characteristic components associated with different conditions and clearly indicating specific fault characteristic frequencies(FCFs)corresponding to each fault condition.This work can provide new insights into spectrum-based fault classification models,and the new multiclass OWS also shows great potential for practical applications.
基金The National Natural Science Foundation of China(No.8123003481271739+2 种基金81501453)the Special Program of Medical Science of Jiangsu Province(No.BL2013029)the Natural Science Foundation of Jiangsu Province(No.BK20141342)
文摘In order to classify the minimal hepatic encephalopathy (MHE) patients from healthy controls, the independent component analysis (ICA) is used to generate the default mode network (DMN) from resting-state functional magnetic resonance imaging (fMRI). Then a Bayesian voxel- wised method, graphical-model-based multivariate analysis (GAMMA), is used to explore the associations between abnormal functional integration within DMN and clinical variable. Without any prior knowledge, five machine learning methods, namely, support vector machines (SVMs), classification and regression trees ( CART ), logistic regression, the Bayesian network, and C4.5, are applied to the classification. The functional integration patterns were alternative within DMN, which have the power to predict MHE with an accuracy of 98%. The GAMMA method generating functional integration patterns within DMN can become a simple, objective, and common imaging biomarker for detecting MIIE and can serve as a supplement to the existing diagnostic methods.
基金The authors would like to thank the Deanship of Scientific Research at Umm Al-Qura University for supporting this work by Grant Code:(22UQU4400257DSR01).
文摘Web-blogging sites such as Twitter and Facebook are heavily influenced by emotions,sentiments,and data in the modern era.Twitter,a widely used microblogging site where individuals share their thoughts in the form of tweets,has become a major source for sentiment analysis.In recent years,there has been a significant increase in demand for sentiment analysis to identify and classify opinions or expressions in text or tweets.Opinions or expressions of people about a particular topic,situation,person,or product can be identified from sentences and divided into three categories:positive for good,negative for bad,and neutral for mixed or confusing opinions.The process of analyzing changes in sentiment and the combination of these categories is known as“sentiment analysis.”In this study,sentiment analysis was performed on a dataset of 90,000 tweets using both deep learning and machine learning methods.The deep learning-based model long-short-term memory(LSTM)performed better than machine learning approaches.Long short-term memory achieved 87%accuracy,and the support vector machine(SVM)classifier achieved slightly worse results than LSTM at 86%.The study also tested binary classes of positive and negative,where LSTM and SVM both achieved 90%accuracy.
基金supported by the National Natural Science Foundation of China under Grant No.U1736216。
文摘Traffic encryption techniques facilitate cyberattackers to hide their presence and activities.Traffic classification is an important method to prevent network threats.However,due to the tremendous traffic volume and limitations of computing,most existing traffic classification techniques are inapplicable to the high-speed network environment.In this paper,we propose a High-speed Encrypted Traffic Classification(HETC)method containing two stages.First,to efficiently detect whether traffic is encrypted,HETC focuses on randomly sampled short flows and extracts aggregation entropies with chi-square test features to measure the different patterns of the byte composition and distribution between encrypted and unencrypted flows.Second,HETC introduces binary features upon the previous features and performs fine-grained traffic classification by combining these payload features with a Random Forest model.The experimental results show that HETC can achieve a 94%F-measure in detecting encrypted flows and a 85%–93%F-measure in classifying fine-grained flows for a 1-KB flow-length dataset,outperforming the state-of-the-art comparison methods.Meanwhile,HETC does not need to wait for the end of the flow and can extract mass computing features.The average time for HETC to process each flow is only 2 or 16 ms,which is lower than the flow duration in most cases,making it a good candidate for high-speed traffic classification.
基金the National Recovery and Resilience Plan (NRRP), Mission 4 Component 2 Investment 1.4-Call for tender No. 3138 of 16/12/2021 of Italian Ministry of University and Research funded by the European Union-Next Generation EU. Award Number: Project code CN00000023Concession Decree No. 1033 of 17/06/2022 adopted by the Italian Ministry of University and Research, CUP D93C22000400001, “Sustainable Mobility Center” (CNMS). Spoke 4-Rail Transportation
文摘Predictive maintenance is essential for the implementation of an innovative and efficient structural health monitoring strategy.Models capable of accurately interpreting new data automatically collected by suitably placed sensors to assess the state of the infrastructure represent a fundamental step,particularly for the railway sector,whose safe and continuous operation plays a strategic role in the well-being and development of nations.In this scenario,the benefits of a digital twin of a bonded insu-lated rail joint(IRJ)with the predictive capabilities of advanced classification algorithms based on artificial intelligence have been explored.The digital model provides an accurate mechanical response of the infrastructure as a pair of wheels passes over the joint.As bolt preload conditions vary,four structural health classes were identified for the joint.Two parameters,i.e.gap value and vertical displacement,which are strongly correlated with bolt preload,are used in different combinations to train and test five predictive classifiers.Their classification effectiveness was assessed using several performance indica-tors.Finally,we compared the IRJ condition predictions of two trained classifiers with the available data,confirming their high accuracy.The approach presented provides an interesting solution for future predictive tools in SHM especially in the case of complex systems such as railways where the vehicle-infrastructure interaction is complex and always time varying.