Machine Learning(ML)algorithms play a pivotal role in Speech Emotion Recognition(SER),although they encounter a formidable obstacle in accurately discerning a speaker’s emotional state.The examination of the emotiona...Machine Learning(ML)algorithms play a pivotal role in Speech Emotion Recognition(SER),although they encounter a formidable obstacle in accurately discerning a speaker’s emotional state.The examination of the emotional states of speakers holds significant importance in a range of real-time applications,including but not limited to virtual reality,human-robot interaction,emergency centers,and human behavior assessment.Accurately identifying emotions in the SER process relies on extracting relevant information from audio inputs.Previous studies on SER have predominantly utilized short-time characteristics such as Mel Frequency Cepstral Coefficients(MFCCs)due to their ability to capture the periodic nature of audio signals effectively.Although these traits may improve their ability to perceive and interpret emotional depictions appropriately,MFCCS has some limitations.So this study aims to tackle the aforementioned issue by systematically picking multiple audio cues,enhancing the classifier model’s efficacy in accurately discerning human emotions.The utilized dataset is taken from the EMO-DB database,preprocessing input speech is done using a 2D Convolution Neural Network(CNN)involves applying convolutional operations to spectrograms as they afford a visual representation of the way the audio signal frequency content changes over time.The next step is the spectrogram data normalization which is crucial for Neural Network(NN)training as it aids in faster convergence.Then the five auditory features MFCCs,Chroma,Mel-Spectrogram,Contrast,and Tonnetz are extracted from the spectrogram sequentially.The attitude of feature selection is to retain only dominant features by excluding the irrelevant ones.In this paper,the Sequential Forward Selection(SFS)and Sequential Backward Selection(SBS)techniques were employed for multiple audio cues features selection.Finally,the feature sets composed from the hybrid feature extraction methods are fed into the deep Bidirectional Long Short Term Memory(Bi-LSTM)network to discern emotions.Since the deep Bi-LSTM can hierarchically learn complex features and increases model capacity by achieving more robust temporal modeling,it is more effective than a shallow Bi-LSTM in capturing the intricate tones of emotional content existent in speech signals.The effectiveness and resilience of the proposed SER model were evaluated by experiments,comparing it to state-of-the-art SER techniques.The results indicated that the model achieved accuracy rates of 90.92%,93%,and 92%over the Ryerson Audio-Visual Database of Emotional Speech and Song(RAVDESS),Berlin Database of Emotional Speech(EMO-DB),and The Interactive Emotional Dyadic Motion Capture(IEMOCAP)datasets,respectively.These findings signify a prominent enhancement in the ability to emotional depictions identification in speech,showcasing the potential of the proposed model in advancing the SER field.展开更多
Depression is a common mental health issue that affects a large percentage of people all around the world.Usually,people who suffer from this mood disorder have issues such as low concentration,dementia,mood swings,an...Depression is a common mental health issue that affects a large percentage of people all around the world.Usually,people who suffer from this mood disorder have issues such as low concentration,dementia,mood swings,and even suicide.A social media platform like Twitter allows people to communicate as well as share photos and videos that reflect their moods.Therefore,the analysis of social media content provides insight into individual moods,including depression.Several studies have been conducted on depression detection in English and less in Arabic.The detection of depression from Arabic social media lags behind due the complexity of Arabic language and the lack of resources and techniques available.In this study,we performed a depression analysis on Arabic social media content to understand the feelings of the users.A bidirectional long short-term memory(Bi-LSTM)with an attention mechanism is presented to learn important hidden features for depression detection successfully.The proposed deep learning model combines an attention mechanism with a Bi-LSTM to simultaneously focus on discriminative features and learn significant word weights that contribute highly to depression detection.In order to evaluate our model,we collected a Twitter dataset of approximately 6000 tweets.The data labelling was done by manually classifying tweets as depressed or not depressed.Experimental results showed that the proposed model outperformed state-of-the-art machine learning models in detecting depression.The attention-based BiLSTM model achieved 0.83%accuracy on the depression detection task.展开更多
Fresh products have the characteristics of perishable, small batch and high frequency. Therefore, for fresh food e-commerce enterprises, market demand forecasting is particularly important. This paper takes the sales ...Fresh products have the characteristics of perishable, small batch and high frequency. Therefore, for fresh food e-commerce enterprises, market demand forecasting is particularly important. This paper takes the sales data of a fresh food e-commerce enterprise as the logistics demand, analyzes the influence of time and meteorological factors on the demand, extracts the characteristic factors with greater influence, and proposes a logistics demand forecast scheme of fresh food e-commerce based on the Bi-LSTM model. The scheme is compared with other schemes based on the BP neural network and LSTM neural network models. The experimental results show that the Bi-LSTM model has good prediction performance on the problem of logistics demand prediction. This facilitates further research on some supply chain issues, such as business decision-making, inventory control, and logistics capacity planning.展开更多
短期预测在智能电网建设中扮演着重要角色,深刻影响电网发输变配用各个环节的智能化改造。短期预测一般基于系统实测数据,而传感器故障,数据传输错误等原因会导致数据质量下降,严重影响短期预测的精确性。为建立数据质量受损情况下的精...短期预测在智能电网建设中扮演着重要角色,深刻影响电网发输变配用各个环节的智能化改造。短期预测一般基于系统实测数据,而传感器故障,数据传输错误等原因会导致数据质量下降,严重影响短期预测的精确性。为建立数据质量受损情况下的精确短期预测模型,提出了结合数据预处理和双向长短期记忆(bi-directional long short-term memory,Bi-LSTM)的短期预测框架Bi-LSTM-DP(bi-directional long short-term memory data preprocessing)。在Bi-LSTM-DP中,采集的数据首先通过均值填补缺失值,进而基于Savitzky-Golay滤波器对数据降噪,最后采用Bi-LSTM提取时间序列的信息,实现短期预测。为了评估所提方法的性能,文中使用实测的公开数据集分别预测风电发电量和负荷需求,与其他参考方法对比表明了所述方法的有效性和鲁棒性。展开更多
BACKGROUND Rebleeding after recovery from esophagogastric variceal bleeding(EGVB)is a severe complication that is associated with high rates of both incidence and mortality.Despite its clinical importance,recognized p...BACKGROUND Rebleeding after recovery from esophagogastric variceal bleeding(EGVB)is a severe complication that is associated with high rates of both incidence and mortality.Despite its clinical importance,recognized prognostic models that can effectively predict esophagogastric variceal rebleeding in patients with liver cirrhosis are lacking.AIM To construct and externally validate a reliable prognostic model for predicting the occurrence of esophagogastric variceal rebleeding.METHODS This study included 477 EGVB patients across 2 cohorts:The derivation cohort(n=322)and the validation cohort(n=155).The primary outcome was rebleeding events within 1 year.The least absolute shrinkage and selection operator was applied for predictor selection,and multivariate Cox regression analysis was used to construct the prognostic model.Internal validation was performed with bootstrap resampling.We assessed the discrimination,calibration and accuracy of the model,and performed patient risk stratification.RESULTS Six predictors,including albumin and aspartate aminotransferase concentrations,white blood cell count,and the presence of ascites,portal vein thrombosis,and bleeding signs,were selected for the rebleeding event prediction following endoscopic treatment(REPET)model.In predicting rebleeding within 1 year,the REPET model ex-hibited a concordance index of 0.775 and a Brier score of 0.143 in the derivation cohort,alongside 0.862 and 0.127 in the validation cohort.Furthermore,the REPET model revealed a significant difference in rebleeding rates(P<0.01)between low-risk patients and intermediate-to high-risk patients in both cohorts.CONCLUSION We constructed and validated a new prognostic model for variceal rebleeding with excellent predictive per-formance,which will improve the clinical management of rebleeding in EGVB patients.展开更多
This study was aimed to prepare landslide susceptibility maps for the Pithoragarh district in Uttarakhand,India,using advanced ensemble models that combined Radial Basis Function Networks(RBFN)with three ensemble lear...This study was aimed to prepare landslide susceptibility maps for the Pithoragarh district in Uttarakhand,India,using advanced ensemble models that combined Radial Basis Function Networks(RBFN)with three ensemble learning techniques:DAGGING(DG),MULTIBOOST(MB),and ADABOOST(AB).This combination resulted in three distinct ensemble models:DG-RBFN,MB-RBFN,and AB-RBFN.Additionally,a traditional weighted method,Information Value(IV),and a benchmark machine learning(ML)model,Multilayer Perceptron Neural Network(MLP),were employed for comparison and validation.The models were developed using ten landslide conditioning factors,which included slope,aspect,elevation,curvature,land cover,geomorphology,overburden depth,lithology,distance to rivers and distance to roads.These factors were instrumental in predicting the output variable,which was the probability of landslide occurrence.Statistical analysis of the models’performance indicated that the DG-RBFN model,with an Area Under ROC Curve(AUC)of 0.931,outperformed the other models.The AB-RBFN model achieved an AUC of 0.929,the MB-RBFN model had an AUC of 0.913,and the MLP model recorded an AUC of 0.926.These results suggest that the advanced ensemble ML model DG-RBFN was more accurate than traditional statistical model,single MLP model,and other ensemble models in preparing trustworthy landslide susceptibility maps,thereby enhancing land use planning and decision-making.展开更多
文摘Machine Learning(ML)algorithms play a pivotal role in Speech Emotion Recognition(SER),although they encounter a formidable obstacle in accurately discerning a speaker’s emotional state.The examination of the emotional states of speakers holds significant importance in a range of real-time applications,including but not limited to virtual reality,human-robot interaction,emergency centers,and human behavior assessment.Accurately identifying emotions in the SER process relies on extracting relevant information from audio inputs.Previous studies on SER have predominantly utilized short-time characteristics such as Mel Frequency Cepstral Coefficients(MFCCs)due to their ability to capture the periodic nature of audio signals effectively.Although these traits may improve their ability to perceive and interpret emotional depictions appropriately,MFCCS has some limitations.So this study aims to tackle the aforementioned issue by systematically picking multiple audio cues,enhancing the classifier model’s efficacy in accurately discerning human emotions.The utilized dataset is taken from the EMO-DB database,preprocessing input speech is done using a 2D Convolution Neural Network(CNN)involves applying convolutional operations to spectrograms as they afford a visual representation of the way the audio signal frequency content changes over time.The next step is the spectrogram data normalization which is crucial for Neural Network(NN)training as it aids in faster convergence.Then the five auditory features MFCCs,Chroma,Mel-Spectrogram,Contrast,and Tonnetz are extracted from the spectrogram sequentially.The attitude of feature selection is to retain only dominant features by excluding the irrelevant ones.In this paper,the Sequential Forward Selection(SFS)and Sequential Backward Selection(SBS)techniques were employed for multiple audio cues features selection.Finally,the feature sets composed from the hybrid feature extraction methods are fed into the deep Bidirectional Long Short Term Memory(Bi-LSTM)network to discern emotions.Since the deep Bi-LSTM can hierarchically learn complex features and increases model capacity by achieving more robust temporal modeling,it is more effective than a shallow Bi-LSTM in capturing the intricate tones of emotional content existent in speech signals.The effectiveness and resilience of the proposed SER model were evaluated by experiments,comparing it to state-of-the-art SER techniques.The results indicated that the model achieved accuracy rates of 90.92%,93%,and 92%over the Ryerson Audio-Visual Database of Emotional Speech and Song(RAVDESS),Berlin Database of Emotional Speech(EMO-DB),and The Interactive Emotional Dyadic Motion Capture(IEMOCAP)datasets,respectively.These findings signify a prominent enhancement in the ability to emotional depictions identification in speech,showcasing the potential of the proposed model in advancing the SER field.
文摘Depression is a common mental health issue that affects a large percentage of people all around the world.Usually,people who suffer from this mood disorder have issues such as low concentration,dementia,mood swings,and even suicide.A social media platform like Twitter allows people to communicate as well as share photos and videos that reflect their moods.Therefore,the analysis of social media content provides insight into individual moods,including depression.Several studies have been conducted on depression detection in English and less in Arabic.The detection of depression from Arabic social media lags behind due the complexity of Arabic language and the lack of resources and techniques available.In this study,we performed a depression analysis on Arabic social media content to understand the feelings of the users.A bidirectional long short-term memory(Bi-LSTM)with an attention mechanism is presented to learn important hidden features for depression detection successfully.The proposed deep learning model combines an attention mechanism with a Bi-LSTM to simultaneously focus on discriminative features and learn significant word weights that contribute highly to depression detection.In order to evaluate our model,we collected a Twitter dataset of approximately 6000 tweets.The data labelling was done by manually classifying tweets as depressed or not depressed.Experimental results showed that the proposed model outperformed state-of-the-art machine learning models in detecting depression.The attention-based BiLSTM model achieved 0.83%accuracy on the depression detection task.
文摘Fresh products have the characteristics of perishable, small batch and high frequency. Therefore, for fresh food e-commerce enterprises, market demand forecasting is particularly important. This paper takes the sales data of a fresh food e-commerce enterprise as the logistics demand, analyzes the influence of time and meteorological factors on the demand, extracts the characteristic factors with greater influence, and proposes a logistics demand forecast scheme of fresh food e-commerce based on the Bi-LSTM model. The scheme is compared with other schemes based on the BP neural network and LSTM neural network models. The experimental results show that the Bi-LSTM model has good prediction performance on the problem of logistics demand prediction. This facilitates further research on some supply chain issues, such as business decision-making, inventory control, and logistics capacity planning.
文摘短期预测在智能电网建设中扮演着重要角色,深刻影响电网发输变配用各个环节的智能化改造。短期预测一般基于系统实测数据,而传感器故障,数据传输错误等原因会导致数据质量下降,严重影响短期预测的精确性。为建立数据质量受损情况下的精确短期预测模型,提出了结合数据预处理和双向长短期记忆(bi-directional long short-term memory,Bi-LSTM)的短期预测框架Bi-LSTM-DP(bi-directional long short-term memory data preprocessing)。在Bi-LSTM-DP中,采集的数据首先通过均值填补缺失值,进而基于Savitzky-Golay滤波器对数据降噪,最后采用Bi-LSTM提取时间序列的信息,实现短期预测。为了评估所提方法的性能,文中使用实测的公开数据集分别预测风电发电量和负荷需求,与其他参考方法对比表明了所述方法的有效性和鲁棒性。
基金Supported by National Natural Science Foundation of China,No.81874390 and No.81573948Shanghai Natural Science Foundation,No.21ZR1464100+1 种基金Science and Technology Innovation Action Plan of Shanghai Science and Technology Commission,No.22S11901700the Shanghai Key Specialty of Traditional Chinese Clinical Medicine,No.shslczdzk01201.
文摘BACKGROUND Rebleeding after recovery from esophagogastric variceal bleeding(EGVB)is a severe complication that is associated with high rates of both incidence and mortality.Despite its clinical importance,recognized prognostic models that can effectively predict esophagogastric variceal rebleeding in patients with liver cirrhosis are lacking.AIM To construct and externally validate a reliable prognostic model for predicting the occurrence of esophagogastric variceal rebleeding.METHODS This study included 477 EGVB patients across 2 cohorts:The derivation cohort(n=322)and the validation cohort(n=155).The primary outcome was rebleeding events within 1 year.The least absolute shrinkage and selection operator was applied for predictor selection,and multivariate Cox regression analysis was used to construct the prognostic model.Internal validation was performed with bootstrap resampling.We assessed the discrimination,calibration and accuracy of the model,and performed patient risk stratification.RESULTS Six predictors,including albumin and aspartate aminotransferase concentrations,white blood cell count,and the presence of ascites,portal vein thrombosis,and bleeding signs,were selected for the rebleeding event prediction following endoscopic treatment(REPET)model.In predicting rebleeding within 1 year,the REPET model ex-hibited a concordance index of 0.775 and a Brier score of 0.143 in the derivation cohort,alongside 0.862 and 0.127 in the validation cohort.Furthermore,the REPET model revealed a significant difference in rebleeding rates(P<0.01)between low-risk patients and intermediate-to high-risk patients in both cohorts.CONCLUSION We constructed and validated a new prognostic model for variceal rebleeding with excellent predictive per-formance,which will improve the clinical management of rebleeding in EGVB patients.
基金the University of Transport Technology under the project entitled“Application of Machine Learning Algorithms in Landslide Susceptibility Mapping in Mountainous Areas”with grant number DTTD2022-16.
文摘This study was aimed to prepare landslide susceptibility maps for the Pithoragarh district in Uttarakhand,India,using advanced ensemble models that combined Radial Basis Function Networks(RBFN)with three ensemble learning techniques:DAGGING(DG),MULTIBOOST(MB),and ADABOOST(AB).This combination resulted in three distinct ensemble models:DG-RBFN,MB-RBFN,and AB-RBFN.Additionally,a traditional weighted method,Information Value(IV),and a benchmark machine learning(ML)model,Multilayer Perceptron Neural Network(MLP),were employed for comparison and validation.The models were developed using ten landslide conditioning factors,which included slope,aspect,elevation,curvature,land cover,geomorphology,overburden depth,lithology,distance to rivers and distance to roads.These factors were instrumental in predicting the output variable,which was the probability of landslide occurrence.Statistical analysis of the models’performance indicated that the DG-RBFN model,with an Area Under ROC Curve(AUC)of 0.931,outperformed the other models.The AB-RBFN model achieved an AUC of 0.929,the MB-RBFN model had an AUC of 0.913,and the MLP model recorded an AUC of 0.926.These results suggest that the advanced ensemble ML model DG-RBFN was more accurate than traditional statistical model,single MLP model,and other ensemble models in preparing trustworthy landslide susceptibility maps,thereby enhancing land use planning and decision-making.