To address the challenge of low survival rates and limited data collection efficiency in current virtual probe deployments,which results from anomaly detection mechanisms in location-based service(LBS)applications,thi...To address the challenge of low survival rates and limited data collection efficiency in current virtual probe deployments,which results from anomaly detection mechanisms in location-based service(LBS)applications,this paper proposes a novel virtual probe deployment method based on user behavioral feature analysis.The core idea is to circumvent LBS anomaly detection by mimicking real-user behavior patterns.First,we design an automated data extraction algorithm that recognizes graphical user interface(GUI)elements to collect spatio-temporal behavior data.Then,by analyzing the automatically collected user data,we identify normal users’spatio-temporal patterns and extract their features such as high-activity time windows and spatial clustering characteristics.Subsequently,an antidetection scheduling strategy is developed,integrating spatial clustering optimization,load-balanced allocation,and time window control to generate probe scheduling schemes.Additionally,a self-correction mechanism based on an exponential backoff strategy is implemented to rectify anomalous behaviors andmaintain system stability.Experiments in real-world environments demonstrate that the proposed method significantly outperforms baseline methods in terms of both probe ban rate and task completion rate,while maintaining high time efficiency.This study provides a more reliable and clandestine solution for geosocial data collection and lays the foundation for building more robust virtual probe systems.展开更多
Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimo...Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimodal Aspect-oriented Sentiment Classification(MASC).Currently,most existing models for JMASA only perform text and image feature encoding from a basic level,but often neglect the in-depth analysis of unimodal intrinsic features,which may lead to the low accuracy of aspect term extraction and the poor ability of sentiment prediction due to the insufficient learning of intra-modal features.Given this problem,we propose a Text-Image Feature Fine-grained Learning(TIFFL)model for JMASA.First,we construct an enhanced adjacency matrix of word dependencies and adopt graph convolutional network to learn the syntactic structure features for text,which addresses the context interference problem of identifying different aspect terms.Then,the adjective-noun pairs extracted from image are introduced to enable the semantic representation of visual features more intuitive,which addresses the ambiguous semantic extraction problem during image feature learning.Thereby,the model performance of aspect term extraction and sentiment polarity prediction can be further optimized and enhanced.Experiments on two Twitter benchmark datasets demonstrate that TIFFL achieves competitive results for JMASA,MATE and MASC,thus validating the effectiveness of our proposed methods.展开更多
BACKGROUND Pancreatic cancer remains one of the most lethal malignancies worldwide,with a poor prognosis often attributed to late diagnosis.Understanding the correlation between pathological type and imaging features ...BACKGROUND Pancreatic cancer remains one of the most lethal malignancies worldwide,with a poor prognosis often attributed to late diagnosis.Understanding the correlation between pathological type and imaging features is crucial for early detection and appropriate treatment planning.AIM To retrospectively analyze the relationship between different pathological types of pancreatic cancer and their corresponding imaging features.METHODS We retrospectively analyzed the data of 500 patients diagnosed with pancreatic cancer between January 2010 and December 2020 at our institution.Pathological types were determined by histopathological examination of the surgical spe-cimens or biopsy samples.The imaging features were assessed using computed tomography,magnetic resonance imaging,and endoscopic ultrasound.Statistical analyses were performed to identify significant associations between pathological types and specific imaging characteristics.RESULTS There were 320(64%)cases of pancreatic ductal adenocarcinoma,75(15%)of intraductal papillary mucinous neoplasms,50(10%)of neuroendocrine tumors,and 55(11%)of other rare types.Distinct imaging features were identified in each pathological type.Pancreatic ductal adenocarcinoma typically presents as a hypodense mass with poorly defined borders on computed tomography,whereas intraductal papillary mucinous neoplasms present as characteristic cystic lesions with mural nodules.Neuroendocrine tumors often appear as hypervascular lesions in contrast-enhanced imaging.Statistical analysis revealed significant correlations between specific imaging features and pathological types(P<0.001).CONCLUSION This study demonstrated a strong association between the pathological types of pancreatic cancer and imaging features.These findings can enhance the accuracy of noninvasive diagnosis and guide personalized treatment approaches.展开更多
Ransomware is malware that encrypts data without permission,demanding payment for access.Detecting ransomware on Android platforms is challenging due to evolving malicious techniques and diverse application behaviors....Ransomware is malware that encrypts data without permission,demanding payment for access.Detecting ransomware on Android platforms is challenging due to evolving malicious techniques and diverse application behaviors.Traditional methods,such as static and dynamic analysis,suffer from polymorphism,code obfuscation,and high resource demands.This paper introduces a multi-stage approach to enhance behavioral analysis for Android ransomware detection,focusing on a reduced set of distinguishing features.The approach includes ransomware app collection,behavioral profile generation,dataset creation,feature identification,reduction,and classification.Experiments were conducted on∼3300 Android-based ransomware samples,despite the challenges posed by their evolving nature and complexity.The feature reduction strategy successfully reduced features by 80%,with only a marginal loss of detection accuracy(0.59%).Different machine learning algorithms are employed for classification and achieve 96.71%detection accuracy.Additionally,10-fold cross-validation demonstrated robustness,yielding an AUC-ROC of 99.3%.Importantly,latency and memory evaluations revealed that models using the reduced feature set achieved up to a 99%reduction in inference time and significant memory savings across classifiers.The proposed approach outperforms existing techniques by achieving high detection accuracy with a minimal feature set,also suitable for deployment in resource-constrained environments.Future work may extend datasets and include iOS-based ransomware applications.展开更多
Urban air quality degradation from rising CO_(2) is acute in rapidly developing tropical cities such as Makassar,Indonesia.We deploy a drone-based Internet of Things(IoT)platform for real-time CO_(2) monitoring,integr...Urban air quality degradation from rising CO_(2) is acute in rapidly developing tropical cities such as Makassar,Indonesia.We deploy a drone-based Internet of Things(IoT)platform for real-time CO_(2) monitoring,integrating low-cost sensors(NDIR,MQ135,MG811)on a DJI Phantom 4 with cloud streaming to Firebase.Measurements were collected at five sites,namely Jl.AP.Pettarani,Jl.Ahmad Yani,Jl.Sultan Hasanuddin,Jl.Nusantara,and KIMA at 08:00,12:00,and 16:00 in September 2024 while vertically profiling 1-20 m with three repeat flights per site and time.Descriptive statistics and one-way ANOVA with Tukey HSD assessed spatio-temporal differences;Pearson correlation quantified cross-sensor agreement.Results show marked spatial and diurnal variability:Jl.AP.Pettarani exhibits the highest mean concentration(442.5 ppm),likely due to flyover-induced trapping,whereas Jl.Ahmad Yani records the lowest(390.0 ppm).Vertical profiles reveal mid-altitude peaks in street-canyon and industrial settings,and dilution with height in greener areas,indicating ventilation contrasts.Preprocessing removed outliers and applied temperature-humidity corrections to low-cost sensors.Differences across locations and times are statistically significant(p<0.05),and cross-sensor correlations are strong(r≈0.88-0.96)after correction.Compared with fixed ground stations,the system provides fine-scale three-dimensional coverage and real-time visualization useful for field decisions.Limitations include payload-constrained endurance and intermittent data loss in obstructed areas.Findings support targeted interventions,improving canyon ventilation around flyovers and expanding urban greenery relevant to Makassar and similar tropical cities.展开更多
Bocapavovirus,a member of the genus Bocaparvovirus within the subfamily Parvovirinae and the family Parvoviridae,is a small,non-enveloped,single-stranded DNA virus.This pathogen poses health risks to both humans and a...Bocapavovirus,a member of the genus Bocaparvovirus within the subfamily Parvovirinae and the family Parvoviridae,is a small,non-enveloped,single-stranded DNA virus.This pathogen poses health risks to both humans and animals.The Bocaparvovirus genome.展开更多
The authors regret that the original publication of this paper did not include Jawad Fayaz as a co-author.After further discussions and a thorough review of the research contributions,it was agreed that his significan...The authors regret that the original publication of this paper did not include Jawad Fayaz as a co-author.After further discussions and a thorough review of the research contributions,it was agreed that his significant contributions to the foundational aspects of the research warranted recognition,and he has now been added as a co-author.展开更多
Kernel-based slow feature analysis(SFA)methods have been successfully applied in the industrial process fault detection field.However,kernel-based SFA methods have high computational complexity as dealing with nonline...Kernel-based slow feature analysis(SFA)methods have been successfully applied in the industrial process fault detection field.However,kernel-based SFA methods have high computational complexity as dealing with nonlinearity,leading to delays in detecting time-varying data features.Additionally,the uncertain kernel function and kernel parameters limit the ability of the extracted features to express process characteristics,resulting in poor fault detection performance.To alleviate the above problems,a novel randomized auto-regressive dynamic slow feature analysis(RRDSFA)method is proposed to simultaneously monitor the operating point deviations and process dynamic faults,enabling real-time monitoring of data features in industrial processes.Firstly,the proposed Random Fourier mappingbased method achieves more effective nonlinear transformation,contrasting with the current kernelbased RDSFA algorithm that may lead to significant computational complexity.Secondly,a randomized RDSFA model is developed to extract nonlinear dynamic slow features.Furthermore,a Bayesian inference-based overall fault monitoring model including all RRDSFA sub-models is developed to overcome the randomness of random Fourier mapping.Finally,the superiority and effectiveness of the proposed monitoring method are demonstrated through a numerical case and a simulation of continuous stirred tank reactor.展开更多
Data-driven process monitoring is an effective approach to assure safe operation of modern manufacturing and energy systems,such as thermal power plants being studied in this work.Industrial processes are inherently d...Data-driven process monitoring is an effective approach to assure safe operation of modern manufacturing and energy systems,such as thermal power plants being studied in this work.Industrial processes are inherently dynamic and need to be monitored using dynamic algorithms.Mainstream dynamic algorithms rely on concatenating current measurement with past data.This work proposes a new,alternative dynamic process monitoring algorithm,using dot product feature analysis(DPFA).DPFA computes the dot product of consecutive samples,thus naturally capturing the process dynamics through temporal correlation.At the same time,DPFA's online computational complexity is lower than not just existing dynamic algorithms,but also classical static algorithms(e.g.,principal component analysis and slow feature analysis).The detectability of the new algorithm is analyzed for three types of faults typically seen in process systems:sensor bias,process fault and gain change fault.Through experiments with a numerical example and real data from a thermal power plant,the DPFA algorithm is shown to be superior to the state-of-the-art methods,in terms of better monitoring performance(fault detection rate and false alarm rate)and lower computational complexity.展开更多
Objective To determine the correlation between traditional Chinese medicine(TCM)inspec-tion of spirit classification and the severity grade of depression based on facial features,offer-ing insights for intelligent int...Objective To determine the correlation between traditional Chinese medicine(TCM)inspec-tion of spirit classification and the severity grade of depression based on facial features,offer-ing insights for intelligent intergrated TCM and western medicine diagnosis of depression.Methods Using the Audio-Visual Emotion Challenge and Workshop(AVEC 2014)public dataset on depression,which conclude 150 interview videos,the samples were classified ac-cording to the TCM inspection of spirit classification:Deshen(得神,presence of spirit),Shaoshen(少神,insufficiency of spirit),and Shenluan(神乱,confusion of spirit).Meanwhile,based on Beck Depression Inventory-II(BDI-II)score for the severity grade of depression,the samples were divided into minimal(0-13,Q1),mild(14-19,Q2),moderate(20-28,Q3),and severe(29-63,Q4).Sixty-eight landmarks were extracted with a ResNet-50 network,and the feature extracion mode was stadardized.Random forest and support vectior machine(SVM)classifiers were used to predict TCM inspection of spirit classification and the severity grade of depression,respectively.A Chi-square test and Apriori association rule mining were then applied to quantify and explore the relationships.Results The analysis revealed a statistically significant and moderately strong association be-tween TCM spirit classification and the severity grade of depression,as confirmed by a Chi-square test(χ^(2)=14.04,P=0.029)with a Cramer’s V effect size of 0.243.Further exploration us-ing association rule mining identified the most compelling rule:“moderate depression(Q3)→Shenluan”.This rule demonstrated a support level of 5%,indicating this specific co-occur-rence was present in 5%of the cohort.Crucially,it achieved a high Confidence of 86%,mean-ing that among patients diagnosed with Q3,86%exhibited the Shenluan pattern according to TCM assessment.The substantial Lift of 2.37 signifies that the observed likelihood of Shenlu-an manifesting in Q3 patients is 2.37 times higher than would be expected by chance if these states were independent-compelling evidence of a highly non-random association.Conse-quently,Shenluan emerges as a distinct and core TCM diagnostic manifestation strongly linked to Q3,forming a clinically significant phenotype within this patient subgroup.展开更多
Aiming at the problem of on-line damage diagnosis in structural health monitoring (SHM), an algorithm of feature extraction and damage alarming based on auto-regressive moving-average (ARMA) time series analysis i...Aiming at the problem of on-line damage diagnosis in structural health monitoring (SHM), an algorithm of feature extraction and damage alarming based on auto-regressive moving-average (ARMA) time series analysis is presented. The monitoring data were first modeled as ARMA models, while a principalcomponent matrix derived from the AR coefficients of these models was utilized to establish the Mahalanobisdistance criterion functions. Then, a new damage-sensitive feature index DDSF is proposed. A hypothesis test involving the t-test method is further applied to obtain a decision of damage alarming as the mean value of DDSF had significantly changed after damage. The numerical results of a three-span-girder model shows that the defined index is sensitive to subtle structural damage, and the proposed algorithm can be applied to the on-line damage alarming in SHM.展开更多
The paper had introduced the development stage and role of site analysis in landscape design. By taking Canal Park in Yichang City for example, landscape design concepts had been discussed and design concepts of this ...The paper had introduced the development stage and role of site analysis in landscape design. By taking Canal Park in Yichang City for example, landscape design concepts had been discussed and design concepts of this park had been summarized as: continuing history and culture of the site; forming rare urban wetland landscape; respecting surrounding environment and integrating leisure recreation with ideal landscape layout; displaying regional customs through core scenic spots; manifesting regional features through plants' planning. After the analysis of features of Canal Park, four approaches for urban park features construction had been revealed, that is, guiding with new concept by centering on urban development requirement; reflecting regional customs; exploiting local historical resources; and fully expressing the property of the park. It was considered that urban park feature construction should be based on site analysis. Through exploitation and refinement of overt natural landscape features and covert cultural and historical resources, they should be decomposed, processed and integrated into concrete concepts. Finally, individual features of the site even the city could be embodied in concrete landscape factors.展开更多
This paper explores how the Chinese college students' life is represented in some graffiti collected in campus.The article analyzes and compares the topics of graffiti from different settings and the linguistic fe...This paper explores how the Chinese college students' life is represented in some graffiti collected in campus.The article analyzes and compares the topics of graffiti from different settings and the linguistic features they manifest.The findings show that fewer graffiti from female toilet and classroom in this university pay attention to political issues compared with the graffiti abroad.Graffiti in female toilet mainly focus on the theme of love,and are found to be more interactive in discourse.Whereas graffiti on desks tend to cover mixed themes and be less interactive.There are more graphic graffiti and exam answers on the undergraduate students' desk than on the postgraduates'.Graffiti have some linguistic features as thematization,repetition and salience,etc.展开更多
Traffic flow prediction constitutes a fundamental component of Intelligent Transportation Systems(ITS),playing a pivotal role in mitigating congestion,enhancing route optimization,and improving the utilization efficie...Traffic flow prediction constitutes a fundamental component of Intelligent Transportation Systems(ITS),playing a pivotal role in mitigating congestion,enhancing route optimization,and improving the utilization efficiency of roadway infrastructure.However,existingmethods struggle in complex traffic scenarios due to static spatio-temporal embedding,restricted multi-scale temporal modeling,and weak representation of local spatial interactions.This study proposes Bi-STAT+,an enhanced bidirectional spatio-temporal attention framework to address existing limitations through three principal contributions:(1)an adaptive spatio-temporal embedding module that dynamically adjusts embeddings to capture complex traffic variations;(2)frequency-domain analysis in the temporal dimension for simultaneous high-frequency details and low-frequency trend extraction;and(3)an agent attention mechanism in the spatial dimension that enhances local feature extraction through dynamic weight allocation.Extensive experiments were performed on four distinct datasets,including two publicly benchmark datasets(PEMS04 and PEMS08)and two private datasets collected from Baotou and Chengdu,China.The results demonstrate that Bi-STAT+consistently outperforms existing methods in terms of MAE,RMSE,and MAPE,while maintaining strong robustness against missing data and noise.Furthermore,the results highlight that prediction accuracy improves significantly with higher sampling rates,providing crucial insights for optimizing real-world deployment scenarios.展开更多
Unconfined Compressive Strength(UCS)is a key parameter for the assessment of the stability and performance of stabilized soils,yet traditional laboratory testing is both time and resource intensive.In this study,an in...Unconfined Compressive Strength(UCS)is a key parameter for the assessment of the stability and performance of stabilized soils,yet traditional laboratory testing is both time and resource intensive.In this study,an interpretable machine learning approach to UCS prediction is presented,pairing five models(Random Forest(RF),Gradient Boosting(GB),Extreme Gradient Boosting(XGB),CatBoost,and K-Nearest Neighbors(KNN))with SHapley Additive exPlanations(SHAP)for enhanced interpretability and to guide feature removal.A complete dataset of 12 geotechnical and chemical parameters,i.e.,Atterberg limits,compaction properties,stabilizer chemistry,dosage,curing time,was used to train and test the models.R2,RMSE,MSE,and MAE were used to assess performance.Initial results with all 12 features indicated that boosting-based models(GB,XGB,CatBoost)exhibited the highest predictive accuracy(R^(2)=0.93)with satisfactory generalization on test data,followed by RF and KNN.SHAP analysis consistently picked CaO content,curing time,stabilizer dosage,and compaction parameters as the most important features,aligning with established soil stabilization mechanisms.Models were then re-trained on the top 8 and top 5 SHAP-ranked features.Interestingly,GB,XGB,and CatBoost maintained comparable accuracy with reduced input sets,while RF was moderately sensitive and KNN was somewhat better owing to reduced dimensionality.The findings confirm that feature reduction through SHAP enables cost-effective UCS prediction through the reduction of laboratory test requirements without significant accuracy loss.The suggested hybrid approach offers an explainable,interpretable,and cost-effective tool for geotechnical engineering practice.展开更多
Hard disk drives(HDDs)serve as the primary storage devices in modern data centers.Once a failure occurs,it often leads to severe data loss,significantly degrading the reliability of storage systems.Numerous studies ha...Hard disk drives(HDDs)serve as the primary storage devices in modern data centers.Once a failure occurs,it often leads to severe data loss,significantly degrading the reliability of storage systems.Numerous studies have proposed machine learning-based HDD failure prediction models.However,the Self-Monitoring,Analysis,and Reporting Technology(SMART)attributes differ across HDD manufacturers.We define hard drives of the same brand and model as homogeneous HDD groups,and those from different brands or models as heterogeneous HDD groups.In practical engineering scenarios,a data center is often composed of a heterogeneous population of HDDs,spanning multiple vendors and models.Existing research predominantly focuses on homogeneous datasets,ignoring the model’s generalization capability across heterogeneous HDDs.As a result,HDD models with limited samples often suffer from poor training effectiveness and prediction performance.To address this issue,we investigate generalizable SMART predictors across heterogeneous HDD groups.By extracting time-series features within a fixed sliding time window,we propose a Heterogeneous Disk Failure Prediction Method based on Time Series Features(HDFPM)framework.This method is adaptable to HDD models with limited sample sizes,thereby enhancing its applicability and robustness across diverse drive populations.Experimental results show that the proposed model achieves an F1-score of 0.9518 when applied to two different Seagate HDD models,while maintaining the False Positive Rate(FPR)below 1%.After incorporating the Complexity-Ratio Dynamic Time Warping(CDTW)based feature enhancement method,the best prediction model achieves a True Positive Rate(TPR)of up to 0.93 between the two models.For next-day failure prediction across various Seagate models,the model achieves an F1-score of up to 0.8792.Moreover,the experimental results also show that within the same brand,the higher the proportion of shared SMART attributes across different models,the better the prediction performance.In addition,HDFPMdemonstrates the best stability andmost significant performance in heterogeneous environments.展开更多
It is an important precondition for machine fault diagnosis that vibrationsignal can be extracted effectively. Based on the characteristic of noise interfused during thecourse of sampling vibration signal, independent...It is an important precondition for machine fault diagnosis that vibrationsignal can be extracted effectively. Based on the characteristic of noise interfused during thecourse of sampling vibration signal, independent component analysis (ICA) method is combined withwavelet to de-noise. Firstly, The sampled signal can be separated with ICA, then the function offrequency band chosen with multi-resolution wavelet transform can be used to judge whether thestochastic disturbance singular signal is interfused. By these ways, the vibration signals can beextracted effectively, which provides favorable condition for subsequent feature detection ofvibration signal and fault diagnosis.展开更多
The strength of cement-based materials,such as mortar,concrete and cement paste backfill(CPB),depends on its microstructures(e.g.pore structure and arrangement of particles and skeleton).Numerous studies on the relati...The strength of cement-based materials,such as mortar,concrete and cement paste backfill(CPB),depends on its microstructures(e.g.pore structure and arrangement of particles and skeleton).Numerous studies on the relationship between strength and pore structure(e.g.,pore size and its distribution)were performed,but the micro-morphology characteristics have been rarely concerned.Texture describing the surface properties of the sample is a global feature,which is an effective way to quantify the micro-morphological properties.In statistical analysis,GLCM features and Tamura texture are the most representative methods for characterizing the texture features.The mechanical strength and section image of the backfill sample prepared from three different solid concentrations of paste were obtained by uniaxial compressive strength test and scanning electron microscope,respectively.The texture features of different SEM images were calculated based on image analysis technology,and then the correlation between these parameters and the strength was analyzed.It was proved that the method is effective in the quantitative analysis on the micro-morphology characteristics of CPB.There is a significant correlation between the texture features and the unconfined compressive strength,and the prediction of strength is feasible using texture parameters of the CPB microstructure.展开更多
In the spectral analysis of laser-induced breakdown spectroscopy,abundant characteristic spectral lines and severe interference information exist simultaneously in the original spectral data.Here,a feature selection m...In the spectral analysis of laser-induced breakdown spectroscopy,abundant characteristic spectral lines and severe interference information exist simultaneously in the original spectral data.Here,a feature selection method called recursive feature elimination based on ridge regression(Ridge-RFE)for the original spectral data is recommended to make full use of the valid information of spectra.In the Ridge-RFE method,the absolute value of the ridge regression coefficient was used as a criterion to screen spectral characteristic,the feature with the absolute value of minimum weight in the input subset features was removed by recursive feature elimination(RFE),and the selected features were used as inputs of the partial least squares regression(PLS)model.The Ridge-RFE method based PLS model was used to measure the Fe,Si,Mg,Cu,Zn and Mn for 51 aluminum alloy samples,and the results showed that the root mean square error of prediction decreased greatly compared to the PLS model with full spectrum as input.The overall results demonstrate that the Ridge-RFE method is more efficient to extract the redundant features,make PLS model for better quantitative analysis results and improve model generalization ability.展开更多
基金supported by theNationalNatural Science Foundation of China(No.U23A20305)National Key Research and Development Program of China(No.2022YFB3102900)+1 种基金Innovation Scientists and Technicians Troop Construction Projects of Henan Province,China(No.254000510007)Key Research and Development Project of Henan Province(No.221111321200).
文摘To address the challenge of low survival rates and limited data collection efficiency in current virtual probe deployments,which results from anomaly detection mechanisms in location-based service(LBS)applications,this paper proposes a novel virtual probe deployment method based on user behavioral feature analysis.The core idea is to circumvent LBS anomaly detection by mimicking real-user behavior patterns.First,we design an automated data extraction algorithm that recognizes graphical user interface(GUI)elements to collect spatio-temporal behavior data.Then,by analyzing the automatically collected user data,we identify normal users’spatio-temporal patterns and extract their features such as high-activity time windows and spatial clustering characteristics.Subsequently,an antidetection scheduling strategy is developed,integrating spatial clustering optimization,load-balanced allocation,and time window control to generate probe scheduling schemes.Additionally,a self-correction mechanism based on an exponential backoff strategy is implemented to rectify anomalous behaviors andmaintain system stability.Experiments in real-world environments demonstrate that the proposed method significantly outperforms baseline methods in terms of both probe ban rate and task completion rate,while maintaining high time efficiency.This study provides a more reliable and clandestine solution for geosocial data collection and lays the foundation for building more robust virtual probe systems.
基金supported by the Science and Technology Project of Henan Province(No.222102210081).
文摘Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimodal Aspect-oriented Sentiment Classification(MASC).Currently,most existing models for JMASA only perform text and image feature encoding from a basic level,but often neglect the in-depth analysis of unimodal intrinsic features,which may lead to the low accuracy of aspect term extraction and the poor ability of sentiment prediction due to the insufficient learning of intra-modal features.Given this problem,we propose a Text-Image Feature Fine-grained Learning(TIFFL)model for JMASA.First,we construct an enhanced adjacency matrix of word dependencies and adopt graph convolutional network to learn the syntactic structure features for text,which addresses the context interference problem of identifying different aspect terms.Then,the adjective-noun pairs extracted from image are introduced to enable the semantic representation of visual features more intuitive,which addresses the ambiguous semantic extraction problem during image feature learning.Thereby,the model performance of aspect term extraction and sentiment polarity prediction can be further optimized and enhanced.Experiments on two Twitter benchmark datasets demonstrate that TIFFL achieves competitive results for JMASA,MATE and MASC,thus validating the effectiveness of our proposed methods.
文摘BACKGROUND Pancreatic cancer remains one of the most lethal malignancies worldwide,with a poor prognosis often attributed to late diagnosis.Understanding the correlation between pathological type and imaging features is crucial for early detection and appropriate treatment planning.AIM To retrospectively analyze the relationship between different pathological types of pancreatic cancer and their corresponding imaging features.METHODS We retrospectively analyzed the data of 500 patients diagnosed with pancreatic cancer between January 2010 and December 2020 at our institution.Pathological types were determined by histopathological examination of the surgical spe-cimens or biopsy samples.The imaging features were assessed using computed tomography,magnetic resonance imaging,and endoscopic ultrasound.Statistical analyses were performed to identify significant associations between pathological types and specific imaging characteristics.RESULTS There were 320(64%)cases of pancreatic ductal adenocarcinoma,75(15%)of intraductal papillary mucinous neoplasms,50(10%)of neuroendocrine tumors,and 55(11%)of other rare types.Distinct imaging features were identified in each pathological type.Pancreatic ductal adenocarcinoma typically presents as a hypodense mass with poorly defined borders on computed tomography,whereas intraductal papillary mucinous neoplasms present as characteristic cystic lesions with mural nodules.Neuroendocrine tumors often appear as hypervascular lesions in contrast-enhanced imaging.Statistical analysis revealed significant correlations between specific imaging features and pathological types(P<0.001).CONCLUSION This study demonstrated a strong association between the pathological types of pancreatic cancer and imaging features.These findings can enhance the accuracy of noninvasive diagnosis and guide personalized treatment approaches.
基金supported by the Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(2021R1I1A3049788).
文摘Ransomware is malware that encrypts data without permission,demanding payment for access.Detecting ransomware on Android platforms is challenging due to evolving malicious techniques and diverse application behaviors.Traditional methods,such as static and dynamic analysis,suffer from polymorphism,code obfuscation,and high resource demands.This paper introduces a multi-stage approach to enhance behavioral analysis for Android ransomware detection,focusing on a reduced set of distinguishing features.The approach includes ransomware app collection,behavioral profile generation,dataset creation,feature identification,reduction,and classification.Experiments were conducted on∼3300 Android-based ransomware samples,despite the challenges posed by their evolving nature and complexity.The feature reduction strategy successfully reduced features by 80%,with only a marginal loss of detection accuracy(0.59%).Different machine learning algorithms are employed for classification and achieve 96.71%detection accuracy.Additionally,10-fold cross-validation demonstrated robustness,yielding an AUC-ROC of 99.3%.Importantly,latency and memory evaluations revealed that models using the reduced feature set achieved up to a 99%reduction in inference time and significant memory savings across classifiers.The proposed approach outperforms existing techniques by achieving high detection accuracy with a minimal feature set,also suitable for deployment in resource-constrained environments.Future work may extend datasets and include iOS-based ransomware applications.
基金supported by the Directorate of Research,Technology,and Community Service(DRTPM),Ministry of Education,Culture,Research,and Technology,grant number 2817/UN36.11/LP2M/2024.
文摘Urban air quality degradation from rising CO_(2) is acute in rapidly developing tropical cities such as Makassar,Indonesia.We deploy a drone-based Internet of Things(IoT)platform for real-time CO_(2) monitoring,integrating low-cost sensors(NDIR,MQ135,MG811)on a DJI Phantom 4 with cloud streaming to Firebase.Measurements were collected at five sites,namely Jl.AP.Pettarani,Jl.Ahmad Yani,Jl.Sultan Hasanuddin,Jl.Nusantara,and KIMA at 08:00,12:00,and 16:00 in September 2024 while vertically profiling 1-20 m with three repeat flights per site and time.Descriptive statistics and one-way ANOVA with Tukey HSD assessed spatio-temporal differences;Pearson correlation quantified cross-sensor agreement.Results show marked spatial and diurnal variability:Jl.AP.Pettarani exhibits the highest mean concentration(442.5 ppm),likely due to flyover-induced trapping,whereas Jl.Ahmad Yani records the lowest(390.0 ppm).Vertical profiles reveal mid-altitude peaks in street-canyon and industrial settings,and dilution with height in greener areas,indicating ventilation contrasts.Preprocessing removed outliers and applied temperature-humidity corrections to low-cost sensors.Differences across locations and times are statistically significant(p<0.05),and cross-sensor correlations are strong(r≈0.88-0.96)after correction.Compared with fixed ground stations,the system provides fine-scale three-dimensional coverage and real-time visualization useful for field decisions.Limitations include payload-constrained endurance and intermittent data loss in obstructed areas.Findings support targeted interventions,improving canyon ventilation around flyovers and expanding urban greenery relevant to Makassar and similar tropical cities.
基金supported by the Natural Science Foundation of Sichuan Province,China(2024NSFSC1272)the Innovation Team Development Funds for Sichuan Mutton Goat&Sheep,China(SCCXTD-2024-14)Scientific and Technological Innovation Team for Qinghai-Tibetan Plateau Research in Southwest Minzu University,China(2024CXTD08)。
文摘Bocapavovirus,a member of the genus Bocaparvovirus within the subfamily Parvovirinae and the family Parvoviridae,is a small,non-enveloped,single-stranded DNA virus.This pathogen poses health risks to both humans and animals.The Bocaparvovirus genome.
文摘The authors regret that the original publication of this paper did not include Jawad Fayaz as a co-author.After further discussions and a thorough review of the research contributions,it was agreed that his significant contributions to the foundational aspects of the research warranted recognition,and he has now been added as a co-author.
基金supported by the Program of National Natural Science Foundation of China(U23A20329,62163036)Youth Academic and Technical Leaders Reserve Talent Training project(202105AC160094)Industrial Innovation Talent Special Project of Xingdian Talent Support Program(XDYC-CYCX-2022-0010).
文摘Kernel-based slow feature analysis(SFA)methods have been successfully applied in the industrial process fault detection field.However,kernel-based SFA methods have high computational complexity as dealing with nonlinearity,leading to delays in detecting time-varying data features.Additionally,the uncertain kernel function and kernel parameters limit the ability of the extracted features to express process characteristics,resulting in poor fault detection performance.To alleviate the above problems,a novel randomized auto-regressive dynamic slow feature analysis(RRDSFA)method is proposed to simultaneously monitor the operating point deviations and process dynamic faults,enabling real-time monitoring of data features in industrial processes.Firstly,the proposed Random Fourier mappingbased method achieves more effective nonlinear transformation,contrasting with the current kernelbased RDSFA algorithm that may lead to significant computational complexity.Secondly,a randomized RDSFA model is developed to extract nonlinear dynamic slow features.Furthermore,a Bayesian inference-based overall fault monitoring model including all RRDSFA sub-models is developed to overcome the randomness of random Fourier mapping.Finally,the superiority and effectiveness of the proposed monitoring method are demonstrated through a numerical case and a simulation of continuous stirred tank reactor.
基金supported in part by the National Science Fund for Distinguished Young Scholars of China(62225303)the National Natural Science Fundation of China(62303039,62433004)+2 种基金the China Postdoctoral Science Foundation(BX20230034,2023M730190)the Fundamental Research Funds for the Central Universities(buctrc202201,QNTD2023-01)the High Performance Computing Platform,College of Information Science and Technology,Beijing University of Chemical Technology
文摘Data-driven process monitoring is an effective approach to assure safe operation of modern manufacturing and energy systems,such as thermal power plants being studied in this work.Industrial processes are inherently dynamic and need to be monitored using dynamic algorithms.Mainstream dynamic algorithms rely on concatenating current measurement with past data.This work proposes a new,alternative dynamic process monitoring algorithm,using dot product feature analysis(DPFA).DPFA computes the dot product of consecutive samples,thus naturally capturing the process dynamics through temporal correlation.At the same time,DPFA's online computational complexity is lower than not just existing dynamic algorithms,but also classical static algorithms(e.g.,principal component analysis and slow feature analysis).The detectability of the new algorithm is analyzed for three types of faults typically seen in process systems:sensor bias,process fault and gain change fault.Through experiments with a numerical example and real data from a thermal power plant,the DPFA algorithm is shown to be superior to the state-of-the-art methods,in terms of better monitoring performance(fault detection rate and false alarm rate)and lower computational complexity.
基金Research and Development Plan of Key Areas of Hunan Science and Technology Department (2022SK2044)Clinical Research Center for Depressive Disorder in Hunan Province (2021SK4022)。
文摘Objective To determine the correlation between traditional Chinese medicine(TCM)inspec-tion of spirit classification and the severity grade of depression based on facial features,offer-ing insights for intelligent intergrated TCM and western medicine diagnosis of depression.Methods Using the Audio-Visual Emotion Challenge and Workshop(AVEC 2014)public dataset on depression,which conclude 150 interview videos,the samples were classified ac-cording to the TCM inspection of spirit classification:Deshen(得神,presence of spirit),Shaoshen(少神,insufficiency of spirit),and Shenluan(神乱,confusion of spirit).Meanwhile,based on Beck Depression Inventory-II(BDI-II)score for the severity grade of depression,the samples were divided into minimal(0-13,Q1),mild(14-19,Q2),moderate(20-28,Q3),and severe(29-63,Q4).Sixty-eight landmarks were extracted with a ResNet-50 network,and the feature extracion mode was stadardized.Random forest and support vectior machine(SVM)classifiers were used to predict TCM inspection of spirit classification and the severity grade of depression,respectively.A Chi-square test and Apriori association rule mining were then applied to quantify and explore the relationships.Results The analysis revealed a statistically significant and moderately strong association be-tween TCM spirit classification and the severity grade of depression,as confirmed by a Chi-square test(χ^(2)=14.04,P=0.029)with a Cramer’s V effect size of 0.243.Further exploration us-ing association rule mining identified the most compelling rule:“moderate depression(Q3)→Shenluan”.This rule demonstrated a support level of 5%,indicating this specific co-occur-rence was present in 5%of the cohort.Crucially,it achieved a high Confidence of 86%,mean-ing that among patients diagnosed with Q3,86%exhibited the Shenluan pattern according to TCM assessment.The substantial Lift of 2.37 signifies that the observed likelihood of Shenlu-an manifesting in Q3 patients is 2.37 times higher than would be expected by chance if these states were independent-compelling evidence of a highly non-random association.Conse-quently,Shenluan emerges as a distinct and core TCM diagnostic manifestation strongly linked to Q3,forming a clinically significant phenotype within this patient subgroup.
基金The National High Technology Research and Devel-opment Program of China (863Program) (No2006AA04Z416)the National Natural Science Foundation of China (No50538020)
文摘Aiming at the problem of on-line damage diagnosis in structural health monitoring (SHM), an algorithm of feature extraction and damage alarming based on auto-regressive moving-average (ARMA) time series analysis is presented. The monitoring data were first modeled as ARMA models, while a principalcomponent matrix derived from the AR coefficients of these models was utilized to establish the Mahalanobisdistance criterion functions. Then, a new damage-sensitive feature index DDSF is proposed. A hypothesis test involving the t-test method is further applied to obtain a decision of damage alarming as the mean value of DDSF had significantly changed after damage. The numerical results of a three-span-girder model shows that the defined index is sensitive to subtle structural damage, and the proposed algorithm can be applied to the on-line damage alarming in SHM.
基金Supported by Research Results of Scientific Research Project of Guangxi Education Department (201010LX531)~~
文摘The paper had introduced the development stage and role of site analysis in landscape design. By taking Canal Park in Yichang City for example, landscape design concepts had been discussed and design concepts of this park had been summarized as: continuing history and culture of the site; forming rare urban wetland landscape; respecting surrounding environment and integrating leisure recreation with ideal landscape layout; displaying regional customs through core scenic spots; manifesting regional features through plants' planning. After the analysis of features of Canal Park, four approaches for urban park features construction had been revealed, that is, guiding with new concept by centering on urban development requirement; reflecting regional customs; exploiting local historical resources; and fully expressing the property of the park. It was considered that urban park feature construction should be based on site analysis. Through exploitation and refinement of overt natural landscape features and covert cultural and historical resources, they should be decomposed, processed and integrated into concrete concepts. Finally, individual features of the site even the city could be embodied in concrete landscape factors.
文摘This paper explores how the Chinese college students' life is represented in some graffiti collected in campus.The article analyzes and compares the topics of graffiti from different settings and the linguistic features they manifest.The findings show that fewer graffiti from female toilet and classroom in this university pay attention to political issues compared with the graffiti abroad.Graffiti in female toilet mainly focus on the theme of love,and are found to be more interactive in discourse.Whereas graffiti on desks tend to cover mixed themes and be less interactive.There are more graphic graffiti and exam answers on the undergraduate students' desk than on the postgraduates'.Graffiti have some linguistic features as thematization,repetition and salience,etc.
基金partly supported by the Youth Foundation of the Inner Mongolia Natural Science Foundation[grant number 2024QN06017 and 2025MS06022]the Basic Scientific Research Business Fee Project for Universities in Inner Mongolia[grant numbers 2023XKJX019 and 2023XKJX024]the Central Guidance on Local Science and Technology Development Fund through[grant number 2024ZY0084].
文摘Traffic flow prediction constitutes a fundamental component of Intelligent Transportation Systems(ITS),playing a pivotal role in mitigating congestion,enhancing route optimization,and improving the utilization efficiency of roadway infrastructure.However,existingmethods struggle in complex traffic scenarios due to static spatio-temporal embedding,restricted multi-scale temporal modeling,and weak representation of local spatial interactions.This study proposes Bi-STAT+,an enhanced bidirectional spatio-temporal attention framework to address existing limitations through three principal contributions:(1)an adaptive spatio-temporal embedding module that dynamically adjusts embeddings to capture complex traffic variations;(2)frequency-domain analysis in the temporal dimension for simultaneous high-frequency details and low-frequency trend extraction;and(3)an agent attention mechanism in the spatial dimension that enhances local feature extraction through dynamic weight allocation.Extensive experiments were performed on four distinct datasets,including two publicly benchmark datasets(PEMS04 and PEMS08)and two private datasets collected from Baotou and Chengdu,China.The results demonstrate that Bi-STAT+consistently outperforms existing methods in terms of MAE,RMSE,and MAPE,while maintaining strong robustness against missing data and noise.Furthermore,the results highlight that prediction accuracy improves significantly with higher sampling rates,providing crucial insights for optimizing real-world deployment scenarios.
文摘Unconfined Compressive Strength(UCS)is a key parameter for the assessment of the stability and performance of stabilized soils,yet traditional laboratory testing is both time and resource intensive.In this study,an interpretable machine learning approach to UCS prediction is presented,pairing five models(Random Forest(RF),Gradient Boosting(GB),Extreme Gradient Boosting(XGB),CatBoost,and K-Nearest Neighbors(KNN))with SHapley Additive exPlanations(SHAP)for enhanced interpretability and to guide feature removal.A complete dataset of 12 geotechnical and chemical parameters,i.e.,Atterberg limits,compaction properties,stabilizer chemistry,dosage,curing time,was used to train and test the models.R2,RMSE,MSE,and MAE were used to assess performance.Initial results with all 12 features indicated that boosting-based models(GB,XGB,CatBoost)exhibited the highest predictive accuracy(R^(2)=0.93)with satisfactory generalization on test data,followed by RF and KNN.SHAP analysis consistently picked CaO content,curing time,stabilizer dosage,and compaction parameters as the most important features,aligning with established soil stabilization mechanisms.Models were then re-trained on the top 8 and top 5 SHAP-ranked features.Interestingly,GB,XGB,and CatBoost maintained comparable accuracy with reduced input sets,while RF was moderately sensitive and KNN was somewhat better owing to reduced dimensionality.The findings confirm that feature reduction through SHAP enables cost-effective UCS prediction through the reduction of laboratory test requirements without significant accuracy loss.The suggested hybrid approach offers an explainable,interpretable,and cost-effective tool for geotechnical engineering practice.
基金supported by the Tianjin Manufacturing High Quality Development Special Foundation(No.20232185)the Roycom Foundation(No.70306901).
文摘Hard disk drives(HDDs)serve as the primary storage devices in modern data centers.Once a failure occurs,it often leads to severe data loss,significantly degrading the reliability of storage systems.Numerous studies have proposed machine learning-based HDD failure prediction models.However,the Self-Monitoring,Analysis,and Reporting Technology(SMART)attributes differ across HDD manufacturers.We define hard drives of the same brand and model as homogeneous HDD groups,and those from different brands or models as heterogeneous HDD groups.In practical engineering scenarios,a data center is often composed of a heterogeneous population of HDDs,spanning multiple vendors and models.Existing research predominantly focuses on homogeneous datasets,ignoring the model’s generalization capability across heterogeneous HDDs.As a result,HDD models with limited samples often suffer from poor training effectiveness and prediction performance.To address this issue,we investigate generalizable SMART predictors across heterogeneous HDD groups.By extracting time-series features within a fixed sliding time window,we propose a Heterogeneous Disk Failure Prediction Method based on Time Series Features(HDFPM)framework.This method is adaptable to HDD models with limited sample sizes,thereby enhancing its applicability and robustness across diverse drive populations.Experimental results show that the proposed model achieves an F1-score of 0.9518 when applied to two different Seagate HDD models,while maintaining the False Positive Rate(FPR)below 1%.After incorporating the Complexity-Ratio Dynamic Time Warping(CDTW)based feature enhancement method,the best prediction model achieves a True Positive Rate(TPR)of up to 0.93 between the two models.For next-day failure prediction across various Seagate models,the model achieves an F1-score of up to 0.8792.Moreover,the experimental results also show that within the same brand,the higher the proportion of shared SMART attributes across different models,the better the prediction performance.In addition,HDFPMdemonstrates the best stability andmost significant performance in heterogeneous environments.
基金This project is supported by National Natural Science Foundation of China (No.50275154) Municipal Natural Science Foundation of Chongqing, China (No.8773).
文摘It is an important precondition for machine fault diagnosis that vibrationsignal can be extracted effectively. Based on the characteristic of noise interfused during thecourse of sampling vibration signal, independent component analysis (ICA) method is combined withwavelet to de-noise. Firstly, The sampled signal can be separated with ICA, then the function offrequency band chosen with multi-resolution wavelet transform can be used to judge whether thestochastic disturbance singular signal is interfused. By these ways, the vibration signals can beextracted effectively, which provides favorable condition for subsequent feature detection ofvibration signal and fault diagnosis.
基金Project(51722401)supported by the National Natural Science Foundation for Excellent Young Scholars of ChinaProject(FRF-TP-18-003C1)supported by the Fundamental Research Funds for the Central Universities,ChinaProject(51734001)supported by the Key Program of National Natural Science Foundation of China
文摘The strength of cement-based materials,such as mortar,concrete and cement paste backfill(CPB),depends on its microstructures(e.g.pore structure and arrangement of particles and skeleton).Numerous studies on the relationship between strength and pore structure(e.g.,pore size and its distribution)were performed,but the micro-morphology characteristics have been rarely concerned.Texture describing the surface properties of the sample is a global feature,which is an effective way to quantify the micro-morphological properties.In statistical analysis,GLCM features and Tamura texture are the most representative methods for characterizing the texture features.The mechanical strength and section image of the backfill sample prepared from three different solid concentrations of paste were obtained by uniaxial compressive strength test and scanning electron microscope,respectively.The texture features of different SEM images were calculated based on image analysis technology,and then the correlation between these parameters and the strength was analyzed.It was proved that the method is effective in the quantitative analysis on the micro-morphology characteristics of CPB.There is a significant correlation between the texture features and the unconfined compressive strength,and the prediction of strength is feasible using texture parameters of the CPB microstructure.
基金supported by National Key Research and Development Program of China(No.2016YFF0102502)the Key Research Program of Frontier Sciences,CAS(No.QYZDJ-SSW-JSC037)the Youth Innovation Promotion Association,CAS,Liao Ning Revitalization Talents Program(No.XLYC1807110)。
文摘In the spectral analysis of laser-induced breakdown spectroscopy,abundant characteristic spectral lines and severe interference information exist simultaneously in the original spectral data.Here,a feature selection method called recursive feature elimination based on ridge regression(Ridge-RFE)for the original spectral data is recommended to make full use of the valid information of spectra.In the Ridge-RFE method,the absolute value of the ridge regression coefficient was used as a criterion to screen spectral characteristic,the feature with the absolute value of minimum weight in the input subset features was removed by recursive feature elimination(RFE),and the selected features were used as inputs of the partial least squares regression(PLS)model.The Ridge-RFE method based PLS model was used to measure the Fe,Si,Mg,Cu,Zn and Mn for 51 aluminum alloy samples,and the results showed that the root mean square error of prediction decreased greatly compared to the PLS model with full spectrum as input.The overall results demonstrate that the Ridge-RFE method is more efficient to extract the redundant features,make PLS model for better quantitative analysis results and improve model generalization ability.