Traditional Chinese medicine(TCM)has played a significant role in the prevention and treatment of chronic heart failure(CHF).To study TCM diagnosis of CHF,a total of 278 Chinese clinical research articles on the study...Traditional Chinese medicine(TCM)has played a significant role in the prevention and treatment of chronic heart failure(CHF).To study TCM diagnosis of CHF,a total of 278 Chinese clinical research articles on the study of CHF syndromes in recent 40 years retrieved from Web of Science,Scopus,Pub Med,Embase,CNKI,Wanfang Data,Cq VIP,and Sino Med.According to cumulative frequency analysis,network analysis,and hierarchical cluster analysis,the study found the distribution of CHF syndromes was syndrome of qi deficiency with blood stasis,syndrome of qi and yin deficiency,syndrome of yang deficiency with water flooding,syndrome of heart blood stasis obstruction,syndrome of turbid phlegm,and syndrome of collapse due to primordial yang deficiency.The syndrome elements on location of illness were heart,kidney,lung,and spleen.The syndrome elements on nature of illness were qi deficiency,blood stasis,yang deficiency,yin deficiency,water retention,and turbid phlegm.These findings can provide reference to the research on diagnosis and treatment of CHF,and contribute to the study on syndrome standardization and objective research of TCM diagnosis.展开更多
Purpose:This study analyzes the profiles of elite Brazilian researchers,recognized through the prestigious CNPq productivity scholarships.By identifying distinct researcher clusters,the study sheds light on different ...Purpose:This study analyzes the profiles of elite Brazilian researchers,recognized through the prestigious CNPq productivity scholarships.By identifying distinct researcher clusters,the study sheds light on different academic strategies,levels of productivity,and academic contributions within the Brazilian higher education system.Design/methodology/approach:The research analyzes a comprehensive dataset of 14,003 researchers,employing principal component analysis(PCA)followed by cluster analysis to group researchers based on their academic attributes.The clusters reflect diverse aspects of research productivity,graduate supervisions,and publication patterns.Findings:The analysis reveals the existence of three distinct researcher profiles(the Advanced Supervisors,the Book Publishers/Organizers,and the Generalists).The study also highlights the characteristics of highcaliber scientists,representing the upper echelon of Brazilian researchers in terms of productivity and impact.Research limitations:Although the study provides a robust analysis of the Brazilian system,the results reflect specific characteristics of the Brazilian academic context.Furthermore,the analysis was restricted to normalized annual data,which may overlook temporal variations in researcher productivity.Pratical implications:The findings provide valuable insights for policy makers,funding agencies(such as CNPq),and university administrators who aim to develop tailored support programs for different researcher profiles.Originality/value:The cluster-based profiling offers a novel perspective on how different academic trajectories coexist within a national science system,offering lessons for other emerging economies.展开更多
With the continuous expansion of the power system scale and the increasing complexity of operational mode,the interaction between transmission and distribution systems is becoming more and more significant,placing hig...With the continuous expansion of the power system scale and the increasing complexity of operational mode,the interaction between transmission and distribution systems is becoming more and more significant,placing higher requirements on the accuracy and efficiency of the power system state estimation to address the challenge of balancing computational efficiency and estimation accuracy in traditional coupled transmission and distribution state estimation methods,this paper proposes a collaborative state estimation method based on distribution systems state clustering and load model parameter identification.To resolve the scalability issue of coupled transmission and distribution power systems,clustering is first carried out based on the distribution system states.As the data and models of the transmission system and distribution systems are not shared.For the transmission system,equating the power transmitted from the transmission system to the distribution system is the same as equating the distribution system.Further,the power transmitted from the transmission system to different types of distribution systems is equivalent to different polynomial equivalent load models.Then,a parameter identification method is proposed to obtain the parameters of the equivalent load model.Finally,a transmission and distribution collaborative state estimation model is constructed based on the equivalent load model.The results of the numerical analysis show that compared with the traditional master-slave splitting method,the proposed method significantly enhances computational efficiency while maintaining high estimation accuracy.展开更多
Cycle slip detection and repair is one of the key technologies for GNSS high-precision positioning.We introduce an enhanced methodology for detecting and repairing BDS four-frequency cycle slips,utilizing fuzzy cluste...Cycle slip detection and repair is one of the key technologies for GNSS high-precision positioning.We introduce an enhanced methodology for detecting and repairing BDS four-frequency cycle slips,utilizing fuzzy clustering analysis.Firstly,based on fuzzy clustering analysis,the optimal combinations for the BDS four-frequency,including extra-wide lane(EWL),wide lane(WL),and narrow lane(NL),were selected.Secondly,the feasibility of this method was verified using actual static and dynamic observation data,and different types of cycle slips were simulated for further validation.Meanwhile,the proposed method was compared with the classical Turbo-Edit method through experiments.Finally,cycle slips were repaired using the least squares method.According to the experimental results,the optimal geometry-free phase combinations(-2,2,1,-1),(1,-1,1,-1),(3,2,-2,-3),and the pseudo-range phase combination(-1,1,1,-1),selected based on fuzzy clustering analysis,were used for cycle slip detection.The proposed method accurately detected small,large,and specific cycle slips simulated in the actual data.Compared with the Turbo-Edit method,the proposed methodwas able to detect specific cycle slips that Turbo-Edit could not.It is worth noting that during the repair process,the coefficients of the combined observation values are integers,preserving the integer cycle characteristic of the observation values,which allows cycle slips to be fixed directly,eliminating the need for complex searching procedures.Consequently,by enhancing the precision and reliability of the detection of BDS four-frequency cycle slips,our proposed method provides the support for the high-precision localization of BDS multi-frequency observations.展开更多
Objectives To identify core symptoms and symptom clusters in patients with neuromyelitis optica spectrum disorder(NMOSD)by network analysis.Methods From October 10 to 30,2023,140 patients with NMOSD were selected to p...Objectives To identify core symptoms and symptom clusters in patients with neuromyelitis optica spectrum disorder(NMOSD)by network analysis.Methods From October 10 to 30,2023,140 patients with NMOSD were selected to participate in this online questionnaire survey.The survey tools included a general information questionnaire and a self-made NMOSD symptoms scale,which included the prevalence,severity,and distress of 29 symptoms.Cluster analysis was used to identify symptom clusters,and network analysis was used to analyze the symptom network and node characteristics and central indicators including strength centrality(r_(s)),closeness centrality(r_(c))and betweeness centrality(r_(b))were used to identify core symptoms and symptom clusters.Results The most common symptom was pain(65.7%),followed by paraesthesia(65.0%),fatigue(65.0%),easy awakening(63.6%).Regarding the burden level of symptoms,pain was the most burdensome symptom,followed by paraesthesia,easy awakening,fatigue,and difficulty falling asleep.Six clusters were identified:somatosensory,motor,visual,and memory symptom clusters,bladder and rectum symptom clusters,sleep symptoms clusters,and neuropsychological symptom clusters.Fatigue(r_(s)=12.39,r_(b)=68.00,r_(c)=0.02)was the most central and prominent bridge symptom,and motor symptom cluster(r_(s)=2.68,r_(c)=0.10)was the most central symptom cluster among the six clusters.Conclusions Our study demonstrated the necessity of symptom management targeting fatigue,pain,and motor symptom cluster in patients with NMOSD.展开更多
Objective:This study aims to investigate the patterns of symptom occurrence in patients experiencing acute exacerbations of chronic obstructive pulmonary disease(AECOPD).It will explore the composition of symptom clus...Objective:This study aims to investigate the patterns of symptom occurrence in patients experiencing acute exacerbations of chronic obstructive pulmonary disease(AECOPD).It will explore the composition of symptom clusters and analyze the correlation between these clusters and health-related quality of life(HRQoL).Methods:A total of 207 patients with AE-COPD were surveyed from a tertiary grade A hospital.Data collection was conducted using three validated instruments:the Basic Information Questionnaire(BIQ),Disease Symptom Survey Questionnaire(MSAS),and Quality of Life Questionnaire(CAT).Statistical software SPSS 22.0 was used to analyze the correlation between symptom clusters and quality of life.Results:Exploratory factor analysis showed that five major symptom clusters existed in the patients,including the psycho-emotional symptom cluster,the sleep-related symptom cluster,the other side effects symptom cluster,the energy deficiency symptom cluster and the cough-loss of appetite symptom cluster,and the severity of the symptom clusters showed a significant negative correlation with the quality of life of the patients(P<0.05).Conclusion:Strengthening the comprehensive management of symptom clusters in patients with AE-COPD can help to effectively reduce the symptom burden of patients,and then significantly improve their quality of life.展开更多
In recent years,with the rapid development of software systems,the continuous expansion of software scale and the increasing complexity of systems have led to the emergence of a growing number of software metrics.Defe...In recent years,with the rapid development of software systems,the continuous expansion of software scale and the increasing complexity of systems have led to the emergence of a growing number of software metrics.Defect prediction methods based on software metric elements highly rely on software metric data.However,redundant software metric data is not conducive to efficient defect prediction,posing severe challenges to current software defect prediction tasks.To address these issues,this paper focuses on the rational clustering of software metric data.Firstly,multiple software projects are evaluated to determine the preset number of clusters for software metrics,and various clustering methods are employed to cluster the metric elements.Subsequently,a co-occurrence matrix is designed to comprehensively quantify the number of times that metrics appear in the same category.Based on the comprehensive results,the software metric data are divided into two semantic views containing different metrics,thereby analyzing the semantic information behind the software metrics.On this basis,this paper also conducts an in-depth analysis of the impact of different semantic view of metrics on defect prediction results,as well as the performance of various classification models under these semantic views.Experiments show that the joint use of the two semantic views can significantly improve the performance of models in software defect prediction,providing a new understanding and approach at the semantic view level for defect prediction research based on software metrics.展开更多
Hepatitis B virus remains a major cause of cirrhosis and hepatocellular carcinoma,with genetic polymorphisms and mutations influencing immune responses and disease progression.Nguyen et al present novel findings on sp...Hepatitis B virus remains a major cause of cirrhosis and hepatocellular carcinoma,with genetic polymorphisms and mutations influencing immune responses and disease progression.Nguyen et al present novel findings on specific human leukocyte antigen(HLA)alleles,including rs2856718 of HLA-DQ and rs3077 and rs9277535 of HLA-DP,which may predispose individuals to cirrhosis and liver cancer,based on multi-clustering analysis.Here,we discuss the feasibility of this approach and identify key areas for further investigation,aiming to offer insights for advancing clinical practice and research in liver disease and related cancers.展开更多
In the effort to enhance cardiovascular diagnostics,deep learning-based heart sound classification presents a promising solution.This research introduces a novel preprocessing method:iterative k-means clustering combi...In the effort to enhance cardiovascular diagnostics,deep learning-based heart sound classification presents a promising solution.This research introduces a novel preprocessing method:iterative k-means clustering combined with silhouette score analysis,aimed at downsampling.This approach ensures optimal cluster formation and improves data quality for deep learning models.The process involves applying k-means clustering to the dataset,calculating the average silhouette score for each cluster,and selecting the clusterwith the highest score.We evaluated this method using 10-fold cross-validation across various transfer learningmodels fromdifferent families and architectures.The evaluation was conducted on four datasets:a binary dataset,an augmented binary dataset,amulticlass dataset,and an augmentedmulticlass dataset.All datasets were derived from the Heart Wave heart sounds dataset,a novelmulticlass dataset introduced by our research group.To increase dataset sizes and improve model training,data augmentation was performed using heartbeat cycle segmentation.Our findings highlight the significant impact of the proposed preprocessing approach on the HeartWave datasets.Across all datasets,model performance improved notably with the application of our method.In augmented multiclass classification,the MobileNetV2 model showed an average weighted F1-score improvement of 27.10%.In binary classification,ResNet50 demonstrated an average accuracy improvement of 8.70%,reaching 92.40%compared to its baseline performance.These results underscore the effectiveness of clustering with silhouette score analysis as a preprocessing step,significantly enhancing model accuracy and robustness.They also emphasize the critical role of preprocessing in addressing class imbalance and advancing precision medicine in cardiovascular diagnostics.展开更多
Remarkable progress has been made in infection prevention and control(IPC)in many countries,but some gaps emerged in the context of the coronavirus disease 2019(COVID-19)pandemic.Core capabilities such as standard cli...Remarkable progress has been made in infection prevention and control(IPC)in many countries,but some gaps emerged in the context of the coronavirus disease 2019(COVID-19)pandemic.Core capabilities such as standard clinical precautions and tracing the source of infection were the focus of IPC in medical institutions during the pandemic.Therefore,the core competences of IPC professionals during the pandemic,and how these contributed to successful prevention and control of the epidemic,should be studied.To investigate,using a systematic review and cluster analysis,fundamental improvements in the competences of infection control and prevention professionals that may be emphasized in light of the COVID-19 pandemic.We searched the PubMed,Embase,Cochrane Library,Web of Science,CNKI,WanFang Data,and CBM databases for original articles exploring core competencies of IPC professionals during the COVID-19 pandemic(from January 1,2020 to February 7,2023).Weiciyun software was used for data extraction and the Donohue formula was followed to distinguish high-frequency technical terms.Cluster analysis was performed using the within-group linkage method and squared Euclidean distance as the metric to determine the priority competencies for development.We identified 46 studies with 29 high-frequency technical terms.The most common term was“infection prevention and control training”(184 times,17.3%),followed by“hand hygiene”(172 times,16.2%).“Infection prevention and control in clinical practice”was the most-reported core competency(367 times,34.5%),followed by“microbiology and surveillance”(292 times,27.5%).Cluster analysis showed two key areas of competence:Category 1(program management and leadership,patient safety and occupational health,education and microbiology and surveillance)and Category 2(IPC in clinical practice).During the COVID-19 pandemic,IPC program management and leadership,microbiology and surveillance,education,patient safety,and occupational health were the most important focus of development and should be given due consideration by IPC professionals.展开更多
Analysis of cluster behaviors of the on-coming cluster is an essential measure for high value locations on the battlefield.Unlike target tracking and clustering analysis,cluster behaviors analysis under cluster securi...Analysis of cluster behaviors of the on-coming cluster is an essential measure for high value locations on the battlefield.Unlike target tracking and clustering analysis,cluster behaviors analysis under cluster security requirements remains an open issue,which is a joint analysis of clus-tering,intent reasoning and activity regions.To address this issue,a framework for cluster behav-iors analysis is proposed by incorporating expert knowledge and domain knowledge,and a knowledge-assisted score function with is designed to improve the accuracy of intent reasoning net-work,overcoming the effects of possible knowledge errors.The framework consists of three mod-ules for cluster analysis,intent reasoning and activity region analysis for typical tasks,in which an intent reasoning network is constructed to obtain cluster intents by using a hybrid knowledge and data driven approach.Furthermore,considering the complexity of the battlefield environment,dif-ferent tasks and corresponding activity region optimization functions are designed for cluster activ-ity region analysis,which are vital elements of cluster behaviors analysis.Simulations demonstrate the effectiveness of the proposed cluster behaviors analysis framework.展开更多
Efficient iterative unsupervised machine learning involving probabilistic clustering analysis with the expectation-maximization(EM)clustering algorithm is applied to categorize reservoir facies by exploiting latent an...Efficient iterative unsupervised machine learning involving probabilistic clustering analysis with the expectation-maximization(EM)clustering algorithm is applied to categorize reservoir facies by exploiting latent and observable well-log variables from a clastic reservoir in the Majnoon oilfield,southern Iraq.The observable well-log variables consist of conventional open-hole,well-log data and the computer-processed interpretation of gamma rays,bulk density,neutron porosity,compressional sonic,deep resistivity,shale volume,total porosity,and water saturation,from three wells located in the Nahr Umr reservoir.The latent variables include shale volume and water saturation.The EM algorithm efficiently characterizes electrofacies through iterative machine learning to identify the local maximum likelihood estimates(MLE)of the observable and latent variables in the studied dataset.The optimized EM model developed successfully predicts the core-derived facies classification in two of the studied wells.The EM model clusters the data into three distinctive reservoir electrofacies(F1,F2,and F3).F1 represents a gas-bearing electrofacies with low shale volume(Vsh)and water saturation(Sw)and high porosity and permeability values identifying it as an attractive reservoir target.The results of the EM model are validated using nuclear magnetic resonance(NMR)data from the third studied well for which no cores were recovered.The NMR results confirm the effectiveness and accuracy of the EM model in predicting electrofacies.The utilization of the EM algorithm for electrofacies classification/cluster analysis is innovative.Specifically,the clusters it establishes are less rigidly constrained than those derived from the more commonly used K-means clustering method.The EM methodology developed generates dependable electrofacies estimates in the studied reservoir intervals where core samples are not available.Therefore,once calibrated with core data in some wells,the model is suitable for application to other wells that lack core data.展开更多
Studying user electricity consumption behavior is crucial for understanding their power usage patterns.However,the traditional clustering methods fail to identify emerging types of electricity consumption behavior.To ...Studying user electricity consumption behavior is crucial for understanding their power usage patterns.However,the traditional clustering methods fail to identify emerging types of electricity consumption behavior.To address this issue,this paper introduces a statistical analysis of clusters and evaluates the set of indicators for power usage patterns.The fuzzy C-means clustering algorithm is then used to analyze 6 months of electricity consumption data in 2017 from energy storage equipment,agricultural drainage irrigation,port shore power,and electric vehicles.Finally,the proposed method is validated through experiments,where the Davies-Bouldin index and profile coefficient are calculated and compared.Experiments showed that the optimal number of clusters is 4.This study demonstrates the potential of using a fuzzy C-means clustering algorithmin identifying emerging types of electricity consumption behavior,which can help power system operators and policymakers to make informed decisions and improve energy efficiency.展开更多
A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in vari...A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in various ways, but most often they are based on previous landslide data. This approach introduces several limitations. For instance, there is a requirement for the location to have been previously monitored in some way to have this type of information recorded. Another significant limitation is the need for information regarding the location and timing of incidents. Despite the current ease of obtaining location information (GPS, drone images, etc.), the timing of the event remains challenging to ascertain for a considerable portion of landslide data. Concerning rainfall monitoring, there are multiple ways to consider it, for instance, examining accumulations over various intervals (1 h, 6 h, 24 h, 72 h), as well as in the calculation of effective rainfall, which represents the precipitation that actually infiltrates the soil. However, in the vast majority of cases, both the thresholds and the rain monitoring approach are defined manually and subjectively, relying on the operators’ experience. This makes the process labor-intensive and time-consuming, hindering the establishment of a truly standardized and rapidly scalable methodology on a large scale. In this work, we propose a Landslides Early Warning System (LEWS) based on the concept of rainfall half-life and the determination of thresholds using Cluster Analysis and data inversion. The system is designed to be applied in extensive monitoring networks, such as the one utilized by Cemaden, Brazil’s National Center for Monitoring and Early Warning of Natural Disasters.展开更多
To solve the problems of a few optical fibre line fault samples and the inefficiency of manual communication optical fibre fault diagnosis,this paper proposes a communication optical fibre fault diagnosis model based ...To solve the problems of a few optical fibre line fault samples and the inefficiency of manual communication optical fibre fault diagnosis,this paper proposes a communication optical fibre fault diagnosis model based on variational modal decomposition(VMD),fuzzy entropy(FE)and fuzzy clustering(FC).Firstly,based on the OTDR curve data collected in the field,VMD is used to extract the different modal components(IMF)of the original signal and calculate the fuzzy entropy(FE)values of different components to characterize the subtle differences between them.The fuzzy entropy of each curve is used as the feature vector,which in turn constructs the communication optical fibre feature vector matrix,and the fuzzy clustering algorithm is used to achieve fault diagnosis of faulty optical fibre.The VMD-FE combination can extract subtle differences in features,and the fuzzy clustering algorithm does not require sample training.The experimental results show that the model in this paper has high accuracy and is relevant to the maintenance of communication optical fibre when compared with existing feature extraction models and traditional machine learning models.展开更多
This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among ...This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among classical private gardens in the Northern,Jiangnan,and Lingnan regions.The study examines nine classical private gardens from Northern China,Jiangnan,and Lingnan by utilizing the advanced tool of principal component cluster analysis.Based on literature analysis and field research,273 variables were selected for principal component analysis,from which four components with higher contribution rates were chosen for further study.Subsequently,we employed clustering analysis techniques to compare the differences among the three types of gardens.The results reveal that the first principal component effectively highlights the differences between Jiangnan and Lingnan private gardens.The second principal component serves as the key to defining the types of Northern private gardens and distinguishing them from the other two types,and the third principal component indicates that Lingnan private gardens can be categorized into two distinct types as well.展开更多
With the development of information technology,radio communication technology has made rapid progress.Many radio signals that have appeared in space are difficult to classify without manually labeling.Unsupervised rad...With the development of information technology,radio communication technology has made rapid progress.Many radio signals that have appeared in space are difficult to classify without manually labeling.Unsupervised radio signal clustering methods have recently become an urgent need for this situation.Meanwhile,the high complexity of deep learning makes it difficult to understand the decision results of the clustering models,making it essential to conduct interpretable analysis.This paper proposed a combined loss function for unsupervised clustering based on autoencoder.The combined loss function includes reconstruction loss and deep clustering loss.Deep clustering loss is added based on reconstruction loss,which makes similar deep features converge more in feature space.In addition,a features visualization method for signal clustering was proposed to analyze the interpretability of autoencoder utilizing Saliency Map.Extensive experiments have been conducted on a modulated signal dataset,and the results indicate the superior performance of our proposed method over other clustering algorithms.In particular,for the simulated dataset containing six modulation modes,when the SNR is 20dB,the clustering accuracy of the proposed method is greater than 78%.The interpretability analysis of the clustering model was performed to visualize the significant features of different modulated signals and verified the high separability of the features extracted by clustering model.展开更多
Failure mode and effect analysis(FMEA)is a preven-tative risk evaluation method used to evaluate and eliminate fail-ure modes within a system.However,the traditional FMEA method exhibits many deficiencies that pose ch...Failure mode and effect analysis(FMEA)is a preven-tative risk evaluation method used to evaluate and eliminate fail-ure modes within a system.However,the traditional FMEA method exhibits many deficiencies that pose challenges in prac-tical applications.To improve the conventional FMEA,many modified FMEA models have been suggested.However,the majority of them inadequately address consensus issues and focus on achieving a complete ranking of failure modes.In this research,we propose a new FMEA approach that integrates a two-stage consensus reaching model and a density peak clus-tering algorithm for the assessment and clustering of failure modes.Firstly,we employ the interval 2-tuple linguistic vari-ables(I2TLVs)to express the uncertain risk evaluations provided by FMEA experts.Then,a two-stage consensus reaching model is adopted to enable FMEA experts to reach a consensus.Next,failure modes are categorized into several risk clusters using a density peak clustering algorithm.Finally,the proposed FMEA is illustrated by a case study of load-bearing guidance devices of subway systems.The results show that the proposed FMEA model can more easily to describe the uncertain risk information of failure modes by using the I2TLVs;the introduction of an endogenous feedback mechanism and an exogenous feedback mechanism can accelerate the process of consensus reaching;and the density peak clustering of failure modes successfully improves the practical applicability of FMEA.展开更多
基金financed by the grants from the National Natural Science Foundation of China(No.81803996)Shanghai Key Laboratory of Health Identification and Assessment(No.21DZ2271000)。
文摘Traditional Chinese medicine(TCM)has played a significant role in the prevention and treatment of chronic heart failure(CHF).To study TCM diagnosis of CHF,a total of 278 Chinese clinical research articles on the study of CHF syndromes in recent 40 years retrieved from Web of Science,Scopus,Pub Med,Embase,CNKI,Wanfang Data,Cq VIP,and Sino Med.According to cumulative frequency analysis,network analysis,and hierarchical cluster analysis,the study found the distribution of CHF syndromes was syndrome of qi deficiency with blood stasis,syndrome of qi and yin deficiency,syndrome of yang deficiency with water flooding,syndrome of heart blood stasis obstruction,syndrome of turbid phlegm,and syndrome of collapse due to primordial yang deficiency.The syndrome elements on location of illness were heart,kidney,lung,and spleen.The syndrome elements on nature of illness were qi deficiency,blood stasis,yang deficiency,yin deficiency,water retention,and turbid phlegm.These findings can provide reference to the research on diagnosis and treatment of CHF,and contribute to the study on syndrome standardization and objective research of TCM diagnosis.
文摘Purpose:This study analyzes the profiles of elite Brazilian researchers,recognized through the prestigious CNPq productivity scholarships.By identifying distinct researcher clusters,the study sheds light on different academic strategies,levels of productivity,and academic contributions within the Brazilian higher education system.Design/methodology/approach:The research analyzes a comprehensive dataset of 14,003 researchers,employing principal component analysis(PCA)followed by cluster analysis to group researchers based on their academic attributes.The clusters reflect diverse aspects of research productivity,graduate supervisions,and publication patterns.Findings:The analysis reveals the existence of three distinct researcher profiles(the Advanced Supervisors,the Book Publishers/Organizers,and the Generalists).The study also highlights the characteristics of highcaliber scientists,representing the upper echelon of Brazilian researchers in terms of productivity and impact.Research limitations:Although the study provides a robust analysis of the Brazilian system,the results reflect specific characteristics of the Brazilian academic context.Furthermore,the analysis was restricted to normalized annual data,which may overlook temporal variations in researcher productivity.Pratical implications:The findings provide valuable insights for policy makers,funding agencies(such as CNPq),and university administrators who aim to develop tailored support programs for different researcher profiles.Originality/value:The cluster-based profiling offers a novel perspective on how different academic trajectories coexist within a national science system,offering lessons for other emerging economies.
基金State Grid Jiangsu Electric Power Co.,Ltd.Technology Project(J2023121).
文摘With the continuous expansion of the power system scale and the increasing complexity of operational mode,the interaction between transmission and distribution systems is becoming more and more significant,placing higher requirements on the accuracy and efficiency of the power system state estimation to address the challenge of balancing computational efficiency and estimation accuracy in traditional coupled transmission and distribution state estimation methods,this paper proposes a collaborative state estimation method based on distribution systems state clustering and load model parameter identification.To resolve the scalability issue of coupled transmission and distribution power systems,clustering is first carried out based on the distribution system states.As the data and models of the transmission system and distribution systems are not shared.For the transmission system,equating the power transmitted from the transmission system to the distribution system is the same as equating the distribution system.Further,the power transmitted from the transmission system to different types of distribution systems is equivalent to different polynomial equivalent load models.Then,a parameter identification method is proposed to obtain the parameters of the equivalent load model.Finally,a transmission and distribution collaborative state estimation model is constructed based on the equivalent load model.The results of the numerical analysis show that compared with the traditional master-slave splitting method,the proposed method significantly enhances computational efficiency while maintaining high estimation accuracy.
基金supported by the National Natural Science Foundation of China(42174003)the Gansu Provincial Department of Education:Innovation Fund Project for College Teachers(2023A-035)+1 种基金Gansu Provincial Science and Technology Program(Joint Research Fund),24JRRA856the Lanzhou Talent Innovation Project,2023-RC-31.
文摘Cycle slip detection and repair is one of the key technologies for GNSS high-precision positioning.We introduce an enhanced methodology for detecting and repairing BDS four-frequency cycle slips,utilizing fuzzy clustering analysis.Firstly,based on fuzzy clustering analysis,the optimal combinations for the BDS four-frequency,including extra-wide lane(EWL),wide lane(WL),and narrow lane(NL),were selected.Secondly,the feasibility of this method was verified using actual static and dynamic observation data,and different types of cycle slips were simulated for further validation.Meanwhile,the proposed method was compared with the classical Turbo-Edit method through experiments.Finally,cycle slips were repaired using the least squares method.According to the experimental results,the optimal geometry-free phase combinations(-2,2,1,-1),(1,-1,1,-1),(3,2,-2,-3),and the pseudo-range phase combination(-1,1,1,-1),selected based on fuzzy clustering analysis,were used for cycle slip detection.The proposed method accurately detected small,large,and specific cycle slips simulated in the actual data.Compared with the Turbo-Edit method,the proposed methodwas able to detect specific cycle slips that Turbo-Edit could not.It is worth noting that during the repair process,the coefficients of the combined observation values are integers,preserving the integer cycle characteristic of the observation values,which allows cycle slips to be fixed directly,eliminating the need for complex searching procedures.Consequently,by enhancing the precision and reliability of the detection of BDS four-frequency cycle slips,our proposed method provides the support for the high-precision localization of BDS multi-frequency observations.
基金supported by the Specific Research Fund for Top-notch Talents of Guangdong Provincial Hospital of Chinese Medicine(No.2022KT1188).
文摘Objectives To identify core symptoms and symptom clusters in patients with neuromyelitis optica spectrum disorder(NMOSD)by network analysis.Methods From October 10 to 30,2023,140 patients with NMOSD were selected to participate in this online questionnaire survey.The survey tools included a general information questionnaire and a self-made NMOSD symptoms scale,which included the prevalence,severity,and distress of 29 symptoms.Cluster analysis was used to identify symptom clusters,and network analysis was used to analyze the symptom network and node characteristics and central indicators including strength centrality(r_(s)),closeness centrality(r_(c))and betweeness centrality(r_(b))were used to identify core symptoms and symptom clusters.Results The most common symptom was pain(65.7%),followed by paraesthesia(65.0%),fatigue(65.0%),easy awakening(63.6%).Regarding the burden level of symptoms,pain was the most burdensome symptom,followed by paraesthesia,easy awakening,fatigue,and difficulty falling asleep.Six clusters were identified:somatosensory,motor,visual,and memory symptom clusters,bladder and rectum symptom clusters,sleep symptoms clusters,and neuropsychological symptom clusters.Fatigue(r_(s)=12.39,r_(b)=68.00,r_(c)=0.02)was the most central and prominent bridge symptom,and motor symptom cluster(r_(s)=2.68,r_(c)=0.10)was the most central symptom cluster among the six clusters.Conclusions Our study demonstrated the necessity of symptom management targeting fatigue,pain,and motor symptom cluster in patients with NMOSD.
文摘Objective:This study aims to investigate the patterns of symptom occurrence in patients experiencing acute exacerbations of chronic obstructive pulmonary disease(AECOPD).It will explore the composition of symptom clusters and analyze the correlation between these clusters and health-related quality of life(HRQoL).Methods:A total of 207 patients with AE-COPD were surveyed from a tertiary grade A hospital.Data collection was conducted using three validated instruments:the Basic Information Questionnaire(BIQ),Disease Symptom Survey Questionnaire(MSAS),and Quality of Life Questionnaire(CAT).Statistical software SPSS 22.0 was used to analyze the correlation between symptom clusters and quality of life.Results:Exploratory factor analysis showed that five major symptom clusters existed in the patients,including the psycho-emotional symptom cluster,the sleep-related symptom cluster,the other side effects symptom cluster,the energy deficiency symptom cluster and the cough-loss of appetite symptom cluster,and the severity of the symptom clusters showed a significant negative correlation with the quality of life of the patients(P<0.05).Conclusion:Strengthening the comprehensive management of symptom clusters in patients with AE-COPD can help to effectively reduce the symptom burden of patients,and then significantly improve their quality of life.
基金supported by the CCF-NSFOCUS‘Kunpeng’Research Fund(CCF-NSFOCUS2024012).
文摘In recent years,with the rapid development of software systems,the continuous expansion of software scale and the increasing complexity of systems have led to the emergence of a growing number of software metrics.Defect prediction methods based on software metric elements highly rely on software metric data.However,redundant software metric data is not conducive to efficient defect prediction,posing severe challenges to current software defect prediction tasks.To address these issues,this paper focuses on the rational clustering of software metric data.Firstly,multiple software projects are evaluated to determine the preset number of clusters for software metrics,and various clustering methods are employed to cluster the metric elements.Subsequently,a co-occurrence matrix is designed to comprehensively quantify the number of times that metrics appear in the same category.Based on the comprehensive results,the software metric data are divided into two semantic views containing different metrics,thereby analyzing the semantic information behind the software metrics.On this basis,this paper also conducts an in-depth analysis of the impact of different semantic view of metrics on defect prediction results,as well as the performance of various classification models under these semantic views.Experiments show that the joint use of the two semantic views can significantly improve the performance of models in software defect prediction,providing a new understanding and approach at the semantic view level for defect prediction research based on software metrics.
基金Supported by National Natural Science Foundation of China,No.32270768,No.82273970,No.32070726,and No.82370715National Key R&D Program of China,No.2023YFC2507904the Innovation Group Project of Hubei Province,No.2023AFA026.
文摘Hepatitis B virus remains a major cause of cirrhosis and hepatocellular carcinoma,with genetic polymorphisms and mutations influencing immune responses and disease progression.Nguyen et al present novel findings on specific human leukocyte antigen(HLA)alleles,including rs2856718 of HLA-DQ and rs3077 and rs9277535 of HLA-DP,which may predispose individuals to cirrhosis and liver cancer,based on multi-clustering analysis.Here,we discuss the feasibility of this approach and identify key areas for further investigation,aiming to offer insights for advancing clinical practice and research in liver disease and related cancers.
基金supported by the Deanship of Scientific Research(DSR),King Abdulaziz University,Jeddah,under grant No.IPP:533-611-2025DSR technical and financial support.
文摘In the effort to enhance cardiovascular diagnostics,deep learning-based heart sound classification presents a promising solution.This research introduces a novel preprocessing method:iterative k-means clustering combined with silhouette score analysis,aimed at downsampling.This approach ensures optimal cluster formation and improves data quality for deep learning models.The process involves applying k-means clustering to the dataset,calculating the average silhouette score for each cluster,and selecting the clusterwith the highest score.We evaluated this method using 10-fold cross-validation across various transfer learningmodels fromdifferent families and architectures.The evaluation was conducted on four datasets:a binary dataset,an augmented binary dataset,amulticlass dataset,and an augmentedmulticlass dataset.All datasets were derived from the Heart Wave heart sounds dataset,a novelmulticlass dataset introduced by our research group.To increase dataset sizes and improve model training,data augmentation was performed using heartbeat cycle segmentation.Our findings highlight the significant impact of the proposed preprocessing approach on the HeartWave datasets.Across all datasets,model performance improved notably with the application of our method.In augmented multiclass classification,the MobileNetV2 model showed an average weighted F1-score improvement of 27.10%.In binary classification,ResNet50 demonstrated an average accuracy improvement of 8.70%,reaching 92.40%compared to its baseline performance.These results underscore the effectiveness of clustering with silhouette score analysis as a preprocessing step,significantly enhancing model accuracy and robustness.They also emphasize the critical role of preprocessing in addressing class imbalance and advancing precision medicine in cardiovascular diagnostics.
基金The National Natural Science Foundation of China,Grant/Award Number:52178080Major Research Project of the Hospital Management Research Institute of the National Health Commission,Grant/Award Number:GY2023011National Institute of Hospital Administration Management of China,Grant/Award Number:GY2023049。
文摘Remarkable progress has been made in infection prevention and control(IPC)in many countries,but some gaps emerged in the context of the coronavirus disease 2019(COVID-19)pandemic.Core capabilities such as standard clinical precautions and tracing the source of infection were the focus of IPC in medical institutions during the pandemic.Therefore,the core competences of IPC professionals during the pandemic,and how these contributed to successful prevention and control of the epidemic,should be studied.To investigate,using a systematic review and cluster analysis,fundamental improvements in the competences of infection control and prevention professionals that may be emphasized in light of the COVID-19 pandemic.We searched the PubMed,Embase,Cochrane Library,Web of Science,CNKI,WanFang Data,and CBM databases for original articles exploring core competencies of IPC professionals during the COVID-19 pandemic(from January 1,2020 to February 7,2023).Weiciyun software was used for data extraction and the Donohue formula was followed to distinguish high-frequency technical terms.Cluster analysis was performed using the within-group linkage method and squared Euclidean distance as the metric to determine the priority competencies for development.We identified 46 studies with 29 high-frequency technical terms.The most common term was“infection prevention and control training”(184 times,17.3%),followed by“hand hygiene”(172 times,16.2%).“Infection prevention and control in clinical practice”was the most-reported core competency(367 times,34.5%),followed by“microbiology and surveillance”(292 times,27.5%).Cluster analysis showed two key areas of competence:Category 1(program management and leadership,patient safety and occupational health,education and microbiology and surveillance)and Category 2(IPC in clinical practice).During the COVID-19 pandemic,IPC program management and leadership,microbiology and surveillance,education,patient safety,and occupational health were the most important focus of development and should be given due consideration by IPC professionals.
基金support from the Natural Science Foundation of China (No.61873205)。
文摘Analysis of cluster behaviors of the on-coming cluster is an essential measure for high value locations on the battlefield.Unlike target tracking and clustering analysis,cluster behaviors analysis under cluster security requirements remains an open issue,which is a joint analysis of clus-tering,intent reasoning and activity regions.To address this issue,a framework for cluster behav-iors analysis is proposed by incorporating expert knowledge and domain knowledge,and a knowledge-assisted score function with is designed to improve the accuracy of intent reasoning net-work,overcoming the effects of possible knowledge errors.The framework consists of three mod-ules for cluster analysis,intent reasoning and activity region analysis for typical tasks,in which an intent reasoning network is constructed to obtain cluster intents by using a hybrid knowledge and data driven approach.Furthermore,considering the complexity of the battlefield environment,dif-ferent tasks and corresponding activity region optimization functions are designed for cluster activ-ity region analysis,which are vital elements of cluster behaviors analysis.Simulations demonstrate the effectiveness of the proposed cluster behaviors analysis framework.
文摘Efficient iterative unsupervised machine learning involving probabilistic clustering analysis with the expectation-maximization(EM)clustering algorithm is applied to categorize reservoir facies by exploiting latent and observable well-log variables from a clastic reservoir in the Majnoon oilfield,southern Iraq.The observable well-log variables consist of conventional open-hole,well-log data and the computer-processed interpretation of gamma rays,bulk density,neutron porosity,compressional sonic,deep resistivity,shale volume,total porosity,and water saturation,from three wells located in the Nahr Umr reservoir.The latent variables include shale volume and water saturation.The EM algorithm efficiently characterizes electrofacies through iterative machine learning to identify the local maximum likelihood estimates(MLE)of the observable and latent variables in the studied dataset.The optimized EM model developed successfully predicts the core-derived facies classification in two of the studied wells.The EM model clusters the data into three distinctive reservoir electrofacies(F1,F2,and F3).F1 represents a gas-bearing electrofacies with low shale volume(Vsh)and water saturation(Sw)and high porosity and permeability values identifying it as an attractive reservoir target.The results of the EM model are validated using nuclear magnetic resonance(NMR)data from the third studied well for which no cores were recovered.The NMR results confirm the effectiveness and accuracy of the EM model in predicting electrofacies.The utilization of the EM algorithm for electrofacies classification/cluster analysis is innovative.Specifically,the clusters it establishes are less rigidly constrained than those derived from the more commonly used K-means clustering method.The EM methodology developed generates dependable electrofacies estimates in the studied reservoir intervals where core samples are not available.Therefore,once calibrated with core data in some wells,the model is suitable for application to other wells that lack core data.
基金supported by the Science and Technology Project of State Grid Jiangxi Electric Power Corporation Limited‘Research on Key Technologies for Non-Intrusive Load Identification for Typical Power Industry Users in Jiangxi Province’(521852220004)。
文摘Studying user electricity consumption behavior is crucial for understanding their power usage patterns.However,the traditional clustering methods fail to identify emerging types of electricity consumption behavior.To address this issue,this paper introduces a statistical analysis of clusters and evaluates the set of indicators for power usage patterns.The fuzzy C-means clustering algorithm is then used to analyze 6 months of electricity consumption data in 2017 from energy storage equipment,agricultural drainage irrigation,port shore power,and electric vehicles.Finally,the proposed method is validated through experiments,where the Davies-Bouldin index and profile coefficient are calculated and compared.Experiments showed that the optimal number of clusters is 4.This study demonstrates the potential of using a fuzzy C-means clustering algorithmin identifying emerging types of electricity consumption behavior,which can help power system operators and policymakers to make informed decisions and improve energy efficiency.
文摘A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in various ways, but most often they are based on previous landslide data. This approach introduces several limitations. For instance, there is a requirement for the location to have been previously monitored in some way to have this type of information recorded. Another significant limitation is the need for information regarding the location and timing of incidents. Despite the current ease of obtaining location information (GPS, drone images, etc.), the timing of the event remains challenging to ascertain for a considerable portion of landslide data. Concerning rainfall monitoring, there are multiple ways to consider it, for instance, examining accumulations over various intervals (1 h, 6 h, 24 h, 72 h), as well as in the calculation of effective rainfall, which represents the precipitation that actually infiltrates the soil. However, in the vast majority of cases, both the thresholds and the rain monitoring approach are defined manually and subjectively, relying on the operators’ experience. This makes the process labor-intensive and time-consuming, hindering the establishment of a truly standardized and rapidly scalable methodology on a large scale. In this work, we propose a Landslides Early Warning System (LEWS) based on the concept of rainfall half-life and the determination of thresholds using Cluster Analysis and data inversion. The system is designed to be applied in extensive monitoring networks, such as the one utilized by Cemaden, Brazil’s National Center for Monitoring and Early Warning of Natural Disasters.
基金This paper is supported by State Grid Gansu Electric Power Company Science and Technology Project(20220515003).
文摘To solve the problems of a few optical fibre line fault samples and the inefficiency of manual communication optical fibre fault diagnosis,this paper proposes a communication optical fibre fault diagnosis model based on variational modal decomposition(VMD),fuzzy entropy(FE)and fuzzy clustering(FC).Firstly,based on the OTDR curve data collected in the field,VMD is used to extract the different modal components(IMF)of the original signal and calculate the fuzzy entropy(FE)values of different components to characterize the subtle differences between them.The fuzzy entropy of each curve is used as the feature vector,which in turn constructs the communication optical fibre feature vector matrix,and the fuzzy clustering algorithm is used to achieve fault diagnosis of faulty optical fibre.The VMD-FE combination can extract subtle differences in features,and the fuzzy clustering algorithm does not require sample training.The experimental results show that the model in this paper has high accuracy and is relevant to the maintenance of communication optical fibre when compared with existing feature extraction models and traditional machine learning models.
文摘This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among classical private gardens in the Northern,Jiangnan,and Lingnan regions.The study examines nine classical private gardens from Northern China,Jiangnan,and Lingnan by utilizing the advanced tool of principal component cluster analysis.Based on literature analysis and field research,273 variables were selected for principal component analysis,from which four components with higher contribution rates were chosen for further study.Subsequently,we employed clustering analysis techniques to compare the differences among the three types of gardens.The results reveal that the first principal component effectively highlights the differences between Jiangnan and Lingnan private gardens.The second principal component serves as the key to defining the types of Northern private gardens and distinguishing them from the other two types,and the third principal component indicates that Lingnan private gardens can be categorized into two distinct types as well.
基金supported in part by the National Natural Science Foundation of China(No.62276206)the Key Research and Development Program of Shaanxi under Grant S2022-YF-YBGY-0921+2 种基金the State Key Program of National Natural Science of China(No.62231027)supported by the Science and Technology on Communication Information Security Control Laboratory。
文摘With the development of information technology,radio communication technology has made rapid progress.Many radio signals that have appeared in space are difficult to classify without manually labeling.Unsupervised radio signal clustering methods have recently become an urgent need for this situation.Meanwhile,the high complexity of deep learning makes it difficult to understand the decision results of the clustering models,making it essential to conduct interpretable analysis.This paper proposed a combined loss function for unsupervised clustering based on autoencoder.The combined loss function includes reconstruction loss and deep clustering loss.Deep clustering loss is added based on reconstruction loss,which makes similar deep features converge more in feature space.In addition,a features visualization method for signal clustering was proposed to analyze the interpretability of autoencoder utilizing Saliency Map.Extensive experiments have been conducted on a modulated signal dataset,and the results indicate the superior performance of our proposed method over other clustering algorithms.In particular,for the simulated dataset containing six modulation modes,when the SNR is 20dB,the clustering accuracy of the proposed method is greater than 78%.The interpretability analysis of the clustering model was performed to visualize the significant features of different modulated signals and verified the high separability of the features extracted by clustering model.
基金supported by the Fundamental Research Funds for the Central Universities(22120240094)Humanities and Social Science Fund of Ministry of Education China(22YJA630082).
文摘Failure mode and effect analysis(FMEA)is a preven-tative risk evaluation method used to evaluate and eliminate fail-ure modes within a system.However,the traditional FMEA method exhibits many deficiencies that pose challenges in prac-tical applications.To improve the conventional FMEA,many modified FMEA models have been suggested.However,the majority of them inadequately address consensus issues and focus on achieving a complete ranking of failure modes.In this research,we propose a new FMEA approach that integrates a two-stage consensus reaching model and a density peak clus-tering algorithm for the assessment and clustering of failure modes.Firstly,we employ the interval 2-tuple linguistic vari-ables(I2TLVs)to express the uncertain risk evaluations provided by FMEA experts.Then,a two-stage consensus reaching model is adopted to enable FMEA experts to reach a consensus.Next,failure modes are categorized into several risk clusters using a density peak clustering algorithm.Finally,the proposed FMEA is illustrated by a case study of load-bearing guidance devices of subway systems.The results show that the proposed FMEA model can more easily to describe the uncertain risk information of failure modes by using the I2TLVs;the introduction of an endogenous feedback mechanism and an exogenous feedback mechanism can accelerate the process of consensus reaching;and the density peak clustering of failure modes successfully improves the practical applicability of FMEA.