期刊文献+
共找到5,146篇文章
< 1 2 250 >
每页显示 20 50 100
An Improved Kernel K-Mean Cluster Method and Its Application in Fault Diagnosis of Roller Bearing 被引量:2
1
作者 Ling-Li Jiang Yu-Xiang Cao +1 位作者 Hua-Kui Yin Kong-Shu Deng 《Engineering(科研)》 2013年第1期44-49,共6页
For the kernel K-mean cluster method is run in an implicit feature space, the initial and iterative cluster centers cannot be defined explicitly. Against the deficiency of the initial cluster centers selected in the o... For the kernel K-mean cluster method is run in an implicit feature space, the initial and iterative cluster centers cannot be defined explicitly. Against the deficiency of the initial cluster centers selected in the original space discretionarily in the existing methods, this paper proposes a new method for ensuring the clustering center that virtual clustering centers are defined in the feature space by the original classification as the initial cluster centers and the iteration clustering centers are ensured by the further virtual classification. The improved method is used for fault diagnosis of roller bearing that achieves a good cluster and diagnosis result, which demonstrates the effectiveness of the proposed method. 展开更多
关键词 IMPROVED KERNEL k-mean cluster FAULT Diagnosis ROLLER BEARING
暂未订购
Equivalent Modeling with Passive Filter Parameter Clustering for Photovoltaic Power Stations Based on a Particle Swarm Optimization K-Means Algorithm
2
作者 Binjiang Hu Yihua Zhu +3 位作者 Liang Tu Zun Ma Xian Meng Kewei Xu 《Energy Engineering》 2026年第1期431-459,共29页
This paper proposes an equivalent modeling method for photovoltaic(PV)power stations via a particle swarm optimization(PSO)K-means clustering(KMC)algorithm with passive filter parameter clustering to address the compl... This paper proposes an equivalent modeling method for photovoltaic(PV)power stations via a particle swarm optimization(PSO)K-means clustering(KMC)algorithm with passive filter parameter clustering to address the complexities,simulation time cost and convergence problems of detailed PV power station models.First,the amplitude–frequency curves of different filter parameters are analyzed.Based on the results,a grouping parameter set for characterizing the external filter characteristics is established.These parameters are further defined as clustering parameters.A single PV inverter model is then established as a prerequisite foundation.The proposed equivalent method combines the global search capability of PSO with the rapid convergence of KMC,effectively overcoming the tendency of KMC to become trapped in local optima.This approach enhances both clustering accuracy and numerical stability when determining equivalence for PV inverter units.Using the proposed clustering method,both a detailed PV power station model and an equivalent model are developed and compared.Simulation and hardwarein-loop(HIL)results based on the equivalent model verify that the equivalent method accurately represents the dynamic characteristics of PVpower stations and adapts well to different operating conditions.The proposed equivalent modeling method provides an effective analysis tool for future renewable energy integration research. 展开更多
关键词 Photovoltaic power station multi-machine equivalentmodeling particle swarmoptimization k-means clustering algorithm
在线阅读 下载PDF
Fuzzy k-Means Clustering-Based Machine Learning Models for LFO Damping in Electric Power System Networks
3
作者 Md Shafiullah 《Computer Modeling in Engineering & Sciences》 2026年第2期803-830,共28页
Various factors,including weak tie-lines into the electric power system(EPS)networks,can lead to low-frequency oscillations(LFOs),which are considered an instant,non-threatening situation,but slow-acting and poisonous... Various factors,including weak tie-lines into the electric power system(EPS)networks,can lead to low-frequency oscillations(LFOs),which are considered an instant,non-threatening situation,but slow-acting and poisonous.Considering the challenge mentioned,this article proposes a clustering-based machine learning(ML)framework to enhance the stability of EPS networks by suppressing LFOs through real-time tuning of key power system stabilizer(PSS)parameters.To validate the proposed strategy,two distinct EPS networks are selected:the single-machine infinite-bus(SMIB)with a single-stage PSS and the unified power flow controller(UPFC)coordinated SMIB with a double-stage PSS.To generate data under various loading conditions for both networks,an efficient but offline meta-heuristic algorithm,namely the grey wolf optimizer(GWO),is used,with the loading conditions as inputs and the key PSS parameters as outputs.The generated loading conditions are then clustered using the fuzzy k-means(FKM)clustering method.Finally,the group method of data handling(GMDH)and long short-term memory(LSTM)ML models are developed for clustered data to predict PSS key parameters in real time for any loading condition.A few well-known statistical performance indices(SPI)are considered for validation and robustness of the training and testing procedure of the developed FKM-GMDH and FKM-LSTM models based on the prediction of PSS parameters.The performance of the ML models is also evaluated using three stability indices(i.e.,minimum damping ratio,eigenvalues,and time-domain simulations)after optimally tuned PSS with real-time estimated parameters under changing operating conditions.Besides,the outputs of the offline(GWO-based)metaheuristic model,proposed real-time(FKM-GMDH and FKM-LSTM)machine learning models,and previously reported literature models are compared.According to the results,the proposed methodology outperforms the others in enhancing the stability of the selected EPS networks by damping out the observed unwanted LFOs under various loading conditions. 展开更多
关键词 Fuzzy k-means clustering grey wolf optimizer group method of data handling long short-term memory low-frequency oscillation power system stabilizer single machine infinite bus STABILITY unified power flow controller
在线阅读 下载PDF
Visual field prediction using K-means clustering in patients with primary open angle glaucoma
4
作者 Junyoung Lee Jihun Kim +5 位作者 Hwayoung Kim Sangwoo Moon EunAh Kim Sanghun Jeong Hojin Yang Jiwoong Lee 《International Journal of Ophthalmology(English edition)》 2026年第1期63-68,共6页
AIM:To evaluate long-term visual field(VF)prediction using K-means clustering in patients with primary open angle glaucoma(POAG).METHODS:Patients who underwent 24-2 VF tests≥10 were included in this study.Using 52 to... AIM:To evaluate long-term visual field(VF)prediction using K-means clustering in patients with primary open angle glaucoma(POAG).METHODS:Patients who underwent 24-2 VF tests≥10 were included in this study.Using 52 total deviation values(TDVs)from the first 10 VF tests of the training dataset,VF points were clustered into several regions using the hierarchical ordered partitioning and collapsing hybrid(HOPACH)and K-means clustering.Based on the clustering results,a linear regression analysis was applied to each clustered region of the testing dataset to predict the TDVs of the 10th VF test.Three to nine VF tests were used to predict the 10th VF test,and the prediction errors(root mean square error,RMSE)of each clustering method and pointwise linear regression(PLR)were compared.RESULTS:The training group consisted of 228 patients(mean age,54.20±14.38y;123 males and 105 females),and the testing group included 81 patients(mean age,54.88±15.22y;43 males and 38 females).All subjects were diagnosed with POAG.Fifty-two VF points were clustered into 11 and nine regions using HOPACH and K-means clustering,respectively.K-means clustering had a lower prediction error than PLR when n=1:3 and 1:4(both P≤0.003).The prediction errors of K-means clustering were lower than those of HOPACH in all sections(n=1:4 to 1:9;all P≤0.011),except for n=1:3(P=0.680).PLR outperformed K-means clustering only when n=1:8 and 1:9(both P≤0.020).CONCLUSION:K-means clustering can predict longterm VF test results more accurately in patients with POAG with limited VF data. 展开更多
关键词 k-means clustering hierarchical ordered partitioning and collapsing hybrid pointwise linear regression visual field prediction
原文传递
Geochemical and Geostatistical Studies for Estimating Gold Grade in Tarq Prospect Area by K-Means Clustering Method 被引量:7
5
作者 Adel Shirazy Aref Shirazi +1 位作者 Mohammad Hossein Ferdossi Mansour Ziaii 《Open Journal of Geology》 2019年第6期306-326,共21页
Tarq geochemical 1:100,000 Sheet is located in Isfahan province which is investigated by Iran’s Geological and Explorations Organization using stream sediment analyzes. This area has stratigraphy of Precambrian to Qu... Tarq geochemical 1:100,000 Sheet is located in Isfahan province which is investigated by Iran’s Geological and Explorations Organization using stream sediment analyzes. This area has stratigraphy of Precambrian to Quaternary rocks and is located in the Central Iran zone. According to the presence of signs of gold mineralization in this area, it is necessary to identify important mineral areas in this area. Therefore, finding information is necessary about the relationship and monitoring the elements of gold, arsenic, and antimony relative to each other in this area to determine the extent of geochemical halos and to estimate the grade. Therefore, a well-known and useful K-means method is used for monitoring the elements in the present study, this is a clustering method based on minimizing the total Euclidean distances of each sample from the center of the classes which are assigned to them. In this research, the clustering quality function and the utility rate of the sample have been used in the desired cluster (S(i)) to determine the optimum number of clusters. Finally, with regard to the cluster centers and the results, the equations were used to predict the amount of the gold element based on four parameters of arsenic and antimony grade, length and width of sampling points. 展开更多
关键词 GOLD Tarq k-meanS clustering method Estimation of the ELEMENTS GRADE k-meanS
暂未订购
Landslide susceptibility zonation method based on C5.0 decision tree and K-means cluster algorithms to improve the efficiency of risk management 被引量:26
6
作者 Zizheng Guo Yu Shi +2 位作者 Faming Huang Xuanmei Fan Jinsong Huang 《Geoscience Frontiers》 SCIE CAS CSCD 2021年第6期243-261,共19页
Machine learning algorithms are an important measure with which to perform landslide susceptibility assessments, but most studies use GIS-based classification methods to conduct susceptibility zonation.This study pres... Machine learning algorithms are an important measure with which to perform landslide susceptibility assessments, but most studies use GIS-based classification methods to conduct susceptibility zonation.This study presents a machine learning approach based on the C5.0 decision tree(DT) model and the K-means cluster algorithm to produce a regional landslide susceptibility map. Yanchang County, a typical landslide-prone area located in northwestern China, was taken as the area of interest to introduce the proposed application procedure. A landslide inventory containing 82 landslides was prepared and subsequently randomly partitioned into two subsets: training data(70% landslide pixels) and validation data(30% landslide pixels). Fourteen landslide influencing factors were considered in the input dataset and were used to calculate the landslide occurrence probability based on the C5.0 decision tree model.Susceptibility zonation was implemented according to the cut-off values calculated by the K-means cluster algorithm. The validation results of the model performance analysis showed that the AUC(area under the receiver operating characteristic(ROC) curve) of the proposed model was the highest, reaching 0.88,compared with traditional models(support vector machine(SVM) = 0.85, Bayesian network(BN) = 0.81,frequency ratio(FR) = 0.75, weight of evidence(WOE) = 0.76). The landslide frequency ratio and frequency density of the high susceptibility zones were 6.76/km^(2) and 0.88/km^(2), respectively, which were much higher than those of the low susceptibility zones. The top 20% interval of landslide occurrence probability contained 89% of the historical landslides but only accounted for 10.3% of the total area.Our results indicate that the distribution of high susceptibility zones was more focused without containing more " stable" pixels. Therefore, the obtained susceptibility map is suitable for application to landslide risk management practices. 展开更多
关键词 Landslide susceptibility Frequency ratio C5.0 decision tree k-means cluster Classification Risk management
在线阅读 下载PDF
Optimizing basis wave functions in the generator coordinate method for microscopic cluster models (Ⅰ)
7
作者 Yi‑Fan Liu Bo Zhou Yu‑Gang Ma 《Nuclear Science and Techniques》 2025年第10期183-191,共9页
We employed random distributions and gradient descent methods for the Generator Coordinate Method(GCM)to identify effective basis wave functions,taking halo nuclei ^(6)He and ^(6)Li as examples.By comparing the ground... We employed random distributions and gradient descent methods for the Generator Coordinate Method(GCM)to identify effective basis wave functions,taking halo nuclei ^(6)He and ^(6)Li as examples.By comparing the ground state(0^(+))energy of ^(6)He and the excited state(0^(+))energy of 6 Li calculated with various random distributions and manually selected generation coordinates,we found that the heavy tail characteristic of the logistic distribution better describes the features of the halo nuclei.Subsequently,the Adam algorithm from machine learning was applied to optimize the basis wave functions,indicating that a limited number of basis wave functions can approximate the converged values.These results offer some empirical insights for selecting basis wave functions and contribute to the broader application of machine learning methods in predicting effective basis wave functions. 展开更多
关键词 Generator Coordinate method Effective basis wave functions Nuclear cluster model Machine learning Halo nuclei
在线阅读 下载PDF
Level-shifted embedded cluster method may offer a viable alternative for the simulation of metal oxides
8
作者 Zi-Jian Zhou Xin-Ping Wu 《Chinese Journal of Structural Chemistry》 2025年第5期1-2,共2页
The use of metal oxides has been extensively documented in the literature and applied in a variety of contexts,including but not limited to energy storage,chemical sensors,and biomedical applications.One of the most s... The use of metal oxides has been extensively documented in the literature and applied in a variety of contexts,including but not limited to energy storage,chemical sensors,and biomedical applications.One of the most significant applications of metal oxides is heterogeneous catalysis,which represents a pivotal technology in industrial production on a global scale.Catalysts serve as the primary enabling agents for chemical reactions,and among the plethora of catalysts,metal oxides including magnesium oxide(MgO),ceria(CeO_(2))and titania(TiO_(2)),have been identified to be particularly effective in catalyzing a variety of reactions[1].Theoretical calculations based on density functional theory(DFT)and a multitude of other quantum chemistry methods have proven invaluable in elucidating the mechanisms of metal-oxide-catalyzed reactions,thereby facilitating the design of high-performance catalysts[2]. 展开更多
关键词 chemical reactionsand industrial production heterogeneous catalysiswhich metal oxides energy storagechemical biomedical applicationsone level shifted embedded cluster method catalystsmetal oxides
原文传递
Quantitative Method of Classification and Discrimination of a Porous Carbonate Reservoir Integrating K-means Clustering and Bayesian Theory
9
作者 FANG Xinxin ZHU Guotao +2 位作者 YANG Yiming LI Fengling FENG Hong 《Acta Geologica Sinica(English Edition)》 SCIE CAS CSCD 2023年第1期176-189,共14页
Reservoir classification is a key link in reservoir evaluation.However,traditional manual means are inefficient,subjective,and classification standards are not uniform.Therefore,taking the Mishrif Formation of the Wes... Reservoir classification is a key link in reservoir evaluation.However,traditional manual means are inefficient,subjective,and classification standards are not uniform.Therefore,taking the Mishrif Formation of the Western Iraq as an example,a new reservoir classification and discrimination method is established by using the K-means clustering method and the Bayesian discrimination method.These methods are applied to non-cored wells to calculate the discrimination accuracy of the reservoir type,and thus the main reasons for low accuracy of reservoir discrimination are clarified.The results show that the discrimination accuracy of reservoir type based on K-means clustering and Bayesian stepwise discrimination is strongly related to the accuracy of the core data.The discrimination accuracy rate of TypeⅠ,TypeⅡ,and TypeⅤreservoirs is found to be significantly higher than that of TypeⅢand TypeⅣreservoirs using the method of combining K-means clustering and Bayesian theory based on logging data.Although the recognition accuracy of the new methodology for the TypeⅣreservoir is low,with average accuracy the new method has reached more than 82%in the entire study area,which lays a good foundation for rapid and accurate discrimination of reservoir types and the fine evaluation of a reservoir. 展开更多
关键词 UPSTREAM resource exploration reservoir classification CARBONATE k-means clustering Bayesian discrimination CENOMANIAN-TURONIAN Iraq
在线阅读 下载PDF
Oversampling Method Based on Gaussian Distribution and K-Means Clustering
10
作者 Masoud Muhammed Hassan Adel Sabry Eesa +1 位作者 Ahmed Jameel Mohammed Wahab Kh.Arabo 《Computers, Materials & Continua》 SCIE EI 2021年第10期451-469,共19页
Learning from imbalanced data is one of the greatest challenging problems in binary classification,and this problem has gained more importance in recent years.When the class distribution is imbalanced,classical machin... Learning from imbalanced data is one of the greatest challenging problems in binary classification,and this problem has gained more importance in recent years.When the class distribution is imbalanced,classical machine learning algorithms tend to move strongly towards the majority class and disregard the minority.Therefore,the accuracy may be high,but the model cannot recognize data instances in the minority class to classify them,leading to many misclassifications.Different methods have been proposed in the literature to handle the imbalance problem,but most are complicated and tend to simulate unnecessary noise.In this paper,we propose a simple oversampling method based on Multivariate Gaussian distribution and K-means clustering,called GK-Means.The new method aims to avoid generating noise and control imbalances between and within classes.Various experiments have been carried out with six classifiers and four oversampling methods.Experimental results on different imbalanced datasets show that the proposed GK-Means outperforms other oversampling methods and improves classification performance as measured by F1-score and Accuracy. 展开更多
关键词 Class imbalance OVERSAMPLING GAUSSIAN multivariate distribution k-means clustering
在线阅读 下载PDF
Visitor segmentation in alpine tourism:Evidence from a survey-based cluster analysis in northern Italy
11
作者 Francesca VISINTIN Elisa TOMASINSIG +4 位作者 Laura PAGANI Ivana BASSI Vanessa DEOTTO Lucia MONTEFIORI Luca ISEPPI 《Journal of Mountain Science》 2026年第2期738-754,共17页
This study addresses the persistent scarcity of systematic and comparable data on mountain tourism,with particular reference to Northern Italy,as highlighted by FAO/UNWTO reports and recent academic literature.It aims... This study addresses the persistent scarcity of systematic and comparable data on mountain tourism,with particular reference to Northern Italy,as highlighted by FAO/UNWTO reports and recent academic literature.It aims to contribute to this gap by analyzing tourist flows,socio-demographic characteristics,preferences,and behaviors of domestic visitors to the Italian Alps.Data were collected through a survey conducted between December 2023 and January 2024 among 1,218 residents of Northwest and Northeast Italy and Friuli Venezia Giulia,using a stratified sampling approach.Descriptive statistics and inferential analyses were employed to examine visitation patterns,while K-means clustering was applied to identify distinct segments of mountain tourists based on activity preferences and motivations.Overall,82.5%of respondents reported visiting Alpine areas.Chi-square tests revealed statistically significant differences in visitation behavior according to age,occupational status,and income.Notably,spiritual activities,such as pilgrimages,elicited levels of interest comparable to those of more traditional mountain sports.The cluster analysis identified three visitor profiles:Active Young Enthusiasts,characterized by high engagement in multiple outdoor activities and motivated by psychological well-being and cultural enrichment;Well-being-Oriented Walkers,preferring low-intensity activities primarily driven by psychological relaxation;and Hiking-Oriented Explorers,exhibiting a strong propensity for mountain excursions associated with high levels of psychophysical well-being.These findings enhance understanding of the heterogeneous structure of mountain tourism demand in Northern Italy and offer insights relevant to sustainable destination planning and management in Alpine regions. 展开更多
关键词 Mountain tourism Visitor segmentation k-means clustering Tourist behavior Activity-based segmentation Italian Alps
原文传递
A Quantum-Inspired Algorithm for Clustering and Intrusion Detection
12
作者 Gang Xu Lefeng Wang +5 位作者 Yuwei Huang Yong Lu Xin Liu Weijie Tan Zongpeng Li Xiu-Bo Chen 《Computers, Materials & Continua》 2026年第4期1180-1215,共36页
The Intrusion Detection System(IDS)is a security mechanism developed to observe network traffic and recognize suspicious or malicious activities.Clustering algorithms are often incorporated into IDS;however,convention... The Intrusion Detection System(IDS)is a security mechanism developed to observe network traffic and recognize suspicious or malicious activities.Clustering algorithms are often incorporated into IDS;however,conventional clustering-based methods face notable drawbacks,including poor scalability in handling high-dimensional datasets and a strong dependence of outcomes on initial conditions.To overcome the performance limitations of existing methods,this study proposes a novel quantum-inspired clustering algorithm that relies on a similarity coefficient-based quantum genetic algorithm(SC-QGA)and an improved quantum artificial bee colony algorithm hybrid K-means(IQABC-K).First,the SC-QGA algorithmis constructed based on quantum computing and integrates similarity coefficient theory to strengthen genetic diversity and feature extraction capabilities.For the subsequent clustering phase,the process based on the IQABC-K algorithm is enhanced with the core improvement of adaptive rotation gate and movement exploitation strategies to balance the exploration capabilities of global search and the exploitation capabilities of local search.Simultaneously,the acceleration of convergence toward the global optimum and a reduction in computational complexity are facilitated by means of the global optimum bootstrap strategy and a linear population reduction strategy.Through experimental evaluation with multiple algorithms and diverse performance metrics,the proposed algorithm confirms reliable accuracy on three datasets:KDD CUP99,NSL_KDD,and UNSW_NB15,achieving accuracy of 98.57%,98.81%,and 98.32%,respectively.These results affirm its potential as an effective solution for practical clustering applications. 展开更多
关键词 Intrusion detection clusterING quantum artificial bee colony algorithm k-meanS quantum genetic algorithm
在线阅读 下载PDF
Service Quality Evaluation of Civil Airports Based on CRITIC‑Bidirectional Grey Possibility Clustering Model
13
作者 ZU Lili LI Xun +1 位作者 WANG Junjie DANG Yaoguo 《Transactions of Nanjing University of Aeronautics and Astronautics》 2026年第1期110-126,共17页
With the rapid development of the aviation industry,air travel has become one of the most important modes.Improving the service quality of civil aviation airports is crucial to their competitiveness.This study intends... With the rapid development of the aviation industry,air travel has become one of the most important modes.Improving the service quality of civil aviation airports is crucial to their competitiveness.This study intends to develop a scientific and rational evaluation methodology and framework for assessing service quality in civil aviation airports,thereby providing a theoretical foundation and practical guidance for enhancing service standards in the aviation industry.First,the study constructs a CRITIC-bidirectional grey possibility clustering model,which uses the CRITIC method to determine the weights of indicators and integrates the forward grey possibility clustering model and the inverse grey possibility clustering model to determine possibility functions from two perspectives.Second,a service quality evaluation index system for civil airports is constructed from four dimensions,and the weights of each index within the system are subsequently calculated.Finally,the constructed model is applied to evaluate the service quality of nine domestic civil airports.Based on the clustering results,targeted countermeasures and suggestions are proposed.Empirical results demonstrate that,compared to the traditional grey possibility clustering model,the proposed model balances the objectivity of indicator weighting,the objectivity of possibility function construction,and the simplicity of the computational process,thereby possessing significant theoretical and practical implications. 展开更多
关键词 CRITIC method grey clustering possibility functions civil airport service quality evaluation
在线阅读 下载PDF
Integrated diagnosis of abnormal energy consumption in converter steelmaking using GWO-SVM-K-means algorithms
14
作者 Fei-Xiang Dai Xiang-Jun Bao +2 位作者 Lu Zhang Xiao-Jing Yang Guang Chen 《Journal of Iron and Steel Research International》 2026年第1期458-468,共11页
To address the issue of abnormal energy consumption fluctuations in the converter steelmaking process,an integrated diagnostic method combining the gray wolf optimization(GWO)algorithm,support vector machine(SVM),and ... To address the issue of abnormal energy consumption fluctuations in the converter steelmaking process,an integrated diagnostic method combining the gray wolf optimization(GWO)algorithm,support vector machine(SVM),and K-means clustering was proposed.Eight input parameters—derived from molten iron conditions and external factors—were selected as feature variables.A GWO-SVM model was developed to accurately predict the energy consumption of individual heats.Based on the prediction results,the mean absolute percentage error and maximum relative error of the test set were employed as criteria to identify heats with abnormal energy usage.For these heats,the K-means clustering algorithm was used to determine benchmark values of influencing factors from similar steel grades,enabling root-cause diagnosis of excessive energy consumption.The proposed method was applied to real production data from a converter in a steel plant.The analysis reveals that heat sample No.44 exhibits abnormal energy consumption,due to gas recovery being 1430.28 kg of standard coal below the benchmark level.A secondary contributing factor is a steam recovery shortfall of 237.99 kg of standard coal.This integrated approach offers a scientifically grounded tool for energy management in converter operations and provides valuable guidance for optimizing process parameters and enhancing energy efficiency. 展开更多
关键词 Converter smelting process Abnormal energy diagnosis Gray wolf optimization algorithm Support vector machine k-means clustering algorithm
原文传递
Improved k-means clustering algorithm 被引量:16
15
作者 夏士雄 李文超 +2 位作者 周勇 张磊 牛强 《Journal of Southeast University(English Edition)》 EI CAS 2007年第3期435-438,共4页
In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering a... In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering algorithm is proposed. First, the concept of a silhouette coefficient is introduced, and the optimal clustering number Kopt of a data set with unknown class information is confirmed by calculating the silhouette coefficient of objects in clusters under different K values. Then the distribution of the data set is obtained through hierarchical clustering and the initial clustering-centers are confirmed. Finally, the clustering is completed by the traditional k-means clustering. By the theoretical analysis, it is proved that the improved k-means clustering algorithm has proper computational complexity. The experimental results of IRIS testing data set show that the algorithm can distinguish different clusters reasonably and recognize the outliers efficiently, and the entropy generated by the algorithm is lower. 展开更多
关键词 clusterING k-means algorithm silhouette coefficient
在线阅读 下载PDF
An efficient enhanced k-means clustering algorithm 被引量:30
16
作者 FAHIM A.M SALEM A.M +1 位作者 TORKEY F.A RAMADAN M.A 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2006年第10期1626-1633,共8页
In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared dista... In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation. 展开更多
关键词 clustering algorithms cluster analysis k-means algorithm Data analysis
在线阅读 下载PDF
Classification of Northeast China Cold Vortex Activity Paths in Early Summer Based on K-means Clustering and Their Climate Impact 被引量:13
17
作者 Yihe FANG Haishan CHEN +3 位作者 Yi LIN Chunyu ZHAO Yitong LIN Fang ZHOU 《Advances in Atmospheric Sciences》 SCIE CAS CSCD 2021年第3期400-412,共13页
The classification of the Northeast China Cold Vortex(NCCV)activity paths is an important way to analyze its characteristics in detail.Based on the daily precipitation data of the northeastern China(NEC)region,and the... The classification of the Northeast China Cold Vortex(NCCV)activity paths is an important way to analyze its characteristics in detail.Based on the daily precipitation data of the northeastern China(NEC)region,and the atmospheric circulation field and temperature field data of ERA-Interim for every six hours,the NCCV processes during the early summer(June)seasons from 1979 to 2018 were objectively identified.Then,the NCCV processes were classified using a machine learning method(k-means)according to the characteristic parameters of the activity path information.The rationality of the classification results was verified from two aspects,as follows:(1)the atmospheric circulation configuration of the NCCV on various paths;and(2)its influences on the climate conditions in the NEC.The obtained results showed that the activity paths of the NCCV could be divided into four types according to such characteristics as the generation origin,movement direction,and movement velocity of the NCCV.These included the generation-eastward movement type in the east of the Mongolia Plateau(eastward movement type or type A);generation-southeast longdistance movement type in the upstream of the Lena River(southeast long-distance movement type or type B);generationeastward less-movement type near Lake Baikal(eastward less-movement type or type C);and the generation-southward less-movement type in eastern Siberia(southward less-movement type or type D).There were obvious differences observed in the atmospheric circulation configuration and the climate impact of the NCCV on the four above-mentioned types of paths,which indicated that the classification results were reasonable. 展开更多
关键词 northeastern China early summer Northeast China Cold Vortex classification of activity paths machine learning method k-means clustering high-pressure blocking
在线阅读 下载PDF
Stable Initialization Scheme for K-Means Clustering 被引量:15
18
作者 XU Junling XU Baowen +2 位作者 ZHANG Weifeng ZHANG Wei HOU Jun 《Wuhan University Journal of Natural Sciences》 CAS 2009年第1期24-28,共5页
Though K-means is very popular for general clustering, its performance, which generally converges to numerous local minima, depends highly on initial cluster centers. In this paper a novel initialization scheme to sel... Though K-means is very popular for general clustering, its performance, which generally converges to numerous local minima, depends highly on initial cluster centers. In this paper a novel initialization scheme to select initial cluster centers for K-means clustering is proposed. This algorithm is based on reverse nearest neighbor (RNN) search which retrieves all points in a given data set whose nearest neighbor is a given query point. The initial cluster centers computed using this methodology are found to be very close to the desired cluster centers for iterative clustering algorithms. This procedure is applicable to clustering algorithms for continuous data. The application of the proposed algorithm to K-means clustering algorithm is demonstrated. An experiment is carried out on several popular datasets and the results show the advantages of the proposed method. 展开更多
关键词 clusterING unsupervised learning k-meanS INITIALIZATION
原文传递
Hierarchical hesitant fuzzy K-means clustering algorithm 被引量:21
19
作者 CHEN Na XU Ze-shui XIA Mei-mei 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2014年第1期1-17,共17页
Due to the limitation and hesitation in one's knowledge, the membership degree of an element to a given set usually has a few different values, in which the conventional fuzzy sets are invalid. Hesitant fuzzy sets ar... Due to the limitation and hesitation in one's knowledge, the membership degree of an element to a given set usually has a few different values, in which the conventional fuzzy sets are invalid. Hesitant fuzzy sets are a powerful tool to treat this case. The present paper focuses on investigating the clustering technique for hesitant fuzzy sets based on the K-means clustering algorithm which takes the results of hierarchical clustering as the initial clusters. Finally, two examples demonstrate the validity of our algorithm. 展开更多
关键词 90B50 68T10 62H30 Hesitant fuzzy set hierarchical clustering k-means clustering intuitionisitc fuzzy set
在线阅读 下载PDF
Global Optimization Method Using SLE and Adaptive RBF Based on Fuzzy Clustering 被引量:8
20
作者 ZHU Huaguang LIU Li LONG Teng ZHAO Junfeng 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2012年第4期768-775,共8页
High fidelity analysis models,which are beneficial to improving the design quality,have been more and more widely utilized in the modern engineering design optimization problems.However,the high fidelity analysis mode... High fidelity analysis models,which are beneficial to improving the design quality,have been more and more widely utilized in the modern engineering design optimization problems.However,the high fidelity analysis models are so computationally expensive that the time required in design optimization is usually unacceptable.In order to improve the efficiency of optimization involving high fidelity analysis models,the optimization efficiency can be upgraded through applying surrogates to approximate the computationally expensive models,which can greately reduce the computation time.An efficient heuristic global optimization method using adaptive radial basis function(RBF) based on fuzzy clustering(ARFC) is proposed.In this method,a novel algorithm of maximin Latin hypercube design using successive local enumeration(SLE) is employed to obtain sample points with good performance in both space-filling and projective uniformity properties,which does a great deal of good to metamodels accuracy.RBF method is adopted for constructing the metamodels,and with the increasing the number of sample points the approximation accuracy of RBF is gradually enhanced.The fuzzy c-means clustering method is applied to identify the reduced attractive regions in the original design space.The numerical benchmark examples are used for validating the performance of ARFC.The results demonstrates that for most application examples the global optima are effectively obtained and comparison with adaptive response surface method(ARSM) proves that the proposed method can intuitively capture promising design regions and can efficiently identify the global or near-global design optimum.This method improves the efficiency and global convergence of the optimization problems,and gives a new optimization strategy for engineering design optimization problems involving computationally expensive models. 展开更多
关键词 global optimization Latin hypercube design radial basis function fuzzy clustering adaptive response surface method
在线阅读 下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部