期刊文献+
共找到439篇文章
< 1 2 22 >
每页显示 20 50 100
Blind source separation by weighted K-means clustering 被引量:5
1
作者 Yi Qingming 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2008年第5期882-887,共6页
Blind separation of sparse sources (BSSS) is discussed. The BSSS method based on the conventional K-means clustering is very fast and is also easy to implement. However, the accuracy of this method is generally not ... Blind separation of sparse sources (BSSS) is discussed. The BSSS method based on the conventional K-means clustering is very fast and is also easy to implement. However, the accuracy of this method is generally not satisfactory. The contribution of the vector x(t) with different modules is theoretically proved to be unequal, and a weighted K-means clustering method is proposed on this grounds. The proposed algorithm is not only as fast as the conventional K-means clustering method, but can also achieve considerably accurate results, which is demonstrated by numerical experiments. 展开更多
关键词 blind source separation underdetermined mixing sparse representation weighted k-means clustering.
在线阅读 下载PDF
Improved k-means clustering algorithm 被引量:16
2
作者 夏士雄 李文超 +2 位作者 周勇 张磊 牛强 《Journal of Southeast University(English Edition)》 EI CAS 2007年第3期435-438,共4页
In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering a... In allusion to the disadvantage of having to obtain the number of clusters of data sets in advance and the sensitivity to selecting initial clustering centers in the k-means algorithm, an improved k-means clustering algorithm is proposed. First, the concept of a silhouette coefficient is introduced, and the optimal clustering number Kopt of a data set with unknown class information is confirmed by calculating the silhouette coefficient of objects in clusters under different K values. Then the distribution of the data set is obtained through hierarchical clustering and the initial clustering-centers are confirmed. Finally, the clustering is completed by the traditional k-means clustering. By the theoretical analysis, it is proved that the improved k-means clustering algorithm has proper computational complexity. The experimental results of IRIS testing data set show that the algorithm can distinguish different clusters reasonably and recognize the outliers efficiently, and the entropy generated by the algorithm is lower. 展开更多
关键词 clusterING k-means algorithm silhouette coefficient
在线阅读 下载PDF
An efficient enhanced k-means clustering algorithm 被引量:30
3
作者 FAHIM A.M SALEM A.M +1 位作者 TORKEY F.A RAMADAN M.A 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2006年第10期1626-1633,共8页
In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared dista... In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation. 展开更多
关键词 clustering algorithms cluster analysis k-means algorithm Data analysis
在线阅读 下载PDF
Multifactor diagnostic model of converter energy consumption based on K-means algorithm and its application
4
作者 Fei-xiang Dai Guang Chen +3 位作者 Xiang-jun Bao Gong-guo Liu Lu Zhang Xiao-jing Yang 《Journal of Iron and Steel Research International》 2025年第8期2359-2369,共11页
To address the challenge of identifying the primary causes of energy consumption fluctuations and accurately assessing the influence of various factors in the converter unit of an iron and steel plant,the focus is pla... To address the challenge of identifying the primary causes of energy consumption fluctuations and accurately assessing the influence of various factors in the converter unit of an iron and steel plant,the focus is placed on the critical components of material and heat balance.Through a thorough analysis of the interactions between various components and energy consumptions,six pivotal factors have been identified—raw material composition,steel type,steel temperature,slag temperature,recycling practices,and operational parameters.Utilizing a framework based on an equivalent energy consumption model,an integrated intelligent diagnostic model has been developed that encapsulates these factors,providing a comprehensive assessment tool for converter energy consumption.Employing the K-means clustering algorithm,historical operational data from the converter have been meticulously analyzed to determine baseline values for essential variables such as energy consumption and recovery rates.Building upon this data-driven foundation,an innovative online system for the intelligent diagnosis of converter energy consumption has been crafted and implemented,enhancing the precision and efficiency of energy management.Upon implementation with energy consumption data at a steel plant in 2023,the diagnostic analysis performed by the system exposed significant variations in energy usage across different converter units.The analysis revealed that the most significant factor influencing the variation in energy consumption for both furnaces was the steel grade,with contributions of−0.550 and 0.379. 展开更多
关键词 Equivalent energy consumption model Intelligent diagnostic model k-means clustering algorithm Online system Energy management
原文传递
Similarity matrix-based K-means algorithm for text clustering 被引量:1
5
作者 曹奇敏 郭巧 吴向华 《Journal of Beijing Institute of Technology》 EI CAS 2015年第4期566-572,共7页
K-means algorithm is one of the most widely used algorithms in the clustering analysis. To deal with the problem caused by the random selection of initial center points in the traditional al- gorithm, this paper propo... K-means algorithm is one of the most widely used algorithms in the clustering analysis. To deal with the problem caused by the random selection of initial center points in the traditional al- gorithm, this paper proposes an improved K-means algorithm based on the similarity matrix. The im- proved algorithm can effectively avoid the random selection of initial center points, therefore it can provide effective initial points for clustering process, and reduce the fluctuation of clustering results which are resulted from initial points selections, thus a better clustering quality can be obtained. The experimental results also show that the F-measure of the improved K-means algorithm has been greatly improved and the clustering results are more stable. 展开更多
关键词 text clustering k-means algorithm similarity matrix F-MEASURE
在线阅读 下载PDF
Development of slope mass rating system using K-means and fuzzy c-means clustering algorithms 被引量:1
6
作者 Jalali Zakaria 《International Journal of Mining Science and Technology》 SCIE EI CSCD 2016年第6期959-966,共8页
Classification systems such as Slope Mass Rating(SMR) are currently being used to undertake slope stability analysis. In SMR classification system, data is allocated to certain classes based on linguistic and experien... Classification systems such as Slope Mass Rating(SMR) are currently being used to undertake slope stability analysis. In SMR classification system, data is allocated to certain classes based on linguistic and experience-based criteria. In order to eliminate linguistic criteria resulted from experience-based judgments and account for uncertainties in determining class boundaries developed by SMR system,the system classification results were corrected using two clustering algorithms, namely K-means and fuzzy c-means(FCM), for the ratings obtained via continuous and discrete functions. By applying clustering algorithms in SMR classification system, no in-advance experience-based judgment was made on the number of extracted classes in this system, and it was only after all steps of the clustering algorithms were accomplished that new classification scheme was proposed for SMR system under different failure modes based on the ratings obtained via continuous and discrete functions. The results of this study showed that, engineers can achieve more reliable and objective evaluations over slope stability by using SMR system based on the ratings calculated via continuous and discrete functions. 展开更多
关键词 SMR based on continuous functions Slope stability analysis k-means and FCM clustering algorithms Validation of clustering algorithms Sangan iron ore mines
在线阅读 下载PDF
Hybrid Genetic Algorithm with K-Means for Clustering Problems 被引量:1
7
作者 Ahamed Al Malki Mohamed M. Rizk +1 位作者 M. A. El-Shorbagy A. A. Mousa 《Open Journal of Optimization》 2016年第2期71-83,共14页
The K-means method is one of the most widely used clustering methods and has been implemented in many fields of science and technology. One of the major problems of the k-means algorithm is that it may produce empty c... The K-means method is one of the most widely used clustering methods and has been implemented in many fields of science and technology. One of the major problems of the k-means algorithm is that it may produce empty clusters depending on initial center vectors. Genetic Algorithms (GAs) are adaptive heuristic search algorithm based on the evolutionary principles of natural selection and genetics. This paper presents a hybrid version of the k-means algorithm with GAs that efficiently eliminates this empty cluster problem. Results of simulation experiments using several data sets prove our claim. 展开更多
关键词 cluster Analysis Genetic algorithm k-means
在线阅读 下载PDF
Plant Leaf Diseases Classification Using Improved K-Means Clustering and SVM Algorithm for Segmentation
8
作者 Mona Jamjoom Ahmed Elhadad +1 位作者 Hussein Abulkasim Safia Abbas 《Computers, Materials & Continua》 SCIE EI 2023年第7期367-382,共16页
Several pests feed on leaves,stems,bases,and the entire plant,causing plant illnesses.As a result,it is vital to identify and eliminate the disease before causing any damage to plants.Manually detecting plant disease ... Several pests feed on leaves,stems,bases,and the entire plant,causing plant illnesses.As a result,it is vital to identify and eliminate the disease before causing any damage to plants.Manually detecting plant disease and treating it is pretty challenging in this period.Image processing is employed to detect plant disease since it requires much effort and an extended processing period.The main goal of this study is to discover the disease that affects the plants by creating an image processing system that can recognize and classify four different forms of plant diseases,including Phytophthora infestans,Fusarium graminearum,Puccinia graminis,tomato yellow leaf curl.Therefore,this work uses the Support vector machine(SVM)classifier to detect and classify the plant disease using various steps like image acquisition,Pre-processing,Segmentation,feature extraction,and classification.The gray level co-occurrence matrix(GLCM)and the local binary pattern features(LBP)are used to identify the disease-affected portion of the plant leaf.According to experimental data,the proposed technology can correctly detect and diagnose plant sickness with a 97.2 percent accuracy. 展开更多
关键词 SVM machine learning GLCM algorithm k-means clustering LBP
在线阅读 下载PDF
An Improved K-Means Algorithm Based on Initial Clustering Center Optimization
9
作者 LI Taihao NAREN Tuya +2 位作者 ZHOU Jianshe REN Fuji LIU Shupeng 《ZTE Communications》 2017年第B12期43-46,共4页
The K-means algorithm is widely known for its simplicity and fastness in text clustering.However,the selection of the initial clus?tering center with the traditional K-means algorithm is some random,and therefore,the ... The K-means algorithm is widely known for its simplicity and fastness in text clustering.However,the selection of the initial clus?tering center with the traditional K-means algorithm is some random,and therefore,the fluctuations and instability of the clustering results are strongly affected by the initial clustering center.This paper proposed an algorithm to select the initial clustering center to eliminate the uncertainty of central point selection.The experiment results show that the improved K-means clustering algorithm is superior to the traditional algorithm. 展开更多
关键词 clusterING k-means algorithm initial clustering center
在线阅读 下载PDF
A State of Art Analysis of Telecommunication Data by k-Means and k-Medoids Clustering Algorithms
10
作者 T. Velmurugan 《Journal of Computer and Communications》 2018年第1期190-202,共13页
Cluster analysis is one of the major data analysis methods widely used for many practical applications in emerging areas of data mining. A good clustering method will produce high quality clusters with high intra-clus... Cluster analysis is one of the major data analysis methods widely used for many practical applications in emerging areas of data mining. A good clustering method will produce high quality clusters with high intra-cluster similarity and low inter-cluster similarity. Clustering techniques are applied in different domains to predict future trends of available data and its uses for the real world. This research work is carried out to find the performance of two of the most delegated, partition based clustering algorithms namely k-Means and k-Medoids. A state of art analysis of these two algorithms is implemented and performance is analyzed based on their clustering result quality by means of its execution time and other components. Telecommunication data is the source data for this analysis. The connection oriented broadband data is given as input to find the clustering quality of the algorithms. Distance between the server locations and their connection is considered for clustering. Execution time for each algorithm is analyzed and the results are compared with one another. Results found in comparison study are satisfactory for the chosen application. 展开更多
关键词 k-means algorithm k-Medoids algorithm DATA clusterING Time COMPLEXITY TELECOMMUNICATION DATA
暂未订购
Fault Diagnosis Model Based on Fuzzy Support Vector Machine Combined with Weighted Fuzzy Clustering 被引量:3
11
作者 张俊红 马文朋 +1 位作者 马梁 何振鹏 《Transactions of Tianjin University》 EI CAS 2013年第3期174-181,共8页
A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to ... A fault diagnosis model is proposed based on fuzzy support vector machine (FSVM) combined with fuzzy clustering (FC).Considering the relationship between the sample point and non-self class,FC algorithm is applied to generate fuzzy memberships.In the algorithm,sample weights based on a distribution density function of data point and genetic algorithm (GA) are introduced to enhance the performance of FC.Then a multi-class FSVM with radial basis function kernel is established according to directed acyclic graph algorithm,the penalty factor and kernel parameter of which are optimized by GA.Finally,the model is executed for multi-class fault diagnosis of rolling element bearings.The results show that the presented model achieves high performances both in identifying fault types and fault degrees.The performance comparisons of the presented model with SVM and distance-based FSVM for noisy case demonstrate the capacity of dealing with noise and generalization. 展开更多
关键词 FUZZY support VECTOR machine FUZZY clustering SAMPLE weight GENETIC algorithm parameter optimization FAULT diagnosis
在线阅读 下载PDF
基于自适应动态特征加权的K-means算法
12
作者 薛雷 王天放 《吉林大学学报(理学版)》 北大核心 2025年第5期1404-1410,共7页
首先,针对传统K-means算法在处理高维异构数据时存在特征平等假设导致重要特征被忽视、聚类结果对预设簇数高度敏感以及对初始中心点选择强依赖性的问题,提出一种自适应动态特征加权K-means(adaptive dynamic feature weighting K-means... 首先,针对传统K-means算法在处理高维异构数据时存在特征平等假设导致重要特征被忽视、聚类结果对预设簇数高度敏感以及对初始中心点选择强依赖性的问题,提出一种自适应动态特征加权K-means(adaptive dynamic feature weighting K-means,ADFW-K-means)算法,该算法融合了动态特征加权、K-means++优化初始化、肘部法则辅助簇数选择、空簇处理机制以及自适应簇数调整策略等多项技术.其次,在吉林大学20222024年选调生数据集上进行实验,实验结果表明,ADFW-K-means算法相较于传统聚类算法,在轮廓系数、聚类稳定性和业务可解释性3个核心指标上均得到显著提升,ADFW-K-means算法有效克服了传统方法的固有缺陷,显著提升了复杂高维异构数据聚类的准确性和鲁棒性. 展开更多
关键词 自适应簇数 动态特征加权 k-means算法 聚类算法
在线阅读 下载PDF
Genetic Algorithm Combined with the K-Means Algorithm:A Hybrid Technique for Unsupervised Feature Selection
13
作者 Hachemi Bennaceur Meznah Almutairy Norah Alhussain 《Intelligent Automation & Soft Computing》 SCIE 2023年第9期2687-2706,共20页
The dimensionality of data is increasing very rapidly,which creates challenges for most of the current mining and learning algorithms,such as large memory requirements and high computational costs.The literature inclu... The dimensionality of data is increasing very rapidly,which creates challenges for most of the current mining and learning algorithms,such as large memory requirements and high computational costs.The literature includes much research on feature selection for supervised learning.However,feature selection for unsupervised learning has only recently been studied.Finding the subset of features in unsupervised learning that enhances the performance is challenging since the clusters are indeterminate.This work proposes a hybrid technique for unsupervised feature selection called GAk-MEANS,which combines the genetic algorithm(GA)approach with the classical k-Means algorithm.In the proposed algorithm,a new fitness func-tion is designed in addition to new smart crossover and mutation operators.The effectiveness of this algorithm is demonstrated on various datasets.Fur-thermore,the performance of GAk-MEANS has been compared with other genetic algorithms,such as the genetic algorithm using the Sammon Error Function and the genetic algorithm using the Sum of Squared Error Function.Additionally,the performance of GAk-MEANS is compared with the state-of-the-art statistical unsupervised feature selection techniques.Experimental results show that GAk-MEANS consistently selects subsets of features that result in better classification accuracy compared to others.In particular,GAk-MEANS is able to significantly reduce the size of the subset of selected features by an average of 86.35%(72%–96.14%),which leads to an increase of the accuracy by an average of 3.78%(1.05%–6.32%)compared to using all features.When compared with the genetic algorithm using the Sammon Error Function,GAk-MEANS is able to reduce the size of the subset of selected features by 41.29%on average,improve the accuracy by 5.37%,and reduce the time by 70.71%.When compared with the genetic algorithm using the Sum of Squared Error Function,GAk-MEANS on average is able to reduce the size of the subset of selected features by 15.91%,and improve the accuracy by 9.81%,but the time is increased by a factor of 3.When compared with the machine-learning based methods,we observed that GAk-MEANS is able to increase the accuracy by 13.67%on average with an 88.76%average increase in time. 展开更多
关键词 Genetic algorithm unsupervised feature selection k-means clustering
在线阅读 下载PDF
一种改进的特征加权K-means聚类算法 被引量:12
14
作者 王慧 申石磊 《微电子学与计算机》 CSCD 北大核心 2010年第7期161-163,167,共4页
提出了一种改进的特征加权K-means聚类算法.该算法首先基于数据样本分布选取初始聚类中心,然后设计特征加权的K-means聚类算法.实验结果证明,该算法能产生质量较高的聚类结果,并且能处理数值、符号两类数据.
关键词 聚类 k-means算法 聚类中心 特征加权
在线阅读 下载PDF
基于初始聚类中心优化和维间加权的改进K-means算法 被引量:7
15
作者 王越 王泉 +1 位作者 吕奇峰 曾晶 《重庆理工大学学报(自然科学)》 CAS 2013年第4期77-80,共4页
针对K-means算法易受随机选择的初始聚类中心的影响和划分准确率不高的缺点,给出了一种改进的K-means算法。首先对初始聚类中心的选择过程进行了改进,然后对各样本点间差异最大的维进行加权处理。在Iris数据集上对原始算法和改进后的K-m... 针对K-means算法易受随机选择的初始聚类中心的影响和划分准确率不高的缺点,给出了一种改进的K-means算法。首先对初始聚类中心的选择过程进行了改进,然后对各样本点间差异最大的维进行加权处理。在Iris数据集上对原始算法和改进后的K-means算法的聚类结果进行对比分析。实验证明:改进后的算法稳定,且聚类的准确率达到了92%。 展开更多
关键词 聚类 K—means算法 初始聚类中心 维间加权 Iris数据集
在线阅读 下载PDF
基于属性权重最优化的k-means聚类算法 被引量:10
16
作者 熊平 顾霄 《微电子学与计算机》 CSCD 北大核心 2014年第4期40-43,共4页
聚类是最常用的数据挖掘算法之一.为了提高聚类结果的质量,应用拉格朗日乘数法提出了一种基于属性权重最优化的k-means聚类算法.该算法在计算样本与质心的距离时为各属性赋予相应的权重以表示属性的重要程度,并在每轮迭代中根据质心向... 聚类是最常用的数据挖掘算法之一.为了提高聚类结果的质量,应用拉格朗日乘数法提出了一种基于属性权重最优化的k-means聚类算法.该算法在计算样本与质心的距离时为各属性赋予相应的权重以表示属性的重要程度,并在每轮迭代中根据质心向量的变化自动计算最优的属性权重,使得所有样本与相应质心的距离和最小.实验结果验证了该方法相对于传统k-means算法的优势. 展开更多
关键词 聚类算法 属性权重 数据挖掘 目标函数
在线阅读 下载PDF
基于加权空间划分的高效全局K-means聚类算法 被引量:2
17
作者 曲福恒 潘曰涛 +3 位作者 杨勇 胡雅婷 宋剑飞 魏成宇 《吉林大学学报(工学版)》 EI CAS CSCD 北大核心 2024年第5期1393-1400,共8页
针对全局K-means聚类算法穷举样本点导致计算量大的问题,提出一种基于加权空间划分的高效全局K-means聚类算法。算法首先对样本空间进行网格划分,然后提出密度准则与距离准则对网格进行过滤,保留密度较大且相互距离较远的网格作为候选... 针对全局K-means聚类算法穷举样本点导致计算量大的问题,提出一种基于加权空间划分的高效全局K-means聚类算法。算法首先对样本空间进行网格划分,然后提出密度准则与距离准则对网格进行过滤,保留密度较大且相互距离较远的网格作为候选中心网格。为避免全局K-means算法只在样本集中选取候选中心的局限性,提出权重准则和中心迭代策略扩充候选中心,增加候选中心多样性。最后,通过增量聚类方式遍历候选中心得到最终的聚类结果。在UCI数据集上的实验结果表明:与全局K-means算法相比,新算法在保证聚类精度的前提下,计算效率平均提高了89.39%~95.79%。与K-means++、IK-+和近期提出的CD算法相比,新算法精度更高,并且克服了因随机初始化导致的聚类结果不稳定问题。 展开更多
关键词 人工智能 k-means算法 聚类中心 网格划分 权重 增量式聚类
原文传递
基于类间差异最大化的加权距离改进K-means算法 被引量:2
18
作者 张雪凤 刘鹏 《山东大学学报(理学版)》 CAS CSCD 北大核心 2010年第7期28-33,共6页
为了改善K-means算法的聚类效果,将聚类准则函数定义为加权的类内误差平方总和SSE(sum of thesquared error),并调整了K-means算法迭代过程中重新分配数据对象的方法:使用一个带有类内数据对象数的加权距离作为重新分配数据对象的依据,... 为了改善K-means算法的聚类效果,将聚类准则函数定义为加权的类内误差平方总和SSE(sum of thesquared error),并调整了K-means算法迭代过程中重新分配数据对象的方法:使用一个带有类内数据对象数的加权距离作为重新分配数据对象的依据,同时按类间差异最大化为准则优化了加权距离中的参数。实验表明,改进后的K-means算法可以在很大程度上减少大类被拆分情况的发生,明显改善聚类效果。 展开更多
关键词 k-means算法 聚类 类间差异 加权距离
原文传递
K-means聚类中心的鲁棒优化算法 被引量:7
19
作者 罗倩 《计算机工程与设计》 北大核心 2015年第9期2395-2400,共6页
针对K-means算法对随机选择的初始聚类中心敏感且聚类结果不稳定、准确率不高的问题,提出一种基于邻域数据距离加权的聚类中心鲁棒优化算法。通过建立数据密度约束将聚类中心优化在数据密集区域,有效克服K-means算法聚类结果稳定性差等... 针对K-means算法对随机选择的初始聚类中心敏感且聚类结果不稳定、准确率不高的问题,提出一种基于邻域数据距离加权的聚类中心鲁棒优化算法。通过建立数据密度约束将聚类中心优化在数据密集区域,有效克服K-means算法聚类结果稳定性差等问题。通过对仿真数据和标准数据集的实验,验证了采用该算法收敛的聚类中心非常接近标准数据集的实际中心,具有较优的聚类准确性、鲁棒性和收敛速度。 展开更多
关键词 k-means聚类算法 初始聚类中心 邻域距离加权 聚类优化 鲁棒算法
在线阅读 下载PDF
基于逻辑回归函数的加权K-means聚类算法 被引量:8
20
作者 林丽 薛芳 《集美大学学报(自然科学版)》 CAS 2021年第2期139-145,共7页
传统K-means聚类算法通过欧式距离计算样本的相似度,将数据所有的属性特征均平等对待,忽略每个属性特征的不同贡献,导致样本相似度计算的准确率不高。针对这个不足,提出一种特征加权的K-means算法进行优化。首先,运用Softmax和Sigmoid... 传统K-means聚类算法通过欧式距离计算样本的相似度,将数据所有的属性特征均平等对待,忽略每个属性特征的不同贡献,导致样本相似度计算的准确率不高。针对这个不足,提出一种特征加权的K-means算法进行优化。首先,运用Softmax和Sigmoid逻辑回归函数计算特征权重,使得加权的欧式距离更能准确地表示样本相似度;其次,优化初始聚类中心选择策略,选择距离较大的K个样本作为初始聚类中心,可有效避免样本的错误聚类及空簇问题。实验结果表明,在UCI标准数据集中采用加权K-means聚类算法可以有效减少迭代次数,提高聚类的准确率、精确率和召回率。 展开更多
关键词 欧式距离 特征加权的k-means算法 逻辑回归函数 初始聚类中心
在线阅读 下载PDF
上一页 1 2 22 下一页 到第
使用帮助 返回顶部