Water quality is a critical global issue,especially in urban and semi-urban regions where natural and anthropogenic factors significantly influence surface water systems.This study evaluates the hydrochemical characte...Water quality is a critical global issue,especially in urban and semi-urban regions where natural and anthropogenic factors significantly influence surface water systems.This study evaluates the hydrochemical characteristics of surface water in the North of Tehran Rivers(NTRs),an essential water resource in a rapidly urbanizing region,using advanced clustering techniques,including Hierarchical Clustering Analysis(HCA),Fuzzy CMeans(FCM),Genetic Algorithm Fuzzy C-Means(GAFCM),and Self-Organizing Map(SOM).The research aims to address the scientific challenge of understanding spatial and temporal variability in water quality,focusing on physicochemical parameters,hydrochemical facies,and contamination sources.Water samples from six rivers collected over four seasons in 2020 were analyzed and classified into distinct clusters based on their chemical composition,revealing significant seasonal and spatial differences.Results showed that FCM and GAFCM consistently categorized the NTRs into two clusters during winter and spring and three in summer and autumn.These findings were supported by HCA and SOM,which identified clusters corresponding to specific river segments and contamination levels.The primary hydrochemical processes identified were mineral dissolution and weathering,with calcite,dolomite,and aragonite significantly influencing water chemistry.Additionally,human activities,such as wastewater discharge,were shown to contribute to elevated sulfate,nitrate,and phosphate concentrations,further corroborated by microbial analyses.By integrating HCA,FCM,and GAFCM with an artificial neural network(ANN)-based clustering method(SOM),this study provides a robust framework for evaluating surface water quality.The findings,supported by Gibbs diagrams,Hounslow ion ratio,and saturation indices,highlight the dominance of rock weathering and human impacts in shaping the hydrochemical dynamics of the NTRs.These insights contribute to the scientific understanding of water quality dynamics and offer practical guidance for sustainable water resource management and environmental protection in developing urban areas.展开更多
The title of the online version of the original article was revised.The title of the original article has been revised to:Hydrochemical characterization of surface waters in Northern Tehran:Integrating cluster-based t...The title of the online version of the original article was revised.The title of the original article has been revised to:Hydrochemical characterization of surface waters in Northern Tehran:Integrating cluster-based techniques with Self-Organizing Maps.展开更多
Determination of homogenous precipitation-based regions is a very important task in effective management of water resources. The present study tried to propose an effective precipitation-based regionalization methodol...Determination of homogenous precipitation-based regions is a very important task in effective management of water resources. The present study tried to propose an effective precipitation-based regionalization methodology by conjugating both temporal pre-processing and spatial clustering approaches in a way to take advantage of multiscale properties of precipitation time series. Annual precipitation data of 51 years(1960-2010) for 31 rain gauges(RGs) were collected and used in proposed clustering approaches. Discreet wavelet transform(DWT) was used to capture the time-frequency attributes of the time series and multiscale regionalization was performed by using k-means and Self Organizing Maps(SOM) clustering techniques. Daubechies function(db) was selected as mother wavelet to decompose the precipitation time series. Also, proper boundary extensions and decomposition level were applied. Different combinations of the approximation(A) and detail(D) coefficients were used to determine the input dataset as a basis of spatial clustering. The proposed model's efficiency in spatial clustering stage was verified using three different indexes namely, Silhouette Coefficient(SC), Dunn index and Davis Bouldin index(DB). Results approved superior performance of k-means technique in comparison to SOM. It was also deduced that DWT-based regionalization methodology showed improvements in comparison to historical-based models. Cross mutual information was used to investigate the RGs of cluster 3's homogeneousness in DWT-k-means approach. Results of non-linear correlation approach verified homogeneity of cluster 3. Verifications based on mean annual precipitation values of rain gauges in each cluster also approved the capability of multiscale approach in precipitation regionalization.展开更多
A model integrating geo-information and self-organizing map(SOM) for exploring the database of soil environmental surveys was established. The dataset of 5 heavy metals(As, Cd, Cr, Hg, and Pb) was built by the regular...A model integrating geo-information and self-organizing map(SOM) for exploring the database of soil environmental surveys was established. The dataset of 5 heavy metals(As, Cd, Cr, Hg, and Pb) was built by the regular grid sampling in Hechi, Guangxi Zhuang Autonomous Region in southern China. Auxiliary datasets were collected throughout the study area to help interpret the potential causes of pollution. The main findings are as follows:(1) Soil samples of 5 elements exhibited strong variation and high skewness. High pollution risk existed in the case study area, especially Hg and Cd.(2) As and Pb had a similar topological distribution pattern, meaning they behaved similarly in the soil environment. Cr had behaviours in soil different from those of the other 4 elements.(3) From the U-matrix of SOM networks, 3 levels of SEQ were identified, and 11 high risk areas of soil heavy metal-contaminated were found throughout the study area, which were basically near rivers,factories, and ore zones.(4) The variations of contamination index(CI) followed the trend of construction land(1.353)> forestland(1.267)> cropland(1.175)> grassland(1.056), which suggest that decision makers should focus more on the problem of soil pollution surrounding industrial and mining enterprises and farmland.展开更多
To design microstructure and microhardness in the additive manufacturing(AM)of nickel(Ni)-based superalloys,the present work develops a novel data-driven approach that combines physics-based models,experimental measur...To design microstructure and microhardness in the additive manufacturing(AM)of nickel(Ni)-based superalloys,the present work develops a novel data-driven approach that combines physics-based models,experimental measurements,and a data-mining method.The simulation is based on a computational thermal-fluid dynamics(CtFD)model,which can obtain thermal behavior,solidification parameters such as cooling rate,and the dilution of solidified clad.Based on the computed thermal information,dendrite arm spacing and microhardness are estimated using well-tested mechanistic models.Experimental microstructure and microhardness are determined and compared with the simulated values for validation.To visualize process-structure-properties(PSPs)linkages,the simulation and experimental datasets are input to a data-mining model-a self-organizing map(SOM).The design windows of the process parameters under multiple objectives can be obtained from the visualized maps.The proposed approaches can be utilized in AM and other data-intensive processes.Data-driven linkages between process,structure,and properties have the potential to benefit online process monitoring control in order to derive an ideal microstructure and mechanical properties.展开更多
Varieties of approaches and algorithms have been presented to identify the distribution of elements. Previous researches based on the type of problem, categorized their data in proper clusters or classes. This means t...Varieties of approaches and algorithms have been presented to identify the distribution of elements. Previous researches based on the type of problem, categorized their data in proper clusters or classes. This means that the process of solution could be supervised or unsupervised. In cases, where there is no idea about dependency of samples to specific groups, clustering methods (unsupervised) are applied. About geochemistry data, since various elements are involved, in addition to the complex nature of geochemical data, clustering algorithms would be useful for recognition of elements distribution. In this paper, Self-Organizing Map (SOM) algorithm, as an unsupervised method, is applied for clustering samples based on REEs contents. For this reason the Choghart Fe-REE deposit (Bafq district, central Iran), was selected as study area and dataset was a collection of 112 lithology samples that were assayed with laboratory tests such as ICP-MS and XRF analysis. In this study, input vectors include 19 features which are coordinates x, y, z and concentrations of REEs as well as the concentration of Phosphate (P<sub>2</sub>O<sub>5</sub>) since the apatite is the main source of REEs in this particular research. Four clusters were determined as an optimal number of clusters using silhouette criterion as well as k-means clustering method and SOM. Therefore, using self-organizing map, study area was subdivided in four zones. These four zones can be described as phosphate type, albitofyre type, metasomatic and phosphorus iron ore, and Iron Ore type. Phosphate type is the most prone to rare earth elements. Eventually, results were validated with laboratory analysis.展开更多
Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annu...Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annuallyinthe urban area of Beijing, the capital of China. Based on a selforganizing map(SOM) artificial neural network(ANN), a graded waterlogging risk assessment was conducted on 56 low-lying points in Beijing, China. Social risk factors, such as Gross domestic product(GDP), population density, and traffic congestion, were utilized as input datasets in this study. The results indicate that SOM-ANNis suitable for automatically and quantitatively assessing risks associated with waterlogging. The greatest advantage of SOM-ANN in the assessment of waterlogging risk is that a priori knowledge about classification categories and assessment indicator weights is not needed. As a result, SOM-ANN can effectively overcome interference from subjective factors,producing classification results that are more objective and accurate. In this paper, the risk level of waterlogging in Beijing was divided into five grades. The points that were assigned risk grades of IV or Vwere located mainly in the districts of Chaoyang, Haidian, Xicheng, and Dongcheng.展开更多
Characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity is a complex problem. In this study, to increase the efficiency and accuracy of source charac...Characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity is a complex problem. In this study, to increase the efficiency and accuracy of source characterization an alternative methodology to the methodologies proposed earlier is developed. This methodology, Adaptive Surrogate Modeling Based Optimization (ASMBO) uses the capabilities of Self Organizing Map (SOM) algorithm to design the surrogate models and adaptive surrogate models for source characterization. The most important advantage of this methodology is its direct utilization for groundwater contaminant characterization without the necessity of utilizing a linked simulation optimization model. The validation of the SOM based surrogate models and SOM based adaptive surrogate models demonstrates that the quantity and quality of initial sample sizes have crucial role on the accuracy of solutions as the designed monitoring locations. The performance evaluation results of the proposed methodology are obtained using error free and erroneous concentration measurement data. These results demonstrate that the developed methodology could approximate groundwater flow and transport simulation models, and substitute the optimization model for characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity.展开更多
This paper presents a real rough sets space and corresponding concepts of real lower and upper approximation sets which correspond to the real-valued attributes. Therefore, the real rough sets space can be investigate...This paper presents a real rough sets space and corresponding concepts of real lower and upper approximation sets which correspond to the real-valued attributes. Therefore, the real rough sets space can be investigated directly. A rhombus neighborhood for SOM is proposed, and the combination of SOM and rough sets theory is explored. According to the distance between the weight of winner node and the input vector in the real rough sets space, new weight learning rules are defined. The modified method makes the classification of the output of SOM clearer and the intervals of different classes larger. Finally, an example based on fault identification of an aircraft actuator is presented, The result of the simulation shows that this method is right and effective.展开更多
Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower,...Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower, an improved GHSOM method combined with mutual information is proposed. After theoretical analysis, experiments are conducted to illustrate the effectiveness of the proposed method by accurately clustering the input data. Based on different clusters, the complex relationship within the data can be revealed effectively.展开更多
A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. ...A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. Simulation results show that the new algorithm is superior to original Kohonen’s algorithm in clustering performance and learning rate.展开更多
The detailed analysis of individual rain events characteristics is an essential step for improving our understanding of variation in precipitation over different topographies. In this study, the homogeneity among rain...The detailed analysis of individual rain events characteristics is an essential step for improving our understanding of variation in precipitation over different topographies. In this study, the homogeneity among rain gauges was investigated using the concept of “rain event properties,” linking them to the main atmospheric system that affects the rainfall in the region. For this, eight properties of more than 23,000 rain events recorded at 47 meteorological stations in Mumbai, India, were analyzed utilizing seasonal (June-September) rainfall records over 2006-2016. The high similarities among the properties indicated the similarities among the rain gauges. Furthermore, similar rain gauges were distinguished, investigated and characterized by cluster analysis using self-organizing maps (SOM). The cluster analysis results show six clusters of similarly behaving rain gauges, where each cluster addresses one isolated class of variables for the rain gauge. Additionally, the clusters confirm the spatial variation of rainfall caused by the complex topography of Mumbai, comprising the flatland near the Arabian Sea, high-rise buildings (urban area) and mountain and hills areas (Sanjay Gandhi National Park located in the northern part of Mumbai).展开更多
This paper describes the evaluation method of the gait motion in walk rehabilitation. We assume that the evaluation consists of the classification of the measured data and the prediction of the feature of the gait mot...This paper describes the evaluation method of the gait motion in walk rehabilitation. We assume that the evaluation consists of the classification of the measured data and the prediction of the feature of the gait motion. The method may enable a doctor and a physical therapist to recognize the condition of the patients more easily, and increase the motivation of patient further for rehabilitation. However, it is difficult to divide the gait motion into discrete categories, since the gait motion continuously changes and does not have the clear boundaries. Therefore, the self-organizing map (SOM) that is able to arrange the continuous data on the almost continuous map is employed in order to classify them. And, the feature of the gait motion is predicted by the classification. In this study, we adopt the gravity-center fluctuation (GCF) on the sole as the measured data. First, it is shown that the pattern of the CCF that is obtained by our developed measurement system includes the feature of the gait motion. Secondly, the relation between the pattern of the GCF and the feature of the gait motion that the doctor and the physical therapist evaluate by visual inspection is considered using the SOM. Next, we describe the prediction of following features measured by numerical values: the length of stride, the velocity of walk and the difference of steps that are important for the doctor and the physical therapist to make a diagnosis of the condition of the gait motion in walk rehabilitation. Finally, it is investigated that the position of a new test data that is arranged on the map accords with the prediction. As a consequence, we confirm that the method using the SOM is often useful to classify and predict the condition of the patient.展开更多
The artificial neural networks (ANNs), among different soft computing methodologies are widely used to meet the challenges thrown by the main objectives of data mining classification techniques, due to their robust, p...The artificial neural networks (ANNs), among different soft computing methodologies are widely used to meet the challenges thrown by the main objectives of data mining classification techniques, due to their robust, powerful, distributed, fault tolerant computing and capability to learn in a data-rich environment. ANNs has been used in several fields, showing high performance as classifiers. The problem of dealing with non numerical data is one major obstacle prevents using them with various data sets and several domains. Another problem is their complex structure and how hands to interprets. Self-Organizing Map (SOM) is type of neural systems that can be easily interpreted, but still can’t be used with non numerical data directly. This paper presents an enhanced SOM structure to cope with non numerical data. It used DNA sequences as the training dataset. Results show very good performance compared to other classifiers. For better evaluation both micro-array structure and their sequential representation as proteins were targeted as dataset accuracy is measured accordingly.展开更多
Self-organizing map(SOM) proposed by Kohonen has obtained certain achievements in solving the traveling salesman problem(TSP).To improve Kohonen SOM,an effective initialization and parameter modification method is dis...Self-organizing map(SOM) proposed by Kohonen has obtained certain achievements in solving the traveling salesman problem(TSP).To improve Kohonen SOM,an effective initialization and parameter modification method is discussed to obtain a faster convergence rate and better solution.Therefore,a new improved self-organizing map(ISOM)algorithm is introduced and applied to four traveling salesman problem instances for experimental simulation,and then the result of ISOM is compared with those of four SOM algorithms:AVL,KL,KG and MSTSP.Using ISOM,the average error of four travelingsalesman problem instances is only 2.895 0%,which is greatly better than the other four algorithms:8.51%(AVL),6.147 5%(KL),6.555%(KG) and 3.420 9%(MSTSP).Finally,ISOM is applied to two practical problems:the Chinese 100 cities-TSP and102 counties-TSP in Shanxi Province,and the two optimal touring routes are provided to the tourists.展开更多
In this paper we applied the technique of Self Organizing Map (SOM) to segment individuals based on their credit information. SOM is an unsupervised machine learning method that reduces data complexity and dimensional...In this paper we applied the technique of Self Organizing Map (SOM) to segment individuals based on their credit information. SOM is an unsupervised machine learning method that reduces data complexity and dimensionality while keeping sits original topology, which is superior to other dimension reduction methods especially when features in data have unclear nonlinear relations. Through this method we provide more clear and intuitive segmentation that other traditional methods cannot achieve.展开更多
Several studies were devoted to investigate the effects of meteorological factors on the occurrence of stroke. Regression models had been mostly used to assess the correlation between weather and stroke incidence. How...Several studies were devoted to investigate the effects of meteorological factors on the occurrence of stroke. Regression models had been mostly used to assess the correlation between weather and stroke incidence. However, these methods could not describe the process proceeding in the back-ground of stroke incidence. The purpose of this study was to provide a new approach based on Hidden Markov Models (HMMs) and self-organizing maps (SOM), interpreting the background from the viewpoint of weather variability. Based on meteorological data, SOM was performed to classify weather patterns. Using these classes by SOM as randomly changing “states”, our Hidden Markov Models were constructed with “observation data” that were extracted from the daily data of emergency transport at Nagoya City in Japan. We showed that SOM was an effective method to get weather patterns that would serve as “states” of Hidden Markov Models. Our Hidden Markov Models provided effective models to clarify background process for stroke incidence. The effectiveness of these Hidden Markov Models was estimated by stochastic test for root mean square errors (RMSE). “HMMs with states by SOM” would serve as a description of the background process of stroke incidence and were useful to show the influence of weather on stroke onset. This finding will contribute to an improvement of our understanding for links between weather variability and stroke incidence.展开更多
We propose an integrative self-organizing map (iSOM) for exploring differential expression patterns across multiple microarray experiments. The algorithm is based on the assumption that observed differential expressio...We propose an integrative self-organizing map (iSOM) for exploring differential expression patterns across multiple microarray experiments. The algorithm is based on the assumption that observed differential expressions are random samples of a mean pattern model which is unknowna priori. The learning mechanism of iSOM is similar to the conventional SOM. The mean pattern model which underlies the proposed iSOM models mean differential expressions using a one-dimension of mean differential expressions for the mean differential expressions. The feature map of an iSOM model can be used to reveal correlation between multiple medically/biologically related disease types or multiple platform experiments for one disease. We illustrate applications of iSOM using simulated data and real data.展开更多
文摘Water quality is a critical global issue,especially in urban and semi-urban regions where natural and anthropogenic factors significantly influence surface water systems.This study evaluates the hydrochemical characteristics of surface water in the North of Tehran Rivers(NTRs),an essential water resource in a rapidly urbanizing region,using advanced clustering techniques,including Hierarchical Clustering Analysis(HCA),Fuzzy CMeans(FCM),Genetic Algorithm Fuzzy C-Means(GAFCM),and Self-Organizing Map(SOM).The research aims to address the scientific challenge of understanding spatial and temporal variability in water quality,focusing on physicochemical parameters,hydrochemical facies,and contamination sources.Water samples from six rivers collected over four seasons in 2020 were analyzed and classified into distinct clusters based on their chemical composition,revealing significant seasonal and spatial differences.Results showed that FCM and GAFCM consistently categorized the NTRs into two clusters during winter and spring and three in summer and autumn.These findings were supported by HCA and SOM,which identified clusters corresponding to specific river segments and contamination levels.The primary hydrochemical processes identified were mineral dissolution and weathering,with calcite,dolomite,and aragonite significantly influencing water chemistry.Additionally,human activities,such as wastewater discharge,were shown to contribute to elevated sulfate,nitrate,and phosphate concentrations,further corroborated by microbial analyses.By integrating HCA,FCM,and GAFCM with an artificial neural network(ANN)-based clustering method(SOM),this study provides a robust framework for evaluating surface water quality.The findings,supported by Gibbs diagrams,Hounslow ion ratio,and saturation indices,highlight the dominance of rock weathering and human impacts in shaping the hydrochemical dynamics of the NTRs.These insights contribute to the scientific understanding of water quality dynamics and offer practical guidance for sustainable water resource management and environmental protection in developing urban areas.
文摘The title of the online version of the original article was revised.The title of the original article has been revised to:Hydrochemical characterization of surface waters in Northern Tehran:Integrating cluster-based techniques with Self-Organizing Maps.
文摘Determination of homogenous precipitation-based regions is a very important task in effective management of water resources. The present study tried to propose an effective precipitation-based regionalization methodology by conjugating both temporal pre-processing and spatial clustering approaches in a way to take advantage of multiscale properties of precipitation time series. Annual precipitation data of 51 years(1960-2010) for 31 rain gauges(RGs) were collected and used in proposed clustering approaches. Discreet wavelet transform(DWT) was used to capture the time-frequency attributes of the time series and multiscale regionalization was performed by using k-means and Self Organizing Maps(SOM) clustering techniques. Daubechies function(db) was selected as mother wavelet to decompose the precipitation time series. Also, proper boundary extensions and decomposition level were applied. Different combinations of the approximation(A) and detail(D) coefficients were used to determine the input dataset as a basis of spatial clustering. The proposed model's efficiency in spatial clustering stage was verified using three different indexes namely, Silhouette Coefficient(SC), Dunn index and Davis Bouldin index(DB). Results approved superior performance of k-means technique in comparison to SOM. It was also deduced that DWT-based regionalization methodology showed improvements in comparison to historical-based models. Cross mutual information was used to investigate the RGs of cluster 3's homogeneousness in DWT-k-means approach. Results of non-linear correlation approach verified homogeneity of cluster 3. Verifications based on mean annual precipitation values of rain gauges in each cluster also approved the capability of multiscale approach in precipitation regionalization.
基金Strategic Priority Research Program of the Chinese Academy of Sciences,No.XDA19040302The Key Research Program of the Chinese Academy of Sciences,No.KFZD-SW-111
文摘A model integrating geo-information and self-organizing map(SOM) for exploring the database of soil environmental surveys was established. The dataset of 5 heavy metals(As, Cd, Cr, Hg, and Pb) was built by the regular grid sampling in Hechi, Guangxi Zhuang Autonomous Region in southern China. Auxiliary datasets were collected throughout the study area to help interpret the potential causes of pollution. The main findings are as follows:(1) Soil samples of 5 elements exhibited strong variation and high skewness. High pollution risk existed in the case study area, especially Hg and Cd.(2) As and Pb had a similar topological distribution pattern, meaning they behaved similarly in the soil environment. Cr had behaviours in soil different from those of the other 4 elements.(3) From the U-matrix of SOM networks, 3 levels of SEQ were identified, and 11 high risk areas of soil heavy metal-contaminated were found throughout the study area, which were basically near rivers,factories, and ore zones.(4) The variations of contamination index(CI) followed the trend of construction land(1.353)> forestland(1.267)> cropland(1.175)> grassland(1.056), which suggest that decision makers should focus more on the problem of soil pollution surrounding industrial and mining enterprises and farmland.
基金Jian Cao,Gregory J.Wagner,and Wing K.Liu acknowledge support from the National Science Foundation(NSF)Cyber-Physical Systems(CPS)(CPS/CMMI-1646592)Hengyang Li acknowledges support from the Northwestern Data Science Initiative(DSI+6 种基金171474500210043324)Jian Cao,Gregory J.Wagner,Wing K.Liu,Jennifer L.Bennett,and Sarah J.Wolff acknowledge support from the Digital Manufacturing and Design Innovation Institute(DMDII15-07)Jian Cao,Wing K.Liu,Zhengtao Gan,and Jennifer L.Bennett acknowledge support from the Center for Hierarchical Materials Design(CHiMaD70NANB14H012)This work made use of facilities at DMG MORI and Northwestern UniversityIt also made use of the MatCI Facility,which receives support from the MRSEC Program(NSF DMR-168 1720139)of the Materials Research Center at Northwestern University.
文摘To design microstructure and microhardness in the additive manufacturing(AM)of nickel(Ni)-based superalloys,the present work develops a novel data-driven approach that combines physics-based models,experimental measurements,and a data-mining method.The simulation is based on a computational thermal-fluid dynamics(CtFD)model,which can obtain thermal behavior,solidification parameters such as cooling rate,and the dilution of solidified clad.Based on the computed thermal information,dendrite arm spacing and microhardness are estimated using well-tested mechanistic models.Experimental microstructure and microhardness are determined and compared with the simulated values for validation.To visualize process-structure-properties(PSPs)linkages,the simulation and experimental datasets are input to a data-mining model-a self-organizing map(SOM).The design windows of the process parameters under multiple objectives can be obtained from the visualized maps.The proposed approaches can be utilized in AM and other data-intensive processes.Data-driven linkages between process,structure,and properties have the potential to benefit online process monitoring control in order to derive an ideal microstructure and mechanical properties.
文摘Varieties of approaches and algorithms have been presented to identify the distribution of elements. Previous researches based on the type of problem, categorized their data in proper clusters or classes. This means that the process of solution could be supervised or unsupervised. In cases, where there is no idea about dependency of samples to specific groups, clustering methods (unsupervised) are applied. About geochemistry data, since various elements are involved, in addition to the complex nature of geochemical data, clustering algorithms would be useful for recognition of elements distribution. In this paper, Self-Organizing Map (SOM) algorithm, as an unsupervised method, is applied for clustering samples based on REEs contents. For this reason the Choghart Fe-REE deposit (Bafq district, central Iran), was selected as study area and dataset was a collection of 112 lithology samples that were assayed with laboratory tests such as ICP-MS and XRF analysis. In this study, input vectors include 19 features which are coordinates x, y, z and concentrations of REEs as well as the concentration of Phosphate (P<sub>2</sub>O<sub>5</sub>) since the apatite is the main source of REEs in this particular research. Four clusters were determined as an optimal number of clusters using silhouette criterion as well as k-means clustering method and SOM. Therefore, using self-organizing map, study area was subdivided in four zones. These four zones can be described as phosphate type, albitofyre type, metasomatic and phosphorus iron ore, and Iron Ore type. Phosphate type is the most prone to rare earth elements. Eventually, results were validated with laboratory analysis.
基金supported by the National Key R&D Program of China (GrantN o.2016YFC0401407)National Natural Science Foundation of China (Grant Nos. 51479003 and 51279006)
文摘Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annuallyinthe urban area of Beijing, the capital of China. Based on a selforganizing map(SOM) artificial neural network(ANN), a graded waterlogging risk assessment was conducted on 56 low-lying points in Beijing, China. Social risk factors, such as Gross domestic product(GDP), population density, and traffic congestion, were utilized as input datasets in this study. The results indicate that SOM-ANNis suitable for automatically and quantitatively assessing risks associated with waterlogging. The greatest advantage of SOM-ANN in the assessment of waterlogging risk is that a priori knowledge about classification categories and assessment indicator weights is not needed. As a result, SOM-ANN can effectively overcome interference from subjective factors,producing classification results that are more objective and accurate. In this paper, the risk level of waterlogging in Beijing was divided into five grades. The points that were assigned risk grades of IV or Vwere located mainly in the districts of Chaoyang, Haidian, Xicheng, and Dongcheng.
文摘Characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity is a complex problem. In this study, to increase the efficiency and accuracy of source characterization an alternative methodology to the methodologies proposed earlier is developed. This methodology, Adaptive Surrogate Modeling Based Optimization (ASMBO) uses the capabilities of Self Organizing Map (SOM) algorithm to design the surrogate models and adaptive surrogate models for source characterization. The most important advantage of this methodology is its direct utilization for groundwater contaminant characterization without the necessity of utilizing a linked simulation optimization model. The validation of the SOM based surrogate models and SOM based adaptive surrogate models demonstrates that the quantity and quality of initial sample sizes have crucial role on the accuracy of solutions as the designed monitoring locations. The performance evaluation results of the proposed methodology are obtained using error free and erroneous concentration measurement data. These results demonstrate that the developed methodology could approximate groundwater flow and transport simulation models, and substitute the optimization model for characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity.
文摘This paper presents a real rough sets space and corresponding concepts of real lower and upper approximation sets which correspond to the real-valued attributes. Therefore, the real rough sets space can be investigated directly. A rhombus neighborhood for SOM is proposed, and the combination of SOM and rough sets theory is explored. According to the distance between the weight of winner node and the input vector in the real rough sets space, new weight learning rules are defined. The modified method makes the classification of the output of SOM clearer and the intervals of different classes larger. Finally, an example based on fault identification of an aircraft actuator is presented, The result of the simulation shows that this method is right and effective.
基金Supported by the Natural Science Foundation of Tianjin(No.15JCQNJC00200)
文摘Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower, an improved GHSOM method combined with mutual information is proposed. After theoretical analysis, experiments are conducted to illustrate the effectiveness of the proposed method by accurately clustering the input data. Based on different clusters, the complex relationship within the data can be revealed effectively.
文摘A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. Simulation results show that the new algorithm is superior to original Kohonen’s algorithm in clustering performance and learning rate.
文摘The detailed analysis of individual rain events characteristics is an essential step for improving our understanding of variation in precipitation over different topographies. In this study, the homogeneity among rain gauges was investigated using the concept of “rain event properties,” linking them to the main atmospheric system that affects the rainfall in the region. For this, eight properties of more than 23,000 rain events recorded at 47 meteorological stations in Mumbai, India, were analyzed utilizing seasonal (June-September) rainfall records over 2006-2016. The high similarities among the properties indicated the similarities among the rain gauges. Furthermore, similar rain gauges were distinguished, investigated and characterized by cluster analysis using self-organizing maps (SOM). The cluster analysis results show six clusters of similarly behaving rain gauges, where each cluster addresses one isolated class of variables for the rain gauge. Additionally, the clusters confirm the spatial variation of rainfall caused by the complex topography of Mumbai, comprising the flatland near the Arabian Sea, high-rise buildings (urban area) and mountain and hills areas (Sanjay Gandhi National Park located in the northern part of Mumbai).
基金supported by JSPS KAKENHI(Nos.JP26730118 and JP16K12486)
文摘This paper describes the evaluation method of the gait motion in walk rehabilitation. We assume that the evaluation consists of the classification of the measured data and the prediction of the feature of the gait motion. The method may enable a doctor and a physical therapist to recognize the condition of the patients more easily, and increase the motivation of patient further for rehabilitation. However, it is difficult to divide the gait motion into discrete categories, since the gait motion continuously changes and does not have the clear boundaries. Therefore, the self-organizing map (SOM) that is able to arrange the continuous data on the almost continuous map is employed in order to classify them. And, the feature of the gait motion is predicted by the classification. In this study, we adopt the gravity-center fluctuation (GCF) on the sole as the measured data. First, it is shown that the pattern of the CCF that is obtained by our developed measurement system includes the feature of the gait motion. Secondly, the relation between the pattern of the GCF and the feature of the gait motion that the doctor and the physical therapist evaluate by visual inspection is considered using the SOM. Next, we describe the prediction of following features measured by numerical values: the length of stride, the velocity of walk and the difference of steps that are important for the doctor and the physical therapist to make a diagnosis of the condition of the gait motion in walk rehabilitation. Finally, it is investigated that the position of a new test data that is arranged on the map accords with the prediction. As a consequence, we confirm that the method using the SOM is often useful to classify and predict the condition of the patient.
文摘The artificial neural networks (ANNs), among different soft computing methodologies are widely used to meet the challenges thrown by the main objectives of data mining classification techniques, due to their robust, powerful, distributed, fault tolerant computing and capability to learn in a data-rich environment. ANNs has been used in several fields, showing high performance as classifiers. The problem of dealing with non numerical data is one major obstacle prevents using them with various data sets and several domains. Another problem is their complex structure and how hands to interprets. Self-Organizing Map (SOM) is type of neural systems that can be easily interpreted, but still can’t be used with non numerical data directly. This paper presents an enhanced SOM structure to cope with non numerical data. It used DNA sequences as the training dataset. Results show very good performance compared to other classifiers. For better evaluation both micro-array structure and their sequential representation as proteins were targeted as dataset accuracy is measured accordingly.
文摘Self-organizing map(SOM) proposed by Kohonen has obtained certain achievements in solving the traveling salesman problem(TSP).To improve Kohonen SOM,an effective initialization and parameter modification method is discussed to obtain a faster convergence rate and better solution.Therefore,a new improved self-organizing map(ISOM)algorithm is introduced and applied to four traveling salesman problem instances for experimental simulation,and then the result of ISOM is compared with those of four SOM algorithms:AVL,KL,KG and MSTSP.Using ISOM,the average error of four travelingsalesman problem instances is only 2.895 0%,which is greatly better than the other four algorithms:8.51%(AVL),6.147 5%(KL),6.555%(KG) and 3.420 9%(MSTSP).Finally,ISOM is applied to two practical problems:the Chinese 100 cities-TSP and102 counties-TSP in Shanxi Province,and the two optimal touring routes are provided to the tourists.
文摘In this paper we applied the technique of Self Organizing Map (SOM) to segment individuals based on their credit information. SOM is an unsupervised machine learning method that reduces data complexity and dimensionality while keeping sits original topology, which is superior to other dimension reduction methods especially when features in data have unclear nonlinear relations. Through this method we provide more clear and intuitive segmentation that other traditional methods cannot achieve.
文摘Several studies were devoted to investigate the effects of meteorological factors on the occurrence of stroke. Regression models had been mostly used to assess the correlation between weather and stroke incidence. However, these methods could not describe the process proceeding in the back-ground of stroke incidence. The purpose of this study was to provide a new approach based on Hidden Markov Models (HMMs) and self-organizing maps (SOM), interpreting the background from the viewpoint of weather variability. Based on meteorological data, SOM was performed to classify weather patterns. Using these classes by SOM as randomly changing “states”, our Hidden Markov Models were constructed with “observation data” that were extracted from the daily data of emergency transport at Nagoya City in Japan. We showed that SOM was an effective method to get weather patterns that would serve as “states” of Hidden Markov Models. Our Hidden Markov Models provided effective models to clarify background process for stroke incidence. The effectiveness of these Hidden Markov Models was estimated by stochastic test for root mean square errors (RMSE). “HMMs with states by SOM” would serve as a description of the background process of stroke incidence and were useful to show the influence of weather on stroke onset. This finding will contribute to an improvement of our understanding for links between weather variability and stroke incidence.
文摘We propose an integrative self-organizing map (iSOM) for exploring differential expression patterns across multiple microarray experiments. The algorithm is based on the assumption that observed differential expressions are random samples of a mean pattern model which is unknowna priori. The learning mechanism of iSOM is similar to the conventional SOM. The mean pattern model which underlies the proposed iSOM models mean differential expressions using a one-dimension of mean differential expressions for the mean differential expressions. The feature map of an iSOM model can be used to reveal correlation between multiple medically/biologically related disease types or multiple platform experiments for one disease. We illustrate applications of iSOM using simulated data and real data.