This paper summarizes the fuel type systems currently adopted by the fire danger rating systems or fire behavior prediction systems of some countries, such as Canada, the United States, Australia, Greece, and Switzerl...This paper summarizes the fuel type systems currently adopted by the fire danger rating systems or fire behavior prediction systems of some countries, such as Canada, the United States, Australia, Greece, and Switzerland. As an example, the Canadian Forest Fire Danger Rating System organizes fuel types into five major groups, with a total of 16 discrete fuel types recognized. In the United States National Fire Danger Rating System, fuel models are divided into four vegetation groups and twenty fire behavior fuel models. The Promethus System (Greece) divides fuels into 7 types, and Australia has adopted only three distinct fuel types: open grasslands, dry eucalyptus forests, and heath/shrublands. Four approaches to mapping fuels are acceptable: field reconnaissance, direct mapping methods, indirect mapping methods, and gradient modeling. Satellite remote-sensing techniques provide an alternative source of obtaining fuel data quickly, since they provide comprehensive spatial coverage and enough temporal resolution to update fuel maps in a more efficient and timely manner than traditional aerial photography or fieldwork. Satellite sensors can also provide digital information that can be easily tied into other spatial databases using Geographic Information System (GIS) analysis, which can be used as input in fire behavior and growth models. Various fuel-mapping methods from satellite remote sensing are discussed in the paper. According to the analysis of the fuel mapping techniques worldwide, this paper suggests that China should first create appropriate fuel types for its fire agencies before embarking on developing a national fire danger rating system to improve the current data situation for it's fire management programs.展开更多
Wetland research has become a hot spot linking multiple disciplines presently. Wetland classification and mapping is the basis for wetland research. It is difficult to generate wetland data sets using traditional meth...Wetland research has become a hot spot linking multiple disciplines presently. Wetland classification and mapping is the basis for wetland research. It is difficult to generate wetland data sets using traditional methods because of the low accessibility of wetlands, hence remote sensing data have become one of the primary data sources in wetland research. This paper presents a case study conducted at the core area of Honghe National Nature Reserve in the Sanjiang Plain, Northeast China. In this study, three images generated by airship, from Thematic Mapper and from SPOT 5 were selected to produce wetland maps at three different wetland landscape levels. After assessing classification accuracies of the three maps, we compared the different wetland mapping results of 11 plant communities to the airship image, 6 plant ecotypes to the TM image and 9 landscape classifications to the SPOT 5 image. We discussed the different characteristics of the hierarchical ecosystem classifications based on the spatial scales of the different images. The results indicate that spatial scales of remote sensing data have an important link to the hierarchies of wetland plant ecosystems displayed on the wetland landscape maps. The richness of wetland landscape information derived from an image closely relates to its spatial resolution. This study can enrich the ecological classification methods and mapping techniques dealing with the spatial scales of different remote sensing images. With a better understanding of classification accuracies in mapping wetlands by using different scales of remote sensing data, we can make an appropriate approach for dealing with the scale issue of remote sensing images.展开更多
This study focused on land cover mapping based on synthetic images,especially using the method of spatial and temporal classification as well as the accuracy validation of their results.Our experimental results indica...This study focused on land cover mapping based on synthetic images,especially using the method of spatial and temporal classification as well as the accuracy validation of their results.Our experimental results indicate that the accuracy of land cover map based on synthetic imagery and actual observation has a similar standard compared with actual land cover survey data.These findings facilitate land cover mapping with synthetic data in the area where actual observation is missing.Furthermore,in order to improve the quality of the land cover mapping,this research employed the spatial and temporal Markov random field classification approach.Test results show that overall mapping accuracy can be increased by approximately 5% after applying spatial and temporal classification.This finding contributes towards the achievement of higher quality land cover mapping of areas with missing data by using spatial and temporal information.展开更多
An extended self-organizing map for supervised classification is proposed in this paper. Unlike other traditional SOMs, the model has an input layer, a Kohonen layer, and an output layer. The number of neurons in the ...An extended self-organizing map for supervised classification is proposed in this paper. Unlike other traditional SOMs, the model has an input layer, a Kohonen layer, and an output layer. The number of neurons in the input layer depends on the dimensionality of input patterns. The number of neurons in the output layer equals the number of the desired classes. The number of neurons in the Kohonen layer may be a few to several thousands, which depends on the complexity of classification problems and the classification precision. Each training sample is expressed by a pair of vectors : an input vector and a class codebook vector. When a training sample is input into the model, Kohonen's competitive learning rule is applied to selecting the winning neuron from the Kohouen layer and the weight coefficients connecting all the neurons in the input layer with both the winning neuron and its neighbors in the Kohonen layer are modified to be closer to the input vector, and those connecting all the neurons around the winning neuron within a certain diameter in the Kohonen layer with all the neurons in the output layer are adjusted to be closer to the class codebook vector. If the number of training sam- ples is sufficiently large and the learning epochs iterate enough times, the model will be able to serve as a supervised classifier. The model has been tentatively applied to the supervised classification of multispectral remotely sensed data. The author compared the performances of the extended SOM and BPN in remotely sensed data classification. The investigation manifests that the extended SOM is feasible for supervised classification.展开更多
The artificial neural networks (ANNs), among different soft computing methodologies are widely used to meet the challenges thrown by the main objectives of data mining classification techniques, due to their robust, p...The artificial neural networks (ANNs), among different soft computing methodologies are widely used to meet the challenges thrown by the main objectives of data mining classification techniques, due to their robust, powerful, distributed, fault tolerant computing and capability to learn in a data-rich environment. ANNs has been used in several fields, showing high performance as classifiers. The problem of dealing with non numerical data is one major obstacle prevents using them with various data sets and several domains. Another problem is their complex structure and how hands to interprets. Self-Organizing Map (SOM) is type of neural systems that can be easily interpreted, but still can’t be used with non numerical data directly. This paper presents an enhanced SOM structure to cope with non numerical data. It used DNA sequences as the training dataset. Results show very good performance compared to other classifiers. For better evaluation both micro-array structure and their sequential representation as proteins were targeted as dataset accuracy is measured accordingly.展开更多
In the future the fifth generation( 5 G) communication systems,channel models may be very complicated and it is difficult to calculate equivalent signal to interference plus noise ratio( SINR)of a random fading channe...In the future the fifth generation( 5 G) communication systems,channel models may be very complicated and it is difficult to calculate equivalent signal to interference plus noise ratio( SINR)of a random fading channel. Therefore,methods for the calculation of equivalent SINR of a random fading channel are very necessary.In this paper,an enhanced algorithm on the exponential effective SINR mapping( EESM) model for random fading channels was proposed. First, the optimal adjustment parameters of typical channel fading models including extended pedestrian A( EPA)model,extended vehicular A( EVA) model and extended typical urban( ETU) model were obtained by simulation. Then the proposed solution was used to actualize channel classification according to the maximum multipath delay and the average power of the random channel. The solution can determine the typical channel closest to random channel for obtaining the optimal adjustment value of EESM. The evaluation results indicate that the proposed one can improve the whole system throughput significantly and meanwhile the accuracy of the link prediction algorithm is also guaranteed.展开更多
The high dimensionalhyperspectral image classification is a challenging task due to the spectral feature vectors.The high correlation between these features and the noises greatly affects the classification performanc...The high dimensionalhyperspectral image classification is a challenging task due to the spectral feature vectors.The high correlation between these features and the noises greatly affects the classification performances.To overcome this,dimensionality reduction techniques are widely used.Traditional image processing applications recently propose numerous deep learning models.However,in hyperspectral image classification,the features of deep learning models are less explored.Thus,for efficient hyperspectral image classification,a depth-wise convolutional neural network is presented in this research work.To handle the dimensionality issue in the classification process,an optimized self-organized map model is employed using a water strider optimization algorithm.The network parameters of the self-organized map are optimized by the water strider optimization which reduces the dimensionality issues and enhances the classification performances.Standard datasets such as Indian Pines and the University of Pavia(UP)are considered for experimental analysis.Existing dimensionality reduction methods like Enhanced Hybrid-Graph Discriminant Learning(EHGDL),local geometric structure Fisher analysis(LGSFA),Discriminant Hyper-Laplacian projection(DHLP),Group-based tensor model(GBTM),and Lower rank tensor approximation(LRTA)methods are compared with proposed optimized SOM model.Results confirm the superior performance of the proposed model of 98.22%accuracy for the Indian pines dataset and 98.21%accuracy for the University of Pavia dataset over the existing maximum likelihood classifier,and Support vector machine(SVM).展开更多
Due to rapid development in software industry, it was necessary to reduce time and efforts in the software development process. Software Reusability is an important measure that can be applied to improve software deve...Due to rapid development in software industry, it was necessary to reduce time and efforts in the software development process. Software Reusability is an important measure that can be applied to improve software development and software quality. Reusability reduces time, effort, errors, and hence the overall cost of the development process. Reusability prediction models are established in the early stage of the system development cycle to support an early reusability assessment. In Object-Oriented systems, Reusability of software components (classes) can be obtained by investigating its metrics values. Analyzing software metric values can help to avoid developing components from scratch. In this paper, we use Chidamber and Kemerer (CK) metrics suite in order to identify the reuse level of object-oriented classes. Self-Organizing Map (SOM) was used to cluster datasets of CK metrics values that were extracted from three different java-based systems. The goal was to find the relationship between CK metrics values and the reusability level of the class. The reusability level of the class was classified into three main categorizes (High Reusable, Medium Reusable and Low Reusable). The clustering was based on metrics threshold values that were used to achieve the experiments. The proposed methodology succeeds in classifying classes to their reusability level (High Reusable, Medium Reusable and Low Reusable). The experiments show how SOM can be applied on software CK metrics with different sizes of SOM grids to provide different levels of metrics details. The results show that Depth of Inheritance Tree (DIT) and Number of Children (NOC) metrics dominated the clustering process, so these two metrics were discarded from the experiments to achieve a successful clustering. The most efficient SOM topology [2 × 2] grid size is used to predict the reusability of classes.展开更多
This work used multi techniques analysis comprises GIS models, geostatistics, clustering analysis as against the traditional single layer thematic approach to characterize Mambilla Plateau, a cold highland in Taraba s...This work used multi techniques analysis comprises GIS models, geostatistics, clustering analysis as against the traditional single layer thematic approach to characterize Mambilla Plateau, a cold highland in Taraba state of Nigeriainto biogeoclimatic ecosystem zones needed for natural resources management. The principal criteria used in classifying and mapping the ecosystems are: climate and bioclimate data, physiography (slope, relief and aspect), vegetation/landcover data. In classifying and mapping the physiography of Mambilla Plateau, the topographic layer was produced from 90 m NASA/SRTM digital elevation model. Principal component analysis, agglomerative hierarchical cluster analysis (AHC) and geostatistical techniques (Kriging) were applied to develop bioclimatic layer. A combination of vegetation field survey conducted on selected sites, un/supervised classification and the application of NDVI values was used to produce landcover map and delineation of the Plateau into vegetation units. These layers of physiographic, bioclimatic and vegetation were spatially combined using fuzzy (sum) overlay in Arc GIS 10.2 to produce 5 major and 1 subunit biogeoclimatic ecosystem zones on the Mambilla Plateau and the adjoining landscape namely: humid lowland forest/humid shrubby forest, montane forest, escarpment stream valley forest, grass cypress cool mountain and, grass eucalyptus cold high mountain. It is expected that this approach to biogeoclimatic ecosystem mapping shall form the bedrock for vegetal/forest resources management not only in the region but find application especially in most of the highland of the world.展开更多
In the Saharian domain, the Tarfaya-Laayoune coastal basin developed in a stable passive margin, where asymmetrical sedimentation increase from East to West and reach a sediment stack of about 14 kilometers. However, ...In the Saharian domain, the Tarfaya-Laayoune coastal basin developed in a stable passive margin, where asymmetrical sedimentation increase from East to West and reach a sediment stack of about 14 kilometers. However, the morphology of the studied area corresponds to a vast plateau (hamada) presenting occasional major reliefs. For this purpose, remote sensing approach has been applied to find the best approaches for truthful lithological mapping. The two supervised classification methods by machine learning (Artificial Neural Network and Spectral Information Divergence) have been evaluated for a most accurate classification to be used for our lithofacies mapping. The latest geological maps and RGB images were used for pseudo-color groups to identify important areas and collect the ROIs that will serve as facilities samples for the classifications. The results obtained showed a clear distinction between the various formation units, and very close results to the field reality in the ANN classification of the studied area. Thus, the ANN method is more accurate with an overall accuracy of 92.56% and a Kappa coefficient is 0.9143.展开更多
Traveling salesman problem(TSP)is a classic non-deterministic polynomial-hard optimization prob-lem.Based on the characteristics of self-organizing mapping(SOM)network,this paper proposes an improved SOM network from ...Traveling salesman problem(TSP)is a classic non-deterministic polynomial-hard optimization prob-lem.Based on the characteristics of self-organizing mapping(SOM)network,this paper proposes an improved SOM network from the perspectives of network update strategy,initialization method,and parameter selection.This paper compares the performance of the proposed algorithms with the performance of existing SOM network algorithms on the TSP and compares them with several heuristic algorithms.Simulations show that compared with existing SOM networks,the improved SOM network proposed in this paper improves the convergence rate and algorithm accuracy.Compared with iterated local search and heuristic algorithms,the improved SOM net-work algorithms proposed in this paper have the advantage of fast calculation speed on medium-scale TSP.展开更多
Recently,machine learning-based technologies have been developed to automate the classification of wafer map defect patterns during semiconductormanufacturing.The existing approaches used in the wafer map pattern clas...Recently,machine learning-based technologies have been developed to automate the classification of wafer map defect patterns during semiconductormanufacturing.The existing approaches used in the wafer map pattern classification include directly learning the image through a convolution neural network and applying the ensemble method after extracting image features.This study aims to classify wafer map defects more effectively and derive robust algorithms even for datasets with insufficient defect patterns.First,the number of defects during the actual process may be limited.Therefore,insufficient data are generated using convolutional auto-encoder(CAE),and the expanded data are verified using the evaluation technique of structural similarity index measure(SSIM).After extracting handcrafted features,a boosted stacking ensemble model that integrates the four base-level classifiers with the extreme gradient boosting classifier as a meta-level classifier is designed and built for training the model based on the expanded data for final prediction.Since the proposed algorithm shows better performance than those of existing ensemble classifiers even for insufficient defect patterns,the results of this study will contribute to improving the product quality and yield of the actual semiconductor manufacturing process.展开更多
To solve the mapping problem for the mobile robots in the unknown environment, a dynamic growing self-organizing map with growing-threshold tuning automatically algorithm (DGSOMGT) based on Self-organizing Map is prop...To solve the mapping problem for the mobile robots in the unknown environment, a dynamic growing self-organizing map with growing-threshold tuning automatically algorithm (DGSOMGT) based on Self-organizing Map is proposed. It introduces a value of spread factor to describe the changing process of the growing threshold dynamically. The method realizes the network structure growing by training through mobile robot movement constantly in the unknown environment. The proposed algorithm is based on self-organizing map and can adjust the growing-threshold value by the number of network neurons increasing. It avoids tuning the parameters repeatedly by human. The experimental results show that the proposed method detects the complex environment quickly, effectively and correctly. The robot can realize environment mapping automatically. Compared with the other methods the proposed mapping strategy has better topological properties and time property.展开更多
Intrusion attempts against Internet of Things(IoT)devices have significantly increased in the last few years.These devices are now easy targets for hackers because of their built-in security flaws.Combining a Self-Org...Intrusion attempts against Internet of Things(IoT)devices have significantly increased in the last few years.These devices are now easy targets for hackers because of their built-in security flaws.Combining a Self-Organizing Map(SOM)hybrid anomaly detection system for dimensionality reduction with the inherited nature of clustering and Extreme Gradient Boosting(XGBoost)for multi-class classification can improve network traffic intrusion detection.The proposed model is evaluated on the NSL-KDD dataset.The hybrid approach outperforms the baseline line models,Multilayer perceptron model,and SOM-KNN(k-nearest neighbors)model in precision,recall,and F1-score,highlighting the proposed approach’s scalability,potential,adaptability,and real-world applicability.Therefore,this paper proposes a highly efficient deployment strategy for resource-constrained network edges.The results reveal that Precision,Recall,and F1-scores rise 10%-30% for the benign,probing,and Denial of Service(DoS)classes.In particular,the DoS,probe,and benign classes improved their F1-scores by 7.91%,32.62%,and 12.45%,respectively.展开更多
Flower Image Classification is a Fine-Grained Classification problem.The main difficulty of Fine-Grained Classification is the large inter-class similarity and the inner-class difference.In this paper,we propose a new...Flower Image Classification is a Fine-Grained Classification problem.The main difficulty of Fine-Grained Classification is the large inter-class similarity and the inner-class difference.In this paper,we propose a new algorithm based on Saliency Map and PCANet to overcome the difficulty.This algorithm mainly consists of two parts:flower region selection,flower feature learning.In first part,we combine saliency map with gray-scale map to select flower region.In second part,we use the flower region as input to train the PCANet which is a simple deep learning network for learning flower feature automatically,then a 102-way softmax layer that follow the PCANet achieve classification.Our approach achieves 84.12%accuracy on Oxford 17 Flowers dataset.The results show that a combination of Saliency Map and simple deep learning network PCANet can applies to flower image classification problem.展开更多
Information on Land Use and Land Cover Map(LULCM)is essential for environment and socioeconomic applications.Such maps are generally derived from Multispectral Remote Sensing Images(MRSI)via classification.The classif...Information on Land Use and Land Cover Map(LULCM)is essential for environment and socioeconomic applications.Such maps are generally derived from Multispectral Remote Sensing Images(MRSI)via classification.The classification process can be described as information flow from images to maps through a trained classifier.Characterizing the information flow is essential for understanding the classification mechanism,providing solutions that address such theoretical issues as“what is the maximum number of classes that can be classified from a given MRSI?”and“how much information gain can be obtained?”Consequently,two interesting questions naturally arise,i.e.(i)How can we characterize the information flow?and(ii)What is the mathematical form of the information flow?To answer these two questions,this study first hypothesizes that thermodynamic entropy is the appropriate measure of information for both MRSI and LULCM.This hypothesis is then supported by kinetic-theory-based experiments.Thereafter,upon such an entropy,a generalized Jarzynski equation is formulated to mathematically model the information flow,which contains such parameters as thermodynamic entropy of MRSI,thermodynamic entropy of LULCM,weighted F1-score(classification accuracy),and total number of classes.This generalized Jarzynski equation has been successfully validated by hypothesis-driven experiments where 694 Sentinel-2 images are classified into 10 classes by four classical classifiers.This study provides a way for linking thermodynamic laws and concepts to the characterization and understanding of information flow in land cover classification,opening a new door for constructing domain knowledge.展开更多
Map recognition is an essential data input means of Geographic Information System (GIS). How to solve the problems in the procedure, such as recognition of maps with crisscross pipeline networks, classification of bui...Map recognition is an essential data input means of Geographic Information System (GIS). How to solve the problems in the procedure, such as recognition of maps with crisscross pipeline networks, classification of buildings and roads, and processing of connected text, is a critical step for GIS keeping high-speed development. In this paper, a new recognition method of pipeline maps is presented, and some common patterns of pipeline connection and component labels are established. Through pattern matching, pipelines and component labels are recognized and peeled off from maps. After this approach, maps simply consist of buildings and roads, which are recognized and classified with fuzzy classification method. In addition, the Double Sides Scan (DSS) technique is also described, through which the effect of connected text can be eliminated.展开更多
There is increasing interest in developing automatic procedures to segment landscapes into soil spatial entities that replace conventional, expensive manual procedures for delineating and classifying soils. Geographic...There is increasing interest in developing automatic procedures to segment landscapes into soil spatial entities that replace conventional, expensive manual procedures for delineating and classifying soils. Geographic object-based image analysis(GEOBIA)partitions remote sensing imagery or digital elevation models into homogeneous image objects based on image segmentation. We used an object-based methodology for the detailed delineation and classification of soil types using digital maps of topography and vegetation as soil covariates, based on the Random Forests(RF) classifier. We compared the object-based method's results with those of a pixel-based classification using the same classifier. We used 18 digital elevation model derivatives and 5 remote sensing indices that were related to vegetation cover and soil. Using 171 soil profiles with their associated environmental variable values,the RF method was used to identify the most important soil type predictors for use in the segmentation process. A stack of rastergeodatasets corresponding to the selected predictors was segmented using a multi-resolution segmentation algorithm, which resulted in homogeneous objects related to soil types. These objects were further classified as soil types using the same method, RF. We also conducted a pixel-based classification using the same classifier and soil profiles, and the resulting maps were assessed in terms of their accuracy using 30% of the soil profiles for validation. We found that GEOBIA was an effective method for soil type mapping, and was superior to the pixel-based approach. The optimized object-based soil map had an overall accuracy of 58%, which was 10% higher than that of the optimized pixel-based map.展开更多
基金This paper was supported by the Beijing Fund of Nature Science (No. 6042025), China NKBRSF Project (No. 2001CB409600) and Laboratory of Forest Protection, State Forestry Administration
文摘This paper summarizes the fuel type systems currently adopted by the fire danger rating systems or fire behavior prediction systems of some countries, such as Canada, the United States, Australia, Greece, and Switzerland. As an example, the Canadian Forest Fire Danger Rating System organizes fuel types into five major groups, with a total of 16 discrete fuel types recognized. In the United States National Fire Danger Rating System, fuel models are divided into four vegetation groups and twenty fire behavior fuel models. The Promethus System (Greece) divides fuels into 7 types, and Australia has adopted only three distinct fuel types: open grasslands, dry eucalyptus forests, and heath/shrublands. Four approaches to mapping fuels are acceptable: field reconnaissance, direct mapping methods, indirect mapping methods, and gradient modeling. Satellite remote-sensing techniques provide an alternative source of obtaining fuel data quickly, since they provide comprehensive spatial coverage and enough temporal resolution to update fuel maps in a more efficient and timely manner than traditional aerial photography or fieldwork. Satellite sensors can also provide digital information that can be easily tied into other spatial databases using Geographic Information System (GIS) analysis, which can be used as input in fire behavior and growth models. Various fuel-mapping methods from satellite remote sensing are discussed in the paper. According to the analysis of the fuel mapping techniques worldwide, this paper suggests that China should first create appropriate fuel types for its fire agencies before embarking on developing a national fire danger rating system to improve the current data situation for it's fire management programs.
基金Under the auspices of National Natural Science Foundation of China (No. 40871241, 40771170)National High Technology Research and Development Program of China (No. 2007AA12Z176)
文摘Wetland research has become a hot spot linking multiple disciplines presently. Wetland classification and mapping is the basis for wetland research. It is difficult to generate wetland data sets using traditional methods because of the low accessibility of wetlands, hence remote sensing data have become one of the primary data sources in wetland research. This paper presents a case study conducted at the core area of Honghe National Nature Reserve in the Sanjiang Plain, Northeast China. In this study, three images generated by airship, from Thematic Mapper and from SPOT 5 were selected to produce wetland maps at three different wetland landscape levels. After assessing classification accuracies of the three maps, we compared the different wetland mapping results of 11 plant communities to the airship image, 6 plant ecotypes to the TM image and 9 landscape classifications to the SPOT 5 image. We discussed the different characteristics of the hierarchical ecosystem classifications based on the spatial scales of the different images. The results indicate that spatial scales of remote sensing data have an important link to the hierarchies of wetland plant ecosystems displayed on the wetland landscape maps. The richness of wetland landscape information derived from an image closely relates to its spatial resolution. This study can enrich the ecological classification methods and mapping techniques dealing with the spatial scales of different remote sensing images. With a better understanding of classification accuracies in mapping wetlands by using different scales of remote sensing data, we can make an appropriate approach for dealing with the scale issue of remote sensing images.
基金supported in part by the National High-Tech R&D Program(863 program)under grant number 2009AA122004the National Natural Science Foundation of China under grant number 60171009the Hong Kong Research Grant Council under grant number CUHK 444612.
文摘This study focused on land cover mapping based on synthetic images,especially using the method of spatial and temporal classification as well as the accuracy validation of their results.Our experimental results indicate that the accuracy of land cover map based on synthetic imagery and actual observation has a similar standard compared with actual land cover survey data.These findings facilitate land cover mapping with synthetic data in the area where actual observation is missing.Furthermore,in order to improve the quality of the land cover mapping,this research employed the spatial and temporal Markov random field classification approach.Test results show that overall mapping accuracy can be increased by approximately 5% after applying spatial and temporal classification.This finding contributes towards the achievement of higher quality land cover mapping of areas with missing data by using spatial and temporal information.
基金Supported by National Natural Science Foundation of China (No. 40872193)
文摘An extended self-organizing map for supervised classification is proposed in this paper. Unlike other traditional SOMs, the model has an input layer, a Kohonen layer, and an output layer. The number of neurons in the input layer depends on the dimensionality of input patterns. The number of neurons in the output layer equals the number of the desired classes. The number of neurons in the Kohonen layer may be a few to several thousands, which depends on the complexity of classification problems and the classification precision. Each training sample is expressed by a pair of vectors : an input vector and a class codebook vector. When a training sample is input into the model, Kohonen's competitive learning rule is applied to selecting the winning neuron from the Kohouen layer and the weight coefficients connecting all the neurons in the input layer with both the winning neuron and its neighbors in the Kohonen layer are modified to be closer to the input vector, and those connecting all the neurons around the winning neuron within a certain diameter in the Kohonen layer with all the neurons in the output layer are adjusted to be closer to the class codebook vector. If the number of training sam- ples is sufficiently large and the learning epochs iterate enough times, the model will be able to serve as a supervised classifier. The model has been tentatively applied to the supervised classification of multispectral remotely sensed data. The author compared the performances of the extended SOM and BPN in remotely sensed data classification. The investigation manifests that the extended SOM is feasible for supervised classification.
文摘The artificial neural networks (ANNs), among different soft computing methodologies are widely used to meet the challenges thrown by the main objectives of data mining classification techniques, due to their robust, powerful, distributed, fault tolerant computing and capability to learn in a data-rich environment. ANNs has been used in several fields, showing high performance as classifiers. The problem of dealing with non numerical data is one major obstacle prevents using them with various data sets and several domains. Another problem is their complex structure and how hands to interprets. Self-Organizing Map (SOM) is type of neural systems that can be easily interpreted, but still can’t be used with non numerical data directly. This paper presents an enhanced SOM structure to cope with non numerical data. It used DNA sequences as the training dataset. Results show very good performance compared to other classifiers. For better evaluation both micro-array structure and their sequential representation as proteins were targeted as dataset accuracy is measured accordingly.
基金Institute of Nonlinear Science of Donghua University,China
文摘In the future the fifth generation( 5 G) communication systems,channel models may be very complicated and it is difficult to calculate equivalent signal to interference plus noise ratio( SINR)of a random fading channel. Therefore,methods for the calculation of equivalent SINR of a random fading channel are very necessary.In this paper,an enhanced algorithm on the exponential effective SINR mapping( EESM) model for random fading channels was proposed. First, the optimal adjustment parameters of typical channel fading models including extended pedestrian A( EPA)model,extended vehicular A( EVA) model and extended typical urban( ETU) model were obtained by simulation. Then the proposed solution was used to actualize channel classification according to the maximum multipath delay and the average power of the random channel. The solution can determine the typical channel closest to random channel for obtaining the optimal adjustment value of EESM. The evaluation results indicate that the proposed one can improve the whole system throughput significantly and meanwhile the accuracy of the link prediction algorithm is also guaranteed.
文摘The high dimensionalhyperspectral image classification is a challenging task due to the spectral feature vectors.The high correlation between these features and the noises greatly affects the classification performances.To overcome this,dimensionality reduction techniques are widely used.Traditional image processing applications recently propose numerous deep learning models.However,in hyperspectral image classification,the features of deep learning models are less explored.Thus,for efficient hyperspectral image classification,a depth-wise convolutional neural network is presented in this research work.To handle the dimensionality issue in the classification process,an optimized self-organized map model is employed using a water strider optimization algorithm.The network parameters of the self-organized map are optimized by the water strider optimization which reduces the dimensionality issues and enhances the classification performances.Standard datasets such as Indian Pines and the University of Pavia(UP)are considered for experimental analysis.Existing dimensionality reduction methods like Enhanced Hybrid-Graph Discriminant Learning(EHGDL),local geometric structure Fisher analysis(LGSFA),Discriminant Hyper-Laplacian projection(DHLP),Group-based tensor model(GBTM),and Lower rank tensor approximation(LRTA)methods are compared with proposed optimized SOM model.Results confirm the superior performance of the proposed model of 98.22%accuracy for the Indian pines dataset and 98.21%accuracy for the University of Pavia dataset over the existing maximum likelihood classifier,and Support vector machine(SVM).
文摘Due to rapid development in software industry, it was necessary to reduce time and efforts in the software development process. Software Reusability is an important measure that can be applied to improve software development and software quality. Reusability reduces time, effort, errors, and hence the overall cost of the development process. Reusability prediction models are established in the early stage of the system development cycle to support an early reusability assessment. In Object-Oriented systems, Reusability of software components (classes) can be obtained by investigating its metrics values. Analyzing software metric values can help to avoid developing components from scratch. In this paper, we use Chidamber and Kemerer (CK) metrics suite in order to identify the reuse level of object-oriented classes. Self-Organizing Map (SOM) was used to cluster datasets of CK metrics values that were extracted from three different java-based systems. The goal was to find the relationship between CK metrics values and the reusability level of the class. The reusability level of the class was classified into three main categorizes (High Reusable, Medium Reusable and Low Reusable). The clustering was based on metrics threshold values that were used to achieve the experiments. The proposed methodology succeeds in classifying classes to their reusability level (High Reusable, Medium Reusable and Low Reusable). The experiments show how SOM can be applied on software CK metrics with different sizes of SOM grids to provide different levels of metrics details. The results show that Depth of Inheritance Tree (DIT) and Number of Children (NOC) metrics dominated the clustering process, so these two metrics were discarded from the experiments to achieve a successful clustering. The most efficient SOM topology [2 × 2] grid size is used to predict the reusability of classes.
文摘This work used multi techniques analysis comprises GIS models, geostatistics, clustering analysis as against the traditional single layer thematic approach to characterize Mambilla Plateau, a cold highland in Taraba state of Nigeriainto biogeoclimatic ecosystem zones needed for natural resources management. The principal criteria used in classifying and mapping the ecosystems are: climate and bioclimate data, physiography (slope, relief and aspect), vegetation/landcover data. In classifying and mapping the physiography of Mambilla Plateau, the topographic layer was produced from 90 m NASA/SRTM digital elevation model. Principal component analysis, agglomerative hierarchical cluster analysis (AHC) and geostatistical techniques (Kriging) were applied to develop bioclimatic layer. A combination of vegetation field survey conducted on selected sites, un/supervised classification and the application of NDVI values was used to produce landcover map and delineation of the Plateau into vegetation units. These layers of physiographic, bioclimatic and vegetation were spatially combined using fuzzy (sum) overlay in Arc GIS 10.2 to produce 5 major and 1 subunit biogeoclimatic ecosystem zones on the Mambilla Plateau and the adjoining landscape namely: humid lowland forest/humid shrubby forest, montane forest, escarpment stream valley forest, grass cypress cool mountain and, grass eucalyptus cold high mountain. It is expected that this approach to biogeoclimatic ecosystem mapping shall form the bedrock for vegetal/forest resources management not only in the region but find application especially in most of the highland of the world.
文摘In the Saharian domain, the Tarfaya-Laayoune coastal basin developed in a stable passive margin, where asymmetrical sedimentation increase from East to West and reach a sediment stack of about 14 kilometers. However, the morphology of the studied area corresponds to a vast plateau (hamada) presenting occasional major reliefs. For this purpose, remote sensing approach has been applied to find the best approaches for truthful lithological mapping. The two supervised classification methods by machine learning (Artificial Neural Network and Spectral Information Divergence) have been evaluated for a most accurate classification to be used for our lithofacies mapping. The latest geological maps and RGB images were used for pseudo-color groups to identify important areas and collect the ROIs that will serve as facilities samples for the classifications. The results obtained showed a clear distinction between the various formation units, and very close results to the field reality in the ANN classification of the studied area. Thus, the ANN method is more accurate with an overall accuracy of 92.56% and a Kappa coefficient is 0.9143.
基金the National Natural Science Foundation of China (No.61627810)the National Science and Technology Major Program of China (No.2018YFB1305003)the National Defense Science and Technology Outstanding Youth Science Foundation (No.2017-JCJQ-ZQ-031)。
文摘Traveling salesman problem(TSP)is a classic non-deterministic polynomial-hard optimization prob-lem.Based on the characteristics of self-organizing mapping(SOM)network,this paper proposes an improved SOM network from the perspectives of network update strategy,initialization method,and parameter selection.This paper compares the performance of the proposed algorithms with the performance of existing SOM network algorithms on the TSP and compares them with several heuristic algorithms.Simulations show that compared with existing SOM networks,the improved SOM network proposed in this paper improves the convergence rate and algorithm accuracy.Compared with iterated local search and heuristic algorithms,the improved SOM net-work algorithms proposed in this paper have the advantage of fast calculation speed on medium-scale TSP.
基金the National Research Foundation of Korea(NRF)grant funded by the Korea government(MSIT)(No.NRF-2021R1A5A8033165)the“Human Resources Program in Energy Technology”of the Korea Institute of Energy Technology Evaluation and Planning(KETEP)and was granted financial resources from the Ministry of Trade,Industry&Energy,Republic of Korea(No.20214000000200).
文摘Recently,machine learning-based technologies have been developed to automate the classification of wafer map defect patterns during semiconductormanufacturing.The existing approaches used in the wafer map pattern classification include directly learning the image through a convolution neural network and applying the ensemble method after extracting image features.This study aims to classify wafer map defects more effectively and derive robust algorithms even for datasets with insufficient defect patterns.First,the number of defects during the actual process may be limited.Therefore,insufficient data are generated using convolutional auto-encoder(CAE),and the expanded data are verified using the evaluation technique of structural similarity index measure(SSIM).After extracting handcrafted features,a boosted stacking ensemble model that integrates the four base-level classifiers with the extreme gradient boosting classifier as a meta-level classifier is designed and built for training the model based on the expanded data for final prediction.Since the proposed algorithm shows better performance than those of existing ensemble classifiers even for insufficient defect patterns,the results of this study will contribute to improving the product quality and yield of the actual semiconductor manufacturing process.
文摘To solve the mapping problem for the mobile robots in the unknown environment, a dynamic growing self-organizing map with growing-threshold tuning automatically algorithm (DGSOMGT) based on Self-organizing Map is proposed. It introduces a value of spread factor to describe the changing process of the growing threshold dynamically. The method realizes the network structure growing by training through mobile robot movement constantly in the unknown environment. The proposed algorithm is based on self-organizing map and can adjust the growing-threshold value by the number of network neurons increasing. It avoids tuning the parameters repeatedly by human. The experimental results show that the proposed method detects the complex environment quickly, effectively and correctly. The robot can realize environment mapping automatically. Compared with the other methods the proposed mapping strategy has better topological properties and time property.
基金Researcher Supporting Project number(RSPD2025R582),King Saud University,Riyadh,Saudi Arabia.
文摘Intrusion attempts against Internet of Things(IoT)devices have significantly increased in the last few years.These devices are now easy targets for hackers because of their built-in security flaws.Combining a Self-Organizing Map(SOM)hybrid anomaly detection system for dimensionality reduction with the inherited nature of clustering and Extreme Gradient Boosting(XGBoost)for multi-class classification can improve network traffic intrusion detection.The proposed model is evaluated on the NSL-KDD dataset.The hybrid approach outperforms the baseline line models,Multilayer perceptron model,and SOM-KNN(k-nearest neighbors)model in precision,recall,and F1-score,highlighting the proposed approach’s scalability,potential,adaptability,and real-world applicability.Therefore,this paper proposes a highly efficient deployment strategy for resource-constrained network edges.The results reveal that Precision,Recall,and F1-scores rise 10%-30% for the benign,probing,and Denial of Service(DoS)classes.In particular,the DoS,probe,and benign classes improved their F1-scores by 7.91%,32.62%,and 12.45%,respectively.
文摘Flower Image Classification is a Fine-Grained Classification problem.The main difficulty of Fine-Grained Classification is the large inter-class similarity and the inner-class difference.In this paper,we propose a new algorithm based on Saliency Map and PCANet to overcome the difficulty.This algorithm mainly consists of two parts:flower region selection,flower feature learning.In first part,we combine saliency map with gray-scale map to select flower region.In second part,we use the flower region as input to train the PCANet which is a simple deep learning network for learning flower feature automatically,then a 102-way softmax layer that follow the PCANet achieve classification.Our approach achieves 84.12%accuracy on Oxford 17 Flowers dataset.The results show that a combination of Saliency Map and simple deep learning network PCANet can applies to flower image classification problem.
基金supported by the National Natural Science Foundation of China[grant number 41930104]by the Research Grants Council of Hong Kong[grant number PolyU 152219/18E].
文摘Information on Land Use and Land Cover Map(LULCM)is essential for environment and socioeconomic applications.Such maps are generally derived from Multispectral Remote Sensing Images(MRSI)via classification.The classification process can be described as information flow from images to maps through a trained classifier.Characterizing the information flow is essential for understanding the classification mechanism,providing solutions that address such theoretical issues as“what is the maximum number of classes that can be classified from a given MRSI?”and“how much information gain can be obtained?”Consequently,two interesting questions naturally arise,i.e.(i)How can we characterize the information flow?and(ii)What is the mathematical form of the information flow?To answer these two questions,this study first hypothesizes that thermodynamic entropy is the appropriate measure of information for both MRSI and LULCM.This hypothesis is then supported by kinetic-theory-based experiments.Thereafter,upon such an entropy,a generalized Jarzynski equation is formulated to mathematically model the information flow,which contains such parameters as thermodynamic entropy of MRSI,thermodynamic entropy of LULCM,weighted F1-score(classification accuracy),and total number of classes.This generalized Jarzynski equation has been successfully validated by hypothesis-driven experiments where 694 Sentinel-2 images are classified into 10 classes by four classical classifiers.This study provides a way for linking thermodynamic laws and concepts to the characterization and understanding of information flow in land cover classification,opening a new door for constructing domain knowledge.
文摘Map recognition is an essential data input means of Geographic Information System (GIS). How to solve the problems in the procedure, such as recognition of maps with crisscross pipeline networks, classification of buildings and roads, and processing of connected text, is a critical step for GIS keeping high-speed development. In this paper, a new recognition method of pipeline maps is presented, and some common patterns of pipeline connection and component labels are established. Through pattern matching, pipelines and component labels are recognized and peeled off from maps. After this approach, maps simply consist of buildings and roads, which are recognized and classified with fuzzy classification method. In addition, the Double Sides Scan (DSS) technique is also described, through which the effect of connected text can be eliminated.
基金supported by the Romanian Government through a doctoral scholarship
文摘There is increasing interest in developing automatic procedures to segment landscapes into soil spatial entities that replace conventional, expensive manual procedures for delineating and classifying soils. Geographic object-based image analysis(GEOBIA)partitions remote sensing imagery or digital elevation models into homogeneous image objects based on image segmentation. We used an object-based methodology for the detailed delineation and classification of soil types using digital maps of topography and vegetation as soil covariates, based on the Random Forests(RF) classifier. We compared the object-based method's results with those of a pixel-based classification using the same classifier. We used 18 digital elevation model derivatives and 5 remote sensing indices that were related to vegetation cover and soil. Using 171 soil profiles with their associated environmental variable values,the RF method was used to identify the most important soil type predictors for use in the segmentation process. A stack of rastergeodatasets corresponding to the selected predictors was segmented using a multi-resolution segmentation algorithm, which resulted in homogeneous objects related to soil types. These objects were further classified as soil types using the same method, RF. We also conducted a pixel-based classification using the same classifier and soil profiles, and the resulting maps were assessed in terms of their accuracy using 30% of the soil profiles for validation. We found that GEOBIA was an effective method for soil type mapping, and was superior to the pixel-based approach. The optimized object-based soil map had an overall accuracy of 58%, which was 10% higher than that of the optimized pixel-based map.