South America’s climatic diversity is a product of its vast geographical expanse, encompassing tropical to subtropical latitudes. The variations in precipitation and temperature across the region stem from the influe...South America’s climatic diversity is a product of its vast geographical expanse, encompassing tropical to subtropical latitudes. The variations in precipitation and temperature across the region stem from the influence of distinct atmospheric systems. While some studies have characterized the prevailing systems over South America, they often lacked the utilization of statistical techniques for homogenization. On the other hand, other research has employed multivariate statistical methods to identify homogeneous regions regarding temperature and precipitation, but their focus has been limited to specific areas, such as the south, southeast, and northeast. Surprisingly, there is a lack of work that compares various multivariate statistical techniques to determine homogeneous regions across the entirety of South America concerning temperature and precipitation. This paper aims to address this gap by comparing three such techniques: Cluster Analysis (K-means and Ward) and Self Organizing Maps, using data from different sources for temperature (ERA5, ERA5-Land, and CRU) and precipitation (ERA5, ERA5-Land, and CPC). Spatial patterns and time series were generated for each region over the period 1981-2010. The results from this analysis of spatially homogeneous regions concerning temperature and precipitation have the potential to significantly benefit climate analysis and forecasts. Moreover, they can offer valuable insights for various climatological studies, guiding decision-making processes in diverse fields that rely on climate information, such as agriculture, disaster management, and water resources planning.展开更多
Detection of Autism Spectrum Disorder(ASD)is a crucial area of research,representing a foundational aspect of psychological studies.The advancement of technology and the widespread adoption of machine learning methodo...Detection of Autism Spectrum Disorder(ASD)is a crucial area of research,representing a foundational aspect of psychological studies.The advancement of technology and the widespread adoption of machine learning methodologies have brought significant attention to this field in recent years.Interdisciplinary efforts have further propelled research into detection methods.Consequently,this study aims to contribute to both the fields of psychology and computer science.Specifically,the goal is to apply machine learning techniques to limited data for the detection of Autism Spectrum Disorder.This study is structured into two distinct phases:data preprocessing and classification.In the data preprocessing phase,four datasets—Toddler,Children,Adolescent,and Adult—were converted into numerical form,adjusted as necessary,and subsequently clustered.Clustering was performed using six different methods:Kmeans,agglomerative,DBSCAN(Density-Based Spatial Clustering of Applications with Noise),mean shift,spectral,and Birch.In the second phase,the clustered ASD data were classified.The model’s accuracy was assessed using 5-fold cross-validation to ensure robust evaluation.In total,ten distinct machine learning algorithms were employed.The findings indicate that all clustering methods demonstrated success with various classifiers.Notably,the K-means algorithm emerged as particularly effective,achieving consistent and significant results across all datasets.This study is expected to serve as a guide for improving ASD detection performance,even with minimal data availability.展开更多
The use of metal oxides has been extensively documented in the literature and applied in a variety of contexts,including but not limited to energy storage,chemical sensors,and biomedical applications.One of the most s...The use of metal oxides has been extensively documented in the literature and applied in a variety of contexts,including but not limited to energy storage,chemical sensors,and biomedical applications.One of the most significant applications of metal oxides is heterogeneous catalysis,which represents a pivotal technology in industrial production on a global scale.Catalysts serve as the primary enabling agents for chemical reactions,and among the plethora of catalysts,metal oxides including magnesium oxide(MgO),ceria(CeO_(2))and titania(TiO_(2)),have been identified to be particularly effective in catalyzing a variety of reactions[1].Theoretical calculations based on density functional theory(DFT)and a multitude of other quantum chemistry methods have proven invaluable in elucidating the mechanisms of metal-oxide-catalyzed reactions,thereby facilitating the design of high-performance catalysts[2].展开更多
Web data extraction has become a key technology for extracting valuable data from websites.At present,most extraction methods based on rule learning,visual pattern or tree matching have limited performance on complex ...Web data extraction has become a key technology for extracting valuable data from websites.At present,most extraction methods based on rule learning,visual pattern or tree matching have limited performance on complex web pages.Through ana-lyzing various statistical characteristics of HTML el-ements in web documents,this paper proposes,based on statistical features,an unsupervised web data ex-traction method—traversing the HTML DOM parse tree at first,calculating and generating the statistical matrix of the elements,and then locating data records by clustering method and heuristic rules that reveal in-herent links between the visual characteristics of the data recording areas and the statistical characteristics of the HTML nodes—which is both suitable for data records extraction of single-page and multi-pages,and it has strong generality and needs no training.The ex-periments show that the accuracy and efficiency of this method are equally better than the current data extrac-tion method.展开更多
Performing analytics on the load curve(LC)of customers is the foundation for demand response which requires a better understanding of customers'consumption pattern(CP)by analyzing the load curve.However,the perfor...Performing analytics on the load curve(LC)of customers is the foundation for demand response which requires a better understanding of customers'consumption pattern(CP)by analyzing the load curve.However,the performances of previous widely-used LC clustering methods are poor in two folds:larger number of clusters,huge variances within a cluster(a CP is extracted from a cluster),bringing huge difficulty to understand the electricity consumption pattern of customers.In this paper,to improve the performance of LC clustering,a clustering framework incorporated with community detection is proposed.The framework includes three parts:network construction,community detection,and CP extraction.According to the cluster validity index(CVI),the integrated approach outperforms the previous state-of-the-art method with the same amount of clusters.And the approach needs fewer clusters to achieve the same performance measured by CVI.展开更多
Neutron shielding materials face imbalanced behaviors among shielding,strength,and ductility properties.Based on the requirement of the high property shielding particles,a superior semi-coherentτ(Al4 MgGd)phase was d...Neutron shielding materials face imbalanced behaviors among shielding,strength,and ductility properties.Based on the requirement of the high property shielding particles,a superior semi-coherentτ(Al4 MgGd)phase was designed and predicted by cluster expansion(CE)method using density functional theory calculations.To realize its shielding property,the Powder Metallurgy-based routines(i.e.,powder fabrication,spark plasma sintering,and hot extrusion techniques)are used to fabricate 6TiB_(2)/Al-6Mg-5Gd(wt.%)composite with dispersed refinedτphases and homogenized TiB_(2) distribution.The atomic structure of ternary phase τ is examined by aberration-corrected high-angle annual dark-field(HAADF)scanning transmission electron microscope(STEM)and energy dispersive X-ray spectroscopy(EDXS)STEM experiments,which is well complied with the calculated compound(Al_(4)MgGd).In detail,theτ(Al_(4)MgGd)phase has a semi-coherent interface both with α-Al and TiB_(2),which is consistent with the prediction of interface relationships.With the optimized interfaces,the TiB_(2) and τ phases can effectively promote recrystallization and suppress grain growth,leading to the formation of ultra-fine grain structure.Then,the composite exhibits advanced shielding properties(Macroscopic transmission cross section ~24.1/cm,higher than 30%B_(4) C/Al)and optimized synergic mechanical properties(Ultimate tensile strength ~506 MPa,elongation ~12.9%),which are far higher than available Al-based neutron shielding materials.Finally,the underlying strength-ductility mechanisms are discussed.Critically,the design and optimization of shielding particle interfaces are reliable strategies for developing novel structural-functional integrated materials.展开更多
We study the thermodynamic properties of the classical one-dimensional generalized nonlinear Klein-Gordon lattice model(n≥2)by using the cluster variation method with linear response theory.The results of this method...We study the thermodynamic properties of the classical one-dimensional generalized nonlinear Klein-Gordon lattice model(n≥2)by using the cluster variation method with linear response theory.The results of this method are exact in the thermodynamic limit.We present the single-site reduced densityρ^((1))(z),averages such as(z^(2)),<|z^(n)|>,and<(z_(1)-z_(2))^(2)>,the specific heat C_(v),and the static correlation functions.We analyze the scaling behavior of these quantities and obtain the exact scaling powers at the low and high temperatures.Using these results,we gauge the accuracy of the projective truncation approximation for theφ^(4)lattice model.展开更多
In this study, we used the multi-resolution graph-based clustering (MRGC) method for determining the electrofacies (EF) and lithofacies (LF) from well log data obtained from the intraplatform bank gas fields loc...In this study, we used the multi-resolution graph-based clustering (MRGC) method for determining the electrofacies (EF) and lithofacies (LF) from well log data obtained from the intraplatform bank gas fields located in the Amu Darya Basin. The MRGC could automatically determine the optimal number of clusters without prior knowledge about the structure or cluster numbers of the analyzed data set and allowed the users to control the level of detail actually needed to define the EF. Based on the LF identification and successful EF calibration using core data, an MRGC EF partition model including five clusters and a quantitative LF interpretation chart were constructed. The EF clusters 1 to 5 were interpreted as lagoon, anhydrite flat, interbank, low-energy bank, and high-energy bank, and the coincidence rate in the cored interval could reach 85%. We concluded that the MRGC could be accurately applied to predict the LF in non-cored but logged wells. Therefore, continuous EF clusters were partitioned and corresponding LF were characteristics &different LF were analyzed interpreted, and the distribution and petrophysical in the framework of sequence stratigraphy.展开更多
[Objective] The aim was to assess regional eco-environmental quality by means of grey clustering method based on normalized index value. [Method] Eco-environmental quality in Chaohu basin was assessed by using grey cl...[Objective] The aim was to assess regional eco-environmental quality by means of grey clustering method based on normalized index value. [Method] Eco-environmental quality in Chaohu basin was assessed by using grey clustering method based on normalized index value, and the evaluation results were compared with those of unascertained measure method to verify the feasibility of grey clustering method used to evaluate regional eco-environmental quality. [Result] In the grey clustering assessment method based on normalized index value, indices whose standard normalized values in the same grade were close to each other were classified into one class and had the same whitening function, which reduced the number of whitening functions. Grey clustering method based on normalized index value was used to assess eco-environmental quality in Chaohu basin, and the evaluation results were basically in accordance with those of unascertained measure method, namely eco-environmental quality in Hefei, Chaohu and Lu’an belonged to the third (pass), fourth (worse) and fifth grade (bad), except for one grade difference in overall basin, and the results showed that the method had practicality and could be applied to assess regional eco-environmental quality. [Conclusion] The study could provide theoretical foundation for the establishment of comprehensive management countermeasures of regional ecological environment.展开更多
A new recommendation method was presented based on memetic algorithm-based clustering. The proposed method was tested on four highly sparse real-world datasets. Its recommendation performance is evaluated and compared...A new recommendation method was presented based on memetic algorithm-based clustering. The proposed method was tested on four highly sparse real-world datasets. Its recommendation performance is evaluated and compared with that of the frequency-based, user-based, item-based, k-means clustering-based, and genetic algorithm-based methods in terms of precision, recall, and F1 score. The results show that the proposed method yields better performance under the new user cold-start problem when each of new active users selects only one or two items into the basket. The average F1 scores on all four datasets are improved by 225.0%, 61.6%, 54.6%, 49.3%, 28.8%, and 6.3% over the frequency-based, user-based, item-based, k-means clustering-based, and two genetic algorithm-based methods, respectively.展开更多
The knowledge of bubble profiles in gas-liquid two-phase flows is crucial for analyzing the kinetic processes such as heat and mass transfer, and this knowledge is contained in field data obtained by surface-resolved ...The knowledge of bubble profiles in gas-liquid two-phase flows is crucial for analyzing the kinetic processes such as heat and mass transfer, and this knowledge is contained in field data obtained by surface-resolved computational fluid dynamics (CFD) simulations. To obtain this information, an efficient bubble profile reconstruction method based on an improved agglomerative hierarchical clustering (AHC) algorithm is proposed in this paper. The reconstruction method is featured by the implementations of a binary space division preprocessing, which aims to reduce the computational complexity, an adaptive linkage criterion, which guarantees the applicability of the AHC algorithm when dealing with datasets involving either non-uniform or distorted grids, and a stepwise execution strategy, which enables the separation of attached bubbles. To illustrate and verify this method, it was applied to dealing with 3 datasets, 2 of them with pre-specified spherical bubbles and the other obtained by a surface-resolved CFD simulation. Application results indicate that the proposed method is effective even when the data include some non-uniform and distortion.展开更多
To make the quantitative results of nuclear magnetic resonance(NMR) transverse relaxation(T;) spectrums reflect the type and pore structure of reservoir more directly, an unsupervised clustering method was developed t...To make the quantitative results of nuclear magnetic resonance(NMR) transverse relaxation(T;) spectrums reflect the type and pore structure of reservoir more directly, an unsupervised clustering method was developed to obtain the quantitative pore structure information from the NMR T;spectrums based on the Gaussian mixture model(GMM). Firstly, We conducted the principal component analysis on T;spectrums in order to reduce the dimension data and the dependence of the original variables. Secondly, the dimension-reduced data was fitted using the GMM probability density function, and the model parameters and optimal clustering numbers were obtained according to the expectation-maximization algorithm and the change of the Akaike information criterion. Finally, the T;spectrum features and pore structure types of different clustering groups were analyzed and compared with T;geometric mean and T;arithmetic mean. The effectiveness of the algorithm has been verified by numerical simulation and field NMR logging data. The research shows that the clustering results based on GMM method have good correlations with the shape and distribution of the T;spectrum, pore structure, and petroleum productivity, providing a new means for quantitative identification of pore structure, reservoir grading, and oil and gas productivity evaluation.展开更多
In this study,we apply the single-link cluster(SLC)method to analyze the characteristics of earth-quake clustering in Japan.Among the clustering algorithms,the SLC method is effpctive in characterizing earthquake clus...In this study,we apply the single-link cluster(SLC)method to analyze the characteristics of earth-quake clustering in Japan.Among the clustering algorithms,the SLC method is effpctive in characterizing earthquake clusters or isolated events at both global and local scales.The results indicate that link lengths for the whole investigated area in and around Japan follow negative exponential functi-onal or Gamma distribution functional increase.Besides,some difference is revealed among different areas,e.g.,link lengths in the area of the Japan Trench and the Kuril Trench are shorter than those in other areas;the densest spatial distribution of the SLC framework is also in the area of the Japan Trench and the Kuril Trench.The close investigations indicate that the a and 1/θvalues estimated respectively from the exponential function and Gamma distribution may relate to spatial clustering,which is supported by the results of the distribution of link lengths and the spatial distribution of the SLC framework in the investigated areas.展开更多
A total of 10 indices of regional economic development in Guangxi are selected.According to the relevant economic data,regional economic development in Guangxi is analyzed by using System Clustering Method and Princip...A total of 10 indices of regional economic development in Guangxi are selected.According to the relevant economic data,regional economic development in Guangxi is analyzed by using System Clustering Method and Principal Component Analysis Method.Result shows that System Clustering Method and Principal Component Analysis Method have revealed similar results analysis of economic development level.Overall economic strength of Guangxi is weak and Nanning has relatively high scores of factors due to its advantage of the political,economic and cultural center.Comprehensive scores of other regions are all lower than 1,which has big gap with the development of Nanning.Overall development strategy points out that Guangxi should accelerate the construction of the Ring Northern Bay Economic Zone,create a strong logistics system having strategic significance to national development,use the unique location advantage and rely on the modern transportation system to establish a logistics center and business center connecting the hinterland and the Asean Market.Based on the problems of unbalanced regional economic development in Guangxi,we should speed up the development of service industry in Nanning,construct the circular economy system of industrial city,and accelerate the industrialization process of tourism city in order to realize balanced development of regional economy in Guangxi,China.展开更多
The cluster analysis method needs continuous improvement and perfection in the research and application of the spatial differentiation and change of pollutants.In this paper,the date of monthly highest concentration o...The cluster analysis method needs continuous improvement and perfection in the research and application of the spatial differentiation and change of pollutants.In this paper,the date of monthly highest concentration of ozone(O_(3))and the concentration value of that day were selected as the similarity coefficient between classes.Single-factor cluster analysis was performed on O_(3)during 2016-2019 and the COVID-19 outbreak of 2020 in Hunan Province using the Ward method.The clustering results showed that the spatial distribution of atmospheric O_(3)in the 14 regions of Hunan Province was most suitable to be classified according to class III clustering areas.That is,the Changsha-Zhuzhou-Xiangtan urban agglomeration was the center,and the high-value area was in northern Hunan.The transition area was in central and southern Hunan,while the low-value area was centered in western Hunan.The partition results were in good agreement with the homogeneous subset of one-way ANOVA and the distribution of monitoring values during the same period.The comparison showed that the inter-class plates in the two periods corresponded well,and the intra-class area showed a continuous geographical distribution,and there were dynamic changes in the spatial differentiation of the O_(3)plates in different periods.In 2020,the center of the O_(3)high-value area plate in Hunan Province moved eastward and extended southward,focusing on the middle and lower reaches of the Xiangjiang River basin,and extending to the upstream area;the regional plate in the transition area expanded significantly;the low-value area plate shrank to the two cities in western Hunan.The abnormal emissions and abnormal climate during the COVID-19 epidemic had an impact on the spatial differentiation of O_(3)in Hunan Province.展开更多
In this paper, we conduct research on the novel natural image reconstruction and representation algorithm based on clustenng and modified neural network. Image resolution enhancement is one of the earliest researches ...In this paper, we conduct research on the novel natural image reconstruction and representation algorithm based on clustenng and modified neural network. Image resolution enhancement is one of the earliest researches of single image interpolation. Although the traditional interpolation and method for single image amplification is effect, but did not provide more useful information. Our method combines the neural network and the clustering approach. The experiment shows that our method performs well and satisfactory.展开更多
In the field of data mining and machine learning,clustering is a typical issue which has been widely studied by many researchers,and lots of effective algorithms have been proposed,including K-means,fuzzy c-means(FCM)...In the field of data mining and machine learning,clustering is a typical issue which has been widely studied by many researchers,and lots of effective algorithms have been proposed,including K-means,fuzzy c-means(FCM)and DBSCAN.However,the traditional clustering methods are easily trapped into local optimum.Thus,many evolutionary-based clustering methods have been investigated.Considering the effectiveness of brain storm optimization(BSO)in increasing the diversity while the diversity optimization is performed,in this paper,we propose a new clustering model based on BSO to use the global ability of BSO.In our experiment,we apply the novel binary model to solve the problem.During the period of processing data,BSO was mainly utilized for iteration.Also,in the process of K-means,we set the more appropriate parameters selected to match it greatly.Four datasets were used in our experiment.In our model,BSO was first introduced in solving the clustering problem.With the algorithm running on each dataset repeatedly,our experimental results have obtained good convergence and diversity.In addition,by comparing the results with other clustering models,the BSO clustering model also guarantees high accuracy.Therefore,from many aspects,the simulation results show that the model of this paper has good performance.展开更多
Nanometer-sized metal clusters were prepared inside single crystalline MgO films by vacuum co-deposition of metals and MgO. The atomic structure was studied by high-resolution electron microscopy (HREM) and nm-area el...Nanometer-sized metal clusters were prepared inside single crystalline MgO films by vacuum co-deposition of metals and MgO. The atomic structure was studied by high-resolution electron microscopy (HREM) and nm-area electron diffraction. The size of the clusters is ranging from 1 nm to 3 nm without those larger than 5 nm, and most of them have definite epitaxial orientations with the MgO matrix films. The character of the composite films is very much useful for the studies of various kinds of physical properties with anisotroPy. The physical properties such as electric transport, magnetic, optical absorption, sintering and catalytic ones were thus measured on the same samples analyzed by HREM by using high sensitivity apparatus with interest of clarifying the retationship between the atomic structure and physical properties展开更多
With the development of green data centers,a large number of Uninterruptible Power Supply(UPS)resources in Internet Data Center(IDC)are becoming idle assets owing to their low utilization rate.The revitalization of th...With the development of green data centers,a large number of Uninterruptible Power Supply(UPS)resources in Internet Data Center(IDC)are becoming idle assets owing to their low utilization rate.The revitalization of these idle UPS resources is an urgent problem that must be addressed.Based on the energy storage type of the UPS(EUPS)and using renewable sources,a solution for IDCs is proposed in this study.Subsequently,an EUPS cluster classification method based on the concept of shared mechanism niche(CSMN)was proposed to effectively solve the EUPS control problem.Accordingly,the classified EUPS aggregation unit was used to determine the optimal operation of the IDC.An IDC cost minimization optimization model was established,and the Quantum Particle Swarm Optimization(QPSO)algorithm was adopted.Finally,the economy and effectiveness of the three-tier optimization framework and model were verified through three case studies.展开更多
Based on the Regional Specialized Meteorological Center(RSMC)Tokyo-Typhoon Center best-track data and the NCEP-NCAR reanalysis dataset,extratropical transitioning(ET)tropical cyclones(ETCs)over the western North Pacif...Based on the Regional Specialized Meteorological Center(RSMC)Tokyo-Typhoon Center best-track data and the NCEP-NCAR reanalysis dataset,extratropical transitioning(ET)tropical cyclones(ETCs)over the western North Pacific(WNP)during 1951–2021 are classified into six clusters using the fuzzy c-means clustering method(FCM)according to their track patterns.The characteristics of the six hard-clustered ETCs with the highest membership coefficient are shown.Most tropical cyclones(TCs)that were assigned to clusters C2,C5,and C6 made landfall over eastern Asian countries,which severely threatened these regions.Among landfalling TCs,93.2%completed their ET after landfall,whereas 39.8%of ETCs completed their transition within one day.The frequency of ETCs over the WNP has decreased in the past four decades,wherein cluster C5 demonstrated a significant decrease on both interannual and interdecadal timescales with the expansion and intensification of the western Pacific subtropical high(WPSH).This large-scale circulation pattern is favorable for C2 and causes it to become the dominant track pattern,owning to it containing the largest number of intensifying ETCs among the six clusters,a number that has increased insignificantly over the past four decades.The surface roughness variation and three-dimensional background circulation led to C5 containing the maximum number of landfalling TCs and a minimum number of intensifying ETCs.Our results will facilitate a better understanding of the spatiotemporal distributions of ET events and associated environment background fields,which will benefit the effective monitoring of these events over the WNP.展开更多
文摘South America’s climatic diversity is a product of its vast geographical expanse, encompassing tropical to subtropical latitudes. The variations in precipitation and temperature across the region stem from the influence of distinct atmospheric systems. While some studies have characterized the prevailing systems over South America, they often lacked the utilization of statistical techniques for homogenization. On the other hand, other research has employed multivariate statistical methods to identify homogeneous regions regarding temperature and precipitation, but their focus has been limited to specific areas, such as the south, southeast, and northeast. Surprisingly, there is a lack of work that compares various multivariate statistical techniques to determine homogeneous regions across the entirety of South America concerning temperature and precipitation. This paper aims to address this gap by comparing three such techniques: Cluster Analysis (K-means and Ward) and Self Organizing Maps, using data from different sources for temperature (ERA5, ERA5-Land, and CRU) and precipitation (ERA5, ERA5-Land, and CPC). Spatial patterns and time series were generated for each region over the period 1981-2010. The results from this analysis of spatially homogeneous regions concerning temperature and precipitation have the potential to significantly benefit climate analysis and forecasts. Moreover, they can offer valuable insights for various climatological studies, guiding decision-making processes in diverse fields that rely on climate information, such as agriculture, disaster management, and water resources planning.
文摘Detection of Autism Spectrum Disorder(ASD)is a crucial area of research,representing a foundational aspect of psychological studies.The advancement of technology and the widespread adoption of machine learning methodologies have brought significant attention to this field in recent years.Interdisciplinary efforts have further propelled research into detection methods.Consequently,this study aims to contribute to both the fields of psychology and computer science.Specifically,the goal is to apply machine learning techniques to limited data for the detection of Autism Spectrum Disorder.This study is structured into two distinct phases:data preprocessing and classification.In the data preprocessing phase,four datasets—Toddler,Children,Adolescent,and Adult—were converted into numerical form,adjusted as necessary,and subsequently clustered.Clustering was performed using six different methods:Kmeans,agglomerative,DBSCAN(Density-Based Spatial Clustering of Applications with Noise),mean shift,spectral,and Birch.In the second phase,the clustered ASD data were classified.The model’s accuracy was assessed using 5-fold cross-validation to ensure robust evaluation.In total,ten distinct machine learning algorithms were employed.The findings indicate that all clustering methods demonstrated success with various classifiers.Notably,the K-means algorithm emerged as particularly effective,achieving consistent and significant results across all datasets.This study is expected to serve as a guide for improving ASD detection performance,even with minimal data availability.
基金financial support from the National Key R&D Program of China(2021YFB3500700)the National Natural Science Foundation of China(22473042,22003016,and 92145302).
文摘The use of metal oxides has been extensively documented in the literature and applied in a variety of contexts,including but not limited to energy storage,chemical sensors,and biomedical applications.One of the most significant applications of metal oxides is heterogeneous catalysis,which represents a pivotal technology in industrial production on a global scale.Catalysts serve as the primary enabling agents for chemical reactions,and among the plethora of catalysts,metal oxides including magnesium oxide(MgO),ceria(CeO_(2))and titania(TiO_(2)),have been identified to be particularly effective in catalyzing a variety of reactions[1].Theoretical calculations based on density functional theory(DFT)and a multitude of other quantum chemistry methods have proven invaluable in elucidating the mechanisms of metal-oxide-catalyzed reactions,thereby facilitating the design of high-performance catalysts[2].
文摘Web data extraction has become a key technology for extracting valuable data from websites.At present,most extraction methods based on rule learning,visual pattern or tree matching have limited performance on complex web pages.Through ana-lyzing various statistical characteristics of HTML el-ements in web documents,this paper proposes,based on statistical features,an unsupervised web data ex-traction method—traversing the HTML DOM parse tree at first,calculating and generating the statistical matrix of the elements,and then locating data records by clustering method and heuristic rules that reveal in-herent links between the visual characteristics of the data recording areas and the statistical characteristics of the HTML nodes—which is both suitable for data records extraction of single-page and multi-pages,and it has strong generality and needs no training.The ex-periments show that the accuracy and efficiency of this method are equally better than the current data extrac-tion method.
基金Supported by the Major Program of National Natural Science Foundation of China(No.61432006)。
文摘Performing analytics on the load curve(LC)of customers is the foundation for demand response which requires a better understanding of customers'consumption pattern(CP)by analyzing the load curve.However,the performances of previous widely-used LC clustering methods are poor in two folds:larger number of clusters,huge variances within a cluster(a CP is extracted from a cluster),bringing huge difficulty to understand the electricity consumption pattern of customers.In this paper,to improve the performance of LC clustering,a clustering framework incorporated with community detection is proposed.The framework includes three parts:network construction,community detection,and CP extraction.According to the cluster validity index(CVI),the integrated approach outperforms the previous state-of-the-art method with the same amount of clusters.And the approach needs fewer clusters to achieve the same performance measured by CVI.
基金financially supported by the National Key Research and Development Program of China(Grant No.2021YFA1600900)the National Natural Science Foundation of China(Grant Nos.U22A20174,52071207,51971137,11875192,and U1930101)the Natural Science Foundation of Shanghai(China,Grant No.22ZR1432800).
文摘Neutron shielding materials face imbalanced behaviors among shielding,strength,and ductility properties.Based on the requirement of the high property shielding particles,a superior semi-coherentτ(Al4 MgGd)phase was designed and predicted by cluster expansion(CE)method using density functional theory calculations.To realize its shielding property,the Powder Metallurgy-based routines(i.e.,powder fabrication,spark plasma sintering,and hot extrusion techniques)are used to fabricate 6TiB_(2)/Al-6Mg-5Gd(wt.%)composite with dispersed refinedτphases and homogenized TiB_(2) distribution.The atomic structure of ternary phase τ is examined by aberration-corrected high-angle annual dark-field(HAADF)scanning transmission electron microscope(STEM)and energy dispersive X-ray spectroscopy(EDXS)STEM experiments,which is well complied with the calculated compound(Al_(4)MgGd).In detail,theτ(Al_(4)MgGd)phase has a semi-coherent interface both with α-Al and TiB_(2),which is consistent with the prediction of interface relationships.With the optimized interfaces,the TiB_(2) and τ phases can effectively promote recrystallization and suppress grain growth,leading to the formation of ultra-fine grain structure.Then,the composite exhibits advanced shielding properties(Macroscopic transmission cross section ~24.1/cm,higher than 30%B_(4) C/Al)and optimized synergic mechanical properties(Ultimate tensile strength ~506 MPa,elongation ~12.9%),which are far higher than available Al-based neutron shielding materials.Finally,the underlying strength-ductility mechanisms are discussed.Critically,the design and optimization of shielding particle interfaces are reliable strategies for developing novel structural-functional integrated materials.
基金supported by the National Natural Science Foundation of China(Grant No.11974420).
文摘We study the thermodynamic properties of the classical one-dimensional generalized nonlinear Klein-Gordon lattice model(n≥2)by using the cluster variation method with linear response theory.The results of this method are exact in the thermodynamic limit.We present the single-site reduced densityρ^((1))(z),averages such as(z^(2)),<|z^(n)|>,and<(z_(1)-z_(2))^(2)>,the specific heat C_(v),and the static correlation functions.We analyze the scaling behavior of these quantities and obtain the exact scaling powers at the low and high temperatures.Using these results,we gauge the accuracy of the projective truncation approximation for theφ^(4)lattice model.
基金supported by the National Science and Technology Major Project of China(No.2011ZX05029-003)CNPC Science Research and Technology Development Project,China(No.2013D-0904)
文摘In this study, we used the multi-resolution graph-based clustering (MRGC) method for determining the electrofacies (EF) and lithofacies (LF) from well log data obtained from the intraplatform bank gas fields located in the Amu Darya Basin. The MRGC could automatically determine the optimal number of clusters without prior knowledge about the structure or cluster numbers of the analyzed data set and allowed the users to control the level of detail actually needed to define the EF. Based on the LF identification and successful EF calibration using core data, an MRGC EF partition model including five clusters and a quantitative LF interpretation chart were constructed. The EF clusters 1 to 5 were interpreted as lagoon, anhydrite flat, interbank, low-energy bank, and high-energy bank, and the coincidence rate in the cored interval could reach 85%. We concluded that the MRGC could be accurately applied to predict the LF in non-cored but logged wells. Therefore, continuous EF clusters were partitioned and corresponding LF were characteristics &different LF were analyzed interpreted, and the distribution and petrophysical in the framework of sequence stratigraphy.
基金Supported National Natural Science Foundation of China(50739002)
文摘[Objective] The aim was to assess regional eco-environmental quality by means of grey clustering method based on normalized index value. [Method] Eco-environmental quality in Chaohu basin was assessed by using grey clustering method based on normalized index value, and the evaluation results were compared with those of unascertained measure method to verify the feasibility of grey clustering method used to evaluate regional eco-environmental quality. [Result] In the grey clustering assessment method based on normalized index value, indices whose standard normalized values in the same grade were close to each other were classified into one class and had the same whitening function, which reduced the number of whitening functions. Grey clustering method based on normalized index value was used to assess eco-environmental quality in Chaohu basin, and the evaluation results were basically in accordance with those of unascertained measure method, namely eco-environmental quality in Hefei, Chaohu and Lu’an belonged to the third (pass), fourth (worse) and fifth grade (bad), except for one grade difference in overall basin, and the results showed that the method had practicality and could be applied to assess regional eco-environmental quality. [Conclusion] The study could provide theoretical foundation for the establishment of comprehensive management countermeasures of regional ecological environment.
基金supporting by grant fund under the Strategic Scholarships for Frontier Research Network for the PhD Program Thai Doctoral degree
文摘A new recommendation method was presented based on memetic algorithm-based clustering. The proposed method was tested on four highly sparse real-world datasets. Its recommendation performance is evaluated and compared with that of the frequency-based, user-based, item-based, k-means clustering-based, and genetic algorithm-based methods in terms of precision, recall, and F1 score. The results show that the proposed method yields better performance under the new user cold-start problem when each of new active users selects only one or two items into the basket. The average F1 scores on all four datasets are improved by 225.0%, 61.6%, 54.6%, 49.3%, 28.8%, and 6.3% over the frequency-based, user-based, item-based, k-means clustering-based, and two genetic algorithm-based methods, respectively.
基金Projects(51634010,51676211) supported by the National Natural Science Foundation of ChinaProject(2017SK2253) supported by the Key Research and Development Program of Hunan Province,China
文摘The knowledge of bubble profiles in gas-liquid two-phase flows is crucial for analyzing the kinetic processes such as heat and mass transfer, and this knowledge is contained in field data obtained by surface-resolved computational fluid dynamics (CFD) simulations. To obtain this information, an efficient bubble profile reconstruction method based on an improved agglomerative hierarchical clustering (AHC) algorithm is proposed in this paper. The reconstruction method is featured by the implementations of a binary space division preprocessing, which aims to reduce the computational complexity, an adaptive linkage criterion, which guarantees the applicability of the AHC algorithm when dealing with datasets involving either non-uniform or distorted grids, and a stepwise execution strategy, which enables the separation of attached bubbles. To illustrate and verify this method, it was applied to dealing with 3 datasets, 2 of them with pre-specified spherical bubbles and the other obtained by a surface-resolved CFD simulation. Application results indicate that the proposed method is effective even when the data include some non-uniform and distortion.
基金Supported by the National Natural Science Foundation of China (42174142)National Science and Technology Major Project (2017ZX05039-002)+2 种基金Operation Fund of China National Petroleum Corporation Logging Key Laboratory (2021DQ20210107-11)Fundamental Research Funds for Central Universities (19CX02006A)Major Science and Technology Project of China National Petroleum Corporation (ZD2019-183-006)。
文摘To make the quantitative results of nuclear magnetic resonance(NMR) transverse relaxation(T;) spectrums reflect the type and pore structure of reservoir more directly, an unsupervised clustering method was developed to obtain the quantitative pore structure information from the NMR T;spectrums based on the Gaussian mixture model(GMM). Firstly, We conducted the principal component analysis on T;spectrums in order to reduce the dimension data and the dependence of the original variables. Secondly, the dimension-reduced data was fitted using the GMM probability density function, and the model parameters and optimal clustering numbers were obtained according to the expectation-maximization algorithm and the change of the Akaike information criterion. Finally, the T;spectrum features and pore structure types of different clustering groups were analyzed and compared with T;geometric mean and T;arithmetic mean. The effectiveness of the algorithm has been verified by numerical simulation and field NMR logging data. The research shows that the clustering results based on GMM method have good correlations with the shape and distribution of the T;spectrum, pore structure, and petroleum productivity, providing a new means for quantitative identification of pore structure, reservoir grading, and oil and gas productivity evaluation.
文摘In this study,we apply the single-link cluster(SLC)method to analyze the characteristics of earth-quake clustering in Japan.Among the clustering algorithms,the SLC method is effpctive in characterizing earthquake clusters or isolated events at both global and local scales.The results indicate that link lengths for the whole investigated area in and around Japan follow negative exponential functi-onal or Gamma distribution functional increase.Besides,some difference is revealed among different areas,e.g.,link lengths in the area of the Japan Trench and the Kuril Trench are shorter than those in other areas;the densest spatial distribution of the SLC framework is also in the area of the Japan Trench and the Kuril Trench.The close investigations indicate that the a and 1/θvalues estimated respectively from the exponential function and Gamma distribution may relate to spatial clustering,which is supported by the results of the distribution of link lengths and the spatial distribution of the SLC framework in the investigated areas.
文摘A total of 10 indices of regional economic development in Guangxi are selected.According to the relevant economic data,regional economic development in Guangxi is analyzed by using System Clustering Method and Principal Component Analysis Method.Result shows that System Clustering Method and Principal Component Analysis Method have revealed similar results analysis of economic development level.Overall economic strength of Guangxi is weak and Nanning has relatively high scores of factors due to its advantage of the political,economic and cultural center.Comprehensive scores of other regions are all lower than 1,which has big gap with the development of Nanning.Overall development strategy points out that Guangxi should accelerate the construction of the Ring Northern Bay Economic Zone,create a strong logistics system having strategic significance to national development,use the unique location advantage and rely on the modern transportation system to establish a logistics center and business center connecting the hinterland and the Asean Market.Based on the problems of unbalanced regional economic development in Guangxi,we should speed up the development of service industry in Nanning,construct the circular economy system of industrial city,and accelerate the industrialization process of tourism city in order to realize balanced development of regional economy in Guangxi,China.
文摘The cluster analysis method needs continuous improvement and perfection in the research and application of the spatial differentiation and change of pollutants.In this paper,the date of monthly highest concentration of ozone(O_(3))and the concentration value of that day were selected as the similarity coefficient between classes.Single-factor cluster analysis was performed on O_(3)during 2016-2019 and the COVID-19 outbreak of 2020 in Hunan Province using the Ward method.The clustering results showed that the spatial distribution of atmospheric O_(3)in the 14 regions of Hunan Province was most suitable to be classified according to class III clustering areas.That is,the Changsha-Zhuzhou-Xiangtan urban agglomeration was the center,and the high-value area was in northern Hunan.The transition area was in central and southern Hunan,while the low-value area was centered in western Hunan.The partition results were in good agreement with the homogeneous subset of one-way ANOVA and the distribution of monitoring values during the same period.The comparison showed that the inter-class plates in the two periods corresponded well,and the intra-class area showed a continuous geographical distribution,and there were dynamic changes in the spatial differentiation of the O_(3)plates in different periods.In 2020,the center of the O_(3)high-value area plate in Hunan Province moved eastward and extended southward,focusing on the middle and lower reaches of the Xiangjiang River basin,and extending to the upstream area;the regional plate in the transition area expanded significantly;the low-value area plate shrank to the two cities in western Hunan.The abnormal emissions and abnormal climate during the COVID-19 epidemic had an impact on the spatial differentiation of O_(3)in Hunan Province.
文摘In this paper, we conduct research on the novel natural image reconstruction and representation algorithm based on clustenng and modified neural network. Image resolution enhancement is one of the earliest researches of single image interpolation. Although the traditional interpolation and method for single image amplification is effect, but did not provide more useful information. Our method combines the neural network and the clustering approach. The experiment shows that our method performs well and satisfactory.
基金supported by Natural Science Foundation of Jiangsu Province(Grant No.BK20141005)by Natural Science Foundation of the Jiangsu Higher Education Institutions of China(Grant No.14KJB520025).
文摘In the field of data mining and machine learning,clustering is a typical issue which has been widely studied by many researchers,and lots of effective algorithms have been proposed,including K-means,fuzzy c-means(FCM)and DBSCAN.However,the traditional clustering methods are easily trapped into local optimum.Thus,many evolutionary-based clustering methods have been investigated.Considering the effectiveness of brain storm optimization(BSO)in increasing the diversity while the diversity optimization is performed,in this paper,we propose a new clustering model based on BSO to use the global ability of BSO.In our experiment,we apply the novel binary model to solve the problem.During the period of processing data,BSO was mainly utilized for iteration.Also,in the process of K-means,we set the more appropriate parameters selected to match it greatly.Four datasets were used in our experiment.In our model,BSO was first introduced in solving the clustering problem.With the algorithm running on each dataset repeatedly,our experimental results have obtained good convergence and diversity.In addition,by comparing the results with other clustering models,the BSO clustering model also guarantees high accuracy.Therefore,from many aspects,the simulation results show that the model of this paper has good performance.
文摘Nanometer-sized metal clusters were prepared inside single crystalline MgO films by vacuum co-deposition of metals and MgO. The atomic structure was studied by high-resolution electron microscopy (HREM) and nm-area electron diffraction. The size of the clusters is ranging from 1 nm to 3 nm without those larger than 5 nm, and most of them have definite epitaxial orientations with the MgO matrix films. The character of the composite films is very much useful for the studies of various kinds of physical properties with anisotroPy. The physical properties such as electric transport, magnetic, optical absorption, sintering and catalytic ones were thus measured on the same samples analyzed by HREM by using high sensitivity apparatus with interest of clarifying the retationship between the atomic structure and physical properties
基金supported by the Key Technology Projects of the China Southern Power Grid Corporation(STKJXM20200059)the Key Support Project of the Joint Fund of the National Natural Science Foundation of China(U22B20123)。
文摘With the development of green data centers,a large number of Uninterruptible Power Supply(UPS)resources in Internet Data Center(IDC)are becoming idle assets owing to their low utilization rate.The revitalization of these idle UPS resources is an urgent problem that must be addressed.Based on the energy storage type of the UPS(EUPS)and using renewable sources,a solution for IDCs is proposed in this study.Subsequently,an EUPS cluster classification method based on the concept of shared mechanism niche(CSMN)was proposed to effectively solve the EUPS control problem.Accordingly,the classified EUPS aggregation unit was used to determine the optimal operation of the IDC.An IDC cost minimization optimization model was established,and the Quantum Particle Swarm Optimization(QPSO)algorithm was adopted.Finally,the economy and effectiveness of the three-tier optimization framework and model were verified through three case studies.
基金supported by the National Natural Science Foundation of China(Grant Nos.42075053 and 41975128)。
文摘Based on the Regional Specialized Meteorological Center(RSMC)Tokyo-Typhoon Center best-track data and the NCEP-NCAR reanalysis dataset,extratropical transitioning(ET)tropical cyclones(ETCs)over the western North Pacific(WNP)during 1951–2021 are classified into six clusters using the fuzzy c-means clustering method(FCM)according to their track patterns.The characteristics of the six hard-clustered ETCs with the highest membership coefficient are shown.Most tropical cyclones(TCs)that were assigned to clusters C2,C5,and C6 made landfall over eastern Asian countries,which severely threatened these regions.Among landfalling TCs,93.2%completed their ET after landfall,whereas 39.8%of ETCs completed their transition within one day.The frequency of ETCs over the WNP has decreased in the past four decades,wherein cluster C5 demonstrated a significant decrease on both interannual and interdecadal timescales with the expansion and intensification of the western Pacific subtropical high(WPSH).This large-scale circulation pattern is favorable for C2 and causes it to become the dominant track pattern,owning to it containing the largest number of intensifying ETCs among the six clusters,a number that has increased insignificantly over the past four decades.The surface roughness variation and three-dimensional background circulation led to C5 containing the maximum number of landfalling TCs and a minimum number of intensifying ETCs.Our results will facilitate a better understanding of the spatiotemporal distributions of ET events and associated environment background fields,which will benefit the effective monitoring of these events over the WNP.