The sea-surface height (SSH) signatures of internal tides extracted from the TOPEX/Poseidon (T/P) altimeter data along satellite tracks are fitted with superposition of several plane waves which have different wav...The sea-surface height (SSH) signatures of internal tides extracted from the TOPEX/Poseidon (T/P) altimeter data along satellite tracks are fitted with superposition of several plane waves which have different wavenumber vectors. The key problem of plane wave fitting with iterative method is how to determine the initial value of wavenumber of each plane wave. The previous solving method is to analyze the internal tidal SSH signatures along each track with wavenumber spectrum. But it is found that the problem cannot be solved completely with the wavenumber spectrum analysis method only. The method based on the combination of wavenumber spectrum analysis method and the exhaustive method is proposed to determine the initial values of wavenumbers for iteration. Numerical results indicate that the proposed method is not only reasonable and feasible but also better than the previous method. The proposed method is an improvement of the previous one, which is beneficial to improving the precision of plane wave fitting of the T/P internal tidal SSH signatures and deepening the understanding of the internal tides in ocean.展开更多
With the development of large-scale spectral surveys, fiber positioning technology has been developing rapidly. Because of the performance advantages of a four-quadrant(4Q) detector, a fiber positioning and real-tim...With the development of large-scale spectral surveys, fiber positioning technology has been developing rapidly. Because of the performance advantages of a four-quadrant(4Q) detector, a fiber positioning and real-time monitoring system based on the 4Q detector is proposed. The detection accuracy of this system is directly determined by the precision of the center of the spot. A Gaussian fitting algorithm based on the 4Q detector is studied and applied in the fiber positioning process to improve the calculated accuracy of the spot center. The relationship between the center position of the incident spot and the detector output signal is deduced. An experimental platform is built to complete the simulated experiment. Then we use the Gaussian fitting method to process experimental data, compare the fitting value with the theoretical one and calculate the corresponding error.展开更多
Solving large radial basis function (RBF) interpolation problem with non-customized methods is computationally expensive and the matrices that occur are typically badly conditioned. In order to avoid these difficult...Solving large radial basis function (RBF) interpolation problem with non-customized methods is computationally expensive and the matrices that occur are typically badly conditioned. In order to avoid these difficulties, we present a fitting based on radial basis functions satisfying side conditions by least squares, although compared with interpolation the method loses some accuracy, it reduces the computational cost largely. Since the fitting accuracy and the non-singularity of coefficient matrix in normal equation are relevant to the uniformity of chosen centers of the fitted RBE we present a choice method of uniform centers. Numerical results confirm the fitting efficiency.展开更多
Background: There has been a great interest in tracking health-related fitness across the United States. The NFL PLAY 60 FITNESSGRAM Partnership Project (NFL P60FGPP) is a large participatory research network that inv...Background: There has been a great interest in tracking health-related fitness across the United States. The NFL PLAY 60 FITNESSGRAM Partnership Project (NFL P60FGPP) is a large participatory research network that involves the surveillance of fitness among more than 1000 schools spread throughout the country. Fitness data are collected by school staff and therefore these data can vary in quality and representativeness. Therefore, careful screening procedures are needed to ensure that the data can reflect actual patterns in the schools. This study examined the impact of different data screening procedures on outcomes of aerobic fitness (AF) collected from the NFL P60FGPP. Methods: Data were compiled from 149,101 youth from 504 schools and were processed using the established age- and gender-specific AF FITNESSGRAM health-related standards. Data were subjected to three different screening procedures (based on grade size and boy-to-girl ratio per grade). Linear models were computed to obtain unadjusted and adjusted (for age, BMI-Z, and socio-economic status) estimates of % youth in the Healthy Fitness Zone (HFZ) in order to determine if, 1) there were differences in % in the HFZ and 2) if differences could be explained by changes in the representativeness of the sample due to the different data screening procedures. Results: Depending on the screening procedure used, the final sample ranged from 96,999 (no screening) to 46,572 youth (most stringent criteria). The proportion of youth achieving appropriate levels of AF ranged from 56% to 61% with unscreened data resulting in consistently lower percentages of youth achieving the standard (P < 0.05). Overall, these differences were not explained by possible changes in demographic characteristics as the result of applying different screening criteria. Conclusions: The findings demonstrate the importance of establishing appropriate screening procedures that maximize sample size while also ensuring generalizability of the findings.展开更多
The question of how to choose a copula model that best fits a given dataset is a predominant limitation of the copula approach, and the present study aims to investigate the techniques of goodness-of-fit tests for mul...The question of how to choose a copula model that best fits a given dataset is a predominant limitation of the copula approach, and the present study aims to investigate the techniques of goodness-of-fit tests for multi-dimensional copulas. A goodness-of-fit test based on Rosenblatt's transformation was mathematically expanded from two dimensions to three dimensions and procedures of a bootstrap version of the test were provided. Through stochastic copula simulation, an empirical application of historical drought data at the Lintong Gauge Station shows that the goodness-of-fit tests perform well, revealing that both trivariate Gaussian and Student t copulas are acceptable for modeling the dependence structures of the observed drought duration, severity, and peak. The goodness-of-fit tests for multi-dimensional copulas can provide further support and help a lot in the potential applications of a wider range of copulas to describe the associations of correlated hydrological variables. However, for the application of copulas with the number of dimensions larger than three, more complicated computational efforts as well as exploration and parameterization of corresponding copulas are required.展开更多
Reverse engineering in the manufacturing field is a process in which the digitized data are obtained from an existing object model or a part of it, and then the CAD model is reconstructed. This paper presents an RBF n...Reverse engineering in the manufacturing field is a process in which the digitized data are obtained from an existing object model or a part of it, and then the CAD model is reconstructed. This paper presents an RBF neural network approach to modify and fit the digitized data. The centers for the RBF are selected by using the orthogonal least squares learning algorithm. A mathematically known surface is used for generating a number of samples for training the networks. The trained networks then generated a number of new points which were compared with the calculating points from the equations. Moreover, a series of practice digitizing curves are used to test the approach. The results showed that this approach is effective in modifying and fitting digitized data and generating data points to reconstruct the surface model.展开更多
By using the method of least square linear fitting to analyze data do not exist errors under certain conditions, in order to make the linear data fitting method that can more accurately solve the relationship expressi...By using the method of least square linear fitting to analyze data do not exist errors under certain conditions, in order to make the linear data fitting method that can more accurately solve the relationship expression between the volume and quantity in scientific experiments and engineering practice, this article analyzed data error by commonly linear data fitting method, and proposed improved process of the least distance squ^re method based on least squares method. Finally, the paper discussed the advantages and disadvantages through the example analysis of two kinds of linear data fitting method, and given reasonable control conditions for its application.展开更多
我国新一代中国频谱射电日像仪(Chinese Spectral Radio Heliograph,CSRH)原始观测数据采用自定义格式,在进行后续处理与共享使用时必须转换相应的格式。在分析FITS-IDI(FITS Interferometry Data Interchange)格式的基础上,结合CSRH的...我国新一代中国频谱射电日像仪(Chinese Spectral Radio Heliograph,CSRH)原始观测数据采用自定义格式,在进行后续处理与共享使用时必须转换相应的格式。在分析FITS-IDI(FITS Interferometry Data Interchange)格式的基础上,结合CSRH的实际观测模式与数据产出方式,定义与设计了符合项目情况的FITS-IDI格式及字段,并对FITS-IDI文件中若干字段的值如何获取、计算进行了深入讨论。根据定义生成的FITS-IDI文件已成功导入CASA软件,并可以进行后续处理。经过对CASA测量集文件的核实,证明了数据生成的正确性。本研究有效地推进了CSRH的建设工作,也对其他射电干涉阵数据存储有一定的参考价值。展开更多
Based on the definition of MQ-B-Splines,this article constructs five types of univariate quasi-interpolants to non-uniformly distributed data. The error estimates and the shape-preserving properties are shown in detai...Based on the definition of MQ-B-Splines,this article constructs five types of univariate quasi-interpolants to non-uniformly distributed data. The error estimates and the shape-preserving properties are shown in details.And examples are shown to demonstrate the capacity of the quasi-interpolants for curve representation.展开更多
Due to the complex nature of multi-source geological data, it is difficult to rebuild every geological structure through a single 3D modeling method. The multi-source data interpretation method put forward in this ana...Due to the complex nature of multi-source geological data, it is difficult to rebuild every geological structure through a single 3D modeling method. The multi-source data interpretation method put forward in this analysis is based on a database-driven pattern and focuses on the discrete and irregular features of geological data. The geological data from a variety of sources covering a range of accuracy, resolution, quantity and quality are classified and integrated according to their reliability and consistency for 3D modeling. The new interpolation-approximation fitting construction algorithm of geological surfaces with the non-uniform rational B-spline(NURBS) technique is then presented. The NURBS technique can retain the balance among the requirements for accuracy, surface continuity and data storage of geological structures. Finally, four alternative 3D modeling approaches are demonstrated with reference to some examples, which are selected according to the data quantity and accuracy specification. The proposed approaches offer flexible modeling patterns for different practical engineering demands.展开更多
A successful mechanical property data-driven prediction model is the core of the optimal design of hot rolling process for hot-rolled strips. However, the original industrial data, usually unbalanced, are inevitably m...A successful mechanical property data-driven prediction model is the core of the optimal design of hot rolling process for hot-rolled strips. However, the original industrial data, usually unbalanced, are inevitably mixed with fluctuant and abnormal values. Models established on the basis of the data without data processing can cause misleading results, which cannot be used for the optimal design of hot rolling process. Thus, a method of industrial data processing of C-Mn steel was proposed based on the data analysis. The Bayesian neural network was employed to establish the reliable mechanical property prediction models for the optimal design of hot rolling process. By using the multi-objective optimization algorithm and considering the individual requirements of costumers and the constraints of the equipment, the optimal design of hot rolling process was successfully applied to the rolling process design for Q345B steel with 0.017% Nb and 0.046% Ti content removed. The optimal process design results were in good agreement with the industrial trials results, which verify the effectiveness of the optimal design of hot rolling process.展开更多
Recent advances in intelligent transportation system allow traffic safety studies to extend from historic data-based analyses to real-time applications. The study presents a new method to predict crash likelihood with...Recent advances in intelligent transportation system allow traffic safety studies to extend from historic data-based analyses to real-time applications. The study presents a new method to predict crash likelihood with traffic data collected by discrete loop detectors as well as the web-crawl weather data. Matched case-control method and support vector machines (SVMs) technique were employed to identify the risk status. The adaptive synthetic over-sampling technique was applied to solve the imbalanced dataset issues. Random forest technique was applied to select the contributing factors and avoid the over-fitting issues. The results indicate that the SVMs classifier could successfully classify 76.32% of the crashes on the test dataset and 87.52% of the crashes on the overall dataset, which were relatively satisfactory compared with the results of the previous studies. Compared with the SVMs classifier without the data, the SVMs classifier with the web-crawl weather data increased the crash prediction accuracy by 1.32% and decreased the false alarm rate by 1.72%, showing the potential value of the massive web weather data. Mean impact value method was employed to evaluate the variable effects, and the results are identical with the results of most of previous studies. The emerging technique based on the discrete traffic data and web weather data proves to be more applicable on real- time safety management on freeways.展开更多
Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the f...Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the functions and usages of LOD KOS products.Design/methodology/approach:Data collection and analysis were conducted at three time periods in 2015–16,2017 and 2019.The sample data used in the comprehensive data analysis comprises all datasets tagged as types of KOS in the Datahub and extracted through their respective SPARQL endpoints.A comparative study of the LOD KOS collected from terminology services Linked Open Vocabularies(LOV)and BioPortal was also performed.Findings:The study proposes a set of Functional,Impactful and Transformable(FIT)metrics for LOD KOS as value vocabularies.The FAIR principles,with additional recommendations,are presented for LOD KOS as open data.Research limitations:The metrics need to be further tested and aligned with the best practices and international standards of both open data and various types of KOS.Practical implications:Assessment performed with FAIR and FIT metrics support the creation and delivery of user-friendly,discoverable and interoperable LOD KOS datasets which can be used for innovative applications,act as a knowledge base,become a foundation of semantic analysis and entity extractions and enhance research in science and the humanities.Originality/value:Our research provides best practice guidelines for LOD KOS as value vocabularies.展开更多
The Tibetan Plateau(TP)is undergoing rapid urbanization.To improve urban sustainability and construct eco-logical security barriers,it is essential to quantify the spatial patterns of urbanization level on the TP,but ...The Tibetan Plateau(TP)is undergoing rapid urbanization.To improve urban sustainability and construct eco-logical security barriers,it is essential to quantify the spatial patterns of urbanization level on the TP,but the existing studies on the topic have been limited by the lack of socioeconomic data.This study aims to quantify the urbanization level on the TP in 2018 with Luojia1-01(LJ1-01)high-resolution nighttime light(NTL)data.Specifically,the compounded night light index is used to quantify spatial patterns of urbanization level at mul-tiple scales.The results showed that the TP had a low overall urbanization level with a large internal difference.The urbanization level in the northeast,southeast and south of the TP was relatively high,forming three hotspots centered in Xining City,Lhasa City and Shangri-La City,while the urbanization level in the central and western regions was relatively low.The analysis of influencing factors,based on the random forest model,showed that transportation and topography were the main factors affecting the TP’s spatial patterns of urbanization level.The comparison analysis with socioeconomic statistics and traditional NTL data showed that LJ1-01 NTL data can be used to more effectively quantify the urbanization level since it is more advantageous for reflecting the spatial extent of urban land and describing the spatial structure of socioeconomic activities within urban areas.These advantages are attributed to the high spatial resolution of the data,appropriate imaging time and unaf-fected by saturation phenomena.Thus,the proposed LJ1-01 NTL-based urbanization level measurement method has the potential for wide applications around the world,especially in less-developed regions lacking statistical data.Using this method,we refined the measurement of the TP’s urbanization level in 2018 for multiple scales including the region,basin,prefecture and county levels,which provides basic information for the further urban sustainability research on the TP.展开更多
A field-programmable gate array(FPGA)based high-speed broadband data acquisition system is designed.The system has a dual channel simultaneous acquisition function.The maximum sampling rate is 500 MSa/s and bandwidth ...A field-programmable gate array(FPGA)based high-speed broadband data acquisition system is designed.The system has a dual channel simultaneous acquisition function.The maximum sampling rate is 500 MSa/s and bandwidth is200 MHz,which solves the large bandwidth,high-speed signal acquisition and processing problems.At present,the data acquisition system is successfully used in broadband receiver test systems.展开更多
The dominant source of error in VLBI phase-referencing is the troposphere at observing frequencies above 5 GHz. We compare the tropospheric zenith delays derived from VLBI and GPS data at VLBA stations collocated with...The dominant source of error in VLBI phase-referencing is the troposphere at observing frequencies above 5 GHz. We compare the tropospheric zenith delays derived from VLBI and GPS data at VLBA stations collocated with GPS antennas. The systematic biases and standard deviations both are at the level of sub-centimeter. Based on this agreement, we suggest a new method of tropospheric correction in phase-referencing using combined VLBI and GPS data.展开更多
The quality of the low frequency electromagnetic data is affected by the spike and the trend noises.Failure in removal of the spikes and the trends reduces the credibility of data explanation.Based on the analyses of ...The quality of the low frequency electromagnetic data is affected by the spike and the trend noises.Failure in removal of the spikes and the trends reduces the credibility of data explanation.Based on the analyses of the causes and characteristics of these noises,this paper presents the results of a preset statistics stacking method(PSSM)and a piecewise linear fitting method(PLFM)in de-noising the spikes and trends,respectively.The magnitudes of the spikes are either higher or lower than the normal values,which leads to distortion of the useful signal.Comparisons have been performed in removing of the spikes among the average,the statistics and the PSSM methods,and the results indicate that only the PSSM can remove the spikes successfully.On the other hand,the spectrums of the linear and nonlinear trends mainly lie in the low frequency band and can change the calculated resistivity significantly.No influence of the trends is observed when the frequency is higher than a certain threshold value.The PLSM can remove effectively both the linear and nonlinear trends with errors around 1% in the power spectrum.The proposed methods present an effective way for de-noising the spike and the trend noises in the low frequency electromagnetic data,and establish a research basis for de-noising the low frequency noises.展开更多
基金The National Natural Science Foundation of China under contract No. 41076006the State Ministry of Science and Technology of China under contract No. 2008AA09A402the Ministry of Education’s "111" Project of China under contract No. B07036
文摘The sea-surface height (SSH) signatures of internal tides extracted from the TOPEX/Poseidon (T/P) altimeter data along satellite tracks are fitted with superposition of several plane waves which have different wavenumber vectors. The key problem of plane wave fitting with iterative method is how to determine the initial value of wavenumber of each plane wave. The previous solving method is to analyze the internal tidal SSH signatures along each track with wavenumber spectrum. But it is found that the problem cannot be solved completely with the wavenumber spectrum analysis method only. The method based on the combination of wavenumber spectrum analysis method and the exhaustive method is proposed to determine the initial values of wavenumbers for iteration. Numerical results indicate that the proposed method is not only reasonable and feasible but also better than the previous method. The proposed method is an improvement of the previous one, which is beneficial to improving the precision of plane wave fitting of the T/P internal tidal SSH signatures and deepening the understanding of the internal tides in ocean.
基金support by the Fundamental Research Funds for the Central Universities of China (2013/B15020271)the National Natural Science Foundation of China (1014/515029111)the National Undergraduate Training Program for Innovation and Entrepreneurship (201610294069)
文摘With the development of large-scale spectral surveys, fiber positioning technology has been developing rapidly. Because of the performance advantages of a four-quadrant(4Q) detector, a fiber positioning and real-time monitoring system based on the 4Q detector is proposed. The detection accuracy of this system is directly determined by the precision of the center of the spot. A Gaussian fitting algorithm based on the 4Q detector is studied and applied in the fiber positioning process to improve the calculated accuracy of the spot center. The relationship between the center position of the incident spot and the detector output signal is deduced. An experimental platform is built to complete the simulated experiment. Then we use the Gaussian fitting method to process experimental data, compare the fitting value with the theoretical one and calculate the corresponding error.
基金Supported by National Natural Science Youth Foundation (10401021).
文摘Solving large radial basis function (RBF) interpolation problem with non-customized methods is computationally expensive and the matrices that occur are typically badly conditioned. In order to avoid these difficulties, we present a fitting based on radial basis functions satisfying side conditions by least squares, although compared with interpolation the method loses some accuracy, it reduces the computational cost largely. Since the fitting accuracy and the non-singularity of coefficient matrix in normal equation are relevant to the uniformity of chosen centers of the fitted RBE we present a choice method of uniform centers. Numerical results confirm the fitting efficiency.
文摘Background: There has been a great interest in tracking health-related fitness across the United States. The NFL PLAY 60 FITNESSGRAM Partnership Project (NFL P60FGPP) is a large participatory research network that involves the surveillance of fitness among more than 1000 schools spread throughout the country. Fitness data are collected by school staff and therefore these data can vary in quality and representativeness. Therefore, careful screening procedures are needed to ensure that the data can reflect actual patterns in the schools. This study examined the impact of different data screening procedures on outcomes of aerobic fitness (AF) collected from the NFL P60FGPP. Methods: Data were compiled from 149,101 youth from 504 schools and were processed using the established age- and gender-specific AF FITNESSGRAM health-related standards. Data were subjected to three different screening procedures (based on grade size and boy-to-girl ratio per grade). Linear models were computed to obtain unadjusted and adjusted (for age, BMI-Z, and socio-economic status) estimates of % youth in the Healthy Fitness Zone (HFZ) in order to determine if, 1) there were differences in % in the HFZ and 2) if differences could be explained by changes in the representativeness of the sample due to the different data screening procedures. Results: Depending on the screening procedure used, the final sample ranged from 96,999 (no screening) to 46,572 youth (most stringent criteria). The proportion of youth achieving appropriate levels of AF ranged from 56% to 61% with unscreened data resulting in consistently lower percentages of youth achieving the standard (P < 0.05). Overall, these differences were not explained by possible changes in demographic characteristics as the result of applying different screening criteria. Conclusions: The findings demonstrate the importance of establishing appropriate screening procedures that maximize sample size while also ensuring generalizability of the findings.
基金supported by the Program of Introducing Talents of Disciplines to Universities of the Ministry of Education and State Administration of the Foreign Experts Affairs of China (the 111 Project, Grant No.B08048)the Special Basic Research Fund for Methodology in Hydrology of the Ministry of Sciences and Technology of China (Grant No. 2011IM011000)
文摘The question of how to choose a copula model that best fits a given dataset is a predominant limitation of the copula approach, and the present study aims to investigate the techniques of goodness-of-fit tests for multi-dimensional copulas. A goodness-of-fit test based on Rosenblatt's transformation was mathematically expanded from two dimensions to three dimensions and procedures of a bootstrap version of the test were provided. Through stochastic copula simulation, an empirical application of historical drought data at the Lintong Gauge Station shows that the goodness-of-fit tests perform well, revealing that both trivariate Gaussian and Student t copulas are acceptable for modeling the dependence structures of the observed drought duration, severity, and peak. The goodness-of-fit tests for multi-dimensional copulas can provide further support and help a lot in the potential applications of a wider range of copulas to describe the associations of correlated hydrological variables. However, for the application of copulas with the number of dimensions larger than three, more complicated computational efforts as well as exploration and parameterization of corresponding copulas are required.
文摘Reverse engineering in the manufacturing field is a process in which the digitized data are obtained from an existing object model or a part of it, and then the CAD model is reconstructed. This paper presents an RBF neural network approach to modify and fit the digitized data. The centers for the RBF are selected by using the orthogonal least squares learning algorithm. A mathematically known surface is used for generating a number of samples for training the networks. The trained networks then generated a number of new points which were compared with the calculating points from the equations. Moreover, a series of practice digitizing curves are used to test the approach. The results showed that this approach is effective in modifying and fitting digitized data and generating data points to reconstruct the surface model.
文摘By using the method of least square linear fitting to analyze data do not exist errors under certain conditions, in order to make the linear data fitting method that can more accurately solve the relationship expression between the volume and quantity in scientific experiments and engineering practice, this article analyzed data error by commonly linear data fitting method, and proposed improved process of the least distance squ^re method based on least squares method. Finally, the paper discussed the advantages and disadvantages through the example analysis of two kinds of linear data fitting method, and given reasonable control conditions for its application.
文摘我国新一代中国频谱射电日像仪(Chinese Spectral Radio Heliograph,CSRH)原始观测数据采用自定义格式,在进行后续处理与共享使用时必须转换相应的格式。在分析FITS-IDI(FITS Interferometry Data Interchange)格式的基础上,结合CSRH的实际观测模式与数据产出方式,定义与设计了符合项目情况的FITS-IDI格式及字段,并对FITS-IDI文件中若干字段的值如何获取、计算进行了深入讨论。根据定义生成的FITS-IDI文件已成功导入CASA软件,并可以进行后续处理。经过对CASA测量集文件的核实,证明了数据生成的正确性。本研究有效地推进了CSRH的建设工作,也对其他射电干涉阵数据存储有一定的参考价值。
基金Supported by the National Natural Science Foundation of China( 1 9971 0 1 7,1 0 1 2 5 1 0 2 )
文摘Based on the definition of MQ-B-Splines,this article constructs five types of univariate quasi-interpolants to non-uniformly distributed data. The error estimates and the shape-preserving properties are shown in details.And examples are shown to demonstrate the capacity of the quasi-interpolants for curve representation.
基金Supported by the National Natural Science Foundation of China(No.51379006 and No.51009106)the Program for New Century Excellent Talents in University of Ministry of Education of China(No.NCET-12-0404)the National Basic Research Program of China("973"Program,No.2013CB035903)
文摘Due to the complex nature of multi-source geological data, it is difficult to rebuild every geological structure through a single 3D modeling method. The multi-source data interpretation method put forward in this analysis is based on a database-driven pattern and focuses on the discrete and irregular features of geological data. The geological data from a variety of sources covering a range of accuracy, resolution, quantity and quality are classified and integrated according to their reliability and consistency for 3D modeling. The new interpolation-approximation fitting construction algorithm of geological surfaces with the non-uniform rational B-spline(NURBS) technique is then presented. The NURBS technique can retain the balance among the requirements for accuracy, surface continuity and data storage of geological structures. Finally, four alternative 3D modeling approaches are demonstrated with reference to some examples, which are selected according to the data quantity and accuracy specification. The proposed approaches offer flexible modeling patterns for different practical engineering demands.
文摘A successful mechanical property data-driven prediction model is the core of the optimal design of hot rolling process for hot-rolled strips. However, the original industrial data, usually unbalanced, are inevitably mixed with fluctuant and abnormal values. Models established on the basis of the data without data processing can cause misleading results, which cannot be used for the optimal design of hot rolling process. Thus, a method of industrial data processing of C-Mn steel was proposed based on the data analysis. The Bayesian neural network was employed to establish the reliable mechanical property prediction models for the optimal design of hot rolling process. By using the multi-objective optimization algorithm and considering the individual requirements of costumers and the constraints of the equipment, the optimal design of hot rolling process was successfully applied to the rolling process design for Q345B steel with 0.017% Nb and 0.046% Ti content removed. The optimal process design results were in good agreement with the industrial trials results, which verify the effectiveness of the optimal design of hot rolling process.
基金supported by the National Natural Science Foundation (71301119)the Shanghai Natural Science Foundation (12ZR1434100)
文摘Recent advances in intelligent transportation system allow traffic safety studies to extend from historic data-based analyses to real-time applications. The study presents a new method to predict crash likelihood with traffic data collected by discrete loop detectors as well as the web-crawl weather data. Matched case-control method and support vector machines (SVMs) technique were employed to identify the risk status. The adaptive synthetic over-sampling technique was applied to solve the imbalanced dataset issues. Random forest technique was applied to select the contributing factors and avoid the over-fitting issues. The results indicate that the SVMs classifier could successfully classify 76.32% of the crashes on the test dataset and 87.52% of the crashes on the overall dataset, which were relatively satisfactory compared with the results of the previous studies. Compared with the SVMs classifier without the data, the SVMs classifier with the web-crawl weather data increased the crash prediction accuracy by 1.32% and decreased the false alarm rate by 1.72%, showing the potential value of the massive web weather data. Mean impact value method was employed to evaluate the variable effects, and the results are identical with the results of most of previous studies. The emerging technique based on the discrete traffic data and web weather data proves to be more applicable on real- time safety management on freeways.
基金College of Communication and Information(CCI)Research and Creative Activity Fund,Kent State University
文摘Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the functions and usages of LOD KOS products.Design/methodology/approach:Data collection and analysis were conducted at three time periods in 2015–16,2017 and 2019.The sample data used in the comprehensive data analysis comprises all datasets tagged as types of KOS in the Datahub and extracted through their respective SPARQL endpoints.A comparative study of the LOD KOS collected from terminology services Linked Open Vocabularies(LOV)and BioPortal was also performed.Findings:The study proposes a set of Functional,Impactful and Transformable(FIT)metrics for LOD KOS as value vocabularies.The FAIR principles,with additional recommendations,are presented for LOD KOS as open data.Research limitations:The metrics need to be further tested and aligned with the best practices and international standards of both open data and various types of KOS.Practical implications:Assessment performed with FAIR and FIT metrics support the creation and delivery of user-friendly,discoverable and interoperable LOD KOS datasets which can be used for innovative applications,act as a knowledge base,become a foundation of semantic analysis and entity extractions and enhance research in science and the humanities.Originality/value:Our research provides best practice guidelines for LOD KOS as value vocabularies.
基金the Second Tibetan Plateau Scientific Expedition and Research Program(Grant No.2019QZKK0405)the National Natural Science Foundation of China(Grant No.41871185&41971270)。
文摘The Tibetan Plateau(TP)is undergoing rapid urbanization.To improve urban sustainability and construct eco-logical security barriers,it is essential to quantify the spatial patterns of urbanization level on the TP,but the existing studies on the topic have been limited by the lack of socioeconomic data.This study aims to quantify the urbanization level on the TP in 2018 with Luojia1-01(LJ1-01)high-resolution nighttime light(NTL)data.Specifically,the compounded night light index is used to quantify spatial patterns of urbanization level at mul-tiple scales.The results showed that the TP had a low overall urbanization level with a large internal difference.The urbanization level in the northeast,southeast and south of the TP was relatively high,forming three hotspots centered in Xining City,Lhasa City and Shangri-La City,while the urbanization level in the central and western regions was relatively low.The analysis of influencing factors,based on the random forest model,showed that transportation and topography were the main factors affecting the TP’s spatial patterns of urbanization level.The comparison analysis with socioeconomic statistics and traditional NTL data showed that LJ1-01 NTL data can be used to more effectively quantify the urbanization level since it is more advantageous for reflecting the spatial extent of urban land and describing the spatial structure of socioeconomic activities within urban areas.These advantages are attributed to the high spatial resolution of the data,appropriate imaging time and unaf-fected by saturation phenomena.Thus,the proposed LJ1-01 NTL-based urbanization level measurement method has the potential for wide applications around the world,especially in less-developed regions lacking statistical data.Using this method,we refined the measurement of the TP’s urbanization level in 2018 for multiple scales including the region,basin,prefecture and county levels,which provides basic information for the further urban sustainability research on the TP.
文摘A field-programmable gate array(FPGA)based high-speed broadband data acquisition system is designed.The system has a dual channel simultaneous acquisition function.The maximum sampling rate is 500 MSa/s and bandwidth is200 MHz,which solves the large bandwidth,high-speed signal acquisition and processing problems.At present,the data acquisition system is successfully used in broadband receiver test systems.
基金Supported by the National Natural Science Foundation of China.
文摘The dominant source of error in VLBI phase-referencing is the troposphere at observing frequencies above 5 GHz. We compare the tropospheric zenith delays derived from VLBI and GPS data at VLBA stations collocated with GPS antennas. The systematic biases and standard deviations both are at the level of sub-centimeter. Based on this agreement, we suggest a new method of tropospheric correction in phase-referencing using combined VLBI and GPS data.
文摘The quality of the low frequency electromagnetic data is affected by the spike and the trend noises.Failure in removal of the spikes and the trends reduces the credibility of data explanation.Based on the analyses of the causes and characteristics of these noises,this paper presents the results of a preset statistics stacking method(PSSM)and a piecewise linear fitting method(PLFM)in de-noising the spikes and trends,respectively.The magnitudes of the spikes are either higher or lower than the normal values,which leads to distortion of the useful signal.Comparisons have been performed in removing of the spikes among the average,the statistics and the PSSM methods,and the results indicate that only the PSSM can remove the spikes successfully.On the other hand,the spectrums of the linear and nonlinear trends mainly lie in the low frequency band and can change the calculated resistivity significantly.No influence of the trends is observed when the frequency is higher than a certain threshold value.The PLSM can remove effectively both the linear and nonlinear trends with errors around 1% in the power spectrum.The proposed methods present an effective way for de-noising the spike and the trend noises in the low frequency electromagnetic data,and establish a research basis for de-noising the low frequency noises.