As a less time-consuming procedure, subsampling technology has been widely used in biological monitoring and assessment programs. It is clear that subsampling counts af fect the value of traditional biodiversity indic...As a less time-consuming procedure, subsampling technology has been widely used in biological monitoring and assessment programs. It is clear that subsampling counts af fect the value of traditional biodiversity indices, but its ef fect on taxonomic distinctness(TD) indices is less well studied. Here, we examined the responses of traditional(species richness, Shannon-Wiener diversity) and TD(average taxonomic distinctness: Δ +, and variation in taxonomic distinctness: Λ +) indices to subsample counts using a random subsampling procedure from 50 to 400 individuals, based on macroinvertebrate datasets from three dif ferent river systems in China. At regional scale, taxa richness asymptotically increased with ?xed-count size; ≥250–300 individuals to express 95% information of the raw data. In contrast, TD indices were less sensitive to the subsampling procedure. At local scale, TD indices were more stable and had less deviation than species richness and Shannon-Wiener index, even at low subsample counts, with ≥100 individuals needed to estimate 95% of the information of the actual Δ + and Λ + in the three river basins. We also found that abundance had a certain ef fect on diversity indices during the subsampling procedure, with dif ferent subsampling counts for species richness and TD indices varying by regions. Therefore, we suggest that TD indices are suitable for biodiversity assessment and environment monitoring. Meanwhile, pilot analyses are necessary when to determine the appropriate subsample counts for bioassessment in a new region or habitat type.展开更多
We propose a subsampling method for robust estimation of regression models which is built on classical methods such as the least squares method. It makes use of the non-robust nature of the underlying classical method...We propose a subsampling method for robust estimation of regression models which is built on classical methods such as the least squares method. It makes use of the non-robust nature of the underlying classical method to find a good sample from regression data contaminated with outliers, and then applies the classical method to the good sample to produce robust estimates of the regression model parameters. The subsampling method is a computational method rooted in the bootstrap methodology which trades analytical treatment for intensive computation;it finds the good sample through repeated fitting of the regression model to many random subsamples of the contaminated data instead of through an analytical treatment of the outliers. The subsampling method can be applied to all regression models for which non-robust classical methods are available. In the present paper, we focus on the basic formulation and robustness property of the subsampling method that are valid for all regression models. We also discuss variations of the method and apply it to three examples involving three different regression models.展开更多
Conventional full-waveform inversion is computationally intensive because it considers all shots in each iteration. To tackle this, we establish the number of shots needed and propose multiscale inversion in the frequ...Conventional full-waveform inversion is computationally intensive because it considers all shots in each iteration. To tackle this, we establish the number of shots needed and propose multiscale inversion in the frequency domain while using only the shots that are positively correlated with frequency. When using low-frequency data, the method considers only a small number of shots and raw data. More shots are used with increasing frequency. The random-in-group subsampling method is used to rotate the shots between iterations and avoid the loss of shot information. By reducing the number of shots in the inversion, we decrease the computational cost. There is no crosstalk between shots, no noise addition, and no observational limits. Numerical modeling suggests that the proposed method reduces the computing time, is more robust to noise, and produces better velocity models when using data with noise.展开更多
A new faster block-matching algorithm (BMA) by using both search candidate and pixd sulzsamplings is proposed. Firstly a pixd-subsampling approach used in adjustable partial distortion search (APDS) is adjusted to...A new faster block-matching algorithm (BMA) by using both search candidate and pixd sulzsamplings is proposed. Firstly a pixd-subsampling approach used in adjustable partial distortion search (APDS) is adjusted to visit about half points of all search candidates by subsampling them, using a spiral-scanning path with one skip. Two sdected candidates that have minimal and second minimal block distortion measures are obtained. Then a fine-tune step is taken around them to find the best one. Some analyses are given to approve the rationality of the approach of this paper. Experimental results show that, as compared to APDS, the proposed algorithm can enhance the block-matching speed by about 30% while maintaining its MSE performance very close to that of it. And it performs much better than many other BMAs such as TSS, NTSS, UCDBS and NPDS.展开更多
The IPCC has drawn attention to an apparent leveling-off of globally-averaged temperatures over the past 15 years or so. Measuring the duration of the hiatus has implications for determining if the underlying trend ha...The IPCC has drawn attention to an apparent leveling-off of globally-averaged temperatures over the past 15 years or so. Measuring the duration of the hiatus has implications for determining if the underlying trend has changed, and for evaluating climate models. Here, I propose a method for estimating the duration of the hiatus that is robust to unknown forms of heteroskedasticity and autocorrelation (HAC) in the temperature series and to cherry-picking of endpoints. For the specific case of global average temperatures I also add the requirement of spatial consistency between hemispheres. The method makes use of the Vogelsang-Franses (2005) HAC-robust trend variance estimator which is valid as long as the underlying series is trend stationary, which is the case for the data used herein. Application of the method shows that there is now a trendless interval of 19 years duration at the end of the HadCRUT4 surface temperature series, and of 16 - 26 years in the lower troposphere. Use of a simple AR1 trend model suggests a shorter hiatus of 14 - 20 years but is likely unreliable.展开更多
Contraposing the need of the robust digital watermark for the copyright protection field, a new digital watermarking algorithm in the non-subsampled contourlet transform (NSCT) domain is proposed. The largest energy...Contraposing the need of the robust digital watermark for the copyright protection field, a new digital watermarking algorithm in the non-subsampled contourlet transform (NSCT) domain is proposed. The largest energy sub-band after NSCT is selected to embed watermark. The watermark is embedded into scaleinvariant feature transform (SIFT) regions. During embedding, the initial region is divided into some cirque sub-regions with the same area, and each watermark bit is embedded into one sub-region. Extensive simulation results and comparisons show that the algorithm gets a good trade-off of invisibility, robustness and capacity, thus obtaining good quality of the image while being able to effectively resist common image processing, and geometric and combo attacks, and normalized similarity is almost all reached.展开更多
This paper presents a new class of test procedures for two-sample location problem based on subsample quantiles. The class includes Mann-Whitney test as a special case. The asymptotic normality of the class of tests p...This paper presents a new class of test procedures for two-sample location problem based on subsample quantiles. The class includes Mann-Whitney test as a special case. The asymptotic normality of the class of tests proposed is established. The asymptotic relative performance of the proposed class of test with respect to the optimal member of Xie and Priebe (2000) is studied in terms of Pitman efficiency for various underlying distributions.展开更多
When the observed price process is the true underlying price process plus microstructure noise, it is known that realized volatility (RV) estimates will be overwhelmed by the noise when the sampling frequency approach...When the observed price process is the true underlying price process plus microstructure noise, it is known that realized volatility (RV) estimates will be overwhelmed by the noise when the sampling frequency approaches infinity. Therefore, it may be optimal to sample less frequently, and averaging the less frequently sampled subsamples can improve estimation for quadratic variation. In this paper, we extend this idea to forecasting daily realized volatility. While subsample averaging has been proposed and used in estimating RV, this paper is the first that uses subsample averaging for forecasting RV. The subsample averaging method we examine incorporates the high frequency data in different levels of systematic sampling. It first pools the high frequency data into several subsamples, then generates forecasts from each subsample, and then combines these forecasts. We find that in daily S&P 500 return realized volatility forecasts, subsample averaging generates better forecasts than those using only one subsample.展开更多
Multimodal medical image fusion is a powerful tool for diagnosing diseases in medical field. The main objective is to capture the relevant information from input images into a single output image, which plays an impor...Multimodal medical image fusion is a powerful tool for diagnosing diseases in medical field. The main objective is to capture the relevant information from input images into a single output image, which plays an important role in clinical applications. In this paper, an image fusion technique for the fusion of multimodal medical images is proposed based on Non-Subsampled Contourlet Transform. The proposed technique uses the Non-Subsampled Contourlet Transform (NSCT) to decompose the images into lowpass and highpass subbands. The lowpass and highpass subbands are fused by using mean based and variance based fusion rules. The reconstructed image is obtained by taking Inverse Non-Subsampled Contourlet Transform (INSCT) on fused subbands. The experimental results on six pairs of medical images are compared in terms of entropy, mean, standard deviation, Q<sup>AB/F</sup> as performance parameters. It reveals that the proposed image fusion technique outperforms the existing image fusion techniques in terms of quantitative and qualitative outcomes of the images. The percentage improvement in entropy is 0% - 40%, mean is 3% - 42%, standard deviation is 1% - 42%, Q<sup>AB/F</sup>is 0.4% - 48% in proposed method comparing to conventional methods for six pairs of medical images.展开更多
基金Supported by the National Natural Science Foundation of China(Nos.31400469,41571495,31770460)the National Science and Technology Basic Research Program(No.2015FY110400-4)+2 种基金the China Three Gorges Corporation Research Project(No.JGJ/0272015)the Key Program of the Chinese Academy of Sciences(Comprehensive Assessment Technology of River Ecology and Environment for the Water Source Region of "South-toNorth Water Diversion Central Route")the Program for Biodiversity Protection(No.2017HB2096001006)
文摘As a less time-consuming procedure, subsampling technology has been widely used in biological monitoring and assessment programs. It is clear that subsampling counts af fect the value of traditional biodiversity indices, but its ef fect on taxonomic distinctness(TD) indices is less well studied. Here, we examined the responses of traditional(species richness, Shannon-Wiener diversity) and TD(average taxonomic distinctness: Δ +, and variation in taxonomic distinctness: Λ +) indices to subsample counts using a random subsampling procedure from 50 to 400 individuals, based on macroinvertebrate datasets from three dif ferent river systems in China. At regional scale, taxa richness asymptotically increased with ?xed-count size; ≥250–300 individuals to express 95% information of the raw data. In contrast, TD indices were less sensitive to the subsampling procedure. At local scale, TD indices were more stable and had less deviation than species richness and Shannon-Wiener index, even at low subsample counts, with ≥100 individuals needed to estimate 95% of the information of the actual Δ + and Λ + in the three river basins. We also found that abundance had a certain ef fect on diversity indices during the subsampling procedure, with dif ferent subsampling counts for species richness and TD indices varying by regions. Therefore, we suggest that TD indices are suitable for biodiversity assessment and environment monitoring. Meanwhile, pilot analyses are necessary when to determine the appropriate subsample counts for bioassessment in a new region or habitat type.
文摘We propose a subsampling method for robust estimation of regression models which is built on classical methods such as the least squares method. It makes use of the non-robust nature of the underlying classical method to find a good sample from regression data contaminated with outliers, and then applies the classical method to the good sample to produce robust estimates of the regression model parameters. The subsampling method is a computational method rooted in the bootstrap methodology which trades analytical treatment for intensive computation;it finds the good sample through repeated fitting of the regression model to many random subsamples of the contaminated data instead of through an analytical treatment of the outliers. The subsampling method can be applied to all regression models for which non-robust classical methods are available. In the present paper, we focus on the basic formulation and robustness property of the subsampling method that are valid for all regression models. We also discuss variations of the method and apply it to three examples involving three different regression models.
基金financially supported by the Fundamental Research Funds for the Central Universities(No.201822011)the National Natural Science Foundation of China(No.41674118)the National Science and Technology Major Project(No.2016ZX05027002)
文摘Conventional full-waveform inversion is computationally intensive because it considers all shots in each iteration. To tackle this, we establish the number of shots needed and propose multiscale inversion in the frequency domain while using only the shots that are positively correlated with frequency. When using low-frequency data, the method considers only a small number of shots and raw data. More shots are used with increasing frequency. The random-in-group subsampling method is used to rotate the shots between iterations and avoid the loss of shot information. By reducing the number of shots in the inversion, we decrease the computational cost. There is no crosstalk between shots, no noise addition, and no observational limits. Numerical modeling suggests that the proposed method reduces the computing time, is more robust to noise, and produces better velocity models when using data with noise.
基金This project was supported by the National Natural Science Foundation of China (60272099) .
文摘A new faster block-matching algorithm (BMA) by using both search candidate and pixd sulzsamplings is proposed. Firstly a pixd-subsampling approach used in adjustable partial distortion search (APDS) is adjusted to visit about half points of all search candidates by subsampling them, using a spiral-scanning path with one skip. Two sdected candidates that have minimal and second minimal block distortion measures are obtained. Then a fine-tune step is taken around them to find the best one. Some analyses are given to approve the rationality of the approach of this paper. Experimental results show that, as compared to APDS, the proposed algorithm can enhance the block-matching speed by about 30% while maintaining its MSE performance very close to that of it. And it performs much better than many other BMAs such as TSS, NTSS, UCDBS and NPDS.
基金Acknowledgment This work was supported by Beijing Natural Science Foundation Funded Project (No.4110001), National S&T Major Project (No. 2011ZX03003-002), Tsinghua Independent Research (No. 2010TH203-02) and Samsung Company.
文摘The IPCC has drawn attention to an apparent leveling-off of globally-averaged temperatures over the past 15 years or so. Measuring the duration of the hiatus has implications for determining if the underlying trend has changed, and for evaluating climate models. Here, I propose a method for estimating the duration of the hiatus that is robust to unknown forms of heteroskedasticity and autocorrelation (HAC) in the temperature series and to cherry-picking of endpoints. For the specific case of global average temperatures I also add the requirement of spatial consistency between hemispheres. The method makes use of the Vogelsang-Franses (2005) HAC-robust trend variance estimator which is valid as long as the underlying series is trend stationary, which is the case for the data used herein. Application of the method shows that there is now a trendless interval of 19 years duration at the end of the HadCRUT4 surface temperature series, and of 16 - 26 years in the lower troposphere. Use of a simple AR1 trend model suggests a shorter hiatus of 14 - 20 years but is likely unreliable.
基金supported by the National Natural Science Foundation of China(61379010)the Natural Science Basic Research Plan in Shaanxi Province of China(2015JM6293)
文摘Contraposing the need of the robust digital watermark for the copyright protection field, a new digital watermarking algorithm in the non-subsampled contourlet transform (NSCT) domain is proposed. The largest energy sub-band after NSCT is selected to embed watermark. The watermark is embedded into scaleinvariant feature transform (SIFT) regions. During embedding, the initial region is divided into some cirque sub-regions with the same area, and each watermark bit is embedded into one sub-region. Extensive simulation results and comparisons show that the algorithm gets a good trade-off of invisibility, robustness and capacity, thus obtaining good quality of the image while being able to effectively resist common image processing, and geometric and combo attacks, and normalized similarity is almost all reached.
文摘This paper presents a new class of test procedures for two-sample location problem based on subsample quantiles. The class includes Mann-Whitney test as a special case. The asymptotic normality of the class of tests proposed is established. The asymptotic relative performance of the proposed class of test with respect to the optimal member of Xie and Priebe (2000) is studied in terms of Pitman efficiency for various underlying distributions.
文摘When the observed price process is the true underlying price process plus microstructure noise, it is known that realized volatility (RV) estimates will be overwhelmed by the noise when the sampling frequency approaches infinity. Therefore, it may be optimal to sample less frequently, and averaging the less frequently sampled subsamples can improve estimation for quadratic variation. In this paper, we extend this idea to forecasting daily realized volatility. While subsample averaging has been proposed and used in estimating RV, this paper is the first that uses subsample averaging for forecasting RV. The subsample averaging method we examine incorporates the high frequency data in different levels of systematic sampling. It first pools the high frequency data into several subsamples, then generates forecasts from each subsample, and then combines these forecasts. We find that in daily S&P 500 return realized volatility forecasts, subsample averaging generates better forecasts than those using only one subsample.
文摘Multimodal medical image fusion is a powerful tool for diagnosing diseases in medical field. The main objective is to capture the relevant information from input images into a single output image, which plays an important role in clinical applications. In this paper, an image fusion technique for the fusion of multimodal medical images is proposed based on Non-Subsampled Contourlet Transform. The proposed technique uses the Non-Subsampled Contourlet Transform (NSCT) to decompose the images into lowpass and highpass subbands. The lowpass and highpass subbands are fused by using mean based and variance based fusion rules. The reconstructed image is obtained by taking Inverse Non-Subsampled Contourlet Transform (INSCT) on fused subbands. The experimental results on six pairs of medical images are compared in terms of entropy, mean, standard deviation, Q<sup>AB/F</sup> as performance parameters. It reveals that the proposed image fusion technique outperforms the existing image fusion techniques in terms of quantitative and qualitative outcomes of the images. The percentage improvement in entropy is 0% - 40%, mean is 3% - 42%, standard deviation is 1% - 42%, Q<sup>AB/F</sup>is 0.4% - 48% in proposed method comparing to conventional methods for six pairs of medical images.