As data analysis often incurs significant communication and computational costs,these tasks are increasingly outsourced to cloud computing platforms.However,this introduces privacy concerns,as sensitive data must be t...As data analysis often incurs significant communication and computational costs,these tasks are increasingly outsourced to cloud computing platforms.However,this introduces privacy concerns,as sensitive data must be transmitted to and processed by untrusted parties.To address this,fully homomorphic encryption(FHE)has emerged as a promising solution for privacy-preserving Machine-Learning-as-a-Service(MLaaS),enabling computation on encrypted data without revealing the plaintext.Nevertheless,FHE remains computationally expensive.As a result,approximate homomorphic encryption(AHE)schemes,such as CKKS,have attracted attention due to their efficiency.In our previous work,we proposed RP-OKC,a CKKS-based clustering scheme implemented via TenSEAL.However,errors inherent to CKKS operations—termed CKKS-errors—can affect the accuracy of the result after decryption.Since these errors can be mitigated through post-decryption rounding,we propose a data pre-scaling technique to increase the number of significant digits and reduce CKKS-errors.Furthermore,we introduce an Operation-Error-Estimation(OEE)table that quantifies upper-bound error estimates for various CKKS operations.This table enables error-aware decryption correction,ensuring alignment between encrypted and plaintext results.We validate our method on K-means clustering using the Kaggle Customer Segmentation dataset.Experimental results confirm that the proposed scheme enhances the accuracy and reliability of privacy-preserving data analysis in cloud environments.展开更多
Approximate Bayesian Computation (ABC) is a popular sampling method in applications involving intractable likelihood functions. Instead of evaluating the likelihood function, ABC approximates the posterior distributio...Approximate Bayesian Computation (ABC) is a popular sampling method in applications involving intractable likelihood functions. Instead of evaluating the likelihood function, ABC approximates the posterior distribution by a set of accepted samples which are simulated from a generating model. Simulated samples are accepted if the distances between the samples and the observation are smaller than some threshold. The distance is calculated in terms of summary statistics. This paper proposes Local Gradient Kernel Dimension Reduction (LGKDR) to construct low dimensional summary statistics for ABC. The proposed method identifies a sufficient subspace of the original summary statistics by implicitly considering all non-linear transforms therein, and a weighting kernel is used for the concentration of the projections. No strong assumptions are made on the marginal distributions, nor the regression models, permitting usage in a wide range of applications. Experiments are done with simple rejection ABC and sequential Monte Carlo ABC methods. Results are reported as competitive in the former and substantially better in the latter cases in which Monte Carlo errors are compressed as much as possible.展开更多
为了对大量电力用户的稀疏、不规律的日耗电量数据进行特征分析,并对用户进行分类,文章提出一种函数性数据聚类分析方法。首先,应用kernel方法将离散的电量数据还原成连续曲线;然后,受Sobolev空间距离的启发,定义了新的函数距离,用于k-m...为了对大量电力用户的稀疏、不规律的日耗电量数据进行特征分析,并对用户进行分类,文章提出一种函数性数据聚类分析方法。首先,应用kernel方法将离散的电量数据还原成连续曲线;然后,受Sobolev空间距离的启发,定义了新的函数距离,用于k-means算法进行聚类。以某城市10 000户居民538天的实际用电数据进行实验,得到了用户在不同距离和聚类个数下的聚类原型。实验结果显示,由于选取的用户主要是城市居民,其用电模式比较相似:大高峰时段主要在6—9月,小高峰时段主要在1—2月,日消耗波动较小。而不同用户类别的主要区别体现在用电量的范围上:低耗电用户整体低于13 k W?h/天,高耗电用户接近100 k W?h/天。展开更多
For the kernel K-mean cluster method is run in an implicit feature space, the initial and iterative cluster centers cannot be defined explicitly. Against the deficiency of the initial cluster centers selected in the o...For the kernel K-mean cluster method is run in an implicit feature space, the initial and iterative cluster centers cannot be defined explicitly. Against the deficiency of the initial cluster centers selected in the original space discretionarily in the existing methods, this paper proposes a new method for ensuring the clustering center that virtual clustering centers are defined in the feature space by the original classification as the initial cluster centers and the iteration clustering centers are ensured by the further virtual classification. The improved method is used for fault diagnosis of roller bearing that achieves a good cluster and diagnosis result, which demonstrates the effectiveness of the proposed method.展开更多
Wavelet, a powerful tool for signal processing, can be used to approximate the target func-tion. For enhancing the sparse property of wavelet approximation, a new algorithm was proposed by using wavelet kernel Support...Wavelet, a powerful tool for signal processing, can be used to approximate the target func-tion. For enhancing the sparse property of wavelet approximation, a new algorithm was proposed by using wavelet kernel Support Vector Machines (SVM), which can converge to minimum error with bet-ter sparsity. Here, wavelet functions would be firstly used to construct the admitted kernel for SVM according to Mercy theory; then new SVM with this kernel can be used to approximate the target fun-citon with better sparsity than wavelet approxiamtion itself. The results obtained by our simulation ex-periment show the feasibility and validity of wavelet kernel support vector machines.展开更多
This paper investigates some approximation properties and learning rates of Lipschitz kernel on the sphere. A perfect convergence rate on the shifts of Lipschitz kernel on the sphere, which is faster than O(n-1/2), ...This paper investigates some approximation properties and learning rates of Lipschitz kernel on the sphere. A perfect convergence rate on the shifts of Lipschitz kernel on the sphere, which is faster than O(n-1/2), is obtained, where n is the number of parameters needed in the approximation. By means of the approximation, a learning rate of regularized least square algorithm with the Lipschitz kernel on the sphere is also deduced.展开更多
In this paper, the normal approximation rate and the random weighting approximation rate of error distribution of the kernel estimator of conditional density function f(y|x) are studied. The results may be used to...In this paper, the normal approximation rate and the random weighting approximation rate of error distribution of the kernel estimator of conditional density function f(y|x) are studied. The results may be used to construct the confidence interval of f(y|x) .展开更多
LetΩbe homogeneous of degree zero,integrable on S^(d−1) and have vanishing moment of order one,a be a function on R^(d) such that ∇a∈L^(∞)(R^(d)).Let T*_(Ω,a) be the maximaloperator associated with the d-dimensional...LetΩbe homogeneous of degree zero,integrable on S^(d−1) and have vanishing moment of order one,a be a function on R^(d) such that ∇a∈L^(∞)(R^(d)).Let T*_(Ω,a) be the maximaloperator associated with the d-dimensional Calder´on commutator defined by T*_(Ωa)f(x):=sup_(ε>0)|∫_(|x-y|>ε)^Ω(x-y)/|x-y|^(d+1)(a(x)-a(y))f(y)dy.In this paper,the authors establish bilinear sparse domination for T*_(Ω,a) under the assumption Ω∈L∞(Sd−1).As applications,some quantitative weighted bounds for T*_(Ω,a) are obtained.展开更多
This paper discusses Born/Rytov approximation tomographic velocity inversion methods constrained by the Fresnel zone. Calculations of the sensitivity kernel function and traveltime residuals are critical in tomographi...This paper discusses Born/Rytov approximation tomographic velocity inversion methods constrained by the Fresnel zone. Calculations of the sensitivity kernel function and traveltime residuals are critical in tomographic velocity inversion. Based on the Bom/Rytov approximation of the frequency-domain wave equation, we derive the traveltime sensitivity kemels of the wave equation on the band-limited wave field and simultaneously obtain the traveltime residuals based on the Rytov approximation. In contrast to single-ray tomography, the modified velocity inversion method improves the inversion stability. Tests of the near- surface velocity model and field data prove that the proposed method has higher accuracy and Computational efficiency than ray theory tomography and full waveform inversion methods.展开更多
提出基于知识度量的模糊粗糙c-均值聚类(fuzzy rough c-means based on the knowledge measure,KFRCM)算法。传统聚类算法在处理具有模糊边界的数据时存在一定的局限性,表现为对初始聚类中心较为敏感且在高维空间中效率较低。为解决上...提出基于知识度量的模糊粗糙c-均值聚类(fuzzy rough c-means based on the knowledge measure,KFRCM)算法。传统聚类算法在处理具有模糊边界的数据时存在一定的局限性,表现为对初始聚类中心较为敏感且在高维空间中效率较低。为解决上述问题,引入特征加权的知识度量,结合模糊隶属度函数与粗糙集近似算子,采用高斯核相似度以增强边界特性。实验采用14个数据集,实验结果表明,KFRCM算法的聚类准确性、稳定性和计算效率均优于6种主流聚类算法。该研究首次将知识度量与模糊粗糙聚类相结合,为开发更为可靠和适应性更强的聚类算法提供了新的思路和算法。展开更多
Using reproducing kernels for Hilbert spaces, we give best approximation for Weierstrass transform associated with spherical mean operator. Also, estimates of extremal functions are checked.
Multiple kernel clustering is an unsupervised data analysis method that has been used in various scenarios where data is easy to be collected but hard to be labeled.However,multiple kernel clustering for incomplete da...Multiple kernel clustering is an unsupervised data analysis method that has been used in various scenarios where data is easy to be collected but hard to be labeled.However,multiple kernel clustering for incomplete data is a critical yet challenging task.Although the existing absent multiple kernel clustering methods have achieved remarkable performance on this task,they may fail when data has a high value-missing rate,and they may easily fall into a local optimum.To address these problems,in this paper,we propose an absent multiple kernel clustering(AMKC)method on incomplete data.The AMKC method rst clusters the initialized incomplete data.Then,it constructs a new multiple-kernel-based data space,referred to as K-space,from multiple sources to learn kernel combination coefcients.Finally,it seamlessly integrates an incomplete-kernel-imputation objective,a multiple-kernel-learning objective,and a kernel-clustering objective in order to achieve absent multiple kernel clustering.The three stages in this process are carried out simultaneously until the convergence condition is met.Experiments on six datasets with various characteristics demonstrate that the kernel imputation and clustering performance of the proposed method is signicantly better than state-of-the-art competitors.Meanwhile,the proposed method gains fast convergence speed.展开更多
How to solve the partial differential equation has been attached importance to by all kinds of fields. The exact solution to a class of partial differential equation with variable-coefficient is obtained in reproducin...How to solve the partial differential equation has been attached importance to by all kinds of fields. The exact solution to a class of partial differential equation with variable-coefficient is obtained in reproducing kernel space. For getting the approximate solution, give an iterative method, convergence of the iterative method is proved. The numerical example shows that our method is effective and good practicability.展开更多
In this paper we study the viscosity analysis of the spatially homogeneous Boltzmann equation for Maxwellian molecules. We first show that the global existence in time of the mild solution of the viscosity equation . ...In this paper we study the viscosity analysis of the spatially homogeneous Boltzmann equation for Maxwellian molecules. We first show that the global existence in time of the mild solution of the viscosity equation . We then study the asymptotic behaviour of the mild solution as the coefficients , and an estimate on is derived.展开更多
In this paper, we apply the new algorithm of reproducing kernel method to give the approximate solution to some functional-differential equations. The numerical results demonstrate the accuracy of the proposed algorithm.
This paper investigates the nonlinear boundary value problem resulting from the exact reduction of the Navier-Stokes equations for unsteady magnetohydrodynamic boundary layer flow over the stretching/shrinking permeab...This paper investigates the nonlinear boundary value problem resulting from the exact reduction of the Navier-Stokes equations for unsteady magnetohydrodynamic boundary layer flow over the stretching/shrinking permeable sheet submerged in a moving fluid.To solve this equation,a numerical method is proposed based on a Laguerre functions with reproducing kernel Hilbert space method.Using the operational matrices of derivative,we reduced the problem to a set of algebraic equations.We also compare this work with some other numerical results and present a solution that proves to be highly accurate.展开更多
基金funded by National Science and Technology Council,Taiwan,grant numbers are 110-2401-H-002-094-MY2 and 112-2221-E-130-001.
文摘As data analysis often incurs significant communication and computational costs,these tasks are increasingly outsourced to cloud computing platforms.However,this introduces privacy concerns,as sensitive data must be transmitted to and processed by untrusted parties.To address this,fully homomorphic encryption(FHE)has emerged as a promising solution for privacy-preserving Machine-Learning-as-a-Service(MLaaS),enabling computation on encrypted data without revealing the plaintext.Nevertheless,FHE remains computationally expensive.As a result,approximate homomorphic encryption(AHE)schemes,such as CKKS,have attracted attention due to their efficiency.In our previous work,we proposed RP-OKC,a CKKS-based clustering scheme implemented via TenSEAL.However,errors inherent to CKKS operations—termed CKKS-errors—can affect the accuracy of the result after decryption.Since these errors can be mitigated through post-decryption rounding,we propose a data pre-scaling technique to increase the number of significant digits and reduce CKKS-errors.Furthermore,we introduce an Operation-Error-Estimation(OEE)table that quantifies upper-bound error estimates for various CKKS operations.This table enables error-aware decryption correction,ensuring alignment between encrypted and plaintext results.We validate our method on K-means clustering using the Kaggle Customer Segmentation dataset.Experimental results confirm that the proposed scheme enhances the accuracy and reliability of privacy-preserving data analysis in cloud environments.
文摘Approximate Bayesian Computation (ABC) is a popular sampling method in applications involving intractable likelihood functions. Instead of evaluating the likelihood function, ABC approximates the posterior distribution by a set of accepted samples which are simulated from a generating model. Simulated samples are accepted if the distances between the samples and the observation are smaller than some threshold. The distance is calculated in terms of summary statistics. This paper proposes Local Gradient Kernel Dimension Reduction (LGKDR) to construct low dimensional summary statistics for ABC. The proposed method identifies a sufficient subspace of the original summary statistics by implicitly considering all non-linear transforms therein, and a weighting kernel is used for the concentration of the projections. No strong assumptions are made on the marginal distributions, nor the regression models, permitting usage in a wide range of applications. Experiments are done with simple rejection ABC and sequential Monte Carlo ABC methods. Results are reported as competitive in the former and substantially better in the latter cases in which Monte Carlo errors are compressed as much as possible.
基金Projected Supported by the National High Technology Research and Development Program of China(863 Program)(2015AA050203)National Talents Training Base for Basic Research and Teaching of Natural Science of China(J1103105)~~
文摘为了对大量电力用户的稀疏、不规律的日耗电量数据进行特征分析,并对用户进行分类,文章提出一种函数性数据聚类分析方法。首先,应用kernel方法将离散的电量数据还原成连续曲线;然后,受Sobolev空间距离的启发,定义了新的函数距离,用于k-means算法进行聚类。以某城市10 000户居民538天的实际用电数据进行实验,得到了用户在不同距离和聚类个数下的聚类原型。实验结果显示,由于选取的用户主要是城市居民,其用电模式比较相似:大高峰时段主要在6—9月,小高峰时段主要在1—2月,日消耗波动较小。而不同用户类别的主要区别体现在用电量的范围上:低耗电用户整体低于13 k W?h/天,高耗电用户接近100 k W?h/天。
文摘For the kernel K-mean cluster method is run in an implicit feature space, the initial and iterative cluster centers cannot be defined explicitly. Against the deficiency of the initial cluster centers selected in the original space discretionarily in the existing methods, this paper proposes a new method for ensuring the clustering center that virtual clustering centers are defined in the feature space by the original classification as the initial cluster centers and the iteration clustering centers are ensured by the further virtual classification. The improved method is used for fault diagnosis of roller bearing that achieves a good cluster and diagnosis result, which demonstrates the effectiveness of the proposed method.
文摘Wavelet, a powerful tool for signal processing, can be used to approximate the target func-tion. For enhancing the sparse property of wavelet approximation, a new algorithm was proposed by using wavelet kernel Support Vector Machines (SVM), which can converge to minimum error with bet-ter sparsity. Here, wavelet functions would be firstly used to construct the admitted kernel for SVM according to Mercy theory; then new SVM with this kernel can be used to approximate the target fun-citon with better sparsity than wavelet approxiamtion itself. The results obtained by our simulation ex-periment show the feasibility and validity of wavelet kernel support vector machines.
基金Supported by the National Natural Science Foundation of China(61272023,91330118)
文摘This paper investigates some approximation properties and learning rates of Lipschitz kernel on the sphere. A perfect convergence rate on the shifts of Lipschitz kernel on the sphere, which is faster than O(n-1/2), is obtained, where n is the number of parameters needed in the approximation. By means of the approximation, a learning rate of regularized least square algorithm with the Lipschitz kernel on the sphere is also deduced.
基金Supported by Natural Science Foundation of Beijing City and National Natural Science Foundation ofChina(2 2 30 4 1 0 0 1 30 1
文摘In this paper, the normal approximation rate and the random weighting approximation rate of error distribution of the kernel estimator of conditional density function f(y|x) are studied. The results may be used to construct the confidence interval of f(y|x) .
文摘LetΩbe homogeneous of degree zero,integrable on S^(d−1) and have vanishing moment of order one,a be a function on R^(d) such that ∇a∈L^(∞)(R^(d)).Let T*_(Ω,a) be the maximaloperator associated with the d-dimensional Calder´on commutator defined by T*_(Ωa)f(x):=sup_(ε>0)|∫_(|x-y|>ε)^Ω(x-y)/|x-y|^(d+1)(a(x)-a(y))f(y)dy.In this paper,the authors establish bilinear sparse domination for T*_(Ω,a) under the assumption Ω∈L∞(Sd−1).As applications,some quantitative weighted bounds for T*_(Ω,a) are obtained.
基金sponsored by the National Natural Science Foundation of China(No.41204086)the Self-governed Innovative Project of China University of Petroleum(No.13CX02041A)+2 种基金the Doctoral Fund of National Ministry of Education(No.20110133120001)the National 863 Project(2011AA060301)the Major National Science and Technology Program(No.2011ZX05006-002)
文摘This paper discusses Born/Rytov approximation tomographic velocity inversion methods constrained by the Fresnel zone. Calculations of the sensitivity kernel function and traveltime residuals are critical in tomographic velocity inversion. Based on the Bom/Rytov approximation of the frequency-domain wave equation, we derive the traveltime sensitivity kemels of the wave equation on the band-limited wave field and simultaneously obtain the traveltime residuals based on the Rytov approximation. In contrast to single-ray tomography, the modified velocity inversion method improves the inversion stability. Tests of the near- surface velocity model and field data prove that the proposed method has higher accuracy and Computational efficiency than ray theory tomography and full waveform inversion methods.
文摘提出基于知识度量的模糊粗糙c-均值聚类(fuzzy rough c-means based on the knowledge measure,KFRCM)算法。传统聚类算法在处理具有模糊边界的数据时存在一定的局限性,表现为对初始聚类中心较为敏感且在高维空间中效率较低。为解决上述问题,引入特征加权的知识度量,结合模糊隶属度函数与粗糙集近似算子,采用高斯核相似度以增强边界特性。实验采用14个数据集,实验结果表明,KFRCM算法的聚类准确性、稳定性和计算效率均优于6种主流聚类算法。该研究首次将知识度量与模糊粗糙聚类相结合,为开发更为可靠和适应性更强的聚类算法提供了新的思路和算法。
文摘Using reproducing kernels for Hilbert spaces, we give best approximation for Weierstrass transform associated with spherical mean operator. Also, estimates of extremal functions are checked.
基金funded by National Natural Science Foundation of China under Grant Nos.61972057 and U1836208Hunan Provincial Natural Science Foundation of China under Grant No.2019JJ50655+3 种基金Scientic Research Foundation of Hunan Provincial Education Department of China under Grant No.18B160Open Fund of Hunan Key Laboratory of Smart Roadway and Cooperative Vehicle Infrastructure Systems(Changsha University of Science and Technology)under Grant No.kfj180402the“Double First-class”International Cooperation and Development Scientic Research Project of Changsha University of Science and Technology under Grant No.2018IC25the Researchers Supporting Project No.(RSP-2020/102)King Saud University,Riyadh,Saudi Arabia.
文摘Multiple kernel clustering is an unsupervised data analysis method that has been used in various scenarios where data is easy to be collected but hard to be labeled.However,multiple kernel clustering for incomplete data is a critical yet challenging task.Although the existing absent multiple kernel clustering methods have achieved remarkable performance on this task,they may fail when data has a high value-missing rate,and they may easily fall into a local optimum.To address these problems,in this paper,we propose an absent multiple kernel clustering(AMKC)method on incomplete data.The AMKC method rst clusters the initialized incomplete data.Then,it constructs a new multiple-kernel-based data space,referred to as K-space,from multiple sources to learn kernel combination coefcients.Finally,it seamlessly integrates an incomplete-kernel-imputation objective,a multiple-kernel-learning objective,and a kernel-clustering objective in order to achieve absent multiple kernel clustering.The three stages in this process are carried out simultaneously until the convergence condition is met.Experiments on six datasets with various characteristics demonstrate that the kernel imputation and clustering performance of the proposed method is signicantly better than state-of-the-art competitors.Meanwhile,the proposed method gains fast convergence speed.
基金Project supported by the National Natural Science Foundation of China(No.10461005)
文摘How to solve the partial differential equation has been attached importance to by all kinds of fields. The exact solution to a class of partial differential equation with variable-coefficient is obtained in reproducing kernel space. For getting the approximate solution, give an iterative method, convergence of the iterative method is proved. The numerical example shows that our method is effective and good practicability.
文摘In this paper we study the viscosity analysis of the spatially homogeneous Boltzmann equation for Maxwellian molecules. We first show that the global existence in time of the mild solution of the viscosity equation . We then study the asymptotic behaviour of the mild solution as the coefficients , and an estimate on is derived.
文摘In this paper, we apply the new algorithm of reproducing kernel method to give the approximate solution to some functional-differential equations. The numerical results demonstrate the accuracy of the proposed algorithm.
文摘This paper investigates the nonlinear boundary value problem resulting from the exact reduction of the Navier-Stokes equations for unsteady magnetohydrodynamic boundary layer flow over the stretching/shrinking permeable sheet submerged in a moving fluid.To solve this equation,a numerical method is proposed based on a Laguerre functions with reproducing kernel Hilbert space method.Using the operational matrices of derivative,we reduced the problem to a set of algebraic equations.We also compare this work with some other numerical results and present a solution that proves to be highly accurate.