Leveraging the extraordinary phenomena of quantum superposition and quantum correlation,quantum computing offers unprecedented potential for addressing challenges beyond the reach of classical computers.This paper tac...Leveraging the extraordinary phenomena of quantum superposition and quantum correlation,quantum computing offers unprecedented potential for addressing challenges beyond the reach of classical computers.This paper tackles two pivotal challenges in the realm of quantum computing:firstly,the development of an effective encoding protocol for translating classical data into quantum states,a critical step for any quantum computation.Different encoding strategies can significantly influence quantum computer performance.Secondly,we address the need to counteract the inevitable noise that can hinder quantum acceleration.Our primary contribution is the introduction of a novel variational data encoding method,grounded in quantum regression algorithm models.By adapting the learning concept from machine learning,we render data encoding a learnable process.This allowed us to study the role of quantum correlation in data encoding.Through numerical simulations of various regression tasks,we demonstrate the efficacy of our variational data encoding,particularly post-learning from instructional data.Moreover,we delve into the role of quantum correlation in enhancing task performance,especially in noisy environments.Our findings underscore the critical role of quantum correlation in not only bolstering performance but also in mitigating noise interference,thus advancing the frontier of quantum computing.展开更多
MicroRNAs(miRNAs)are closely related to numerous complex human diseases,therefore,exploring miRNA-disease associations(MDAs)can help people gain a better understanding of complex disease mechanism.An increasing number...MicroRNAs(miRNAs)are closely related to numerous complex human diseases,therefore,exploring miRNA-disease associations(MDAs)can help people gain a better understanding of complex disease mechanism.An increasing number of computational methods have been developed to predict MDAs.However,the sparsity of the MDAs may hinder the performance of many methods.In addition,many methods fail to capture the nonlinear relationships of miRNA-disease network and inadequately leverage the features of network and neighbor nodes.In this study,we propose a deep matrix factorization model with variational autoencoder(DMFVAE)to predict potential MDAs.DMFVAE first decomposes the original association matrix and the enhanced association matrix,in which the enhanced association matrix is enhanced by self-adjusting the nearest neighbor method,to obtain sparse vectors and dense vectors,respectively.Then,the variational encoder is employed to obtain the nonlinear latent vectors of miRNA and disease for the sparse vectors,and meanwhile,node2vec is used to obtain the network structure embedding vectors of miRNA and disease for the dense vectors.Finally,sample features are acquired by combining the latent vectors and network structure embedding vectors,and the final prediction is implemented by convolutional neural network with channel attention.To evaluate the performance of DMFVAE,we conduct five-fold cross validation on the HMDD v2.0 and HMDD v3.2 datasets and the results show that DMFVAE performs well.Furthermore,case studies on lung neoplasms,colon neoplasms,and esophageal neoplasms confirm the ability of DMFVAE in identifying potential miRNAs for human diseases.展开更多
In this study,we employ advanced data-driven techniques to investigate the complex relationships between the yields of five major crops and various geographical and spatiotemporal features in Senegal.We analyze how th...In this study,we employ advanced data-driven techniques to investigate the complex relationships between the yields of five major crops and various geographical and spatiotemporal features in Senegal.We analyze how these features influence crop yields by utilizing remotely sensed data.Our methodology incorporates clustering algorithms and correlation matrix analysis to identify significant patterns and dependencies,offering a comprehensive understanding of the factors affecting agricultural productivity in Senegal.To optimize the model's performance and identify the optimal hyperparameters,we implemented a comprehensive grid search across four distinct machine learning regressors:Random Forest,Extreme Gradient Boosting(XGBoost),Categorical Boosting(CatBoost),and Light Gradient-Boosting Machine(LightGBM).Each regressor offers unique functionalities,enhancing our exploration of potential model configurations.The top-performing models were selected based on evaluating multiple performance metrics,ensuring robust and accurate predictive capabilities.The results demonstrated that XGBoost and CatBoost perform better than the other two.We introduce synthetic crop data generated using a Variational Auto Encoder to address the challenges posed by limited agricultural datasets.By achieving high similarity scores with real-world data,our synthetic samples enhance model robustness,mitigate overfitting,and provide a viable solution for small dataset issues in agriculture.Our approach distinguishes itself by creating a flexible model applicable to various crops together.By integrating five crop datasets and generating high-quality synthetic data,we improve model performance,reduce overfitting,and enhance realism.Our findings provide crucial insights for productivity drivers in key cropping systems,enabling robust recommendations and strengthening the decision-making capabilities of policymakers and farmers in datascarce regions.展开更多
基金the National Natural Science Foun-dation of China(Grant Nos.12105090 and 12175057).
文摘Leveraging the extraordinary phenomena of quantum superposition and quantum correlation,quantum computing offers unprecedented potential for addressing challenges beyond the reach of classical computers.This paper tackles two pivotal challenges in the realm of quantum computing:firstly,the development of an effective encoding protocol for translating classical data into quantum states,a critical step for any quantum computation.Different encoding strategies can significantly influence quantum computer performance.Secondly,we address the need to counteract the inevitable noise that can hinder quantum acceleration.Our primary contribution is the introduction of a novel variational data encoding method,grounded in quantum regression algorithm models.By adapting the learning concept from machine learning,we render data encoding a learnable process.This allowed us to study the role of quantum correlation in data encoding.Through numerical simulations of various regression tasks,we demonstrate the efficacy of our variational data encoding,particularly post-learning from instructional data.Moreover,we delve into the role of quantum correlation in enhancing task performance,especially in noisy environments.Our findings underscore the critical role of quantum correlation in not only bolstering performance but also in mitigating noise interference,thus advancing the frontier of quantum computing.
基金the National Natural Science Foundation of China(Grant Nos.62202004,and 62322301)the Natural Science Foundation of Anhui Province(No.2108085QF267)+1 种基金the University Synergy Innovation Program of Anhui Province(No.GXXT-2021-039)the Anhui University Outstanding Youth Research Project(No.2022AH020010)。
文摘MicroRNAs(miRNAs)are closely related to numerous complex human diseases,therefore,exploring miRNA-disease associations(MDAs)can help people gain a better understanding of complex disease mechanism.An increasing number of computational methods have been developed to predict MDAs.However,the sparsity of the MDAs may hinder the performance of many methods.In addition,many methods fail to capture the nonlinear relationships of miRNA-disease network and inadequately leverage the features of network and neighbor nodes.In this study,we propose a deep matrix factorization model with variational autoencoder(DMFVAE)to predict potential MDAs.DMFVAE first decomposes the original association matrix and the enhanced association matrix,in which the enhanced association matrix is enhanced by self-adjusting the nearest neighbor method,to obtain sparse vectors and dense vectors,respectively.Then,the variational encoder is employed to obtain the nonlinear latent vectors of miRNA and disease for the sparse vectors,and meanwhile,node2vec is used to obtain the network structure embedding vectors of miRNA and disease for the dense vectors.Finally,sample features are acquired by combining the latent vectors and network structure embedding vectors,and the final prediction is implemented by convolutional neural network with channel attention.To evaluate the performance of DMFVAE,we conduct five-fold cross validation on the HMDD v2.0 and HMDD v3.2 datasets and the results show that DMFVAE performs well.Furthermore,case studies on lung neoplasms,colon neoplasms,and esophageal neoplasms confirm the ability of DMFVAE in identifying potential miRNAs for human diseases.
文摘In this study,we employ advanced data-driven techniques to investigate the complex relationships between the yields of five major crops and various geographical and spatiotemporal features in Senegal.We analyze how these features influence crop yields by utilizing remotely sensed data.Our methodology incorporates clustering algorithms and correlation matrix analysis to identify significant patterns and dependencies,offering a comprehensive understanding of the factors affecting agricultural productivity in Senegal.To optimize the model's performance and identify the optimal hyperparameters,we implemented a comprehensive grid search across four distinct machine learning regressors:Random Forest,Extreme Gradient Boosting(XGBoost),Categorical Boosting(CatBoost),and Light Gradient-Boosting Machine(LightGBM).Each regressor offers unique functionalities,enhancing our exploration of potential model configurations.The top-performing models were selected based on evaluating multiple performance metrics,ensuring robust and accurate predictive capabilities.The results demonstrated that XGBoost and CatBoost perform better than the other two.We introduce synthetic crop data generated using a Variational Auto Encoder to address the challenges posed by limited agricultural datasets.By achieving high similarity scores with real-world data,our synthetic samples enhance model robustness,mitigate overfitting,and provide a viable solution for small dataset issues in agriculture.Our approach distinguishes itself by creating a flexible model applicable to various crops together.By integrating five crop datasets and generating high-quality synthetic data,we improve model performance,reduce overfitting,and enhance realism.Our findings provide crucial insights for productivity drivers in key cropping systems,enabling robust recommendations and strengthening the decision-making capabilities of policymakers and farmers in datascarce regions.