Orthogonal Time Frequency and Space(OTFS) modulation is expected to provide high-speed and ultra-reliable communications for emerging mobile applications, including low-orbit satellite communications. Using the Dopple...Orthogonal Time Frequency and Space(OTFS) modulation is expected to provide high-speed and ultra-reliable communications for emerging mobile applications, including low-orbit satellite communications. Using the Doppler frequency for positioning is a promising research direction on communication and navigation integration. To tackle the high Doppler frequency and low signal-to-noise ratio(SNR) in satellite communication, this paper proposes a Red and Blue Frequency Shift Discriminator(RBFSD) based on the pseudo-noise(PN) sequence.The paper derives that the cross-correlation function on the Doppler domain exhibits the characteristic of a Sinc function. Therefore, it applies modulation onto the Delay-Doppler domain using PN sequence and adjusts Doppler frequency estimation by red-shifting or blue-shifting. Simulation results show that the performance of Doppler frequency estimation is close to the Cramér-Rao Lower Bound when the SNR is greater than -15dB. The proposed algorithm is about 1/D times less complex than the existing PN pilot sequence algorithm, where D is the resolution of the fractional Doppler.展开更多
In recent years,self-supervised learning has achieved great success in areas such as computer vision and natural language processing because it can mine supervised signals from unlabeled data and reduce the reliance o...In recent years,self-supervised learning has achieved great success in areas such as computer vision and natural language processing because it can mine supervised signals from unlabeled data and reduce the reliance on manual labels.However,the currently generated self-supervised signals are either neighbor discrimination or self-discrimination,and there is no model to integrate neighbor discrimination and self-discrimination.Based on this,this paper proposes Fu-Rec that integrates neighbor-discrimination contrastive learning and self-discrimination contrastive learning,which consists of three modules:(1)neighbor-discrimination contrastive learning,(2)selfdiscrimination contrastive learning,and(3)recommendation module.The neighbor-discrimination contrastive learning and selfdiscrimination contrastive learning tasks are used as auxiliary tasks to assist the recommendation task.The Fu-Rec model effectively utilizes the respective advantages of neighbor-discrimination and self-discrimination to consider the information of the user’s neighbors as well as the user and the item itself for the recommendation,which results in better performance of the recommendation module.Experimental results on several public datasets demonstrate the effectiveness of the Fu-Rec proposed in this paper.展开更多
Deep neural networks(DNNs)are effective in solving both forward and inverse problems for nonlinear partial differential equations(PDEs).However,conventional DNNs are not effective in handling problems such as delay di...Deep neural networks(DNNs)are effective in solving both forward and inverse problems for nonlinear partial differential equations(PDEs).However,conventional DNNs are not effective in handling problems such as delay differential equations(DDEs)and delay integrodifferential equations(DIDEs)with constant delays,primarily due to their low regularity at delayinduced breaking points.In this paper,a DNN method that combines multi-task learning(MTL)which is proposed to solve both the forward and inverse problems of DIDEs.The core idea of this approach is to divide the original equation into multiple tasks based on the delay,using auxiliary outputs to represent the integral terms,followed by the use of MTL to seamlessly incorporate the properties at the breaking points into the loss function.Furthermore,given the increased training dificulty associated with multiple tasks and outputs,we employ a sequential training scheme to reduce training complexity and provide reference solutions for subsequent tasks.This approach significantly enhances the approximation accuracy of solving DIDEs with DNNs,as demonstrated by comparisons with traditional DNN methods.We validate the effectiveness of this method through several numerical experiments,test various parameter sharing structures in MTL and compare the testing results of these structures.Finally,this method is implemented to solve the inverse problem of nonlinear DIDE and the results show that the unknown parameters of DIDE can be discovered with sparse or noisy data.展开更多
Recovering high-quality inscription images from unknown and complex inscription noisy images is a challenging research issue.Different fromnatural images,character images pay more attention to stroke information.Howev...Recovering high-quality inscription images from unknown and complex inscription noisy images is a challenging research issue.Different fromnatural images,character images pay more attention to stroke information.However,existingmodelsmainly consider pixel-level informationwhile ignoring structural information of the character,such as its edge and glyph,resulting in reconstructed images with mottled local structure and character damage.To solve these problems,we propose a novel generative adversarial network(GAN)framework based on an edge-guided generator and a discriminator constructed by a dual-domain U-Net framework,i.e.,EDU-GAN.Unlike existing frameworks,the generator introduces the edge extractionmodule,guiding it into the denoising process through the attention mechanism,which maintains the edge detail of the restored inscription image.Moreover,a dual-domain U-Net-based discriminator is proposed to learn the global and local discrepancy between the denoised and the label images in both image and morphological domains,which is helpful to blind denoising tasks.The proposed dual-domain discriminator and generator for adversarial training can reduce local artifacts and keep the denoised character structure intact.Due to the lack of a real-inscription image,we built the real-inscription dataset to provide an effective benchmark for studying inscription image denoising.The experimental results show the superiority of our method both in the synthetic and real-inscription datasets.展开更多
Zero-shot learning enables the recognition of new class samples by migrating models learned from semanticfeatures and existing sample features to things that have never been seen before. The problems of consistencyof ...Zero-shot learning enables the recognition of new class samples by migrating models learned from semanticfeatures and existing sample features to things that have never been seen before. The problems of consistencyof different types of features and domain shift problems are two of the critical issues in zero-shot learning. Toaddress both of these issues, this paper proposes a new modeling structure. The traditional approach mappedsemantic features and visual features into the same feature space;based on this, a dual discriminator approachis used in the proposed model. This dual discriminator approach can further enhance the consistency betweensemantic and visual features. At the same time, this approach can also align unseen class semantic features andtraining set samples, providing a portion of information about the unseen classes. In addition, a new feature fusionmethod is proposed in the model. This method is equivalent to adding perturbation to the seen class features,which can reduce the degree to which the classification results in the model are biased towards the seen classes.At the same time, this feature fusion method can provide part of the information of the unseen classes, improvingits classification accuracy in generalized zero-shot learning and reducing domain bias. The proposed method isvalidated and compared with othermethods on four datasets, and fromthe experimental results, it can be seen thatthe method proposed in this paper achieves promising results.展开更多
Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-...Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-robot control.Empowering cooperative MARL with multi-task decision-making capabilities is expected to further broaden its application scope.In multi-task scenarios,cooperative MARL algorithms need to address 3 types of multi-task problems:reward-related multi-task,arising from different reward functions;multi-domain multi-task,caused by differences in state and action spaces,state transition functions;and scalability-related multi-task,resulting from the dynamic variation in the number of agents.Most existing studies focus on scalability-related multitask problems.However,with the increasing integration between large language models(LLMs)and multi-agent systems,a growing number of LLM-based multi-agent systems have emerged,enabling more complex multi-task cooperation.This paper provides a comprehensive review of the latest advances in this field.By combining multi-task reinforcement learning with cooperative MARL,we categorize and analyze the 3 major types of multi-task problems under multi-agent settings,offering more fine-grained classifications and summarizing key insights for each.In addition,we summarize commonly used benchmarks and discuss future directions of research in this area,which hold promise for further enhancing the multi-task cooperation capabilities of multi-agent systems and expanding their practical applications in the real world.展开更多
The human motion generation model can extract structural features from existing human motion capture data,and the generated data makes animated characters move.The 3D human motion capture sequences contain complex spa...The human motion generation model can extract structural features from existing human motion capture data,and the generated data makes animated characters move.The 3D human motion capture sequences contain complex spatial-temporal structures,and the deep learning model can fully describe the potential semantic structure of human motion.To improve the authenticity of the generated human motion sequences,we propose a multi-task motion generation model that consists of a discriminator and a generator.The discriminator classifies motion sequences into different styles according to their similarity to the mean spatial-temporal templates from motion sequences of 17 crucial human joints in three-freedom degrees.And target motion sequences are created with these styles by the generator.Unlike traditional related works,our model can handle multiple tasks,such as identifying styles and generating data.In addition,by extracting 17 crucial joints from 29 human joints,our model avoids data redundancy and improves the accuracy of model recognition.The experimental results show that the discriminator of the model can effectively recognize diversified movements,and the generated data can correctly fit the actual data.The combination of discriminator and generator solves the problem of low reuse rate of motion data,and the generated motion sequences are more suitable for actual movement.展开更多
Yellow rust(Puccinia striiformis f.sp.Tritici,YR)and fusarium head blight(Fusarium graminearum,FHB)are the two main diseases affecting wheat in the main grain-producing areas of East China,which is common for the two ...Yellow rust(Puccinia striiformis f.sp.Tritici,YR)and fusarium head blight(Fusarium graminearum,FHB)are the two main diseases affecting wheat in the main grain-producing areas of East China,which is common for the two diseases to appear simultaneously in some main production areas.It is necessary to discriminate wheat YR and FHB at the regional scale to accurately locate the disease in space,conduct detailed disease severity monitoring,and scientific control.Four images on different dates were acquired from Sentinel-2,Landsat-8,and Gaofen-1 during the critical period of winter wheat,and 22 remote sensing features that characterize the wheat growth status were then calculated.Meanwhile,6 meteorological parameters that reflect the wheat phenological information were also obtained by combining the site meteorological data and spatial interpolation technology.Then,the principal components(PCs)of comprehensive remote sensing and meteorological features were extracted with principal component analysis(PCA).The PCs-based discrimination models were established to map YR and FHB damage using the random forest(RF)and backpropagation neural network(BPNN).The models’performance was verified based on the disease field truth data(57 plots during the filling period)and 5-fold cross-validation.The results revealed that the PCs obtained after PCA dimensionality reduction outperformed the initial features(IFs)from remote sensing and meteorology in discriminating between the two diseases.Compared to the IFs,the average area under the curve for both micro-average and macro-average ROC curves increased by 0.07 in the PCs-based RF models and increased by 0.16 and 0.13,respectively,in the PCs-based BPNN models.Notably,the PCs-based BPNN discrimination model emerged as the most effective,achieving an overall accuracy of 83.9%.Our proposed discrimination model for wheat YR and FHB,coupled with multi-source remote sensing images and meteorological data,overcomes the limitations of a single-sensor and single-phase remote sensing information in multiple stress discrimination in cloudy and rainy areas.It performs well in revealing the damage spatial distribution of the two diseases at a regional scale,providing a basis for detailed disease severity monitoring,and scientific prevention and control.展开更多
Predicting the health status of stroke patients at different stages of the disease is a critical clinical task.The onset and development of stroke are affected by an array of factors,encompassing genetic predispositio...Predicting the health status of stroke patients at different stages of the disease is a critical clinical task.The onset and development of stroke are affected by an array of factors,encompassing genetic predisposition,environmental exposure,unhealthy lifestyle habits,and existing medical conditions.Although existing machine learning-based methods for predicting stroke patients’health status have made significant progress,limitations remain in terms of prediction accuracy,model explainability,and system optimization.This paper proposes a multi-task learning approach based on Explainable Artificial Intelligence(XAI)for predicting the health status of stroke patients.First,we design a comprehensive multi-task learning framework that utilizes the task correlation of predicting various health status indicators in patients,enabling the parallel prediction of multiple health indicators.Second,we develop a multi-task Area Under Curve(AUC)optimization algorithm based on adaptive low-rank representation,which removes irrelevant information from the model structure to enhance the performance of multi-task AUC optimization.Additionally,the model’s explainability is analyzed through the stability analysis of SHAP values.Experimental results demonstrate that our approach outperforms comparison algorithms in key prognostic metrics F1 score and Efficiency.展开更多
Tropical cyclones(TCs)are one of the most serious types of natural disasters,and accurate TC activity predictions are key to disaster prevention and mitigation.Recently,TC track predictions have made significant progr...Tropical cyclones(TCs)are one of the most serious types of natural disasters,and accurate TC activity predictions are key to disaster prevention and mitigation.Recently,TC track predictions have made significant progress,but the ability to predict their intensity is obviously lagging behind.At present,research on TC intensity prediction takes atmospheric reanalysis data as the research object and mines the relationship between TC-related environmental factors and intensity through deep learning.However,reanalysis data are non-real-time in nature,which does not meet the requirements for operational forecasting applications.Therefore,a TC intensity prediction model named TC-Rolling is proposed,which can simultaneously extract the degree of symmetry for strong TC convective cloud and convection intensity,and fuse the deviation-angle variance with satellite images to construct the correlation between TC convection structure and intensity.For TCs'complex dynamic processes,a convolutional neural network(CNN)is used to learn their temporal and spatial features.For real-time intensity estimation,multi-task learning acts as an implicit time-series enhancement.The model is designed with a rolling strategy that aims to moderate the long-term dependent decay problem and improve accuracy for short-term intensity predictions.Since multiple tasks are correlated,the loss function of 12 h and 24 h are corrected.After testing on a sample of TCs in the Northwest Pacific,with a 4.48 kt root-mean-square error(RMSE)of 6 h intensity prediction,5.78 kt for 12 h,and 13.94 kt for 24 h,TC records from official agencies are used to assess the validity of TC-Rolling.展开更多
As the group-buying model shows significant progress in attracting new users,enhancing user engagement,and increasing platform profitability,providing personalized recommendations for group-buying users has emerged as...As the group-buying model shows significant progress in attracting new users,enhancing user engagement,and increasing platform profitability,providing personalized recommendations for group-buying users has emerged as a new challenge in the field of recommendation systems.This paper introduces a group-buying recommendation model based on multi-head attention mechanisms and multi-task learning,termed the Multi-head Attention Mechanisms and Multi-task Learning Group-Buying Recommendation(MAMGBR)model,specifically designed to optimize group-buying recommendations on e-commerce platforms.The core dataset of this study comes from the Chinese maternal and infant e-commerce platform“Beibei,”encompassing approximately 430,000 successful groupbuying actions and over 120,000 users.Themodel focuses on twomain tasks:recommending items for group organizers(Task Ⅰ)and recommending participants for a given group-buying event(Task Ⅱ).In model evaluation,MAMGBR achieves an MRR@10 of 0.7696 for Task I,marking a 20.23%improvement over baseline models.Furthermore,in Task II,where complex interaction patterns prevail,MAMGBR utilizes auxiliary loss functions to effectively model the multifaceted roles of users,items,and participants,leading to a 24.08%increase in MRR@100 under a 1:99 sample ratio.Experimental results show that compared to benchmark models,such as NGCF and EATNN,MAMGBR’s integration ofmulti-head attentionmechanisms,expert networks,and gating mechanisms enables more accurate modeling of user preferences and social associations within group-buying scenarios,significantly enhancing recommendation accuracy and platform group-buying success rates.展开更多
Aerosol optical depth(AOD)and fine particulate matter with a diameter of less than or equal to 2.5μm(PM_(2.5))play crucial roles in air quality,human health,and climate change.However,the complex correlation of AOD–...Aerosol optical depth(AOD)and fine particulate matter with a diameter of less than or equal to 2.5μm(PM_(2.5))play crucial roles in air quality,human health,and climate change.However,the complex correlation of AOD–PM_(2.5)and the limitations of existing algorithms pose a significant challenge in realizing the accurate joint retrieval of these two parameters at the same location.On this point,a multi-task learning(MTL)model,which enables the joint retrieval of PM_(2.5)concentration and AOD,is proposed and applied on the top-of-the-atmosphere reflectance data gathered by the Fengyun-4A Advanced Geosynchronous Radiation Imager(FY-4A AGRI),and compared to that of two single-task learning models—namely,Random Forest(RF)and Deep Neural Network(DNN).Specifically,MTL achieves a coefficient of determination(R^(2))of 0.88 and a root-mean-square error(RMSE)of 0.10 in AOD retrieval.In comparison to RF,the R^(2)increases by 0.04,the RMSE decreases by 0.02,and the percentage of retrieval results falling within the expected error range(Within-EE)rises by 5.55%.The R^(2)and RMSE of PM_(2.5)retrieval by MTL are 0.84 and 13.76μg m~(-3)respectively.Compared with RF,the R^(2)increases by 0.06,the RMSE decreases by 4.55μg m~(-3),and the Within-EE increases by 7.28%.Additionally,compared to DNN,MTL shows an increase of 0.01 in R^(2)and a decrease of 0.02 in RMSE in AOD retrieval,with a corresponding increase of 2.89%in Within-EE.For PM_(2.5)retrieval,MTL exhibits an increase of 0.05 in R^(2),a decrease of 1.76μg m~(-3)in RMSE,and an increase of 6.83%in Within-EE.The evaluation suggests that MTL is able to provide simultaneously improved AOD and PM_(2.5)retrievals,demonstrating a significant advantage in efficiently capturing the spatial distribution of PM_(2.5)concentration and AOD.展开更多
Accurate short-term forecast of offshore wind fields is still challenging for numerical weather prediction models.Based on three years of 48-hour forecast data from the European Centre for Medium-Range Weather Forecas...Accurate short-term forecast of offshore wind fields is still challenging for numerical weather prediction models.Based on three years of 48-hour forecast data from the European Centre for Medium-Range Weather Forecasts Integrated Forecasting System global model(ECMWF-IFS)over 14 offshore weather stations along the coast of Shandong Province,this study introduces a multi-task learning(MTL)model(TabNet-MTL),which significantly improves the forecast bias of near-surface wind direction and speed simultaneously.TabNet-MTL adopts the feature engineering method,utilizes mean square error as the loss function,and employs the 5-fold cross validation method to ensure the generalization ability of the trained model.It demonstrates superior skills in wind field correction across different forecast lead times over all stations compared to its single-task version(TabNet-STL)and three other popular single-task learning models(Random Forest,LightGBM,and XGBoost).Results show that it significantly reduces root mean square error of the ECMWF-IFS wind speed forecast from 2.20 to 1.25 m s−1,and increases the forecast accuracy of wind direction from 50%to 65%.As an explainable deep learning model,the weather stations and long-term temporal statistics of near-surface wind speed are identified as the most influential variables for TabNet-MTL in constructing its feature engineering.展开更多
The accurate prediction of drug absorption,distribution,metabolism,excretion,and toxicity(ADMET)properties represents a crucial step in early drug development for reducing failure risk.Current deep learning approaches...The accurate prediction of drug absorption,distribution,metabolism,excretion,and toxicity(ADMET)properties represents a crucial step in early drug development for reducing failure risk.Current deep learning approaches face challenges with data sparsity and information loss due to single-molecule representation limitations and isolated predictive tasks.This research proposes molecular properties prediction with parallel-view and collaborative learning(MolP-PC),a multi-view fusion and multi-task deep learning framework that integrates 1D molecular fingerprints(MFs),2D molecular graphs,and 3D geometric representations,incorporating an attention-gated fusion mechanism and multi-task adaptive learning strategy for precise ADMET property predictions.Experimental results demonstrate that MolP-PC achieves optimal performance in 27 of 54 tasks,with its multi-task learning(MTL)mechanism significantly enhancing predictive performance on small-scale datasets and surpassing single-task models in 41 of 54 tasks.Additional ablation studies and interpretability analyses confirm the significance of multi-view fusion in capturing multi-dimensional molecular information and enhancing model generalization.A case study examining the anticancer compound Oroxylin A demonstrates MolP-PC’s effective generalization in predicting key pharmacokinetic parameters such as half-life(T0.5)and clearance(CL),indicating its practical utility in drug modeling.However,the model exhibits a tendency to underestimate volume of distribution(VD),indicating potential for improvement in analyzing compounds with high tissue distribution.This study presents an efficient and interpretable approach for ADMET property prediction,establishing a novel framework for molecular optimization and risk assessment in drug development.展开更多
Polymethyl methacrylate(PMMA)is an optically transparent thermoplastic with favorable processing conditions.In this study,a series of plastic scintillators are prepared via thermal polymerization,and the impact of PMM...Polymethyl methacrylate(PMMA)is an optically transparent thermoplastic with favorable processing conditions.In this study,a series of plastic scintillators are prepared via thermal polymerization,and the impact of PMMA content on their transparency and pulse shape discrimination(PSD)ability is investigated.The fabricated samples,comprising a polystyrene(PS)-PMMA matrix,30.0 wt%2,5-diphenyloxazole(PPO),and 0.2 wt%9,10-diphenylanthracene(DPA),exhibit high transparency with transmissivity ranging from 70.0 to 90.0%(above 415.0 nm)and demonstrate excellent n/γdiscrimination capability.Transparency increased with increasing PMMA content across the entire visible light spectrum.However,the PSD performance gradually deteriorated when the aromatic matrix was replaced with PMMA.The scintillator containing 20.0 wt%PMMA demonstrated the best stability concerning PSD properties and relative light yields.展开更多
INTRODUCTION Reports indicating that culturally and linguistically diverse(CALD)people-often with migrant backgrounds-in Australia and New Zealand are more likely to be placed in compulsory community treatment(CCT)hav...INTRODUCTION Reports indicating that culturally and linguistically diverse(CALD)people-often with migrant backgrounds-in Australia and New Zealand are more likely to be placed in compulsory community treatment(CCT)have rightlyraised concernsthat such action might be discriminatory.展开更多
Previous studies have found associations between color discrimination deficits and cognitive impairments besides aging.However,investigations into the microstructural pathology of brain white matter(WM)associated with...Previous studies have found associations between color discrimination deficits and cognitive impairments besides aging.However,investigations into the microstructural pathology of brain white matter(WM)associated with these deficits remain limited.This study aimed to examine the microstructural characteristics of WM in the non-demented population with abnormal color discrimination,utilizing Neurite Orientation Dispersion and Density Imaging(NODDI),and to explore their correlations with cognitive functions and cognition-related plasma biomarkers.The tract-based spatial statistic analysis revealed significant differences in specific brain regions between the abnormal color discrimination group and the healthy controls,characterized by increased isotropic volume fraction and decreased neurite density index and orientation dispersion index.Further analysis of region-of-interest parameters revealed that the isotropic volume fraction in the bilateral anterior thalamic radiation,superior longitudinal fasciculus,cingulum,and forceps minor was significantly correlated with poorer performance on neuropsychological assessments and to varying degrees various cognition-related plasma biomarkers.These findings provide neuroimaging evidence that WM microstructural abnormalities in non-demented individuals with abnormal color discrimination are associated with cognitive dysfunction,potentially serving as early markers for cognitive decline.展开更多
This study discusses the challenges in logging the evaluation of low-resistivity oil reservoirs,especially the difficult problems involving their saturation calculation.A correction method for equivalent water conduct...This study discusses the challenges in logging the evaluation of low-resistivity oil reservoirs,especially the difficult problems involving their saturation calculation.A correction method for equivalent water conductivity is proposed,given the high conductivity caused by small amounts of water distributed in a network within the low-resistivity reservoir,which mimics the eff ects of high water saturation.This approach signifi cantly improves the accuracy of hydrocarbon saturation calculations in these low-resistivity reservoirs.The corrected hydrocarbon saturation values highly matched the porosity and are consistent with experimental results.This study also establishes a discrimination process to determine whether corrections are required,verifying the eff ectiveness and accuracy of the method through an application example.展开更多
Background Three-dimensional(3D)shape representation using mesh data is essential in various applications,such as virtual reality and simulation technologies.Current methods for extracting features from mesh edges or ...Background Three-dimensional(3D)shape representation using mesh data is essential in various applications,such as virtual reality and simulation technologies.Current methods for extracting features from mesh edges or faces struggle with complex 3D models because edge-based approaches miss global contexts and face-based methods overlook variations in adjacent areas,which affects the overall precision.To address these issues,we propose the Feature Discrimination and Context Propagation Network(FDCPNet),which is a novel approach that synergistically integrates local and global features in mesh datasets.Methods FDCPNet is composed of two modules:(1)the Feature Discrimination Module,which employs an attention mechanism to enhance the identification of key local features,and(2)the Context Propagation Module,which enriches key local features by integrating global contextual information,thereby facilitating a more detailed and comprehensive representation of crucial areas within the mesh model.Results Experiments on popular datasets validated the effectiveness of FDCPNet,showing an improvement in the classification accuracy over the baseline MeshNet.Furthermore,even with reduced mesh face numbers and limited training data,FDCPNet achieved promising results,demonstrating its robustness in scenarios of variable complexity.展开更多
Mineral identification and discrimination play a significant role in geological study.Intelligent mineral discrimination based on deep learning has the advantages of automation,low cost,less time consuming and low err...Mineral identification and discrimination play a significant role in geological study.Intelligent mineral discrimination based on deep learning has the advantages of automation,low cost,less time consuming and low error rate.In this article,characteristics of quartz,biotite and Kfeldspar from granite thin sections under cross-polarized light were studied for mineral images intelligent classification by Inception-v3 deep learning convolutional neural network(CNN),and transfer learning method.Dynamic images from multi-angles were employed to enhance the accuracy and reproducibility in the process of mineral discrimination.Test results show that the average discrimination accuracies of quartz,biotite and K-feldspar are 100.00%,96.88%and 90.63%.Results of this study prove the feasibility and reliability of the application of convolution neural network in mineral images classification.This study could have a significant impact in explorations of complicated mineral intelligent discrimination using deep learning methods and it will provide a new perspective for the development of more professional and practical mineral intelligent discrimination tools.展开更多
文摘Orthogonal Time Frequency and Space(OTFS) modulation is expected to provide high-speed and ultra-reliable communications for emerging mobile applications, including low-orbit satellite communications. Using the Doppler frequency for positioning is a promising research direction on communication and navigation integration. To tackle the high Doppler frequency and low signal-to-noise ratio(SNR) in satellite communication, this paper proposes a Red and Blue Frequency Shift Discriminator(RBFSD) based on the pseudo-noise(PN) sequence.The paper derives that the cross-correlation function on the Doppler domain exhibits the characteristic of a Sinc function. Therefore, it applies modulation onto the Delay-Doppler domain using PN sequence and adjusts Doppler frequency estimation by red-shifting or blue-shifting. Simulation results show that the performance of Doppler frequency estimation is close to the Cramér-Rao Lower Bound when the SNR is greater than -15dB. The proposed algorithm is about 1/D times less complex than the existing PN pilot sequence algorithm, where D is the resolution of the fractional Doppler.
基金Supported by the Scientific and Technological Innovation 2030-Major Project of New Generation Artificial Intelligence(2020AAA0109300)Science and Technology Commission of Shanghai Municipality(21DZ2203100)2023 Anhui Province Key Research and Development Plan Project-Special Project of Science and Technology Cooperation(2023i11020002)。
文摘In recent years,self-supervised learning has achieved great success in areas such as computer vision and natural language processing because it can mine supervised signals from unlabeled data and reduce the reliance on manual labels.However,the currently generated self-supervised signals are either neighbor discrimination or self-discrimination,and there is no model to integrate neighbor discrimination and self-discrimination.Based on this,this paper proposes Fu-Rec that integrates neighbor-discrimination contrastive learning and self-discrimination contrastive learning,which consists of three modules:(1)neighbor-discrimination contrastive learning,(2)selfdiscrimination contrastive learning,and(3)recommendation module.The neighbor-discrimination contrastive learning and selfdiscrimination contrastive learning tasks are used as auxiliary tasks to assist the recommendation task.The Fu-Rec model effectively utilizes the respective advantages of neighbor-discrimination and self-discrimination to consider the information of the user’s neighbors as well as the user and the item itself for the recommendation,which results in better performance of the recommendation module.Experimental results on several public datasets demonstrate the effectiveness of the Fu-Rec proposed in this paper.
文摘Deep neural networks(DNNs)are effective in solving both forward and inverse problems for nonlinear partial differential equations(PDEs).However,conventional DNNs are not effective in handling problems such as delay differential equations(DDEs)and delay integrodifferential equations(DIDEs)with constant delays,primarily due to their low regularity at delayinduced breaking points.In this paper,a DNN method that combines multi-task learning(MTL)which is proposed to solve both the forward and inverse problems of DIDEs.The core idea of this approach is to divide the original equation into multiple tasks based on the delay,using auxiliary outputs to represent the integral terms,followed by the use of MTL to seamlessly incorporate the properties at the breaking points into the loss function.Furthermore,given the increased training dificulty associated with multiple tasks and outputs,we employ a sequential training scheme to reduce training complexity and provide reference solutions for subsequent tasks.This approach significantly enhances the approximation accuracy of solving DIDEs with DNNs,as demonstrated by comparisons with traditional DNN methods.We validate the effectiveness of this method through several numerical experiments,test various parameter sharing structures in MTL and compare the testing results of these structures.Finally,this method is implemented to solve the inverse problem of nonlinear DIDE and the results show that the unknown parameters of DIDE can be discovered with sparse or noisy data.
基金supported by the Key R&D Program of Shaanxi Province,China(Grant Nos.2022GY-274,2023-YBSF-505)the National Natural Science Foundation of China(Grant No.62273273).
文摘Recovering high-quality inscription images from unknown and complex inscription noisy images is a challenging research issue.Different fromnatural images,character images pay more attention to stroke information.However,existingmodelsmainly consider pixel-level informationwhile ignoring structural information of the character,such as its edge and glyph,resulting in reconstructed images with mottled local structure and character damage.To solve these problems,we propose a novel generative adversarial network(GAN)framework based on an edge-guided generator and a discriminator constructed by a dual-domain U-Net framework,i.e.,EDU-GAN.Unlike existing frameworks,the generator introduces the edge extractionmodule,guiding it into the denoising process through the attention mechanism,which maintains the edge detail of the restored inscription image.Moreover,a dual-domain U-Net-based discriminator is proposed to learn the global and local discrepancy between the denoised and the label images in both image and morphological domains,which is helpful to blind denoising tasks.The proposed dual-domain discriminator and generator for adversarial training can reduce local artifacts and keep the denoised character structure intact.Due to the lack of a real-inscription image,we built the real-inscription dataset to provide an effective benchmark for studying inscription image denoising.The experimental results show the superiority of our method both in the synthetic and real-inscription datasets.
文摘Zero-shot learning enables the recognition of new class samples by migrating models learned from semanticfeatures and existing sample features to things that have never been seen before. The problems of consistencyof different types of features and domain shift problems are two of the critical issues in zero-shot learning. Toaddress both of these issues, this paper proposes a new modeling structure. The traditional approach mappedsemantic features and visual features into the same feature space;based on this, a dual discriminator approachis used in the proposed model. This dual discriminator approach can further enhance the consistency betweensemantic and visual features. At the same time, this approach can also align unseen class semantic features andtraining set samples, providing a portion of information about the unseen classes. In addition, a new feature fusionmethod is proposed in the model. This method is equivalent to adding perturbation to the seen class features,which can reduce the degree to which the classification results in the model are biased towards the seen classes.At the same time, this feature fusion method can provide part of the information of the unseen classes, improvingits classification accuracy in generalized zero-shot learning and reducing domain bias. The proposed method isvalidated and compared with othermethods on four datasets, and fromthe experimental results, it can be seen thatthe method proposed in this paper achieves promising results.
基金The National Natural Science Foundation of China(62136008,62293541)The Beijing Natural Science Foundation(4232056)The Beijing Nova Program(20240484514).
文摘Cooperative multi-agent reinforcement learning(MARL)is a key technology for enabling cooperation in complex multi-agent systems.It has achieved remarkable progress in areas such as gaming,autonomous driving,and multi-robot control.Empowering cooperative MARL with multi-task decision-making capabilities is expected to further broaden its application scope.In multi-task scenarios,cooperative MARL algorithms need to address 3 types of multi-task problems:reward-related multi-task,arising from different reward functions;multi-domain multi-task,caused by differences in state and action spaces,state transition functions;and scalability-related multi-task,resulting from the dynamic variation in the number of agents.Most existing studies focus on scalability-related multitask problems.However,with the increasing integration between large language models(LLMs)and multi-agent systems,a growing number of LLM-based multi-agent systems have emerged,enabling more complex multi-task cooperation.This paper provides a comprehensive review of the latest advances in this field.By combining multi-task reinforcement learning with cooperative MARL,we categorize and analyze the 3 major types of multi-task problems under multi-agent settings,offering more fine-grained classifications and summarizing key insights for each.In addition,we summarize commonly used benchmarks and discuss future directions of research in this area,which hold promise for further enhancing the multi-task cooperation capabilities of multi-agent systems and expanding their practical applications in the real world.
文摘The human motion generation model can extract structural features from existing human motion capture data,and the generated data makes animated characters move.The 3D human motion capture sequences contain complex spatial-temporal structures,and the deep learning model can fully describe the potential semantic structure of human motion.To improve the authenticity of the generated human motion sequences,we propose a multi-task motion generation model that consists of a discriminator and a generator.The discriminator classifies motion sequences into different styles according to their similarity to the mean spatial-temporal templates from motion sequences of 17 crucial human joints in three-freedom degrees.And target motion sequences are created with these styles by the generator.Unlike traditional related works,our model can handle multiple tasks,such as identifying styles and generating data.In addition,by extracting 17 crucial joints from 29 human joints,our model avoids data redundancy and improves the accuracy of model recognition.The experimental results show that the discriminator of the model can effectively recognize diversified movements,and the generated data can correctly fit the actual data.The combination of discriminator and generator solves the problem of low reuse rate of motion data,and the generated motion sequences are more suitable for actual movement.
基金supported by National Key R&D Program of China(2022YFD2000100)National Natural Science Foundation of China(42401400)Zhejiang Provincial Key Research and Development Program(2023C02018).
文摘Yellow rust(Puccinia striiformis f.sp.Tritici,YR)and fusarium head blight(Fusarium graminearum,FHB)are the two main diseases affecting wheat in the main grain-producing areas of East China,which is common for the two diseases to appear simultaneously in some main production areas.It is necessary to discriminate wheat YR and FHB at the regional scale to accurately locate the disease in space,conduct detailed disease severity monitoring,and scientific control.Four images on different dates were acquired from Sentinel-2,Landsat-8,and Gaofen-1 during the critical period of winter wheat,and 22 remote sensing features that characterize the wheat growth status were then calculated.Meanwhile,6 meteorological parameters that reflect the wheat phenological information were also obtained by combining the site meteorological data and spatial interpolation technology.Then,the principal components(PCs)of comprehensive remote sensing and meteorological features were extracted with principal component analysis(PCA).The PCs-based discrimination models were established to map YR and FHB damage using the random forest(RF)and backpropagation neural network(BPNN).The models’performance was verified based on the disease field truth data(57 plots during the filling period)and 5-fold cross-validation.The results revealed that the PCs obtained after PCA dimensionality reduction outperformed the initial features(IFs)from remote sensing and meteorology in discriminating between the two diseases.Compared to the IFs,the average area under the curve for both micro-average and macro-average ROC curves increased by 0.07 in the PCs-based RF models and increased by 0.16 and 0.13,respectively,in the PCs-based BPNN models.Notably,the PCs-based BPNN discrimination model emerged as the most effective,achieving an overall accuracy of 83.9%.Our proposed discrimination model for wheat YR and FHB,coupled with multi-source remote sensing images and meteorological data,overcomes the limitations of a single-sensor and single-phase remote sensing information in multiple stress discrimination in cloudy and rainy areas.It performs well in revealing the damage spatial distribution of the two diseases at a regional scale,providing a basis for detailed disease severity monitoring,and scientific prevention and control.
基金funded by the Excellent Talent Training Funding Project in Dongcheng District,Beijing,with project number 2024-dchrcpyzz-9.
文摘Predicting the health status of stroke patients at different stages of the disease is a critical clinical task.The onset and development of stroke are affected by an array of factors,encompassing genetic predisposition,environmental exposure,unhealthy lifestyle habits,and existing medical conditions.Although existing machine learning-based methods for predicting stroke patients’health status have made significant progress,limitations remain in terms of prediction accuracy,model explainability,and system optimization.This paper proposes a multi-task learning approach based on Explainable Artificial Intelligence(XAI)for predicting the health status of stroke patients.First,we design a comprehensive multi-task learning framework that utilizes the task correlation of predicting various health status indicators in patients,enabling the parallel prediction of multiple health indicators.Second,we develop a multi-task Area Under Curve(AUC)optimization algorithm based on adaptive low-rank representation,which removes irrelevant information from the model structure to enhance the performance of multi-task AUC optimization.Additionally,the model’s explainability is analyzed through the stability analysis of SHAP values.Experimental results demonstrate that our approach outperforms comparison algorithms in key prognostic metrics F1 score and Efficiency.
基金jointly supported by the National Natural Science Foundation of China(Grant Nos.42075138 and 42375147)the Program on Key Basic Research Project of Jiangsu(Grant No.BE2023829)。
文摘Tropical cyclones(TCs)are one of the most serious types of natural disasters,and accurate TC activity predictions are key to disaster prevention and mitigation.Recently,TC track predictions have made significant progress,but the ability to predict their intensity is obviously lagging behind.At present,research on TC intensity prediction takes atmospheric reanalysis data as the research object and mines the relationship between TC-related environmental factors and intensity through deep learning.However,reanalysis data are non-real-time in nature,which does not meet the requirements for operational forecasting applications.Therefore,a TC intensity prediction model named TC-Rolling is proposed,which can simultaneously extract the degree of symmetry for strong TC convective cloud and convection intensity,and fuse the deviation-angle variance with satellite images to construct the correlation between TC convection structure and intensity.For TCs'complex dynamic processes,a convolutional neural network(CNN)is used to learn their temporal and spatial features.For real-time intensity estimation,multi-task learning acts as an implicit time-series enhancement.The model is designed with a rolling strategy that aims to moderate the long-term dependent decay problem and improve accuracy for short-term intensity predictions.Since multiple tasks are correlated,the loss function of 12 h and 24 h are corrected.After testing on a sample of TCs in the Northwest Pacific,with a 4.48 kt root-mean-square error(RMSE)of 6 h intensity prediction,5.78 kt for 12 h,and 13.94 kt for 24 h,TC records from official agencies are used to assess the validity of TC-Rolling.
基金supported by the Key Research and Development Program of Heilongjiang Province(No.2022ZX01A35).
文摘As the group-buying model shows significant progress in attracting new users,enhancing user engagement,and increasing platform profitability,providing personalized recommendations for group-buying users has emerged as a new challenge in the field of recommendation systems.This paper introduces a group-buying recommendation model based on multi-head attention mechanisms and multi-task learning,termed the Multi-head Attention Mechanisms and Multi-task Learning Group-Buying Recommendation(MAMGBR)model,specifically designed to optimize group-buying recommendations on e-commerce platforms.The core dataset of this study comes from the Chinese maternal and infant e-commerce platform“Beibei,”encompassing approximately 430,000 successful groupbuying actions and over 120,000 users.Themodel focuses on twomain tasks:recommending items for group organizers(Task Ⅰ)and recommending participants for a given group-buying event(Task Ⅱ).In model evaluation,MAMGBR achieves an MRR@10 of 0.7696 for Task I,marking a 20.23%improvement over baseline models.Furthermore,in Task II,where complex interaction patterns prevail,MAMGBR utilizes auxiliary loss functions to effectively model the multifaceted roles of users,items,and participants,leading to a 24.08%increase in MRR@100 under a 1:99 sample ratio.Experimental results show that compared to benchmark models,such as NGCF and EATNN,MAMGBR’s integration ofmulti-head attentionmechanisms,expert networks,and gating mechanisms enables more accurate modeling of user preferences and social associations within group-buying scenarios,significantly enhancing recommendation accuracy and platform group-buying success rates.
基金supported by the National Natural Science Foundation of China(Grant Nos.42030708,42375138,42030608,42105128,42075079)the Opening Foundation of Key Laboratory of Atmospheric Sounding,China Meteorological Administration(CMA),and the CMA Research Center on Meteorological Observation Engineering Technology(Grant No.U2021Z03),and the Opening Foundation of the Key Laboratory of Atmospheric Chemistry,CMA(Grant No.2022B02)。
文摘Aerosol optical depth(AOD)and fine particulate matter with a diameter of less than or equal to 2.5μm(PM_(2.5))play crucial roles in air quality,human health,and climate change.However,the complex correlation of AOD–PM_(2.5)and the limitations of existing algorithms pose a significant challenge in realizing the accurate joint retrieval of these two parameters at the same location.On this point,a multi-task learning(MTL)model,which enables the joint retrieval of PM_(2.5)concentration and AOD,is proposed and applied on the top-of-the-atmosphere reflectance data gathered by the Fengyun-4A Advanced Geosynchronous Radiation Imager(FY-4A AGRI),and compared to that of two single-task learning models—namely,Random Forest(RF)and Deep Neural Network(DNN).Specifically,MTL achieves a coefficient of determination(R^(2))of 0.88 and a root-mean-square error(RMSE)of 0.10 in AOD retrieval.In comparison to RF,the R^(2)increases by 0.04,the RMSE decreases by 0.02,and the percentage of retrieval results falling within the expected error range(Within-EE)rises by 5.55%.The R^(2)and RMSE of PM_(2.5)retrieval by MTL are 0.84 and 13.76μg m~(-3)respectively.Compared with RF,the R^(2)increases by 0.06,the RMSE decreases by 4.55μg m~(-3),and the Within-EE increases by 7.28%.Additionally,compared to DNN,MTL shows an increase of 0.01 in R^(2)and a decrease of 0.02 in RMSE in AOD retrieval,with a corresponding increase of 2.89%in Within-EE.For PM_(2.5)retrieval,MTL exhibits an increase of 0.05 in R^(2),a decrease of 1.76μg m~(-3)in RMSE,and an increase of 6.83%in Within-EE.The evaluation suggests that MTL is able to provide simultaneously improved AOD and PM_(2.5)retrievals,demonstrating a significant advantage in efficiently capturing the spatial distribution of PM_(2.5)concentration and AOD.
基金the National Key Research and Development Plan of China[Grant No.2023YFB3002400]the Shanghai 2021 Natural Science Foundation[Grant Nos.21ZR1420400 and 21ZR1419800]+1 种基金the Shanghai 2023 Natural Science Foundation[Grant No.23ZR1463000]the Shandong Provincial Meteorological Bureau Scientific Research Project[Grant No.2023SDBD05].
文摘Accurate short-term forecast of offshore wind fields is still challenging for numerical weather prediction models.Based on three years of 48-hour forecast data from the European Centre for Medium-Range Weather Forecasts Integrated Forecasting System global model(ECMWF-IFS)over 14 offshore weather stations along the coast of Shandong Province,this study introduces a multi-task learning(MTL)model(TabNet-MTL),which significantly improves the forecast bias of near-surface wind direction and speed simultaneously.TabNet-MTL adopts the feature engineering method,utilizes mean square error as the loss function,and employs the 5-fold cross validation method to ensure the generalization ability of the trained model.It demonstrates superior skills in wind field correction across different forecast lead times over all stations compared to its single-task version(TabNet-STL)and three other popular single-task learning models(Random Forest,LightGBM,and XGBoost).Results show that it significantly reduces root mean square error of the ECMWF-IFS wind speed forecast from 2.20 to 1.25 m s−1,and increases the forecast accuracy of wind direction from 50%to 65%.As an explainable deep learning model,the weather stations and long-term temporal statistics of near-surface wind speed are identified as the most influential variables for TabNet-MTL in constructing its feature engineering.
基金supported by the research on key technologies for monitoring and identifying drug abuse of anesthetic drugs and psychotropic drugs,and intervention for addiction(No.2023YFC3304200)the program of a study on the diagnosis of addiction to synthetic cannabinoids and methods of assessing the risk of abuse(No.2022YFC3300905)+1 种基金the program of Ab initio design and generation of AI models for small molecule ligands based on target structures(No.2022PE0AC03)ZHIJIANG LAB.
文摘The accurate prediction of drug absorption,distribution,metabolism,excretion,and toxicity(ADMET)properties represents a crucial step in early drug development for reducing failure risk.Current deep learning approaches face challenges with data sparsity and information loss due to single-molecule representation limitations and isolated predictive tasks.This research proposes molecular properties prediction with parallel-view and collaborative learning(MolP-PC),a multi-view fusion and multi-task deep learning framework that integrates 1D molecular fingerprints(MFs),2D molecular graphs,and 3D geometric representations,incorporating an attention-gated fusion mechanism and multi-task adaptive learning strategy for precise ADMET property predictions.Experimental results demonstrate that MolP-PC achieves optimal performance in 27 of 54 tasks,with its multi-task learning(MTL)mechanism significantly enhancing predictive performance on small-scale datasets and surpassing single-task models in 41 of 54 tasks.Additional ablation studies and interpretability analyses confirm the significance of multi-view fusion in capturing multi-dimensional molecular information and enhancing model generalization.A case study examining the anticancer compound Oroxylin A demonstrates MolP-PC’s effective generalization in predicting key pharmacokinetic parameters such as half-life(T0.5)and clearance(CL),indicating its practical utility in drug modeling.However,the model exhibits a tendency to underestimate volume of distribution(VD),indicating potential for improvement in analyzing compounds with high tissue distribution.This study presents an efficient and interpretable approach for ADMET property prediction,establishing a novel framework for molecular optimization and risk assessment in drug development.
基金supported by the National Natural Science Foundation of China(No.12027813)the fund of National Innovation Center of Radiation Application of China(Nos.KFZC2020020501,KFZC2021010101).
文摘Polymethyl methacrylate(PMMA)is an optically transparent thermoplastic with favorable processing conditions.In this study,a series of plastic scintillators are prepared via thermal polymerization,and the impact of PMMA content on their transparency and pulse shape discrimination(PSD)ability is investigated.The fabricated samples,comprising a polystyrene(PS)-PMMA matrix,30.0 wt%2,5-diphenyloxazole(PPO),and 0.2 wt%9,10-diphenylanthracene(DPA),exhibit high transparency with transmissivity ranging from 70.0 to 90.0%(above 415.0 nm)and demonstrate excellent n/γdiscrimination capability.Transparency increased with increasing PMMA content across the entire visible light spectrum.However,the PSD performance gradually deteriorated when the aromatic matrix was replaced with PMMA.The scintillator containing 20.0 wt%PMMA demonstrated the best stability concerning PSD properties and relative light yields.
文摘INTRODUCTION Reports indicating that culturally and linguistically diverse(CALD)people-often with migrant backgrounds-in Australia and New Zealand are more likely to be placed in compulsory community treatment(CCT)have rightlyraised concernsthat such action might be discriminatory.
基金supported by the Joint Funds for Innovation of Science and Technology,Fujian Province(2021Y9037)a National Clinical Key Special Subject of China(21281003).
文摘Previous studies have found associations between color discrimination deficits and cognitive impairments besides aging.However,investigations into the microstructural pathology of brain white matter(WM)associated with these deficits remain limited.This study aimed to examine the microstructural characteristics of WM in the non-demented population with abnormal color discrimination,utilizing Neurite Orientation Dispersion and Density Imaging(NODDI),and to explore their correlations with cognitive functions and cognition-related plasma biomarkers.The tract-based spatial statistic analysis revealed significant differences in specific brain regions between the abnormal color discrimination group and the healthy controls,characterized by increased isotropic volume fraction and decreased neurite density index and orientation dispersion index.Further analysis of region-of-interest parameters revealed that the isotropic volume fraction in the bilateral anterior thalamic radiation,superior longitudinal fasciculus,cingulum,and forceps minor was significantly correlated with poorer performance on neuropsychological assessments and to varying degrees various cognition-related plasma biomarkers.These findings provide neuroimaging evidence that WM microstructural abnormalities in non-demented individuals with abnormal color discrimination are associated with cognitive dysfunction,potentially serving as early markers for cognitive decline.
文摘This study discusses the challenges in logging the evaluation of low-resistivity oil reservoirs,especially the difficult problems involving their saturation calculation.A correction method for equivalent water conductivity is proposed,given the high conductivity caused by small amounts of water distributed in a network within the low-resistivity reservoir,which mimics the eff ects of high water saturation.This approach signifi cantly improves the accuracy of hydrocarbon saturation calculations in these low-resistivity reservoirs.The corrected hydrocarbon saturation values highly matched the porosity and are consistent with experimental results.This study also establishes a discrimination process to determine whether corrections are required,verifying the eff ectiveness and accuracy of the method through an application example.
基金Supported by the National Key R&D Program of China(2022YFC3803600).
文摘Background Three-dimensional(3D)shape representation using mesh data is essential in various applications,such as virtual reality and simulation technologies.Current methods for extracting features from mesh edges or faces struggle with complex 3D models because edge-based approaches miss global contexts and face-based methods overlook variations in adjacent areas,which affects the overall precision.To address these issues,we propose the Feature Discrimination and Context Propagation Network(FDCPNet),which is a novel approach that synergistically integrates local and global features in mesh datasets.Methods FDCPNet is composed of two modules:(1)the Feature Discrimination Module,which employs an attention mechanism to enhance the identification of key local features,and(2)the Context Propagation Module,which enriches key local features by integrating global contextual information,thereby facilitating a more detailed and comprehensive representation of crucial areas within the mesh model.Results Experiments on popular datasets validated the effectiveness of FDCPNet,showing an improvement in the classification accuracy over the baseline MeshNet.Furthermore,even with reduced mesh face numbers and limited training data,FDCPNet achieved promising results,demonstrating its robustness in scenarios of variable complexity.
基金funded by the National Natural Science Foundation of China(Nos.41672082,42030809)。
文摘Mineral identification and discrimination play a significant role in geological study.Intelligent mineral discrimination based on deep learning has the advantages of automation,low cost,less time consuming and low error rate.In this article,characteristics of quartz,biotite and Kfeldspar from granite thin sections under cross-polarized light were studied for mineral images intelligent classification by Inception-v3 deep learning convolutional neural network(CNN),and transfer learning method.Dynamic images from multi-angles were employed to enhance the accuracy and reproducibility in the process of mineral discrimination.Test results show that the average discrimination accuracies of quartz,biotite and K-feldspar are 100.00%,96.88%and 90.63%.Results of this study prove the feasibility and reliability of the application of convolution neural network in mineral images classification.This study could have a significant impact in explorations of complicated mineral intelligent discrimination using deep learning methods and it will provide a new perspective for the development of more professional and practical mineral intelligent discrimination tools.