Recently,the effectiveness of neural networks,especially convolutional neural networks,has been validated in the field of natural language processing,in which,sentiment classification for online reviews is an importan...Recently,the effectiveness of neural networks,especially convolutional neural networks,has been validated in the field of natural language processing,in which,sentiment classification for online reviews is an important and challenging task.Existing convolutional neural networks extract important features of sentences without local features or the feature sequence.Thus,these models do not perform well,especially for transition sentences.To this end,we propose a Piecewise Pooling Convolutional Neural Network(PPCNN)for sentiment classification.Firstly,with a sentence presented by word vectors,convolution operation is introduced to obtain the convolution feature map vectors.Secondly,these vectors are segmented according to the positions of transition words in sentences.Thirdly,the most significant feature of each local segment is extracted using max pooling mechanism,and then the different aspects of features can be extracted.Specifically,the relative sequence of these features is preserved.Finally,after processed by the dropout algorithm,the softmax classifier is trained for sentiment classification.Experimental results show that the proposed method PPCNN is effective and superior to other baseline methods,especially for datasets with transition sentences.展开更多
Phase classification has a clear guiding significance for the design of high entropy alloys.For mutually exclusive and non-mutually exclusive classifications,the composition descriptors,commonly used physical paramete...Phase classification has a clear guiding significance for the design of high entropy alloys.For mutually exclusive and non-mutually exclusive classifications,the composition descriptors,commonly used physical parameter descriptors,elemental-property descriptors,and descriptors extracted from the periodic table representation(PTR)by the convolutional neural network were collected.Appropriate selection among features with rich information is helpful for phase classification.Based on random forest,the accuracy of the four-label classification and balanced accuracy of the five-label classification were improved to be 0.907 and 0.876,respectively.The roles of the four important features were summarized by interpretability analysis,and a new important feature was found.The model extrapolation ability and the influence of Mo were demonstrated by phase prediction in(CoFeNiMn)_(1-x)Mo_(x).The phase information is helpful for the hardness prediction,the classification results were coupled with the PTR of hardness data,and the prediction error(the root mean square error)was reduced to 56.69.展开更多
This article proposes a VGG network with histogram of oriented gradient(HOG) feature fusion(HOG-VGG) for polarization synthetic aperture radar(PolSAR) image terrain classification.VGG-Net has a strong ability of deep ...This article proposes a VGG network with histogram of oriented gradient(HOG) feature fusion(HOG-VGG) for polarization synthetic aperture radar(PolSAR) image terrain classification.VGG-Net has a strong ability of deep feature extraction,which can fully extract the global deep features of different terrains in PolSAR images,so it is widely used in PolSAR terrain classification.However,VGG-Net ignores the local edge & shape features,resulting in incomplete feature representation of the PolSAR terrains,as a consequence,the terrain classification accuracy is not promising.In fact,edge and shape features play an important role in PolSAR terrain classification.To solve this problem,a new VGG network with HOG feature fusion was specifically proposed for high-precision PolSAR terrain classification.HOG-VGG extracts both the global deep semantic features and the local edge & shape features of the PolSAR terrains,so the terrain feature representation completeness is greatly elevated.Moreover,HOG-VGG optimally fuses the global deep features and the local edge & shape features to achieve the best classification results.The superiority of HOG-VGG is verified on the Flevoland,San Francisco and Oberpfaffenhofen datasets.Experiments show that the proposed HOG-VGG achieves much better PolSAR terrain classification performance,with overall accuracies of 97.54%,94.63%,and 96.07%,respectively.展开更多
To minimize the low classification accuracy and low utilization of spatial information in traditional hyperspectral image classification methods, we propose a new hyperspectral image classification method, which is ba...To minimize the low classification accuracy and low utilization of spatial information in traditional hyperspectral image classification methods, we propose a new hyperspectral image classification method, which is based on the Gabor spatial texture features and nonparametric weighted spectral features, and the sparse representation classification method(Gabor–NWSF and SRC), abbreviated GNWSF–SRC. The proposed(GNWSF–SRC) method first combines the Gabor spatial features and nonparametric weighted spectral features to describe the hyperspectral image, and then applies the sparse representation method. Finally, the classification is obtained by analyzing the reconstruction error. We use the proposed method to process two typical hyperspectral data sets with different percentages of training samples. Theoretical analysis and simulation demonstrate that the proposed method improves the classification accuracy and Kappa coefficient compared with traditional classification methods and achieves better classification performance.展开更多
Few-shot learning has emerged as a crucial technique for coral species classification,addressing the challenge of limited labeled data in underwater environments.This study introduces an optimized few-shot learning mo...Few-shot learning has emerged as a crucial technique for coral species classification,addressing the challenge of limited labeled data in underwater environments.This study introduces an optimized few-shot learning model that enhances classification accuracy while minimizing reliance on extensive data collection.The proposed model integrates a hybrid similarity measure combining Euclidean distance and cosine similarity,effectively capturing both feature magnitude and directional relationships.This approach achieves a notable accuracy of 71.8%under a 5-way 5-shot evaluation,outperforming state-of-the-art models such as Prototypical Networks,FEAT,and ESPT by up to 10%.Notably,the model demonstrates high precision in classifying Siderastreidae(87.52%)and Fungiidae(88.95%),underscoring its effectiveness in distinguishing subtle morphological differences.To further enhance performance,we incorporate a self-supervised learning mechanism based on contrastive learning,enabling the model to extract robust representations by leveraging local structural patterns in corals.This enhancement significantly improves classification accuracy,particularly for species with high intra-class variation,leading to an overall accuracy of 76.52%under a 5-way 10-shot evaluation.Additionally,the model exploits the repetitive structures inherent in corals,introducing a local feature aggregation strategy that refines classification through spatial information integration.Beyond its technical contributions,this study presents a scalable and efficient approach for automated coral reef monitoring,reducing annotation costs while maintaining high classification accuracy.By improving few-shot learning performance in underwater environments,our model enhances monitoring accuracy by up to 15%compared to traditional methods,offering a practical solution for large-scale coral conservation efforts.展开更多
One of the drastically growing and emerging research areas used in most information technology industries is Bigdata analytics.Bigdata is created from social websites like Facebook,WhatsApp,Twitter,etc.Opinions about ...One of the drastically growing and emerging research areas used in most information technology industries is Bigdata analytics.Bigdata is created from social websites like Facebook,WhatsApp,Twitter,etc.Opinions about products,persons,initiatives,political issues,research achievements,and entertainment are discussed on social websites.The unique data analytics method cannot be applied to various social websites since the data formats are different.Several approaches,techniques,and tools have been used for big data analytics,opinion mining,or sentiment analysis,but the accuracy is yet to be improved.The proposed work is motivated to do sentiment analysis on Twitter data for cloth products using Simulated Annealing incorporated with the Multiclass Support Vector Machine(SA-MSVM)approach.SA-MSVM is a hybrid heuristic approach for selecting and classifying text-based sentimental words following the Natural Language Processing(NLP)process applied on tweets extracted from the Twitter dataset.A simulated annealing algorithm searches for relevant features and selects and identifies sentimental terms that customers criticize.SA-MSVM is implemented,experimented with MATLAB,and the results are verified.The results concluded that SA-MSVM has more potential in sentiment analysis and classification than the existing Support Vector Machine(SVM)approach.SA-MSVM has obtained 96.34%accuracy in classifying the product review compared with the existing systems.展开更多
Symbolic Aggregate approXimation (SAX) is an efficient symbolic representation method that has been widely used in time series data mining. Its major limitation is that it relies exclusively on the mean values of segm...Symbolic Aggregate approXimation (SAX) is an efficient symbolic representation method that has been widely used in time series data mining. Its major limitation is that it relies exclusively on the mean values of segmented time series to derive the symbols. So, many important features of time series are not considered, such as extreme value, trend, fluctuation and so on. To solve this issue, we propose in this paper an improved Symbolic Aggregate approXimation based on multiple features and Vector Frequency Difference (SAX_VFD). SAX_VFD discriminates between time series by adopting an adaptive feature selection method. Furthermore, SAX_VFD is endowed with a new distance that takes into account the vector frequency difference between the symbolic sequence. We demonstrate the utility of the SAX_VFD on the time series classification task. The experimental results show that the proposed method has a better performance in terms of accuracy and dimensionality reduction compared to the so far published SAX based reduction techniques.展开更多
针对潜在低秩表示学习的投影矩阵不能解释提取特征重要程度和保持数据的局部几何结构的问题,提出了一种基于双邻域和特征选择的潜在低秩稀疏投影算法(LLRSP:Latent Low-Rank And Sparse Projection)。该算法首先融合低秩约束和正交重构...针对潜在低秩表示学习的投影矩阵不能解释提取特征重要程度和保持数据的局部几何结构的问题,提出了一种基于双邻域和特征选择的潜在低秩稀疏投影算法(LLRSP:Latent Low-Rank And Sparse Projection)。该算法首先融合低秩约束和正交重构保持数据的主要能量,然后对投影矩阵施加行稀疏约束进行特征选择,使特征更加紧凑和具有可解释性。此外引入l_(2,1)范数对误差分量进行正则化使模型对噪声更具健壮性。最后在低维数据和低秩表示系数矩阵上施加邻域保持正则化以保留数据的局部几何结构。公开数据集上的大量实验结果表明,所提方法与其他先进算法相比具有更好的性能。展开更多
针对现有情感分类模型在深层情感理解上的局限性、传统注意力机制的单向性束缚以及自然语言处理(NLP)中的类别不平衡等问题,提出一种融合多尺度BERT(Bidirectional Encoder Representations from Transformers)特征和双向交叉注意力机...针对现有情感分类模型在深层情感理解上的局限性、传统注意力机制的单向性束缚以及自然语言处理(NLP)中的类别不平衡等问题,提出一种融合多尺度BERT(Bidirectional Encoder Representations from Transformers)特征和双向交叉注意力机制的情感分类模型M-BCA(Multi-scale BERT features with Bidirectional Cross Attention)。首先,从BERT的低层、中层和高层分别提取多尺度特征,以捕捉句子文本的表面信息、语法信息和深层语义信息;其次,利用三通道门控循环单元(GRU)进一步提取深层语义特征,从而增强模型对文本的理解能力;最后,为促进不同尺度特征之间的交互与学习,引入双向交叉注意力机制,从而增强多尺度特征之间的相互作用。此外,针对不平衡数据问题,设计数据增强策略,并采用混合损失函数优化模型对少数类别样本的学习。实验结果表明,在细粒度情感分类任务中,M-BCA表现优异。M-BCA在处理分布不平衡的多分类情感数据集时,它的性能显著优于大多数基线模型。此外,M-BCA在少数类别样本的分类任务中表现突出,尤其是在NLPCC 2014与Online_Shopping_10_Cats数据集上,MBCA的少数类别的Macro-Recall领先其他所有对比模型。可见,该模型在细粒度情感分类任务中取得了显著的性能提升,并适用于处理不平衡数据集。展开更多
针对现有方面级情感分类模型存在方面词与上下文交互不充分、分类精度低的问题,提出一种基于多交互特征融合的方面级情感分类方法(ASMFF:Aspect-level Sentiment classification method based on Multi-interaction Feature Fusion)。首...针对现有方面级情感分类模型存在方面词与上下文交互不充分、分类精度低的问题,提出一种基于多交互特征融合的方面级情感分类方法(ASMFF:Aspect-level Sentiment classification method based on Multi-interaction Feature Fusion)。首先,将上下文和方面词分别进行特殊标记,输入BERT(Bidirectional Encoder Representations from Transformers)编码层进行文本特征向量提取。其次,将文本特征向量输入AOA(Attention Over Attention)和IAN(Interactive Attention Networks)网络提取交互注意力特征向量。最后,将得到的两种交互特征向量进行融合学习,通过交叉熵损失函数进行概率计算、损失回传和参数更新。在Laptop、Restaurant和Twitter 3个公开数据集上的实验结果表明,ASMFF模型的分类准确率分别为80.25%、84.38%、75.29%,相比基线模型有显著提升。展开更多
基金This paper is supported by National Natural Science Foundation of China (No. 61074078) and Fundamental Research Funds for the Central Universities (No. 12MS121).
基金This work is supported in part by the Natural Science Foundation of China under grants(61503112,61673152 and 61503116).
文摘Recently,the effectiveness of neural networks,especially convolutional neural networks,has been validated in the field of natural language processing,in which,sentiment classification for online reviews is an important and challenging task.Existing convolutional neural networks extract important features of sentences without local features or the feature sequence.Thus,these models do not perform well,especially for transition sentences.To this end,we propose a Piecewise Pooling Convolutional Neural Network(PPCNN)for sentiment classification.Firstly,with a sentence presented by word vectors,convolution operation is introduced to obtain the convolution feature map vectors.Secondly,these vectors are segmented according to the positions of transition words in sentences.Thirdly,the most significant feature of each local segment is extracted using max pooling mechanism,and then the different aspects of features can be extracted.Specifically,the relative sequence of these features is preserved.Finally,after processed by the dropout algorithm,the softmax classifier is trained for sentiment classification.Experimental results show that the proposed method PPCNN is effective and superior to other baseline methods,especially for datasets with transition sentences.
基金supported by the National Natural Science Foundation of China(Nos.51671075,51971086)the Natural Science Foundation of Heilongjiang Province,China(No.LH2022E081)。
文摘Phase classification has a clear guiding significance for the design of high entropy alloys.For mutually exclusive and non-mutually exclusive classifications,the composition descriptors,commonly used physical parameter descriptors,elemental-property descriptors,and descriptors extracted from the periodic table representation(PTR)by the convolutional neural network were collected.Appropriate selection among features with rich information is helpful for phase classification.Based on random forest,the accuracy of the four-label classification and balanced accuracy of the five-label classification were improved to be 0.907 and 0.876,respectively.The roles of the four important features were summarized by interpretability analysis,and a new important feature was found.The model extrapolation ability and the influence of Mo were demonstrated by phase prediction in(CoFeNiMn)_(1-x)Mo_(x).The phase information is helpful for the hardness prediction,the classification results were coupled with the PTR of hardness data,and the prediction error(the root mean square error)was reduced to 56.69.
基金Sponsored by the Fundamental Research Funds for the Central Universities of China(Grant No.PA2023IISL0098)the Hefei Municipal Natural Science Foundation(Grant No.202201)+1 种基金the National Natural Science Foundation of China(Grant No.62071164)the Open Fund of Information Materials and Intelligent Sensing Laboratory of Anhui Province(Anhui University)(Grant No.IMIS202214 and IMIS202102)。
文摘This article proposes a VGG network with histogram of oriented gradient(HOG) feature fusion(HOG-VGG) for polarization synthetic aperture radar(PolSAR) image terrain classification.VGG-Net has a strong ability of deep feature extraction,which can fully extract the global deep features of different terrains in PolSAR images,so it is widely used in PolSAR terrain classification.However,VGG-Net ignores the local edge & shape features,resulting in incomplete feature representation of the PolSAR terrains,as a consequence,the terrain classification accuracy is not promising.In fact,edge and shape features play an important role in PolSAR terrain classification.To solve this problem,a new VGG network with HOG feature fusion was specifically proposed for high-precision PolSAR terrain classification.HOG-VGG extracts both the global deep semantic features and the local edge & shape features of the PolSAR terrains,so the terrain feature representation completeness is greatly elevated.Moreover,HOG-VGG optimally fuses the global deep features and the local edge & shape features to achieve the best classification results.The superiority of HOG-VGG is verified on the Flevoland,San Francisco and Oberpfaffenhofen datasets.Experiments show that the proposed HOG-VGG achieves much better PolSAR terrain classification performance,with overall accuracies of 97.54%,94.63%,and 96.07%,respectively.
基金supported by the National Natural Science Foundation of China(No.61275010)the Ph.D.Programs Foundation of Ministry of Education of China(No.20132304110007)+1 种基金the Heilongjiang Natural Science Foundation(No.F201409)the Fundamental Research Funds for the Central Universities(No.HEUCFD1410)
文摘To minimize the low classification accuracy and low utilization of spatial information in traditional hyperspectral image classification methods, we propose a new hyperspectral image classification method, which is based on the Gabor spatial texture features and nonparametric weighted spectral features, and the sparse representation classification method(Gabor–NWSF and SRC), abbreviated GNWSF–SRC. The proposed(GNWSF–SRC) method first combines the Gabor spatial features and nonparametric weighted spectral features to describe the hyperspectral image, and then applies the sparse representation method. Finally, the classification is obtained by analyzing the reconstruction error. We use the proposed method to process two typical hyperspectral data sets with different percentages of training samples. Theoretical analysis and simulation demonstrate that the proposed method improves the classification accuracy and Kappa coefficient compared with traditional classification methods and achieves better classification performance.
基金funded by theNational Science and TechnologyCouncil(NSTC),Taiwan,under grant numbers NSTC 112-2634-F-019-001 and NSTC 113-2634-F-A49-007.
文摘Few-shot learning has emerged as a crucial technique for coral species classification,addressing the challenge of limited labeled data in underwater environments.This study introduces an optimized few-shot learning model that enhances classification accuracy while minimizing reliance on extensive data collection.The proposed model integrates a hybrid similarity measure combining Euclidean distance and cosine similarity,effectively capturing both feature magnitude and directional relationships.This approach achieves a notable accuracy of 71.8%under a 5-way 5-shot evaluation,outperforming state-of-the-art models such as Prototypical Networks,FEAT,and ESPT by up to 10%.Notably,the model demonstrates high precision in classifying Siderastreidae(87.52%)and Fungiidae(88.95%),underscoring its effectiveness in distinguishing subtle morphological differences.To further enhance performance,we incorporate a self-supervised learning mechanism based on contrastive learning,enabling the model to extract robust representations by leveraging local structural patterns in corals.This enhancement significantly improves classification accuracy,particularly for species with high intra-class variation,leading to an overall accuracy of 76.52%under a 5-way 10-shot evaluation.Additionally,the model exploits the repetitive structures inherent in corals,introducing a local feature aggregation strategy that refines classification through spatial information integration.Beyond its technical contributions,this study presents a scalable and efficient approach for automated coral reef monitoring,reducing annotation costs while maintaining high classification accuracy.By improving few-shot learning performance in underwater environments,our model enhances monitoring accuracy by up to 15%compared to traditional methods,offering a practical solution for large-scale coral conservation efforts.
文摘One of the drastically growing and emerging research areas used in most information technology industries is Bigdata analytics.Bigdata is created from social websites like Facebook,WhatsApp,Twitter,etc.Opinions about products,persons,initiatives,political issues,research achievements,and entertainment are discussed on social websites.The unique data analytics method cannot be applied to various social websites since the data formats are different.Several approaches,techniques,and tools have been used for big data analytics,opinion mining,or sentiment analysis,but the accuracy is yet to be improved.The proposed work is motivated to do sentiment analysis on Twitter data for cloth products using Simulated Annealing incorporated with the Multiclass Support Vector Machine(SA-MSVM)approach.SA-MSVM is a hybrid heuristic approach for selecting and classifying text-based sentimental words following the Natural Language Processing(NLP)process applied on tweets extracted from the Twitter dataset.A simulated annealing algorithm searches for relevant features and selects and identifies sentimental terms that customers criticize.SA-MSVM is implemented,experimented with MATLAB,and the results are verified.The results concluded that SA-MSVM has more potential in sentiment analysis and classification than the existing Support Vector Machine(SVM)approach.SA-MSVM has obtained 96.34%accuracy in classifying the product review compared with the existing systems.
文摘Symbolic Aggregate approXimation (SAX) is an efficient symbolic representation method that has been widely used in time series data mining. Its major limitation is that it relies exclusively on the mean values of segmented time series to derive the symbols. So, many important features of time series are not considered, such as extreme value, trend, fluctuation and so on. To solve this issue, we propose in this paper an improved Symbolic Aggregate approXimation based on multiple features and Vector Frequency Difference (SAX_VFD). SAX_VFD discriminates between time series by adopting an adaptive feature selection method. Furthermore, SAX_VFD is endowed with a new distance that takes into account the vector frequency difference between the symbolic sequence. We demonstrate the utility of the SAX_VFD on the time series classification task. The experimental results show that the proposed method has a better performance in terms of accuracy and dimensionality reduction compared to the so far published SAX based reduction techniques.
文摘针对潜在低秩表示学习的投影矩阵不能解释提取特征重要程度和保持数据的局部几何结构的问题,提出了一种基于双邻域和特征选择的潜在低秩稀疏投影算法(LLRSP:Latent Low-Rank And Sparse Projection)。该算法首先融合低秩约束和正交重构保持数据的主要能量,然后对投影矩阵施加行稀疏约束进行特征选择,使特征更加紧凑和具有可解释性。此外引入l_(2,1)范数对误差分量进行正则化使模型对噪声更具健壮性。最后在低维数据和低秩表示系数矩阵上施加邻域保持正则化以保留数据的局部几何结构。公开数据集上的大量实验结果表明,所提方法与其他先进算法相比具有更好的性能。
文摘针对现有情感分类模型在深层情感理解上的局限性、传统注意力机制的单向性束缚以及自然语言处理(NLP)中的类别不平衡等问题,提出一种融合多尺度BERT(Bidirectional Encoder Representations from Transformers)特征和双向交叉注意力机制的情感分类模型M-BCA(Multi-scale BERT features with Bidirectional Cross Attention)。首先,从BERT的低层、中层和高层分别提取多尺度特征,以捕捉句子文本的表面信息、语法信息和深层语义信息;其次,利用三通道门控循环单元(GRU)进一步提取深层语义特征,从而增强模型对文本的理解能力;最后,为促进不同尺度特征之间的交互与学习,引入双向交叉注意力机制,从而增强多尺度特征之间的相互作用。此外,针对不平衡数据问题,设计数据增强策略,并采用混合损失函数优化模型对少数类别样本的学习。实验结果表明,在细粒度情感分类任务中,M-BCA表现优异。M-BCA在处理分布不平衡的多分类情感数据集时,它的性能显著优于大多数基线模型。此外,M-BCA在少数类别样本的分类任务中表现突出,尤其是在NLPCC 2014与Online_Shopping_10_Cats数据集上,MBCA的少数类别的Macro-Recall领先其他所有对比模型。可见,该模型在细粒度情感分类任务中取得了显著的性能提升,并适用于处理不平衡数据集。
文摘针对现有方面级情感分类模型存在方面词与上下文交互不充分、分类精度低的问题,提出一种基于多交互特征融合的方面级情感分类方法(ASMFF:Aspect-level Sentiment classification method based on Multi-interaction Feature Fusion)。首先,将上下文和方面词分别进行特殊标记,输入BERT(Bidirectional Encoder Representations from Transformers)编码层进行文本特征向量提取。其次,将文本特征向量输入AOA(Attention Over Attention)和IAN(Interactive Attention Networks)网络提取交互注意力特征向量。最后,将得到的两种交互特征向量进行融合学习,通过交叉熵损失函数进行概率计算、损失回传和参数更新。在Laptop、Restaurant和Twitter 3个公开数据集上的实验结果表明,ASMFF模型的分类准确率分别为80.25%、84.38%、75.29%,相比基线模型有显著提升。