期刊文献+
共找到408篇文章
< 1 2 21 >
每页显示 20 50 100
Joint Feature Encoding and Task Alignment Mechanism for Emotion-Cause Pair Extraction
1
作者 Shi Li Didi Sun 《Computers, Materials & Continua》 SCIE EI 2025年第1期1069-1086,共18页
With the rapid expansion of social media,analyzing emotions and their causes in texts has gained significant importance.Emotion-cause pair extraction enables the identification of causal relationships between emotions... With the rapid expansion of social media,analyzing emotions and their causes in texts has gained significant importance.Emotion-cause pair extraction enables the identification of causal relationships between emotions and their triggers within a text,facilitating a deeper understanding of expressed sentiments and their underlying reasons.This comprehension is crucial for making informed strategic decisions in various business and societal contexts.However,recent research approaches employing multi-task learning frameworks for modeling often face challenges such as the inability to simultaneouslymodel extracted features and their interactions,or inconsistencies in label prediction between emotion-cause pair extraction and independent assistant tasks like emotion and cause extraction.To address these issues,this study proposes an emotion-cause pair extraction methodology that incorporates joint feature encoding and task alignment mechanisms.The model consists of two primary components:First,joint feature encoding simultaneously generates features for emotion-cause pairs and clauses,enhancing feature interactions between emotion clauses,cause clauses,and emotion-cause pairs.Second,the task alignment technique is applied to reduce the labeling distance between emotion-cause pair extraction and the two assistant tasks,capturing deep semantic information interactions among tasks.The proposed method is evaluated on a Chinese benchmark corpus using 10-fold cross-validation,assessing key performance metrics such as precision,recall,and F1 score.Experimental results demonstrate that the model achieves an F1 score of 76.05%,surpassing the state-of-the-art by 1.03%.The proposed model exhibits significant improvements in emotion-cause pair extraction(ECPE)and cause extraction(CE)compared to existing methods,validating its effectiveness.This research introduces a novel approach based on joint feature encoding and task alignment mechanisms,contributing to advancements in emotion-cause pair extraction.However,the study’s limitation lies in the data sources,potentially restricting the generalizability of the findings. 展开更多
关键词 Emotion-cause pair extraction interactive information enhancement joint feature encoding label consistency task alignment mechanisms
在线阅读 下载PDF
Self-FAGCFN:Graph-Convolution Fusion Network Based on Feature Fusion and Self-Supervised Feature Alignment for Pneumonia and Tuberculosis Diagnosis
2
作者 Junding Sun Wenhao Tang +5 位作者 Lei Zhao Chaosheng Tang Xiaosheng Wu Zhaozhao Xu Bin Pu Yudong Zhang 《Journal of Bionic Engineering》 2025年第4期2012-2029,共18页
Feature fusion is an important technique in medical image classification that can improve diagnostic accuracy by integrating complementary information from multiple sources.Recently,Deep Learning(DL)has been widely us... Feature fusion is an important technique in medical image classification that can improve diagnostic accuracy by integrating complementary information from multiple sources.Recently,Deep Learning(DL)has been widely used in pulmonary disease diagnosis,such as pneumonia and tuberculosis.However,traditional feature fusion methods often suffer from feature disparity,information loss,redundancy,and increased complexity,hindering the further extension of DL algorithms.To solve this problem,we propose a Graph-Convolution Fusion Network with Self-Supervised Feature Alignment(Self-FAGCFN)to address the limitations of traditional feature fusion methods in deep learning-based medical image classification for respiratory diseases such as pneumonia and tuberculosis.The network integrates Convolutional Neural Networks(CNNs)for robust feature extraction from two-dimensional grid structures and Graph Convolutional Networks(GCNs)within a Graph Neural Network branch to capture features based on graph structure,focusing on significant node representations.Additionally,an Attention-Embedding Ensemble Block is included to capture critical features from GCN outputs.To ensure effective feature alignment between pre-and post-fusion stages,we introduce a feature alignment loss that minimizes disparities.Moreover,to address the limitations of proposed methods,such as inappropriate centroid discrepancies during feature alignment and class imbalance in the dataset,we develop a Feature-Centroid Fusion(FCF)strategy and a Multi-Level Feature-Centroid Update(MLFCU)algorithm,respectively.Extensive experiments on public datasets LungVision and Chest-Xray demonstrate that the Self-FAGCFN model significantly outperforms existing methods in diagnosing pneumonia and tuberculosis,highlighting its potential for practical medical applications. 展开更多
关键词 feature fusion Self-supervised feature alignment Convolutional neural networks Graph convolutional networks Class imbalance feature-centroid fusion
在线阅读 下载PDF
Multi-Layer Feature Extraction with Deformable Convolution for Fabric Defect Detection 被引量:1
3
作者 Jielin Jiang Chao Cui +1 位作者 Xiaolong Xu Yan Cui 《Intelligent Automation & Soft Computing》 2024年第4期725-744,共20页
In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.... In the textile industry,the presence of defects on the surface of fabric is an essential factor in determining fabric quality.Therefore,identifying fabric defects forms a crucial part of the fabric production process.Traditional fabric defect detection algorithms can only detect specific materials and specific fabric defect types;in addition,their detection efficiency is low,and their detection results are relatively poor.Deep learning-based methods have many advantages in the field of fabric defect detection,however,such methods are less effective in identifying multiscale fabric defects and defects with complex shapes.Therefore,we propose an effective algorithm,namely multilayer feature extraction combined with deformable convolution(MFDC),for fabric defect detection.In MFDC,multi-layer feature extraction is used to fuse the underlying location features with high-level classification features through a horizontally connected top-down architecture to improve the detection of multi-scale fabric defects.On this basis,a deformable convolution is added to solve the problem of the algorithm’s weak detection ability of irregularly shaped fabric defects.In this approach,Roi Align and Cascade-RCNN are integrated to enhance the adaptability of the algorithm in materials with complex patterned backgrounds.The experimental results show that the MFDC algorithm can achieve good detection results for both multi-scale fabric defects and defects with complex shapes,at the expense of a small increase in detection time. 展开更多
关键词 Fabric defect detection multi-layer features deformable convolution
在线阅读 下载PDF
Digital modulation classification using multi-layer perceptron and time-frequency features
4
作者 Yuan Ye Mei Wenbo 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2007年第2期249-254,共6页
Considering that real communication signals corrupted by noise are generally nonstationary, and timefrequency distributions are especially suitable for the analysis of nonstationary signals, time-frequency distributio... Considering that real communication signals corrupted by noise are generally nonstationary, and timefrequency distributions are especially suitable for the analysis of nonstationary signals, time-frequency distributions are introduced for the modulation classification of communication signals: The extracted time-frequency features have good classification information, and they are insensitive to signal to noise ratio (SNR) variation. According to good classification by the correct rate of a neural network classifier, a multilayer perceptron (MLP) classifier with better generalization, as well as, addition of time-frequency features set for classifying six different modulation types has been proposed. Computer simulations show that the MLP classifier outperforms the decision-theoretic classifier at low SNRs, and the classification experiments for real MPSK signals verify engineering significance of the MLP classifier. 展开更多
关键词 Digital modulation classification Time-frequency feature Time-frequency distribution multi-layer perceptron.
在线阅读 下载PDF
Application of response surface method for optimal transfer conditions of multi-layer ceramic capacitor alignment system
5
作者 PARK Su-seong KIM Jae-min +1 位作者 CHUNG Won-jee SHIN O-chul 《Journal of Central South University》 SCIE EI CAS 2011年第3期726-730,共5页
The multi-layer ceramic capacitor (MLCC) alignment system aims at the inter-process automation between the first and the second plastic processes.As a result of testing performance verification of MLCC alignment syste... The multi-layer ceramic capacitor (MLCC) alignment system aims at the inter-process automation between the first and the second plastic processes.As a result of testing performance verification of MLCC alignment system,the average alignment rates are 95% for 3216 chip,88.5% for 2012 chip and 90.8% for 3818 chip.The MLCC alignment system can be accepted for practical use because the average manual alignment is just 80%.In other words,the developed MLCC alignment system has been upgraded to a great extent,compared with manual alignment.Based on the successfully developed MLCC alignment system,the optimal transfer conditions have been explored by using RSM.The simulations using ADAMS has been performed according to the cube model of CCD.By using MiniTAB,the model of response surface has been established based on the simulation results.The optimal conditions resulted from the response optimization tool of MiniTAB has been verified by being assigned to the prototype of MLCC alignment system. 展开更多
关键词 multi-layer ceramic capacitor (MLCC) alignment system response surface method (RSM) MiniTAB ADAMS
在线阅读 下载PDF
Hierarchical Optimization Method for Federated Learning with Feature Alignment and Decision Fusion
6
作者 Ke Li Xiaofeng Wang Hu Wang 《Computers, Materials & Continua》 SCIE EI 2024年第10期1391-1407,共17页
In the realm of data privacy protection,federated learning aims to collaboratively train a global model.However,heterogeneous data between clients presents challenges,often resulting in slow convergence and inadequate... In the realm of data privacy protection,federated learning aims to collaboratively train a global model.However,heterogeneous data between clients presents challenges,often resulting in slow convergence and inadequate accuracy of the global model.Utilizing shared feature representations alongside customized classifiers for individual clients emerges as a promising personalized solution.Nonetheless,previous research has frequently neglected the integration of global knowledge into local representation learning and the synergy between global and local classifiers,thereby limiting model performance.To tackle these issues,this study proposes a hierarchical optimization method for federated learning with feature alignment and the fusion of classification decisions(FedFCD).FedFCD regularizes the relationship between global and local feature representations to achieve alignment and incorporates decision information from the global classifier,facilitating the late fusion of decision outputs from both global and local classifiers.Additionally,FedFCD employs a hierarchical optimization strategy to flexibly optimize model parameters.Through experiments on the Fashion-MNIST,CIFAR-10 and CIFAR-100 datasets,we demonstrate the effectiveness and superiority of FedFCD.For instance,on the CIFAR-100 dataset,FedFCD exhibited a significant improvement in average test accuracy by 6.83%compared to four outstanding personalized federated learning approaches.Furthermore,extended experiments confirm the robustness of FedFCD across various hyperparameter values. 展开更多
关键词 Federated learning data heterogeneity feature alignment decision fusion hierarchical optimization
在线阅读 下载PDF
Feature pyramid attention network for audio-visual scene classification 被引量:1
7
作者 Liguang Zhou Yuhongze Zhou +3 位作者 Xiaonan Qi Junjie Hu Tin Lun Lam Yangsheng Xu 《CAAI Transactions on Intelligence Technology》 2025年第2期359-374,共16页
Audio-visual scene classification(AVSC)poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals,coupled with the complex spatial patterns of objects and text... Audio-visual scene classification(AVSC)poses a formidable challenge owing to the intricate spatial-temporal relationships exhibited by audio-visual signals,coupled with the complex spatial patterns of objects and textures found in visual images.The focus of recent studies has predominantly revolved around extracting features from diverse neural network structures,inadvertently neglecting the acquisition of semantically meaningful regions and crucial components within audio-visual data.The authors present a feature pyramid attention network(FPANet)for audio-visual scene understanding,which extracts semantically significant characteristics from audio-visual data.The authors’approach builds multi-scale hierarchical features of sound spectrograms and visual images using a feature pyramid representation and localises the semantically relevant regions with a feature pyramid attention module(FPAM).A dimension alignment(DA)strategy is employed to align feature maps from multiple layers,a pyramid spatial attention(PSA)to spatially locate essential regions,and a pyramid channel attention(PCA)to pinpoint significant temporal frames.Experiments on visual scene classification(VSC),audio scene classification(ASC),and AVSC tasks demonstrate that FPANet achieves performance on par with state-of-the-art(SOTA)approaches,with a 95.9 F1-score on the ADVANCE dataset and a relative improvement of 28.8%.Visualisation results show that FPANet can prioritise semantically meaningful areas in audio-visual signals. 展开更多
关键词 dimension alignment feature pyramid attention network pyramid channel attention pyramid spatial attention semantic relevant regions
在线阅读 下载PDF
Multi-Modal Pre-Synergistic Fusion Entity Alignment Based on Mutual Information Strategy Optimization
8
作者 Huayu Li Xinxin Chen +3 位作者 Lizhuang Tan Konstantin I.Kostromitin Athanasios V.Vasilakos Peiying Zhang 《Computers, Materials & Continua》 2025年第11期4133-4153,共21页
To address the challenge of missing modal information in entity alignment and to mitigate information loss or bias arising frommodal heterogeneity during fusion,while also capturing shared information acrossmodalities... To address the challenge of missing modal information in entity alignment and to mitigate information loss or bias arising frommodal heterogeneity during fusion,while also capturing shared information acrossmodalities,this paper proposes a Multi-modal Pre-synergistic Entity Alignmentmodel based on Cross-modalMutual Information Strategy Optimization(MPSEA).The model first employs independent encoders to process multi-modal features,including text,images,and numerical values.Next,a multi-modal pre-synergistic fusion mechanism integrates graph structural and visual modal features into the textual modality as preparatory information.This pre-fusion strategy enables unified perception of heterogeneous modalities at the model’s initial stage,reducing discrepancies during the fusion process.Finally,using cross-modal deep perception reinforcement learning,the model achieves adaptive multilevel feature fusion between modalities,supporting learningmore effective alignment strategies.Extensive experiments on multiple public datasets show that the MPSEA method achieves gains of up to 7% in Hits@1 and 8.2% in MRR on the FBDB15K dataset,and up to 9.1% in Hits@1 and 7.7% in MRR on the FBYG15K dataset,compared to existing state-of-the-art methods.These results confirm the effectiveness of the proposed model. 展开更多
关键词 Knowledge graph MULTI-MODAL entity alignment feature fusion pre-synergistic fusion
在线阅读 下载PDF
A Dual Stream Multimodal Alignment and Fusion Network for Classifying Short Videos
9
作者 ZHOU Ming WANG Tong 《Journal of Donghua University(English Edition)》 2025年第1期88-95,共8页
Video classification is an important task in video understanding and plays a pivotal role in intelligent monitoring of information content.Most existing methods do not consider the multimodal nature of the video,and t... Video classification is an important task in video understanding and plays a pivotal role in intelligent monitoring of information content.Most existing methods do not consider the multimodal nature of the video,and the modality fusion approach tends to be too simple,often neglecting modality alignment before fusion.This research introduces a novel dual stream multimodal alignment and fusion network named DMAFNet for classifying short videos.The network uses two unimodal encoder modules to extract features within modalities and exploits a multimodal encoder module to learn interaction between modalities.To solve the modality alignment problem,contrastive learning is introduced between two unimodal encoder modules.Additionally,masked language modeling(MLM)and video text matching(VTM)auxiliary tasks are introduced to improve the interaction between video frames and text modalities through backpropagation of loss functions.Diverse experiments prove the efficiency of DMAFNet in multimodal video classification tasks.Compared with other two mainstream baselines,DMAFNet achieves the best results on the 2022 WeChat Big Data Challenge dataset. 展开更多
关键词 video classification multimodal fusion feature alignment
在线阅读 下载PDF
Advancing Sports Image Classification and Analysis:Effective Data Augmentation and Feature Alignment Strategies
10
作者 Ping Liu Chao Zhao +2 位作者 Bin Zang Sifeng Wang Shigen Shen 《Tsinghua Science and Technology》 2026年第1期577-589,共13页
Sport plays a crucial role in society,influencing physical health,entertainment,and community engagement.As artificial intelligence advances,the ability to classify sport images accurately becomes increasingly crucial... Sport plays a crucial role in society,influencing physical health,entertainment,and community engagement.As artificial intelligence advances,the ability to classify sport images accurately becomes increasingly crucial.Effective sport image classification enhances applications,such as performance analysis,athlete tracking,and fan engagement.Despite its significance,current methods face challenges due to limited labeled datasets and issues with feature misalignment.This paper introduces a novel Contrastive Language-Image Pre-training(CLIP)based framework specifically designed for sport image classification.By incorporating data augmentation techniques,the approach addresses data sparsity and enriches the diversity of image-text pairings,reducing the need for extensive manual annotation.Additionally,feature alignment strategies tackle text-image misalignment issues that affect classification accuracy.This approach fills a significant research gap and offers practical solutions to improve classification performance in sport image analysis.The results of extensive experiments validate the effectiveness of the framework,demonstrating its potential to advance sports analytics and contribute to more precise and scalable solutions in sport image classification. 展开更多
关键词 sport image classification sports analytics data augmentation feature alignment
原文传递
Feature Extraction of Kernel Regress Reconstruction for Fault Diagnosis Based on Self-organizing Manifold Learning 被引量:3
11
作者 CHEN Xiaoguang LIANG Lin +1 位作者 XU Guanghua LIU Dan 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2013年第5期1041-1049,共9页
The feature space extracted from vibration signals with various faults is often nonlinear and of high dimension.Currently,nonlinear dimensionality reduction methods are available for extracting low-dimensional embeddi... The feature space extracted from vibration signals with various faults is often nonlinear and of high dimension.Currently,nonlinear dimensionality reduction methods are available for extracting low-dimensional embeddings,such as manifold learning.However,these methods are all based on manual intervention,which have some shortages in stability,and suppressing the disturbance noise.To extract features automatically,a manifold learning method with self-organization mapping is introduced for the first time.Under the non-uniform sample distribution reconstructed by the phase space,the expectation maximization(EM) iteration algorithm is used to divide the local neighborhoods adaptively without manual intervention.After that,the local tangent space alignment(LTSA) algorithm is adopted to compress the high-dimensional phase space into a more truthful low-dimensional representation.Finally,the signal is reconstructed by the kernel regression.Several typical states include the Lorenz system,engine fault with piston pin defect,and bearing fault with outer-race defect are analyzed.Compared with the LTSA and continuous wavelet transform,the results show that the background noise can be fully restrained and the entire periodic repetition of impact components is well separated and identified.A new way to automatically and precisely extract the impulsive components from mechanical signals is proposed. 展开更多
关键词 feature extraction manifold learning self-organize mapping kernel regression local tangent space alignment
在线阅读 下载PDF
Class conditional distribution alignment for domain adaptation 被引量:2
12
作者 Kai CAO Zhipeng TU Yang MING 《Control Theory and Technology》 EI CSCD 2020年第1期72-80,共9页
In this paper,we study the problem of domain adaptation,which is a crucial ingredient in transfer learning with two domains,that is,the source domain with labeled data and the target domain with none or few labels.Dom... In this paper,we study the problem of domain adaptation,which is a crucial ingredient in transfer learning with two domains,that is,the source domain with labeled data and the target domain with none or few labels.Domain adaptation aims to extract knowledge from the source domain to improve the performance of the learning task in the target domain.A popular approach to handle this problem is via adversarial training,which is explained by the H△H-distance theory.However,traditional adversarial network architectures just align the marginal feature distribution in the feature space.The alignment of class condition distribution is not guaranteed.Therefore,we proposed a novel method based on pseudo labels and the cluster assumption to avoid the incorrect class alignment in the feature space.The experiments demonstrate that our framework improves the accuracy on typical transfer learning tasks. 展开更多
关键词 DOMAIN ADAPTATION distribution alignment feature CLUSTER
原文传递
Dynamic Multi-Layer Perceptron for Fetal Health Classification Using Cardiotocography Data
13
作者 Uddagiri Sirisha Parvathaneni Naga Srinivasu +4 位作者 Panguluri Padmavathi Seongki Kim Aruna Pavate Jana Shafi Muhammad Fazal Ijaz 《Computers, Materials & Continua》 SCIE EI 2024年第8期2301-2330,共30页
Fetal health care is vital in ensuring the health of pregnant women and the fetus.Regular check-ups need to be taken by the mother to determine the status of the fetus’growth and identify any potential problems.To kn... Fetal health care is vital in ensuring the health of pregnant women and the fetus.Regular check-ups need to be taken by the mother to determine the status of the fetus’growth and identify any potential problems.To know the status of the fetus,doctors monitor blood reports,Ultrasounds,cardiotocography(CTG)data,etc.Still,in this research,we have considered CTG data,which provides information on heart rate and uterine contractions during pregnancy.Several researchers have proposed various methods for classifying the status of fetus growth.Manual processing of CTG data is time-consuming and unreliable.So,automated tools should be used to classify fetal health.This study proposes a novel neural network-based architecture,the Dynamic Multi-Layer Perceptron model,evaluated from a single layer to several layers to classify fetal health.Various strategies were applied,including pre-processing data using techniques like Balancing,Scaling,Normalization hyperparameter tuning,batch normalization,early stopping,etc.,to enhance the model’s performance.A comparative analysis of the proposed method is done against the traditional machine learning models to showcase its accuracy(97%).An ablation study without any pre-processing techniques is also illustrated.This study easily provides valuable interpretations for healthcare professionals in the decision-making process. 展开更多
关键词 Fetal health cardiotocography data deep learning dynamic multi-layer perceptron feature engineering
在线阅读 下载PDF
A Power Data Anomaly Detection Model Based on Deep Learning with Adaptive Feature Fusion
14
作者 Xiu Liu Liang Gu +3 位作者 Xin Gong Long An Xurui Gao Juying Wu 《Computers, Materials & Continua》 SCIE EI 2024年第6期4045-4061,共17页
With the popularisation of intelligent power,power devices have different shapes,numbers and specifications.This means that the power data has distributional variability,the model learning process cannot achieve suffi... With the popularisation of intelligent power,power devices have different shapes,numbers and specifications.This means that the power data has distributional variability,the model learning process cannot achieve sufficient extraction of data features,which seriously affects the accuracy and performance of anomaly detection.Therefore,this paper proposes a deep learning-based anomaly detection model for power data,which integrates a data alignment enhancement technique based on random sampling and an adaptive feature fusion method leveraging dimension reduction.Aiming at the distribution variability of power data,this paper developed a sliding window-based data adjustment method for this model,which solves the problem of high-dimensional feature noise and low-dimensional missing data.To address the problem of insufficient feature fusion,an adaptive feature fusion method based on feature dimension reduction and dictionary learning is proposed to improve the anomaly data detection accuracy of the model.In order to verify the effectiveness of the proposed method,we conducted effectiveness comparisons through elimination experiments.The experimental results show that compared with the traditional anomaly detection methods,the method proposed in this paper not only has an advantage in model accuracy,but also reduces the amount of parameter calculation of the model in the process of feature matching and improves the detection speed. 展开更多
关键词 Data alignment dimension reduction feature fusion data anomaly detection deep learning
在线阅读 下载PDF
Impact of Portable Executable Header Features on Malware Detection Accuracy
15
作者 Hasan H.Al-Khshali Muhammad Ilyas 《Computers, Materials & Continua》 SCIE EI 2023年第1期153-178,共26页
One aspect of cybersecurity,incorporates the study of Portable Executables(PE)files maleficence.Artificial Intelligence(AI)can be employed in such studies,since AI has the ability to discriminate benign from malicious... One aspect of cybersecurity,incorporates the study of Portable Executables(PE)files maleficence.Artificial Intelligence(AI)can be employed in such studies,since AI has the ability to discriminate benign from malicious files.In this study,an exclusive set of 29 features was collected from trusted implementations,this set was used as a baseline to analyze the presented work in this research.A Decision Tree(DT)and Neural Network Multi-Layer Perceptron(NN-MLPC)algorithms were utilized during this work.Both algorithms were chosen after testing a few diverse procedures.This work implements a method of subgrouping features to answer questions such as,which feature has a positive impact on accuracy when added?Is it possible to determine a reliable feature set to distinguish a malicious PE file from a benign one?when combining features,would it have any effect on malware detection accuracy in a PE file?Results obtained using the proposed method were improved and carried few observations.Generally,the obtained results had practical and numerical parts,for the practical part,the number of features and which features included are the main factors impacting the calculated accuracy,also,the combination of features is as crucial in these calculations.Numerical results included,finding accuracies with enhanced values,for example,NN_MLPC attained 0.979 and 0.98;for DT an accuracy of 0.9825 and 0.986 was attained. 展开更多
关键词 AI driven cybersecurity artificial intelligence CYBERSECURITY Decision Tree Neural Network multi-layer Perceptron Classifier portable executable(PE)file header features
在线阅读 下载PDF
DM-L Based Feature Extraction and Classifier Ensemble for Object Recognition
16
作者 Hamayun A. Khan 《Journal of Signal and Information Processing》 2018年第2期92-110,共19页
Deep Learning is a powerful technique that is widely applied to Image Recognition and Natural Language Processing tasks amongst many other tasks. In this work, we propose an efficient technique to utilize pre-trained ... Deep Learning is a powerful technique that is widely applied to Image Recognition and Natural Language Processing tasks amongst many other tasks. In this work, we propose an efficient technique to utilize pre-trained Convolutional Neural Network (CNN) architectures to extract powerful features from images for object recognition purposes. We have built on the existing concept of extending the learning from pre-trained CNNs to new databases through activations by proposing to consider multiple deep layers. We have exploited the progressive learning that happens at the various intermediate layers of the CNNs to construct Deep Multi-Layer (DM-L) based Feature Extraction vectors to achieve excellent object recognition performance. Two popular pre-trained CNN architecture models i.e. the VGG_16 and VGG_19 have been used in this work to extract the feature sets from 3 deep fully connected multiple layers namely “fc6”, “fc7” and “fc8” from inside the models for object recognition purposes. Using the Principal Component Analysis (PCA) technique, the Dimensionality of the DM-L feature vectors has been reduced to form powerful feature vectors that have been fed to an external Classifier Ensemble for classification instead of the Softmax based classification layers of the two original pre-trained CNN models. The proposed DM-L technique has been applied to the Benchmark Caltech-101 object recognition database. Conventional wisdom may suggest that feature extractions based on the deepest layer i.e. “fc8” compared to “fc6” will result in the best recognition performance but our results have proved it otherwise for the two considered models. Our experiments have revealed that for the two models under consideration, the “fc6” based feature vectors have achieved the best recognition performance. State-of-the-Art recognition performances of 91.17% and 91.35% have been achieved by utilizing the “fc6” based feature vectors for the VGG_16 and VGG_19 models respectively. The recognition performance has been achieved by considering 30 sample images per class whereas the proposed system is capable of achieving improved performance by considering all sample images per class. Our research shows that for feature extraction based on CNNs, multiple layers should be considered and then the best layer can be selected that maximizes the recognition performance. 展开更多
关键词 DEEP Learning Object Recognition CNN DEEP multi-layer feature Extraction Principal Component Analysis CLASSIFIER ENSEMBLE Caltech-101 BENCHMARK Database
在线阅读 下载PDF
A Robust Rating Prediction Model for Recommendation Systems Based on Fake User Detection and Multi-Layer Feature Fusion
17
作者 Zhigeng Han Ting Zhou +2 位作者 Geng Chen Jian Chen Chunshuo Fu 《Big Data Mining and Analytics》 2025年第2期292-309,共18页
The effectiveness of recommendation systems heavily relies on accurately predicting user ratings for items based on user preferences and item attributes derived from ratings and reviews.However,the increasing presence... The effectiveness of recommendation systems heavily relies on accurately predicting user ratings for items based on user preferences and item attributes derived from ratings and reviews.However,the increasing presence of fake user data in these ratings and reviews poses significant challenges,hindering feature extraction,diminishing rating prediction accuracy,and eroding user trust in the system.To tackle this issue,we propose a robust rating prediction model for recommendation systems that integrates fake user detection and multi-layer feature fusion.Our model utilizes a GraphSAGE-based submodel to filter out fake user data from rating data and review texts.To strengthen fake user detection,we enhance GraphSAGE by selecting aggregation neighbors based on the collusion fraud degree among users,and employ an attention mechanism to weigh the contribution of each neighbor during representation aggregation.Furthermore,we introduce a multi-layer feature fusion submodel to integrate diverse features extracted from the filtered ratings and reviews.For deep feature extraction from review texts,we implement a temporal attention mechanism to analyze the relevance of reviews over time.For shallow feature extraction from rating data,we incorporate trust evaluation mechanism and cloud model to assess the influence of trusted neighbors’ratings.In our evaluation,we compare our model against six baseline models for fake user detection and four rating prediction models across five datasets.The results demonstrate that our model exhibits significant performance advantages in both fake user detection and rating prediction. 展开更多
关键词 recommendation system rating prediction fake user detection multi-layer feature fusion Graph Neural Network(GNN)
原文传递
隐式特征图引导的文生图:三向注意力融合
18
作者 马栋林 马晓珍 赵宏 《计算机技术与发展》 2026年第2期101-108,共8页
针对多阶段生成模型因显式中间图像导致误差累积的问题,提出基于隐式特征图与三向注意力融合的双阶段解耦框架。在隐式特征图生成阶段,通过递归注意力迭代生成64×64非可视化特征图,替代AttnGAN等模型的显式中间图像,有效规避多阶... 针对多阶段生成模型因显式中间图像导致误差累积的问题,提出基于隐式特征图与三向注意力融合的双阶段解耦框架。在隐式特征图生成阶段,通过递归注意力迭代生成64×64非可视化特征图,替代AttnGAN等模型的显式中间图像,有效规避多阶段可视化误差传递;在三向注意力增强阶段,扩展Triplet Attention为通道-空间-文本交互机制,实现像素级细粒度控制。实验表明,在细粒度要求最高的CUB数据集上,CLIP分数达0.82,优于基线模型AttnGAN(0.62)、MediaPipe(0.71)等模型,较基线模型提升32.3%;IS分数提升至5.05,较基线模型提升15.8%,且模型推理速度较StackGAN++等多阶段生成方法提升20%。在NVIDIA RTX 4090(24 GB显存)环境下,单张图像生成时间降低至0.96秒,该框架有效解决了多阶段误差传递与细粒度对齐问题。 展开更多
关键词 文本到图像生成 三向注意力 隐式特征图 细粒度对齐 双阶段生成
在线阅读 下载PDF
图文跨模态检索双重过滤与动态补全的注意力区域优化方法研究
19
作者 孟凡奇 田凯迪 田研 《现代信息科技》 2026年第1期41-46,共6页
当前图文跨模态检索存在两个主要瓶颈:传统注意力机制往往包含大量冗余区域,引入无关语义噪声;过度筛选又会导致有效区域不足,造成关键视觉信息丢失。这两种情况均会显著降低模型的匹配精度与鲁棒性。针对该问题,提出一种双重优化策略:... 当前图文跨模态检索存在两个主要瓶颈:传统注意力机制往往包含大量冗余区域,引入无关语义噪声;过度筛选又会导致有效区域不足,造成关键视觉信息丢失。这两种情况均会显著降低模型的匹配精度与鲁棒性。针对该问题,提出一种双重优化策略:首先通过双重过滤机制自适应保留高响应区域,有效抑制冗余噪声;同时创新性引入Top-K动态补全方法,在检测到特征不足时自动补充关键语义区域。实验验证表明,该方法在保持特征选择精度的同时,有效避免关键信息丢失,显著提升了模型在复杂场景下的跨模态匹配性能。 展开更多
关键词 跨模态检索 图文检索 特征对齐 阈值过滤 注意力优化
在线阅读 下载PDF
基于SMPL模态分解与嵌入融合的多模态步态识别
20
作者 吴越 梁铮 +4 位作者 高巍 杨茂达 赵培森 邓红霞 常媛媛 《浙江大学学报(工学版)》 北大核心 2026年第1期52-60,共9页
针对现有步态识别研究中步态信息挖掘不足和跨模态特征对齐不充分导致真实场景中识别性能受限的问题,提出基于蒙皮多人线性(SMPL)模态分解与嵌入融合的多模态步态识别方法.通过将SMPL模型分解为形状分支和姿势分支,全面提取人体静态形... 针对现有步态识别研究中步态信息挖掘不足和跨模态特征对齐不充分导致真实场景中识别性能受限的问题,提出基于蒙皮多人线性(SMPL)模态分解与嵌入融合的多模态步态识别方法.通过将SMPL模型分解为形状分支和姿势分支,全面提取人体静态形状特征和动态运动特征;构建自适应帧关节注意力模块,自适应聚焦关键帧与重要关节,增强姿势特征表达能力;设计模态嵌入融合模块,将不同模态特征投影至统一语义空间,并构建模态一致性损失函数,优化跨模态特征对齐,提升融合效果.在Gait3D数据集上的实验结果表明,与6种基于轮廓的方法、2种基于骨骼的方法以及5种基于轮廓和骨骼或SMPL模型的多模态方法比较,所提方法 Rank-1准确率达到70.4%,在复杂真实场景中表现出更高鲁棒性,验证了所提方法在模态特征提取和跨模态特征对齐方面的有效性. 展开更多
关键词 步态识别 SMPL模型 自适应注意力 特征对齐 模态融合
在线阅读 下载PDF
上一页 1 2 21 下一页 到第
使用帮助 返回顶部