期刊文献+
共找到23篇文章
< 1 2 >
每页显示 20 50 100
Multi-Stage-Based Siamese Neural Network for Seal Image Recognition
1
作者 Jianfeng Lu Xiangye Huang +3 位作者 Caijin Li Renlin Xin Shanqing Zhang Mahmoud Emam 《Computer Modeling in Engineering & Sciences》 SCIE EI 2025年第1期405-423,共19页
Seal authentication is an important task for verifying the authenticity of stamped seals used in various domains to protect legal documents from tampering and counterfeiting.Stamped seal inspection is commonly audited... Seal authentication is an important task for verifying the authenticity of stamped seals used in various domains to protect legal documents from tampering and counterfeiting.Stamped seal inspection is commonly audited manually to ensure document authenticity.However,manual assessment of seal images is tedious and laborintensive due to human errors,inconsistent placement,and completeness of the seal.Traditional image recognition systems are inadequate enough to identify seal types accurately,necessitating a neural network-based method for seal image recognition.However,neural network-based classification algorithms,such as Residual Networks(ResNet)andVisualGeometryGroup with 16 layers(VGG16)yield suboptimal recognition rates on stamp datasets.Additionally,the fixed training data categories make handling new categories to be a challenging task.This paper proposes amulti-stage seal recognition algorithmbased on Siamese network to overcome these limitations.Firstly,the seal image is pre-processed by applying an image rotation correction module based on Histogram of Oriented Gradients(HOG).Secondly,the similarity between input seal image pairs is measured by utilizing a similarity comparison module based on the Siamese network.Finally,we compare the results with the pre-stored standard seal template images in the database to obtain the seal type.To evaluate the performance of the proposed method,we further create a new seal image dataset that contains two subsets with 210,000 valid labeled pairs in total.The proposed work has a practical significance in industries where automatic seal authentication is essential as in legal,financial,and governmental sectors,where automatic seal recognition can enhance document security and streamline validation processes.Furthermore,the experimental results show that the proposed multi-stage method for seal image recognition outperforms state-of-the-art methods on the two established datasets. 展开更多
关键词 Seal recognition seal authentication document tampering siamese network spatial transformer network similarity comparison network
在线阅读 下载PDF
An Image Inpainting Approach Based on Parallel Dual-Branch Learnable Transformer Network
2
作者 Rongrong Gong Tingxian Zhang +2 位作者 Yawen Wei Dengyong Zhang Yan Li 《Computers, Materials & Continua》 2025年第10期1221-1234,共14页
Image inpainting refers to synthesizing missing content in an image based on known information to restore occluded or damaged regions,which is a typical manifestation of this trend.With the increasing complexity of im... Image inpainting refers to synthesizing missing content in an image based on known information to restore occluded or damaged regions,which is a typical manifestation of this trend.With the increasing complexity of image in tasks and the growth of data scale,existing deep learning methods still have some limitations.For example,they lack the ability to capture long-range dependencies and their performance in handling multi-scale image structures is suboptimal.To solve this problem,the paper proposes an image inpainting method based on the parallel dual-branch learnable Transformer network.The encoder of the proposed model generator consists of a dual-branch parallel structure with stacked CNN blocks and Transformer blocks,aiming to extract global and local feature information from images.Furthermore,a dual-branch fusion module is adopted to combine the features obtained from both branches.Additionally,a gated full-scale skip connection module is proposed to further enhance the coherence of the inpainting results and alleviate information loss.Finally,experimental results from the three public datasets demonstrate the superior performance of the proposed method. 展开更多
关键词 Artificial intelligence image inpainting transformer network dual-branch fusion gated full-scale skip connection
在线阅读 下载PDF
Appearance consistency and motion coherence learning for internal video inpainting 被引量:1
3
作者 Ruixin Liu Yuesheng Zhu GuiBo Luo 《CAAI Transactions on Intelligence Technology》 2025年第3期827-841,共15页
Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing int... Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing internal learning-based video inpainting methods would produce inconsistent structures or blurry textures due to the insufficient utilisation of motion priors within the video sequence.In this paper,the authors propose a new internal learning-based video inpainting model called appearance consistency and motion coherence network(ACMC-Net),which can not only learn the recurrence of appearance prior but can also capture motion coherence prior to improve the quality of the inpainting results.In ACMC-Net,a transformer-based appearance network is developed to capture global context information within the video frame for representing appearance consistency accurately.Additionally,a novel motion coherence learning scheme is proposed to learn the motion prior in a video sequence effectively.Finally,the learnt internal appearance consistency and motion coherence are implicitly propagated to the missing regions to achieve inpainting well.Extensive experiments conducted on the DAVIS dataset show that the proposed model obtains the superior performance in terms of quantitative measurements and produces more visually plausible results compared with the state-of-the-art methods. 展开更多
关键词 deep internal learning motion coherence spatial-temporal priors transformer network video inpainting
在线阅读 下载PDF
Network Decomposition and Maximum Independent Set Part Ⅱ: Application Research
4
作者 朱松年 朱嫱 《Journal of Southwest Jiaotong University(English Edition)》 2004年第1期1-14,共14页
According to the researches on theoretic basis in part Ⅰ of the paper, the spanning tree algorithms solving the maximum independent set both in even network and in odd network have been developed in this part, part ... According to the researches on theoretic basis in part Ⅰ of the paper, the spanning tree algorithms solving the maximum independent set both in even network and in odd network have been developed in this part, part Ⅱ of the paper. The algorithms transform first the general network into the pair sets network, and then decompose the pair sets network into a series of pair subsets by use of the characteristic of maximum flow passing through the pair sets network. As for the even network, the algorithm requires only one time of transformation and decomposition, the maximum independent set can be gained without any iteration processes, and the time complexity of the algorithm is within the bound of O(V3). However, as for the odd network, the algorithm consists of two stages. In the first stage, the general odd network is transformed and decomposed into the pseudo-negative envelope graphs and generalized reverse pseudo-negative envelope graphs alternately distributed at first; then the algorithm turns to the second stage, searching for the negative envelope graphs within the pseudo-negative envelope graphs only. Each time as a negative envelope graph has been found, renew the pair sets network by iteration at once, and then turn back to the first stage. So both stages form a circulation process up to the optimum. Two available methods, the adjusting search and the picking-off search are specially developed to deal with the problems resulted from the odd network. Both of them link up with each other harmoniously and are embedded together in the algorithm. Analysis and study indicate that the time complexity of this algorithm is within the bound of O(V5). 展开更多
关键词 network transformation and decomposition Negative envelope graph Pseudo-negative envelope graph Spanning tree algorithm Adjusting search Picking-off search Polynomial time bound.
在线阅读 下载PDF
Network Decomposition and Maximum Independent Set Part Ⅰ:Theoretic Basis
5
作者 朱松年 朱嫱 《Journal of Southwest Jiaotong University(English Edition)》 2003年第2期103-121,共19页
The structure and characteristics of a connected network are analyzed, and a special kind of sub-network, which can optimize the iteration processes, is discovered. Then, the sufficient and necessary conditions for o... The structure and characteristics of a connected network are analyzed, and a special kind of sub-network, which can optimize the iteration processes, is discovered. Then, the sufficient and necessary conditions for obtaining the maximum independent set are deduced. It is found that the neighborhood of this sub-network possesses the similar characters, but both can never be allowed incorporated together. Particularly, it is identified that the network can be divided into two parts by a certain style, and then both of them can be transformed into a pair sets network, where the special sub-networks and their neighborhoods appear alternately distributed throughout the entire pair sets network. By use of this characteristic, the network decomposed enough without losing any solutions is obtained. All of these above will be able to make well ready for developing a much better algorithm with polynomial time bound for an odd network in the the application research part of this subject. 展开更多
关键词 odd network network transformation and decomposition negative envelope graph and pseudo-negative envelope graph the sufficient and necessary conditions polynomial time.
在线阅读 下载PDF
Densely-connected Decoder Transformer for unsupervised anomaly detection of power electronic systems
6
作者 Zhichen Zhang Gen Qiu +1 位作者 Yuhua Cheng Min Wang 《Journal of Automation and Intelligence》 2025年第3期217-226,共10页
Reliable electricity infrastructure is critical for modern society,highlighting the importance of securing the stability of fundamental power electronic systems.However,as such systems frequently involve high-current ... Reliable electricity infrastructure is critical for modern society,highlighting the importance of securing the stability of fundamental power electronic systems.However,as such systems frequently involve high-current and high-voltage conditions,there is a greater likelihood of failures.Consequently,anomaly detection of power electronic systems holds great significance,which is a task that properly-designed neural networks can well undertake,as proven in various scenarios.Transformer-like networks are promising for such application,yet with its structure initially designed for different tasks,features extracted by beginning layers are often lost,decreasing detection performance.Also,such data-driven methods typically require sufficient anomalous data for training,which could be difficult to obtain in practice.Therefore,to improve feature utilization while achieving efficient unsupervised learning,a novel model,Densely-connected Decoder Transformer(DDformer),is proposed for unsupervised anomaly detection of power electronic systems in this paper.First,efficient labelfree training is achieved based on the concept of autoencoder with recursive-free output.An encoder-decoder structure with densely-connected decoder is then adopted,merging features from all encoder layers to avoid possible loss of mined features while reducing training difficulty.Both simulation and real-world experiments are conducted to validate the capabilities of DDformer,and the average FDR has surpassed baseline models,reaching 89.39%,93.91%,95.98%in different experiment setups respectively. 展开更多
关键词 Power electronic systems Anomaly detection Transformer network Dense connection Unsupervised learning DDformer
在线阅读 下载PDF
Transient Stability Assessment Model and Its Updating Based on Dual-Tower Transformer
7
作者 Nan Li Jingxiong Dong +1 位作者 Liang Tao Liang Huang 《Energy Engineering》 2025年第7期2957-2975,共19页
With the continuous expansion and increasing complexity of power system scales,the binary classifica-tion for transient stability assessment in power systems can no longer meet the safety requirements of power system ... With the continuous expansion and increasing complexity of power system scales,the binary classifica-tion for transient stability assessment in power systems can no longer meet the safety requirements of power system control and regulation.Therefore,this paper proposes a multi-class transient stability assessment model based on an improved Transformer.The model is designed with a dual-tower encoder structure:one encoder focuses on the time dependency of data,while the other focuses on the dynamic correlations between variables.Feature extraction is conducted from both time and variable perspectives to ensure the completeness of the feature extraction process,thereby enhancing the accuracy of multi-class evaluation in power systems.Additionally,this paper introduces a hybrid sampling strategy based on sample boundaries,which addresses the issue of sample imbalance by increasing the number of boundary samples in the minority class and reducing the number of non-boundary samples in the majority class.Considering the frequent changes in power grid topology or operation modes,this paper proposes a two-stage updating scheme based on self-supervised learning:In the first stage,self-supervised learning is employed to mine the structural information from unlabeled data in the target domain,enhancing the model’s generalization capability in new scenarios.In the second stage,a sample screening mechanism is used to select key samples,which are labeled through long-term simulation techniques for fine-tuning the model parameters.This allows for rapid model updates without relying on many labeled samples.This paper’s proposed model and update scheme have been simulated and verified on two node systems,the IEEE New England 10-machine 39-bus system and the IEEE 47-machine 140-bus system,demonstrating their effectiveness and reliability. 展开更多
关键词 Transient stability assessment sample imbalance dual-tower transformer network self-supervised learning
在线阅读 下载PDF
Person Re-Identification Based on Spatial Feature Learning and Multi-Granularity Feature Fusion
8
作者 DIAO Zijian CAO Shuai +4 位作者 LI Wenwei LIANG Jianan WEN Guilin HUANG Weici ZHANG Shouming 《Journal of Shanghai Jiaotong university(Science)》 2025年第2期363-374,共12页
In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestri... In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestrian re-identification tasks,a person re-identification method combining spatial feature learning and multi-granularity feature fusion was proposed.First,an attention spatial transformation network(A-STN)is proposed to learn spatial features and solve the problem of misalignment of pedestrian spatial features.Then the network was divided into a global branch,a local coarse-grained fusion branch,and a local fine-grained fusion branch to extract pedestrian global features,coarse-grained fusion features,and fine-grained fusion features,respectively.Among them,the global branch enriches the global features by fusing different pooling features.The local coarse-grained fusion branch uses an overlay pooling to enhance each local feature while learning the correlation relationship between multi-granularity features.The local fine-grained fusion branch uses a differential pooling to obtain the differential features that were fused with global features to learn the relationship between pedestrian local features and pedestrian global features.Finally,the proposed method was compared on three public datasets:Market1501,DukeMTMC-ReID and CUHK03.The experimental results were better than those of the comparative methods,which verifies the effectiveness of the proposed method. 展开更多
关键词 pedestrian re-identification spatial features attention spatial transformation network multi-branch network relation features
原文传递
Research on YOLO algorithm for lightweight PCB defect detection based on MobileViT
9
作者 LIU Yuchen LIU Fuzheng JIANG Mingshun 《Optoelectronics Letters》 2025年第8期483-490,共8页
Current you only look once(YOLO)-based algorithm model is facing the challenge of overwhelming parameters and calculation complexity under the printed circuit board(PCB)defect detection application scenario.In order t... Current you only look once(YOLO)-based algorithm model is facing the challenge of overwhelming parameters and calculation complexity under the printed circuit board(PCB)defect detection application scenario.In order to solve this problem,we propose a new method,which combined the lightweight network mobile vision transformer(Mobile Vi T)with the convolutional block attention module(CBAM)mechanism and the new regression loss function.This method needed less computation resources,making it more suitable for embedded edge detection devices.Meanwhile,the new loss function improved the positioning accuracy of the bounding box and enhanced the robustness of the model.In addition,experiments on public datasets demonstrate that the improved model achieves an average accuracy of 87.9%across six typical defect detection tasks,while reducing computational costs by nearly 90%.It significantly reduces the model's computational requirements while maintaining accuracy,ensuring reliable performance for edge deployment. 展开更多
关键词 YOLO lightweight network mobile vision transformer mobile Lightweight network convolutional block attention module cbam mechanism MobileViT CBAM PCB Defect Detection Regression Loss Function
原文传递
A Novel Action Transformer Network for Hybrid Multimodal Sign Language Recognition
10
作者 Sameena Javaid Safdar Rizvi 《Computers, Materials & Continua》 SCIE EI 2023年第1期523-537,共15页
Sign language fills the communication gap for people with hearing and speaking ailments.It includes both visual modalities,manual gestures consisting of movements of hands,and non-manual gestures incorporating body mo... Sign language fills the communication gap for people with hearing and speaking ailments.It includes both visual modalities,manual gestures consisting of movements of hands,and non-manual gestures incorporating body movements including head,facial expressions,eyes,shoulder shrugging,etc.Previously both gestures have been detected;identifying separately may have better accuracy,butmuch communicational information is lost.Aproper sign language mechanism is needed to detect manual and non-manual gestures to convey the appropriate detailed message to others.Our novel proposed system contributes as Sign LanguageAction Transformer Network(SLATN),localizing hand,body,and facial gestures in video sequences.Here we are expending a Transformer-style structural design as a“base network”to extract features from a spatiotemporal domain.Themodel impulsively learns to track individual persons and their action context inmultiple frames.Furthermore,a“head network”emphasizes hand movement and facial expression simultaneously,which is often crucial to understanding sign language,using its attention mechanism for creating tight bounding boxes around classified gestures.The model’s work is later compared with the traditional identification methods of activity recognition.It not only works faster but achieves better accuracy as well.Themodel achieves overall 82.66%testing accuracy with a very considerable performance of computation with 94.13 Giga-Floating Point Operations per Second(G-FLOPS).Another contribution is a newly created dataset of Pakistan Sign Language forManual and Non-Manual(PkSLMNM)gestures. 展开更多
关键词 Sign language gesture recognition manual signs non-manual signs action transformer network
在线阅读 下载PDF
Poisson Image Restoration via Transformed Network
11
作者 XU Xiaoling ZHENG Haiyu +2 位作者 ZHANG Fengqin LI Hechen ZHANG Minghui 《Journal of Shanghai Jiaotong university(Science)》 EI 2021年第6期857-868,共12页
There is a Poisson inverse problem in biomedical imaging,fluorescence microscopy and so on.Since the observed measurements are damaged by a linear operator and further destroyed by Poisson noise,recovering the approxi... There is a Poisson inverse problem in biomedical imaging,fluorescence microscopy and so on.Since the observed measurements are damaged by a linear operator and further destroyed by Poisson noise,recovering the approximate original image is difficult.Motivated by the decouple scheme and the variance-stabilizing transformation(VST)strategy,we propose a method of transformed convolutional neural network(CNN)to restore the observed image.In the network,the Conv-layers play the role of a linear inverse filter and the distribution transformation simultaneously.Furthermore,there is no batch normalization(BN)layer in the residual block of the network,which is devoted to tackling with the non-Gaussian recovery procedure.The proposed method is compared with state-of-the-art Poisson deblurring algorithms,and the experimental results show the effectiveness of the method. 展开更多
关键词 DECONVOLUTION Poisson noise transformed network decouple scheme variance-stabilizing transformation(VST)
原文传递
View interpolation networks for reproducing the material appearance of specular objects
12
作者 Chihiro HOSHIZAWA Takashi KOMURO 《Virtual Reality & Intelligent Hardware》 2023年第1期1-10,共10页
Background In this study, we propose view interpolation networks to reproduce changes in the brightness of an object′s surface depending on the viewing direction, which is important for reproducing the material appea... Background In this study, we propose view interpolation networks to reproduce changes in the brightness of an object′s surface depending on the viewing direction, which is important for reproducing the material appearance of a real object. Method We used an original and modified version of U-Net for image transformation. The networks were trained to generate images from the intermediate viewpoints of four cameras placed at the corners of a square. We conducted an experiment using with three different combinations of methods and training data formats. Result We determined that inputting the coordinates of the viewpoints together with the four camera images and using images from random viewpoints as the training data produces the best results. 展开更多
关键词 View synthesis Image transformation network Reflectance reproduction Material appearance U-Net
在线阅读 下载PDF
Method to generate training samples for neural network used in target recognition
13
作者 何灏 罗庆生 +2 位作者 罗霄 徐如强 李钢 《Journal of Beijing Institute of Technology》 EI CAS 2012年第3期400-407,共8页
Training neural network to recognize targets needs a lot of samples.People usually get these samples in a non-systematic way,which can miss or overemphasize some target information.To improve this situation,a new meth... Training neural network to recognize targets needs a lot of samples.People usually get these samples in a non-systematic way,which can miss or overemphasize some target information.To improve this situation,a new method based on virtual model and invariant moments was proposed to generate training samples.The method was composed of the following steps:use computer and simulation software to build target object's virtual model and then simulate the environment,light condition,camera parameter,etc.;rotate the model by spin and nutation of inclination to get the image sequence by virtual camera;preprocess each image and transfer them into binary image;calculate the invariant moments for each image and get a vectors' sequence.The vectors' sequence which was proved to be complete became the training samples together with the target outputs.The simulated results showed that the proposed method could be used to recognize the real targets and improve the accuracy of target recognition effectively when the sampling interval was short enough and the circumstance simulation was close enough. 展开更多
关键词 pattern recognition training samples for neural network model emulation space coordinate transform invariant moments
在线阅读 下载PDF
SIT.net: SAR Deforestation Classification of Amazon Forest for Land Use Land Cover Application 被引量:2
14
作者 Priyanka Darbari Ankush Agarwal Manoj Kumar 《Journal of Computer and Communications》 2024年第3期68-83,共16页
The process of turning forest area into land is known as deforestation or forest degradation. Reforestation as a fraction of deforestation is extremely low. For improved qualitative and quantitative classification, we... The process of turning forest area into land is known as deforestation or forest degradation. Reforestation as a fraction of deforestation is extremely low. For improved qualitative and quantitative classification, we used Sentinel-1 dataset of State of Para, Brazil to precisely and closely monitor deforestation between June 2019 and June 2023. This research aimed to find out suitable model for classification called Satellite Imaging analysis by Transpose deep neural transformation network (SIT-net) using mathematical model based on Band math approach to classify deforestation applying transpose deep neural network. The main advantage of proposed model is easy to handle SAR images. The study concludes that SAR satellite gives high-resolution images to improve deforestation monitoring and proposed model takes less computational time compared to other techniques. 展开更多
关键词 Brazilian Amazon Sentinel-1 Band Math Transpose CNN transformation network
在线阅读 下载PDF
Transformer-based correction scheme for short-term bus load prediction in holidays
15
作者 Tang Ningkai Lu Jixiang +3 位作者 Chen Tianyu Shu Jiao Chang Li Chen Tao 《Journal of Southeast University(English Edition)》 EI CAS 2024年第3期304-312,共9页
To tackle the problem of inaccurate short-term bus load prediction,especially during holidays,a Transformer-based scheme with tailored architectural enhancements is proposed.First,the input data are clustered to reduc... To tackle the problem of inaccurate short-term bus load prediction,especially during holidays,a Transformer-based scheme with tailored architectural enhancements is proposed.First,the input data are clustered to reduce complexity and capture inherent characteristics more effectively.Gated residual connections are then employed to selectively propagate salient features across layers,while an attention mechanism focuses on identifying prominent patterns in multivariate time-series data.Ultimately,a pre-trained structure is incorporated to reduce computational complexity.Experimental results based on extensive data show that the proposed scheme achieves improved prediction accuracy over comparative algorithms by at least 32.00%consistently across all buses evaluated,and the fitting effect of holiday load curves is outstanding.Meanwhile,the pre-trained structure drastically reduces the training time of the proposed algorithm by more than 65.75%.The proposed scheme can efficiently predict bus load results while enhancing robustness for holiday predictions,making it better adapted to real-world prediction scenarios. 展开更多
关键词 short-term bus load prediction Transformer network holiday load pre-training model load clustering
在线阅读 下载PDF
Mathematical Modeling and Control Algorithm Development for Bidirectional Power Flow in CCS-CNT System
16
作者 Sinqobile Wiseman Nene 《Journal of Power and Energy Engineering》 2024年第9期131-143,共12页
As the demand for more efficient and adaptable power distribution systems intensifies, especially in rural areas, innovative solutions like the Capacitor-Coupled Substation with a Controllable Network Transformer (CCS... As the demand for more efficient and adaptable power distribution systems intensifies, especially in rural areas, innovative solutions like the Capacitor-Coupled Substation with a Controllable Network Transformer (CCS-CNT) are becoming increasingly critical. Traditional power distribution networks, often limited by unidirectional flow capabilities and inflexibility, struggle to meet the complex demands of modern energy systems. The CCS-CNT system offers a transformative approach by enabling bidirectional power flow between high-voltage transmission lines and local distribution networks, a feature that is essential for integrating renewable energy sources and ensuring reliable electrification in underserved regions. This paper presents a detailed mathematical representation of power flow within the CCS-CNT system, emphasizing the control of both active and reactive power through the adjustment of voltage levels and phase angles. A control algorithm is developed to dynamically manage power flow, ensuring optimal performance by minimizing losses and maintaining voltage stability across the network. The proposed CCS-CNT system demonstrates significant potential in enhancing the efficiency and reliability of power distribution, making it particularly suited for rural electrification and other applications where traditional methods fall short. The findings underscore the system's capability to adapt to varying operational conditions, offering a robust solution for modern power distribution challenges. 展开更多
关键词 Capacitor Couple Substation Ferroresonance Power Flow Control Controllable network Controller Capacitor-Coupled Substation Incorporating Controllable network Transformer (CCS-CNT) System System Modeling
在线阅读 下载PDF
Automated Facial Expression Recognition and Age Estimation Using Deep Learning 被引量:3
17
作者 Syeda Amna Rizwan Yazeed Yasin Ghadi +1 位作者 Ahmad Jalal Kibum Kim 《Computers, Materials & Continua》 SCIE EI 2022年第6期5235-5252,共18页
With the advancement of computer vision techniques in surveillance systems,the need for more proficient,intelligent,and sustainable facial expressions and age recognition is necessary.The main purpose of this study is... With the advancement of computer vision techniques in surveillance systems,the need for more proficient,intelligent,and sustainable facial expressions and age recognition is necessary.The main purpose of this study is to develop accurate facial expressions and an age recognition system that is capable of error-free recognition of human expression and age in both indoor and outdoor environments.The proposed system first takes an input image pre-process it and then detects faces in the entire image.After that landmarks localization helps in the formation of synthetic face mask prediction.A novel set of features are extracted and passed to a classifier for the accurate classification of expressions and age group.The proposed system is tested over two benchmark datasets,namely,the Gallagher collection person dataset and the Images of Groups dataset.The system achieved remarkable results over these benchmark datasets about recognition accuracy and computational time.The proposed system would also be applicable in different consumer application domains such as online business negotiations,consumer behavior analysis,E-learning environments,and emotion robotics. 展开更多
关键词 Feature extraction face expression model local transform features and recurrent neural network(RNN)
在线阅读 下载PDF
Effectively Integrating CNN and Low-Complexity Transformer for Lung Cancer Tumor Prediction After Neoadjuvant Chemoimmunotherapy
18
作者 Jiancun Zhou Xianzhen Tan +1 位作者 Hulin Kuang Jianxin Wang 《Big Data Mining and Analytics》 2025年第5期981-996,共16页
A novel hybrid model combining a convolutional neural network(CNN)and a low-complexity Transformer network is introduced for predicting lung cancer response to neoadjuvant chemoimmunotherapy using computed tomography ... A novel hybrid model combining a convolutional neural network(CNN)and a low-complexity Transformer network is introduced for predicting lung cancer response to neoadjuvant chemoimmunotherapy using computed tomography scans.This approach is crucial as it assists clinicians in identifying patients likely to benefit from treatment and in assessing their prognosis.The model employs channel splitting to minimize parameter count.It then leverages both CNN for local feature extraction and a streamlined Transformer for global feature comprehension.To enhance efficiency,a novel self-attention mechanism is implemented,focusing on feature aggregation and element-wise multiplication.To address the different semantic meanings of features,an attention-based module is designed to seamlessly integrate features from both networks,employing a process of coarse fusion,attention computation,and fine fusion.When evaluated with data from 232 lung cancer patients who have undergone neoadjuvant chemoimmunotherapy,the model demonstrates exceptional performance,achieving a Dice score of 47.04%and a 95.00%Hausdorff distance of 25.12 mm,outperforming existing methods.Additionally,it has only 2.91×106 parameters and 52.95×109 floating point operations.Moreover,the model’s predictive accuracy in tumor diameter estimation is beneficial for treatment planning.Its robustness is further validated through its application in stroke lesion prediction,indicating its broad applicability. 展开更多
关键词 lung cancer tumor prediction neoadjuvant chemoimmunotherapy hybrid convolutional neural network(CNN)and Transformer network low-complexity self-attention
原文传递
Desensitization of Private Text Dataset Based on Gradient Strategy Trans-WTGAN
19
作者 Zhen Guo Ying Zhou +1 位作者 Jun Ye Yongxu Hou 《Tsinghua Science and Technology》 2025年第5期2081-2096,共16页
Privacy-sensitive data encounter immense security and usability challenges in processing,analyzing,and sharing.Meanwhile,traditional privacy data desensitization methods suffer from issues such as poor quality and low... Privacy-sensitive data encounter immense security and usability challenges in processing,analyzing,and sharing.Meanwhile,traditional privacy data desensitization methods suffer from issues such as poor quality and low usability after desensitization.Therefore,a text data desensitization model that combines Transformer and Wasserstein Text convolutional Generative Adversarial Network(Trans-WTGAN)is proposed.Transformer as the generator and its self-attention mechanism can handle long-range dependencies,enabling the generated of higher-quality text;Text Convolutional Neural Network(TextCNN)integrates the idea of Wasserstein as the discriminator to enhance the stability of model training;and the strategy gradient scheme of reinforcement learning is employed.Reinforcement learning utilizes the policy gradient scheme as the updating method of generator parameters,ensuring the generated data retains the original key features and maintains a certain level of usability.The experimental results indicate that the proposed model scheme holds a greater advantage over existing methods in terms of text quality and structural consistency,can guarantee the desensitization effect,and ensures the usability of the privacy-sensitive data to a certain extent after desensitization,facilitates the simulation of the development environment for the use of real data and the analysis and sharing of data. 展开更多
关键词 DESENSITIZATION gradient strategy Transformer and Wasserstein Text convolutional Generative Adversarial network(Trans-WTGAN) USABILITY
原文传递
A transformer-based Siamese network and an open optical dataset for semantic change detection of remote sensing images 被引量:3
20
作者 Panli Yuan Qingzhan Zhao +3 位作者 Xingbiao Zhao Xuewen Wang Xuefeng Long Yuchen Zheng 《International Journal of Digital Earth》 SCIE EI 2022年第1期1506-1525,共20页
Recent change detection(CD)methods focus on the extraction of deep change semantic features.However,existing methods overlook the fine-grained features and have the poor ability to capture long-range space–time infor... Recent change detection(CD)methods focus on the extraction of deep change semantic features.However,existing methods overlook the fine-grained features and have the poor ability to capture long-range space–time information,which leads to the micro changes missing and the edges of change types smoothing.In this paper,a potential transformer-based semantic change detection(SCD)model,Pyramid-SCDFormer is proposed,which precisely recognizes the small changes and fine edges details of the changes.The SCD model selectively merges different semantic tokens in multi-head self-attention block to obtain multiscale features,which is crucial for extraction information of remote sensing images(RSIs)with multiple changes from different scales.Moreover,we create a well-annotated SCD dataset,Landsat-SCD with unprecedented time series and change types in complex scenarios.Comparing with three Convolutional Neural Network-based,one attention-based,and two transformer-based networks,experimental results demonstrate that the Pyramid-SCDFormer stably outperforms the existing state-of-the-art CD models and obtains an improvement in MIoU/F1 of 1.11/0.76%,0.57/0.50%,and 8.75/8.59%on the LEVIR-CD,WHU_CD,and Landsat-SCD dataset respectively.For change classes proportion less than 1%,the proposed model improves the MIoU by 7.17–19.53%on Landsat-SCD dataset.The recognition performance for small-scale and fine edges of change types has greatly improved. 展开更多
关键词 Semantic change detection(SCD) change detection dataset transformer siamese network self-attention mechanism bitemporal remote sensing
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部