期刊文献+
共找到23篇文章
< 1 2 >
每页显示 20 50 100
An Image Inpainting Approach Based on Parallel Dual-Branch Learnable Transformer Network
1
作者 Rongrong Gong Tingxian Zhang +2 位作者 Yawen Wei Dengyong Zhang Yan Li 《Computers, Materials & Continua》 2025年第10期1221-1234,共14页
Image inpainting refers to synthesizing missing content in an image based on known information to restore occluded or damaged regions,which is a typical manifestation of this trend.With the increasing complexity of im... Image inpainting refers to synthesizing missing content in an image based on known information to restore occluded or damaged regions,which is a typical manifestation of this trend.With the increasing complexity of image in tasks and the growth of data scale,existing deep learning methods still have some limitations.For example,they lack the ability to capture long-range dependencies and their performance in handling multi-scale image structures is suboptimal.To solve this problem,the paper proposes an image inpainting method based on the parallel dual-branch learnable Transformer network.The encoder of the proposed model generator consists of a dual-branch parallel structure with stacked CNN blocks and Transformer blocks,aiming to extract global and local feature information from images.Furthermore,a dual-branch fusion module is adopted to combine the features obtained from both branches.Additionally,a gated full-scale skip connection module is proposed to further enhance the coherence of the inpainting results and alleviate information loss.Finally,experimental results from the three public datasets demonstrate the superior performance of the proposed method. 展开更多
关键词 Artificial intelligence image inpainting transformer network dual-branch fusion gated full-scale skip connection
在线阅读 下载PDF
A Novel Action Transformer Network for Hybrid Multimodal Sign Language Recognition
2
作者 Sameena Javaid Safdar Rizvi 《Computers, Materials & Continua》 SCIE EI 2023年第1期523-537,共15页
Sign language fills the communication gap for people with hearing and speaking ailments.It includes both visual modalities,manual gestures consisting of movements of hands,and non-manual gestures incorporating body mo... Sign language fills the communication gap for people with hearing and speaking ailments.It includes both visual modalities,manual gestures consisting of movements of hands,and non-manual gestures incorporating body movements including head,facial expressions,eyes,shoulder shrugging,etc.Previously both gestures have been detected;identifying separately may have better accuracy,butmuch communicational information is lost.Aproper sign language mechanism is needed to detect manual and non-manual gestures to convey the appropriate detailed message to others.Our novel proposed system contributes as Sign LanguageAction Transformer Network(SLATN),localizing hand,body,and facial gestures in video sequences.Here we are expending a Transformer-style structural design as a“base network”to extract features from a spatiotemporal domain.Themodel impulsively learns to track individual persons and their action context inmultiple frames.Furthermore,a“head network”emphasizes hand movement and facial expression simultaneously,which is often crucial to understanding sign language,using its attention mechanism for creating tight bounding boxes around classified gestures.The model’s work is later compared with the traditional identification methods of activity recognition.It not only works faster but achieves better accuracy as well.Themodel achieves overall 82.66%testing accuracy with a very considerable performance of computation with 94.13 Giga-Floating Point Operations per Second(G-FLOPS).Another contribution is a newly created dataset of Pakistan Sign Language forManual and Non-Manual(PkSLMNM)gestures. 展开更多
关键词 Sign language gesture recognition manual signs non-manual signs action transformer network
在线阅读 下载PDF
Multi-Stage-Based Siamese Neural Network for Seal Image Recognition
3
作者 Jianfeng Lu Xiangye Huang +3 位作者 Caijin Li Renlin Xin Shanqing Zhang Mahmoud Emam 《Computer Modeling in Engineering & Sciences》 SCIE EI 2025年第1期405-423,共19页
Seal authentication is an important task for verifying the authenticity of stamped seals used in various domains to protect legal documents from tampering and counterfeiting.Stamped seal inspection is commonly audited... Seal authentication is an important task for verifying the authenticity of stamped seals used in various domains to protect legal documents from tampering and counterfeiting.Stamped seal inspection is commonly audited manually to ensure document authenticity.However,manual assessment of seal images is tedious and laborintensive due to human errors,inconsistent placement,and completeness of the seal.Traditional image recognition systems are inadequate enough to identify seal types accurately,necessitating a neural network-based method for seal image recognition.However,neural network-based classification algorithms,such as Residual Networks(ResNet)andVisualGeometryGroup with 16 layers(VGG16)yield suboptimal recognition rates on stamp datasets.Additionally,the fixed training data categories make handling new categories to be a challenging task.This paper proposes amulti-stage seal recognition algorithmbased on Siamese network to overcome these limitations.Firstly,the seal image is pre-processed by applying an image rotation correction module based on Histogram of Oriented Gradients(HOG).Secondly,the similarity between input seal image pairs is measured by utilizing a similarity comparison module based on the Siamese network.Finally,we compare the results with the pre-stored standard seal template images in the database to obtain the seal type.To evaluate the performance of the proposed method,we further create a new seal image dataset that contains two subsets with 210,000 valid labeled pairs in total.The proposed work has a practical significance in industries where automatic seal authentication is essential as in legal,financial,and governmental sectors,where automatic seal recognition can enhance document security and streamline validation processes.Furthermore,the experimental results show that the proposed multi-stage method for seal image recognition outperforms state-of-the-art methods on the two established datasets. 展开更多
关键词 Seal recognition seal authentication document tampering siamese network spatial transformer network similarity comparison network
在线阅读 下载PDF
Densely-connected Decoder Transformer for unsupervised anomaly detection of power electronic systems
4
作者 Zhichen Zhang Gen Qiu +1 位作者 Yuhua Cheng Min Wang 《Journal of Automation and Intelligence》 2025年第3期217-226,共10页
Reliable electricity infrastructure is critical for modern society,highlighting the importance of securing the stability of fundamental power electronic systems.However,as such systems frequently involve high-current ... Reliable electricity infrastructure is critical for modern society,highlighting the importance of securing the stability of fundamental power electronic systems.However,as such systems frequently involve high-current and high-voltage conditions,there is a greater likelihood of failures.Consequently,anomaly detection of power electronic systems holds great significance,which is a task that properly-designed neural networks can well undertake,as proven in various scenarios.Transformer-like networks are promising for such application,yet with its structure initially designed for different tasks,features extracted by beginning layers are often lost,decreasing detection performance.Also,such data-driven methods typically require sufficient anomalous data for training,which could be difficult to obtain in practice.Therefore,to improve feature utilization while achieving efficient unsupervised learning,a novel model,Densely-connected Decoder Transformer(DDformer),is proposed for unsupervised anomaly detection of power electronic systems in this paper.First,efficient labelfree training is achieved based on the concept of autoencoder with recursive-free output.An encoder-decoder structure with densely-connected decoder is then adopted,merging features from all encoder layers to avoid possible loss of mined features while reducing training difficulty.Both simulation and real-world experiments are conducted to validate the capabilities of DDformer,and the average FDR has surpassed baseline models,reaching 89.39%,93.91%,95.98%in different experiment setups respectively. 展开更多
关键词 Power electronic systems Anomaly detection transformer network Dense connection Unsupervised learning DDformer
在线阅读 下载PDF
Transient Stability Assessment Model and Its Updating Based on Dual-Tower Transformer
5
作者 Nan Li Jingxiong Dong +1 位作者 Liang Tao Liang Huang 《Energy Engineering》 2025年第7期2957-2975,共19页
With the continuous expansion and increasing complexity of power system scales,the binary classifica-tion for transient stability assessment in power systems can no longer meet the safety requirements of power system ... With the continuous expansion and increasing complexity of power system scales,the binary classifica-tion for transient stability assessment in power systems can no longer meet the safety requirements of power system control and regulation.Therefore,this paper proposes a multi-class transient stability assessment model based on an improved Transformer.The model is designed with a dual-tower encoder structure:one encoder focuses on the time dependency of data,while the other focuses on the dynamic correlations between variables.Feature extraction is conducted from both time and variable perspectives to ensure the completeness of the feature extraction process,thereby enhancing the accuracy of multi-class evaluation in power systems.Additionally,this paper introduces a hybrid sampling strategy based on sample boundaries,which addresses the issue of sample imbalance by increasing the number of boundary samples in the minority class and reducing the number of non-boundary samples in the majority class.Considering the frequent changes in power grid topology or operation modes,this paper proposes a two-stage updating scheme based on self-supervised learning:In the first stage,self-supervised learning is employed to mine the structural information from unlabeled data in the target domain,enhancing the model’s generalization capability in new scenarios.In the second stage,a sample screening mechanism is used to select key samples,which are labeled through long-term simulation techniques for fine-tuning the model parameters.This allows for rapid model updates without relying on many labeled samples.This paper’s proposed model and update scheme have been simulated and verified on two node systems,the IEEE New England 10-machine 39-bus system and the IEEE 47-machine 140-bus system,demonstrating their effectiveness and reliability. 展开更多
关键词 Transient stability assessment sample imbalance dual-tower transformer network self-supervised learning
在线阅读 下载PDF
Transformer-based correction scheme for short-term bus load prediction in holidays
6
作者 Tang Ningkai Lu Jixiang +3 位作者 Chen Tianyu Shu Jiao Chang Li Chen Tao 《Journal of Southeast University(English Edition)》 EI CAS 2024年第3期304-312,共9页
To tackle the problem of inaccurate short-term bus load prediction,especially during holidays,a Transformer-based scheme with tailored architectural enhancements is proposed.First,the input data are clustered to reduc... To tackle the problem of inaccurate short-term bus load prediction,especially during holidays,a Transformer-based scheme with tailored architectural enhancements is proposed.First,the input data are clustered to reduce complexity and capture inherent characteristics more effectively.Gated residual connections are then employed to selectively propagate salient features across layers,while an attention mechanism focuses on identifying prominent patterns in multivariate time-series data.Ultimately,a pre-trained structure is incorporated to reduce computational complexity.Experimental results based on extensive data show that the proposed scheme achieves improved prediction accuracy over comparative algorithms by at least 32.00%consistently across all buses evaluated,and the fitting effect of holiday load curves is outstanding.Meanwhile,the pre-trained structure drastically reduces the training time of the proposed algorithm by more than 65.75%.The proposed scheme can efficiently predict bus load results while enhancing robustness for holiday predictions,making it better adapted to real-world prediction scenarios. 展开更多
关键词 short-term bus load prediction transformer network holiday load pre-training model load clustering
在线阅读 下载PDF
Appearance consistency and motion coherence learning for internal video inpainting 被引量:1
7
作者 Ruixin Liu Yuesheng Zhu GuiBo Luo 《CAAI Transactions on Intelligence Technology》 2025年第3期827-841,共15页
Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing int... Internal learning-based video inpainting methods have shown promising results by exploiting the intrinsic properties of the video to fill in the missing region without external dataset supervision.However,existing internal learning-based video inpainting methods would produce inconsistent structures or blurry textures due to the insufficient utilisation of motion priors within the video sequence.In this paper,the authors propose a new internal learning-based video inpainting model called appearance consistency and motion coherence network(ACMC-Net),which can not only learn the recurrence of appearance prior but can also capture motion coherence prior to improve the quality of the inpainting results.In ACMC-Net,a transformer-based appearance network is developed to capture global context information within the video frame for representing appearance consistency accurately.Additionally,a novel motion coherence learning scheme is proposed to learn the motion prior in a video sequence effectively.Finally,the learnt internal appearance consistency and motion coherence are implicitly propagated to the missing regions to achieve inpainting well.Extensive experiments conducted on the DAVIS dataset show that the proposed model obtains the superior performance in terms of quantitative measurements and produces more visually plausible results compared with the state-of-the-art methods. 展开更多
关键词 deep internal learning motion coherence spatial-temporal priors transformer network video inpainting
在线阅读 下载PDF
Person Re-Identification Based on Spatial Feature Learning and Multi-Granularity Feature Fusion
8
作者 DIAO Zijian CAO Shuai +4 位作者 LI Wenwei LIANG Jianan WEN Guilin HUANG Weici ZHANG Shouming 《Journal of Shanghai Jiaotong university(Science)》 2025年第2期363-374,共12页
In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestri... In view of the weak ability of the convolutional neural networks to explicitly learn spatial invariance and the probabilistic loss of discriminative features caused by occlusion and background interference in pedestrian re-identification tasks,a person re-identification method combining spatial feature learning and multi-granularity feature fusion was proposed.First,an attention spatial transformation network(A-STN)is proposed to learn spatial features and solve the problem of misalignment of pedestrian spatial features.Then the network was divided into a global branch,a local coarse-grained fusion branch,and a local fine-grained fusion branch to extract pedestrian global features,coarse-grained fusion features,and fine-grained fusion features,respectively.Among them,the global branch enriches the global features by fusing different pooling features.The local coarse-grained fusion branch uses an overlay pooling to enhance each local feature while learning the correlation relationship between multi-granularity features.The local fine-grained fusion branch uses a differential pooling to obtain the differential features that were fused with global features to learn the relationship between pedestrian local features and pedestrian global features.Finally,the proposed method was compared on three public datasets:Market1501,DukeMTMC-ReID and CUHK03.The experimental results were better than those of the comparative methods,which verifies the effectiveness of the proposed method. 展开更多
关键词 pedestrian re-identification spatial features attention spatial transformation network multi-branch network relation features
原文传递
Research on YOLO algorithm for lightweight PCB defect detection based on MobileViT
9
作者 LIU Yuchen LIU Fuzheng JIANG Mingshun 《Optoelectronics Letters》 2025年第8期483-490,共8页
Current you only look once(YOLO)-based algorithm model is facing the challenge of overwhelming parameters and calculation complexity under the printed circuit board(PCB)defect detection application scenario.In order t... Current you only look once(YOLO)-based algorithm model is facing the challenge of overwhelming parameters and calculation complexity under the printed circuit board(PCB)defect detection application scenario.In order to solve this problem,we propose a new method,which combined the lightweight network mobile vision transformer(Mobile Vi T)with the convolutional block attention module(CBAM)mechanism and the new regression loss function.This method needed less computation resources,making it more suitable for embedded edge detection devices.Meanwhile,the new loss function improved the positioning accuracy of the bounding box and enhanced the robustness of the model.In addition,experiments on public datasets demonstrate that the improved model achieves an average accuracy of 87.9%across six typical defect detection tasks,while reducing computational costs by nearly 90%.It significantly reduces the model's computational requirements while maintaining accuracy,ensuring reliable performance for edge deployment. 展开更多
关键词 YOLO lightweight network mobile vision transformer mobile Lightweight network convolutional block attention module cbam mechanism MobileViT CBAM PCB Defect Detection Regression Loss Function
原文传递
Poisson Image Restoration via Transformed Network
10
作者 XU Xiaoling ZHENG Haiyu +2 位作者 ZHANG Fengqin LI Hechen ZHANG Minghui 《Journal of Shanghai Jiaotong university(Science)》 EI 2021年第6期857-868,共12页
There is a Poisson inverse problem in biomedical imaging,fluorescence microscopy and so on.Since the observed measurements are damaged by a linear operator and further destroyed by Poisson noise,recovering the approxi... There is a Poisson inverse problem in biomedical imaging,fluorescence microscopy and so on.Since the observed measurements are damaged by a linear operator and further destroyed by Poisson noise,recovering the approximate original image is difficult.Motivated by the decouple scheme and the variance-stabilizing transformation(VST)strategy,we propose a method of transformed convolutional neural network(CNN)to restore the observed image.In the network,the Conv-layers play the role of a linear inverse filter and the distribution transformation simultaneously.Furthermore,there is no batch normalization(BN)layer in the residual block of the network,which is devoted to tackling with the non-Gaussian recovery procedure.The proposed method is compared with state-of-the-art Poisson deblurring algorithms,and the experimental results show the effectiveness of the method. 展开更多
关键词 DECONVOLUTION Poisson noise transformed network decouple scheme variance-stabilizing transformation(VST)
原文传递
View interpolation networks for reproducing the material appearance of specular objects
11
作者 Chihiro HOSHIZAWA Takashi KOMURO 《Virtual Reality & Intelligent Hardware》 2023年第1期1-10,共10页
Background In this study, we propose view interpolation networks to reproduce changes in the brightness of an object′s surface depending on the viewing direction, which is important for reproducing the material appea... Background In this study, we propose view interpolation networks to reproduce changes in the brightness of an object′s surface depending on the viewing direction, which is important for reproducing the material appearance of a real object. Method We used an original and modified version of U-Net for image transformation. The networks were trained to generate images from the intermediate viewpoints of four cameras placed at the corners of a square. We conducted an experiment using with three different combinations of methods and training data formats. Result We determined that inputting the coordinates of the viewpoints together with the four camera images and using images from random viewpoints as the training data produces the best results. 展开更多
关键词 View synthesis Image transformation network Reflectance reproduction Material appearance U-Net
在线阅读 下载PDF
Network Decomposition and Maximum Independent Set Part Ⅱ: Application Research
12
作者 朱松年 朱嫱 《Journal of Southwest Jiaotong University(English Edition)》 2004年第1期1-14,共14页
According to the researches on theoretic basis in part Ⅰ of the paper, the spanning tree algorithms solving the maximum independent set both in even network and in odd network have been developed in this part, part ... According to the researches on theoretic basis in part Ⅰ of the paper, the spanning tree algorithms solving the maximum independent set both in even network and in odd network have been developed in this part, part Ⅱ of the paper. The algorithms transform first the general network into the pair sets network, and then decompose the pair sets network into a series of pair subsets by use of the characteristic of maximum flow passing through the pair sets network. As for the even network, the algorithm requires only one time of transformation and decomposition, the maximum independent set can be gained without any iteration processes, and the time complexity of the algorithm is within the bound of O(V3). However, as for the odd network, the algorithm consists of two stages. In the first stage, the general odd network is transformed and decomposed into the pseudo-negative envelope graphs and generalized reverse pseudo-negative envelope graphs alternately distributed at first; then the algorithm turns to the second stage, searching for the negative envelope graphs within the pseudo-negative envelope graphs only. Each time as a negative envelope graph has been found, renew the pair sets network by iteration at once, and then turn back to the first stage. So both stages form a circulation process up to the optimum. Two available methods, the adjusting search and the picking-off search are specially developed to deal with the problems resulted from the odd network. Both of them link up with each other harmoniously and are embedded together in the algorithm. Analysis and study indicate that the time complexity of this algorithm is within the bound of O(V5). 展开更多
关键词 network transformation and decomposition Negative envelope graph Pseudo-negative envelope graph Spanning tree algorithm Adjusting search Picking-off search Polynomial time bound.
在线阅读 下载PDF
Network Decomposition and Maximum Independent Set Part Ⅰ:Theoretic Basis
13
作者 朱松年 朱嫱 《Journal of Southwest Jiaotong University(English Edition)》 2003年第2期103-121,共19页
The structure and characteristics of a connected network are analyzed, and a special kind of sub-network, which can optimize the iteration processes, is discovered. Then, the sufficient and necessary conditions for o... The structure and characteristics of a connected network are analyzed, and a special kind of sub-network, which can optimize the iteration processes, is discovered. Then, the sufficient and necessary conditions for obtaining the maximum independent set are deduced. It is found that the neighborhood of this sub-network possesses the similar characters, but both can never be allowed incorporated together. Particularly, it is identified that the network can be divided into two parts by a certain style, and then both of them can be transformed into a pair sets network, where the special sub-networks and their neighborhoods appear alternately distributed throughout the entire pair sets network. By use of this characteristic, the network decomposed enough without losing any solutions is obtained. All of these above will be able to make well ready for developing a much better algorithm with polynomial time bound for an odd network in the the application research part of this subject. 展开更多
关键词 odd network network transformation and decomposition negative envelope graph and pseudo-negative envelope graph the sufficient and necessary conditions polynomial time.
在线阅读 下载PDF
Method to generate training samples for neural network used in target recognition
14
作者 何灏 罗庆生 +2 位作者 罗霄 徐如强 李钢 《Journal of Beijing Institute of Technology》 EI CAS 2012年第3期400-407,共8页
Training neural network to recognize targets needs a lot of samples.People usually get these samples in a non-systematic way,which can miss or overemphasize some target information.To improve this situation,a new meth... Training neural network to recognize targets needs a lot of samples.People usually get these samples in a non-systematic way,which can miss or overemphasize some target information.To improve this situation,a new method based on virtual model and invariant moments was proposed to generate training samples.The method was composed of the following steps:use computer and simulation software to build target object's virtual model and then simulate the environment,light condition,camera parameter,etc.;rotate the model by spin and nutation of inclination to get the image sequence by virtual camera;preprocess each image and transfer them into binary image;calculate the invariant moments for each image and get a vectors' sequence.The vectors' sequence which was proved to be complete became the training samples together with the target outputs.The simulated results showed that the proposed method could be used to recognize the real targets and improve the accuracy of target recognition effectively when the sampling interval was short enough and the circumstance simulation was close enough. 展开更多
关键词 pattern recognition training samples for neural network model emulation space coordinate transform invariant moments
在线阅读 下载PDF
SIT.net: SAR Deforestation Classification of Amazon Forest for Land Use Land Cover Application 被引量:2
15
作者 Priyanka Darbari Ankush Agarwal Manoj Kumar 《Journal of Computer and Communications》 2024年第3期68-83,共16页
The process of turning forest area into land is known as deforestation or forest degradation. Reforestation as a fraction of deforestation is extremely low. For improved qualitative and quantitative classification, we... The process of turning forest area into land is known as deforestation or forest degradation. Reforestation as a fraction of deforestation is extremely low. For improved qualitative and quantitative classification, we used Sentinel-1 dataset of State of Para, Brazil to precisely and closely monitor deforestation between June 2019 and June 2023. This research aimed to find out suitable model for classification called Satellite Imaging analysis by Transpose deep neural transformation network (SIT-net) using mathematical model based on Band math approach to classify deforestation applying transpose deep neural network. The main advantage of proposed model is easy to handle SAR images. The study concludes that SAR satellite gives high-resolution images to improve deforestation monitoring and proposed model takes less computational time compared to other techniques. 展开更多
关键词 Brazilian Amazon Sentinel-1 Band Math Transpose CNN Transformation network
在线阅读 下载PDF
Mathematical Modeling and Control Algorithm Development for Bidirectional Power Flow in CCS-CNT System
16
作者 Sinqobile Wiseman Nene 《Journal of Power and Energy Engineering》 2024年第9期131-143,共12页
As the demand for more efficient and adaptable power distribution systems intensifies, especially in rural areas, innovative solutions like the Capacitor-Coupled Substation with a Controllable Network Transformer (CCS... As the demand for more efficient and adaptable power distribution systems intensifies, especially in rural areas, innovative solutions like the Capacitor-Coupled Substation with a Controllable Network Transformer (CCS-CNT) are becoming increasingly critical. Traditional power distribution networks, often limited by unidirectional flow capabilities and inflexibility, struggle to meet the complex demands of modern energy systems. The CCS-CNT system offers a transformative approach by enabling bidirectional power flow between high-voltage transmission lines and local distribution networks, a feature that is essential for integrating renewable energy sources and ensuring reliable electrification in underserved regions. This paper presents a detailed mathematical representation of power flow within the CCS-CNT system, emphasizing the control of both active and reactive power through the adjustment of voltage levels and phase angles. A control algorithm is developed to dynamically manage power flow, ensuring optimal performance by minimizing losses and maintaining voltage stability across the network. The proposed CCS-CNT system demonstrates significant potential in enhancing the efficiency and reliability of power distribution, making it particularly suited for rural electrification and other applications where traditional methods fall short. The findings underscore the system's capability to adapt to varying operational conditions, offering a robust solution for modern power distribution challenges. 展开更多
关键词 Capacitor Couple Substation Ferroresonance Power Flow Control Controllable network Controller Capacitor-Coupled Substation Incorporating Controllable network transformer (CCS-CNT) System System Modeling
在线阅读 下载PDF
A transformer-based Siamese network and an open optical dataset for semantic change detection of remote sensing images 被引量:3
17
作者 Panli Yuan Qingzhan Zhao +3 位作者 Xingbiao Zhao Xuewen Wang Xuefeng Long Yuchen Zheng 《International Journal of Digital Earth》 SCIE EI 2022年第1期1506-1525,共20页
Recent change detection(CD)methods focus on the extraction of deep change semantic features.However,existing methods overlook the fine-grained features and have the poor ability to capture long-range space–time infor... Recent change detection(CD)methods focus on the extraction of deep change semantic features.However,existing methods overlook the fine-grained features and have the poor ability to capture long-range space–time information,which leads to the micro changes missing and the edges of change types smoothing.In this paper,a potential transformer-based semantic change detection(SCD)model,Pyramid-SCDFormer is proposed,which precisely recognizes the small changes and fine edges details of the changes.The SCD model selectively merges different semantic tokens in multi-head self-attention block to obtain multiscale features,which is crucial for extraction information of remote sensing images(RSIs)with multiple changes from different scales.Moreover,we create a well-annotated SCD dataset,Landsat-SCD with unprecedented time series and change types in complex scenarios.Comparing with three Convolutional Neural Network-based,one attention-based,and two transformer-based networks,experimental results demonstrate that the Pyramid-SCDFormer stably outperforms the existing state-of-the-art CD models and obtains an improvement in MIoU/F1 of 1.11/0.76%,0.57/0.50%,and 8.75/8.59%on the LEVIR-CD,WHU_CD,and Landsat-SCD dataset respectively.For change classes proportion less than 1%,the proposed model improves the MIoU by 7.17–19.53%on Landsat-SCD dataset.The recognition performance for small-scale and fine edges of change types has greatly improved. 展开更多
关键词 Semantic change detection(SCD) change detection dataset transformer siamese network self-attention mechanism bitemporal remote sensing
原文传递
Automated Facial Expression Recognition and Age Estimation Using Deep Learning 被引量:3
18
作者 Syeda Amna Rizwan Yazeed Yasin Ghadi +1 位作者 Ahmad Jalal Kibum Kim 《Computers, Materials & Continua》 SCIE EI 2022年第6期5235-5252,共18页
With the advancement of computer vision techniques in surveillance systems,the need for more proficient,intelligent,and sustainable facial expressions and age recognition is necessary.The main purpose of this study is... With the advancement of computer vision techniques in surveillance systems,the need for more proficient,intelligent,and sustainable facial expressions and age recognition is necessary.The main purpose of this study is to develop accurate facial expressions and an age recognition system that is capable of error-free recognition of human expression and age in both indoor and outdoor environments.The proposed system first takes an input image pre-process it and then detects faces in the entire image.After that landmarks localization helps in the formation of synthetic face mask prediction.A novel set of features are extracted and passed to a classifier for the accurate classification of expressions and age group.The proposed system is tested over two benchmark datasets,namely,the Gallagher collection person dataset and the Images of Groups dataset.The system achieved remarkable results over these benchmark datasets about recognition accuracy and computational time.The proposed system would also be applicable in different consumer application domains such as online business negotiations,consumer behavior analysis,E-learning environments,and emotion robotics. 展开更多
关键词 Feature extraction face expression model local transform features and recurrent neural network(RNN)
在线阅读 下载PDF
Integrated Multi-Head Self-Attention Transformer model for electricity demand prediction incorporating local climate variables 被引量:1
19
作者 Sujan Ghimire Thong Nguyen-Huy +3 位作者 Mohanad S.AL-Musaylh Ravinesh C.Deo David Casillas-Perez Sancho Salcedo-Sanz 《Energy and AI》 2023年第4期620-644,共25页
This paper develops a trustworthy deep learning model that considers electricity demand(G)and local climate conditions.The model utilises Multi-Head Self-Attention Transformer(TNET)to capture critical information from... This paper develops a trustworthy deep learning model that considers electricity demand(G)and local climate conditions.The model utilises Multi-Head Self-Attention Transformer(TNET)to capture critical information from𝐻,to attain reliable predictions with local climate(rainfall,radiation,humidity,evaporation,and maximum and minimum temperatures)data from Energex substations in Queensland,Australia.The TNET model is then evaluated with deep learning models(Long-Short Term Memory LSTM,Bidirectional LSTM BILSTM,Gated Recurrent Unit GRU,Convolutional Neural Networks CNN,and Deep Neural Network DNN)based on robust model assessment metrics.The Kernel Density Estimation method is used to generate the prediction interval(PI)of electricity demand forecasts and derive probability metrics and results to show the developed TNET model is accurate for all the substations.The study concludes that the proposed TNET model is a reliable electricity demand predictive tool that has high accuracy and low predictive errors and could be employed as a stratagem by demand modellers and energy policy-makers who wish to incorporate climatic factors into electricity demand patterns and develop national energy market insights and analysis systems. 展开更多
关键词 Electricity demand forecasting Sustainable energy Artificial Intelligence Deep learning transformer networks Kernel Density Estimation
在线阅读 下载PDF
Graph Enhanced Transformer for Aspect Category Detection
20
作者 陈晨 王厚峰 +1 位作者 朱晴晴 柳军飞 《Journal of Computer Science & Technology》 SCIE EI CSCD 2023年第3期612-625,共14页
Aspect category detection is one challenging subtask of aspect based sentiment analysis, which categorizes a review sentence into a set of predefined aspect categories. Most existing methods regard the aspect category... Aspect category detection is one challenging subtask of aspect based sentiment analysis, which categorizes a review sentence into a set of predefined aspect categories. Most existing methods regard the aspect category detection as a flat classification problem. However, aspect categories are inter-related, and they are usually organized with a hierarchical tree structure. To leverage the structure information, this paper proposes a hierarchical multi-label classification model to detect aspect categories and uses a graph enhanced transformer network to integrate label dependency information into prediction features. Experiments have been conducted on four widely-used benchmark datasets, showing that the proposed model outperforms all strong baselines. 展开更多
关键词 aspect based sentiment analysis aspect category detection hierarchical multi-label classification transformer network
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部