期刊文献+
共找到477篇文章
< 1 2 24 >
每页显示 20 50 100
Elimination of the Background Noise of the Decoded Image in Fresnel Zone Plate Scanning Holography
1
作者 孙萍 谢敬辉 《Journal of Beijing Institute of Technology》 EI CAS 2005年第3期225-228,共4页
A method of digitally high pass filtering in frequency domain is proposed to eliminate the background noise of the decoded image in Fresnel zone plate scanning holography. The high pass filter is designed as a circula... A method of digitally high pass filtering in frequency domain is proposed to eliminate the background noise of the decoded image in Fresnel zone plate scanning holography. The high pass filter is designed as a circular stop, which should be suitable to suppressing the background noise significantly and remain much low frequency information of the object. The principle of high pass filtering is that the Fourier transform of the decoded image is multiplied with the high pass filter. Thus the frequency spectrum of the decoded image without the background noise is achieved. By inverse Fourier transform of the spectrum of the decoded image after multiplying operation, the decoded image without the background noise is obtained. Both of the computer simulations and the experimental results show that the contrast and the signal-to-noise ratio of the decoded image are significantly improved with digital filtering. 展开更多
关键词 Fresnel zone plate scanning holography digitally high pass filtering coded image decoded image
在线阅读 下载PDF
Differences in Mean Number of Consonant-Vowel-Consonant Words Decoded between Letter-Sound Readers and Non Letter-Sound Readers
2
作者 Gail M. Wolf 《Open Journal of Nursing》 2014年第6期409-450,共42页
Children’s failure to develop simple word decoding skills in early years is linked to future poor reading, school dropout, and poor health [1] [2]. Letter-sound knowledge is needed for word decoding development;howev... Children’s failure to develop simple word decoding skills in early years is linked to future poor reading, school dropout, and poor health [1] [2]. Letter-sound knowledge is needed for word decoding development;however questions remain on what types of letter-sound knowledge help children decode simple words [3]. This study investigated the differences in mean number of consonant-vowel-consonant (CVC) words decoded between two groups of children, a letter-sound reading group and non letter-sound reading group. Children aged 4 to 6 in both groups, attempted to decode a variety of simple words such as tan, sit, hen, pig, dot, and fun. Analysis determined word decoding differences existed between the two groups. The alternate hypothesis was accepted;the letter-sound reading group had a significantly higher mean in number of consonant-vowel-consonant words decoded compared to the non letter-sound reading group. The study informs the teaching approaches needed to improve early decoding skills showing letter-sound reading ability is an important step for learning to decode simple consonant-vowel-consonant words. 展开更多
关键词 Letter-Sound READING Decoding Automaticity PRACTICING READING
暂未订购
大视场下火龙果目标检测与计数方法
3
作者 欧阳春凡 高嘉正 +5 位作者 陈桥 曾春林 李文涛 肖明玮 罗陈迪 周学成 《中国农业科技导报(中英文)》 北大核心 2025年第8期100-109,共10页
为解决大视场条件下小目标火龙果识别精度低、实时性差、果实计数困难的问题,提出一种大视场下火龙果目标检测与计数方法,实现火龙果小目标精准识别与计数,完善火龙果机器人采前指导性工作。在目标特征提取阶段,采用动态可变形卷积C2F_D... 为解决大视场条件下小目标火龙果识别精度低、实时性差、果实计数困难的问题,提出一种大视场下火龙果目标检测与计数方法,实现火龙果小目标精准识别与计数,完善火龙果机器人采前指导性工作。在目标特征提取阶段,采用动态可变形卷积C2F_DCNV2_Dynamic替换YOLOv8主干网络的C2F模块,引入Conv_offer_mask获取输入特征图的可变形偏移和掩码,使网络能够更好地适应目标形状的特征,提升复杂背景的目标特征提取能力;以多路协调注意力(multipath coordinate attention,MPCA)机制模块对输入进行多路径处理,使模型可以同时关注输入张量的空间信息与通道信息,提高网络对不同尺度和语境的特征感知能力,进而提升小目标识别精度;在目标预测阶段,使用基于端到端Transformer的检测器RT-DETR的Decoder Head替换YOLO Head,通过集合预测方法直接对目标进行预测和关联,去除传统非极大抑制步骤,提高推理速度,进一步提升网络实时性能;在目标计数阶段,结合Deep Sort算法实现果实区域计数。结果表明,改进的目标检测网络对火龙果果实检测的平均精度可达99.0%,在实时性测试中每秒传输32帧,模型大小为11.8 MB,果实计数精度达82.96%,平均检测速度为17帧·s^(−1)。该方法能够精准识别与计数大视场条件下的小目标火龙果,且实时性满足果园实际生产环境。 展开更多
关键词 小目标火龙果识别 果实计数 YOLOv8 C2F_DCNV2_Dynamic MPCA Decoder Head
在线阅读 下载PDF
Research on deep learning decoding method for polar codes in ACO-OFDM spatial optical communication system
4
作者 LIU Kangrui LI Ming +2 位作者 CHEN Sizhe QU Jiashun ZHOU Ming’ou 《Optoelectronics Letters》 2025年第7期427-433,共7页
Aiming at the problem that the bit error rate(BER)of asymmetrically clipped optical orthogonal frequency division multiplexing(ACO-OFDM)space optical communication system is significantly affected by different turbule... Aiming at the problem that the bit error rate(BER)of asymmetrically clipped optical orthogonal frequency division multiplexing(ACO-OFDM)space optical communication system is significantly affected by different turbulence intensities,the deep learning technique is proposed to the polarization code decoding in ACO-OFDM space optical communication system.Moreover,this system realizes the polarization code decoding and signal demodulation without frequency conduction with superior performance and robustness compared with the performance of traditional decoder.Simulations under different turbulence intensities as well as different mapping orders show that the convolutional neural network(CNN)decoder trained under weak-medium-strong turbulence atmospheric channels achieves a performance improvement of about 10^(2)compared to the conventional decoder at 4-quadrature amplitude modulation(4QAM),and the BERs for both 16QAM and 64QAM are in between those of the conventional decoder. 展开更多
关键词 frequency conduction polar codes deep learning signal demodulation deep learning technique DECODING ACO OFDM polarization code decoding
原文传递
Spectral Efficiency of Cell-Free Massive MIMO with LSFD Receiver Under Spatially Correlated Rayleigh Fading Channels
5
作者 Li Ning Fan Pingzhi 《China Communications》 2025年第8期1-18,共18页
This paper investigates the uplink spectral efficiency of distributed cell-free(CF)massive multiple-input multiple-output(mMIMO)networks with correlated Rayleigh fading channels based on three different channel estima... This paper investigates the uplink spectral efficiency of distributed cell-free(CF)massive multiple-input multiple-output(mMIMO)networks with correlated Rayleigh fading channels based on three different channel estimation schemes.Specifically,each access point(AP)first uses embedded pilots to estimate the channels of all users based on minimum mean-squared error(MMSE)estimation.Given the high computational cost of MMSE estimation,the low-complexity element-wise MMSE(EW-MMSE)channel estimator and the least-squares(LS)channel estimator without prior statistical information are also analyzed.To reduce non-coherent and coherent interference during uplink payload data transmission,simple centralized decoding(SCD)and large-scale fading decoding(LSFD)are examined.Then,the closedform expressions for uplink spectral efficiency(SE)using MMSE,EW-MMSE,and LS estimators are developed for maximum ratio(MR)combining under LSFD,where each AP may have any number of antennas.The sum SE maximization problem with uplink power control is formulated.Since the maximization problem is non-convex and challenging,a block coordinate descent approach based on the weighted MMSE method is used to get the optimal local solution.Numerical studies demonstrate that LSFD and efficient uplink power control can considerably increase SE in distributed CF m MIMO networks. 展开更多
关键词 CF mMIMO channel estimation largescale fading decoding simple centralized decoding sum SE optimization
在线阅读 下载PDF
Design of improved error-rate sliding window decoder for SC-LDPC codes: reliable termination and channel value reuse
6
作者 JIA Xishan LI Jining +3 位作者 YAO Yuan WANG Yifan LIU Bo XU Degang 《Optoelectronics Letters》 2025年第4期212-217,共6页
In this paper,an improved error-rate sliding window decoder is proposed for spatially coupled low-density parity-check(SC-LDPC)codes.For the conventional sliding window decoder,the message retention mechanism causes u... In this paper,an improved error-rate sliding window decoder is proposed for spatially coupled low-density parity-check(SC-LDPC)codes.For the conventional sliding window decoder,the message retention mechanism causes unreliable messages along the edges of belief propagation(BP)decoding in the current window to be kept for subsequent window decoding.To improve the reliability of the retained messages during the window transition,a reliable termination method is embedded,where the retained messages undergo more reliable parity checks.Additionally,decoding failure is unavoidable and even causes error propagation when the number of errors exceeds the error-correcting capability of the window.To mitigate this problem,a channel value reuse mechanism is designed,where the received channel values are utilized to reinitialize the window.Furthermore,considering the complexity and performance of decoding,a feasible sliding optimized window decoding(SOWD)scheme is introduced.Finally,simulation results confirm the superior performance of the proposed SOWD scheme in both the waterfall and error floor regions.This work has great potential in the applications of wireless optical communication and fiber optic communication. 展开更多
关键词 reliable termination message retention mechanism reliable termination method sliding window decoderthe error rate sliding window decoder belief propagation bp decoding retained messages
原文传递
Large Language Models With Contrastive Decoding Algorithm for Hallucination Mitigation in Low-Resource Languages
7
作者 Zan Hongying Arifa Javed +2 位作者 Muhammad Abdullah Javed Rashid Muhammad Faheem 《CAAI Transactions on Intelligence Technology》 2025年第4期1104-1117,共14页
Neural machine translation(NMT)has advanced with deep learning and large-scale multilingual models,yet translating lowresource languages often lacks sufficient training data and leads to hallucinations.This often resu... Neural machine translation(NMT)has advanced with deep learning and large-scale multilingual models,yet translating lowresource languages often lacks sufficient training data and leads to hallucinations.This often results in translated content that diverges significantly from the source text.This research proposes a refined Contrastive Decoding(CD)algorithm that dynamically adjusts weights of log probabilities from strong expert and weak amateur models to mitigate hallucinations in lowresource NMT and improve translation quality.Advanced large language NMT models,including ChatGLM and LLaMA,are fine-tuned and implemented for their superior contextual understanding and cross-lingual capabilities.The refined CD algorithm evaluates multiple candidate translations using BLEU score,semantic similarity,and Named Entity Recognition accuracy.Extensive experimental results show substantial improvements in translation quality and a significant reduction in hallucination rates.Fine-tuned models achieve higher evaluation metrics compared to baseline models and state-of-the-art models.An ablation study confirms the contributions of each methodological component and highlights the effectiveness of the refined CD algorithm and advanced models in mitigating hallucinations.Notably,the refined methodology increased the BLEU score by approximately 30%compared to baseline models. 展开更多
关键词 ChatGLM contrastive decoding HALLUCINATION LLAMA LLM low resource NMT
在线阅读 下载PDF
An Efficient Temporal Decoding Module for Action Recognition
8
作者 HUANG Qiubo MEI Jianmin +3 位作者 ZHAO Wupeng LU Yiru WANG Mei CHEN Dehua 《Journal of Donghua University(English Edition)》 2025年第2期187-196,共10页
Action recognition,a fundamental task in the field of video understanding,has been extensively researched and applied.In contrast to an image,a video introduces an extra temporal dimension.However,many existing action... Action recognition,a fundamental task in the field of video understanding,has been extensively researched and applied.In contrast to an image,a video introduces an extra temporal dimension.However,many existing action recognition networks either perform simple temporal fusion through averaging or rely on pre-trained models from image recognition,resulting in limited temporal information extraction capabilities.This work proposes a highly efficient temporal decoding module that can be seamlessly integrated into any action recognition backbone network to enhance the focus on temporal relationships between video frames.Firstly,the decoder initializes a set of learnable queries,termed video-level action category prediction queries.Then,they are combined with the video frame features extracted by the backbone network after self-attention learning to extract video context information.Finally,these prediction queries with rich temporal features are used for category prediction.Experimental results on HMDB51,MSRDailyAct3D,Diving48 and Breakfast datasets show that using TokShift-Transformer and VideoMAE as encoders results in a significant improvement in Top-1 accuracy compared to the original models(TokShift-Transformer and VideoMAE),after introducing the proposed temporal decoder.The introduction of the temporal decoder results in an average performance increase exceeding 11%for TokShift-Transformer and nearly 5%for VideoMAE across the four datasets.Furthermore,the work explores the combination of the decoder with various action recognition networks,including Timesformer,as encoders.This results in an average accuracy improvement of more than 3.5%on the HMDB51 dataset.The code is available at https://github.com/huangturbo/TempDecoder. 展开更多
关键词 action recognition video understanding temporal relationship temporal decoder TRANSFORMER
在线阅读 下载PDF
Anti-Interference High-Speed Modulation Decoder for Quantum Key Distribution
9
作者 Hua-Xing Xu Shao-Hua Wang +1 位作者 Chang-Lei Wang Ping Zhang 《Chinese Physics Letters》 2025年第1期34-39,共6页
Quantum key distribution is increasingly transitioning toward network applications,necessitating advancements in system performance,including photonic integration for compact designs,enhanced stability against environ... Quantum key distribution is increasingly transitioning toward network applications,necessitating advancements in system performance,including photonic integration for compact designs,enhanced stability against environmental disturbances,higher key rates,and improved efficiency.In this letter,we propose an orthogonal polarization exchange reflector Michelson interferometer model to address quantum channel disturbances caused by environmental factors.Based on this model,we designed a Sagnac reflector-Michelson interferometer decoder and verified its performance through an interference system.The interference fringe visibility exceeded 98%across all four coding phases at 625 MHz.These results indicate that the decoder effectively mitigates environmental interference while supporting high-speed modulation frequencies.In addition,the proposed anti-interference decoder,which does not rely on magneto-optical devices,is well-suited for photonic integration,aligning with the development trajectory for next-generation quantum communication devices. 展开更多
关键词 DECODER INTERFEROMETER POLARIZATION
原文传递
轻量级低光照图像增强算法研究
10
作者 单慧 《电脑编程技巧与维护》 2025年第7期154-156,共3页
提出一种结合注意力机制与U-Net结构的低光照图像增强算法,由MSFE、Encoder、Bottle-neck、Decoder和RFRM 5个模块组成。通过联合注意力、轴向注意力和细节增强模块,有效提升图像清晰度并降低噪声。实验结果验证了该方法的有效性。
关键词 MSFE模块 Encoder模块 Bottleneck模块 Decoder模块 RFRM模块
在线阅读 下载PDF
Low Complexity Successive Cancellation List Decoding of U-UV Codes
11
作者 Chen Wenhao Chen Li +1 位作者 Lin Jingyu Zhang Huazi 《China Communications》 2025年第1期41-60,共20页
Constituted by BCH component codes and its ordered statistics decoding(OSD),the successive cancellation list(SCL)decoding of U-UV structural codes can provide competent error-correction performance in the short-to-med... Constituted by BCH component codes and its ordered statistics decoding(OSD),the successive cancellation list(SCL)decoding of U-UV structural codes can provide competent error-correction performance in the short-to-medium length regime.However,this list decoding complexity becomes formidable as the decoding output list size increases.This is primarily incurred by the OSD.Addressing this challenge,this paper proposes the low complexity SCL decoding through reducing the complexity of component code decoding,and pruning the redundant SCL decoding paths.For the former,an efficient skipping rule is introduced for the OSD so that the higher order decoding can be skipped when they are not possible to provide a more likely codeword candidate.It is further extended to the OSD variant,the box-andmatch algorithm(BMA),in facilitating the component code decoding.Moreover,through estimating the correlation distance lower bounds(CDLBs)of the component code decoding outputs,a path pruning(PP)-SCL decoding is proposed to further facilitate the decoding of U-UV codes.In particular,its integration with the improved OSD and BMA is discussed.Simulation results show that significant complexity reduction can be achieved.Consequently,the U-UV codes can outperform the cyclic redundancy check(CRC)-polar codes with a similar decoding complexity. 展开更多
关键词 ordered statistics decoding successive cancellation list decoding U-UV codes
在线阅读 下载PDF
Decoding of Surface Meteorological Observation Data Files and Application Research on Climatic Data
12
作者 Hui LIANG Xianqiang SU Qingyun ZHU 《Meteorological and Environmental Research》 2025年第2期16-21,25,共7页
In this paper,Wuzhou City of Guangxi was taken as the research object.Through the design of a climatic data warehousing system,the decoding methods of surface meteorological data and their application in the managemen... In this paper,Wuzhou City of Guangxi was taken as the research object.Through the design of a climatic data warehousing system,the decoding methods of surface meteorological data and their application in the management of climatic data were explored.Based on the parsing technology of the monthly report of surface meteorological records(A-file),the design of Wuzhou climatic data warehousing system was realized,completing the precise extraction and database construction of observational elements such as regional temperature,wind direction,and weather phenomena.Based on this system,the meteorological data in 2024 were analyzed,and the probabilistic characteristics of dominant wind direction in Wuzhou(northeast wind accounting for the largest proportion),the spatiotemporal distribution patterns of extreme temperatures(annual extreme high temperature of 37.1℃in August and extreme low temperature of 1.9℃in January),and the general climatic overview of Wuzhou City(annual precipitation 3.2%higher than the standard value)were revealed.The research shows that climate change has a significant impact on agricultural production and economic development in Wuzhou City,and the construction of a reasonable climatic data database is of great significance for enhancing professional meteorological service capabilities in the context of climate change.This study not only provides a scientific basis for the economic development of Wuzhou region,but also offers reference ideas for other regions to cope with regional climate adaptation planning. 展开更多
关键词 Surface meteorological observation A-file decoding Climatic database Climate change
在线阅读 下载PDF
Molecular Retrosynthesis Top-K Prediction Based on the Latent Generation Process
13
作者 Yupeng Liu Han Zhang Rui Hu 《CAAI Transactions on Intelligence Technology》 2025年第3期902-911,共10页
In the field of organic synthesis,the core objective of retrosynthetic methods is to deduce possible synthetic routes and precursor molecules for complex target molecules.Traditional retrosynthetic methods,such as tem... In the field of organic synthesis,the core objective of retrosynthetic methods is to deduce possible synthetic routes and precursor molecules for complex target molecules.Traditional retrosynthetic methods,such as template-based retrosynthesis,have high accuracy and interpretability in specific types of reactions but are limited by the scope of the template library,making it difficult to adapt to new or uncommon reaction types.Moreover,sequence-to-sequence retrosynthetic prediction methods,although they enhance the flexibility of prediction,often overlook the complexity of molecular graph structures and the actual interactions between atoms,which limits the accuracy and reliability of the predictions.To address these limitations,this paper proposes a Molecular Retrosynthesis Top-k Prediction based on the Latent Generation Process(MRLGP)that uses latent variables from graph neural networks to model the generation process and produce diverse set of reactants.Utilising an encoding method based on Graphormer,the authors have also introduced topology-aware positional encoding to better capture the interactions between atomic nodes in the molecular graph structure,thereby more accurately simulating the retrosynthetic process.The MRLGP model significantly enhances the accuracy and diversity of predictions by correlating discrete latent variables with the reactant generation process and progressively constructing molecular graphs using a variational autoregressive decoder.Experimental results on benchmark datasets such as USPTO-50k,USPTO-Full,and USPTO-DIVERSE demonstrate that MRLGP outperforms baseline models on multiple Top-k evaluation metrics.Additionally,ablation experiments conducted on the USPTO-50K dataset further validate the effectiveness of the methods used in the encoder and decoder parts of the model. 展开更多
关键词 latent variable molecular retrosynthesis TOPOLOGY-AWARE variational autoregressive decoder
在线阅读 下载PDF
Modulation of tRNA^(Cln)decoding efficacy by metal ion binding and glutamine supply
14
作者 Yuxuan Shen Tianchang Wang +3 位作者 Hua Qiao Qing Liang Jingru Lv Qing Xia 《Journal of Chinese Pharmaceutical Sciences》 2025年第1期28-40,共13页
Transfer RNAs(tRNAs)adopt a stable L-shaped tertiary structure crucial for their involvement in protein translation.Among various divalent metal ions,magnesium ions play a pivotal role in preserving the tertiary struc... Transfer RNAs(tRNAs)adopt a stable L-shaped tertiary structure crucial for their involvement in protein translation.Among various divalent metal ions,magnesium ions play a pivotal role in preserving the tertiary structure of tRNA.However,the precise location of the Mg^(2+)binding pocket in human tRNA remains elusive.In this investigation,we identified the Mg^(2+)binding site within human tRNAGln using suppressor tRNA^(Gln).This variant of tRNA recognizes premature stop codons(specificlly UAG)and facilitates the expression of fll-length proteis.By mutating sites 8 and C72 in supprssr tRNAcl,we assessed the decoding efficiency of the resulting mutant suppressor tRNAs,which serves as a measure of tRNA's ability to decode genetic information.Our analysis revealed that the U8C mutant suppressor tRNA exhibited a significantly lower Mg^(2+)content compared to the C72U mutant.Furthermore,we observed a notable reduction in decoding efficiency in the U8-mutated suppressor tRNA,as evidenced by GFP fluorescence and Western blotting analysis.Conversely,mutations at the C72 site had a comparatively minor impact on decoding efficiency.These findings underscored the tight binding of Mg^(2+)to the U8 site of human tRNAGln,crucial for maintaining the stability of tRNA tertiary structure and translation efficacy.Additionally,our investigation delved into the influence of glutamine availability on tRNA decoding efficiency at the cellular level.The results indicated that both the concentration of amino acids and the codon context of TAG could modulate tRNA decoding efficiency.This study provided valuable insights into the structure and function of tRNA,laying the groundwork for further exploration in this field. 展开更多
关键词 Metal ions tRNA tertiary structure Glutamine supply Decoding efficacy
原文传递
An Optimization of Weak Key Attacks Based on the BGF Decoding Algorithm
15
作者 Bing Liu Ting Nie +1 位作者 Yansong Liu Weibo Hu 《Computers, Materials & Continua》 2025年第9期4583-4599,共17页
Among the four candidate algorithms in the fourth round of NIST standardization,the BIKE(Bit Flipping Key Encapsulation)scheme has a small key size and high efficiency,showing good prospects for application.However,th... Among the four candidate algorithms in the fourth round of NIST standardization,the BIKE(Bit Flipping Key Encapsulation)scheme has a small key size and high efficiency,showing good prospects for application.However,the BIKE scheme based on QC-MDPC(Quasi Cyclic Medium Density Parity Check)codes still faces challenges such as the GJS attack and weak key attacks targeting the decoding failure rate(DFR).This paper analyzes the BGF decoding algorithm of the BIKE scheme,revealing two deep factors that lead to DFR,and proposes a weak key optimization attack method for the BGF decoding algorithm based on these two factors.The proposed method constructs a new weak key set,and experiment results eventually indicate that,considering BIKE’s parameter set targeting 128-bit security,the average decryption failure rate is lowerly bounded by.This result not only highlights a significant vulnerability in the BIKE scheme but also provides valuable insights for future improvements in its design.By addressing these weaknesses,the robustness of QC-MDPC code-based cryptographic systems can be enhanced,paving the way for more secure post-quantum cryptographic solutions. 展开更多
关键词 BIKE BGF decoding algorithm weak key attack GJS attack
在线阅读 下载PDF
Remote sensing image semantic segmentation algorithm based on improved DeepLabv3+
16
作者 SONG Xirui GE Hongwei LI Ting 《Journal of Measurement Science and Instrumentation》 2025年第2期205-215,共11页
The convolutional neural network(CNN)method based on DeepLabv3+has some problems in the semantic segmentation task of high-resolution remote sensing images,such as fixed receiving field size of feature extraction,lack... The convolutional neural network(CNN)method based on DeepLabv3+has some problems in the semantic segmentation task of high-resolution remote sensing images,such as fixed receiving field size of feature extraction,lack of semantic information,high decoder magnification,and insufficient detail retention ability.A hierarchical feature fusion network(HFFNet)was proposed.Firstly,a combination of transformer and CNN architectures was employed for feature extraction from images of varying resolutions.The extracted features were processed independently.Subsequently,the features from the transformer and CNN were fused under the guidance of features from different sources.This fusion process assisted in restoring information more comprehensively during the decoding stage.Furthermore,a spatial channel attention module was designed in the final stage of decoding to refine features and reduce the semantic gap between shallow CNN features and deep decoder features.The experimental results showed that HFFNet had superior performance on UAVid,LoveDA,Potsdam,and Vaihingen datasets,and its cross-linking index was better than DeepLabv3+and other competing methods,showing strong generalization ability. 展开更多
关键词 semantic segmentation high-resolution remote sensing image deep learning transformer model attention mechanism feature fusion ENCODER DECODER
在线阅读 下载PDF
Deep Learning-Based Lip-Reading for Vocal Impaired Patient Rehabilitation
17
作者 Chiara Innocente Matteo Boemio +6 位作者 Gianmarco Lorenzetti Ilaria Pulito Diego Romagnoli Valeria Saponaro Giorgia Marullo Luca Ulrich Enrico Vezzetti 《Computer Modeling in Engineering & Sciences》 2025年第5期1355-1379,共25页
Lip-reading technology,based on visual speech decoding and automatic speech recognition,offers a promising solution to overcoming communication barriers,particularly for individuals with temporary or permanent speech ... Lip-reading technology,based on visual speech decoding and automatic speech recognition,offers a promising solution to overcoming communication barriers,particularly for individuals with temporary or permanent speech impairments.However,most Visual Speech Recognition(VSR)research has primarily focused on the English language and general-purpose applications,limiting its practical applicability in medical and rehabilitative settings.This study introduces the first Deep Learning(DL)based lip-reading system for the Italian language designed to assist individuals with vocal cord pathologies in daily interactions,facilitating communication for patients recovering from vocal cord surgeries,whether temporarily or permanently impaired.To ensure relevance and effectiveness in real-world scenarios,a carefully curated vocabulary of twenty-five Italian words was selected,encompassing critical semantic fields such as Needs,Questions,Answers,Emergencies,Greetings,Requests,and Body Parts.These words were chosen to address both essential daily communication and urgent medical assistance requests.Our approach combines a spatiotemporal Convolutional Neural Network(CNN)with a bidirectional Long Short-Term Memory(BiLSTM)recurrent network,and a Connectionist Temporal Classification(CTC)loss function to recognize individual words,without requiring predefined words boundaries.The experimental results demonstrate the system’s robust performance in recognizing target words,reaching an average accuracy of 96.4%in individual word recognition,suggesting that the system is particularly well-suited for offering support in constrained clinical and caregiving environments,where quick and reliable communication is critical.In conclusion,the study highlights the importance of developing language-specific,application-driven VSR solutions,particularly for non-English languages with limited linguistic resources.By bridging the gap between deep learning-based lip-reading and real-world clinical needs,this research advances assistive communication technologies,paving the way for more inclusive and medically relevant applications of VSR in rehabilitation and healthcare. 展开更多
关键词 LIP-READING deep learning automatic speech recognition visual speech decoding 3D convolutional neural network
在线阅读 下载PDF
Effects of Different Representations for Solving Integrated Production and Transportation Scheduling Problem
18
作者 Youjie Yao Qingzheng Wang +1 位作者 Cuiyu Wang Xinyu Li 《Chinese Journal of Mechanical Engineering》 2025年第3期209-228,共20页
With the growing adoption of automated guided vehicles(AGVs)in various industries,the integrated production and transportation scheduling problem(IPTSP)has emerged as a critical research focus.The IPTSP is classified ... With the growing adoption of automated guided vehicles(AGVs)in various industries,the integrated production and transportation scheduling problem(IPTSP)has emerged as a critical research focus.The IPTSP is classified as a strongly NP-hard problem due to the simultaneous scheduling of two resources:machines and transportation equipment.Meta-heuristic algorithms are one of the most popular and effective approaches to solving this problem.However,their effectiveness heavily depends on the choice of solution representation,which influences both the algorithm’s search space and convergence speed.This paper reviews the existing encoding and decoding methods and proposes a novel active decoding approach.Based on different combinations of encoding and decoding methods,six solution representations are identified,among which the newly proposed representation offers a trade-off between the search space and the algorithm’s efficiency.Specifically,four scenarios of IPTSP under different assumptions are first analyzed.Next,the variations in the six solution representations across unused scenarios and different layouts,as well as their respective encoding spaces and qualities,are summarized.Subsequently,the search efficiency of the six solution representations is evaluated using a genetic algorithm to analyze their performance under different scenarios,layouts,time ratios,and number of AGVs.Finally,the advantages,disadvantages and applicable scenes for each solution representation are summarized based on the experimental results and analysis.These findings provide valuable insights for designing more efficient algorithms to address the IPTSP. 展开更多
关键词 Integrated scheduling Processing and transportation Encoding and decoding Genetic algorithm
在线阅读 下载PDF
Digital Dissemination and Consumption of Cultural Symbols in Lingnan’s Traditional Villages
19
作者 Xinrui Cai 《Proceedings of Business and Economic Studies》 2025年第6期24-31,共8页
Ancient villages in Lingnan serve as crucial carriers of Lingnan culture.Their abundant cultural symbols now face the dual task of inheritance and innovation in the digital era.Drawing on Stuart Hall’s encoding/decod... Ancient villages in Lingnan serve as crucial carriers of Lingnan culture.Their abundant cultural symbols now face the dual task of inheritance and innovation in the digital era.Drawing on Stuart Hall’s encoding/decoding theory,this study explores how representative cultural symbols of Lingnan’s ancient villages are digitally translated and disseminated.By analyzing specific cases,it elucidates the logic of audience interaction and consumption during the decoding of these digital cultural symbols.This study aims to offer valuable insights for revitalizing ancient village culture and informing its sustainable industrial development. 展开更多
关键词 Ancient villages of Lingnan Cultural symbols Digital dissemination Encoding and decoding Symbolic consumption
在线阅读 下载PDF
Pyramid–MixNet: Integrate Attention into Encoder-Decoder Transformer Framework for Automatic Railway Surface Damage Segmentation
20
作者 Hui Luo Wenqing Li Wei Zeng 《Computers, Materials & Continua》 2025年第7期1567-1580,共14页
Rail surface damage is a critical component of high-speed railway infrastructure,directly affecting train operational stability and safety.Existing methods face limitations in accuracy and speed for small-sample,multi... Rail surface damage is a critical component of high-speed railway infrastructure,directly affecting train operational stability and safety.Existing methods face limitations in accuracy and speed for small-sample,multi-category,and multi-scale target segmentation tasks.To address these challenges,this paper proposes Pyramid-MixNet,an intelligent segmentation model for high-speed rail surface damage,leveraging dataset construction and expansion alongside a feature pyramid-based encoder-decoder network with multi-attention mechanisms.The encoding net-work integrates Spatial Reduction Masked Multi-Head Attention(SRMMHA)to enhance global feature extraction while reducing trainable parameters.The decoding network incorporates Mix-Attention(MA),enabling multi-scale structural understanding and cross-scale token group correlation learning.Experimental results demonstrate that the proposed method achieves 62.17%average segmentation accuracy,80.28%Damage Dice Coefficient,and 56.83 FPS,meeting real-time detection requirements.The model’s high accuracy and scene adaptability significantly improve the detection of small-scale and complex multi-scale rail damage,offering practical value for real-time monitoring in high-speed railway maintenance systems. 展开更多
关键词 Pyramid vision transformer encoder–decoder architecture railway damage segmentation masked multi-head attention mix-attention
在线阅读 下载PDF
上一页 1 2 24 下一页 到第
使用帮助 返回顶部