期刊文献+
共找到981篇文章
< 1 2 50 >
每页显示 20 50 100
Smart phone-based context-aware augmentative and alternative communications system
1
作者 PARK DongGyu SONG Sejun LEE DoHoon 《Journal of Central South University》 SCIE EI CAS 2014年第9期3551-3558,共8页
A smartphone-based context-aware augmentative and alternative communication(AAC) was applied was in order to enhance the user's experience by providing simple, adaptive, and intuitive interfaces. Various potential... A smartphone-based context-aware augmentative and alternative communication(AAC) was applied was in order to enhance the user's experience by providing simple, adaptive, and intuitive interfaces. Various potential context-aware technologies and AAC usage scenarios were studied, and an efficient communication system was developed by combining smartphone's multimedia functions and its optimized sensor technologies. The experimental results show that context-awareness accuracy is achieved up to 97%. 展开更多
关键词 augmentative alternative communication system context awareness mobile system location based services
在线阅读 下载PDF
Deep Neural Network-based Speaker-Aware Information Logging for Augmentative and Alternative Communication
2
作者 Gang Hu Szu-Han Kay Chen Neal Mazur 《Journal of Artificial Intelligence and Technology》 2021年第2期138-143,共6页
People with complex communication needs can use a high-technology augmentative and alternative communication device to communicate with others.Currently,researchers and clinicians often use data logging from high-tech... People with complex communication needs can use a high-technology augmentative and alternative communication device to communicate with others.Currently,researchers and clinicians often use data logging from high-tech augmentative and alternative communication devices to analyze augmentative and alternative communication user performance.However,existing automated data logging systems cannot differentiate the authorship of the data log when more than one user accesses the device.This issue reduces the validity of the data logs and increases the difficulties of performance analysis.Therefore,this paper presents a solution using a deep neural network-based visual analysis approach to process videos to detect different augmentative and alternative communication users in practice sessions.This approach has significant potential to improve the validity of data logs and ultimately to enhance augmentative and alternative communication outcome measures. 展开更多
关键词 augmentative and alternative communication(AAC) outcome measures visual logs hand tracking deep learning
在线阅读 下载PDF
Effectiveness and safety of augmentative plating technique in managing nonunion following intramedullary nailing of long bones in the lower extremity:A systematic review and meta-analysis
3
作者 Cong-Xiao Fu Hao Gao +8 位作者 Jun Ren Hu Wang Shuai-Kun Lu Guo-Liang Wang Zhen-Feng Zhu Yun-Yan Liu Wen Luo Yong Zhang Yun-Fei Zhang 《Chinese Journal of Traumatology》 2025年第3期164-174,共11页
Purpose:To methodically assess the effectiveness of augmentative plating(AP)and exchange nailing(EN)in managing nonunion following intramedullary nailing for long bone fractures of the lower extremity.Methods:PubMed,E... Purpose:To methodically assess the effectiveness of augmentative plating(AP)and exchange nailing(EN)in managing nonunion following intramedullary nailing for long bone fractures of the lower extremity.Methods:PubMed,EMBASE,Web of Science,and the Cochrane Library were searched to gather clinical studies regarding the use of AP and EN techniques in the treatment of nonunion following intramedullary nailing of lower extremity long bones.The search was conducted up until May 2023.The original studies underwent an independent assessment of their quality,a process conducted utilizing the Newcastle-Ottawa scale.Data were retrieved from these studies,and meta-analysis was executed utilizing Review Manager 5.3.Results:This meta-analysis included 8 studies involving 661 participants,with 305 in the AP group and 356 in the EN group.The results of the meta-analysis demonstrated that the AP group exhibited a higher rate of union(odds ratio:8.61,95%confidence intervals(CI):4.1217.99,p<0.001),shorter union time(standardized mean difference(SMD):-1.08,95%CI:-1.79--0.37,p=0.003),reduced duration of the surgical procedure(SMD:-0.56,95%CI:-0.93--0.19,p=0.003),less bleeding(SMD:-1.5,95%CI:-2.81--0.18,p=0.03),and a lower incidence of complications(relative risk:-0.17,95%CI:-0.27--0.06,p=0.001).In the subgroup analysis,the time for union in the AP group in nonisthmal and isthmal nonunion of lower extremity long bones was shorter compared to the EN group(nonisthmal SMD:-1.94,95%CI:-3.28--0.61,p<0.001;isthmal SMD:-1.08,95%CI:-1.64--0.52,p=0.002).Conclusion:In the treatment of nonunion in diaphyseal fractures of the long bones in the lower extremity,the AP approach is superior to EN,both intraoperatively(with reduced duration of the surgical procedure and diminished blood loss)and postoperatively(with an elevated union rate,shorter union time,and lower incidence of complications).Specifically,in the management of nonunion of lower extremity long bones with non-isthmal and isthmal intramedullary nails,AP demonstrated shorter union time in comparison to EN. 展开更多
关键词 augmentative plating Exchange nailing Lower extremity long bone fractures Intramedullary nail NONUNION
原文传递
Design of Navigation Message Authentication for BDSBAS System
4
作者 Chen Xiao Tian Xiang +2 位作者 Luo Ruidan Liu Ting Wu Haitao 《China Communications》 2026年第1期189-203,共15页
The satellite-based augmentation system(SBAS)provides differential and integrity augmentation services for life safety fields of aviation and navigation.However,the signal structure of SBAS is public,which incurs a ri... The satellite-based augmentation system(SBAS)provides differential and integrity augmentation services for life safety fields of aviation and navigation.However,the signal structure of SBAS is public,which incurs a risk of spoofing attacks.To improve the anti-spoofing capability of the SBAS,European Union and the United States conduct research on navigation message authentication,and promote the standardization of SBAS message authentication.For the development of Beidou satellite-based augmentation system(BDSBAS),this paper proposes navigation message authentication based on the Chinese commercial cryptographic standards.Firstly,this paper expounds the architecture and principles of the SBAS message authentication,and then carries out the design of timed efficient streaming losstolerant authentication scheme(TESLA)and elliptic curve digital signature algorithm(ECDSA)authentication schemes based on Chinese commercial cryptographic standards,message arrangement and the design of over-the-air rekeying(OTAR)message.Finally,this paper conducts a theoretical analysis of the time between authentications(TBA)and maximum authentication latency(MAL)for L5 TESLA-I and L5 ECDSA-Q,and further simulates the reception time of OTAR message,TBA and MAL from the aspects of OTAR message weight and demodulation error rate.The simulation results can provide theoretical supports for the standardization of BDSBAS message authentication. 展开更多
关键词 Beidou satellite-based augmentation system ECDSA message authentication satellite navigation TESLA
在线阅读 下载PDF
A Fine-Grained RecognitionModel based on Discriminative Region Localization and Efficient Second-Order Feature Encoding
5
作者 Xiaorui Zhang Yingying Wang +3 位作者 Wei Sun Shiyu Zhou Haoming Zhang Pengpai Wang 《Computers, Materials & Continua》 2026年第4期946-965,共20页
Discriminative region localization and efficient feature encoding are crucial for fine-grained object recognition.However,existing data augmentation methods struggle to accurately locate discriminative regions in comp... Discriminative region localization and efficient feature encoding are crucial for fine-grained object recognition.However,existing data augmentation methods struggle to accurately locate discriminative regions in complex backgrounds,small target objects,and limited training data,leading to poor recognition.Fine-grained images exhibit“small inter-class differences,”and while second-order feature encoding enhances discrimination,it often requires dual Convolutional Neural Networks(CNN),increasing training time and complexity.This study proposes a model integrating discriminative region localization and efficient second-order feature encoding.By ranking feature map channels via a fully connected layer,it selects high-importance channels to generate an enhanced map,accurately locating discriminative regions.Cropping and erasing augmentations further refine recognition.To improve efficiency,a novel second-order feature encoding module generates an attention map from the fourth convolutional group of Residual Network 50 layers(ResNet-50)and multiplies it with features from the fifth group,producing second-order features while reducing dimensionality and training time.Experiments on Caltech-University of California,San Diego Birds-200-2011(CUB-200-2011),Stanford Car,and Fine-Grained Visual Classification of Aircraft(FGVC Aircraft)datasets show state-of-the-art accuracy of 88.9%,94.7%,and 93.3%,respectively. 展开更多
关键词 Fine-grained recognition feature encoding data augmentation second-order feature discriminative regions
在线阅读 下载PDF
Integrated metasurface-freeform system enabled multi-focal planes augmented reality display
6
作者 Shifei Zhang Lina Gao +9 位作者 Yidan Zhao Yongdong Wang Bo Wang Junjie Li Jiaxi Duan Dewen Cheng Cheng-Wei Qiu Yongtian Wang Tong Yang Lingling Huang 《Opto-Electronic Science》 2026年第1期1-12,共12页
The advent of artificial intelligence(AI)has propelled augmented reality(AR)display technology to a pivotal juncture,positioning it as a contender for the next generation of mobile intelligent terminals.However,the pu... The advent of artificial intelligence(AI)has propelled augmented reality(AR)display technology to a pivotal juncture,positioning it as a contender for the next generation of mobile intelligent terminals.However,the pursuit of advanced AR displays,particularly those capable of delivering immersive 3D experiences,is significantly hindered by the performance limitations of current hardware and the complexity of system integration.In this study,we present an innovative multi-focal plane AR display system that integrates a non-orthogonal polarization-multiplexing metasurface,freeform optical elements,and an OLED display screen.All optical elements are integrated into a single solid-state architecture,based on a joint optimization design approach of ray tracing and diffraction theory.The multi-focal plane AR visual effect is realized by the compact and multiplexing metasurface,which performs distinct phase functions across diverse polarization channels.Meanwhile,freeform surfaces offer ample design flexibility for the collaborative optimization of multi-focal plane imaging and the see-through systems.Followed by a mechanical design and prototype assembly,we demonstrate the system's capabilities in real-time and multi-focal plane display.The digital images at all virtual image distances seamlessly integrate with the real environment,fully exhibiting the system's high parallelism and real-time interactivity.With the innovative design concept and joint design method,we believe that our work will spur more innovative and compact intelligent solutions for AR displays and inject new vitality into hybrid optical systems. 展开更多
关键词 augmented reality metasurface-freeform multi-focal planes display non-orthogonal polarizationmultiplexing metasurfaces
在线阅读 下载PDF
An Unsupervised Online Detection Method for Foreign Objects in Complex Environments
7
作者 YANG Xiaoyang YANG Yanzhu DENG Haiping 《Journal of Donghua University(English Edition)》 2026年第1期140-151,共12页
In modern industrial production,foreign object detection in complex environments is crucial to ensure product quality and production safety.Detection systems based on deep-learning image processing algorithms often fa... In modern industrial production,foreign object detection in complex environments is crucial to ensure product quality and production safety.Detection systems based on deep-learning image processing algorithms often face challenges with handling high-resolution images and achieving accurate detection against complex backgrounds.To address these issues,this study employs the PatchCore unsupervised anomaly detection algorithm combined with data augmentation techniques to enhance the system’s generalization capability across varying lighting conditions,viewing angles,and object scales.The proposed method is evaluated in a complex industrial detection scenario involving the bogie of an electric multiple unit(EMU).A dataset consisting of complex backgrounds,diverse lighting conditions,and multiple viewing angles is constructed to validate the performance of the detection system in real industrial environments.Experimental results show that the proposed model achieves an average area under the receiver operating characteristic curve(AUROC)of 0.92 and an average F1 score of 0.85.Combined with data augmentation,the proposed model exhibits improvements in AUROC by 0.06 and F1 score by 0.03,demonstrating enhanced accuracy and robustness for foreign object detection in complex industrial settings.In addition,the effects of key factors on detection performance are systematically analyzed,providing practical guidance for parameter selection in real industrial applications. 展开更多
关键词 foreign object detection unsupervised learning data augmentation complex environment BOGIE DATASET
在线阅读 下载PDF
Electrochromic retina E-paper:defining the ultimate display at the human vision limit
8
作者 Tongqing Zhou Jianmin Li +1 位作者 Shujuan Liu Qiang Zhao 《Journal of Semiconductors》 2026年第3期1-5,共5页
In an era dominated by visual information,the display interface serves as a critical gateway between the human and digital worlds.The relentless pursuit of visual immersion has driven display technology from cinema sc... In an era dominated by visual information,the display interface serves as a critical gateway between the human and digital worlds.The relentless pursuit of visual immersion has driven display technology from cinema screens to smart-phones and now to virtual and augmented reality(VR/AR)headsets,progressively moving closer to the human eye.This evolution places unprecedented demands on pixel density,power efficiency,and form factor,pushing up against funda-mental physical and physiological limits. 展开更多
关键词 display interface display technology e paper electrochromic cinema screens RETINA virtual augmented visual immersion
在线阅读 下载PDF
Penile shaft reconstruction after cream self-injection:a case report
9
作者 Léa Bollen Stéphane Rysselinck +1 位作者 Jean-Philippe Salmin Gilles Dosin 《The Canadian Journal of Urology》 2026年第1期221-225,共5页
Background:Penile augmentation through injectable substances is becoming increasingly common.A growing number of aesthetic clinics are developing penile enlargement procedures using various injectable materials.Althou... Background:Penile augmentation through injectable substances is becoming increasingly common.A growing number of aesthetic clinics are developing penile enlargement procedures using various injectable materials.Although these procedures are now performed in more controlled and medically supervised environments,their long-term outcomes remain poorly understood.The promotion of such medical treatments contributes to an increasing interest among adult males in self-injection as a method to alleviate psychological distress associated with penile size concerns.At the same time,access to injectable substances through unofficial or unregulated sources has become increasingly easy.Tor our knowledge,we report the first documented case of self-injection with Garamycin®(gentamicin)cream,contributing to the literature on the often multidisciplinary management of penile enlargement injections,a field still lacking well-established guidelines.Case Description:This case report describes a young patient who self-injected Garamycin®into the penis for the purpose of enlargement.He presented to our urology department with worsening symptoms,including severe and poorly tolerated pain.His primary request was prompt relief of pain while preserving,as much as possible,the aesthetic appearance and functional integrity of his penis.This case required a multi-stage surgical approach to salvage the penis and preserve both its structural integrity and functional outcome.Conclusions:To our knowledge,this case report documents the first reported instance of Garamycin®injection performed for the purpose of penile enlargement.It provides insight into the clinical course of such penile cream injections,demonstrates that a two-stage scrotal flap can achieve both functional and aesthetic outcomes,and highlights the importance of comprehensive management particularly addressing the traumatic impact of penile deformity secondary to inflammation and/or infection,as well as the body dysmorphic concerns often associated with these cases. 展开更多
关键词 penile augmentation foreign body injection penile reconstruction scrotal flap complications case report
暂未订购
Effective Token Masking Augmentation Using Term-Document Frequency for Language Model-Based Legal Case Classification
10
作者 Ye-Chan Park Mohd Asyraf Zulkifley +1 位作者 Bong-Soo Sohn Jaesung Lee 《Computers, Materials & Continua》 2026年第4期928-945,共18页
Legal case classification involves the categorization of legal documents into predefined categories,which facilitates legal information retrieval and case management.However,real-world legal datasets often suffer from... Legal case classification involves the categorization of legal documents into predefined categories,which facilitates legal information retrieval and case management.However,real-world legal datasets often suffer from class imbalances due to the uneven distribution of case types across legal domains.This leads to biased model performance,in the form of high accuracy for overrepresented categories and underperformance for minority classes.To address this issue,in this study,we propose a data augmentation method that masks unimportant terms within a document selectively while preserving key terms fromthe perspective of the legal domain.This approach enhances data diversity and improves the generalization capability of conventional models.Our experiments demonstrate consistent improvements achieved by the proposed augmentation strategy in terms of accuracy and F1 score across all models,validating the effectiveness of the proposed method in legal case classification. 展开更多
关键词 Legal case classification class imbalance data augmentation token masking legal NLP
在线阅读 下载PDF
Augmented reality surgical navigation:Clinical applications,key technologies,and future directions
11
作者 Yuanyuan WANG Dawei LU +9 位作者 Jingfan FAN Deqiang XIAO Danni AI Tianyu FU Yucong LIN Long SHAO Tao CHEN Hong SONG Yongtian WANG Jian YANG 《虚拟现实与智能硬件(中英文)》 2026年第1期1-27,共27页
Surgical navigation has evolved significantly through advances in augmented reality,virtual reality,and mixed reality,improving precision and safety across many clinical applications,including neurosurgery,maxillofaci... Surgical navigation has evolved significantly through advances in augmented reality,virtual reality,and mixed reality,improving precision and safety across many clinical applications,including neurosurgery,maxillofacial,spinal,and arthroplasty procedures.By integrating preoperative imaging with real-time intraoperative data,these systems provide dynamic guidance,reduce radiation exposure,and minimize tissue damage.Key challenges persist,including intraoperative registration accuracy,flexible tissue deformation,respiratory compensation,and real-time imaging quality.Emerging solutions include artificial intelligence-driven segmentation,deformation-field modeling,and hybrid registration techniques.Future developments will include lightweight,portable systems,improved non-rigid registration algorithms,and greater clinical adoption.Despite advances in rigid-tissue applications,soft-tissue navigation requires additional innovation to address motion variability and registration reliability,ultimately advancing minimally invasive surgery and precision medicine. 展开更多
关键词 Surgical navigation Augmented reality Multimodal image registration Artificial intelligence
在线阅读 下载PDF
Multiple PointMedSAM Prompting for Enhanced Medical Image Segmentation
12
作者 Wasfieh Nazzal Ezequiel López-Rubio +1 位作者 Miguel A.Molina-Cabello Karl Thurnhofer-Hemsi 《Computers, Materials & Continua》 2026年第5期2100-2115,共16页
Automatic and accurate medical image segmentation remains a fundamental task in computer-aided diagnosis and treatment planning.Recent advances in foundation models,such as the medical-focused Segment AnythingModel(Me... Automatic and accurate medical image segmentation remains a fundamental task in computer-aided diagnosis and treatment planning.Recent advances in foundation models,such as the medical-focused Segment AnythingModel(MedSAM),have demonstrated strong performance but face challenges inmanymedical applications due to anatomical complexity and a limited domain-specific prompt.Thiswork introduces amethodology that enhances segmentation robustness and precision by automatically generating multiple informative point prompts,rather than relying on single inputs.The proposed approach randomly samples sets of spatially distributed point prompts based on image features,enabling MedSAM to better capture fine-grained anatomical structures and boundaries.During inference,probability maps are aggregated to reduce local misclassifications without additional model training.Extensive experiments on various computed tomography(CT)and magnetic resonance imaging(MRI)datasets demonstrate improvements in Dice Similarity Coefficient(DSC)and Normalized Surface Dice(NSD)metrics compared to baseline SAM and Scribble Prompt models.A semi-automatic point sampling version based on the ground truth segmentations yielded enhanced results,achieving up to 92.1%DSC and 86.6%NSD,with significant gains in delineating complex organs such as the pancreas,colon,kidney,and brain tumours.The main novelty of our method consists of effectively combining the results of multiple point prompts into the medical segmentation pipeline so that single-point prompt methods are outperformed.Overall,the proposed model offers a straightforward yet effective approach to improve medical image segmentation performance while maintaining computational efficiency. 展开更多
关键词 Medical image segmentation deep learning test-time augmentation point prompt
在线阅读 下载PDF
Korean Sign Language Recognition and Sentence Generation through Data Augmentation
13
作者 Soo-Yeon Jeong Ho-Yeon Jeong Sun-Young Ihm 《Computers, Materials & Continua》 2026年第5期2005-2019,共15页
Sign language is a primary mode of communication for individuals with hearing impairments,conveying meaning through hand shapes and hand movements.Contrary to spoken or written languages,sign language relies on the re... Sign language is a primary mode of communication for individuals with hearing impairments,conveying meaning through hand shapes and hand movements.Contrary to spoken or written languages,sign language relies on the recognition and interpretation of hand gestures captured in video data.However,sign language datasets remain relatively limited compared to those of other languages,which hinders the training and performance of deep learning models.Additionally,the distinct word order of sign language,unlike that of spoken language,requires context-aware and natural sentence generation.To address these challenges,this study applies data augmentation techniques to build a Korean Sign Language dataset and train recognition models.Recognized words are then reconstructed into complete sentences.The sign recognition process uses OpenCV and MediaPipe to extract hand landmarks from sign language videos and analyzes hand position,orientation,and motion.The extracted features are converted into time-series data and fed into a Long Short-Term Memory(LSTM)model.The proposed recognition framework achieved an accuracy of up to 81.25%,while the sentence generation achieved an accuracy of up to 95%.The proposed approach is expected to be applicable not only to Korean Sign Language but also to other low-resource sign languages for recognition and translation tasks. 展开更多
关键词 Korean sign language recognition LSTM data augmentation sentence completion
在线阅读 下载PDF
Individual Software Expertise Formalization and Assessment from Project Management Tool Databases
14
作者 Traian-Radu Plosca Alexandru-Mihai Pescaru +1 位作者 Bianca-Valeria Rus Daniel-Ioan Curiac 《Computers, Materials & Continua》 2026年第1期389-411,共23页
Objective expertise evaluation of individuals,as a prerequisite stage for team formation,has been a long-term desideratum in large software development companies.With the rapid advancements in machine learning methods... Objective expertise evaluation of individuals,as a prerequisite stage for team formation,has been a long-term desideratum in large software development companies.With the rapid advancements in machine learning methods,based on reliable existing data stored in project management tools’datasets,automating this evaluation process becomes a natural step forward.In this context,our approach focuses on quantifying software developer expertise by using metadata from the task-tracking systems.For this,we mathematically formalize two categories of expertise:technology-specific expertise,which denotes the skills required for a particular technology,and general expertise,which encapsulates overall knowledge in the software industry.Afterward,we automatically classify the zones of expertise associated with each task a developer has worked on using Bidirectional Encoder Representations from Transformers(BERT)-like transformers to handle the unique characteristics of project tool datasets effectively.Finally,our method evaluates the proficiency of each software specialist across already completed projects from both technology-specific and general perspectives.The method was experimentally validated,yielding promising results. 展开更多
关键词 Expertise formalization transformer-based models natural language processing augmented data project management tool skill classification
在线阅读 下载PDF
Diffusion-Driven Generation of Synthetic Complex Concrete Crack Images for Segmentation Tasks
15
作者 Pengwei Guo Xiao Tan Yiming Liu 《Structural Durability & Health Monitoring》 2026年第1期47-69,共23页
Crack detection accuracy in computer vision is often constrained by limited annotated datasets.Although Generative Adversarial Networks(GANs)have been applied for data augmentation,they frequently introduce blurs and ... Crack detection accuracy in computer vision is often constrained by limited annotated datasets.Although Generative Adversarial Networks(GANs)have been applied for data augmentation,they frequently introduce blurs and artifacts.To address this challenge,this study leverages Denoising Diffusion Probabilistic Models(DDPMs)to generate high-quality synthetic crack images,enriching the training set with diverse and structurally consistent samples that enhance the crack segmentation.The proposed framework involves a two-stage pipeline:first,DDPMs are used to synthesize high-fidelity crack images that capture fine structural details.Second,these generated samples are combined with real data to train segmentation networks,thereby improving accuracy and robustness in crack detection.Compared with GAN-based approaches,DDPM achieved the best fidelity,with the highest Structural Similarity Index(SSIM)(0.302)and lowest Learned Perceptual Image Patch Similarity(LPIPS)(0.461),producing artifact-free images that preserve fine crack details.To validate its effectiveness,six segmentation models were tested,among which LinkNet consistently achieved the best performance,excelling in both region-level accuracy and structural continuity.Incorporating DDPM-augmented data further enhanced segmentation outcomes,increasing F1 scores by up to 1.1%and IoU by 1.7%,while also improving boundary alignment and skeleton continuity compared with models trained on real images alone.Experiments with varying augmentation ratios showed consistent improvements,with F1 rising from 0.946(no augmentation)to 0.957 and IoU from 0.897 to 0.913 at the highest ratio.These findings demonstrate the effectiveness of diffusion-based augmentation for complex crack detection in structural health monitoring. 展开更多
关键词 Crack monitoring complex cracks denoising diffusion models generative artificial intelligence synthetic data augmentation
在线阅读 下载PDF
Attention-Enhanced YOLOv8-Seg with WGAN-GP-Based Generative Data Augmentation for High-Precision Surface Defect Detection on Coarsely Ground SiC Wafers
16
作者 Chih-Yung Huang Hong-Ru Shi Min-Yan Xie 《Computers, Materials & Continua》 2026年第5期1431-1455,共25页
Quality control plays a critical role in modern manufacturing.With the rapid development of electric vehicles,5G communications,and the semiconductor industry,high-speed and high-precision detection of surface defects... Quality control plays a critical role in modern manufacturing.With the rapid development of electric vehicles,5G communications,and the semiconductor industry,high-speed and high-precision detection of surface defects on silicon carbide(SiC)wafers has become essential.This study developed an automated inspection framework for identifying surface defects on SiC wafers during the coarse grinding stage.Thecomplex machining textures on wafer surfaces hinder conventional machine vision models,often leading to misjudgment.To address this,deep learning algorithms were applied for defect classification.Because defects are rare and imbalanced across categories,data augmentation was performed using aWasserstein generative adversarial network with gradient penalty(WGAN-GP),along with conventionalmethods.An improved YOLOv8-seg instance segmentationmodel was then trained and tested on datasets with different augmentation strategies.Experimental results showed that,when trained withWGAN-GP–generated data,YOLOv8-seg achieved mean average precision values of 87.0%(bounding box)and 86.6%(segmentation mask).Compared with the traditional WGAN-GP,the proposed model reduced Frechet inception distance by 32.2%and multiscale structural similarity index by 29.8%,generating more realistic and diverse defect images.The proposed framework effectively improves defect detection accuracy under limited data conditions and shows strong potential for industrial applications. 展开更多
关键词 Data augmentation defect detection silicon carbide(SiC )wafer WGAN-GP YOLOv8-seg
在线阅读 下载PDF
Advancing living Bacillus spore identification:Multi-head self-attention mechanism-enabled deep learning combined with single-cell Raman spectroscopy
17
作者 Mengjiao Xue Fusheng Du +5 位作者 Lin He Junhui Hu Yuanpeng Li Yuan Lu Shuwen Zeng Yufeng Yuan 《Journal of Innovative Optical Health Sciences》 2026年第1期139-155,共17页
Many spore-forming Bacillus species can cause serious human diseases,because of accidental Bacillusspore infection.Thus,developing an identification strategy with both high sensitivity and specificity is greatly in de... Many spore-forming Bacillus species can cause serious human diseases,because of accidental Bacillusspore infection.Thus,developing an identification strategy with both high sensitivity and specificity is greatly in demand.In this work,we proposed a novel approach named multi-head self-attention mechanism-guided neural network Raman platform to identify living Bacillus spores within a single-cell resolution.The multi-head self-attention mechanism-guided neural network Raman platform was created by combining single-cell Raman spectroscopy,convolutional neural network(CNN),and multi-head self-attention mechanism.To address the limited size of the original spectra dataset,Gaussian noise-based spectra augmentation was employed to increase the number of single-cell Raman spectra datasets for CNN training.Owing to the assistance of both spectra augmentation and multi-head self-attention mechanism,the obtained prediction accuracy of five Bacillus spore species was further improved from 92.29±0.82%to 99.43±0.15%.To figure out the spectra differences covered by the multi-head self-attention mechanism-guided CNN,the relative classification weight from typical Raman bands was visualized via multi-head self-attention mechanism curve.In the process of spectra augmentation from 0 to 1000,the distribution of relative classification weight varied from a discrete state to a more concentrated phase.More importantly,these highlighted four Raman bands(1017,1449,1576,and 1660 cm^(-1))were assigned large weights,showing that the spectra differences in the Raman bands produced the largest contribution to prediction accuracy.It can be foreseen that,our proposed sorting platform has great potential in accurately identifying Bacillus and its related genera species at a single-cell level. 展开更多
关键词 Multi-head self-attention mechanism CNN single-cell Raman spectroscopy spectra augmentation advanced Bacillus spore identification
原文传递
An Augmentation Method for Small-Sample Imbalanced Industrial IoT Detection Data
18
作者 SU Zhilong SHEN Zhidong SUN Hui 《Wuhan University Journal of Natural Sciences》 2026年第1期25-34,共10页
IoT devices are highly vulnerable to cyberattacks due to their widespread,distributed nature and limited security features.Intrusion detection can counter these threats,but class imbalance between normal and abnormal ... IoT devices are highly vulnerable to cyberattacks due to their widespread,distributed nature and limited security features.Intrusion detection can counter these threats,but class imbalance between normal and abnormal traffic often degrades model performance.We propose a novel multi-generator adversarial data augmentation method that blends the strengths of TMG-GAN(Tabular Multi-Generator Generative Adversarial Network)and R3GAN(Re-GAN).Our approach uses multiple class-specific generators to create diverse,high-quality synthetic samples,improving training stability and minority-class detection.A dual-branch discriminator-classifier enhances authenticity and class prediction,while feature similarity and decoupling techniques ensure clear class separation.Experiments on TON-IoT and Edge-IIoTset datasets show our method outperforms existing techniques like hybrid sampling,SNGAN(Spectral Normalization GAN),and TMG-GAN,achieving higher detection accuracy and better minority-class recall for imbalanced IoT intrusion detection. 展开更多
关键词 Internet of Things(IoT) intrusion detection system generative adversarial networks class imbalance data augmentation
原文传递
Enhanced sparse RCNN for transmission line bolt defect detection via text-to-image data augmentation and quality filtering
19
作者 Chen Zhenyu Yan Huaguang +2 位作者 Du Jianguang Xue Meng Zhao Shuai 《High Technology Letters》 2026年第1期11-20,共10页
To address the issue of inconsistent image quality and data scarcity in bolt defect detection for transmission lines,this paper proposes an improved sparse region-based convolutional neural network(RCNN) based detecti... To address the issue of inconsistent image quality and data scarcity in bolt defect detection for transmission lines,this paper proposes an improved sparse region-based convolutional neural network(RCNN) based detection framework integrating image quality evaluation and text-to-image data augmentation.First,a HyperNetwork-based image quality assessment module is introduced to filter low-quality inspection images in terms of clarity and structural integrity,resulting in a high-quality training dataset.Second,a text-to-image diffusion model is utilized for sample augmentation.By designing text prompts that describe various bolt defect types under diverse lighting and viewing conditions,the model automatically generates realistic synthetic samples.The generated images are further filtered using a combination of quality and perceptual similarity metrics to ensure consistency with the real data distribution.Building upon the sparse RCNN baseline,a dynamic label assignment mechanism and a random decision path detection head are incorporated to enhance bounding box matching and prediction accuracy.Experimental results demonstrate that the proposed method significantly improves detection accuracy(mAP@0.5) over the original sparse RCNN while maintaining low computational cost,enabling more efficient and intelligent inspection of transmission line components. 展开更多
关键词 sparse region-based convolutional neural network HyperNetwork image quality assessment text-to-image generation data augmentation bolt defect detection transmission line inspection
在线阅读 下载PDF
Research on Integrating Deep Learning-Based Vehicle Brand and Model Recognition into a Police Intelligence Analysis Platform
20
作者 Shih-Lin Lin Cheng-Wei Li 《Computers, Materials & Continua》 2026年第2期785-804,共20页
This study focuses on developing a deep learning model capable of recognizing vehicle brands and models,integrated with a law enforcement intelligence platform to overcome the limitations of existing license plate rec... This study focuses on developing a deep learning model capable of recognizing vehicle brands and models,integrated with a law enforcement intelligence platform to overcome the limitations of existing license plate recognition techniques—particularly in handling counterfeit,obscured,or absent plates.The research first entailed collecting,annotating,and classifying images of various vehiclemodels,leveraging image processing and feature extraction methodologies to train themodel on Microsoft Custom Vision.Experimental results indicate that,formost brands and models,the system achieves stable and relatively high performance in Precision,Recall,and Average Precision(AP).Furthermore,simulated tests involving illicit vehicles reveal that,even in cases of reassigned,concealed,or missing license plates,the model can rely on exterior body features to effectively identify vehicles,reducing dependence on plate-specific data.In practical law enforcement scenarios,these findings can accelerate investigations of stolen or forged plates and enhance overall accuracy.In conclusion,continued collection of vehicle images across broadermodel types,production years,and modification levels—along with refined annotation processes and parameter adjustment strategies—will further strengthen themethod’s applicability within law enforcement intelligence platforms,facilitating more precise and comprehensive vehicle recognition and control in real-world operations. 展开更多
关键词 Deep learning vehicle brand-model recognition license plate anomalies(counterfeit/obscured) law enforcement intelligence data augmentation
在线阅读 下载PDF
上一页 1 2 50 下一页 到第
使用帮助 返回顶部