期刊文献+
共找到37,417篇文章
< 1 2 250 >
每页显示 20 50 100
Total score of the computer vision syndrome questionnaire predicts refractive errors and binocular vision anomalies
1
作者 Mosaad Alhassan Tasneem Samman +5 位作者 Hatoun Badukhen Muhamad Alrashed Balsam Alabdulkader Essam Almutleb Tahani Alqahtani Ali Almustanyir 《International Journal of Ophthalmology(English edition)》 2026年第1期90-96,共7页
AIM:To evaluate the efficacy of the total computer vision syndrome questionnaire(CVS-Q)score as a predictive tool for identifying individuals with symptomatic binocular vision anomalies and refractive errors.METHODS:A... AIM:To evaluate the efficacy of the total computer vision syndrome questionnaire(CVS-Q)score as a predictive tool for identifying individuals with symptomatic binocular vision anomalies and refractive errors.METHODS:A total of 141 healthy computer users underwent comprehensive clinical visual function assessments,including evaluations of refractive errors,accommodation(amplitude of accommodation,positive relative accommodation,negative relative accommodation,accommodative accuracy,and accommodative facility),and vergence(phoria,positive and negative fusional vergence,near point of convergence,and vergence facility).Total CVS-Q scores were recorded to explore potential associations between symptom scores and the aforementioned clinical visual function parameters.RESULTS:The cohort included 54 males(38.3%)with a mean age of 23.9±0.58y and 87 age-matched females(61.7%)with a mean age of 23.9±0.53y.The multiple regression model was statistically significant[R²=0.60,F=13.28,degrees of freedom(DF=17122,P<0.001].This indicates that 60%of the variance in total CVS-Q scores(reflecting reported symptoms)could be explained by four clinical measurements:amplitude of accommodation,positive relative accommodation,exophoria at distance and near,and positive fusional vergence at near.CONCLUSION:The total CVS-Q score is a valid and reliable tool for predicting the presence of various nonstrabismic binocular vision anomalies and refractive errors in symptomatic computer users. 展开更多
关键词 computer vision syndrome refractive errors ACCOMMODATION VERGENCE binocular vision SYMPTOMS
原文传递
卷积神经网络与Vision Transformer在胶质瘤中的研究进展
2
作者 杨浩辉 徐涛 +3 位作者 王伟 安良良 敖用芳 朱家宝 《磁共振成像》 北大核心 2026年第1期168-174,共7页
胶质瘤因高度异质性、强侵袭性及预后差,传统诊疗面临巨大挑战。深度学习技术的引入为其精准诊疗提供了新路径,其中卷积神经网络(convolutional neural network,CNN)与Vision Transformer(ViT)是核心工具。CNN凭借层级化卷积操作在局部... 胶质瘤因高度异质性、强侵袭性及预后差,传统诊疗面临巨大挑战。深度学习技术的引入为其精准诊疗提供了新路径,其中卷积神经网络(convolutional neural network,CNN)与Vision Transformer(ViT)是核心工具。CNN凭借层级化卷积操作在局部特征提取(如肿瘤边缘、纹理细节)上具有天然优势,而ViT基于自注意力机制在全局上下文建模(如肿瘤跨区域异质性、多模态关联)方面表现突出,二者的融合策略通过整合局部精细特征与全局关联信息,在应对胶质瘤边界模糊、跨模态数据异构性等临床难题中展现出显著优势。本文综述了二者在胶质瘤检测与分割、病理分级、分子分型、预后评估等关键临床任务中的研究进展,阐述了原理、单独应用及融合策略。同时,本文也探讨了当前研究中存在的挑战,诸如对数据标注的强依赖性、模型可解释性不足等问题,并展望了未来的发展方向,例如构建轻量化架构、发展自监督学习以及推进多组学融合等前沿,以期为胶质瘤智能诊断提供系统性参考。 展开更多
关键词 胶质瘤 深度学习 卷积神经网络 vision Transformer 磁共振成像
暂未订购
基于条件生成对抗网络和Vision Transformer的胎儿颅脑超声标准切面识别方法
3
作者 李惠莲 林艺榕 +1 位作者 刘中华 柳培忠 《临床超声医学杂志》 2026年第2期164-169,共6页
胎儿颅脑超声检查是产前常规筛查中至关重要的一环,准确识别标准切面对于评估胎儿大脑发育状况具有重要意义。然而,由于超声图像质量差异和切面获取的复杂性,准确识别标准切面具有较大的挑战性。本文提出了一种基于条件对抗生成网络(CG... 胎儿颅脑超声检查是产前常规筛查中至关重要的一环,准确识别标准切面对于评估胎儿大脑发育状况具有重要意义。然而,由于超声图像质量差异和切面获取的复杂性,准确识别标准切面具有较大的挑战性。本文提出了一种基于条件对抗生成网络(CGAN)和Vision Transformer的胎儿颅脑超声标准切面识别方法,利用CGAN对原始数据进行增强,生成额外的标准切面和非标准切面图像,解决数据不足的问题;同时采用YOLOv9模型对超声图像中的颅骨区域进行自动裁剪,去除无关信息,确保模型专注于关键区域。在分类模型中采用Vision Transformer对所有输入图像进行归一化和尺寸调整,使用了数据增强技术如随机水平或垂直翻转、调整图像对比度、中心裁剪和调整图像饱和度等。结果显示,相较于现有最优模型CSwin Transformer的方法,本文提出的方法在胎儿颅脑超声标准切面识别任务中表现出色,其精确率、召回率、F1分数及准确率分别为92.5%、92.3%、92.4%和93.3%。该方法在提升识别精度方面具有显著优势,为临床超声检查提供了有效技术支持。 展开更多
关键词 条件生成对抗网络 vision Transformer 颅脑超声 胎儿 标准切面识别方法
暂未订购
孪生多级Vision Transformer高分遥感影像变化检测方法
4
作者 黄英杰 《测绘与空间地理信息》 2026年第2期123-126,130,共5页
针对现有遥感变化检测模型捕获特征不全面,深、浅层特征利用不充分,导致分割精度不高的问题,提出一种结合Vision Transformer与孪生架构的遥感影像变化检测模型。在编码器端,采用孪生多级Vision Transformer实现空间特征提取与全局上下... 针对现有遥感变化检测模型捕获特征不全面,深、浅层特征利用不充分,导致分割精度不高的问题,提出一种结合Vision Transformer与孪生架构的遥感影像变化检测模型。在编码器端,采用孪生多级Vision Transformer实现空间特征提取与全局上下文特征建模,同时采用haar小波下采样层进行特征图尺寸压缩,减少细节特征的丢失;在特征解码过程中,引入全尺度特征连接机制,充分利用不同来源的深、浅层特征。实验结果表明,所提出模型在分割精度上优于当前的主流模型,能够准确地捕获变化目标的边界与细节信息。 展开更多
关键词 遥感变化检测 孪生架构 vision Transformer haar小波下采样 全尺度特征连接
在线阅读 下载PDF
From microstructure to performance optimization:Innovative applications of computer vision in materials science
5
作者 Chunyu Guo Xiangyu Tang +10 位作者 Yu’e Chen Changyou Gao Qinglin Shan Heyi Wei Xusheng Liu Chuncheng Lu Meixia Fu Enhui Wang Xinhong Liu Xinmei Hou Yanglong Hou 《International Journal of Minerals,Metallurgy and Materials》 2026年第1期94-115,共22页
The rapid advancements in computer vision(CV)technology have transformed the traditional approaches to material microstructure analysis.This review outlines the history of CV and explores the applications of deep-lear... The rapid advancements in computer vision(CV)technology have transformed the traditional approaches to material microstructure analysis.This review outlines the history of CV and explores the applications of deep-learning(DL)-driven CV in four key areas of materials science:microstructure-based performance prediction,microstructure information generation,microstructure defect detection,and crystal structure-based property prediction.The CV has significantly reduced the cost of traditional experimental methods used in material performance prediction.Moreover,recent progress made in generating microstructure images and detecting microstructural defects using CV has led to increased efficiency and reliability in material performance assessments.The DL-driven CV models can accelerate the design of new materials with optimized performance by integrating predictions based on both crystal and microstructural data,thereby allowing for the discovery and innovation of next-generation materials.Finally,the review provides insights into the rapid interdisciplinary developments in the field of materials science and future prospects. 展开更多
关键词 MICROSTRUCTURE deep learning computer vision performance prediction image generation
在线阅读 下载PDF
Advances and Prospects in Body-Size Measurement of Sheep:From 2D Vision to 3D Reconstruction and 2D-3D Fusion
6
作者 DAI Weijiao LIANG Yudongchen +5 位作者 ZHOU Yong YAO Chao ZHANG Cheng SONG Yongjian LI Guoliang TIAN Fang 《智慧农业(中英文)》 2026年第1期120-147,共28页
[Significance]In alignment with the national germplasm security strategy,current research efforts are accelerating the adoption of precision breeding in sheep.Within the whole-genome selection,accurate phenotyping of ... [Significance]In alignment with the national germplasm security strategy,current research efforts are accelerating the adoption of precision breeding in sheep.Within the whole-genome selection,accurate phenotyping of body morphometrics is critical for assessing growth performance and breeding value.Traditional manual measurements are inefficient,prone to human error,and may cause stress to sheep,limiting their suitability for precision sheep management.By summarizing the applications of sheep body size measurement technologies and analyzing their development directions,this paper provides theoretical references and practical guidance for the research and application of non contact sheep body size measurement.[Progress]This review synthesizes progress across three principal methodological paradigms:two-dimensional(2D)image-based techniques,three-dimensional(3D)point cloud-based approaches,and integrated 2D-3D fusion systems.2D methods,employing either handcrafted geometric features or deep learning-based keypoint detector algorithms,are cost-effective and operationally simple but sensitive to variation in imaging conditions and unable to capture critical circumference metrics.3D point-cloud approaches enable precise reconstruction of full animal morphology,supporting comprehensive body-size acquisition with higher accuracy,yet face challenges including high hardware costs,complex data workflows,and sensitivity to posture variability.Hybrid 2D-3D fusion systems combine semantic richness from RGB imagery with geometric completeness from point clouds.Having been effectively validated in other livestock specise,e.g.,cattle and pigs,these fusion systems have demonstrated excellent performance,providing important technical references and practical insights for sheep body size measurement.[Conclusions and Prospects]Firstly,future research should focus on constructing large-scale,high-quality datasets for sheep body size measurement that encompass diverse breeds,growth stages,and environmental conditions,thereby enhancing model robustness and generalization.Secondly,the development of lightweight artificial intelligence models is essential.Techniques such as model compression,quantization,and algorithmic optimization can substantially reduce computational complexity and storage requirements,facilitating deployment in resource-constrained environments.Thirdly,the 3D point cloud processing pipeline should be streamlined to improve the efficiency of data acquisition,filtering,registration,and segmentation,while promoting the integration of low-cost,high-resilience vision systems into practical farming scenarios.Fourthly,specific emphasis should be placed on improving the accuracy of curved-dimensional measurements,such as chest circumference,abdominal circumference,and shank circumference,through advances in pose standardization,refined 3D segmentation strategies,and multimodal data fusion.Finally,the cross-fertilization of sheep body size measurement technologies with analogous methods for other livestock species offers a promising pathway for mutual learning and collaborative innovation,accelerating the industrialization of automated sheep morphometric systems and supporting the development of intelligent,data-driven pasture management practices. 展开更多
关键词 smart breeding computer vision image recognition three-dimensional reconstruction 2D-3D body measurement
在线阅读 下载PDF
Functional outcome and patient satisfaction 5y after laser vision correction
7
作者 Ran Gao Yu Han +4 位作者 Jie Qin Yu-Shan Xu Yu Li Xiao-Tong Lyu Feng-Ju Zhang 《International Journal of Ophthalmology(English edition)》 2026年第1期123-131,共9页
AIM:To investigate the association between functionaloutcomes and postoperative patient satisfaction 5y aftersmall incision lenticule extraction(SMILE)and femtosecondlaser-assisted in situ keratomileusis(FS-LASIK).MET... AIM:To investigate the association between functionaloutcomes and postoperative patient satisfaction 5y aftersmall incision lenticule extraction(SMILE)and femtosecondlaser-assisted in situ keratomileusis(FS-LASIK).METHODS:This is a cross-sectional study.Thepatients underwent basic ophthalmic examinations,axiallength measurement,wide-field fundus photography,andaccommodation function testing.Behavioral habits datawere collected using a self-administered questionnaire,andvisual symptoms were assessed with the Quality of Vision(QoV)questionnaire.Postoperative satisfaction was alsorecorded.RESULTS:Totally 410 subjects[820 eyes,160males(39.02%)and 250 females(60.98%)]who hadundergone SMILE or FS-LASIK 5y ago were enrolled.Themean(standard deviation,SD)age of all patients was29.83y(6.69).The mean(SD)preoperative manifest SEwas-5.80(2.04)diopters(D;range:-0.88 to-13.75).Patient satisfaction at 5y after undergoing SMILE or FSLASIKwas 91.70%.Patients were categorized into twogroups:dissatisfied group and satisfied group.Significantdifferences were observed between the two groups in termsof age(P=0.012),sex(P=0.021),preoperative degreeof myopia(P=0.049),postoperative visual symptoms(frequency,P=0.043;severity,P<0.001;bothersome,P=0.018),difficulty driving at night(P=0.001),andaccommodative amplitude(AMP,P=0.020).Multivariateanalysis confirmed that female sex(P=0.024),severityof visual symptoms(P=0.009),and difficulty driving atnight(P=0.006)were significantly associated with lowersatisfaction.The dissatisfied group showed higher rates ofstarbursts,double or multiple images,and high myopia,but lower age.The frequency,severity,and bothersome ofdistortion exhibited decreased with increasing age.CONCLUSION:Patient satisfaction 5y after SMILEand FS-LASIK is high and stable.Difficulty driving at night,sex,and severity of visual symptoms are important factorsinfluencing patient satisfaction.Special attention should bepaid to younger highly myopic female patients,particularlythose with starbursts and double or multiple images.It is crucial to monitor postoperative visual outcomesand provide patients with comprehensive preoperativecounseling to enhance long-term satisfaction. 展开更多
关键词 patient satisfaction MYOPIA vision small incision lenticule extraction femtosecond laser-assisted in situ keratomileusis
原文传递
基于注意机制优化的Vision Transformer在虫草等级识别中的应用
8
作者 刘惠文 《消费电子》 2026年第4期248-250,共3页
在数字时代背景下,深度学习驱动了图像识别的创新,但目前对虫草等级识别的研究主要还是依靠人工经验,存在效率低、主观性强等问题。文章采用视觉转换器(Vision Transformer,ViT)模型对虫草图像进行分级识别。首先,阐述视觉知觉、注意机... 在数字时代背景下,深度学习驱动了图像识别的创新,但目前对虫草等级识别的研究主要还是依靠人工经验,存在效率低、主观性强等问题。文章采用视觉转换器(Vision Transformer,ViT)模型对虫草图像进行分级识别。首先,阐述视觉知觉、注意机制和层次划分的理论依据,并从注意机制和模型结构两个角度对ViT进行调整和优化;在此基础上,利用PyTorch框架对包含5000幅图像的数据集进行5重交叉验证。实验结果显示,该模型的预测精度达到95.2%,召回率达到94.5%,F1值达到94.8%,为虫草行业智能化发展提供了技术支持。 展开更多
关键词 vision Transformer 虫草等级识别 图像分类 深度学习 计算机视觉
在线阅读 下载PDF
Privacy-Preserving Gender-Based Customer Behavior Analytics in Retail Spaces Using Computer Vision
9
作者 Ginanjar Suwasono Adi Samsul Huda +4 位作者 Griffani Megiyanto Rahmatullah Dodit Suprianto Dinda Qurrota Aini Al-Sefy Ivon Sandya Sari Putri Lalu Tri Wijaya Nata Kusuma 《Computers, Materials & Continua》 2026年第1期1839-1861,共23页
In the competitive retail industry of the digital era,data-driven insights into gender-specific customer behavior are essential.They support the optimization of store performance,layout design,product placement,and ta... In the competitive retail industry of the digital era,data-driven insights into gender-specific customer behavior are essential.They support the optimization of store performance,layout design,product placement,and targeted marketing.However,existing computer vision solutions often rely on facial recognition to gather such insights,raising significant privacy and ethical concerns.To address these issues,this paper presents a privacypreserving customer analytics system through two key strategies.First,we deploy a deep learning framework using YOLOv9s,trained on the RCA-TVGender dataset.Cameras are positioned perpendicular to observation areas to reduce facial visibility while maintaining accurate gender classification.Second,we apply AES-128 encryption to customer position data,ensuring secure access and regulatory compliance.Our system achieved overall performance,with 81.5%mAP@50,77.7%precision,and 75.7%recall.Moreover,a 90-min observational study confirmed the system’s ability to generate privacy-protected heatmaps revealing distinct behavioral patterns between male and female customers.For instance,women spent more time in certain areas and showed interest in different products.These results confirm the system’s effectiveness in enabling personalized layout and marketing strategies without compromising privacy. 展开更多
关键词 Business intelligence customer behavior privacy-preserving analytics computer vision deep learning smart retail gender recognition heatmap privacy RCA-TVGender dataset
在线阅读 下载PDF
Comparison of binocular vision indices in Parkinson’s disease patients vs age-sex-matched healthy controls
10
作者 Reyhaneh Shariati-Moghaddam Ali Shoeibi +6 位作者 Morad Amir Ahmad Hadi Ostadimoghaddam Hassan Hashemi Akbar Derakhshan Zahra Hemmatian Abbasali Yekta Mehdi Khabazkhoob 《International Journal of Ophthalmology(English edition)》 2026年第3期549-555,共7页
AIM:To evaluate the differences in near point of convergence(NPC),fusional vergence,saccadic eye movements,versional eye movements,and heterophoria between patients diagnosed with Parkinson’s disease(PD)and healthy s... AIM:To evaluate the differences in near point of convergence(NPC),fusional vergence,saccadic eye movements,versional eye movements,and heterophoria between patients diagnosed with Parkinson’s disease(PD)and healthy subjects.METHODS:A cross-sectional comparative study was conducted,enrolling two cohorts:a PD group and a healthy control group.The PD group was recruited via non-random convenience sampling,while the control group was selected randomly from individuals without PD.All participants were screened according to predefined inclusion and exclusion criteria before undergoing a comprehensive optometric assessment,which included measurements of uncorrected visual acuity,corrected visual acuity,and objective and subjective refraction.Subsequently,binocular vision function evaluations were performed,covering NPC measurement,fusional vergence reserve assessment at both distance and near,saccadic eye movement testing,and versional eye movement and heterophoria assessment.RESULTS:A total of 42 PD patients and 41 healthy controls were included in the final analysis.The two groups were well-matched in terms of sex distribution[29 males(69.0%)in the PD group vs 29 males(70.7%)in the control group,P=0.867]and mean age(55.3±9.6y in the PD group vs 54.9±9.8y in the control group,P=0.866).The prevalence of abnormal versional eye movements was significantly higher in the PD group than in the control group(23.81%,95%CI:12.05%-39.45%vs 7.32%,95%CI:1.54%-19.92%;P=0.025).Near exophoria was more prevalent in PD patients(61.90%,95%CI:45.64%-76.43%)than in controls(17.07%,95%CI:7.15%-32.06%),with a significant difference[odds ratio(OR)=7.99;95%CI:2.83-21.99;P<0.001].The mean NPC was significantly greater(more receded)in the PD group than in the control group(9.01±3.74 cm vs 7.20±2.15 cm;P=0.007).A statistically significant positive correlation was observed between PD severity and NPC values(Pearson’s correlation coefficient=0.309;P=0.046).Except for distance baseout break and distance base-out recovery values,all other fusional vergence parameters were significantly lower in the PD group than in the control group(P<0.05).The mean saccadic test score was significantly lower in PD patients than in controls(3.29±0.57 vs 3.78±0.42;P<0.001).Among all fusional vergence indices,near base-in blur yielded the highest area under the curve(AUC=0.877),with a sensitivity of 69%and specificity of 90%,followed by distance base-out blur(AUC=0.824,sensitivity=97.6%,specificity=66.7%),near base-out blur(AUC=0.814,sensitivity=76.2%,specificity=72.7%),near base-out break(AUC=0.749,sensitivity=78.6%,specificity=67.6%),and near base-out recovery(AUC=0.749,sensitivity=95.2%,specificity=50%).CONCLUSION:PD is associated with significant binocular vision function impairment,with receded NPC and reduced near fusional vergence reserves being the most prominent disorders.These findings highlight the potential value of binocular vision assessment as a non-invasive biomarker for the early detection and clinical monitoring of PD. 展开更多
关键词 Parkinson’s disease binocular vision near point of convergence fusional vergence saccadic eye movement HETEROPHORIA
原文传递
基于Vision Transformer的轻量化单目深度估计
11
作者 张凯 唐嘉宁 +2 位作者 李叶嘉 马孟星 周思达 《现代电子技术》 北大核心 2026年第4期64-72,共9页
深度估计能为无人机提供精确的三维环境感知能力,而对边缘设备而言,实时推理与极低的计算资源消耗至关重要。目前大多数单目深度估计网络都侧重于提高在高端GPU上运行时的精度,难以满足边缘设备的实时性要求。为解决该问题,提出一种新... 深度估计能为无人机提供精确的三维环境感知能力,而对边缘设备而言,实时推理与极低的计算资源消耗至关重要。目前大多数单目深度估计网络都侧重于提高在高端GPU上运行时的精度,难以满足边缘设备的实时性要求。为解决该问题,提出一种新型编码器-解码器网络,以实现边缘设备上的实时单目深度估计。所提网络通过一个高效的语义模块合并全局的语义信息,为深度估计提供更多的物体边缘细节;并将基于Transformer的模块集成到编码器-解码器架构的最低分辨率层级,从而大大减少视觉变换器(ViT)的参数。此外,还提出了用于深度解码的Upconv层。该网络在精度和速度之间实现了较好的权衡,通过TensorRT优化,在NVIDIA Jetson Orin设备上具备实时推理性能,优于目前多数先进的实时性算法。 展开更多
关键词 单目深度估计网络 边缘设备 编码器 解码器 Transformer技术 视觉变换器
在线阅读 下载PDF
基于改进Vision Transformer的水稻叶片病害图像识别 被引量:1
12
作者 朱周华 周怡纳 +1 位作者 侯智杰 田成源 《电子测量技术》 北大核心 2025年第10期153-160,共8页
水稻叶片病害智能识别在现代农业生产中具有重要意义。针对传统Vision Transformer网络缺乏归纳偏置,难以有效捕捉图像局部细节特征的问题,提出了一种改进的Vision Transformer模型。该模型通过引入内在归纳偏置,增强了对多尺度上下文... 水稻叶片病害智能识别在现代农业生产中具有重要意义。针对传统Vision Transformer网络缺乏归纳偏置,难以有效捕捉图像局部细节特征的问题,提出了一种改进的Vision Transformer模型。该模型通过引入内在归纳偏置,增强了对多尺度上下文以及局部与全局依赖关系的建模能力,同时降低了对大规模数据集的需求。此外,Vision Transformer中的多层感知器模块被Kolmogorov-Arnold网络结构取代,从而提升了模型对复杂特征的提取能力和可解释性。实验结果表明,所提模型在水稻叶片病害识别任务中取得了优异的性能,识别准确率达到了98.62%,较原始ViT模型提升了6.2%,显著提高了对水稻叶片病害的识别性能。 展开更多
关键词 水稻叶片病害 图像识别 vision Transformer网络 归纳偏置 局部特征
原文传递
基于残差注意力TCN与vision transformer的齿轮剩余寿命预测
13
作者 胡爱军 李晨阳 +2 位作者 邢磊 周卓浩 向玲 《航空动力学报》 北大核心 2025年第12期14-24,共11页
齿轮系统的运行状况受到多个因素的影响,这些因素在时间上存在长期依赖关系,并在局部和全局特征之间存在差异。为了有效地捕捉数据中的时间依赖性并自适应调整对特征的关注度,提出具有残差卷积块注意力机制的时间卷积网络(RCMTCN)。通... 齿轮系统的运行状况受到多个因素的影响,这些因素在时间上存在长期依赖关系,并在局部和全局特征之间存在差异。为了有效地捕捉数据中的时间依赖性并自适应调整对特征的关注度,提出具有残差卷积块注意力机制的时间卷积网络(RCMTCN)。通过在卷积块注意力机制中引入残差连接,模型能够同时关注原始输入和注意力加权的信息,提高了模型对局部信息的感知能力。在此基础上,将vision transformer(ViT)模型与RCMTCN相结合对齿轮的剩余使用寿命(RUL)预测,ViT模型能有效地捕获数据中的全局信息。两者融合后能充分展现在处理时间序列数据局部特征提取能力和全局信息关注方面的优势,提高对多维度特征的感知能力。最后,通过在两种工况齿轮性能退化数据集上对模型进行验证,选用点蚀故障数据进行训练,分别对点蚀和断齿故障进行测试。实验结果表明:与其他方法相比,所提出的方法能更充分地提取关键特征信息,在点蚀故障上评分函数得分为0.8898,且在断齿故障上得分为0.8587,表现出良好的工况、故障适应能力。 展开更多
关键词 齿轮 剩余使用寿命 时序网络 注意力机制 vision transformer模型
原文传递
Vision Transformer模型在中医舌诊图像分类中的应用研究
14
作者 周坚和 王彩雄 +3 位作者 李炜 周晓玲 张丹璇 吴玉峰 《广西科技大学学报》 2025年第5期89-98,共10页
舌诊作为中医望诊中的一项重要且常规的检查手段,在中医临床诊断中发挥着不可或缺的作用。为突破传统舌诊依赖主观经验及卷积神经网络(convolutional neural network,CNN)模型分类性能不足的局限,本文基于高质量舌象分类数据集,提出基于... 舌诊作为中医望诊中的一项重要且常规的检查手段,在中医临床诊断中发挥着不可或缺的作用。为突破传统舌诊依赖主观经验及卷积神经网络(convolutional neural network,CNN)模型分类性能不足的局限,本文基于高质量舌象分类数据集,提出基于Vision Transformer(ViT)深度学习模型,通过预训练与微调策略优化特征提取能力,并结合数据增强技术解决类别分布不平衡问题。实验结果表明,该模型在6项关键舌象特征分类任务中,5项指标的准确率(苔色85.6%、瘀斑98.0%、质地99.6%、舌色96.6%、裂纹87.8%)显著优于现有CNN方法(如ResNet50对应准确率分别为78.0%、91.0%、92.0%、68.0%、80.1%),验证了该模型在突破传统性能瓶颈、提升中医临床智能诊断可靠性方面的有效性和应用潜力。 展开更多
关键词 舌诊 vision Transformer(ViT) 深度学习 医学图像分类
在线阅读 下载PDF
Steel Surface Defect Detection Using Learnable Memory Vision Transformer
15
作者 Syed Tasnimul Karim Ayon Farhan Md.Siraj Jia Uddin 《Computers, Materials & Continua》 SCIE EI 2025年第1期499-520,共22页
This study investigates the application of Learnable Memory Vision Transformers(LMViT)for detecting metal surface flaws,comparing their performance with traditional CNNs,specifically ResNet18 and ResNet50,as well as o... This study investigates the application of Learnable Memory Vision Transformers(LMViT)for detecting metal surface flaws,comparing their performance with traditional CNNs,specifically ResNet18 and ResNet50,as well as other transformer-based models including Token to Token ViT,ViT withoutmemory,and Parallel ViT.Leveraging awidely-used steel surface defect dataset,the research applies data augmentation and t-distributed stochastic neighbor embedding(t-SNE)to enhance feature extraction and understanding.These techniques mitigated overfitting,stabilized training,and improved generalization capabilities.The LMViT model achieved a test accuracy of 97.22%,significantly outperforming ResNet18(88.89%)and ResNet50(88.90%),aswell as the Token to TokenViT(88.46%),ViT without memory(87.18),and Parallel ViT(91.03%).Furthermore,LMViT exhibited superior training and validation performance,attaining a validation accuracy of 98.2%compared to 91.0%for ResNet 18,96.0%for ResNet50,and 89.12%,87.51%,and 91.21%for Token to Token ViT,ViT without memory,and Parallel ViT,respectively.The findings highlight the LMViT’s ability to capture long-range dependencies in images,an areawhere CNNs struggle due to their reliance on local receptive fields and hierarchical feature extraction.The additional transformer-based models also demonstrate improved performance in capturing complex features over CNNs,with LMViT excelling particularly at detecting subtle and complex defects,which is critical for maintaining product quality and operational efficiency in industrial applications.For instance,the LMViT model successfully identified fine scratches and minor surface irregularities that CNNs often misclassify.This study not only demonstrates LMViT’s potential for real-world defect detection but also underscores the promise of other transformer-based architectures like Token to Token ViT,ViT without memory,and Parallel ViT in industrial scenarios where complex spatial relationships are key.Future research may focus on enhancing LMViT’s computational efficiency for deployment in real-time quality control systems. 展开更多
关键词 Learnable Memory vision Transformer(LMViT) Convolutional Neural Networks(CNN) metal surface defect detection deep learning computer vision image classification learnable memory gradient clipping label smoothing t-SNE visualization
在线阅读 下载PDF
Causes and factors associated with vision impairment in the elderly population in Mangxin town,Kashgar region,Xinjiang,China
16
作者 Lingling Chen Ruilian Liao +6 位作者 Yuanyuan Liu Ling Jin Jun Fu Xun Wang Hongwen Jiang Lin Ding Qianyun Chen 《Eye Science》 2025年第1期12-24,共13页
Objective:This study aimed to investigate the prevalence,causes,and influencing factors of vision impairment in the elderly population aged 60 years and above in Mangxin Town,Kashgar region,Xinjiang,China.Located in a... Objective:This study aimed to investigate the prevalence,causes,and influencing factors of vision impairment in the elderly population aged 60 years and above in Mangxin Town,Kashgar region,Xinjiang,China.Located in a region characterized by intense ultraviolet radiation and arid climatic conditions,Mangxin Town presents unique environmental challenges that may exacerbate ocular health issues.Despite the global emphasis on addressing vision impairment among aging populations,there remains a paucity of updated and region-specific data in Xinjiang,necessitating this comprehensive assessment to inform targeted interventions.Methods:A cross-sectional study was conducted from May to June 2024,involving 1,311 elderly participants(76.76%participation rate)out of a total eligible population of 1,708 individuals aged≥60 years.Participants underwent detailed ocular examinations,including assessments of uncorrected visual acuity(UVA)and best-corrected visual acuity(BCVA)using standard logarithmic charts,slit-lamp biomicroscopy,optical coherence tomography(OCT,Topcon DRI OCT Triton),fundus photography,and intraocular pressure measurement(Canon TX-20 Tonometer).A multidisciplinary team of 10 ophthalmologists and 2 local village doctors,trained rigorously in standardized protocols,ensured consistent data collection.Demographic,lifestyle,and medical history data were collected via questionnaires.Statistical analyses,performed using STATA 16,included multivariate logistic regression to identify risk factors,with significance defined as P<0.05.Results:The overall prevalence of vision impairment was 13.21%(95%CI:11.37%-15.04%),with low vision at 11.76%(95%CI:10.01%-13.50%)and blindness at 1.45%(95%CI:0.80%-2.10%).Cataract emerged as the leading cause,responsible for 68.20%of cases,followed by glaucoma(5.80%),optic atrophy(5.20%),and age-related macular degeneration(2.90%).Vision impairment prevalence escalated significantly with age:7.74%in the 60–69 age group,17.79%in 70–79,and 33.72%in those≥80.Males exhibited higher prevalence than females(15.84%vs.10.45%,P=0.004).Multivariate analysis revealed age≥80 years(OR=6.43,95%CI:3.79%-10.90%),male sex(OR=0.53,95%CI:0.34%-0.83%),and daily exercise(OR=0.44,95%CI:0.20%-0.95%)as significant factors.History of eye disease showed a non-significant trend toward increased risk(OR=1.49,P=0.107).Education level,income,and smoking status showed no significant associations.Conclusions:This study underscores cataract as the predominant cause of vision impairment in Mangxin Town’s elderly population,with age and sex as critical determinants.The findings align with global patterns but highlight region-specific challenges,such as environmental factors contributing to cataract prevalence.Public health strategies should prioritize improving access to cataract surgery,enhancing grassroots ophthalmic infrastructure,and integrating portable screening technologies for early detection of fundus diseases.Additionally,promoting health education on UV protection and lifestyle modifications,such as regular exercise,may mitigate risks.Future research should expand to broader regions in Xinjiang,employ advanced diagnostic tools for complex conditions like glaucoma,and explore longitudinal trends to refine intervention strategies.These efforts are vital to reducing preventable blindness and improving quality of life for aging populations in underserved areas. 展开更多
关键词 low vision BLINDNESS vision impairment elderly XINJIANG CATARACT
暂未订购
AARPose:Real-time and accurate drogue pose measurement based on monocular vision for autonomous aerial refueling
17
作者 Shuyuan WEN Yang GAO +3 位作者 Bingrui HU Zhongyu LUO Zhenzhong WEI Guangjun ZHANG 《Chinese Journal of Aeronautics》 2025年第6期552-572,共21页
Real-time and accurate drogue pose measurement during docking is basic and critical for Autonomous Aerial Refueling(AAR).Vision measurement is the best practicable technique,but its measurement accuracy and robustness... Real-time and accurate drogue pose measurement during docking is basic and critical for Autonomous Aerial Refueling(AAR).Vision measurement is the best practicable technique,but its measurement accuracy and robustness are easily affected by limited computing power of airborne equipment,complex aerial scenes and partial occlusion.To address the above challenges,we propose a novel drogue keypoint detection and pose measurement algorithm based on monocular vision,and realize real-time processing on airborne embedded devices.Firstly,a lightweight network is designed with structural re-parameterization to reduce computational cost and improve inference speed.And a sub-pixel level keypoints prediction head and loss functions are adopted to improve keypoint detection accuracy.Secondly,a closed-form solution of drogue pose is computed based on double spatial circles,followed by a nonlinear refinement based on Levenberg-Marquardt optimization.Both virtual simulation and physical simulation experiments have been used to test the proposed method.In the virtual simulation,the mean pixel error of the proposed method is 0.787 pixels,which is significantly superior to that of other methods.In the physical simulation,the mean relative measurement error is 0.788%,and the mean processing time is 13.65 ms on embedded devices. 展开更多
关键词 Autonomous aerial refueling vision measurement Deep learning REAL-TIME LIGHTWEIGHT ACCURATE Monocular vision Drogue pose measurement
原文传递
Long-Term Vision in a Rapidly Changing World
18
作者 JOHN QUELCH 《China Today》 2025年第8期43-45,共3页
China’s five-year plans crystallize a governance model that merges long-term strategic vision with adaptive execution.AS China prepares to unveil its 15th Five-Year Plan in 2026,policymakers,investors,and scholars ar... China’s five-year plans crystallize a governance model that merges long-term strategic vision with adaptive execution.AS China prepares to unveil its 15th Five-Year Plan in 2026,policymakers,investors,and scholars around the world are watching closely.For over 70 years,these plans have guided the country’s economic and social development. 展开更多
关键词 economic social development long term vision China economic development five year plans adaptive execution strategic vision governance model
在线阅读 下载PDF
Automated Concrete Bridge Damage Detection Using an Efficient Vision Transformer-Enhanced Anchor-Free YOLO
19
作者 Xiaofei Yang Enrique del Rey Castillo +3 位作者 Yang Zou Liam Wotherspoon Jianxi Yang Hao Li 《Engineering》 2025年第8期311-326,共16页
Deep learning techniques have recently been the most popular method for automatically detecting bridge damage captured by unmanned aerial vehicles(UAVs).However,their wider application to real-world scenarios is hinde... Deep learning techniques have recently been the most popular method for automatically detecting bridge damage captured by unmanned aerial vehicles(UAVs).However,their wider application to real-world scenarios is hindered by three challenges:①defect scale variance,motion blur,and strong illumination significantly affect the accuracy and reliability of damage detectors;②existing commonly used anchor-based damage detectors struggle to effectively generalize to harsh real-world scenarios;and③convolutional neural networks(CNNs)lack the capability to model long-range dependencies across the entire image.This paper presents an efficient Vision Transformer-enhanced anchor-free YOLO(you only look once)method to address these challenges.First,a concrete bridge damage dataset was established,augmented by motion blur and varying brightness.Four key enhancements were then applied to an anchor-based YOLO method:①Four detection heads were introduced to alleviate the multi-scale damage detection issue;②decoupled heads were employed to address the conflict between classification and bounding box regression tasks inherent in the original coupled head design;③an anchor-free mechanism was incorporated to reduce the computational complexity and improve generalization to real-world scenarios;and④a novel Vision Transformer block,C3MaxViT,was added to enable CNNs to model long-range dependencies.These enhancements were integrated into an advanced anchor-based YOLOv5l algorithm,and the proposed Vision Transformer-enhanced anchor-free YOLO method was then compared against cutting-edge damage detection methods.The experimental results demonstrated the effectiveness of the proposed method,with an increase of 8.1%in mean average precision at intersection over union threshold of 0.5(mAP_(50))and an improvement of 8.4%in mAP@[0.5:.05:.95]respectively.Furthermore,extensive ablation studies revealed that the four detection heads,decoupled head design,anchor-free mechanism,and C3MaxViT contributed improvements of 2.4%,1.2%,2.6%,and 1.9%in mAP50,respectively. 展开更多
关键词 Computer vision Deep learning techniques vision Transformer Object detection Bridge visual inspection
在线阅读 下载PDF
Vision care and the sustainable development goals: a brief review and suggested research agenda
20
作者 Nathan Congdon Brad Wong +1 位作者 Xinxing Guo Graeme MacKenzie 《Eye Science》 2025年第2期103-110,共8页
Blindness affected 45 million people globally in 2021,and moderate to severe vision loss a further 295 million.[1]The most common causes,cataract and uncorrected refractive error,are generally the easiest to treat,and... Blindness affected 45 million people globally in 2021,and moderate to severe vision loss a further 295 million.[1]The most common causes,cataract and uncorrected refractive error,are generally the easiest to treat,and are among the most cost-effective procedures in all of medicine and international development.[1-2]Thus,vision impairment is both extremely common and,in principle,readily manageable. 展开更多
关键词 vision care CATARACT cost effective procedures uncorrected refractive error BLINDNESS moderate severe vision loss uncorrected refractive errorare sustainable development goals
暂未订购
上一页 1 2 250 下一页 到第
使用帮助 返回顶部