AIM:To evaluate the efficacy of the total computer vision syndrome questionnaire(CVS-Q)score as a predictive tool for identifying individuals with symptomatic binocular vision anomalies and refractive errors.METHODS:A...AIM:To evaluate the efficacy of the total computer vision syndrome questionnaire(CVS-Q)score as a predictive tool for identifying individuals with symptomatic binocular vision anomalies and refractive errors.METHODS:A total of 141 healthy computer users underwent comprehensive clinical visual function assessments,including evaluations of refractive errors,accommodation(amplitude of accommodation,positive relative accommodation,negative relative accommodation,accommodative accuracy,and accommodative facility),and vergence(phoria,positive and negative fusional vergence,near point of convergence,and vergence facility).Total CVS-Q scores were recorded to explore potential associations between symptom scores and the aforementioned clinical visual function parameters.RESULTS:The cohort included 54 males(38.3%)with a mean age of 23.9±0.58y and 87 age-matched females(61.7%)with a mean age of 23.9±0.53y.The multiple regression model was statistically significant[R²=0.60,F=13.28,degrees of freedom(DF=17122,P<0.001].This indicates that 60%of the variance in total CVS-Q scores(reflecting reported symptoms)could be explained by four clinical measurements:amplitude of accommodation,positive relative accommodation,exophoria at distance and near,and positive fusional vergence at near.CONCLUSION:The total CVS-Q score is a valid and reliable tool for predicting the presence of various nonstrabismic binocular vision anomalies and refractive errors in symptomatic computer users.展开更多
The rapid advancements in computer vision(CV)technology have transformed the traditional approaches to material microstructure analysis.This review outlines the history of CV and explores the applications of deep-lear...The rapid advancements in computer vision(CV)technology have transformed the traditional approaches to material microstructure analysis.This review outlines the history of CV and explores the applications of deep-learning(DL)-driven CV in four key areas of materials science:microstructure-based performance prediction,microstructure information generation,microstructure defect detection,and crystal structure-based property prediction.The CV has significantly reduced the cost of traditional experimental methods used in material performance prediction.Moreover,recent progress made in generating microstructure images and detecting microstructural defects using CV has led to increased efficiency and reliability in material performance assessments.The DL-driven CV models can accelerate the design of new materials with optimized performance by integrating predictions based on both crystal and microstructural data,thereby allowing for the discovery and innovation of next-generation materials.Finally,the review provides insights into the rapid interdisciplinary developments in the field of materials science and future prospects.展开更多
To overcome the limitations of low efficiency and reliance on manual processes in the measurement of geometric parameters for bridge prefabricated components,a method based on deep learning and computer vision is deve...To overcome the limitations of low efficiency and reliance on manual processes in the measurement of geometric parameters for bridge prefabricated components,a method based on deep learning and computer vision is developed to identify the geometric parameters.The study utilizes a common precast element for highway bridges as the research subject.First,edge feature points of the bridge component section are extracted from images of the precast component cross-sections by combining the Canny operator with mathematical morphology.Subsequently,a deep learning model is developed to identify the geometric parameters of the precast components using the extracted edge coordinates from the images as input and the predefined control parameters of the bridge section as output.A dataset is generated by varying the control parameters and noise levels for model training.Finally,field measurements are conducted to validate the accuracy of the developed method.The results indicate that the developed method effectively identifies the geometric parameters of bridge precast components,with an error rate maintained within 5%.展开更多
[Significance]In alignment with the national germplasm security strategy,current research efforts are accelerating the adoption of precision breeding in sheep.Within the whole-genome selection,accurate phenotyping of ...[Significance]In alignment with the national germplasm security strategy,current research efforts are accelerating the adoption of precision breeding in sheep.Within the whole-genome selection,accurate phenotyping of body morphometrics is critical for assessing growth performance and breeding value.Traditional manual measurements are inefficient,prone to human error,and may cause stress to sheep,limiting their suitability for precision sheep management.By summarizing the applications of sheep body size measurement technologies and analyzing their development directions,this paper provides theoretical references and practical guidance for the research and application of non contact sheep body size measurement.[Progress]This review synthesizes progress across three principal methodological paradigms:two-dimensional(2D)image-based techniques,three-dimensional(3D)point cloud-based approaches,and integrated 2D-3D fusion systems.2D methods,employing either handcrafted geometric features or deep learning-based keypoint detector algorithms,are cost-effective and operationally simple but sensitive to variation in imaging conditions and unable to capture critical circumference metrics.3D point-cloud approaches enable precise reconstruction of full animal morphology,supporting comprehensive body-size acquisition with higher accuracy,yet face challenges including high hardware costs,complex data workflows,and sensitivity to posture variability.Hybrid 2D-3D fusion systems combine semantic richness from RGB imagery with geometric completeness from point clouds.Having been effectively validated in other livestock specise,e.g.,cattle and pigs,these fusion systems have demonstrated excellent performance,providing important technical references and practical insights for sheep body size measurement.[Conclusions and Prospects]Firstly,future research should focus on constructing large-scale,high-quality datasets for sheep body size measurement that encompass diverse breeds,growth stages,and environmental conditions,thereby enhancing model robustness and generalization.Secondly,the development of lightweight artificial intelligence models is essential.Techniques such as model compression,quantization,and algorithmic optimization can substantially reduce computational complexity and storage requirements,facilitating deployment in resource-constrained environments.Thirdly,the 3D point cloud processing pipeline should be streamlined to improve the efficiency of data acquisition,filtering,registration,and segmentation,while promoting the integration of low-cost,high-resilience vision systems into practical farming scenarios.Fourthly,specific emphasis should be placed on improving the accuracy of curved-dimensional measurements,such as chest circumference,abdominal circumference,and shank circumference,through advances in pose standardization,refined 3D segmentation strategies,and multimodal data fusion.Finally,the cross-fertilization of sheep body size measurement technologies with analogous methods for other livestock species offers a promising pathway for mutual learning and collaborative innovation,accelerating the industrialization of automated sheep morphometric systems and supporting the development of intelligent,data-driven pasture management practices.展开更多
In the competitive retail industry of the digital era,data-driven insights into gender-specific customer behavior are essential.They support the optimization of store performance,layout design,product placement,and ta...In the competitive retail industry of the digital era,data-driven insights into gender-specific customer behavior are essential.They support the optimization of store performance,layout design,product placement,and targeted marketing.However,existing computer vision solutions often rely on facial recognition to gather such insights,raising significant privacy and ethical concerns.To address these issues,this paper presents a privacypreserving customer analytics system through two key strategies.First,we deploy a deep learning framework using YOLOv9s,trained on the RCA-TVGender dataset.Cameras are positioned perpendicular to observation areas to reduce facial visibility while maintaining accurate gender classification.Second,we apply AES-128 encryption to customer position data,ensuring secure access and regulatory compliance.Our system achieved overall performance,with 81.5%mAP@50,77.7%precision,and 75.7%recall.Moreover,a 90-min observational study confirmed the system’s ability to generate privacy-protected heatmaps revealing distinct behavioral patterns between male and female customers.For instance,women spent more time in certain areas and showed interest in different products.These results confirm the system’s effectiveness in enabling personalized layout and marketing strategies without compromising privacy.展开更多
critical for guiding treatment and improving patient outcomes.Traditional molecular subtyping via immuno-histochemistry(IHC)test is invasive,time-consuming,and may not fully represent tumor heterogeneity.This study pr...critical for guiding treatment and improving patient outcomes.Traditional molecular subtyping via immuno-histochemistry(IHC)test is invasive,time-consuming,and may not fully represent tumor heterogeneity.This study proposes a non-invasive approach using digital mammography images and deep learning algorithm for classifying breast cancer molecular subtypes.Four pretrained models,including two Convolutional Neural Networks(MobileNet_V3_Large and VGG-16)and two Vision Transformers(ViT_B_16 and ViT_Base_Patch16_Clip_224)were fine-tuned to classify images into HER2-enriched,Luminal,Normal-like,and Triple Negative subtypes.Hyperparameter tuning,including learning rate adjustment and layer freezing strategies,was applied to optimize performance.Among the evaluated models,ViT_Base_Patch16_Clip_224 achieved the highest test accuracy(94.44%),with equally high precision,recall,and F1-score of 0.94,demonstrating excellent generalization.MobileNet_V3_Large achieved the same accuracy but showed less training stability.In contrast,VGG-16 recorded the lowest performance,indicating a limitation in its generalizability for this classification task.The study also highlighted the superior performance of the Vision Transformer models over CNNs,particularly due to their ability to capture global contextual features and the benefit of CLIP-based pretraining in ViT_Base_Patch16_Clip_224.To enhance clinical applicability,a graphical user interface(GUI)named“BCMS Dx”was developed for streamlined subtype prediction.Deep learning applied to mammography has proven effective for accurate and non-invasive molecular subtyping.The proposed Vision Transformer-based model and supporting GUI offer a promising direction for augmenting diagnostic workflows,minimizing the need for invasive procedures,and advancing personalized breast cancer management.展开更多
Recent years have witnessed the ever-increasing performance of Deep Neural Networks(DNNs)in computer vision tasks.However,researchers have identified a potential vulnerability:carefully crafted adversarial examples ca...Recent years have witnessed the ever-increasing performance of Deep Neural Networks(DNNs)in computer vision tasks.However,researchers have identified a potential vulnerability:carefully crafted adversarial examples can easily mislead DNNs into incorrect behavior via the injection of imperceptible modification to the input data.In this survey,we focus on(1)adversarial attack algorithms to generate adversarial examples,(2)adversarial defense techniques to secure DNNs against adversarial examples,and(3)important problems in the realm of adversarial examples beyond attack and defense,including the theoretical explanations,trade-off issues and benign attacks in adversarial examples.Additionally,we draw a brief comparison between recently published surveys on adversarial examples,and identify the future directions for the research of adversarial examples,such as the generalization of methods and the understanding of transferability,that might be solutions to the open problems in this field.展开更多
To improve the safety of construction workers and help workers remotely control humanoid robots in construc-tion,this study designs and implements a computer vision based virtual construction simulation system.For thi...To improve the safety of construction workers and help workers remotely control humanoid robots in construc-tion,this study designs and implements a computer vision based virtual construction simulation system.For this pur-pose,human skeleton motion data are collected using a Ki-nect depth camera,and the obtained data are optimized via abnormal data elimination,smoothing,and normalization.MediaPipe extracts three-dimensional hand motion coordi-nates for accurate human posture tracking.Blender is used to build a virtual worker and site model,and the virtual worker motion is controlled based on the quaternion inverse kinematics algorithm while limiting the joint angle to en-hance the authenticity of motion simulation.Experimental results show that the system frame rate is stable at 60 frame/s,end-to-end delay is less than 20 ms,and virtual task comple-tion time is close to the real scene,verifying its engineering applicability.The proposed system can drive virtual work-ers to perform tasks and provide technical support for con-struction safety training.展开更多
Accurate estimation on the state of health(SOH)is essential for ensuring the safe and reliable operation of batteries.Traditional assessment methods primarily focus on electrical attributes for capacity decay,often ov...Accurate estimation on the state of health(SOH)is essential for ensuring the safe and reliable operation of batteries.Traditional assessment methods primarily focus on electrical attributes for capacity decay,often overlooking the impact of thermal distribution on battery aging.However,thermal effect is a critical factor for degradation process and associated risks throughout their service life.In this paper,we introduce a novel deep learning framework specially designed to estimate the capacity and thermal risks of lithium-ion batteries(LIBs).This model consists of two main components that leverage computer vision technology.One predicts battery capacity by integrating the advantages of thermal and electrical features using a temporal pattern attention(TPA)mechanism,while the other assesses thermal risk by incorporating temperature variation to provide early warnings of potential hazards.An infrared camera is deployed to record temperature evolution of LIBs during the electrochemical process.The thermal heterogeneities are recorded by infrared camera,and the corresponding temperature evolutions are extracted as representative features for analysis.The proposed model demonstrates high accuracy and stability,with an average root mean square error(RMSE)of 0.67% for capacity estimation and accuracy exceeding 93.9% for risk prediction,underscoring the importance of integrating spatial temperature distribution into battery health assessments.This work offers valuable insights for the development of intelligent and robust battery management systems.展开更多
The classification of seedlings is important to ensure the viability of seedlings after transplantation and is acknowledged as a key factor in forestation and environmental improvement. Based on numerous papers on aut...The classification of seedlings is important to ensure the viability of seedlings after transplantation and is acknowledged as a key factor in forestation and environmental improvement. Based on numerous papers on automatic seedling classification (ASC), the seedling grading theory, traditional grading methods, the background and the proceeding of ASC techniques are described. The automation of the measurement of seedling morphological characteristics by photoelectric meters and computer vision is studied, and the automatic methods of the current grading systems are described respectively. And the further researches on ASC by computer vision are proposed.展开更多
Variety identification is important for maize breeding, processing and trade. The computer vision technique has been widely applied to maize variety identification. In this paper, computer vision technique has been su...Variety identification is important for maize breeding, processing and trade. The computer vision technique has been widely applied to maize variety identification. In this paper, computer vision technique has been summarized from the following technical aspects including image acquisition, image processing, characteristic parameter extraction, pattern recognition and programming softwares. In addition, the existing problems during the application of this technique to maize variety identification have also been analyzed and its development tendency is forecasted.展开更多
With the development of image processing technology and computer, computer vision technology has been widely used in the production of agriculture,and has made many important achievements. This paper reviews its-resea...With the development of image processing technology and computer, computer vision technology has been widely used in the production of agriculture,and has made many important achievements. This paper reviews its-research progress on diagnosis of agricultural products, water diagnosis, weed identification,product quality testing and grading, agricultural picking and sorting and other as- pects, and finally put forward its existing problems and prospects for the future.展开更多
Damage detection is a key procedure in maintenance throughout structures′life cycles and post-disaster loss assessment.Due to the complex types of structural damages and the low efficiency and safety of manual detect...Damage detection is a key procedure in maintenance throughout structures′life cycles and post-disaster loss assessment.Due to the complex types of structural damages and the low efficiency and safety of manual detection,detecting damages with high efficiency and accuracy is the most popular research direction in civil engineering.Computer vision(CV)technology and deep learning(DL)algorithms are considered as promising tools to address the aforementioned challenges.The paper aims to systematically summarized the research and applications of DL-based CV technology in the field of damage detection in recent years.The basic concepts of DL-based CV technology are introduced first.The implementation steps of creating a damage detection dataset and some typical datasets are reviewed.CV-based structural damage detection algorithms are divided into three categories,namely,image classification-based(IC-based)algorithms,object detection-based(OD-based)algorithms,and semantic segmentation-based(SS-based)algorithms.Finally,the problems to be solved and future research directions are discussed.The foundation for promoting the deep integration of DL-based CV technology in structural damage detection and structural seismic damage identification has been laid.展开更多
The behavioral responses of a tilapia (Oreochromis niloticus) school to low (0.13 mg/L), moderate (0.79 mg/L) and high (2.65 mg/L) levels of unionized ammonia (UIA) concentration were monitored using a computer vision...The behavioral responses of a tilapia (Oreochromis niloticus) school to low (0.13 mg/L), moderate (0.79 mg/L) and high (2.65 mg/L) levels of unionized ammonia (UIA) concentration were monitored using a computer vision system. The swimming activity and geometrical parameters such as location of the gravity center and distribution of the fish school were calculated continuously. These behavioral parameters of tilapia school responded sensitively to moderate and high UIA concen-tration. Under high UIA concentration the fish activity showed a significant increase (P<0.05), exhibiting an avoidance reaction to high ammonia condition, and then decreased gradually. Under moderate and high UIA concentration the school’s vertical location had significantly large fluctuation (P<0.05) with the school moving up to the water surface then down to the bottom of the aquarium alternately and tending to crowd together. After several hours’ exposure to high UIA level, the school finally stayed at the aquarium bottom. These observations indicate that alterations in fish behavior under acute stress can provide important in-formation useful in predicting the stress.展开更多
The structure, function and working principle of JLUIV-3, which is a new typeof auto-mated guided vehicle (AGV) with computer vision, is described. The white stripe line withcertain width is used as inductive mark for...The structure, function and working principle of JLUIV-3, which is a new typeof auto-mated guided vehicle (AGV) with computer vision, is described. The white stripe line withcertain width is used as inductive mark for JLUIV-3 automated navigation. JULIV-3 can automaticallyrecognize the Arabic numeral codes which mark the multi-branch paths and multi-operation buffers,and autonomously select the correct path for destination. Compared with the traditional AGV, it hasmuch more navigation flexibility and less cost, and provides higher-level intelligence. Theidentification method of navigation path by using neural network and the optimal control method ofthe AGV are introduced in detail.展开更多
In recent years, aquaculture industry in China is developing rapidly, and especially, China has the largest aquaculture area and the most output in the world. In the past, traditional aquaculture mainly depended on ma...In recent years, aquaculture industry in China is developing rapidly, and especially, China has the largest aquaculture area and the most output in the world. In the past, traditional aquaculture mainly depended on manual labour to breed and gain aquatic organisms. However, with the increasing scale of production and the continuous improvement of science and technology, the traditional aquaculture approach has become more and more unsuitable for the development of the times. With the rapid development of computer technology, computer vision technology infiltrates through the traditional aquaculture industry quickly and improves the aquaculture production efficiency .This paper mainly introduces the basic situation of computer vision technology and summarizes the application of computer vision technology in aquaculture industry at home and abroad. The paper concludes with the expectation of application of computer vision in the aquaculture.展开更多
Tomato leaf diseases significantly reduce crop yield;therefore,early and accurate disease detection is required.Traditional detection methods are laborious and error-prone,particularly in large-scale farms,whereas exi...Tomato leaf diseases significantly reduce crop yield;therefore,early and accurate disease detection is required.Traditional detection methods are laborious and error-prone,particularly in large-scale farms,whereas existing hybrid deep learning models often face computational inefficiencies and poor generalization over diverse environmental and disease conditions.This study presents a unified U-Net-Vision Mamba Model with Hierarchical Bottleneck AttentionMechanism(U-net-Vim-HBAM),which integrates U-Net’s high-resolution segmentation,Vision Mamba’s efficient contextual processing,and a Hierarchical Bottleneck Attention Mechanism to address the challenges of disease detection accuracy,computational complexity,and efficiency in existing models.The model was trained on the Tomato Leaves and PlantVillage combined datasets from Kaggle and achieved 98.63% accuracy,98.24% precision,96.41% recall,and 97.31%F1 score,outperforming baselinemodels.Simulation tests demonstrated the model’s compatibility across devices with computational efficacy,ensuring its potential for integration into real-time mobile agricultural applications.The model’s adaptability to diverse datasets and conditions suggests that it is a versatile and high-precision instrument for disease management in agriculture,supporting sustainable agricultural practices.This offers a promising solution for crop health management and contributes to food security.展开更多
AIM:To evaluate the effects of refractive errors and binocular vision anomalies on the quality of life(QOL)of university students.METHODS:This cross-sectional analytical study was conducted on university students usin...AIM:To evaluate the effects of refractive errors and binocular vision anomalies on the quality of life(QOL)of university students.METHODS:This cross-sectional analytical study was conducted on university students using simple random sampling.Objective refraction,ocular alignment,vergence and accommodative performance were measured and assessed in all participants.Data on QOL were collected using the College of Optometrists in Vision Development-Quality of Life(COVD-QOL)Questionnaire.The effect of mentioned parameters on the QOL were evaluated.RESULTS:Totally 726 students with mean age of 21.35±1.88y were evaluated in this study,51.5%of whom were female.Esophoria was caused significantly lower QOL in the domains of somatic symptoms and occupationalphysical symptoms(P<0.05);Besides,esotropia decreased QOL in domains of somatic symptoms P=0.002 and psychological factors(P=0.023).Students with accommodation insufficiency experienced more symptoms in all domains(P<0.05)except for psychological factors(P=0.07).Increasing in the near point of convergence and accommodation and decreases QOL and increasing accommodative facility increases QOL(all P<0.05).Myopia and astigmatism cause decrease in QOL(both P<0.05),but hyperopic students had better QOL in comparison with others(P<0.05).CONCLUSION:Screening programs and treatment of refractive errors and binocular vision anomalies,especially phoria and accommodative insufficiency,positively impact the QOL and academic achievements of university students.展开更多
Spodoptera frugiperda(Lepidoptera:Noctuidae)is an important migratory agricultural pest worldwide,which has invaded many countries in the Old World since 2016 and now poses a serious threat to world food security.The ...Spodoptera frugiperda(Lepidoptera:Noctuidae)is an important migratory agricultural pest worldwide,which has invaded many countries in the Old World since 2016 and now poses a serious threat to world food security.The present monitoring and early warning strategies for the fall army worm(FAW)mainly focus on adult population density,but lack an information technology platform for precisely forecasting the reproductive dynamics of the adults.In this study,to identify the developmental status of the adults,we first utilized female ovarian images to extract and screen five features combined with the support vector machine(SVM)classifier and employed male testes images to obtain the testis circular features.Then,we established models for the relationship between oviposition dynamics and the developmental time of adult reproductive organs using laboratory tests.The results show that the accuracy of female ovary development stage determination reached 91%.The mean standard error(MSE)between the actual and predicted values of the ovarian developmental time was 0.2431,and the mean error rate between the actual and predicted values of the daily oviposition quantity was 12.38%.The error rate for the recognition of testis diameter was 3.25%,and the predicted and actual values of the testis developmental time in males had an MSE of 0.7734.A WeChat applet for identifying the reproductive developmental state and predicting reproduction of S.frugiperda was developed by integrating the above research results,and it is now available for use by anyone involved in plant protection.This study developed an automated method for accurately forecasting the reproductive dynamics of S.frugiperda populations,which can be helpful for the construction of a population monitoring and early warning system for use by both professional experts and local people at the county level.展开更多
Age-related Macular Degeneration(AMD)and Diabetic Macular Edema(DME)are two com-mon retinal diseases for elder people that may ultimately cause irreversible blindness.Timely and accurate diagnosis is essential for the...Age-related Macular Degeneration(AMD)and Diabetic Macular Edema(DME)are two com-mon retinal diseases for elder people that may ultimately cause irreversible blindness.Timely and accurate diagnosis is essential for the treatment of these diseases.In recent years,computer-aided diagnosis(CAD)has been deeply investigated and effectively used for rapid and early diagnosis.In this paper,we proposed a method of CAD using vision transformer to analyze optical co-herence tomography(OCT)images and to automatically discriminate AMD,DME,and normal eyes.A classification accuracy of 99.69%was achieved.After the model pruning,the recognition time reached 0.010 s and the classification accuracy did not drop.Compared with the Con-volutional Neural Network(CNN)image classification models(VGG16,Resnet50,Densenet121,and EfficientNet),vision transformer after pruning exhibited better recognition ability.Results show that vision transformer is an improved alternative to diagnose retinal diseases more accurately.展开更多
基金Supported by Ongoing Research Funding Program(ORFFT-2025-054-1),King Saud University,Riyadh,Saudi Arabia.
文摘AIM:To evaluate the efficacy of the total computer vision syndrome questionnaire(CVS-Q)score as a predictive tool for identifying individuals with symptomatic binocular vision anomalies and refractive errors.METHODS:A total of 141 healthy computer users underwent comprehensive clinical visual function assessments,including evaluations of refractive errors,accommodation(amplitude of accommodation,positive relative accommodation,negative relative accommodation,accommodative accuracy,and accommodative facility),and vergence(phoria,positive and negative fusional vergence,near point of convergence,and vergence facility).Total CVS-Q scores were recorded to explore potential associations between symptom scores and the aforementioned clinical visual function parameters.RESULTS:The cohort included 54 males(38.3%)with a mean age of 23.9±0.58y and 87 age-matched females(61.7%)with a mean age of 23.9±0.53y.The multiple regression model was statistically significant[R²=0.60,F=13.28,degrees of freedom(DF=17122,P<0.001].This indicates that 60%of the variance in total CVS-Q scores(reflecting reported symptoms)could be explained by four clinical measurements:amplitude of accommodation,positive relative accommodation,exophoria at distance and near,and positive fusional vergence at near.CONCLUSION:The total CVS-Q score is a valid and reliable tool for predicting the presence of various nonstrabismic binocular vision anomalies and refractive errors in symptomatic computer users.
基金financially supported by the National Science Fund for Distinguished Young Scholars,China(No.52025041)the National Natural Science Foundation of China(Nos.52450003,U2341267,and 52174294)+1 种基金the National Postdoctoral Program for Innovative Talents,China(No.BX20240437)the Fundamental Research Funds for the Central Universities,China(Nos.FRF-IDRY-23-037 and FRF-TP-20-02C2)。
文摘The rapid advancements in computer vision(CV)technology have transformed the traditional approaches to material microstructure analysis.This review outlines the history of CV and explores the applications of deep-learning(DL)-driven CV in four key areas of materials science:microstructure-based performance prediction,microstructure information generation,microstructure defect detection,and crystal structure-based property prediction.The CV has significantly reduced the cost of traditional experimental methods used in material performance prediction.Moreover,recent progress made in generating microstructure images and detecting microstructural defects using CV has led to increased efficiency and reliability in material performance assessments.The DL-driven CV models can accelerate the design of new materials with optimized performance by integrating predictions based on both crystal and microstructural data,thereby allowing for the discovery and innovation of next-generation materials.Finally,the review provides insights into the rapid interdisciplinary developments in the field of materials science and future prospects.
基金The National Natural Science Foundation of China(No.52338011,52378291)Young Elite Scientists Sponsorship Program by CAST(No.2022-2024QNRC0101).
文摘To overcome the limitations of low efficiency and reliance on manual processes in the measurement of geometric parameters for bridge prefabricated components,a method based on deep learning and computer vision is developed to identify the geometric parameters.The study utilizes a common precast element for highway bridges as the research subject.First,edge feature points of the bridge component section are extracted from images of the precast component cross-sections by combining the Canny operator with mathematical morphology.Subsequently,a deep learning model is developed to identify the geometric parameters of the precast components using the extracted edge coordinates from the images as input and the predefined control parameters of the bridge section as output.A dataset is generated by varying the control parameters and noise levels for model training.Finally,field measurements are conducted to validate the accuracy of the developed method.The results indicate that the developed method effectively identifies the geometric parameters of bridge precast components,with an error rate maintained within 5%.
文摘[Significance]In alignment with the national germplasm security strategy,current research efforts are accelerating the adoption of precision breeding in sheep.Within the whole-genome selection,accurate phenotyping of body morphometrics is critical for assessing growth performance and breeding value.Traditional manual measurements are inefficient,prone to human error,and may cause stress to sheep,limiting their suitability for precision sheep management.By summarizing the applications of sheep body size measurement technologies and analyzing their development directions,this paper provides theoretical references and practical guidance for the research and application of non contact sheep body size measurement.[Progress]This review synthesizes progress across three principal methodological paradigms:two-dimensional(2D)image-based techniques,three-dimensional(3D)point cloud-based approaches,and integrated 2D-3D fusion systems.2D methods,employing either handcrafted geometric features or deep learning-based keypoint detector algorithms,are cost-effective and operationally simple but sensitive to variation in imaging conditions and unable to capture critical circumference metrics.3D point-cloud approaches enable precise reconstruction of full animal morphology,supporting comprehensive body-size acquisition with higher accuracy,yet face challenges including high hardware costs,complex data workflows,and sensitivity to posture variability.Hybrid 2D-3D fusion systems combine semantic richness from RGB imagery with geometric completeness from point clouds.Having been effectively validated in other livestock specise,e.g.,cattle and pigs,these fusion systems have demonstrated excellent performance,providing important technical references and practical insights for sheep body size measurement.[Conclusions and Prospects]Firstly,future research should focus on constructing large-scale,high-quality datasets for sheep body size measurement that encompass diverse breeds,growth stages,and environmental conditions,thereby enhancing model robustness and generalization.Secondly,the development of lightweight artificial intelligence models is essential.Techniques such as model compression,quantization,and algorithmic optimization can substantially reduce computational complexity and storage requirements,facilitating deployment in resource-constrained environments.Thirdly,the 3D point cloud processing pipeline should be streamlined to improve the efficiency of data acquisition,filtering,registration,and segmentation,while promoting the integration of low-cost,high-resilience vision systems into practical farming scenarios.Fourthly,specific emphasis should be placed on improving the accuracy of curved-dimensional measurements,such as chest circumference,abdominal circumference,and shank circumference,through advances in pose standardization,refined 3D segmentation strategies,and multimodal data fusion.Finally,the cross-fertilization of sheep body size measurement technologies with analogous methods for other livestock species offers a promising pathway for mutual learning and collaborative innovation,accelerating the industrialization of automated sheep morphometric systems and supporting the development of intelligent,data-driven pasture management practices.
文摘In the competitive retail industry of the digital era,data-driven insights into gender-specific customer behavior are essential.They support the optimization of store performance,layout design,product placement,and targeted marketing.However,existing computer vision solutions often rely on facial recognition to gather such insights,raising significant privacy and ethical concerns.To address these issues,this paper presents a privacypreserving customer analytics system through two key strategies.First,we deploy a deep learning framework using YOLOv9s,trained on the RCA-TVGender dataset.Cameras are positioned perpendicular to observation areas to reduce facial visibility while maintaining accurate gender classification.Second,we apply AES-128 encryption to customer position data,ensuring secure access and regulatory compliance.Our system achieved overall performance,with 81.5%mAP@50,77.7%precision,and 75.7%recall.Moreover,a 90-min observational study confirmed the system’s ability to generate privacy-protected heatmaps revealing distinct behavioral patterns between male and female customers.For instance,women spent more time in certain areas and showed interest in different products.These results confirm the system’s effectiveness in enabling personalized layout and marketing strategies without compromising privacy.
基金funded by the Ministry of Higher Education(MoHE)Malaysia through the Fundamental Research Grant Scheme—Early Career Researcher(FRGS-EC),grant number FRGSEC/1/2024/ICT02/UNIMAP/02/8.
文摘critical for guiding treatment and improving patient outcomes.Traditional molecular subtyping via immuno-histochemistry(IHC)test is invasive,time-consuming,and may not fully represent tumor heterogeneity.This study proposes a non-invasive approach using digital mammography images and deep learning algorithm for classifying breast cancer molecular subtypes.Four pretrained models,including two Convolutional Neural Networks(MobileNet_V3_Large and VGG-16)and two Vision Transformers(ViT_B_16 and ViT_Base_Patch16_Clip_224)were fine-tuned to classify images into HER2-enriched,Luminal,Normal-like,and Triple Negative subtypes.Hyperparameter tuning,including learning rate adjustment and layer freezing strategies,was applied to optimize performance.Among the evaluated models,ViT_Base_Patch16_Clip_224 achieved the highest test accuracy(94.44%),with equally high precision,recall,and F1-score of 0.94,demonstrating excellent generalization.MobileNet_V3_Large achieved the same accuracy but showed less training stability.In contrast,VGG-16 recorded the lowest performance,indicating a limitation in its generalizability for this classification task.The study also highlighted the superior performance of the Vision Transformer models over CNNs,particularly due to their ability to capture global contextual features and the benefit of CLIP-based pretraining in ViT_Base_Patch16_Clip_224.To enhance clinical applicability,a graphical user interface(GUI)named“BCMS Dx”was developed for streamlined subtype prediction.Deep learning applied to mammography has proven effective for accurate and non-invasive molecular subtyping.The proposed Vision Transformer-based model and supporting GUI offer a promising direction for augmenting diagnostic workflows,minimizing the need for invasive procedures,and advancing personalized breast cancer management.
基金Supported by the National Natural Science Foundation of China(U1903214,62372339,62371350,61876135)the Ministry of Education Industry University Cooperative Education Project(202102246004,220800006041043,202002142012)the Fundamental Research Funds for the Central Universities(2042023kf1033)。
文摘Recent years have witnessed the ever-increasing performance of Deep Neural Networks(DNNs)in computer vision tasks.However,researchers have identified a potential vulnerability:carefully crafted adversarial examples can easily mislead DNNs into incorrect behavior via the injection of imperceptible modification to the input data.In this survey,we focus on(1)adversarial attack algorithms to generate adversarial examples,(2)adversarial defense techniques to secure DNNs against adversarial examples,and(3)important problems in the realm of adversarial examples beyond attack and defense,including the theoretical explanations,trade-off issues and benign attacks in adversarial examples.Additionally,we draw a brief comparison between recently published surveys on adversarial examples,and identify the future directions for the research of adversarial examples,such as the generalization of methods and the understanding of transferability,that might be solutions to the open problems in this field.
基金The Eighth National “Ten Thousand Talents Plan for Top Young Talents” of Chinathe National Natural Science Foundation of China (No. 52478117, 52378120)。
文摘To improve the safety of construction workers and help workers remotely control humanoid robots in construc-tion,this study designs and implements a computer vision based virtual construction simulation system.For this pur-pose,human skeleton motion data are collected using a Ki-nect depth camera,and the obtained data are optimized via abnormal data elimination,smoothing,and normalization.MediaPipe extracts three-dimensional hand motion coordi-nates for accurate human posture tracking.Blender is used to build a virtual worker and site model,and the virtual worker motion is controlled based on the quaternion inverse kinematics algorithm while limiting the joint angle to en-hance the authenticity of motion simulation.Experimental results show that the system frame rate is stable at 60 frame/s,end-to-end delay is less than 20 ms,and virtual task comple-tion time is close to the real scene,verifying its engineering applicability.The proposed system can drive virtual work-ers to perform tasks and provide technical support for con-struction safety training.
基金financial support of the Fundamental Research Funds for the Central Universities(SCU2023HGXY)Special Research Funds for Intelligent Battery Cell Multidimensional Signal Sensing Technology Project from Huawei Technologies Co.Ltd.(24H1117)。
文摘Accurate estimation on the state of health(SOH)is essential for ensuring the safe and reliable operation of batteries.Traditional assessment methods primarily focus on electrical attributes for capacity decay,often overlooking the impact of thermal distribution on battery aging.However,thermal effect is a critical factor for degradation process and associated risks throughout their service life.In this paper,we introduce a novel deep learning framework specially designed to estimate the capacity and thermal risks of lithium-ion batteries(LIBs).This model consists of two main components that leverage computer vision technology.One predicts battery capacity by integrating the advantages of thermal and electrical features using a temporal pattern attention(TPA)mechanism,while the other assesses thermal risk by incorporating temperature variation to provide early warnings of potential hazards.An infrared camera is deployed to record temperature evolution of LIBs during the electrochemical process.The thermal heterogeneities are recorded by infrared camera,and the corresponding temperature evolutions are extracted as representative features for analysis.The proposed model demonstrates high accuracy and stability,with an average root mean square error(RMSE)of 0.67% for capacity estimation and accuracy exceeding 93.9% for risk prediction,underscoring the importance of integrating spatial temperature distribution into battery health assessments.This work offers valuable insights for the development of intelligent and robust battery management systems.
基金This paper was supported by National Natural Science Foundation of China (Grant No. 39670607).
文摘The classification of seedlings is important to ensure the viability of seedlings after transplantation and is acknowledged as a key factor in forestation and environmental improvement. Based on numerous papers on automatic seedling classification (ASC), the seedling grading theory, traditional grading methods, the background and the proceeding of ASC techniques are described. The automation of the measurement of seedling morphological characteristics by photoelectric meters and computer vision is studied, and the automatic methods of the current grading systems are described respectively. And the further researches on ASC by computer vision are proposed.
基金Special Fund for Science & Technology Research of Education Commission,Chongqing(KJ101302)~~
文摘Variety identification is important for maize breeding, processing and trade. The computer vision technique has been widely applied to maize variety identification. In this paper, computer vision technique has been summarized from the following technical aspects including image acquisition, image processing, characteristic parameter extraction, pattern recognition and programming softwares. In addition, the existing problems during the application of this technique to maize variety identification have also been analyzed and its development tendency is forecasted.
文摘With the development of image processing technology and computer, computer vision technology has been widely used in the production of agriculture,and has made many important achievements. This paper reviews its-research progress on diagnosis of agricultural products, water diagnosis, weed identification,product quality testing and grading, agricultural picking and sorting and other as- pects, and finally put forward its existing problems and prospects for the future.
基金National Key R&D Program of China under Grant No.2017YFC1500606,National Natural Science Foundation of China under Grant No.52020105002Heilongjiang Touyan Innovation Team Program。
文摘Damage detection is a key procedure in maintenance throughout structures′life cycles and post-disaster loss assessment.Due to the complex types of structural damages and the low efficiency and safety of manual detection,detecting damages with high efficiency and accuracy is the most popular research direction in civil engineering.Computer vision(CV)technology and deep learning(DL)algorithms are considered as promising tools to address the aforementioned challenges.The paper aims to systematically summarized the research and applications of DL-based CV technology in the field of damage detection in recent years.The basic concepts of DL-based CV technology are introduced first.The implementation steps of creating a damage detection dataset and some typical datasets are reviewed.CV-based structural damage detection algorithms are divided into three categories,namely,image classification-based(IC-based)algorithms,object detection-based(OD-based)algorithms,and semantic segmentation-based(SS-based)algorithms.Finally,the problems to be solved and future research directions are discussed.The foundation for promoting the deep integration of DL-based CV technology in structural damage detection and structural seismic damage identification has been laid.
基金Project (Nos. 2001AA620104 and 2003AA603140) supported by theHi-Tech Research and Development Program (863) of China
文摘The behavioral responses of a tilapia (Oreochromis niloticus) school to low (0.13 mg/L), moderate (0.79 mg/L) and high (2.65 mg/L) levels of unionized ammonia (UIA) concentration were monitored using a computer vision system. The swimming activity and geometrical parameters such as location of the gravity center and distribution of the fish school were calculated continuously. These behavioral parameters of tilapia school responded sensitively to moderate and high UIA concen-tration. Under high UIA concentration the fish activity showed a significant increase (P<0.05), exhibiting an avoidance reaction to high ammonia condition, and then decreased gradually. Under moderate and high UIA concentration the school’s vertical location had significantly large fluctuation (P<0.05) with the school moving up to the water surface then down to the bottom of the aquarium alternately and tending to crowd together. After several hours’ exposure to high UIA level, the school finally stayed at the aquarium bottom. These observations indicate that alterations in fish behavior under acute stress can provide important in-formation useful in predicting the stress.
基金This project is supported by National Natural Science Foundation of China(No.50175046) Technology Foundation of Education Ministry of China(No.00037).
文摘The structure, function and working principle of JLUIV-3, which is a new typeof auto-mated guided vehicle (AGV) with computer vision, is described. The white stripe line withcertain width is used as inductive mark for JLUIV-3 automated navigation. JULIV-3 can automaticallyrecognize the Arabic numeral codes which mark the multi-branch paths and multi-operation buffers,and autonomously select the correct path for destination. Compared with the traditional AGV, it hasmuch more navigation flexibility and less cost, and provides higher-level intelligence. Theidentification method of navigation path by using neural network and the optimal control method ofthe AGV are introduced in detail.
文摘In recent years, aquaculture industry in China is developing rapidly, and especially, China has the largest aquaculture area and the most output in the world. In the past, traditional aquaculture mainly depended on manual labour to breed and gain aquatic organisms. However, with the increasing scale of production and the continuous improvement of science and technology, the traditional aquaculture approach has become more and more unsuitable for the development of the times. With the rapid development of computer technology, computer vision technology infiltrates through the traditional aquaculture industry quickly and improves the aquaculture production efficiency .This paper mainly introduces the basic situation of computer vision technology and summarizes the application of computer vision technology in aquaculture industry at home and abroad. The paper concludes with the expectation of application of computer vision in the aquaculture.
文摘Tomato leaf diseases significantly reduce crop yield;therefore,early and accurate disease detection is required.Traditional detection methods are laborious and error-prone,particularly in large-scale farms,whereas existing hybrid deep learning models often face computational inefficiencies and poor generalization over diverse environmental and disease conditions.This study presents a unified U-Net-Vision Mamba Model with Hierarchical Bottleneck AttentionMechanism(U-net-Vim-HBAM),which integrates U-Net’s high-resolution segmentation,Vision Mamba’s efficient contextual processing,and a Hierarchical Bottleneck Attention Mechanism to address the challenges of disease detection accuracy,computational complexity,and efficiency in existing models.The model was trained on the Tomato Leaves and PlantVillage combined datasets from Kaggle and achieved 98.63% accuracy,98.24% precision,96.41% recall,and 97.31%F1 score,outperforming baselinemodels.Simulation tests demonstrated the model’s compatibility across devices with computational efficacy,ensuring its potential for integration into real-time mobile agricultural applications.The model’s adaptability to diverse datasets and conditions suggests that it is a versatile and high-precision instrument for disease management in agriculture,supporting sustainable agricultural practices.This offers a promising solution for crop health management and contributes to food security.
文摘AIM:To evaluate the effects of refractive errors and binocular vision anomalies on the quality of life(QOL)of university students.METHODS:This cross-sectional analytical study was conducted on university students using simple random sampling.Objective refraction,ocular alignment,vergence and accommodative performance were measured and assessed in all participants.Data on QOL were collected using the College of Optometrists in Vision Development-Quality of Life(COVD-QOL)Questionnaire.The effect of mentioned parameters on the QOL were evaluated.RESULTS:Totally 726 students with mean age of 21.35±1.88y were evaluated in this study,51.5%of whom were female.Esophoria was caused significantly lower QOL in the domains of somatic symptoms and occupationalphysical symptoms(P<0.05);Besides,esotropia decreased QOL in domains of somatic symptoms P=0.002 and psychological factors(P=0.023).Students with accommodation insufficiency experienced more symptoms in all domains(P<0.05)except for psychological factors(P=0.07).Increasing in the near point of convergence and accommodation and decreases QOL and increasing accommodative facility increases QOL(all P<0.05).Myopia and astigmatism cause decrease in QOL(both P<0.05),but hyperopic students had better QOL in comparison with others(P<0.05).CONCLUSION:Screening programs and treatment of refractive errors and binocular vision anomalies,especially phoria and accommodative insufficiency,positively impact the QOL and academic achievements of university students.
基金supported by the National Natural Science Foundation of China(31727901)the National Key R&D Program of China(2021YFD1400702)the Science and Technology Innovation Program of the Chinese Academy of Agricultural Sciences.
文摘Spodoptera frugiperda(Lepidoptera:Noctuidae)is an important migratory agricultural pest worldwide,which has invaded many countries in the Old World since 2016 and now poses a serious threat to world food security.The present monitoring and early warning strategies for the fall army worm(FAW)mainly focus on adult population density,but lack an information technology platform for precisely forecasting the reproductive dynamics of the adults.In this study,to identify the developmental status of the adults,we first utilized female ovarian images to extract and screen five features combined with the support vector machine(SVM)classifier and employed male testes images to obtain the testis circular features.Then,we established models for the relationship between oviposition dynamics and the developmental time of adult reproductive organs using laboratory tests.The results show that the accuracy of female ovary development stage determination reached 91%.The mean standard error(MSE)between the actual and predicted values of the ovarian developmental time was 0.2431,and the mean error rate between the actual and predicted values of the daily oviposition quantity was 12.38%.The error rate for the recognition of testis diameter was 3.25%,and the predicted and actual values of the testis developmental time in males had an MSE of 0.7734.A WeChat applet for identifying the reproductive developmental state and predicting reproduction of S.frugiperda was developed by integrating the above research results,and it is now available for use by anyone involved in plant protection.This study developed an automated method for accurately forecasting the reproductive dynamics of S.frugiperda populations,which can be helpful for the construction of a population monitoring and early warning system for use by both professional experts and local people at the county level.
基金This work was supported by the Science and Technology innovation project of Shanghai Science and Technology Commission(19441905800)the Natural National Science Foundation of China(62175156,81827807,8210041176,82101177,61675134)+1 种基金the Project of State Key Laboratory of Ophthalmology,Optometry and Visual Science,Wenzhou Medical University(K181002)the Key R&D Program Projects in Zhejiang Province(2019C03045).
文摘Age-related Macular Degeneration(AMD)and Diabetic Macular Edema(DME)are two com-mon retinal diseases for elder people that may ultimately cause irreversible blindness.Timely and accurate diagnosis is essential for the treatment of these diseases.In recent years,computer-aided diagnosis(CAD)has been deeply investigated and effectively used for rapid and early diagnosis.In this paper,we proposed a method of CAD using vision transformer to analyze optical co-herence tomography(OCT)images and to automatically discriminate AMD,DME,and normal eyes.A classification accuracy of 99.69%was achieved.After the model pruning,the recognition time reached 0.010 s and the classification accuracy did not drop.Compared with the Con-volutional Neural Network(CNN)image classification models(VGG16,Resnet50,Densenet121,and EfficientNet),vision transformer after pruning exhibited better recognition ability.Results show that vision transformer is an improved alternative to diagnose retinal diseases more accurately.