期刊文献+
共找到38,227篇文章
< 1 2 250 >
每页显示 20 50 100
AARPose:Real-time and accurate drogue pose measurement based on monocular vision for autonomous aerial refueling
1
作者 Shuyuan WEN Yang GAO +3 位作者 Bingrui HU Zhongyu LUO Zhenzhong WEI Guangjun ZHANG 《Chinese Journal of Aeronautics》 2025年第6期552-572,共21页
Real-time and accurate drogue pose measurement during docking is basic and critical for Autonomous Aerial Refueling(AAR).Vision measurement is the best practicable technique,but its measurement accuracy and robustness... Real-time and accurate drogue pose measurement during docking is basic and critical for Autonomous Aerial Refueling(AAR).Vision measurement is the best practicable technique,but its measurement accuracy and robustness are easily affected by limited computing power of airborne equipment,complex aerial scenes and partial occlusion.To address the above challenges,we propose a novel drogue keypoint detection and pose measurement algorithm based on monocular vision,and realize real-time processing on airborne embedded devices.Firstly,a lightweight network is designed with structural re-parameterization to reduce computational cost and improve inference speed.And a sub-pixel level keypoints prediction head and loss functions are adopted to improve keypoint detection accuracy.Secondly,a closed-form solution of drogue pose is computed based on double spatial circles,followed by a nonlinear refinement based on Levenberg-Marquardt optimization.Both virtual simulation and physical simulation experiments have been used to test the proposed method.In the virtual simulation,the mean pixel error of the proposed method is 0.787 pixels,which is significantly superior to that of other methods.In the physical simulation,the mean relative measurement error is 0.788%,and the mean processing time is 13.65 ms on embedded devices. 展开更多
关键词 Autonomous aerial refueling vision measurement Deep learning REAL-TIME LIGHTWEIGHT ACCURATE monocular vision Drogue pose measurement
原文传递
Monocular Vision-based Two-stage Iterative Algorithm for Relative Position and Attitude Estimation of Docking Spacecraft 被引量:7
2
作者 张世杰 刘峰华 +1 位作者 曹喜滨 贺亮 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2010年第2期204-210,共7页
Visual sensors are used to measure the relative state of the chaser spacecraft to the target spacecraft during close range ren- dezvous phases. This article proposes a two-stage iterative algorithm based on an inverse... Visual sensors are used to measure the relative state of the chaser spacecraft to the target spacecraft during close range ren- dezvous phases. This article proposes a two-stage iterative algorithm based on an inverse projection ray approach to address the relative position and attitude estimation by using feature points and monocular vision. It consists of two stages: absolute orienta- tion and depth recovery. In the first stage, Umeyama's algorithm is used to fit the three-dimensional (3D) model set and estimate the 3D point set while in the second stage, the depths of the observed feature points are estimated. This procedure is repeated until the result converges. Moreover, the effectiveness and convergence of the proposed algorithm are verified through theoreti- cal analysis and mathematical simulation. 展开更多
关键词 SPACECRAFT relative position and attitude monocular vision depth recovery absolute orientation
原文传递
Design of a road vehicle detection system based on monocular vision 被引量:5
3
作者 王海 张为公 蔡英凤 《Journal of Southeast University(English Edition)》 EI CAS 2011年第2期169-173,共5页
In order to decrease vehicle crashes, a new rear view vehicle detection system based on monocular vision is designed. First, a small and flexible hardware platform based on a DM642 digtal signal processor (DSP) micr... In order to decrease vehicle crashes, a new rear view vehicle detection system based on monocular vision is designed. First, a small and flexible hardware platform based on a DM642 digtal signal processor (DSP) micro-controller is built. Then, a two-step vehicle detection algorithm is proposed. In the first step, a fast vehicle edge and symmetry fusion algorithm is used and a low threshold is set so that all the possible vehicles have a nearly 100% detection rate (TP) and the non-vehicles have a high false detection rate (FP), i. e., all the possible vehicles can be obtained. In the second step, a classifier using a probabilistic neural network (PNN) which is based on multiple scales and an orientation Gabor feature is trained to classify the possible vehicles and eliminate the false detected vehicles from the candidate vehicles generated in the first step. Experimental results demonstrate that the proposed system maintains a high detection rate and a low false detection rate under different road, weather and lighting conditions. 展开更多
关键词 vehicle detection monocular vision edge andsymmetry fusion Gabor feature PNN network
在线阅读 下载PDF
Mobile Robot Localization and Navigation System Based on Monocular Vision 被引量:2
4
作者 贾云伟 刘铁根 +1 位作者 高丽兰 王聃 《Transactions of Tianjin University》 EI CAS 2012年第5期335-342,共8页
A system for mobile robot localization and navigation was presented.With the proposed system,the robot can be located and navigated by a single landmark in a single image.And the navigation mode may be following-track... A system for mobile robot localization and navigation was presented.With the proposed system,the robot can be located and navigated by a single landmark in a single image.And the navigation mode may be following-track,teaching and playback,or programming.The basic idea is that the system computes the differences between the expected and the recognized position at each time and then controls the robot in a direction to reduce those differences.To minimize the robot sensor equipment,only one omnidirectional camera was used.Experiments in disturbing environments show that the presented algorithm is robust and easy to implement,without camera rectification.The rootmean-square error(RMSE) of localization is 1.4,cm,and the navigation error in teaching and playback is within 10,cm. 展开更多
关键词 localization algorithm NAVIGATION OMNI-vision monocular vision
在线阅读 下载PDF
Research on Vehicle Anti-collision Technique Based on Monocular Vision
5
作者 LU Weiwei XIAO Zhitao LEI Meilin WU Jun 《Semiconductor Photonics and Technology》 CAS 2010年第1期47-52,共6页
Vehicle anti-collision technique is a hot topic in the research area of Intelligent Transport System. The research on preceding vehicles detection and the distance measurement, which are the key techniques, makes grea... Vehicle anti-collision technique is a hot topic in the research area of Intelligent Transport System. The research on preceding vehicles detection and the distance measurement, which are the key techniques, makes great contributions to safe-driving. This paper presents a method which can be used to detect preceding vehicles and get the distance between own car and the car ahead. Firstly, an adaptive threshold method is used to get shadow feature, and a shadow!area merging approach is used to deal with the distortion of the shadow border. Region of interest(ROI) is obtained using shadow feature. Then in the ROI, symmetry feature is analyzed to verify whether there are vehicles and to locate the vehicles. Finally, using monocular vision distance measurement based on camera interior parameters and geometrical reasoning, we get the distance between own car and the preceding one. Experimental results show that the proposed method can detect the preceding vehicle effectively and get the distance between vehicles accurately. 展开更多
关键词 monocular vision shadow feature symmetry feature monocular measurement of distance
在线阅读 下载PDF
Real-time drogue recognition and 3D locating for UAV autonomous aerial refueling based on monocular machine vision 被引量:17
6
作者 Wang Xufeng Kong Xingwei +2 位作者 Zhi Jianhui Chen Yong Dong Xinmin 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2015年第6期1667-1675,共9页
Drogue recognition and 3D locating is a key problem during the docking phase of the autonomous aerial refueling (AAR). To solve this problem, a novel and effective method based on monocular vision is presented in th... Drogue recognition and 3D locating is a key problem during the docking phase of the autonomous aerial refueling (AAR). To solve this problem, a novel and effective method based on monocular vision is presented in this paper. Firstly, by employing computer vision with red-ring-shape feature, a drogue detection and recognition algorithm is proposed to guarantee safety and ensure the robustness to the drogue diversity and the changes in environmental condi- tions, without using a set of infrared light emitting diodes (LEDs) on the parachute part of the dro- gue. Secondly, considering camera lens distortion, a monocular vision measurement algorithm for drogue 3D locating is designed to ensure the accuracy and real-time performance of the system, with the drogue attitude provided. Finally, experiments are conducted to demonstrate the effective- ness of the proposed method. Experimental results show the performances of the entire system in contrast with other methods, which validates that the proposed method can recognize and locate the drogue three dimensionally, rapidly and precisely. 展开更多
关键词 Autonomous aerial refueling Drogue 3D locating Drogue attitudemeasurement Drogue detection Drogue recognition monocular machine vision
原文传递
A New Monocular Vision Measurement Method to Estimate 3D Positions of Objects on Floor 被引量:3
7
作者 Ling-Yi Xu Zhi-Qiang Cao +1 位作者 Peng Zhao Chao Zhou 《International Journal of Automation and computing》 EI CSCD 2017年第2期159-168,共10页
A new visual measurement method is proposed to estimate three-dimensional (3D) position of the object on the floor based on a single camera. The camera fixed on a robot is in an inclined position with respect to the... A new visual measurement method is proposed to estimate three-dimensional (3D) position of the object on the floor based on a single camera. The camera fixed on a robot is in an inclined position with respect to the floor. A measurement model with the camera's extrinsic parameters such as the height and pitch angle is described. Single image of a chessboard pattern placed on the floor is enough to calibrate the camera's extrinsic parameters after the camera's intrinsic parameters are calibrated. Then the position of object on the floor can be computed with the measurement model. Furthermore, the height of object can be calculated with the paired-points in the vertical line sharing the same position on the floor. Compared to the conventional method used to estimate the positions on the plane, this method can obtain the 3D positions. The indoor experiment testifies the accuracy and validity of the proposed method. 展开更多
关键词 Visual measurement calibration localization position estimation monocular vision.
原文传递
Mobile Robot Hierarchical Simultaneous Localization and Mapping Using Monocular Vision 被引量:1
8
作者 厉茂海 洪炳熔 罗荣华 《Journal of Shanghai Jiaotong university(Science)》 EI 2007年第6期765-772,共8页
A hierarchical mobile robot simultaneous localization and mapping (SLAM) method that allows us to obtain accurate maps was presented. The local map level is composed of a set of local metric feature maps that are guar... A hierarchical mobile robot simultaneous localization and mapping (SLAM) method that allows us to obtain accurate maps was presented. The local map level is composed of a set of local metric feature maps that are guaranteed to be statistically independent. The global level is a topological graph whose arcs are labeled with the relative location between local maps. An estimation of these relative locations is maintained with local map alignment algorithm, and more accurate estimation is calculated through a global minimization procedure using the loop closure constraint. The local map is built with Rao-Blackwellised particle filter (RBPF), where the particle filter is used to extending the path posterior by sampling new poses. The landmark position estimation and update is implemented through extended Kalman filter (EKF). Monocular vision mounted on the robot tracks the 3D natural point landmarks, which are structured with matching scale invariant feature transform (SIFT) feature pairs. The matching for multi-dimension SIFT features is implemented with a KD-tree in the time cost of O(lbN). Experiment results on Pioneer mobile robot in a real indoor environment show the superior performance of our proposed method. 展开更多
关键词 mobile robot HIERARCHICAL simultaneous localization and mapping (SLAM) Rao-Blackwellised particle filter (RBPF) monocular vision scale INVARIANT feature TRANSFORM
在线阅读 下载PDF
Calibration of laser beam direction based on monocular vision 被引量:3
9
作者 WANG Zhong YANG Tong-yu +2 位作者 WANG Lei FU Lu-hua LIU Chang-jie 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2017年第4期354-363,共10页
In the laser displacement sensors measurement system,the laser beam direction is an important parameter.Particularly,the azimuth and pitch angles are the most important parameters to a laser beam.In this paper,based o... In the laser displacement sensors measurement system,the laser beam direction is an important parameter.Particularly,the azimuth and pitch angles are the most important parameters to a laser beam.In this paper,based on monocular vision,a laser beam direction measurement method is proposed.First,place the charge coupled device(CCD)camera above the base plane,and adjust and fix the camera position so that the optical axis is nearly perpendicular to the base plane.The monocular vision localization model is established by using circular aperture calibration board.Then the laser beam generating device is placed and maintained on the base plane at fixed position.At the same time a special target block is placed on the base plane so that the laser beam can project to the special target and form a laser spot.The CCD camera placed above the base plane can acquire the laser spot and the image of the target block clearly,so the two-dimensional(2D)image coordinate of the centroid of the laser spot can be extracted by correlation algorithm.The target is moved at an equal distance along the laser beam direction,and the spots and target images of each moving under the current position are collected by the CCD camera.By using the relevant transformation formula and combining the intrinsic parameters of the target block,the2D coordinates of the gravity center of the spot are converted to the three-dimensional(3D)coordinate in the base plane.Because of the moving of the target,the3D coordinates of the gravity center of the laser spot at different positions are obtained,and these3D coordinates are synthesized into a space straight line to represent the laser beam to be measured.In the experiment,the target parameters are measured by high-precision instruments,and the calibration parameters of the camera are calibrated by a high-precision calibration board to establish the corresponding positioning model.The measurement accuracy is mainly guaranteed by the monocular vision positioning accuracy and the gravity center extraction accuracy.The experimental results show the maximum error of the angle between laser beams reaches to0.04°and the maximum error of beam pitch angle reaches to0.02°. 展开更多
关键词 monocular vision laser beam direction coordinate transformation laser displacement sensor
在线阅读 下载PDF
Monocular Vision Based Boundary Avoidance for Non-Invasive Stray Control System for Cattle: A Conceptual Approach
10
作者 Adeniran Ishola Oluwaranti Seun Ayeni 《Journal of Sensor Technology》 2015年第3期63-71,共9页
Building fences to manage the cattle grazing can be very expensive;cost inefficient. These do not provide dynamic control over the area in which the cattle are grazing. Existing virtual fencing techniques for the cont... Building fences to manage the cattle grazing can be very expensive;cost inefficient. These do not provide dynamic control over the area in which the cattle are grazing. Existing virtual fencing techniques for the control of herds of cattle, based on polygon coordinate definition of boundaries is limited in the area of land mass coverage and dynamism. This work seeks to develop a more robust and an improved monocular vision based boundary avoidance for non-invasive stray control system for cattle, with a view to increase land mass coverage in virtual fencing techniques and dynamism. The monocular vision based depth estimation will be modeled using concept of global Fourier Transform (FT) and local Wavelet Transform (WT) of image structure of scenes (boundaries). The magnitude of the global Fourier Transform gives the dominant orientations and textual patterns of the image;while the local Wavelet Transform gives the dominant spectral features of the image and their spatial distribution. Each scene picture or image is defined by features v, which contain the set of global (FT) and local (WT) statistics of the image. Scenes or boundaries distances are given by estimating the depth D by means of the image features v. Sound cues of intensity equivalent to the magnitude of the depth D are applied to the animal ears as stimuli. This brings about the desired control as animals tend to move away from uncomfortable sounds. 展开更多
关键词 monocular vision Control Systems Global POSITIONING System Wireless Sensor Networks Depth Estimation
暂未订购
Monocular Dynamic Machine Vision-Based Pearl Shape Detection
11
作者 WANG Yuzong DENG Fei +3 位作者 ZHAO Daxu YE Jiaying WANG Peixin SHOU Guozhong 《Journal of Shanghai Jiaotong university(Science)》 EI 2019年第5期654-662,共9页
In terms of the requirement of automatically sorting pearls, the pearl contour feature extraction and shape recognition algorithm are studied in this paper to reckon with the rapid identification of pearls shape onlin... In terms of the requirement of automatically sorting pearls, the pearl contour feature extraction and shape recognition algorithm are studied in this paper to reckon with the rapid identification of pearls shape online,and a monocular dynamic machine vision-based pearl shape detection device is designed. Through blowing, the pearl is suspended in a funnel shaped container and flipped rapidly in the device. The entire surface image of the pearl to be measured can be promptly grasped by the camera placed right above the funnel. The results of illumination experiments conducted from different angles indicate that the image contour acquired by the medium angle illumination is better extracted. The pearl shape test indicates that the method is incorporated with the inflatable suspension device to classify the pearls into seven types according to the national standard,and additionally the average error rate is confined under 5.38%. The shape characteristic of the pearl can be detected promptly and reliably, and accordingly the high-speed automatic sorting can be satisfied. 展开更多
关键词 PEARL machine vision monocular dynamic SUSPENSION shape characteristics
原文传递
AB012. The effects of monocular deprivation do not accumulate across days in adults with normal vision
12
作者 Seung Hyun Min Alex S.Baldwin Robert F.Hess 《Annals of Eye Science》 2019年第1期187-187,共1页
Background:We investigate whether changes in visual plasticity induced by monocular deprivation can be maintained across multiple days.It has been known that monocular deprivation strengthens the deprived eye in adult... Background:We investigate whether changes in visual plasticity induced by monocular deprivation can be maintained across multiple days.It has been known that monocular deprivation strengthens the deprived eye in adults with normal vision for a short period of time(30-60 minutes).This has been shown through a variety of visual tasks such as binocular combination and rivalry.Methods:Ten subjects were recruited and patched for five consecutive days for two hours.We used a binocular phase combination task to measure the subjects’sensory eye balances.We initially measured their baseline of sensory eye balance,patched their dominant eye,and then conducted post-patching measurements at 0,3,6,12,24 and 48 minutes after patching.Results:We performed a 2-way ANOVA(Before vs.after patching×Day);we found that although the effect of monocular deprivation on the deprived eye was significant,F(1,9)=17.32,P=0.002,the effect of Day was not.Conclusions:Hence we found no accumulation of the patching effect across five days in healthy adults.This suggests that the degree of remnant neural plasticity in adult primary visual cortex may be too limited to be exploited therapeutically. 展开更多
关键词 monocular deprivation neural plasticity binocular vision
在线阅读 下载PDF
Autonomous Landing of Small Unmanned Aerial Rotorcraft Based on Monocular Vision in GPS-denied Area 被引量:6
13
作者 Cunxiao Miao Jingjing Li 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI 2015年第1期109-114,共6页
Focusing on the low-precision attitude of a current small unmanned aerial rotorcraft at the landing stage, the present paper proposes a new attitude control method for the GPS-denied scenario based on the monocular vi... Focusing on the low-precision attitude of a current small unmanned aerial rotorcraft at the landing stage, the present paper proposes a new attitude control method for the GPS-denied scenario based on the monocular vision. Primarily, a robust landmark detection technique is developed which leverages the well-documented merits of supporting vector machines (SVMs) to enable landmark detection. Then an algorithm of nonlinear optimization based on Newton iteration method for the attitude and position of camera is put forward to reduce the projection error and get an optimized solution. By introducing the wavelet analysis into the adaptive Kalman filter, the high frequency noise of vision is filtered out successfully. At last, automatic landing tests are performed to verify the method's feasibility and effectiveness. © 2014 Chinese Association of Automation. 展开更多
关键词 AIRCRAFT Attitude control Helicopter rotors LANDING Nonlinear programming Rotors vision Wavelet analysis
在线阅读 下载PDF
Depth-Guided Vision Transformer With Normalizing Flows for Monocular 3D Object Detection 被引量:2
14
作者 Cong Pan Junran Peng Zhaoxiang Zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第3期673-689,共17页
Monocular 3D object detection is challenging due to the lack of accurate depth information.Some methods estimate the pixel-wise depth maps from off-the-shelf depth estimators and then use them as an additional input t... Monocular 3D object detection is challenging due to the lack of accurate depth information.Some methods estimate the pixel-wise depth maps from off-the-shelf depth estimators and then use them as an additional input to augment the RGB images.Depth-based methods attempt to convert estimated depth maps to pseudo-LiDAR and then use LiDAR-based object detectors or focus on the perspective of image and depth fusion learning.However,they demonstrate limited performance and efficiency as a result of depth inaccuracy and complex fusion mode with convolutions.Different from these approaches,our proposed depth-guided vision transformer with a normalizing flows(NF-DVT)network uses normalizing flows to build priors in depth maps to achieve more accurate depth information.Then we develop a novel Swin-Transformer-based backbone with a fusion module to process RGB image patches and depth map patches with two separate branches and fuse them using cross-attention to exchange information with each other.Furthermore,with the help of pixel-wise relative depth values in depth maps,we develop new relative position embeddings in the cross-attention mechanism to capture more accurate sequence ordering of input tokens.Our method is the first Swin-Transformer-based backbone architecture for monocular 3D object detection.The experimental results on the KITTI and the challenging Waymo Open datasets show the effectiveness of our proposed method and superior performance over previous counterparts. 展开更多
关键词 monocular 3D object detection normalizing flows Swin Transformer
在线阅读 下载PDF
Orientation Measurement for Objects with Planar Surface Based on Monocular Microscopic Vision 被引量:1
15
作者 Ying Li Xi-Long Liu +1 位作者 De Xu Da-Peng Zhang 《International Journal of Automation and computing》 EI CSCD 2020年第2期247-256,共10页
Orientation measurement of objects is vital in micro assembly.In this paper,we present a novel method based on monocular microscopic vision for 3-D orientation measurement of objects with planar surfaces.The proposed ... Orientation measurement of objects is vital in micro assembly.In this paper,we present a novel method based on monocular microscopic vision for 3-D orientation measurement of objects with planar surfaces.The proposed methods aim to measure the orientation of the object,which does not require calibrating the intrinsic parameters of microscopic camera.In our methods,the orientation of the object is firstly measured with analytical computation based on feature points.The results of the analytical computation are coarse because the information about feature points is not fully used.In order to improve the precision,the orientation measurement is converted into an optimization process base on the relationship between deviations in image space and in Cartesian space under microscopic vision.The results of the analytical computation are used as the initial values of the optimization process.The optimized variables are the three rotational angles of the object and the pixel equivalent coefficient.The objective of the optimization process is to minimize the coordinates differences of the feature points on the object.The precision of the orientation measurement is boosted effectively.Experimental and comparative results validate the effectiveness of the proposed methods. 展开更多
关键词 MICROSCOPIC vision micro assembly ORIENTATION MEASUREMENT optimization analytical computation
原文传递
Monocular vision based navigation method of mobile robot
16
作者 DONG Ji-wen YANG Sen LU Shou-yin 《重庆邮电大学学报(自然科学版)》 北大核心 2009年第2期158-161,共4页
A trajectory tracking method is presented for the visual navigation of the monocular mobile robot.The robot move along line trajectory drawn beforehand,recognized and stop on the stop-sign to finish special task.The r... A trajectory tracking method is presented for the visual navigation of the monocular mobile robot.The robot move along line trajectory drawn beforehand,recognized and stop on the stop-sign to finish special task.The robot uses a forward looking colorful digital camera to capture information in front of the robot,and by the use of HSI model partition the trajectory and the stop-sign out.Then the "sampling estimate" method was used to calculate the navigation parameters.The stop-sign is easily recognized and can identify 256 different signs.Tests indicate that the method can fit large-scale intensity of brightness and has more robustness and better real-time character. 展开更多
关键词 移动机器人 导航方法 单目视觉 HSI模型 跟踪方法 视觉导航 数码相机 导航参数
在线阅读 下载PDF
Total score of the computer vision syndrome questionnaire predicts refractive errors and binocular vision anomalies
17
作者 Mosaad Alhassan Tasneem Samman +5 位作者 Hatoun Badukhen Muhamad Alrashed Balsam Alabdulkader Essam Almutleb Tahani Alqahtani Ali Almustanyir 《International Journal of Ophthalmology(English edition)》 2026年第1期90-96,共7页
AIM:To evaluate the efficacy of the total computer vision syndrome questionnaire(CVS-Q)score as a predictive tool for identifying individuals with symptomatic binocular vision anomalies and refractive errors.METHODS:A... AIM:To evaluate the efficacy of the total computer vision syndrome questionnaire(CVS-Q)score as a predictive tool for identifying individuals with symptomatic binocular vision anomalies and refractive errors.METHODS:A total of 141 healthy computer users underwent comprehensive clinical visual function assessments,including evaluations of refractive errors,accommodation(amplitude of accommodation,positive relative accommodation,negative relative accommodation,accommodative accuracy,and accommodative facility),and vergence(phoria,positive and negative fusional vergence,near point of convergence,and vergence facility).Total CVS-Q scores were recorded to explore potential associations between symptom scores and the aforementioned clinical visual function parameters.RESULTS:The cohort included 54 males(38.3%)with a mean age of 23.9±0.58y and 87 age-matched females(61.7%)with a mean age of 23.9±0.53y.The multiple regression model was statistically significant[R²=0.60,F=13.28,degrees of freedom(DF=17122,P<0.001].This indicates that 60%of the variance in total CVS-Q scores(reflecting reported symptoms)could be explained by four clinical measurements:amplitude of accommodation,positive relative accommodation,exophoria at distance and near,and positive fusional vergence at near.CONCLUSION:The total CVS-Q score is a valid and reliable tool for predicting the presence of various nonstrabismic binocular vision anomalies and refractive errors in symptomatic computer users. 展开更多
关键词 computer vision syndrome refractive errors ACCOMMODATION VERGENCE binocular vision SYMPTOMS
原文传递
卷积神经网络与Vision Transformer在胶质瘤中的研究进展
18
作者 杨浩辉 徐涛 +3 位作者 王伟 安良良 敖用芳 朱家宝 《磁共振成像》 北大核心 2026年第1期168-174,共7页
胶质瘤因高度异质性、强侵袭性及预后差,传统诊疗面临巨大挑战。深度学习技术的引入为其精准诊疗提供了新路径,其中卷积神经网络(convolutional neural network,CNN)与Vision Transformer(ViT)是核心工具。CNN凭借层级化卷积操作在局部... 胶质瘤因高度异质性、强侵袭性及预后差,传统诊疗面临巨大挑战。深度学习技术的引入为其精准诊疗提供了新路径,其中卷积神经网络(convolutional neural network,CNN)与Vision Transformer(ViT)是核心工具。CNN凭借层级化卷积操作在局部特征提取(如肿瘤边缘、纹理细节)上具有天然优势,而ViT基于自注意力机制在全局上下文建模(如肿瘤跨区域异质性、多模态关联)方面表现突出,二者的融合策略通过整合局部精细特征与全局关联信息,在应对胶质瘤边界模糊、跨模态数据异构性等临床难题中展现出显著优势。本文综述了二者在胶质瘤检测与分割、病理分级、分子分型、预后评估等关键临床任务中的研究进展,阐述了原理、单独应用及融合策略。同时,本文也探讨了当前研究中存在的挑战,诸如对数据标注的强依赖性、模型可解释性不足等问题,并展望了未来的发展方向,例如构建轻量化架构、发展自监督学习以及推进多组学融合等前沿,以期为胶质瘤智能诊断提供系统性参考。 展开更多
关键词 胶质瘤 深度学习 卷积神经网络 vision Transformer 磁共振成像
暂未订购
基于条件生成对抗网络和Vision Transformer的胎儿颅脑超声标准切面识别方法
19
作者 李惠莲 林艺榕 +1 位作者 刘中华 柳培忠 《临床超声医学杂志》 2026年第2期164-169,共6页
胎儿颅脑超声检查是产前常规筛查中至关重要的一环,准确识别标准切面对于评估胎儿大脑发育状况具有重要意义。然而,由于超声图像质量差异和切面获取的复杂性,准确识别标准切面具有较大的挑战性。本文提出了一种基于条件对抗生成网络(CG... 胎儿颅脑超声检查是产前常规筛查中至关重要的一环,准确识别标准切面对于评估胎儿大脑发育状况具有重要意义。然而,由于超声图像质量差异和切面获取的复杂性,准确识别标准切面具有较大的挑战性。本文提出了一种基于条件对抗生成网络(CGAN)和Vision Transformer的胎儿颅脑超声标准切面识别方法,利用CGAN对原始数据进行增强,生成额外的标准切面和非标准切面图像,解决数据不足的问题;同时采用YOLOv9模型对超声图像中的颅骨区域进行自动裁剪,去除无关信息,确保模型专注于关键区域。在分类模型中采用Vision Transformer对所有输入图像进行归一化和尺寸调整,使用了数据增强技术如随机水平或垂直翻转、调整图像对比度、中心裁剪和调整图像饱和度等。结果显示,相较于现有最优模型CSwin Transformer的方法,本文提出的方法在胎儿颅脑超声标准切面识别任务中表现出色,其精确率、召回率、F1分数及准确率分别为92.5%、92.3%、92.4%和93.3%。该方法在提升识别精度方面具有显著优势,为临床超声检查提供了有效技术支持。 展开更多
关键词 条件生成对抗网络 vision Transformer 颅脑超声 胎儿 标准切面识别方法
暂未订购
基于Vision Transformer的高炉风口智能监测模型及应用
20
作者 王浩男 韩明博 +1 位作者 但家云 李强 《钢铁研究学报》 北大核心 2026年第1期25-37,共13页
高炉下部风口窥视孔可以实时监测高炉回旋区的燃烧特征与喷煤状态等关键冶炼状态信息,进而判断煤气流分布和炉缸活跃程度等重要参数。为解决风口监测过程中存在的主观性与时滞性问题,本工作基于风口图像非结构大数据与Vision Transforme... 高炉下部风口窥视孔可以实时监测高炉回旋区的燃烧特征与喷煤状态等关键冶炼状态信息,进而判断煤气流分布和炉缸活跃程度等重要参数。为解决风口监测过程中存在的主观性与时滞性问题,本工作基于风口图像非结构大数据与Vision Transformer架构,建立了高炉风口智能监测模型TI-ViT。首先,对采集到的风口图像进行预处理,通过特征辨析与标签标定形成典型炉况数据集;进而,基于Vision Transformer架构构建了TI-ViT风口图像识别模型;最后,对TI-ViT模型进行性能评估,重点探究了模型深度对准确率、参数量、训练时间与运行时间的影响,并与传统卷积神经网络模型进行比较。经验证,TI-ViT模型的准确率达到97.7%,相比基于卷积神经网络的模型提升了9.1%,单张图像的推理时间仅为15.75 ms。将基于本研究模型所开发的“智慧眼”系统应用于现场实践,其识别准确率可达95.2%,表明该系统实现了对高炉风口的实时监测、识别与预警,有助于降低钢铁企业对风口异常状态的监测与诊断成本,为高炉炼铁智能化提供了新的发展方向。 展开更多
关键词 高炉风口 计算机视觉 vision Transformer 图像识别 高炉炼铁
原文传递
上一页 1 2 250 下一页 到第
使用帮助 返回顶部