期刊文献+
共找到426篇文章
< 1 2 22 >
每页显示 20 50 100
Contribution of poses screen preimpregnated(PSP) installed at openings and eaves of dwellings in the reduction of malaria transmission in the commune of agugus in bnin
1
作者 F.Modeste Gouissi Sahidou Salifou +5 位作者 A.Patrick Edorh A.Rufine Sedjame S.G.Augustin Gouissi W.Anges Yadouleton Martin Akogbeto Michel Boko 《Asian Pacific Journal of Tropical Medicine》 SCIE CAS 2013年第1期61-67,共7页
Objective:To evaluate the contribution of poses screen pre-impregnated(PSP) installed at openings and eaves of dwellings in the reduction of malaria transmission in the commune of Aguegues in Benin.Methods:The PSP wer... Objective:To evaluate the contribution of poses screen pre-impregnated(PSP) installed at openings and eaves of dwellings in the reduction of malaria transmission in the commune of Aguegues in Benin.Methods:The PSP were manufactured from preimpregnated Olyset Net.They were installed at windows,eaves and doors of 70 dwellings.320 children aged 6-59 months were treated and 311 children were recruited in the control zone.Variables measured are:plasmodic index(IP),gametoeyte index,parasite density(PD),fever,hemoglobin,anemia. Results:The global IP was 16.62%with PSP and 72.20%without PSP.Gametoeyte index did not differ significantly between the treated zone(27.8) and the control zone(29.1).The total geometric mean of DP was 309 in the treated zone and 600 in the control zone.Hemoglobin level is 8.7 in the control zone and 9.5 in the treated zone.We noted a predominance of anemia in the control zone compared to the treated zone.Conclusions:The PSP have contributed to a significant reduction in morbidity in the commune of Aguegues. 展开更多
关键词 Malaria poses SCREEN pre-impregnated ANOPHELES Hemoglobin
暂未订购
Gaddafi’s Death Poses Challenges
2
《ChinAfrica》 2011年第12期11-11,共1页
The death of Muammar Gaddafi marks a new era for Libya.It also poses a huge challenge for Libyan authorities dealing with tribal conflicts.He Wenping, a researcher with the Institute of West-Asian and African Studies ... The death of Muammar Gaddafi marks a new era for Libya.It also poses a huge challenge for Libyan authorities dealing with tribal conflicts.He Wenping, a researcher with the Institute of West-Asian and African Studies at the Chinese Academy of Social Sciences, believes that Libya is in danger of falling into a period of internal strife and tribal conflict.Her thoughts are as follows: 展开更多
关键词 Gaddafi’s Death poses Challenges
原文传递
Effects of Dressing Poses on Clothing Thermal Insulation 被引量:2
3
作者 李俊 刘岩 张渭源 《Journal of Donghua University(English Edition)》 EI CAS 2004年第4期113-117,共5页
With a thermal manikin, the effects of dressing poses on clothing thermal insulation are studied. It is found that the thermal insulation of still air layer over human body has not been influenced by the dressing pose... With a thermal manikin, the effects of dressing poses on clothing thermal insulation are studied. It is found that the thermal insulation of still air layer over human body has not been influenced by the dressing poses, but the dressing poses have effects on the thermal insulation of clothing system. 展开更多
关键词 CLOTHING thermal insulation thermal manikin dressing pose.
在线阅读 下载PDF
Tracking Human Poses with Head Orientation Estimation 被引量:3
4
作者 TIAN Jinglan WANG Zhengyuan +1 位作者 LI Ling LIU Wanquan 《Instrumentation》 2017年第3期40-46,共7页
Lots of progress has been made recently on 2 D human pose tracking with tracking-by-detection approaches. However,several challenges still remain in this area which is due to self-occlusions and the confusion between ... Lots of progress has been made recently on 2 D human pose tracking with tracking-by-detection approaches. However,several challenges still remain in this area which is due to self-occlusions and the confusion between the left and right limbs during tracking. In this work,a head orientation detection step is introduced into the tracking framework to serve as a complementary tool to assist human pose estimation. With the face orientation determined,the system can decide whether the left or right side of the human body is exactly visible and infer the state of the symmetric counterpart. By granting a higher priority for the completely visible side,the system can avoid double counting to a great extent when inferring body poses. The proposed framework is evaluated on the HumanEva dataset. The results show that it largely reduces the occurrence of double counting and distinguishes the left and right sides consistently. 展开更多
关键词 Human Pose Tracking Head Orientation Tracking by Detection
原文传递
Waves of 3D Marine Structures Slamming at Different Initial Poses in Complex Wind-Wave-Flow Environments
5
作者 ZHU Liang-sheng YU Long-fei 《China Ocean Engineering》 SCIE EI CSCD 2016年第5期772-785,共14页
Aimed at the hydrodynamic response for marine structures slamming into water, based on the mechanism analysis to the slamming process, and by combining 3D N-S equation and k-ε turbulent kinetic equation with structur... Aimed at the hydrodynamic response for marine structures slamming into water, based on the mechanism analysis to the slamming process, and by combining 3D N-S equation and k-ε turbulent kinetic equation with structure fully 6DOF motion equation, a mathematical model for the wind-fluid-solid interaction is established in 3D marine structure slamming wave at free poses and wind-wave-flow complex environments. Compared with the results of physical model test, the numerical results from the slamming wave well correspond with the experimental results. Through the mathematical model, the wave-making issue of 3D marine structure at initial pose falls into water in different complex wind, wave and flow environments is investigated. The research results show that various kinds of natural factors and structure initial poses have different influence on the slamming wave, and there is an obvious rule in this process. 展开更多
关键词 marine structures slamming wave initial pose natural background 6DOF
在线阅读 下载PDF
Robust facial expression recognition system in higher poses
6
作者 Ebenezer Owusu Justice Kwame Appati Percy Okae 《Visual Computing for Industry,Biomedicine,and Art》 EI 2022年第1期159-173,共15页
Facial expression recognition(FER)has numerous applications in computer security,neuroscience,psychology,and engineering.Owing to its non-intrusiveness,it is considered a useful technology for combating crime.However,... Facial expression recognition(FER)has numerous applications in computer security,neuroscience,psychology,and engineering.Owing to its non-intrusiveness,it is considered a useful technology for combating crime.However,FER is plagued with several challenges,the most serious of which is its poor prediction accuracy in severe head poses.The aim of this study,therefore,is to improve the recognition accuracy in severe head poses by proposing a robust 3D head-tracking algorithm based on an ellipsoidal model,advanced ensemble of AdaBoost,and saturated vector machine(SVM).The FER features are tracked from one frame to the next using the ellipsoidal tracking model,and the visible expressive facial key points are extracted using Gabor filters.The ensemble algorithm(Ada-AdaSVM)is then used for feature selection and classification.The proposed technique is evaluated using the Bosphorus,BU-3DFE,MMI,CK^(+),and BP4D-Spontaneous facial expression databases.The overall performance is outstanding. 展开更多
关键词 Facial expressions Three-dimensional head pose Ellipsoidal model Gabor filters Ada-AdaSVM
在线阅读 下载PDF
Visualizing game dynamics at a specific time:Influence of the players’poses for tactical analyses in padel
7
作者 Mohammadreza Javadiha Carlos Andujar +3 位作者 Enrique Lacasa Gota Shirato Natalia Andrienko Gennady Andrienko 《Visual Informatics》 2025年第3期56-70,共15页
benefits from positional diagrams showing where the players are.These diagrams often show the layout of the players through simple symbols,which provide no information about their poses.This paper investigates if the ... benefits from positional diagrams showing where the players are.These diagrams often show the layout of the players through simple symbols,which provide no information about their poses.This paper investigates if the visualization of player poses is beneficial for tactical understanding of positional diagrams in padel.We propose a realistic,cartoon-like representation of the players and discuss its integration into a typical positional diagram.To overcome the cost of generating player representations depicting their pose,we propose a method to generate such representations from minimal user input.We conducted a user study to evaluate the effectiveness of our pose-aware diagrams.The tasks for the study were designed to encompass the main in-game scenarios in padel,which include the ballholder at the net with opponents defending,the reverse situation,and transitions between these two states.We found that our representation is preferred over a symbolic one that only indicates player orientation.The proposed method enables coaches to produce such representations within a matter of seconds,thereby significantly facilitating the creation of detailed and easily analyzable depictions of game situations. 展开更多
关键词 Padel Positional diagram Tactical analysis Player pose
原文传递
Eye location under different eye poses,scales,and illuminations 被引量:5
8
作者 袁景和 《Chinese Optics Letters》 SCIE EI CAS CSCD 2010年第1期59-62,共4页
Robust non-intrusive eye location plays an important role in vision-based man-mechine interaction. A modified Hausdorff distance based measure to localize the eyes is proposed, which could tolerate various changes in ... Robust non-intrusive eye location plays an important role in vision-based man-mechine interaction. A modified Hausdorff distance based measure to localize the eyes is proposed, which could tolerate various changes in eye pose, shape, and scale. To eliminate the effects of the illumination variations, an 8- neighbour-based transformation of the gray images is proposed. The transformed image is less sensitive to illumination changes while preserves the appearance information of eyes. All the localized candidates of eyes are identified by back-propagation neural networks. Experiments demonstrate that the robust method for eye location is able to localize eyes with different eye sizes, shapes, and poses under different illuminations. 展开更多
关键词 MHD Eye location under different eye poses scales and illuminations BPNN this FIGURE
原文传递
基于人体姿态估计的跌倒行为检测方法研究
9
作者 崔泽男 张俊林 王泽强 《工业控制计算机》 2026年第3期54-56,共3页
基于深度学习的方法在提高跌倒检测准确性和鲁棒性方面取得了显著进展,然而仍存在人体关键点检测困难、复杂场景下实时姿态追踪精度偏低和速度较慢等问题。针对这些问题,构建了一个人体姿态估计和跌倒检测模型。提出采用模型YOLOv8对运... 基于深度学习的方法在提高跌倒检测准确性和鲁棒性方面取得了显著进展,然而仍存在人体关键点检测困难、复杂场景下实时姿态追踪精度偏低和速度较慢等问题。针对这些问题,构建了一个人体姿态估计和跌倒检测模型。提出采用模型YOLOv8对运动目标进行深度学习;采用MobileNet网络结构对原有的DarkNet网络结构进行替换;采用MergeNMS,有效避免多次计算同一边界框之间的重叠度,提高网络处理速度。实验表明,所提算法对复杂场景下实时的人体姿态估计和跌倒检测具有较好的效果,模型运行具有较快速度。 展开更多
关键词 人体姿态估计 YOLOv8 Alpha Pose MobileNet Merge-NMS
在线阅读 下载PDF
UNO:Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment
10
作者 Wentao Zhao Yihe Niu +5 位作者 Yanbo Wang Tianchen Deng Shenghai Yuan Zhenli Wang Rui Guo Jingchuan Wang 《CAAI Transactions on Intelligence Technology》 2026年第1期205-222,共18页
This work presents UNO,a unified monocular visual odometry framework that enables robust and adaptable pose estimation across diverse environments,platforms and motion patterns.Unlike traditional methods that rely on ... This work presents UNO,a unified monocular visual odometry framework that enables robust and adaptable pose estimation across diverse environments,platforms and motion patterns.Unlike traditional methods that rely on deploymentspecific tuning or predefined motion priors,our approach generalises effectively across a wide range of real-world scenarios,including autonomous vehicles,aerial drones,mobile robots and handheld devices.To this end,we introduce a mixture-of-experts strategy for local state estimation,with several specialised decoders that each handle a distinct class of ego-motion patterns.Moreover,we introduce a fully differentiable Gumbel-softmax module that constructs a robust interframe correlation graph,selects the optimal expert decoder and prunes erroneous estimates.These cues are then fed into a unified back-end that combines pretrained scale-independent depth priors with a lightweight bundling adjustment to enforce geometric consistency.We extensively evaluate our method on three major benchmark datasets:KITTI(outdoor/autonomous driving),EuRoC-MAV(indoor/aerial drones)and TUM-RGBD(indoor/handheld),demonstrating state-of-theart performance. 展开更多
关键词 computer vision pose estimation robotics unsupervised learning
在线阅读 下载PDF
An Attention-Based 6D Pose Estimation Network for Weakly Textured Industrial Parts
11
作者 Song Xu Liang Xuan +1 位作者 Yifeng Li Qiang Zhang 《Computers, Materials & Continua》 2026年第2期2148-2166,共19页
The 6D pose estimation of objects is of great significance for the intelligent assembly and sorting of industrial parts.In the industrial robot production scenarios,the 6D pose estimation of industrial parts mainly fa... The 6D pose estimation of objects is of great significance for the intelligent assembly and sorting of industrial parts.In the industrial robot production scenarios,the 6D pose estimation of industrial parts mainly faces two challenges:one is the loss of information and interference caused by occlusion and stacking in the sorting scenario,the other is the difficulty of feature extraction due to the weak texture of industrial parts.To address the above problems,this paper proposes an attention-based pixel-level voting network for 6D pose estimation of weakly textured industrial parts,namely CB-PVNet.On the one hand,the voting scheme can predict the keypoints of affected pixels,which improves the accuracy of keypoint localization even in scenarios such as weak texture and partial occlusion.On the other hand,the attention mechanism can extract interesting features of the object while suppressing useless features of surroundings.Extensive comparative experiments were conducted on both public datasets(including LINEMOD,Occlusion LINEMOD and T-LESS datasets)and self-made datasets.The experimental results indicate that the proposed network CB-PVNet can achieve accuracy of ADD(-s)comparable to state-of-the-art using only RGB images while ensuring real-time performance.Additionally,we also conducted robot grasping experiments in the real world.The balance between accuracy and computational efficiency makes the method well-suited for applications in industrial automation. 展开更多
关键词 Industrial robots pose estimation industrial parts attention mechanism weak texture
在线阅读 下载PDF
Towards Real-Time Multi-Person Pose Estimation via Feature Selection and Sharpening Mechanisms
12
作者 Chengang Dong Yongkang Ding Jianwei Hu 《Computer Modeling in Engineering & Sciences》 2026年第3期888-908,共21页
Real-time multi-person pose estimation(MPE)built upon neural network architectures aims to simultaneously detect multiple human instances and regress joint coordinates in dynamic scenes.However,due to factors such as ... Real-time multi-person pose estimation(MPE)built upon neural network architectures aims to simultaneously detect multiple human instances and regress joint coordinates in dynamic scenes.However,due to factors such as high model complexity and limited expression of keypoint information,both the efficiency and accuracy of real-time MPE remain to be improved.To mitigate the adverse impacts caused by the aforementioned issues,this work develops FSEM-Pose,a real-time MPE model rooted in the YOLOv10 framework.In detail,first,FSEM-Pose upgrades the backbone module of the baseline network by introducing the Feature Shuffling-Convolution(FS-Conv),which effectively reduces the backbone size while maximizing the retention of spatial information from the input image.Second,FSEM-Pose incorporates a Feature Saliency Enhancement Module(FSEM)to strengthen the feature encoding of human keypoints,thereby improving the accuracy of pose estimation.Finally,FSEM-Pose further enhances inference efficiency via a lightweight optimization of the head using shared convolutional layers.Our method achieves competitive results across multiple accuracy and efficiency metrics on the MS COCO 2017 and CrowdPose datasets.While being lightweight in design,it improves average precision(AP)by 2.1%and 2.5%,respectively. 展开更多
关键词 Pose estimation feature sharpening LIGHTWEIGHT YOLOv10
在线阅读 下载PDF
A pull-up scoring method based on skeleton point recognition and fuzzy comprehensive evaluation
13
作者 Jiacheng Fu Peng Yu +1 位作者 Yinghui Wang Jin Guo 《Control Theory and Technology》 2026年第1期156-169,共14页
Pull-ups are a very common fitness exercise that can be seen in many gyms.For athletes,it is very important to perform pull-ups correctly and scientifically.The pull-up scoring method designed in this paper can score ... Pull-ups are a very common fitness exercise that can be seen in many gyms.For athletes,it is very important to perform pull-ups correctly and scientifically.The pull-up scoring method designed in this paper can score the quality of pull-up movement scientifically and objectively,and provide guidance to help athletes better complete the pull-up movement.In this method,the OpenPose algorithm is used to identify the coordinates of skeleton points,and then the coordinate data are processed by a Kalman filter to obtain coordinates closer to the true values.Finally,the filtered data are input into the scoring algorithm designed based on the fuzzy comprehensive evaluation algorithm,and the results of the pull-up quality score and the corresponding guidance are obtained. 展开更多
关键词 Pull-ups Pose estimation Kalman filtering Fuzzy comprehensive evaluation
原文传递
Application of the Improved PF-Flow-Style-VTON in Virtual Try-On
14
作者 TIAN Jiajia HUANG Rong +1 位作者 DONG Aihua WANG Zhijie 《Journal of Donghua University(English Edition)》 2026年第1期104-117,共14页
During the image generation phase,the parserfree Flow-Style-VTON model(PF-Flow-Style-VTON),which utilizes distilled appearance flows,faces two main challenges:blurring,deformation,occlusion,or loss of the arm or palm ... During the image generation phase,the parserfree Flow-Style-VTON model(PF-Flow-Style-VTON),which utilizes distilled appearance flows,faces two main challenges:blurring,deformation,occlusion,or loss of the arm or palm regions in the generated image when these regions of the person occlude the garment;blurring and deformation in the generated image when the person performs large pose movements and the target garment is complex with detailed patterns.To solve these two problems,an improved virtual try-on network model,denoted as IPF-Flow-Style-VTON,is proposed.Firstly,a target warped garment mask refinement module(M-RM)is introduced to refine the warped garment mask and remove erroneous information in the arm and palm regions,thereby improving the quality of subsequent image generation.Secondly,an improved global attention module(GAM)is integrated into the original image generation network,enhancing the ResUNet’s understanding of global context and optimizing the fusion of local features and global information,thereby further improving image generation quality.Finally,the UniPose model is used to provide the pose keypoint information of the target person image,guiding the task execution during the image generation phase.Experiments conducted on the VITON dataset show that the proposed method outperforms the original method,Flow-Style-VTON,by 5.4%,0.3%,6.7%,and 2.2%in Frchet inception distance(FID),structural similarity index measure(SSIM),learned perceptual image patch similarity(LPIPS),and peak signal-to-noise ratio(PSNR),respectively.Overall,the proposed method effectively improves upon the shortcomings of the original network and achieves better visual results. 展开更多
关键词 virtual try-on image generation network pose keypoint deep learning
在线阅读 下载PDF
针对AI防呆的手部姿态估计算法研究
15
作者 孙世丹 《电子质量》 2026年第1期112-117,共6页
为满足实际生产中对于人工智能(AI)智能防呆的需求,对手部姿态估计算法进行了深入研究。通过对多种算法进行调研与实验对比,最终构建了一套基于YOLOv8Pose的手部姿态估计系统。首先,对COCO-WholeBody数据集中的手部训练数据进行整理,筛... 为满足实际生产中对于人工智能(AI)智能防呆的需求,对手部姿态估计算法进行了深入研究。通过对多种算法进行调研与实验对比,最终构建了一套基于YOLOv8Pose的手部姿态估计系统。首先,对COCO-WholeBody数据集中的手部训练数据进行整理,筛选并剔除错误标注,有效提升了数据质量与后续模型训练的准确率;其次,应用GroupNorm剪枝算法对YOLOv8Pose模型进行压缩,在较好地保持模型检测精度的同时,显著降低了其资源消耗,使其能够适配资源受限的边缘计算设备;最后,将优化后的模型成功部署在瑞芯微RK3588硬件平台,验证了所提算法在真实工业场景中的可行性与有效性。本研究为工业生产线上的实时、精准操作监控提供了一种高效的AI解决方案,对提升生产自动化水平和质量控制能力具有积极意义。 展开更多
关键词 姿态估计 YOLOv8Pose 边缘部署 防呆设计
在线阅读 下载PDF
An intelligent segmentation method for leakage points in central serous chorioretinopathy based on fluorescein angiography images
16
作者 Jian-Guo Xu Yong-Chi Liu +4 位作者 Fen Zhou Jian-Xin Shen Zhi-Peng Yan Xin-Ya Hu Wei-Hua Yang 《International Journal of Ophthalmology(English edition)》 2026年第3期421-433,共13页
AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigat... AIM:To construct an intelligent segmentation scheme for precise localization of central serous chorioretinopathy(CSC)leakage points,thereby enabling ophthalmologists to deliver accurate laser treatment without navigational laser equipment.METHODS:A dataset with dual labels(point-level and pixel-level)was first established based on fundus fluorescein angiography(FFA)images of CSC and subsequently divided into training(102 images),validation(40 images),and test(40 images)datasets.An intelligent segmentation method was then developed,based on the You Only Look Once version 8 Pose Estimation(YOLOv8-Pose)model and segment anything model(SAM),to segment CSC leakage points.Next,the YOLOv8-Pose model was trained for 200 epochs,and the best-performing model was selected to form the optimal combination with SAM.Additionally,the classic five types of U-Net series models[i.e.,U-Net,recurrent residual U-Net(R2U-Net),attention U-Net(AttU-Net),recurrent residual attention U-Net(R2AttUNet),and nested U-Net(UNet^(++))]were initialized with three random seeds and trained for 200 epochs,resulting in a total of 15 baseline models for comparison.Finally,based on the metrics including Dice similarity coefficient(DICE),intersection over union(IoU),precision,recall,precisionrecall(PR)curve,and receiver operating characteristic(ROC)curve,the proposed method was compared with baseline models through quantitative and qualitative experiments for leakage point segmentation,thereby demonstrating its effectiveness.RESULTS:With the increase of training epochs,the mAP50-95,Recall,and precision of the YOLOv8-Pose model showed a significant increase and tended to stabilize,and it achieved a preliminary localization success rate of 90%(i.e.,36 images)for CSC leakage points in 40 test images.Using manually expert-annotated pixel-level labels as the ground truth,the proposed method achieved outcomes with a DICE of 57.13%,an IoU of 45.31%,a precision of 45.91%,a recall of 93.57%,an area under the PR curve(AUC-PR)of 0.78 and an area under the ROC curve(AUC-ROC)of 0.97,which enables more accurate segmentation of CSC leakage points.CONCLUSION:By combining the precise localization capability of the YOLOv8-Pose model with the robust and flexible segmentation ability of SAM,the proposed method not only demonstrates the effectiveness of the YOLOv8-Pose model in detecting keypoint coordinates of CSC leakage points from the perspective of application innovation but also establishes a novel approach for accurate segmentation of CSC leakage points through the“detect-then-segment”strategy,thereby providing a potential auxiliary means for the automatic and precise realtime localization of leakage points during traditional laser photocoagulation for CSC. 展开更多
关键词 You Only Look Once version 8 Pose Estimation segment anything model central serous chorioretinopathy leakage point segmentation
原文传递
Transformer-Driven Multimodal for Human-Object Detection and Recognition for Intelligent Robotic Surveillance
17
作者 Aman Aman Ullah Yanfeng Wu +3 位作者 Shaheryar Najam Nouf Abdullah Almujally Ahmad Jalal Hui Liu 《Computers, Materials & Continua》 2026年第4期1364-1383,共20页
Human object detection and recognition is essential for elderly monitoring and assisted living however,models relying solely on pose or scene context often struggle in cluttered or visually ambiguous settings.To addre... Human object detection and recognition is essential for elderly monitoring and assisted living however,models relying solely on pose or scene context often struggle in cluttered or visually ambiguous settings.To address this,we present SCENET-3D,a transformer-drivenmultimodal framework that unifies human-centric skeleton features with scene-object semantics for intelligent robotic vision through a three-stage pipeline.In the first stage,scene analysis,rich geometric and texture descriptors are extracted from RGB frames,including surface-normal histograms,angles between neighboring normals,Zernike moments,directional standard deviation,and Gabor-filter responses.In the second stage,scene-object analysis,non-human objects are segmented and represented using local feature descriptors and complementary surface-normal information.In the third stage,human-pose estimation,silhouettes are processed through an enhanced MoveNet to obtain 2D anatomical keypoints,which are fused with depth information and converted into RGB-based point clouds to construct pseudo-3D skeletons.Features from all three stages are fused and fed in a transformer encoder with multi-head attention to resolve visually similar activities.Experiments on UCLA(95.8%),ETRI-Activity3D(89.4%),andCAD-120(91.2%)demonstrate that combining pseudo-3D skeletonswith rich scene-object fusion significantly improves generalizable activity recognition,enabling safer elderly care,natural human–robot interaction,and robust context-aware robotic perception in real-world environments. 展开更多
关键词 Human object detection elderly care RGB-based pose estimation scene context analysis object recognition Gabor features point cloud reconstruction
在线阅读 下载PDF
Robust Human Pose Estimation and Action Recognition Utilizing Feature Extraction
18
作者 Sheng Luo Rashid Abbasi +7 位作者 Hao Wang Jinghua Xu Dongyang Lyu Aaron Zhang Farhan Amin Isabel de la Torre Gerardo Mendez Mezquita Henry Fabian Gongora 《Computer Modeling in Engineering & Sciences》 2026年第3期870-887,共18页
Human pose estimation is crucial across diverse applications,from healthcare to human-computer interaction.Integrating inertial measurement units(IMUs)with monocular vision methods holds great potential for leveraging... Human pose estimation is crucial across diverse applications,from healthcare to human-computer interaction.Integrating inertial measurement units(IMUs)with monocular vision methods holds great potential for leveraging complementary modalities;however,existing approaches are often limited by IMU drift,noise,and underutilization of visual information.To address these limitations,we propose a novel dual-stream feature extraction framework that effectively combines temporal IMU data and single-view image features for improved pose estimation.Short-term dependencies in IMU sequences are captured with convolutional layers,while a Transformerbased architecture models long-range temporal dynamics.To mitigate IMU drift and inter-sensor inconsistencies,a complementary filtering module is introduced alongside a cross-channel interaction mechanism.Features from the IMU and image streams are then fused via a dedicated fusion module and further refined utilizing a high-precision regression head for accurate pose prediction.Experimental results on benchmark datasets demonstrate that our method significantly outperforms existing techniques in terms of estimation,accuracy,and robustness,validating the effectiveness of our dual-stream architecture. 展开更多
关键词 Human pose estimation dual-stream network inertial measurement units(IMU)
在线阅读 下载PDF
Hybrid Directed Hypergraph Learning and Forecasting of Skeleton-Based Human Poses
19
作者 Qiongjie Cui Zongyuan Ding Fuhua Chen 《Cyborg and Bionic Systems》 2024年第1期665-675,共11页
Forecasting 3-dimensional skeleton-based human poses from the historical sequence is a classic task,which shows enormous potential in robotics,computer vision,and graphics.Currently,the state-of-theart methods resort ... Forecasting 3-dimensional skeleton-based human poses from the historical sequence is a classic task,which shows enormous potential in robotics,computer vision,and graphics.Currently,the state-of-theart methods resort to graph convolutional networks(GCNs)to access the relationships of human joint pairs to formulate this problem.However,human action involves complex interactions among multiple joints,which presents a higher-order correlation overstepping the pairwise(2-order)connection of GCNs.Moreover,joints are typically activated by the parent joint,rather than driving their parent joints,whereas in existing methods,this specific direction of information transmission is ignored.In this work,we propose a novel hybrid directed hypergraph convolution network(H-DHGCN)to model the high-order relationships of the human skeleton with directionality.Specifically,our H-DHGCN mainly involves 2 core components.One is the static directed hypergraph,which is pre-defined according to the human body structure,to effectively leverage the natural relations of human joints.The second is dynamic directed hypergraph(D-DHG).D-DHG is learnable and can be constructed adaptively,to learn the unique characteristics of the motion sequence.In contrast to the typical GCNs,our method brings a richer and more refined topological representation of skeleton data.On several large-scale benchmarks,experimental results show that the proposed model consistently surpasses the latest techniques. 展开更多
关键词 high order relationships forecasting graph convolutional networks gcns topological representation relationships human joint pairs skeleton based human poses hybrid directed hypergraph convolutional networks
原文传递
煤矿井下人员危险行为检测方法 被引量:1
20
作者 张旭辉 余恒翰 +6 位作者 杜昱阳 杨文娟 赵亦辉 万继成 王彦群 赵典 汤杜炜 《工矿自动化》 北大核心 2025年第5期64-71,共8页
井下人员危险行为检测是煤矿安全防控的关键环节。现有目标检测技术用于人员危险行为检测时,受煤矿井下复杂工况、设备遮挡、多目标密集、粉尘干扰等因素影响,存在特征提取不准确等问题,且未明确界定人员危险行为。以YOLOv8−pose模型为... 井下人员危险行为检测是煤矿安全防控的关键环节。现有目标检测技术用于人员危险行为检测时,受煤矿井下复杂工况、设备遮挡、多目标密集、粉尘干扰等因素影响,存在特征提取不准确等问题,且未明确界定人员危险行为。以YOLOv8−pose模型为基准架构,采用DCNv4和PConv模块融合的DCNv4−PConv混合模块代替标准卷积,添加混合局部通道注意力(MLCA)模块,并采用感受野注意力卷积(RFAConv)模块替换检测头,构建了PMR−YOLO模型,用于检测井下监控图像中人体关键点,提升检测精度和运算速度。在此基础上设计了人员行为识别算法,将井下人员行为划分为9种类别,基于YOLOv8−pose模型检测的人体关键点形成人体骨架,判断人员行为类别型。采用DsLMF+数据集进行消融实验、对比实验和人员行为识别实验,结果表明:DCNv4−PConv混合模块、MLCA模块、RFAConv模块的引入有效提高了YOLOv8−pose模型的精确度、召回率和平均精度均值(mAP);PMR−YOLO模型对人体关键点特征提取的精确度、召回率和mAP分别为0.893,0.841,0.852,较YOLOv8−pose模型分别提高了6.9%,14.4%,10.5%;基于PMR−YOLO模型的检测方法可有效识别井下人员9种行为类别,识别准确率均不低于96%。 展开更多
关键词 视频识别 危险行为检测 人员行为识别 YOLOv8−pose模型 人体关键点检测
在线阅读 下载PDF
上一页 1 2 22 下一页 到第
使用帮助 返回顶部