期刊文献+
共找到26篇文章
< 1 2 >
每页显示 20 50 100
Hourglass-GCN for 3D Human Pose Estimation Using Skeleton Structure and View Correlation
1
作者 Ange Chen Chengdong Wu Chuanjiang Leng 《Computers, Materials & Continua》 SCIE EI 2025年第1期173-191,共19页
Previous multi-view 3D human pose estimation methods neither correlate different human joints in each view nor model learnable correlations between the same joints in different views explicitly,meaning that skeleton s... Previous multi-view 3D human pose estimation methods neither correlate different human joints in each view nor model learnable correlations between the same joints in different views explicitly,meaning that skeleton structure information is not utilized and multi-view pose information is not completely fused.Moreover,existing graph convolutional operations do not consider the specificity of different joints and different views of pose information when processing skeleton graphs,making the correlation weights between nodes in the graph and their neighborhood nodes shared.Existing Graph Convolutional Networks(GCNs)cannot extract global and deeplevel skeleton structure information and view correlations efficiently.To solve these problems,pre-estimated multiview 2D poses are designed as a multi-view skeleton graph to fuse skeleton priors and view correlations explicitly to process occlusion problem,with the skeleton-edge and symmetry-edge representing the structure correlations between adjacent joints in each viewof skeleton graph and the view-edge representing the view correlations between the same joints in different views.To make graph convolution operation mine elaborate and sufficient skeleton structure information and view correlations,different correlation weights are assigned to different categories of neighborhood nodes and further assigned to each node in the graph.Based on the graph convolution operation proposed above,a Residual Graph Convolution(RGC)module is designed as the basic module to be combined with the simplified Hourglass architecture to construct the Hourglass-GCN as our 3D pose estimation network.Hourglass-GCNwith a symmetrical and concise architecture processes three scales ofmulti-viewskeleton graphs to extract local-to-global scale and shallow-to-deep level skeleton features efficiently.Experimental results on common large 3D pose dataset Human3.6M and MPI-INF-3DHP show that Hourglass-GCN outperforms some excellent methods in 3D pose estimation accuracy. 展开更多
关键词 3D human pose estimation multi-view skeleton graph elaborate graph convolution operation Hourglass-GCN
在线阅读 下载PDF
VMHPE:Human Pose Estimation for Virtual Maintenance Tasks
2
作者 Shuo Zhang Hanwu He Yueming Wu 《Computers, Materials & Continua》 2025年第10期801-826,共26页
Virtual maintenance,as an important means of industrial training and education,places strict requirements on the accuracy of participant pose perception and assessment of motion standardization.However,existing resear... Virtual maintenance,as an important means of industrial training and education,places strict requirements on the accuracy of participant pose perception and assessment of motion standardization.However,existing research mainly focuses on human pose estimation in general scenarios,lacking specialized solutions for maintenance scenarios.This paper proposes a virtual maintenance human pose estimation method based on multi-scale feature enhancement(VMHPE),which integrates adaptive input feature enhancement,multi-scale feature correction for improved expression of fine movements and complex poses,and multi-scale feature fusion to enhance keypoint localization accuracy.Meanwhile,this study constructs the first virtual maintenance-specific human keypoint dataset(VMHKP),which records standard action sequences of professional maintenance personnel in five typical maintenance tasks and provides a reliable benchmark for evaluating operator motion standardization.The dataset is publicly available at.Using high-precision keypoint prediction results,an action assessment system utilizing topological structure similarity was established.Experiments show that our method achieves significant performance improvements:average precision(AP)reaches 94.4%,an increase of 2.3 percentage points over baseline methods;average recall(AR)reaches 95.6%,an increase of 1.3 percentage points.This research establishes a scientific four-level evaluation standard based on comparative motion analysis and provides a reliable solution for standardizing industrial maintenance training. 展开更多
关键词 Virtual maintenance human pose estimation multi-scale feature fusion
在线阅读 下载PDF
Lightweight Multi-Resolution Network for Human Pose Estimation
3
作者 Pengxin Li Rong Wang +2 位作者 Wenjing Zhang Yinuo Liu Chenyue Xu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第3期2239-2255,共17页
Human pose estimation aims to localize the body joints from image or video data.With the development of deeplearning,pose estimation has become a hot research topic in the field of computer vision.In recent years,huma... Human pose estimation aims to localize the body joints from image or video data.With the development of deeplearning,pose estimation has become a hot research topic in the field of computer vision.In recent years,humanpose estimation has achieved great success in multiple fields such as animation and sports.However,to obtainaccurate positioning results,existing methods may suffer from large model sizes,a high number of parameters,and increased complexity,leading to high computing costs.In this paper,we propose a new lightweight featureencoder to construct a high-resolution network that reduces the number of parameters and lowers the computingcost.We also introduced a semantic enhancement module that improves global feature extraction and networkperformance by combining channel and spatial dimensions.Furthermore,we propose a dense connected spatialpyramid pooling module to compensate for the decrease in image resolution and information loss in the network.Finally,ourmethod effectively reduces the number of parameters and complexitywhile ensuring high performance.Extensive experiments show that our method achieves a competitive performance while dramatically reducing thenumber of parameters,and operational complexity.Specifically,our method can obtain 89.9%AP score on MPIIVAL,while the number of parameters and the complexity of operations were reduced by 41%and 36%,respectively. 展开更多
关键词 LIGHTWEIGHT human pose estimation keypoint detection high resolution network
在线阅读 下载PDF
DAUNet: Detail-Aware U-Shaped Network for 2D Human Pose Estimation
4
作者 Xi Li Yuxin Li +2 位作者 Zhenhua Xiao Zhenghua Huang Lianying Zou 《Computers, Materials & Continua》 SCIE EI 2024年第11期3325-3349,共25页
Human pose estimation is a critical research area in the field of computer vision,playing a significant role in applications such as human-computer interaction,behavior analysis,and action recognition.In this paper,we... Human pose estimation is a critical research area in the field of computer vision,playing a significant role in applications such as human-computer interaction,behavior analysis,and action recognition.In this paper,we propose a U-shaped keypoint detection network(DAUNet)based on an improved ResNet subsampling structure and spatial grouping mechanism.This network addresses key challenges in traditional methods,such as information loss,large network redundancy,and insufficient sensitivity to low-resolution features.DAUNet is composed of three main components.First,we introduce an improved BottleNeck block that employs partial convolution and strip pooling to reduce computational load and mitigate feature loss.Second,after upsampling,the network eliminates redundant features,improving the overall efficiency.Finally,a lightweight spatial grouping attention mechanism is applied to enhance low-resolution semantic features within the feature map,allowing for better restoration of the original image size and higher accuracy.Experimental results demonstrate that DAUNet achieves superior accuracy compared to most existing keypoint detection models,with a mean PCKh@0.5 score of 91.6%on the MPII dataset and an AP of 76.1%on the COCO dataset.Moreover,real-world experiments further validate the robustness and generalizability of DAUNet for detecting human bodies in unknown environments,highlighting its potential for broader applications. 展开更多
关键词 human pose estimation keypoint detection U-shaped network architecture spatial grouping mechanism
在线阅读 下载PDF
Movement Function Assessment Based on Human Pose Estimation from Multi-View
5
作者 Lingling Chen Tong Liu +1 位作者 Zhuo Gong Ding Wang 《Computer Systems Science & Engineering》 2024年第2期321-339,共19页
Human pose estimation is a basic and critical task in the field of computer vision that involves determining the position(or spatial coordinates)of the joints of the human body in a given image or video.It is widely u... Human pose estimation is a basic and critical task in the field of computer vision that involves determining the position(or spatial coordinates)of the joints of the human body in a given image or video.It is widely used in motion analysis,medical evaluation,and behavior monitoring.In this paper,the authors propose a method for multi-view human pose estimation.Two image sensors were placed orthogonally with respect to each other to capture the pose of the subject as they moved,and this yielded accurate and comprehensive results of three-dimensional(3D)motion reconstruction that helped capture their multi-directional poses.Following this,we propose a method based on 3D pose estimation to assess the similarity of the features of motion of patients with motor dysfunction by comparing differences between their range of motion and that of normal subjects.We converted these differences into Fugl–Meyer assessment(FMA)scores in order to quantify them.Finally,we implemented the proposed method in the Unity framework,and built a Virtual Reality platform that provides users with human–computer interaction to make the task more enjoyable for them and ensure their active participation in the assessment process.The goal is to provide a suitable means of assessing movement disorders without requiring the immediate supervision of a physician. 展开更多
关键词 human pose estimation 3D pose reconstruction assessment of movement function plane of features of human motion
在线阅读 下载PDF
Squirrel Search Optimization with Deep Convolutional Neural Network for Human Pose Estimation 被引量:3
6
作者 K.Ishwarya A.Alice Nithya 《Computers, Materials & Continua》 SCIE EI 2023年第3期6081-6099,共19页
Human pose estimation(HPE)is a procedure for determining the structure of the body pose and it is considered a challenging issue in the computer vision(CV)communities.HPE finds its applications in several fields namel... Human pose estimation(HPE)is a procedure for determining the structure of the body pose and it is considered a challenging issue in the computer vision(CV)communities.HPE finds its applications in several fields namely activity recognition and human-computer interface.Despite the benefits of HPE,it is still a challenging process due to the variations in visual appearances,lighting,occlusions,dimensionality,etc.To resolve these issues,this paper presents a squirrel search optimization with a deep convolutional neural network for HPE(SSDCNN-HPE)technique.The major intention of the SSDCNN-HPE technique is to identify the human pose accurately and efficiently.Primarily,the video frame conversion process is performed and pre-processing takes place via bilateral filtering-based noise removal process.Then,the EfficientNet model is applied to identify the body points of a person with no problem constraints.Besides,the hyperparameter tuning of the EfficientNet model takes place by the use of the squirrel search algorithm(SSA).In the final stage,the multiclass support vector machine(M-SVM)technique was utilized for the identification and classification of human poses.The design of bilateral filtering followed by SSA based EfficientNetmodel for HPE depicts the novelty of the work.To demonstrate the enhanced outcomes of the SSDCNN-HPE approach,a series of simulations are executed.The experimental results reported the betterment of the SSDCNN-HPE system over the recent existing techniques in terms of different measures. 展开更多
关键词 Parameter tuning human pose estimation deep learning squirrel search algorithm activity recognition
在线阅读 下载PDF
Human Pose Estimation and Object Interaction for Sports Behaviour 被引量:3
7
作者 Ayesha Arif Yazeed Yasin Ghadi +3 位作者 Mohammed Alarfaj Ahmad Jalal Shaharyar Kamal Dong-Seong Kim 《Computers, Materials & Continua》 SCIE EI 2022年第7期1-18,共18页
In the new era of technology,daily human activities are becoming more challenging in terms of monitoring complex scenes and backgrounds.To understand the scenes and activities from human life logs,human-object interac... In the new era of technology,daily human activities are becoming more challenging in terms of monitoring complex scenes and backgrounds.To understand the scenes and activities from human life logs,human-object interaction(HOI)is important in terms of visual relationship detection and human pose estimation.Activities understanding and interaction recognition between human and object along with the pose estimation and interaction modeling have been explained.Some existing algorithms and feature extraction procedures are complicated including accurate detection of rare human postures,occluded regions,and unsatisfactory detection of objects,especially small-sized objects.The existing HOI detection techniques are instancecentric(object-based)where interaction is predicted between all the pairs.Such estimation depends on appearance features and spatial information.Therefore,we propose a novel approach to demonstrate that the appearance features alone are not sufficient to predict the HOI.Furthermore,we detect the human body parts by using the Gaussian Matric Model(GMM)followed by object detection using YOLO.We predict the interaction points which directly classify the interaction and pair them with densely predicted HOI vectors by using the interaction algorithm.The interactions are linked with the human and object to predict the actions.The experiments have been performed on two benchmark HOI datasets demonstrating the proposed approach. 展开更多
关键词 human object interaction human pose estimation object detection sports estimation sports prediction
在线阅读 下载PDF
Overview of 3D Human Pose Estimation 被引量:2
8
作者 Jianchu Lin Shuang Li +5 位作者 Hong Qin Hongchang Wang Ning Cui Qian Jiang Haifang Jian Gongming Wang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第3期1621-1651,共31页
3D human pose estimation is a major focus area in the field of computer vision,which plays an important role in practical applications.This article summarizes the framework and research progress related to the estimat... 3D human pose estimation is a major focus area in the field of computer vision,which plays an important role in practical applications.This article summarizes the framework and research progress related to the estimation of monocular RGB images and videos.An overall perspective ofmethods integrated with deep learning is introduced.Novel image-based and video-based inputs are proposed as the analysis framework.From this viewpoint,common problems are discussed.The diversity of human postures usually leads to problems such as occlusion and ambiguity,and the lack of training datasets often results in poor generalization ability of the model.Regression methods are crucial for solving such problems.Considering image-based input,the multi-view method is commonly used to solve occlusion problems.Here,the multi-view method is analyzed comprehensively.By referring to video-based input,the human prior knowledge of restricted motion is used to predict human postures.In addition,structural constraints are widely used as prior knowledge.Furthermore,weakly supervised learningmethods are studied and discussed for these two types of inputs to improve the model generalization ability.The problem of insufficient training datasets must also be considered,especially because 3D datasets are usually biased and limited.Finally,emerging and popular datasets and evaluation indicators are discussed.The characteristics of the datasets and the relationships of the indicators are explained and highlighted.Thus,this article can be useful and instructive for researchers who are lacking in experience and find this field confusing.In addition,by providing an overview of 3D human pose estimation,this article sorts and refines recent studies on 3D human pose estimation.It describes kernel problems and common useful methods,and discusses the scope for further research. 展开更多
关键词 3D human pose estimation monocular camera deep learning MULTI-VIEW INDICATOR
在线阅读 下载PDF
A survey on monocular 3D human pose estimation 被引量:4
9
作者 Xiaopeng JI Qi FANG +3 位作者 Junting DONG Qing SHUAI Wen JIANG Xiaowei ZHOU 《Virtual Reality & Intelligent Hardware》 2020年第6期471-500,共30页
Recovering human pose from RGB images and videos has drawn increasing attention in recent years owing to minimum sensor requirements and applicability in diverse fields such as human-computer interaction,robotics,vide... Recovering human pose from RGB images and videos has drawn increasing attention in recent years owing to minimum sensor requirements and applicability in diverse fields such as human-computer interaction,robotics,video analytics,and augmented reality.Although a large amount of work has been devoted to this field,3D human pose estimation based on monocular images or videos remains a very challenging task due to a variety of difficulties such as depth ambiguities,occlusion,background clutters,and lack of training data.In this survey,we summarize recent advances in monocular 3D human pose estimation.We provide a general taxonomy to cover existing approaches and analyze their capabilities and limitations.We also present a summary of extensively used datasets and metrics,and provide a quantitative comparison of some representative methods.Finally,we conclude with a discussion on realistic challenges and open problems for future research directions. 展开更多
关键词 human pose estimation human motion capture
在线阅读 下载PDF
A Survey on Deep Learning-Based 2D Human Pose Estimation Models
10
作者 Sani Salisu A.S.A.Mohamed +2 位作者 M.H.Jaafar Ainun S.B.Pauzi Hussain A.Younis 《Computers, Materials & Continua》 SCIE EI 2023年第8期2385-2400,共16页
In this article,a comprehensive survey of deep learning-based(DLbased)human pose estimation(HPE)that can help researchers in the domain of computer vision is presented.HPE is among the fastest-growing research domains... In this article,a comprehensive survey of deep learning-based(DLbased)human pose estimation(HPE)that can help researchers in the domain of computer vision is presented.HPE is among the fastest-growing research domains of computer vision and is used in solving several problems for human endeavours.After the detailed introduction,three different human body modes followed by the main stages of HPE and two pipelines of twodimensional(2D)HPE are presented.The details of the four components of HPE are also presented.The keypoints output format of two popular 2D HPE datasets and the most cited DL-based HPE articles from the year of breakthrough are both shown in tabular form.This study intends to highlight the limitations of published reviews and surveys respecting presenting a systematic review of the current DL-based solution to the 2D HPE model.Furthermore,a detailed and meaningful survey that will guide new and existing researchers on DL-based 2D HPE models is achieved.Finally,some future research directions in the field of HPE,such as limited data on disabled persons and multi-training DL-based models,are revealed to encourage researchers and promote the growth of HPE research. 展开更多
关键词 human pose estimation deep learning 2D DATASET MODELS body parts
在线阅读 下载PDF
Local imperceptible adversarial attacks against human pose estimation networks
11
作者 Fuchang Liu Shen Zhang +2 位作者 Hao Wang Caiping Yan Yongwei Miao 《Visual Computing for Industry,Biomedicine,and Art》 EI 2023年第1期318-328,共11页
Deep neural networks are vulnerable to attacks from adversarial inputs.Corresponding attack research on human pose estimation(HPE),particularly for body joint detection,has been largely unexplored.Transferring classif... Deep neural networks are vulnerable to attacks from adversarial inputs.Corresponding attack research on human pose estimation(HPE),particularly for body joint detection,has been largely unexplored.Transferring classification-based attack methods to body joint regression tasks is not straightforward.Another issue is that the attack effectiveness and imperceptibility contradict each other.To solve these issues,we propose local imperceptible attacks on HPE networks.In particular,we reformulate imperceptible attacks on body joint regression into a constrained maximum allowable attack.Furthermore,we approximate the solution using iterative gradient-based strength refinement and greedy-based pixel selection.Our method crafts effective perceptual adversarial attacks that consider both human perception and attack effectiveness.We conducted a series of imperceptible attacks against state-of-the-art HPE methods,including HigherHRNet,DEKR,and ViTPose.The experimental results demonstrate that the proposed method achieves excellent imperceptibility while maintaining attack effectiveness by significantly reducing the number of perturbed pixels.Approximately 4%of the pixels can achieve sufficient attacks on HPE. 展开更多
关键词 Adversarial attack human pose estimation White-box attack IMPERCEPTIBILITY Local perturbation
在线阅读 下载PDF
Viewpoint Manipulation for Interactive Television by Using Human Pose Estimation
12
作者 仝明磊 《Journal of Shanghai Jiaotong university(Science)》 EI 2011年第5期538-542,共5页
In current interactive television schemes, the viewpoints should be manipulated by the user. However, there is no efficient method, to assist a user in automatically identifying and tracking the optimum viewpoint when... In current interactive television schemes, the viewpoints should be manipulated by the user. However, there is no efficient method, to assist a user in automatically identifying and tracking the optimum viewpoint when the user observes the object of interest because many objects, most often humans, move rapidly and frequently. This paper proposes a novel framework for determining and tracking the virtual camera to best capture the front of the person of interest (PoI). First, one PoI is interactively chosen in a segmented 3D scene reconstructed by space carving method. Second, key points of the human torso of the PoI are detected by using a model-based method and the human's global motion including rotation and translation is estimated by using a close-formed method with 3 corresponding points. At the last step, the front direction of PoI is tracked temporally by using the unscented particle filter (UPF). Experimental results show that the method can properly compute the front direction of the PoI and robustly track the best viewpoints. 展开更多
关键词 free viewpoint video human pose estimation voxel space carving
原文传递
A multi‑channel spatial information feature based human pose estimation algorithm
13
作者 Yinghong Xie Yan Hao +2 位作者 Xiaowei Han Qiang Gao Biao Yin 《Cybersecurity》 2025年第3期227-243,共17页
Human pose estimation is an important task in computer vision,which can provide key point detection of human body and obtain bone information.At present,human pose estimation is mainly utilized for detection of large ... Human pose estimation is an important task in computer vision,which can provide key point detection of human body and obtain bone information.At present,human pose estimation is mainly utilized for detection of large targets,and there is no solution for detection of small targets.This paper proposes a multi-channel spatial information feature based human pose(MCSF-Pose)estimation algorithm to address the issue of medium and small targets inaccurate detection of human key points in scenarios involving occlusion and multiple poses.The MCSF-Pose network is a bottom-up regression network.Firstly,an UP-Focus module is designed to expand the feature information while reducing parameter computation during the up-sampling process.Then,the channel segmentation strategy is adopted to cut the features,and the feature information of multiple dimensions is retained through different convolutional groups,which reduces the parameter lightweight network model and makes up for the loss of the feature information associated with the depth of the network.Finally,the three-layer PANet structure is designed to reduce the complexity of the model.With the aid of the structure,it also to improve the detection accuracy and anti-interference ability of human key points.The experimental results indicate that the proposed algorithm outperforms YOLO-Pose and other human pose estimation algorithms on COCO2017 and MPII human pose datasets. 展开更多
关键词 human pose estimation YOLO-pose Channel segmentation Key points detection CSPNet PANet
原文传递
Lightweight Human Pose Estimation Based on Multi-Attention Mechanism
14
作者 LIN Xiao LU Meichen +1 位作者 GAO Mufeng LI Yan 《Journal of Shanghai Jiaotong university(Science)》 2025年第5期899-910,共12页
Human pose estimation has received much attention from the research community because of its wide range of applications.However,current research for pose estimation is usually complex and computationally intensive,esp... Human pose estimation has received much attention from the research community because of its wide range of applications.However,current research for pose estimation is usually complex and computationally intensive,especially the feature loss problems in the feature fusion process.To address the above problems,we propose a lightweight human pose estimation network based on multi-attention mechanism(LMANet).In our method,network parameters can be significantly reduced by lightweighting the bottleneck blocks with depth-wise separable convolution on the high-resolution networks.After that,we also introduce a multi-attention mechanism to improve the model prediction accuracy,and the channel attention module is added in the initial stage of the network to enhance the local cross-channel information interaction.More importantly,we inject spatial crossawareness module in the multi-scale feature fusion stage to reduce the spatial information loss during feature extraction.Extensive experiments on COCO2017 dataset and MPII dataset show that LMANet can guarantee a higher prediction accuracy with fewer network parameters and computational effort.Compared with the highresolution network HRNet,the number of parameters and the computational complexity of the network are reduced by 67%and 73%,respectively. 展开更多
关键词 human pose estimation attention mechanisms multi-scale feature fusion high-resolution networks
原文传递
FaSRnet:a feature and semantics refinement network for human pose estimation
15
作者 Yuanhong ZHONG Qianfeng XU +2 位作者 Daidi ZHONG Xun YANG Shanshan WANG 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2024年第4期513-526,共14页
Due to factors such as motion blur,video out-of-focus,and occlusion,multi-frame human pose estimation is a challenging task.Exploiting temporal consistency between consecutive frames is an efficient approach for addre... Due to factors such as motion blur,video out-of-focus,and occlusion,multi-frame human pose estimation is a challenging task.Exploiting temporal consistency between consecutive frames is an efficient approach for addressing this issue.Currently,most methods explore temporal consistency through refinements of the final heatmaps.The heatmaps contain the semantics information of key points,and can improve the detection quality to a certain extent.However,they are generated by features,and feature-level refinements are rarely considered.In this paper,we propose a human pose estimation framework with refinements at the feature and semantics levels.We align auxiliary features with the features of the current frame to reduce the loss caused by different feature distributions.An attention mechanism is then used to fuse auxiliary features with current features.In terms of semantics,we use the difference information between adjacent heatmaps as auxiliary features to refine the current heatmaps.The method is validated on the large-scale benchmark datasets PoseTrack2017 and PoseTrack2018,and the results demonstrate the effectiveness of our method. 展开更多
关键词 human pose estimation Multi-frame refinement Heatmap and offset estimation Feature alignment Multi-person
原文传递
Multipath affinage stacked-hourglass networks for human pose estimation 被引量:9
16
作者 Guoguang HUA Lihong LI Shiguang LIU 《Frontiers of Computer Science》 SCIE EI CSCD 2020年第4期155-165,共11页
Recently,stacked hourglass network has shown outstanding performance in human pose estimation.However,repeated bottom-up and top-down stride convolution operations in deep convolutional neural networks lead to a signi... Recently,stacked hourglass network has shown outstanding performance in human pose estimation.However,repeated bottom-up and top-down stride convolution operations in deep convolutional neural networks lead to a significant decrease in the initial image resolution.In order to address this problem,we propose to incorporate affinage module and residual attention module into stacked hourglass network for human pose estimation.This paper introduces a novel network architecture to replace the stacked hourglass network of up-sampling operation for getting high-resolution features.We refer to the architecture as an affinage module which is critical to improve the performance of the stacked hourglass network.Additionally,we also propose a novel residual attention module to increase the supervision of up-sample process.The effectiveness of the introduced module is evaluated on standard benchmarks.Various experimental results demonstrated that our method can achieve more accurate and more robust human pose estimation results in images with complex background. 展开更多
关键词 human pose estimation stacked hourglass network affinage module residual attention module
原文传递
RFPose-OT:RF-based 3D human pose estimation via optimal transport theory 被引量:2
17
作者 Cong YU Dongheng ZHANG +4 位作者 ZhiWU Zhi LU Chunyang XIE Yang HU Yan CHEN 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2023年第10期1445-1457,共13页
This paper introduces a novel framework,i.e.,RFPose-OT,to enable three-dimensional(3D)human pose estimation from radio frequency(RF)signals.Different from existing methods that predict human poses from RF signals at t... This paper introduces a novel framework,i.e.,RFPose-OT,to enable three-dimensional(3D)human pose estimation from radio frequency(RF)signals.Different from existing methods that predict human poses from RF signals at the signal level directly,we consider the structure difference between the RF signals and the human poses,propose a transformation of the RF signals to the pose domain at the feature level based on the optimal transport(OT)theory,and generate human poses from the transformed features.To evaluate RFPose-OT,we build a radio system and a multi-view camera system to acquire the RF signal data and the ground-truth human poses.The experimental results in a basic indoor environment,an occlusion indoor environment,and an outdoor environment demonstrate that RFPose-OT can predict 3D human poses with higher precision than state-of-the-art methods. 展开更多
关键词 Radio frequency sensing human pose estimation Optimal transport Deep learning
原文传递
Human-pose estimation based on weak supervision 被引量:1
18
作者 Xiaoyan HU Xizhao BAO +1 位作者 Guoli WEI Zhaoyu LI 《Virtual Reality & Intelligent Hardware》 EI 2023年第4期366-377,共12页
Background In computer vision,simultaneously estimating human pose,shape,and clothing is a practical issue in real life,but remains a challenging task owing to the variety of clothing,complexity of de-formation,shorta... Background In computer vision,simultaneously estimating human pose,shape,and clothing is a practical issue in real life,but remains a challenging task owing to the variety of clothing,complexity of de-formation,shortage of large-scale datasets,and difficulty in estimating clothing style.Methods We propose a multistage weakly supervised method that makes full use of data with less labeled information for learning to estimate human body shape,pose,and clothing deformation.In the first stage,the SMPL human-body model parameters were regressed using the multi-view 2D key points of the human body.Using multi-view information as weakly supervised information can avoid the deep ambiguity problem of a single view,obtain a more accurate human posture,and access supervisory information easily.In the second stage,clothing is represented by a PCA-based model that uses two-dimensional key points of clothing as supervised information to regress the parameters.In the third stage,we predefine an embedding graph for each type of clothing to describe the deformation.Then,the mask information of the clothing is used to further adjust the deformation of the clothing.To facilitate training,we constructed a multi-view synthetic dataset that included BCNet and SURREAL.Results The Experiments show that the accuracy of our method reaches the same level as that of SOTA methods using strong supervision information while only using weakly supervised information.Because this study uses only weakly supervised information,which is much easier to obtain,it has the advantage of utilizing existing data as training data.Experiments on the DeepFashion2 dataset show that our method can make full use of the existing weak supervision information for fine-tuning on a dataset with little supervision information,compared with the strong supervision information that cannot be trained or adjusted owing to the lack of exact annotation information.Conclusions Our weak supervision method can accurately estimate human body size,pose,and several common types of clothing and overcome the issues of the current shortage of clothing data. 展开更多
关键词 human pose estimation Clothing estimation Weak supervision
在线阅读 下载PDF
SPPNet:Single-Person Human Parsing and Pose Estimation in RGB Videos
19
作者 Aditi Verma Vivek Tiwari Mayank Lovanshi 《Journal of Social Computing》 2025年第1期18-28,共11页
The human-centric visual analysis field thrives on rich video datasets that explore human behaviours and interactions.Yet,a gap persists in datasets covering both human pose estimation and parsing challenges.In this s... The human-centric visual analysis field thrives on rich video datasets that explore human behaviours and interactions.Yet,a gap persists in datasets covering both human pose estimation and parsing challenges.In this study,a notable effort has been made to develop a dedicated dataset named“Single Person Video-in-Person(SP-VIP)”to suit the research scenario,resolving a lack of a universal dataset to support three major human-centric visual analysis methods.The SP-VIP dataset was derived by extracting videos from the VIP dataset initially designed exclusively for parsing-related tasks.Furthermore,the VIP dataset did not encompass provisions for pose estimation and human activity recognition,which are crucial elements for human activity recognition.To bridge this gap,the SP-VIP dataset was meticulously curated with a specific focus on single-person activities.Videos in the newly created dataset are split into frames with semantic labels and joint values for each frame.To assess the performance of the tailored dataset,a novel architecture Single-person Parsing and Pose Network(SPPNet)was employed using a Deep ConvNet network for parsing while simultaneously performing pose estimation using the stacked hourglass method.To demonstrate the effectiveness of the newly created dataset,extensive experiments were performed on the discussed architecture,which produced favourable results with a pixel accuracy of 88.50%,a mean accuracy of 60.50%,and a mean Intersection over Union(IoU)of 49.30%signifying enhancement in performance. 展开更多
关键词 human parsing human pose estimation human activity recognition Single Person Video-in-Person(SP-VIP)
原文传递
Full Scale-Aware Balanced High-Resolution Network for Multi-Person Pose Estimation
20
作者 Shaohua Li Haixiang Zhang +2 位作者 HanjieMa Jie Feng Mingfeng Jiang 《Computers, Materials & Continua》 SCIE EI 2023年第9期3379-3392,共14页
Scale variation is amajor challenge inmulti-person pose estimation.In scenes where persons are present at various distances,models tend to perform better on larger-scale persons,while the performance for smaller-scale... Scale variation is amajor challenge inmulti-person pose estimation.In scenes where persons are present at various distances,models tend to perform better on larger-scale persons,while the performance for smaller-scale persons often falls short of expectations.Therefore,effectively balancing the persons of different scales poses a significant challenge.So this paper proposes a newmulti-person pose estimation model called FSANet to improve themodel’s performance in complex scenes.Our model utilizes High-Resolution Network(HRNet)as the backbone and feeds the outputs of the last stage’s four branches into the DCB module.The dilated convolution-based(DCB)module employs a parallel structure that incorporates dilated convolutions with different rates to expand the receptive field of each branch.Subsequently,the attention operation-based(AOB)module performs attention operations at both branch and channel levels to enhance high-frequency features and reduce the influence of noise.Finally,predictions are made using the heatmap representation.The model can recognize images with diverse scales and more complex semantic information.Experimental results demonstrate that FSA Net achieves competitive results on the MSCOCO and MPII datasets,validating the effectiveness of our proposed approach. 展开更多
关键词 Computer vision high-resolution network human pose estimation
在线阅读 下载PDF
上一页 1 2 下一页 到第
使用帮助 返回顶部