Detecting feature points on the human body in video frames is a key step for tracking human movements. There have been methods developed that leverage models of human pose and classification of pixels of the body imag...Detecting feature points on the human body in video frames is a key step for tracking human movements. There have been methods developed that leverage models of human pose and classification of pixels of the body image. Yet, occlusion and robustness are still open challenges. In this paper, we present an automatic, model-free feature point detection and action tracking method using a time-of-flight camera. Our method automatically detects feature points for movement abstraction. To overcome errors caused by miss-detection and occlusion, a refinement method is devised that uses the trajectory of the feature points to correct the erroneous detections. Experiments were conducted using videos acquired with a Microsoft Kinect camera and a publicly available video set and comparisons were conducted with the state-of-the-art methods. The results demonstrated that our proposed method delivered improved and reliable performance with an average accuracy in the range of 90 %.The trajectorybased refinement also demonstrated satisfactory effectiveness that recovers the detection with a success rate of 93.7 %. Our method processed a frame in an average time of 71.1 ms.展开更多
With the rapid advancements in artificial intelligence and computer vision technology,thefield of visual-based human pose detection has emerged as a highly sought-after research area in recent years.The identification of...With the rapid advancements in artificial intelligence and computer vision technology,thefield of visual-based human pose detection has emerged as a highly sought-after research area in recent years.The identification of human poses has practical applications in diverse domains,ranging from motion-sensing games for human-computer interaction to activity prediction and medical rehabil-itation.The present study is focused on the utilization of human pose detection forfitness movement counting.The ultimate aim of the system design is to accurately detect the skeletal key points of each body part in the image and subsequently con-nect them to form a human pose skeleton,which serves as a vital representation of the characteristics of human motion,particularly in the context of video data,where multiple human poses can be linked to form a certain movement trajectory.By judging the trajectory and angle changes,the system can determine whether people’sfitness movements are correct and help them improve theirfitness effec-tiveness.Hence,an increasing number of researchers are investing time and effort in thisfield.One common approach for human pose detection is OpenPose,but this model has a large and complex structure and low detection accuracy.Therefore,thisfitness movement detection and counting system uses a lightweight MediaPipe model and improves it to enhance the algorithm’s accuracy and recognition speed.The specific work in this paper includes three main points:(1)a suitable network structure to detect human skeletal points;(2)the appropriate skeletal structure forfitness movements through experiments to obtain accurate results;and(3)a Qt interface for human-computer interaction.展开更多
文摘Detecting feature points on the human body in video frames is a key step for tracking human movements. There have been methods developed that leverage models of human pose and classification of pixels of the body image. Yet, occlusion and robustness are still open challenges. In this paper, we present an automatic, model-free feature point detection and action tracking method using a time-of-flight camera. Our method automatically detects feature points for movement abstraction. To overcome errors caused by miss-detection and occlusion, a refinement method is devised that uses the trajectory of the feature points to correct the erroneous detections. Experiments were conducted using videos acquired with a Microsoft Kinect camera and a publicly available video set and comparisons were conducted with the state-of-the-art methods. The results demonstrated that our proposed method delivered improved and reliable performance with an average accuracy in the range of 90 %.The trajectorybased refinement also demonstrated satisfactory effectiveness that recovers the detection with a success rate of 93.7 %. Our method processed a frame in an average time of 71.1 ms.
文摘With the rapid advancements in artificial intelligence and computer vision technology,thefield of visual-based human pose detection has emerged as a highly sought-after research area in recent years.The identification of human poses has practical applications in diverse domains,ranging from motion-sensing games for human-computer interaction to activity prediction and medical rehabil-itation.The present study is focused on the utilization of human pose detection forfitness movement counting.The ultimate aim of the system design is to accurately detect the skeletal key points of each body part in the image and subsequently con-nect them to form a human pose skeleton,which serves as a vital representation of the characteristics of human motion,particularly in the context of video data,where multiple human poses can be linked to form a certain movement trajectory.By judging the trajectory and angle changes,the system can determine whether people’sfitness movements are correct and help them improve theirfitness effec-tiveness.Hence,an increasing number of researchers are investing time and effort in thisfield.One common approach for human pose detection is OpenPose,but this model has a large and complex structure and low detection accuracy.Therefore,thisfitness movement detection and counting system uses a lightweight MediaPipe model and improves it to enhance the algorithm’s accuracy and recognition speed.The specific work in this paper includes three main points:(1)a suitable network structure to detect human skeletal points;(2)the appropriate skeletal structure forfitness movements through experiments to obtain accurate results;and(3)a Qt interface for human-computer interaction.