期刊文献+
共找到1,182篇文章
< 1 2 60 >
每页显示 20 50 100
Human Interaction Recognition in Surveillance Videos Using Hybrid Deep Learning and Machine Learning Models
1
作者 Vesal Khean Chomyong Kim +5 位作者 Sunjoo Ryu Awais Khan Min Kyung Hong Eun Young Kim Joungmin Kim Yunyoung Nam 《Computers, Materials & Continua》 SCIE EI 2024年第10期773-787,共15页
Human Interaction Recognition(HIR)was one of the challenging issues in computer vision research due to the involvement of multiple individuals and their mutual interactions within video frames generated from their mov... Human Interaction Recognition(HIR)was one of the challenging issues in computer vision research due to the involvement of multiple individuals and their mutual interactions within video frames generated from their movements.HIR requires more sophisticated analysis than Human Action Recognition(HAR)since HAR focuses solely on individual activities like walking or running,while HIR involves the interactions between people.This research aims to develop a robust system for recognizing five common human interactions,such as hugging,kicking,pushing,pointing,and no interaction,from video sequences using multiple cameras.In this study,a hybrid Deep Learning(DL)and Machine Learning(ML)model was employed to improve classification accuracy and generalizability.The dataset was collected in an indoor environment with four-channel cameras capturing the five types of interactions among 13 participants.The data was processed using a DL model with a fine-tuned ResNet(Residual Networks)architecture based on 2D Convolutional Neural Network(CNN)layers for feature extraction.Subsequently,machine learning models were trained and utilized for interaction classification using six commonly used ML algorithms,including SVM,KNN,RF,DT,NB,and XGBoost.The results demonstrate a high accuracy of 95.45%in classifying human interactions.The hybrid approach enabled effective learning,resulting in highly accurate performance across different interaction types.Future work will explore more complex scenarios involving multiple individuals based on the application of this architecture. 展开更多
关键词 Convolutional neural network deep learning human interaction recognition ResNet skeleton joint key points human pose estimation hybrid deep learning and machine learning
在线阅读 下载PDF
Hand Gesture Recognition Using Appearance Features Based on 3D Point Cloud 被引量:2
2
作者 Yanwen Chong Jianfeng Huang Shaoming Pan 《Journal of Software Engineering and Applications》 2016年第4期103-111,共9页
This paper presents a method for hand gesture recognition based on 3D point cloud. Digital image processing technology is used in this research. Based on the 3D point from depth camera, the system firstly extracts som... This paper presents a method for hand gesture recognition based on 3D point cloud. Digital image processing technology is used in this research. Based on the 3D point from depth camera, the system firstly extracts some raw data of the hand. After the data segmentation and preprocessing, three kinds of appearance features are extracted, including the number of stretched fingers, the angles between fingers and the gesture region’s area distribution feature. Based on these features, the system implements the identification of the gestures by using decision tree method. The results of experiment demonstrate that the proposed method is pretty efficient to recognize common gestures with a high accuracy. 展开更多
关键词 Human-Computer-Interaction Gesture recognition 3D point Cloud Depth Image
在线阅读 下载PDF
基于改进PointNet++的集装箱船绑扎眼板智能识别算法
3
作者 闫家文 陈震 +2 位作者 马誉贤 钮东辉 郝传宏 《船舶工程》 北大核心 2025年第11期145-152,共8页
[目的]为了对绑扎桥点云数据实现端到端智能识别,[方法]提出一种改进PointNet++的绑扎眼板智能识别算法。基于实船扫描获取的集装箱船绑扎桥点云数据,使用CloudCompare软件对绑扎眼板进行语义分割标注,构建LashingNet数据集,基于绑扎眼... [目的]为了对绑扎桥点云数据实现端到端智能识别,[方法]提出一种改进PointNet++的绑扎眼板智能识别算法。基于实船扫描获取的集装箱船绑扎桥点云数据,使用CloudCompare软件对绑扎眼板进行语义分割标注,构建LashingNet数据集,基于绑扎眼板理论模型构建综合损失函数,通过该数据集训练改进的PointNet++网络。[结果]结果表明,网络模型在LashingNet训练集和验证集上的收敛结果趋于稳定,在验证集上的总体准确率(OA)达到99.15%,平均交并比(mIoU)达到96.61%,在全新测试集上眼板有效识别率达到94.64%,证明了模型良好的泛化性。[结论]该方法具有良好的识别能力,在集装箱船绑扎眼板的智能识别领域有良好的应用潜力。 展开更多
关键词 船舶绑扎桥 船舶建造 三维点云 pointNet++ 智能绑扎识别
原文传递
Recognition Method for Change Point of Traffic Flow Linear Regressions
4
作者 张敬磊 王晓原 马立云 《Journal of Donghua University(English Edition)》 EI CAS 2012年第1期59-61,共3页
Recognition method of traffic flow change point was put forward based on traffic flow theory and the statistical change point analysis of multiple linear regressions. The method was calibrated and tested with the fiel... Recognition method of traffic flow change point was put forward based on traffic flow theory and the statistical change point analysis of multiple linear regressions. The method was calibrated and tested with the field data of Liantong Road of Zibo city to verify the validity and the feasibility of the theory. The results show that change point method of multiple linear regression can make out the rule of quantitative changes in traffic flow more accurately than ordinary methods. So, the change point method can be applied to traffic information management system more effectively. 展开更多
关键词 traffic flow quantitative changes multiple linear regressions change point recognition
在线阅读 下载PDF
WADE-Net: Weighted Aggregation with Density Estimation for Point Cloud Place Recognition
5
作者 Ke Liu Xing Wang +2 位作者 Yaxin Peng Zhen Ye Chaozheng Zhou 《Advances in Pure Mathematics》 2021年第5期502-523,共22页
Point cloud based place recognition plays an important role in mobile robotics. In this paper, we propose a weighted aggregation method from structure information adaptively for point cloud place recognition. Firstly,... Point cloud based place recognition plays an important role in mobile robotics. In this paper, we propose a weighted aggregation method from structure information adaptively for point cloud place recognition. Firstly, to preserve the prior distributions and local geometric structures, we fuse learned hidden features with handcrafted features in the beginning. Secondly, we further extract and aggregate adaptively weighted features concerning density and relative spatial information from these fused features, named Weighted Aggregation with Density Estimation (WADE) module. Then, we conduct the WADE block iteratively to group the latent manifold structures. Finally, comparison results on two public datasets Oxford Robotcar and KITTI show that the proposed approach exceeds the comparison approaches on recall rate averagely 7% - 8%. 展开更多
关键词 point Cloud Place recognition Deep Learning Feature Extraction
在线阅读 下载PDF
Understanding Local Conformation in Cyclic and Linear Polymers Using Molecular Dynamics and Point Cloud Neural Network
6
作者 Wan-Chen Zhao Hai-Yang Huo +1 位作者 Zhong-Yuan Lu Zhao-Yan Sun 《Chinese Journal of Polymer Science》 2025年第5期695-710,共16页
Understanding the conformational characteristics of polymers is key to elucidating their physical properties.Cyclic polymers,defined by their closed-loop structures,inherently differ from linear polymers possessing di... Understanding the conformational characteristics of polymers is key to elucidating their physical properties.Cyclic polymers,defined by their closed-loop structures,inherently differ from linear polymers possessing distinct chain ends.Despite these structural differences,both types of polymers exhibit locally random-walk-like conformations,making it challenging to detect subtle spatial variations using conventional methods.In this study,we address this challenge by integrating molecular dynamics simulations with point cloud neural networks to analyze the spatial conformations of cyclic and linear polymers.By utilizing the Dynamic Graph CNN(DGCNN)model,we classify polymer conformations based on the 3D coordinates of monomers,capturing local and global topological differences without considering chain connectivity sequentiality.Our findings reveal that the optimal local structural feature unit size scales linearly with molecular weight,aligning with theoretical predictions.Additionally,interpretability techniques such as Grad-CAM and SHAP identify significant conformational differences:cyclic polymers tend to form prolate ellipsoid shapes with pronounced elongation along the major axis,while linear polymers show elongated ends with more spherical centers.These findings reveal subtle yet critical differences in local conformations between cyclic and linear polymers that were previously difficult to discern,providing deeper insights into polymer structure-property relationships and offering guidance for future polymer science advancements. 展开更多
关键词 Molecular dynamics simulation point cloud Interpretable deep learning Conformational recognition
原文传递
Pre-process algorithm for satellite laser ranging data based on curve recognition from points cloud
7
作者 Liu Yanyu Zhao Dongming Wu Shan 《Geodesy and Geodynamics》 2012年第2期53-59,共7页
The satellite laser ranging (SLR) data quality from the COMPASS was analyzed, and the difference between curve recognition in computer vision and pre-process of SLR data finally proposed a new algorithm for SLR was ... The satellite laser ranging (SLR) data quality from the COMPASS was analyzed, and the difference between curve recognition in computer vision and pre-process of SLR data finally proposed a new algorithm for SLR was discussed data based on curve recognition from points cloud is proposed. The results obtained by the new algorithm are 85 % (or even higher) consistent with that of the screen displaying method, furthermore, the new method can process SLR data automatically, which makes it possible to be used in the development of the COMPASS navigation system. 展开更多
关键词 satellite laser ranging (SLR) curve recognition points cloud pre-process algorithm COM- PASS screen displaying
原文传递
基于PointNet的钢板毛坯垛点云分割
8
作者 林振杨 《机电工程技术》 2025年第1期152-156,共5页
钢板毛坯垛拆垛工序中,钢板毛坯分层厚度的估计是推钢机准确且安全执行推钢动作的关键;当前不少企业此道生产工序仍主要靠操作员人工观察的方式估算钢板厚度及与传送辊道的相对高度,其估算不准易导致碰撞,造成输送设备损坏生产中断。提... 钢板毛坯垛拆垛工序中,钢板毛坯分层厚度的估计是推钢机准确且安全执行推钢动作的关键;当前不少企业此道生产工序仍主要靠操作员人工观察的方式估算钢板厚度及与传送辊道的相对高度,其估算不准易导致碰撞,造成输送设备损坏生产中断。提出一种钢板毛坯垛智能分层方法,该方法结合现场工况环境采用激光雷达,对钢板毛坯垛进行三维点云成像,然后对采集的点云数据,用PointNet神经网络框架进行特征识别、分层分割与提取,最后对分割的不同层根据标定值,换算成真实厚度。根据现场实验结果表明,PointNet对钢板毛坯垛分割的识别率达到了87.4%,厚度估算误差低于1.2cm,根据钢厂钢板毛坯规格表(3种规格150、160、180cm)可以准确估计出钢板毛坯厚度的规格,识别速度15帧/s,满足现场工况要求。 展开更多
关键词 钢板毛坯垛 点云分割 特征识别 pointNet
在线阅读 下载PDF
Study of Human Action Recognition Based on Improved Spatio-temporal Features 被引量:7
9
作者 Xiao-Fei Ji Qian-Qian Wu +1 位作者 Zhao-Jie Ju Yang-Yang Wang 《International Journal of Automation and computing》 EI CSCD 2014年第5期500-509,共10页
Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combin... Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios. 展开更多
关键词 Action recognition spatio-temporal interest points 3-dimensional scale-invariant feature transform (3D SIFT) positional distribution information dimension reduction
原文传递
Using multi-matching system based on a simplified deformable model of the human iris for iris recognition 被引量:2
10
作者 MING Xing , XU Tao , WANG Zheng-xuan 1 2 3 1. College of Computer Science and Technology, Nanling Campus,Jilin University, 5988 Renmin Street,Changchun 130022, P. R. China 2. College of Mechanical Science and Engineering, Nanling Campus,Jilin University, 5988 Renmin Street, Changchun 130022, P. R. China 3. College of Computer Science and Technology, Qianwei Campus,Jilin University, 10 Qianwei Road, Changchun 130012, P. R. China. 《Journal of Bionic Engineering》 SCIE EI CSCD 2004年第3期183-190,共8页
A new method for iris recognition using a multi-matching system based on a simplified deformable model of the human iris was proposed. The method defined iris feature points and formed the feature space based on a wa... A new method for iris recognition using a multi-matching system based on a simplified deformable model of the human iris was proposed. The method defined iris feature points and formed the feature space based on a wavelet transform. In the matching stage it worked in a crude manner. Driven by a simplified deformable iris model, the crude matching was refined. By means of such multi-matching system, the task of iris recognition was accomplished. This process can preserve the elastic deformation between an input iris image and a template and improve precision for iris recognition. The experimental results indicate the va- lidity of this method. 展开更多
关键词 iris recognition wavelet transform feature points deformable model
在线阅读 下载PDF
Summed volume region selection based three-dimensional automatic target recognition for airborne LIDAR 被引量:2
11
作者 Qi-shu Qian Yi-hua Hu +2 位作者 Nan-xiang Zhao Min-le Li Fu-cai Shao 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2020年第3期535-542,共8页
Airborne LIDAR can flexibly obtain point cloud data with three-dimensional structural information,which can improve its effectiveness of automatic target recognition in the complex environment.Compared with 2D informa... Airborne LIDAR can flexibly obtain point cloud data with three-dimensional structural information,which can improve its effectiveness of automatic target recognition in the complex environment.Compared with 2D information,3D information performs better in separating objects and background.However,an aircraft platform can have a negative influence on LIDAR obtained data because of various flight attitudes,flight heights and atmospheric disturbances.A structure of global feature based 3D automatic target recognition method for airborne LIDAR is proposed,which is composed of offline phase and online phase.The performance of four global feature descriptors is compared.Considering the summed volume region(SVR) discrepancy in real objects,SVR selection is added into the pre-processing operations to eliminate mismatching clusters compared with the interested target.Highly reliable simulated data are obtained under various sensor’s altitudes,detection distances and atmospheric disturbances.The final experiments results show that the added step increases the recognition rate by above 2.4% and decreases the execution time by about 33%. 展开更多
关键词 3D automatic target recognition point cloud LIDAR AIRBORNE Global feature descriptor
在线阅读 下载PDF
Optimal Deep Convolutional Neural Network with Pose Estimation for Human Activity Recognition 被引量:1
12
作者 S.Nandagopal G.Karthy +1 位作者 A.Sheryl Oliver M.Subha 《Computer Systems Science & Engineering》 SCIE EI 2023年第2期1719-1733,共15页
Human Action Recognition(HAR)and pose estimation from videos have gained significant attention among research communities due to its applica-tion in several areas namely intelligent surveillance,human robot interaction... Human Action Recognition(HAR)and pose estimation from videos have gained significant attention among research communities due to its applica-tion in several areas namely intelligent surveillance,human robot interaction,robot vision,etc.Though considerable improvements have been made in recent days,design of an effective and accurate action recognition model is yet a difficult process owing to the existence of different obstacles such as variations in camera angle,occlusion,background,movement speed,and so on.From the literature,it is observed that hard to deal with the temporal dimension in the action recognition process.Convolutional neural network(CNN)models could be used widely to solve this.With this motivation,this study designs a novel key point extraction with deep convolutional neural networks based pose estimation(KPE-DCNN)model for activity recognition.The KPE-DCNN technique initially converts the input video into a sequence of frames followed by a three stage process namely key point extraction,hyperparameter tuning,and pose estimation.In the keypoint extraction process an OpenPose model is designed to compute the accurate key-points in the human pose.Then,an optimal DCNN model is developed to classify the human activities label based on the extracted key points.For improving the training process of the DCNN technique,RMSProp optimizer is used to optimally adjust the hyperparameters such as learning rate,batch size,and epoch count.The experimental results tested using benchmark dataset like UCF sports dataset showed that KPE-DCNN technique is able to achieve good results compared with benchmark algorithms like CNN,DBN,SVM,STAL,T-CNN and so on. 展开更多
关键词 Human activity recognition pose estimation key point extraction classification deep learning RMSProp
在线阅读 下载PDF
Automatic Terrain Debris Recognition Network Based on 3D Remote Sensing Data
13
作者 Xu Han Huijun Yang +4 位作者 Qiufeng Shen Jiangtao Yang Huihui Liang Cancan Bao Shuang Cang 《Computers, Materials & Continua》 SCIE EI 2020年第10期579-596,共18页
Although predecessors have made great contributions to the semantic segmentation of 3D indoor scenes,there still exist some challenges in the debris recognition of terrain data.Compared with hundreds of thousands of i... Although predecessors have made great contributions to the semantic segmentation of 3D indoor scenes,there still exist some challenges in the debris recognition of terrain data.Compared with hundreds of thousands of indoor point clouds,the amount of terrain point cloud is up to millions.Apart from that,terrain point cloud data obtained from remote sensing is measured in meters,but the indoor scene is measured in centimeters.In this case,the terrain debris obtained from remote sensing mapping only have dozens of points,which means that sufficient training information cannot be obtained only through the convolution of points.In this paper,we build multi-attribute descriptors containing geometric information and color information to better describe the information in low-precision terrain debris.Therefore,our process is aimed at the multi-attribute descriptors of each point rather than the point.On this basis,an unsupervised classification algorithm is proposed to divide the point cloud into several terrain areas,and regard each area as a graph vertex named super point to form the graph structure,thus effectively reducing the number of the terrain point cloud from millions to hundreds.Then we proposed a graph convolution network by employing PointNet for graph embedding and recurrent gated graph convolutional network for classification.Our experiments show that the terrain point cloud can reduce the amount of data from millions to hundreds through the super point graph based on multi-attribute descriptor and our accuracy reached 91.74%and the IoU reached 94.08%,both of which were significantly better than the current methods such as SEGCloud(Acc:88.63%,IoU:89.29%)and PointCNN(Acc:86.35,IoU:87.26). 展开更多
关键词 Semantic segmentation low-precision point cloud large-scale terrain debris recognition
在线阅读 下载PDF
About a Method of Recognition of Race and Ethnicity of Individuals Based on Portrait Photographs
14
作者 Tofiq Kazimov Shafagat Mahmudova 《Intelligent Control and Automation》 2014年第3期120-125,共6页
This article is devoted to developing a recognition method of race and ethnicity of individual based on portrait photographs. The reference image is formed based on selected geometric points of the face and a special ... This article is devoted to developing a recognition method of race and ethnicity of individual based on portrait photographs. The reference image is formed based on selected geometric points of the face and a special algorithm for calculating the characteristic parameters of the images available in the database. Next, the original image is compared with the reference images of ethnic groups, and thus, the affiliation of the original image to a specific ethnic group is determined. 展开更多
关键词 recognition RACE ETHNICITY ETHNIC Group RACIAL Origin Identification points
暂未订购
Face Recognition from Incomplete Measurements via <i>l<sub>1</sub></i>-Optimization
15
作者 Miguel Argaez Reinaldo Sanchez Carlos Ramirez 《American Journal of Computational Mathematics》 2012年第4期287-294,共8页
In this work, we consider a homotopic principle for solving large-scale and dense l1underdetermined problems and its applications in image processing and classification. We solve the face recognition problem where the... In this work, we consider a homotopic principle for solving large-scale and dense l1underdetermined problems and its applications in image processing and classification. We solve the face recognition problem where the input image contains corrupted and/or lost pixels. The approach involves two steps: first, the incomplete or corrupted image is subject to an inpainting process, and secondly, the restored image is used to carry out the classification or recognition task. Addressing these two steps involves solving large scale l1minimization problems. To that end, we propose to solve a sequence of linear equality constrained multiquadric problems that depends on a regularization parameter that converges to zero. The procedure generates a central path that converges to a point on the solution set of the l1underdetermined problem. In order to solve each subproblem, a conjugate gradient algorithm is formulated. When noise is present in the model, inexact directions are taken so that an approximate solution is computed faster. This prevents the ill conditioning produced when the conjugate gradient is required to iterate until a zero residual is attained. 展开更多
关键词 SPARSE Representation l1Minimization Face recognition SPARSE Recovery INTERIOR point Methods SPARSE REGULARIZATION
在线阅读 下载PDF
Real-Time Safety Behavior Detection Technology of Indoors Power Personnel Based on Human Key Points
16
作者 杨坚 李聪敏 +5 位作者 洪道鉴 卢东祁 林秋佳 方兴其 喻谦 张乾 《Journal of Shanghai Jiaotong university(Science)》 EI 2024年第2期309-315,共7页
Safety production is of great significance to the development of enterprises and society.Accidents often cause great losses because of the particularity environment of electric power.Therefore,it is important to impro... Safety production is of great significance to the development of enterprises and society.Accidents often cause great losses because of the particularity environment of electric power.Therefore,it is important to improve the safety supervision and protection in the electric power environment.In this paper,we simulate the actual electric power operation scenario by monitoring equipment and propose a real-time detection method of illegal actions based on human body key points to ensure safety behavior in real time.In this method,the human body key points in video frames were first extracted by the high-resolution network,and then classified in real time by spatial-temporal graph convolutional network.Experimental results show that this method can effectively detect illegal actions in the simulated scene. 展开更多
关键词 real-time behavior recognition human key points high-resolution network spatial-temporal graph convolutional network
原文传递
基于改进PointRCNN模型的柑橘树识别算法
17
作者 王亭亭 谭德权 +5 位作者 戴宁湘 郭霆峰 吕柯 罗梁梁 胡文武 蒋蘋 《农业工程与装备》 2024年第4期43-49,共7页
针对农业机器人在果园作业中受环境因素影响导致的目标识别不准确等问题,提出了一种基于改进PointRCNN模型的3D目标检测与识别算法。该算法利用激光雷达采集点云数据,建立自制果园场景数据集,并通过高斯滤波对点云数据进行预处理,以去... 针对农业机器人在果园作业中受环境因素影响导致的目标识别不准确等问题,提出了一种基于改进PointRCNN模型的3D目标检测与识别算法。该算法利用激光雷达采集点云数据,建立自制果园场景数据集,并通过高斯滤波对点云数据进行预处理,以去除噪声干扰。为进一步提升检测性能,在算法中引入了交叉注意力(Cross Attention,CA)机制,用于优化点云特征并生成高质量的3D候选框;同时引入CSPNet结构对3D候选框进行优化选择以提高网络检测准确率。实验结果表明,与原始PointRCNN算法相比,该算法在柑橘树树干识别任务中的平均精度值得到了有效提高,并有效缓解了3D点云数据稀疏性和不规则性对检测精度的影响,为农业机器人在复杂果园环境下的精准作业提供了技术支持。 展开更多
关键词 pointRCNN算法 激光雷达 点云识别 深度学习
在线阅读 下载PDF
基于点与体素融合的人体步态识别方法
18
作者 郭剑 鲁捷敏 +2 位作者 韩崇 许棣华 孙力娟 《小型微型计算机系统》 北大核心 2025年第11期2643-2650,共8页
毫米波雷达具有分辨率高、抗干扰能力强和对人体隐私侵犯少等优点,在身份识别领域中具有较好的应用前景.其中,基于毫米波雷达点云的步态识别已成为热门的研究方向之一.但这类方法大多基于点模型进行全局信息处理,对局部信息感知不足,从... 毫米波雷达具有分辨率高、抗干扰能力强和对人体隐私侵犯少等优点,在身份识别领域中具有较好的应用前景.其中,基于毫米波雷达点云的步态识别已成为热门的研究方向之一.但这类方法大多基于点模型进行全局信息处理,对局部信息感知不足,从而导致算法的准确性不够.针对上述问题,该文提出了一种基于点体素交叉注意力机制的步态识别方法(gait recognition based on Point-Voxel fusion and Cross-attention,gaitPVC).该方法对数据采用了多帧融合的处理,利用双分支网络分别从点数据和体素数据协作提取并融合全局与局部特征,然后利用时序网络提取时序特征,以更好地提取人体步态信息.仿真结果表明,该文方法具有较好的鲁棒性和准确率. 展开更多
关键词 步态识别 点云 多模态 毫米波雷达
在线阅读 下载PDF
基于2D卷积神经网络的3D点云物体检测
19
作者 李晓丽 王乐 +1 位作者 杜振龙 陈东 《计算机工程与应用》 北大核心 2025年第23期297-304,共8页
激光雷达在自动驾驶和工业自动化领域已得到初步应用,获取了大量的场景、物体等点云数据,这些点云数据具有维度高、不规则的特性,已有的深度学习网络模型在处理这些数据时需用到计算代价高昂的三维卷积,其时空复杂度高且不能在线应用。... 激光雷达在自动驾驶和工业自动化领域已得到初步应用,获取了大量的场景、物体等点云数据,这些点云数据具有维度高、不规则的特性,已有的深度学习网络模型在处理这些数据时需用到计算代价高昂的三维卷积,其时空复杂度高且不能在线应用。针对传统网络模型处理点云数据的缺陷,提出一种基于2D卷积神经网络的3D点云物体识别方法,所提方法把不规则的点云数据统计规整为点云柱,用卷积、池化提取点云柱簇的特征,将三维的点云数据编码转化为二维的类图像特征数据;使用包含注意力机制的二维卷积神经网络在多个感受野提取充分表示点云的多尺度隐特征,解码网络根据位置、方向及物体种类识别点云物体。实验基于AscendAtlas 200DK边端设备,单次推理耗时291 ms,实验结果与传统点云目标检测网络进行比较,分别以14.7、13.2、3.4倍的性能提升优于Voxel-Net、F-PoitnNet以及Second网络模型;在KITTI数据集与ContFuse等14种点云目标检测算法进行精度对比,与次优算法相比,平均精度提升在2.3%以上;设计针对二维卷积以及注意力机制的消融实验,两个模块在模型大小与推理精度上分别提升50.9%和5.37%。实验结果表明,所提方法可高效、鲁棒、准确地检测3D点云数据的目标物体。 展开更多
关键词 3D点云 点云物体识别 深度学习 点云柱 类图像
在线阅读 下载PDF
面向无序分拣场景的工件6D位姿检测方法
20
作者 曹学鹏 李鑫 +4 位作者 冯艳丽 石瑞 葛天烨 张新荣 赵睿英 《工程科学与技术》 北大核心 2025年第5期298-308,共11页
目标6D位姿检测是实现机器人自主抓取的关键。为克服传统点对识别(PPF)方法检测性能差、耗时及难以检测到多平面特征工件的6D位姿等不足,提出面向无序分拣场景的工件6D位姿检测方法。首先,基于模型平面点分布筛选多平面特征工件,提取其... 目标6D位姿检测是实现机器人自主抓取的关键。为克服传统点对识别(PPF)方法检测性能差、耗时及难以检测到多平面特征工件的6D位姿等不足,提出面向无序分拣场景的工件6D位姿检测方法。首先,基于模型平面点分布筛选多平面特征工件,提取其边界特征进行6D位姿检测,并在多视点下提取模型点对以去除冗余点对,提高算法识别速度。其次,匹配场景与模型间的点对特征,利用快速投票方案获取无序场景中目标的位姿假设集合。接下来,通过位姿验证筛选方法,剔除重复和误匹配位姿,实现目标多实例位姿的粗略估计,并借助迭代最近点(ICP)算法完成目标位姿的精确估计。实验结果表明:在无序仿真场景中,单次识别时间小于等于1.15 s,平均平移偏差小于等于0.95 mm,平均旋转误差小于等于1.56°;在实际场景中,平均识别成功率为95.82%,平均单次识别时间为1.11 s。综上,该6D位姿检测方法在保证识别效率的同时兼顾了位姿估计精度,并在识别精度和速度上均优于同类算法,为机器人的精准抓取的实现提供了有力的保障。 展开更多
关键词 无序场景 6D位姿检测 点对特征 位姿估计精度 识别率
在线阅读 下载PDF
上一页 1 2 60 下一页 到第
使用帮助 返回顶部