期刊文献+
共找到17篇文章
< 1 >
每页显示 20 50 100
Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving:A Review
1
作者 Peicheng Shi Li Yang +2 位作者 Xinlong Dong Heng Qi Aixi Yang 《Computers, Materials & Continua》 2025年第6期3877-3917,共41页
As the number and complexity of sensors in autonomous vehicles continue to rise,multimodal fusionbased object detection algorithms are increasingly being used to detect 3D environmental information,significantly advan... As the number and complexity of sensors in autonomous vehicles continue to rise,multimodal fusionbased object detection algorithms are increasingly being used to detect 3D environmental information,significantly advancing the development of perception technology in autonomous driving.To further promote the development of fusion algorithms and improve detection performance,this paper discusses the advantages and recent advancements of multimodal fusion-based object detection algorithms.Starting fromsingle-modal sensor detection,the paper provides a detailed overview of typical sensors used in autonomous driving and introduces object detection methods based on images and point clouds.For image-based detection methods,they are categorized into monocular detection and binocular detection based on different input types.For point cloud-based detection methods,they are classified into projection-based,voxel-based,point cluster-based,pillar-based,and graph structure-based approaches based on the technical pathways for processing point cloud features.Additionally,multimodal fusion algorithms are divided into Camera-LiDAR fusion,Camera-Radar fusion,Camera-LiDAR-Radar fusion,and other sensor fusion methods based on the types of sensors involved.Furthermore,the paper identifies five key future research directions in this field,aiming to provide insights for researchers engaged in multimodal fusion-based object detection algorithms and to encourage broader attention to the research and application of multimodal fusion-based object detection. 展开更多
关键词 multi-modal fusion 3d object detection deep learning autonomous driving
在线阅读 下载PDF
Multi-Modal Multi-View 3D Hand Pose Estimation
2
作者 WANG Hao WANG Ping +2 位作者 YU Haoran DING Dong XIANG Weiming 《Journal of Donghua University(English Edition)》 2025年第6期673-682,共10页
With the rapid progress of the artificial intelligence(AI)technology and mobile internet,3D hand pose estimation has become critical to various intelligent application areas,e.g.,human-computer interaction.To avoid th... With the rapid progress of the artificial intelligence(AI)technology and mobile internet,3D hand pose estimation has become critical to various intelligent application areas,e.g.,human-computer interaction.To avoid the low accuracy of single-modal estimation and the high complexity of traditional multi-modal 3D estimation,this paper proposes a novel multi-modal multi-view(MMV)3D hand pose estimation system,which introduces a registration before translation(RT)-translation before registration(TR)jointed conditional generative adversarial network(cGAN)to train a multi-modal registration network,and then employs the multi-modal feature fusion to achieve high-quality estimation,with low hardware and software costs both in data acquisition and processing.Experimental results demonstrate that the MMV system is effective and feasible in various scenarios.It is promising for the MMV system to be used in broad intelligent application areas. 展开更多
关键词 3d hand pose estimation registration network multi-modal MULTI-VIEW conditional generative adversarial network(cGAN)
在线阅读 下载PDF
Adaptive Fusion Neural Networks for Sparse-Angle X-Ray 3D Reconstruction
3
作者 Shaoyong Hong Bo Yang +4 位作者 Yan Chen Hao Quan Shan Liu Minyi Tang Jiawei Tian 《Computer Modeling in Engineering & Sciences》 2025年第7期1091-1112,共22页
3D medical image reconstruction has significantly enhanced diagnostic accuracy,yet the reliance on densely sampled projection data remains a major limitation in clinical practice.Sparse-angle X-ray imaging,though safe... 3D medical image reconstruction has significantly enhanced diagnostic accuracy,yet the reliance on densely sampled projection data remains a major limitation in clinical practice.Sparse-angle X-ray imaging,though safer and faster,poses challenges for accurate volumetric reconstruction due to limited spatial information.This study proposes a 3D reconstruction neural network based on adaptive weight fusion(AdapFusionNet)to achieve high-quality 3D medical image reconstruction from sparse-angle X-ray images.To address the issue of spatial inconsistency in multi-angle image reconstruction,an innovative adaptive fusion module was designed to score initial reconstruction results during the inference stage and perform weighted fusion,thereby improving the final reconstruction quality.The reconstruction network is built on an autoencoder(AE)framework and uses orthogonal-angle X-ray images(frontal and lateral projections)as inputs.The encoder extracts 2D features,which the decoder maps into 3D space.This study utilizes a lung CT dataset to obtain complete three-dimensional volumetric data,from which digitally reconstructed radiographs(DRR)are generated at various angles to simulate X-ray images.Since real-world clinical X-ray images rarely come with perfectly corresponding 3D“ground truth,”using CT scans as the three-dimensional reference effectively supports the training and evaluation of deep networks for sparse-angle X-ray 3D reconstruction.Experiments conducted on the LIDC-IDRI dataset with simulated X-ray images(DRR images)as training data demonstrate the superior performance of AdapFusionNet compared to other fusion methods.Quantitative results show that AdapFusionNet achieves SSIM,PSNR,and MAE values of 0.332,13.404,and 0.163,respectively,outperforming other methods(SingleViewNet:0.289,12.363,0.182;AvgFusionNet:0.306,13.384,0.159).Qualitative analysis further confirms that AdapFusionNet significantly enhances the reconstruction of lung and chest contours while effectively reducing noise during the reconstruction process.The findings demonstrate that AdapFusionNet offers significant advantages in 3D reconstruction of sparse-angle X-ray images. 展开更多
关键词 3d reconstruction adaptive fusion X-ray imaging medical imaging deep learning neural networks sparse angles autoencoder
暂未订购
Adaptive multi-modal feature fusion for far and hard object detection
4
作者 LI Yang GE Hongwei 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2021年第2期232-241,共10页
In order to solve difficult detection of far and hard objects due to the sparseness and insufficient semantic information of LiDAR point cloud,a 3D object detection network with multi-modal data adaptive fusion is pro... In order to solve difficult detection of far and hard objects due to the sparseness and insufficient semantic information of LiDAR point cloud,a 3D object detection network with multi-modal data adaptive fusion is proposed,which makes use of multi-neighborhood information of voxel and image information.Firstly,design an improved ResNet that maintains the structure information of far and hard objects in low-resolution feature maps,which is more suitable for detection task.Meanwhile,semantema of each image feature map is enhanced by semantic information from all subsequent feature maps.Secondly,extract multi-neighborhood context information with different receptive field sizes to make up for the defect of sparseness of point cloud which improves the ability of voxel features to represent the spatial structure and semantic information of objects.Finally,propose a multi-modal feature adaptive fusion strategy which uses learnable weights to express the contribution of different modal features to the detection task,and voxel attention further enhances the fused feature expression of effective target objects.The experimental results on the KITTI benchmark show that this method outperforms VoxelNet with remarkable margins,i.e.increasing the AP by 8.78%and 5.49%on medium and hard difficulty levels.Meanwhile,our method achieves greater detection performance compared with many mainstream multi-modal methods,i.e.outperforming the AP by 1%compared with that of MVX-Net on medium and hard difficulty levels. 展开更多
关键词 3d object detection adaptive fusion multi-modal data fusion attention mechanism multi-neighborhood features
在线阅读 下载PDF
MFF-Net: Multimodal Feature Fusion Network for 3D Object Detection
5
作者 Peicheng Shi Zhiqiang Liu +1 位作者 Heng Qi Aixi Yang 《Computers, Materials & Continua》 SCIE EI 2023年第6期5615-5637,共23页
In complex traffic environment scenarios,it is very important for autonomous vehicles to accurately perceive the dynamic information of other vehicles around the vehicle in advance.The accuracy of 3D object detection ... In complex traffic environment scenarios,it is very important for autonomous vehicles to accurately perceive the dynamic information of other vehicles around the vehicle in advance.The accuracy of 3D object detection will be affected by problems such as illumination changes,object occlusion,and object detection distance.To this purpose,we face these challenges by proposing a multimodal feature fusion network for 3D object detection(MFF-Net).In this research,this paper first uses the spatial transformation projection algorithm to map the image features into the feature space,so that the image features are in the same spatial dimension when fused with the point cloud features.Then,feature channel weighting is performed using an adaptive expression augmentation fusion network to enhance important network features,suppress useless features,and increase the directionality of the network to features.Finally,this paper increases the probability of false detection and missed detection in the non-maximum suppression algo-rithm by increasing the one-dimensional threshold.So far,this paper has constructed a complete 3D target detection network based on multimodal feature fusion.The experimental results show that the proposed achieves an average accuracy of 82.60%on the Karlsruhe Institute of Technology and Toyota Technological Institute(KITTI)dataset,outperforming previous state-of-the-art multimodal fusion networks.In Easy,Moderate,and hard evaluation indicators,the accuracy rate of this paper reaches 90.96%,81.46%,and 75.39%.This shows that the MFF-Net network has good performance in 3D object detection. 展开更多
关键词 3d object detection multimodal fusion neural network autonomous driving attention mechanism
在线阅读 下载PDF
基于3D多模态卷积网络与跨模态特征集成的阿尔茨海默症分类
6
作者 朱厚元 郑乐乐 +5 位作者 商浩 臧雪峰 吴少琪 周广超 孙建德 乔建苹 《数据采集与处理》 北大核心 2025年第4期912-921,共10页
多模态神经影像技术为阿尔茨海默症(Alzheimer’s disease,AD)的早期精准诊断提供了重要的技术支撑。然而,由于不同模态神经影像数据在成像原理和特征表达上存在固有异质性,模态间的信息融合面临挑战。针对这一问题,提出了一种基于3D Re... 多模态神经影像技术为阿尔茨海默症(Alzheimer’s disease,AD)的早期精准诊断提供了重要的技术支撑。然而,由于不同模态神经影像数据在成像原理和特征表达上存在固有异质性,模态间的信息融合面临挑战。针对这一问题,提出了一种基于3D ResNet架构的多模态融合网络(Multi-modal fusion network,MFN),用于AD的早期辅助诊断。该方法首先采用3D ResNet网络分别提取T1加权和T2加权磁共振图像的特征表示,然后设计了一种创新的跨模态特征集成模块(Cross-modal feature integration module,CFIM)。相较于多模态数据直接串联,导致维度增长无法自适应调整模态权重的问题,CFIM采用分阶段融合策略,包括全局信息融合模块、局部特征学习模块和关键因素模块。最后,融合后的多模态特征通过全连接神经网络进行分类决策。相比早期拼接的固定权重叠加和后期融合的浅层聚合,该策略能更精准地筛选出疾病诊断相关的特征。通过在阿尔茨海默症神经影像倡议(ADNI)数据库上的实验结果表明,与现有方法相比,本文方法在AD分类任务中具有较高的准确率和显著优势,且消融实验进一步验证了各模块的有效性,为多模态神经影像分析提供了新的技术思路。 展开更多
关键词 阿尔茨海默症 3d多模态融合网络 核磁共振图像 跨模态特征集成模块 深度学习
在线阅读 下载PDF
Three-dimensional Fusion of Spaceborne and Ground Radar Reflectivity Data Using a Neural Network–Based Approach 被引量:5
7
作者 Leilei KOU Zhuihui WANG Fen XU 《Advances in Atmospheric Sciences》 SCIE CAS CSCD 2018年第3期346-359,共14页
The spaceborne precipitation radar onboard the Tropical Rainfall Measuring Mission satellite (TRMM PR) can provide good measurement of the vertical structure of reflectivity, while ground radar (GR) has a relative... The spaceborne precipitation radar onboard the Tropical Rainfall Measuring Mission satellite (TRMM PR) can provide good measurement of the vertical structure of reflectivity, while ground radar (GR) has a relatively high horizontal resolution and greater sensitivity. Fusion of TRMM PR and GR reflectivity data may maximize the advantages from both instruments. In this paper, TRMM PR and GR reflectivity data are fused using a neural network (NN)-based approach. The main steps included are: quality control of TRMM PR and GR reflectivity data; spatiotemporal matchup; GR calibration bias correction; conversion of TRMM PR data from Ku to S band; fusion of TRMM PR and GR reflectivity data with an NN method: interpolation of reflectivity data that are below PR's sensitivity; blind areas compensation with a distance weighting-based merging approach; combination of three types of data: data with the NN method, data below PR's sensitivity and data within compensated blind areas. During the NN fusion step, the TRMM PR data are taken as targets of the training NNs, and gridded GR data after horizontal downsampling at different heights are used as the input. The trained NNs are then used to obtain 3D high-resolution reflectivity from the original GR gridded data. After 3D fusion of the TRMM PR and GR reflectivity data, a more complete and finer-scale 3D radar reflectivity dataset incorporating characteristics from both the TRMM PR and GR observations can be obtained. The fused reflectivity data are evaluated based on a convective precipitation event through comparison with the high resolution TRMM PR and GR data with an interpolation algorithm. 展开更多
关键词 TRMM PR ground radar 3d fusion neural network
在线阅读 下载PDF
3D Vehicle Detection Algorithm Based onMultimodal Decision-Level Fusion
8
作者 Peicheng Shi Heng Qi +1 位作者 Zhiqiang Liu Aixi Yang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第6期2007-2023,共17页
3D vehicle detection based on LiDAR-camera fusion is becoming an emerging research topic in autonomous driving.The algorithm based on the Camera-LiDAR object candidate fusion method(CLOCs)is currently considered to be... 3D vehicle detection based on LiDAR-camera fusion is becoming an emerging research topic in autonomous driving.The algorithm based on the Camera-LiDAR object candidate fusion method(CLOCs)is currently considered to be a more effective decision-level fusion algorithm,but it does not fully utilize the extracted features of 3D and 2D.Therefore,we proposed a 3D vehicle detection algorithm based onmultimodal decision-level fusion.First,project the anchor point of the 3D detection bounding box into the 2D image,calculate the distance between 2D and 3D anchor points,and use this distance as a new fusion feature to enhance the feature redundancy of the network.Subsequently,add an attention module:squeeze-and-excitation networks,weight each feature channel to enhance the important features of the network,and suppress useless features.The experimental results show that the mean average precision of the algorithm in the KITTI dataset is 82.96%,which outperforms previous state-ofthe-art multimodal fusion-based methods,and the average accuracy in the Easy,Moderate and Hard evaluation indicators reaches 88.96%,82.60%,and 77.31%,respectively,which are higher compared to the original CLOCs model by 1.02%,2.29%,and 0.41%,respectively.Compared with the original CLOCs algorithm,our algorithm has higher accuracy and better performance in 3D vehicle detection. 展开更多
关键词 3d vehicle detection multimodal fusion CLOCs network structure optimization attention module
在线阅读 下载PDF
Concrete Defects Inspection and 3D Mapping Using City Flyer Quadrotor Robot 被引量:8
9
作者 Liang Yang Bing Li +3 位作者 Wei Li Howard Brand Biao Jiang Jizhong Xiao 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2020年第4期991-1002,共12页
The concrete aging problem has gained more attention in recent years as more bridges and tunnels in the United States lack proper maintenance. Though the Federal Highway Administration requires these public concrete s... The concrete aging problem has gained more attention in recent years as more bridges and tunnels in the United States lack proper maintenance. Though the Federal Highway Administration requires these public concrete structures to be inspected regularly, on-site manual inspection by human operators is time-consuming and labor-intensive. Conventional inspection approaches for concrete inspection, using RGB imagebased thresholding methods, are not able to determine metric information as well as accurate location information for assessed defects for conditions. To address this challenge, we propose a deep neural network(DNN) based concrete inspection system using a quadrotor flying robot(referred to as City Flyer) mounted with an RGB-D camera. The inspection system introduces several novel modules. Firstly, a visual-inertial fusion approach is introduced to perform camera and robot positioning and structure 3 D metric reconstruction. The reconstructed map is used to retrieve the location and metric information of the defects.Secondly, we introduce a DNN model, namely Ada Net, to detect concrete spalling and cracking, with the capability of maintaining robustness under various distances between the camera and concrete surface. In order to train the model, we craft a new dataset, i.e., the concrete structure spalling and cracking(CSSC)dataset, which is released publicly to the research community.Finally, we introduce a 3 D semantic mapping method using the annotated framework to reconstruct the concrete structure for visualization. We performed comparative studies and demonstrated that our Ada Net can achieve 8.41% higher detection accuracy than Res Nets and VGGs. Moreover, we conducted five field tests, of which three are manual hand-held tests and two are drone-based field tests. These results indicate that our system is capable of performing metric field inspection,and can serve as an effective tool for civil engineers. 展开更多
关键词 3d reconstruction concrete inspection deep neural network quadrotor flying robot visual-inertial fusion
在线阅读 下载PDF
AV-FDTI:Audio-visual fusion for drone threat identification 被引量:1
10
作者 Yizhuo Yang Shenghai Yuan +5 位作者 Jianfei Yang Thien Hoang Nguyen Muqing Cao Thien-Minh Nguyen Han Wang Lihua Xie 《Journal of Automation and Intelligence》 2024年第3期144-151,共8页
In response to the evolving challenges posed by small unmanned aerial vehicles(UAVs),which have the potential to transport harmful payloads or cause significant damage,we present AV-FDTI,an innovative Audio-Visual Fus... In response to the evolving challenges posed by small unmanned aerial vehicles(UAVs),which have the potential to transport harmful payloads or cause significant damage,we present AV-FDTI,an innovative Audio-Visual Fusion system designed for Drone Threat Identification.AV-FDTI leverages the fusion of audio and omnidirectional camera feature inputs,providing a comprehensive solution to enhance the precision and resilience of drone classification and 3D localization.Specifically,AV-FDTI employs a CRNN network to capture vital temporal dynamics within the audio domain and utilizes a pretrained ResNet50 model for image feature extraction.Furthermore,we adopt a visual information entropy and cross-attention-based mechanism to enhance the fusion of visual and audio data.Notably,our system is trained based on automated Leica tracking annotations,offering accurate ground truth data with millimeter-level accuracy.Comprehensive comparative evaluations demonstrate the superiority of our solution over the existing systems.In our commitment to advancing this field,we will release this work as open-source code and wearable AV-FDTI design,contributing valuable resources to the research community. 展开更多
关键词 Audio-visual fusion Anti-UAV multi-modal fusion Classification 3d localization Self-attention
在线阅读 下载PDF
基于结构总体最小二乘的多传感器定位算法 被引量:5
11
作者 雷雨 冯新喜 +1 位作者 潘海峰 朱灿彬 《系统仿真学报》 CAS CSCD 北大核心 2013年第4期668-673,共6页
多部二维传感器组网定位时,为充分利用各传感器量测并减小地球曲率对观测的影响,建立了大场景下考虑地球曲率的三维空间观测定位模型,提出了此场景下融合测距信息的方位结构总体最小二乘定位算法。该算法首先根据各传感器的方位角量测... 多部二维传感器组网定位时,为充分利用各传感器量测并减小地球曲率对观测的影响,建立了大场景下考虑地球曲率的三维空间观测定位模型,提出了此场景下融合测距信息的方位结构总体最小二乘定位算法。该算法首先根据各传感器的方位角量测信息使用结构最小二乘法初步定位,然后利用观测模型融合各传感器的距离量测提高定位精度并最终得到目标位置估计。仿真实验证明了该方法在多部2D传感器组网情况下对三维空间内目标定位的实际性和有效性,适用于工程利用。 展开更多
关键词 多传感器组网 目标三维定位 融合 结构总体最小二乘
原文传递
基于深度学习的多模态融合三维人脸识别 被引量:3
12
作者 胡乃平 贾浩杰 《计算机系统应用》 2022年第8期152-159,共8页
二维人脸识别受光照、遮挡和姿态的影响较大.为了克服二维人脸识别的缺点,本文提出了一种基于深度学习的多模态融合三维人脸识别算法.该方法首先使用卷积自编码器将彩色图像和深度图进行融合,将融合后的图像作为网络的输入进行预训练,... 二维人脸识别受光照、遮挡和姿态的影响较大.为了克服二维人脸识别的缺点,本文提出了一种基于深度学习的多模态融合三维人脸识别算法.该方法首先使用卷积自编码器将彩色图像和深度图进行融合,将融合后的图像作为网络的输入进行预训练,并且设计了一种新的损失函数cluster loss,结合Softmax损失,预训练了一个精度非常高的模型.之后使用迁移学习将预训练的模型进行微调,得到了一个轻量级神经网络模型.将原始数据集进行一系列处理,使用处理之后的数据集作为测试集,测试的识别准确率为96.37%.实验证明,该方法弥补了二维人脸识别的一些缺点,受光照和遮挡的影响非常小,并且相对于使用高精度三维人脸图像的三维人脸识别,本文提出的算法速度快,并且鲁棒性高. 展开更多
关键词 三维人脸识别 多模态融合 深度学习 卷积神经网络 损失函数 迁移学习
在线阅读 下载PDF
基于双注意力融合和残差优化的点云语义分割 被引量:5
13
作者 于魁梧 宋玉琴 徐轩 《国外电子测量技术》 北大核心 2022年第8期12-18,共7页
在直接处理原始点云的语义分割算法中,存在局部特征提取模块细粒度特征描述能力不足和逐步下采样使得网络深度受到限制的问题。提出一种双注意力特征增强模块,其中点注意模块学习邻域点之间的相互关联性,捕捉上下文信息,提高局部特征的... 在直接处理原始点云的语义分割算法中,存在局部特征提取模块细粒度特征描述能力不足和逐步下采样使得网络深度受到限制的问题。提出一种双注意力特征增强模块,其中点注意模块学习邻域点之间的相互关联性,捕捉上下文信息,提高局部特征的分辨能力,通道注意模块聚合通道结构信息,减少噪声影响。构建编码器多尺度残差结构增加网络深度,避免下采样造成的关键点信息丢失。方法在S3DIS数据集上准确率为88.9%,平均交并比为70.7%;在Semantic3D数据集上准确率为95.8%,平均交并比为78.5%。实验结果表明,所提出的算法对物体边缘特征具有良好的区分性,具有更好的泛化性能。 展开更多
关键词 三维点云 语义分割 自注意力机制 残差网络 特征融合
原文传递
基于卷积神经网络的三维目标检测研究综述 被引量:21
14
作者 王亚东 田永林 +2 位作者 李国强 王坤峰 李大字 《模式识别与人工智能》 CSCD 北大核心 2021年第12期1103-1119,共17页
深度学习尤其卷积神经网络为精确目标检测提供可能,推动三维目标检测在自动驾驶、机器人等领域发挥重要作用.文中综述基于卷积神经网络的三维目标检测研究进展.首先总结三维目标检测的应用价值、基本流程及存在的挑战.再介绍卷积神经网... 深度学习尤其卷积神经网络为精确目标检测提供可能,推动三维目标检测在自动驾驶、机器人等领域发挥重要作用.文中综述基于卷积神经网络的三维目标检测研究进展.首先总结三维目标检测的应用价值、基本流程及存在的挑战.再介绍卷积神经网络基本原理、典型的二维目标检测网络结构、常用的开源数据集及点云表示形式等相关基础知识.然后介绍卷积神经网络在三维目标检测中的应用进展,根据不同数据模态及方法共性对方法进行梳理.最后对当前三维目标检测研究存在的问题进行论述,对未来的研究发展趋势进行展望. 展开更多
关键词 卷积神经网络 三维目标检测 点云 多模态融合
在线阅读 下载PDF
基于Kolmogorov-Arnold网络的多模态三维目标检测算法 被引量:1
15
作者 凌言武 饶俊民 +1 位作者 李燕 李范鸣 《激光与光电子学进展》 北大核心 2025年第16期212-220,共9页
针对传统多层感知机(MLP)模型中存在的可解释性不足、神经元数量增多导致训练成本增加,以及图像与点云数据融合效果欠佳等问题,提出了一种基于Kolmogorov‒Arnold网络(KAN)的多模态三维(3D)目标检测网络。该网络以KAN为主干,设计了一种... 针对传统多层感知机(MLP)模型中存在的可解释性不足、神经元数量增多导致训练成本增加,以及图像与点云数据融合效果欠佳等问题,提出了一种基于Kolmogorov‒Arnold网络(KAN)的多模态三维(3D)目标检测网络。该网络以KAN为主干,设计了一种结合融合层的体素特征编码器KANDyVFE,其中融合层采用自注意力机制动态融合图像和点云特征,还通过裁剪RGB图像和绘制生成彩色点云来增强点云特征表达。在KITTI数据集上的实验结果表明,与基线方法SECOND相比,所提网络在汽车类别的鸟瞰图检测中平均精度均值提升了3.78百分点,在3D目标检测中平均精度均值提升了3.75百分点。可视化结果显示,该网络在减少误报和漏检方面表现优异,验证了KAN在点云应用中的有效性。消融实验结果进一步证明了所提网络具有较好的检测性能。 展开更多
关键词 深度学习 Kolmogorov‒Arnold网络 多模态融合 点云 3d目标检测
原文传递
基于深度学习的三维人脸识别方法研究
16
作者 吴梦蝶 《陕西工业职业技术学院学报》 2017年第3期3-7,共5页
本文通过建立两个深度卷积网络模型,以二维人脸图片和人脸深度图作为输入,对两个DCNN进行训练以及识别测试,将两个DCNN提取的二维人脸图像及人脸深度图的高层抽象特征作为神经网络的输入,得到携带二维特征及人脸深度信息的融合特征... 本文通过建立两个深度卷积网络模型,以二维人脸图片和人脸深度图作为输入,对两个DCNN进行训练以及识别测试,将两个DCNN提取的二维人脸图像及人脸深度图的高层抽象特征作为神经网络的输入,得到携带二维特征及人脸深度信息的融合特征,将融合特征作为最终的人脸的高层描述,进而完成三维人脸识别任务。实验证明,与其他识别方法相比,该方法有较高的识别正确率。 展开更多
关键词 三维人脸识别 深层卷积网络 特征提取 特征融合
在线阅读 下载PDF
从二维视图识别三维目标的多网络融合方法 被引量:7
17
作者 贾财潮 戚飞虎 +1 位作者 于询 张季涛 《光学学报》 EI CAS CSCD 北大核心 2001年第2期177-180,共4页
提出了一种从二维视图识别三维目标的多网络融合方法 ,基于单个网络分类的置信度概念 ,有效地结合多个网络的输出结果作出最终分类判决。应用三个多层前向网络 (隐层神经元数、初始权值等取不同值 ) ,设计了基于分类确信度的多网络融合... 提出了一种从二维视图识别三维目标的多网络融合方法 ,基于单个网络分类的置信度概念 ,有效地结合多个网络的输出结果作出最终分类判决。应用三个多层前向网络 (隐层神经元数、初始权值等取不同值 ) ,设计了基于分类确信度的多网络融合结构。对四类车辆目标进行的识别实验表明 。 展开更多
关键词 多网络融合 三维目标识别 置信度 多层前向网络 二维视图 自动目标识别
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部