期刊文献+
共找到1,697篇文章
< 1 2 85 >
每页显示 20 50 100
Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving:A Review
1
作者 Peicheng Shi Li Yang +2 位作者 Xinlong Dong Heng Qi Aixi Yang 《Computers, Materials & Continua》 2025年第6期3877-3917,共41页
As the number and complexity of sensors in autonomous vehicles continue to rise,multimodal fusionbased object detection algorithms are increasingly being used to detect 3D environmental information,significantly advan... As the number and complexity of sensors in autonomous vehicles continue to rise,multimodal fusionbased object detection algorithms are increasingly being used to detect 3D environmental information,significantly advancing the development of perception technology in autonomous driving.To further promote the development of fusion algorithms and improve detection performance,this paper discusses the advantages and recent advancements of multimodal fusion-based object detection algorithms.Starting fromsingle-modal sensor detection,the paper provides a detailed overview of typical sensors used in autonomous driving and introduces object detection methods based on images and point clouds.For image-based detection methods,they are categorized into monocular detection and binocular detection based on different input types.For point cloud-based detection methods,they are classified into projection-based,voxel-based,point cluster-based,pillar-based,and graph structure-based approaches based on the technical pathways for processing point cloud features.Additionally,multimodal fusion algorithms are divided into Camera-LiDAR fusion,Camera-Radar fusion,Camera-LiDAR-Radar fusion,and other sensor fusion methods based on the types of sensors involved.Furthermore,the paper identifies five key future research directions in this field,aiming to provide insights for researchers engaged in multimodal fusion-based object detection algorithms and to encourage broader attention to the research and application of multimodal fusion-based object detection. 展开更多
关键词 Multi-modal fusion 3D object detection deep learning autonomous driving
在线阅读 下载PDF
AMTS:Adaptive Multi-Objective Task Scheduling Strategy in Cloud Computing
2
作者 HE Hua XU Guangquan +1 位作者 PANG Shanchen ZHAO Zenghua 《China Communications》 SCIE CSCD 2016年第4期162-171,共10页
Task scheduling in cloud computing environments is a multi-objective optimization problem, which is NP hard. It is also a challenging problem to find an appropriate trade-off among resource utilization, energy consump... Task scheduling in cloud computing environments is a multi-objective optimization problem, which is NP hard. It is also a challenging problem to find an appropriate trade-off among resource utilization, energy consumption and Quality of Service(QoS) requirements under the changing environment and diverse tasks. Considering both processing time and transmission time, a PSO-based Adaptive Multi-objective Task Scheduling(AMTS) Strategy is proposed in this paper. First, the task scheduling problem is formulated. Then, a task scheduling policy is advanced to get the optimal resource utilization, task completion time, average cost and average energy consumption. In order to maintain the particle diversity, the adaptive acceleration coefficient is adopted. Experimental results show that the improved PSO algorithm can obtain quasi-optimal solutions for the cloud task scheduling problem. 展开更多
关键词 quality of service cloud computing multi-objective task scheduling particle swarm optimization(PSO) small position value(SPV)
在线阅读 下载PDF
An object oriented multi-robotic graphic simulation environment for programming the welding tasks
3
作者 崔泽 赵杰 +1 位作者 崔岩 蔡鹤皋 《China Welding》 EI CAS 2002年第1期77-83,共7页
An object oriented multi robotic graphic simulation environment is described in this paper. Object oriented programming is used to model the physical objects of the robotic workcell in the form of software objects ... An object oriented multi robotic graphic simulation environment is described in this paper. Object oriented programming is used to model the physical objects of the robotic workcell in the form of software objects or classes. The virtual objects are defined to provide the user with a user friendly interface including realistic graphic simulation and clarify the software architecture. The programming method of associating the task object with active object effectively increases the software reusability, maintainability and modifiability. Task level programming is also demonstrated through a multi robot welding task that allows the user to concentrate on the most important aspects of the tasks. The multi thread programming technique is used to simulate the interaction of multiple tasks. Finally, a virtual test is carried out in the graphic simulation environment to observe design and program errors and fix them before downloading the software to the real workcell. 展开更多
关键词 object oriented programming task level programming welding task 3D graphic simulation expert system
在线阅读 下载PDF
The Practice Research of Task Driving Method in Basic Teaching of Information Technology
4
作者 Liqiao Geng 《International Journal of Technology Management》 2013年第8期51-53,共3页
Purpose: It is used for judging the advantages and disadvantages of information technology foundation course teaching in health vocational colleges. Method: In teaching, it takes the two classes of 2012 grade nursin... Purpose: It is used for judging the advantages and disadvantages of information technology foundation course teaching in health vocational colleges. Method: In teaching, it takes the two classes of 2012 grade nursing major as the experiment object. The comparison class adopts traditonal and speaking-practice combination teaching method and the experiment class adopts task-driving teaching method. When the semester finishes, it conducts testing andd questionnaire survey, collecting the relevant data, analyzing the changes of students in the aspects of performance, learning interest and attitude, autonomous learning consciousness and ability after experiment class adopting new teaching methods. Result: The exam performance of experiment class is obviously higher than the comparison class, and the experiment class has an obvious improvement in the aspects of learning interest, autonomous learning consciousness and ability, and the difference has statistical significance. Conclusion: The task driving teaching method is suitable for the status of information foundation teaching in health vocational colleges, which improves students' performance significantly and is good for students' learning interest and enthusiasm, obtaining good classroom effect. Also, it makes students' autonomous learning consciousness and ability improve greatly. 展开更多
关键词 task driving teaching method teaching mode teaching design information tecbalology foundation
在线阅读 下载PDF
A Dangerous Driving Behaviors Detection Method for Car Driver Based on Improved YOLOv7 Model
5
作者 Md Tariqul Islam Akash Joarder Md Niaz Ahmed 《Journal of Computer and Communications》 2024年第12期289-317,共29页
The basic theory of YOLO series object detection algorithms is discussed, the dangerous driving behavior dataset is collected and produced, and then the YOLOv7 network is introduced in detail, the deep separable convo... The basic theory of YOLO series object detection algorithms is discussed, the dangerous driving behavior dataset is collected and produced, and then the YOLOv7 network is introduced in detail, the deep separable convolution and CA attention mechanism are introduced, the YOLOv7 bounding box loss function and clustering algorithm are optimized, and the DB-YOLOv7 network structure is constructed. In the first stage of the experiment, the PASCAL VOC public dataset was utilized for pre-training. A comparative analysis was conducted to assess the recognition accuracy and inference time before and after the proposed improvements. The experimental results demonstrated an increase of 1.4% in the average recognition accuracy, alongside a reduction in the inference time by 4 ms. Subsequently, a model for the recognition of dangerous driving behaviors was trained using a specialized dangerous driving behavior dataset. A series of experiments were performed to evaluate the efficacy of the DB-YOLOv7 algorithm in this context. The findings indicate a significant enhancement in detection performance, with a 4% improvement in accuracy compared to the baseline network. Furthermore, the model’s inference time was reduced by 20%, from 25 ms to 20 ms. These results substantiate the effectiveness of the DB-YOLOv7 recognition algorithm for detecting dangerous driving behaviors, providing comprehensive validation of its practical applicability. 展开更多
关键词 Dangerous driving Behaviors object Detection YOLOv7 Separable Convolution CA Attention Mechanism
在线阅读 下载PDF
车载视觉图像运动信息提取方法
6
作者 刘平 王硕翰 +1 位作者 张逸康 周子龙 《重庆交通大学学报(自然科学版)》 北大核心 2026年第1期106-112,共7页
运动目标检测是计算机视觉领域重要的研究内容,运动信息定义为运动目标对应图像中的像素点位置,然而在自动驾驶场景下,由于车载相机自身运动引起图像背景变化使得运动信息难以准确提取。提出了基于稀疏光流估计与深度学习的运动信息提... 运动目标检测是计算机视觉领域重要的研究内容,运动信息定义为运动目标对应图像中的像素点位置,然而在自动驾驶场景下,由于车载相机自身运动引起图像背景变化使得运动信息难以准确提取。提出了基于稀疏光流估计与深度学习的运动信息提取模型来克服背景变化带来的影响,检测环境中的运动信息。光流提取模块通过Shi-Tomasi角点检测及Lucas-Kanade(LK)稀疏光流估计初步得到全局稀疏光流;运动信息判别模块通过将图像深度信息和稀疏光流输入Transformer神经网络,推理出抑制信号,抑制背景运动带来的影响,从而提取出准确的运动信息。结果表明:该方法可以提取出图像中的运动信息,具有92%准确率,可用于自动驾驶车辆检测运动目标。 展开更多
关键词 车辆工程 运动目标检测 稀疏光流 深度学习 自动驾驶
在线阅读 下载PDF
EvRDETRG:融合事件与RGB图像的轻量级端到端目标检测
7
作者 周秉泉 蒋杰 +1 位作者 陈江民 詹礼新 《计算机科学》 北大核心 2026年第1期153-162,共10页
基于神经脉冲信号的事件摄像机可以提供光线变化的信息,以弥补传统RGB相机目标检测在恶劣环境性能下降的缺陷。然而,传统融合事件相机的现有方法存在模型参数大和非端到端训练方法的问题,限制了模态融合的有效性。因此,提出了一种事件与... 基于神经脉冲信号的事件摄像机可以提供光线变化的信息,以弥补传统RGB相机目标检测在恶劣环境性能下降的缺陷。然而,传统融合事件相机的现有方法存在模型参数大和非端到端训练方法的问题,限制了模态融合的有效性。因此,提出了一种事件与RGB信息融合的轻量级端到端对象检测框架,基于两种模态各级尺度特征进行不同细粒度的信息融合,同时基于重参数化卷积实现轻量级的融合模块并进行端到端的训练,从而提升模型对于两种模态互补信息的提取能力,以克服自动驾驶中具有挑战性的不利环境。所提出的模型在大规模数据集PKU-SOD上进行了测试,该数据集提供了低光、高速运动模糊与正常光照环境下车辆行驶的视觉数据。实验结果表明,与此前的多模态目标检测框架相比,所提方法在模型参数量上大幅下降,并提升了目标检测的准确率与推理速度,表现出优于现有方法的性能。 展开更多
关键词 目标检测 仿生相机 自动驾驶 深度学习 端到端目标检测 事件相目标机检测 轻量化目标检测
在线阅读 下载PDF
面向自动驾驶的多尺度目标三维检测算法
8
作者 刘嫚 陈晓楠 《现代电子技术》 北大核心 2026年第1期141-147,共7页
在自动驾驶场景中,使用单目相机进行三维目标检测是一项具有挑战性的任务,尤其是在复杂道路环境下,目标的尺度差异和遮挡现象容易导致误检或漏检。针对这一问题,文中提出一种基于特征融合与增强的单目三维目标检测算法。首先,构建Faster... 在自动驾驶场景中,使用单目相机进行三维目标检测是一项具有挑战性的任务,尤其是在复杂道路环境下,目标的尺度差异和遮挡现象容易导致误检或漏检。针对这一问题,文中提出一种基于特征融合与增强的单目三维目标检测算法。首先,构建FasterNet+作为骨干网络,通过优化嵌入层和块结构,增强细节信息的提取,提升网络的整体性能;其次,设计多维特征自适应融合模块,自适应地选择并融合高维与低维特征,解决高维特征丢失小目标信息和低维特征缺乏上下文信息的问题;最后,引入特征增强注意力模块,突出特定目标区域,进一步提升网络在目标定位和分类方面的精度。在nuScenes数据集上的实验结果表明,其mAP和NDS比基准方法分别提高0.038和0.035,可以有效检测出不同类型和尺度的目标,并展现出更强的鲁棒性,为自动驾驶场景中的多维目标检测提供了一种新思路。 展开更多
关键词 自动驾驶 单目相机 三维目标检测 多尺度感知 特征融合 注意力机制 机器视觉
在线阅读 下载PDF
Resource allocation optimization of equipment development task based on MOPSO algorithm 被引量:9
9
作者 ZHANG Xilin TAN Yuejin and YANG Zhiwei 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2019年第6期1132-1143,共12页
Resource allocation for an equipment development task is a complex process owing to the inherent characteristics,such as large amounts of input resources,numerous sub-tasks,complex network structures,and high degrees ... Resource allocation for an equipment development task is a complex process owing to the inherent characteristics,such as large amounts of input resources,numerous sub-tasks,complex network structures,and high degrees of uncertainty.This paper presents an investigation into the influence of resource allocation on the duration and cost of sub-tasks.Mathematical models are constructed for the relationships of the resource allocation quantity with the duration and cost of the sub-tasks.By considering the uncertainties,such as fluctuations in the sub-task duration and cost,rework iterations,and random overlaps,the tasks are simulated for various resource allocation schemes.The shortest duration and the minimum cost of the development task are first formulated as the objective function.Based on a multi-objective particle swarm optimization(MOPSO)algorithm,a multi-objective evolutionary algorithm is constructed to optimize the resource allocation scheme for the development task.Finally,an uninhabited aerial vehicle(UAV)is considered as an example of a development task to test the algorithm,and the optimization results of this method are compared with those based on non-dominated sorting genetic algorithm-II(NSGA-II),non-dominated sorting differential evolution(NSDE)and strength pareto evolutionary algorithm-II(SPEA-II).The proposed method is verified for its scientific approach and effectiveness.The case study shows that the optimization of the resource allocation can greatly aid in shortening the duration of the development task and reducing its cost effectively. 展开更多
关键词 resource allocation equipment development task multi-objective particle swarm optimization(MOPSO) develop ment task simulation.
在线阅读 下载PDF
AVL DRIVE在整车驾驶性能开发的运用 被引量:8
10
作者 陈铭 黄炯 魏喜乐 《汽车实用技术》 2021年第12期117-119,共3页
文章利用AVL DRIVE系统结合测试的加速度、电流、CAN网络信号对车辆驾驶性进行客观化测试与评估,使驾驶性问题可视化,通过分析客观数据得出车辆差异点及问题点,制定不同工况下的差异化目标。使用表明AVL DRIVE在整车驾驶性能开发中有良... 文章利用AVL DRIVE系统结合测试的加速度、电流、CAN网络信号对车辆驾驶性进行客观化测试与评估,使驾驶性问题可视化,通过分析客观数据得出车辆差异点及问题点,制定不同工况下的差异化目标。使用表明AVL DRIVE在整车驾驶性能开发中有良好的助力效果。 展开更多
关键词 AVL DRIVE 驾驶性 客观评估 整车性能
在线阅读 下载PDF
Research on Object Model-Based Architecture for Service Robot System
11
作者 邵鹏鸣 李成刚 吴翰声 《Journal of Southwest Jiaotong University(English Edition)》 2002年第1期21-32,共12页
An object model based software architecture for service robot system is presented, which addresses both software engineering issues such as reuse, extensibility, and management of complexity as well as system enginee... An object model based software architecture for service robot system is presented, which addresses both software engineering issues such as reuse, extensibility, and management of complexity as well as system engineering issues like scalability, reactivity, and robustness. A novel approach to the service robot system architecture is discussed. Cognitive psychology is considered in designing the software system, i.e., a humans way of vision and planning is applied. The planner can incorporate the users request into its task selection mechanism and generate plans biased toward picking the most reliable task execution in a given situation, and the planner can alter task selection based on changes that occur in dynamic and uncertain environments. 展开更多
关键词 ROBOTS software engineering ARCHITECTURE object model task plan
在线阅读 下载PDF
基于驾驶性主观评价和AVL-Drive客观测试的整车驾驶性主客观评价方法 被引量:5
12
作者 林彦名 蒋贤芳 蒋华梁 《时代汽车》 2022年第21期13-15,共3页
整车驾驶性评价是主机厂新研发车型过程中不可缺少的一环,是让工程师发现驾驶性问题的主要途径。将主观评价结果和AVL-Drive的驾驶性客观测试结果相互对比,能够精确识别出驾驶性问题,为工程师提供优化方向及依据,从根本上提升了工程师... 整车驾驶性评价是主机厂新研发车型过程中不可缺少的一环,是让工程师发现驾驶性问题的主要途径。将主观评价结果和AVL-Drive的驾驶性客观测试结果相互对比,能够精确识别出驾驶性问题,为工程师提供优化方向及依据,从根本上提升了工程师评审的准确度和置信度。本文详细介绍了基于驾驶性主观评价和AVL-Drive客观测试的整车驾驶性主客观评价方法。 展开更多
关键词 驾驶性 主观评价 客观测试
在线阅读 下载PDF
MFF-Net: Multimodal Feature Fusion Network for 3D Object Detection
13
作者 Peicheng Shi Zhiqiang Liu +1 位作者 Heng Qi Aixi Yang 《Computers, Materials & Continua》 SCIE EI 2023年第6期5615-5637,共23页
In complex traffic environment scenarios,it is very important for autonomous vehicles to accurately perceive the dynamic information of other vehicles around the vehicle in advance.The accuracy of 3D object detection ... In complex traffic environment scenarios,it is very important for autonomous vehicles to accurately perceive the dynamic information of other vehicles around the vehicle in advance.The accuracy of 3D object detection will be affected by problems such as illumination changes,object occlusion,and object detection distance.To this purpose,we face these challenges by proposing a multimodal feature fusion network for 3D object detection(MFF-Net).In this research,this paper first uses the spatial transformation projection algorithm to map the image features into the feature space,so that the image features are in the same spatial dimension when fused with the point cloud features.Then,feature channel weighting is performed using an adaptive expression augmentation fusion network to enhance important network features,suppress useless features,and increase the directionality of the network to features.Finally,this paper increases the probability of false detection and missed detection in the non-maximum suppression algo-rithm by increasing the one-dimensional threshold.So far,this paper has constructed a complete 3D target detection network based on multimodal feature fusion.The experimental results show that the proposed achieves an average accuracy of 82.60%on the Karlsruhe Institute of Technology and Toyota Technological Institute(KITTI)dataset,outperforming previous state-of-the-art multimodal fusion networks.In Easy,Moderate,and hard evaluation indicators,the accuracy rate of this paper reaches 90.96%,81.46%,and 75.39%.This shows that the MFF-Net network has good performance in 3D object detection. 展开更多
关键词 3D object detection multimodal fusion neural network autonomous driving attention mechanism
在线阅读 下载PDF
Adaptive Consistent Management to Prevent System Collapse on Shared Object Manipulation in Mixed Reality
14
作者 Jun Lee Hyun Kwon 《Computers, Materials & Continua》 SCIE EI 2023年第4期2025-2042,共18页
A concurrency control mechanism for collaborative work is akey element in a mixed reality environment. However, conventional lockingmechanisms restrict potential tasks or the support of non-owners, thusincreasing the ... A concurrency control mechanism for collaborative work is akey element in a mixed reality environment. However, conventional lockingmechanisms restrict potential tasks or the support of non-owners, thusincreasing the working time because of waiting to avoid conflicts. Herein, wepropose an adaptive concurrency control approach that can reduce conflictsand work time. We classify shared object manipulation in mixed reality intodetailed goals and tasks. Then, we model the relationships among goal,task, and ownership. As the collaborative work progresses, the proposedsystem adapts the different concurrency control mechanisms of shared objectmanipulation according to the modeling of goal–task–ownership. With theproposed concurrency control scheme, users can hold shared objects andmove and rotate together in a mixed reality environment similar to realindustrial sites. Additionally, this system provides MS Hololens and Myosensors to recognize inputs from a user and provides results in a mixed realityenvironment. The proposed method is applied to install an air conditioneras a case study. Experimental results and user studies show that, comparedwith the conventional approach, the proposed method reduced the number ofconflicts, waiting time, and total working time. 展开更多
关键词 Mixed reality upper body motion recognition shared object manipulation adaptive task concurrency control
在线阅读 下载PDF
3D Object Detection with Attention:Shell-Based Modeling
15
作者 Xiaorui Zhang Ziquan Zhao +1 位作者 Wei Sun Qi Cui 《Computer Systems Science & Engineering》 SCIE EI 2023年第7期537-550,共14页
LIDAR point cloud-based 3D object detection aims to sense the surrounding environment by anchoring objects with the Bounding Box(BBox).However,under the three-dimensional space of autonomous driving scenes,the previou... LIDAR point cloud-based 3D object detection aims to sense the surrounding environment by anchoring objects with the Bounding Box(BBox).However,under the three-dimensional space of autonomous driving scenes,the previous object detection methods,due to the pre-processing of the original LIDAR point cloud into voxels or pillars,lose the coordinate information of the original point cloud,slow detection speed,and gain inaccurate bounding box positioning.To address the issues above,this study proposes a new two-stage network structure to extract point cloud features directly by PointNet++,which effectively preserves the original point cloud coordinate information.To improve the detection accuracy,a shell-based modeling method is proposed.It roughly determines which spherical shell the coordinates belong to.Then,the results are refined to ground truth,thereby narrowing the localization range and improving the detection accuracy.To improve the recall of 3D object detection with bounding boxes,this paper designs a self-attention module for 3D object detection with a skip connection structure.Some of these features are highlighted by weighting them on the feature dimensions.After training,it makes the feature weights that are favorable for object detection get larger.Thus,the extracted features are more adapted to the object detection task.Extensive comparison experiments and ablation experiments conducted on the KITTI dataset verify the effectiveness of our proposed method in improving recall and precision. 展开更多
关键词 3D object detection autonomous driving point cloud shell-based modeling self-attention mechanism
在线阅读 下载PDF
基于进化多任务的稀疏大规模多目标优化 被引量:2
16
作者 梁正平 王侃 +2 位作者 周倩 王继刚 朱泽轩 《计算机学报》 北大核心 2025年第2期358-380,共23页
稀疏大规模多目标优化存在稀疏位置探测困难、搜索空间巨大等诸多挑战,现有为数不多的稀疏大规模多目标优化算法在稀疏位置的探测准确率和非零决策变量的优化程度方面尚存在较大提升空间.为进一步提升稀疏大规模多目标优化的性能,本文... 稀疏大规模多目标优化存在稀疏位置探测困难、搜索空间巨大等诸多挑战,现有为数不多的稀疏大规模多目标优化算法在稀疏位置的探测准确率和非零决策变量的优化程度方面尚存在较大提升空间.为进一步提升稀疏大规模多目标优化的性能,本文从辅助任务构建与优化、辅助任务重新初始化、知识迁移等三个方面,提出了基于进化多任务优化的稀疏大规模多目标优化算法(Evolutionary Multi-Task for Sparse Large-scale Multi-objective Op⁃timization,SLMO-EMT).其中,辅助任务构建与优化方面,基于主任务精英解的稀疏分布,采用两种不同的方式对决策变量的搜索空间进行限定,构建分别用于对稀疏位置和非零决策变量进行降维优化的两个辅助任务.辅助任务重新初始化方面,根据辅助任务在历史迭代中的知识迁移效果,对其搜索空间和当前种群进行更新,以使辅助任务可持续促进主任务的进化.知识迁移方面,首先基于轮询方式和各辅助任务的知识迁移概率,挑选用于知识迁移的辅助任务,再基于相似度挑选适合的知识受体,最后在子代生成过程中采用迁移知识引导的局部交叉,借助辅助任务的知识促进主任务的进化.为验证SLMO-EMT的性能,将其与8个先进的稀疏大规模多目标优化算法在1000-10000维的32个基准测试实例,以及8个应用测试实例上进行对比,实验结果表明SLMO-EMT对于稀疏大规模多目标优化问题的求解具有明显的竞争优势.SLMO-EMT的源代码已在Github上公开:https://github.com/CIA-SZU/WK. 展开更多
关键词 稀疏大规模多目标优化 进化多任务 辅助任务 知识迁移
在线阅读 下载PDF
Advancing Crowd Object Detection: A Review of YOLO, CNN and ViTs Hybrid Approach*
17
作者 Mahmoud Atta Mohammed Ali Tarek Aly +2 位作者 Atef Tayh Raslan Mervat Gheith Essam A. Amin 《Journal of Intelligent Learning Systems and Applications》 2024年第3期175-221,共47页
One of the most basic and difficult areas of computer vision and image understanding applications is still object detection. Deep neural network models and enhanced object representation have led to significant progre... One of the most basic and difficult areas of computer vision and image understanding applications is still object detection. Deep neural network models and enhanced object representation have led to significant progress in object detection. This research investigates in greater detail how object detection has changed in the recent years in the deep learning age. We provide an overview of the literature on a range of cutting-edge object identification algorithms and the theoretical underpinnings of these techniques. Deep learning technologies are contributing to substantial innovations in the field of object detection. While Convolutional Neural Networks (CNN) have laid a solid foundation, new models such as You Only Look Once (YOLO) and Vision Transformers (ViTs) have expanded the possibilities even further by providing high accuracy and fast detection in a variety of settings. Even with these developments, integrating CNN, YOLO and ViTs, into a coherent framework still poses challenges with juggling computing demand, speed, and accuracy especially in dynamic contexts. Real-time processing in applications like surveillance and autonomous driving necessitates improvements that take use of each model type’s advantages. The goal of this work is to provide an object detection system that maximizes detection speed and accuracy while decreasing processing requirements by integrating YOLO, CNN, and ViTs. Improving real-time detection performance in changing weather and light exposure circumstances, as well as detecting small or partially obscured objects in crowded cities, are among the goals. We provide a hybrid architecture which leverages CNN for robust feature extraction, YOLO for rapid detection, and ViTs for remarkable global context capture via self-attention techniques. Using an innovative training regimen that prioritizes flexible learning rates and data augmentation procedures, the model is trained on an extensive dataset of urban settings. Compared to solo YOLO, CNN, or ViTs models, the suggested model exhibits an increase in detection accuracy. This improvement is especially noticeable in difficult situations such settings with high occlusion and low light. In addition, it attains a decrease in inference time in comparison to baseline models, allowing real-time object detection without performance loss. This work introduces a novel method of object identification that integrates CNN, YOLO and ViTs, in a synergistic way. The resultant framework extends the use of integrated deep learning models in practical applications while also setting a new standard for detection performance under a variety of conditions. Our research advances computer vision by providing a scalable and effective approach to object identification problems. Its possible uses include autonomous navigation, security, and other areas. 展开更多
关键词 object Detection Deep Learning Computer Vision YOLO Convolutional Neural Networks (CNN) Vision Transformers Neural Networks Transfer Learning Autonomous driving Self-Drive Vehicles
在线阅读 下载PDF
融合动态加权图卷积的三维目标检测 被引量:1
18
作者 李宗民 戎光彩 +2 位作者 白云 徐畅 鲜世洋 《计算机科学》 北大核心 2025年第3期104-111,共8页
三维目标检测是自动驾驶中最关键的技术之一,基于激光雷达的三维目标检测通常在点云构建的场景中进行。目前的三维检测方法不能充分地利用点云的结构信息,这将导致目标物体的误检和漏检。为此,提出了基于动态加权图卷积的DEG R-CNN。首... 三维目标检测是自动驾驶中最关键的技术之一,基于激光雷达的三维目标检测通常在点云构建的场景中进行。目前的三维检测方法不能充分地利用点云的结构信息,这将导致目标物体的误检和漏检。为此,提出了基于动态加权图卷积的DEG R-CNN。首先,在RoI中对节点设置主邻点和次邻点,为目标物体构建点云的图结构,恢复物体的几何信息;然后,在图中利用Gaussian函数和一维卷积,高效地聚合点云的结构特征;最后,使用交叉注意力机制自适应地融合不同粒度的图像特征,为点云补充图像语义信息。在KITTI数据集上进行实验,验证了各个模块的有效性,三维目标检测的3D mAP达到88.80%,相比基线模型提高了1.22%。同时,对三维目标检测的结果进行了可视化,并对可视化结果进行了分析。 展开更多
关键词 点云 三维目标检测 激光雷达 多模态融合 自动驾驶
在线阅读 下载PDF
项目驱动的机器人伺服电动机成果导向实践教学 被引量:1
19
作者 吴敏 赵建勇 +2 位作者 于彦雪 黄晓艳 郑太英 《实验室研究与探索》 北大核心 2025年第8期199-204,221,共7页
为提高学生工程实践能力和创新思维,填补伺服电动机驱动技术实践教学的空白,基于项目化教学理念,提出机器人伺服电动机驱动技术实践教学新模式,并构建以成果导向教育为核心的电动机控制实践教学框架。结合伺服电动机实际工程项目以及电... 为提高学生工程实践能力和创新思维,填补伺服电动机驱动技术实践教学的空白,基于项目化教学理念,提出机器人伺服电动机驱动技术实践教学新模式,并构建以成果导向教育为核心的电动机控制实践教学框架。结合伺服电动机实际工程项目以及电动机控制理论教学知识点,构建包含工程问题的核心实践内容;通过模块化拆分实践任务,细化教学目标,设计涵盖硬件电路设计与软件算法调试的机器人伺服电动机驱动实践教学案例。教学实践表明,该模式显著加深了学生对伺服电动机控制项目设计思路与实现方法的理解,有力推动电动机控制相关课程的实践教学改革。 展开更多
关键词 成果导向教育 伺服电机驱动 实践教学 工程能力
在线阅读 下载PDF
电磁-机械激励下电动汽车电驱动系统振动特性分析及优化 被引量:1
20
作者 葛帅帅 文一州 +2 位作者 张志刚 吴行 谢正邱 《重庆理工大学学报(自然科学)》 北大核心 2025年第1期1-11,共11页
为探究电磁-机械激励下电动汽车电驱动系统振动特性,建立了永磁同步电机有限元模型,研究了电机电磁特性及辅助槽对电机电磁转矩脉动的影响规律;在此基础上,建立了包括电机、传动系统、壳体总成的电驱动系统多柔体动力学模型,分析了电磁... 为探究电磁-机械激励下电动汽车电驱动系统振动特性,建立了永磁同步电机有限元模型,研究了电机电磁特性及辅助槽对电机电磁转矩脉动的影响规律;在此基础上,建立了包括电机、传动系统、壳体总成的电驱动系统多柔体动力学模型,分析了电磁激励和机械激励下电驱动系统壳体总成振动特性;研究了定转子开槽结构参数对电驱动系统振动特性的影响,选取定子槽宽、转子槽宽、转子槽间距等敏感参数作为优化变量,构建Kriging响应面模型,对电机辅助槽参数进行多目标优化。研究结果表明,电机24阶、48阶电磁激励对系统振动的贡献较大,优化后振动加速度频域幅值在24f、f z1、48f处分别下降了32.8%、56.6%、57.3%。 展开更多
关键词 电动汽车 电驱动系统 振动特性 多目标优化
在线阅读 下载PDF
上一页 1 2 85 下一页 到第
使用帮助 返回顶部