A multi-objective optimization and analysis model of the sintering process based on BP neural network is presented. Genetic algorithms are combined to simplify the BP neural network, which can reduce the learning time...A multi-objective optimization and analysis model of the sintering process based on BP neural network is presented. Genetic algorithms are combined to simplify the BP neural network, which can reduce the learning time and increase the forecasting accuracy of the network model. This model has been experimented in the sintering process, and the production cost, the energy consumption, the quality (revolving intensity), and the output are considered at the same time. Moreover, the relation between some factors and the multi-objectives has been analyzed, and the results are consistent with the process. Different objectives are emphasized at different practical periods, and this can provide a theoretical basis for the manager.展开更多
In today’s world, computer vision technology has become a very important direction in the field of Internet applications. As one of the basic problems of computer vision, object detection has become the basis of many...In today’s world, computer vision technology has become a very important direction in the field of Internet applications. As one of the basic problems of computer vision, object detection has become the basis of many vision tasks. Whether we need to realize the interaction between images and text or recognize fine categories, it provides reliable information. This article reviews the development of object detection networks. Starting from RCNN, we introduce object detection based on candidate regions, including Fast R-CNN, Faster R-CNN, etc.;and then start to introduce single-shot networks including YOLO, SSD, and Retina Net, etc. Detectors are the most excellent methods at present. By reviewing the current research status of object detection networks, it provides suggestions for the further development trend and research of object detection.展开更多
This paper concerns the problem of object segmentation in real-time for picking system. A region proposal method inspired by human glance based on the convolutional neural network is proposed to select promising regio...This paper concerns the problem of object segmentation in real-time for picking system. A region proposal method inspired by human glance based on the convolutional neural network is proposed to select promising regions, allowing more processing is reserved only for these regions. The speed of object segmentation is significantly improved by the region proposal method.By the combination of the region proposal method based on the convolutional neural network and superpixel method, the category and location information can be used to segment objects and image redundancy is significantly reduced. The processing time is reduced considerably by this to achieve the real time. Experiments show that the proposed method can segment the interested target object in real time on an ordinary laptop.展开更多
In this Paper, a classification method based on neural networks is presented for recognition of 3D objects. Indeed, the objective of this paper is to classify an object query against objects in a database, which leads...In this Paper, a classification method based on neural networks is presented for recognition of 3D objects. Indeed, the objective of this paper is to classify an object query against objects in a database, which leads to recognition of the former. 3D objects of this database are transformations of other objects by one element of the overall transformation. The set of transformations considered in this work is the general affine group.展开更多
MPEG 4 is a basic tool for interactivity and manipulation of video sequences. Video object segmentation is a key issue in defining the content of any video sequence, which is often divided into two steps: initial obj...MPEG 4 is a basic tool for interactivity and manipulation of video sequences. Video object segmentation is a key issue in defining the content of any video sequence, which is often divided into two steps: initial object segmentation and object tracking. In this paper, an initial object segmentation method for video object plane(VOP) generation using color information is proposed. Based on 3 by 3 linear templates, a cellular neural network (CNN) is used to implemented object segmentation. The Experimental results are presented to verify the efficiency and robustness of this approach.展开更多
In this paper, the modelling and multi-objective optimal control of batch processes, using a recurrent neuro-fuzzy network, are presented. The recurrent neuro-fuzzy network, forms a "global" nonlinear long-range pre...In this paper, the modelling and multi-objective optimal control of batch processes, using a recurrent neuro-fuzzy network, are presented. The recurrent neuro-fuzzy network, forms a "global" nonlinear long-range prediction model through the fuzzy conjunction of a number of "local" linear dynamic models. Network output is fed back to network input through one or more time delay units, which ensure that predictions from the recurrent neuro-fuzzy network are long-range. In building a recurrent neural network model, process knowledge is used initially to partition the processes non-linear characteristics into several local operating regions, and to aid in the initialisation of corresponding network weights. Process operational data is then used to train the network. Membership functions of the local regimes are identified, and local models are discovered via network training. Based on a recurrent neuro-fuzzy network model, a multi-objective optimal control policy can be obtained. The proposed technique is applied to a fed-batch reactor.展开更多
While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection ...While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection of objects with multiple aspect ratios and scales is still a key problem. This paper proposes a top-down and bottom-up feature pyramid network(TDBU-FPN),which combines multi-scale feature representation and anchor generation at multiple aspect ratios. First, in order to build the multi-scale feature map, this paper puts a number of fully convolutional layers after the backbone. Second, to link neighboring feature maps, top-down and bottom-up flows are adopted to introduce context information via top-down flow and supplement suboriginal information via bottom-up flow. The top-down flow refers to the deconvolution procedure, and the bottom-up flow refers to the pooling procedure. Third, the problem of adapting different object aspect ratios is tackled via many anchor shapes with different aspect ratios on each multi-scale feature map. The proposed method is evaluated on the pattern analysis, statistical modeling and computational learning visual object classes(PASCAL VOC)dataset and reaches an accuracy of 79%, which exhibits a 1.8% improvement with a detection speed of 23 fps.展开更多
The devastating effects of wildland fire are an unsolved problem,resulting in human losses and the destruction of natural and economic resources.Convolutional neural network(CNN)is shown to perform very well in the ar...The devastating effects of wildland fire are an unsolved problem,resulting in human losses and the destruction of natural and economic resources.Convolutional neural network(CNN)is shown to perform very well in the area of object classification.This network has the ability to perform feature extraction and classification within the same architecture.In this paper,we propose a CNN for identifying fire in videos.A deep domain based method for video fire detection is proposed to extract a powerful feature representation of fire.Testing on real video sequences,the proposed approach achieves better classification performance as some of relevant conventional video based fire detection methods and indicates that using CNN to detect fire in videos is efficient.To balance the efficiency and accuracy,the model is fine-tuned considering the nature of the target problem and fire data.Experimental results on benchmark fire datasets reveal the effectiveness of the proposed framework and validate its suitability for fire detection in closed-circuit television surveillance systems compared to state-of-the-art methods.展开更多
The coating on the electrodes contains many kinds of raw materials which affect significantly on the mechanical properties of deposited metals. It is still a problem how to predict and control the mechanical propertie...The coating on the electrodes contains many kinds of raw materials which affect significantly on the mechanical properties of deposited metals. It is still a problem how to predict and control the mechanical properties of deposited metals directly according to the components of coating on the electrodes. In this paper an electrode intelligent design system is developed by means of fuzzy neural network technology and genetic algorithm,, dynamic link library, object linking and embedding and multithreading. The front-end application and customer interface of the system is realized by using visual C ++ program language and taking SQL Server 2000 as background database. It realizes series functions including automatic design of electrode formula, intelligent prediction of electrode properties, inquiry of electrode information, output of process report based on normalized template and electronic storage and search of relative files.展开更多
Machine learning is an integral technology many people utilize in all areas of human life. It is pervasive in modern living worldwide, and has multiple usages. One application is image classification, embraced across ...Machine learning is an integral technology many people utilize in all areas of human life. It is pervasive in modern living worldwide, and has multiple usages. One application is image classification, embraced across many spheres of influence such as business, finance, medicine, etc. to enhance produces, causes, efficiency, etc. This need for more accurate, detail-oriented classification increases the need for modifications, adaptations, and innovations to Deep Learning Algorithms. This article used Convolutional Neural Networks (CNN) to classify scenes in the CIFAR-10 database, and detect emotions in the KDEF database. The proposed method converted the data to the wavelet domain to attain greater accuracy and comparable efficiency to the spatial domain processing. By dividing image data into subbands, important feature learning occurred over differing low to high frequencies. The combination of the learned low and high frequency features, and processing the fused feature mapping resulted in an advance in the detection accuracy. Comparing the proposed methods to spatial domain CNN and Stacked Denoising Autoencoder (SDA), experimental findings revealed a substantial increase in accuracy.展开更多
An approach of adaptive predictive control with a new structure and a fast algorithm of neural network (NN) is proposed. NN modeling and optimal predictive control are combined to achieve both accuracy and good contro...An approach of adaptive predictive control with a new structure and a fast algorithm of neural network (NN) is proposed. NN modeling and optimal predictive control are combined to achieve both accuracy and good control performance. The output of nonlinear network model is adopted as a measured disturbance that is therefore weakened in predictive feed-forward control. Simulation and practical application show the effectiveness of control by the proposed approach.展开更多
目的利用2023年新推出的YOLOv8m网络,开发一款人工智能辅助系统,旨在实现腺瘤性息肉的自动定位和诊断。方法使用4个结肠息肉数据集,总计包括9411张静态图像和25段视频。所涵盖的息肉类别包括增生性息肉和腺瘤性息肉。利用LabelMe工具对...目的利用2023年新推出的YOLOv8m网络,开发一款人工智能辅助系统,旨在实现腺瘤性息肉的自动定位和诊断。方法使用4个结肠息肉数据集,总计包括9411张静态图像和25段视频。所涵盖的息肉类别包括增生性息肉和腺瘤性息肉。利用LabelMe工具对图像进行标注,并将标注数据转换成适用于深度学习模型训练的YOLO格式。在模型训练方面,采用预训练的YOLOv5m和YOLOv8m模型,并结合实时数据增强以及多种图像处理技术进行迁移学习训练。模型性能的评估采用多个指标,包括敏感性、特异性、假阳性率和检测速度(每秒帧数,frames per second,FPS)、平均精度(mean average precision,mAP)等。此外,还使用混淆矩阵进行详细评估,并将模型的性能与不同资历的医师进行比较分析。结果在对1411个息肉的验证集进行评估中,YOLOv8m模型在多项性能指标上超越了YOLOv5。YOLOv8m的整体准确率为98.58%,在腺瘤性息肉、增生性息肉检测的敏感性分别为98.06%和99.32%,特异性分别为99.33%和98.09%,不同类型息肉预测的mAP50为0.994。在与内镜医师的性能比较中,YOLOv8m模型在准确率(98.58%)和处理速度(60.61帧/s)方面均优于低年资(准确率为86.02%)和高年资内镜医师(准确率为93.14%),其处理速度是低年资内镜医师的67.2倍。结论基于YOLOv8m网络的深度学习模型能够快速、精确地检测与分类结直肠息肉,在辅助内镜医师提高腺瘤性息肉检出率方面展现出很大的应用潜力。展开更多
文摘A multi-objective optimization and analysis model of the sintering process based on BP neural network is presented. Genetic algorithms are combined to simplify the BP neural network, which can reduce the learning time and increase the forecasting accuracy of the network model. This model has been experimented in the sintering process, and the production cost, the energy consumption, the quality (revolving intensity), and the output are considered at the same time. Moreover, the relation between some factors and the multi-objectives has been analyzed, and the results are consistent with the process. Different objectives are emphasized at different practical periods, and this can provide a theoretical basis for the manager.
文摘In today’s world, computer vision technology has become a very important direction in the field of Internet applications. As one of the basic problems of computer vision, object detection has become the basis of many vision tasks. Whether we need to realize the interaction between images and text or recognize fine categories, it provides reliable information. This article reviews the development of object detection networks. Starting from RCNN, we introduce object detection based on candidate regions, including Fast R-CNN, Faster R-CNN, etc.;and then start to introduce single-shot networks including YOLO, SSD, and Retina Net, etc. Detectors are the most excellent methods at present. By reviewing the current research status of object detection networks, it provides suggestions for the further development trend and research of object detection.
基金supported by the National Natural Science Foundation of China(61233010 61305106)+2 种基金the Shanghai Natural Science Foundation(17ZR1409700 18ZR1415300)the basic research project of Shanghai Municipal Science and Technology Commission(16JC1400900)
文摘This paper concerns the problem of object segmentation in real-time for picking system. A region proposal method inspired by human glance based on the convolutional neural network is proposed to select promising regions, allowing more processing is reserved only for these regions. The speed of object segmentation is significantly improved by the region proposal method.By the combination of the region proposal method based on the convolutional neural network and superpixel method, the category and location information can be used to segment objects and image redundancy is significantly reduced. The processing time is reduced considerably by this to achieve the real time. Experiments show that the proposed method can segment the interested target object in real time on an ordinary laptop.
文摘In this Paper, a classification method based on neural networks is presented for recognition of 3D objects. Indeed, the objective of this paper is to classify an object query against objects in a database, which leads to recognition of the former. 3D objects of this database are transformations of other objects by one element of the overall transformation. The set of transformations considered in this work is the general affine group.
文摘MPEG 4 is a basic tool for interactivity and manipulation of video sequences. Video object segmentation is a key issue in defining the content of any video sequence, which is often divided into two steps: initial object segmentation and object tracking. In this paper, an initial object segmentation method for video object plane(VOP) generation using color information is proposed. Based on 3 by 3 linear templates, a cellular neural network (CNN) is used to implemented object segmentation. The Experimental results are presented to verify the efficiency and robustness of this approach.
基金This work was supported by the UK EPSRC (GR/N13319, GR/R10875).
文摘In this paper, the modelling and multi-objective optimal control of batch processes, using a recurrent neuro-fuzzy network, are presented. The recurrent neuro-fuzzy network, forms a "global" nonlinear long-range prediction model through the fuzzy conjunction of a number of "local" linear dynamic models. Network output is fed back to network input through one or more time delay units, which ensure that predictions from the recurrent neuro-fuzzy network are long-range. In building a recurrent neural network model, process knowledge is used initially to partition the processes non-linear characteristics into several local operating regions, and to aid in the initialisation of corresponding network weights. Process operational data is then used to train the network. Membership functions of the local regimes are identified, and local models are discovered via network training. Based on a recurrent neuro-fuzzy network model, a multi-objective optimal control policy can be obtained. The proposed technique is applied to a fed-batch reactor.
基金supported by the Program of Introducing Talents of Discipline to Universities(111 Plan)of China(B14010)the National Natural Science Foundation of China(31727901)
文摘While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection of objects with multiple aspect ratios and scales is still a key problem. This paper proposes a top-down and bottom-up feature pyramid network(TDBU-FPN),which combines multi-scale feature representation and anchor generation at multiple aspect ratios. First, in order to build the multi-scale feature map, this paper puts a number of fully convolutional layers after the backbone. Second, to link neighboring feature maps, top-down and bottom-up flows are adopted to introduce context information via top-down flow and supplement suboriginal information via bottom-up flow. The top-down flow refers to the deconvolution procedure, and the bottom-up flow refers to the pooling procedure. Third, the problem of adapting different object aspect ratios is tackled via many anchor shapes with different aspect ratios on each multi-scale feature map. The proposed method is evaluated on the pattern analysis, statistical modeling and computational learning visual object classes(PASCAL VOC)dataset and reaches an accuracy of 79%, which exhibits a 1.8% improvement with a detection speed of 23 fps.
基金National Natural Science Foundation of China(No.61573095)Natural Science Foundation of Shanghai,China(No.6ZR1446700)
文摘The devastating effects of wildland fire are an unsolved problem,resulting in human losses and the destruction of natural and economic resources.Convolutional neural network(CNN)is shown to perform very well in the area of object classification.This network has the ability to perform feature extraction and classification within the same architecture.In this paper,we propose a CNN for identifying fire in videos.A deep domain based method for video fire detection is proposed to extract a powerful feature representation of fire.Testing on real video sequences,the proposed approach achieves better classification performance as some of relevant conventional video based fire detection methods and indicates that using CNN to detect fire in videos is efficient.To balance the efficiency and accuracy,the model is fine-tuned considering the nature of the target problem and fire data.Experimental results on benchmark fire datasets reveal the effectiveness of the proposed framework and validate its suitability for fire detection in closed-circuit television surveillance systems compared to state-of-the-art methods.
文摘The coating on the electrodes contains many kinds of raw materials which affect significantly on the mechanical properties of deposited metals. It is still a problem how to predict and control the mechanical properties of deposited metals directly according to the components of coating on the electrodes. In this paper an electrode intelligent design system is developed by means of fuzzy neural network technology and genetic algorithm,, dynamic link library, object linking and embedding and multithreading. The front-end application and customer interface of the system is realized by using visual C ++ program language and taking SQL Server 2000 as background database. It realizes series functions including automatic design of electrode formula, intelligent prediction of electrode properties, inquiry of electrode information, output of process report based on normalized template and electronic storage and search of relative files.
文摘Machine learning is an integral technology many people utilize in all areas of human life. It is pervasive in modern living worldwide, and has multiple usages. One application is image classification, embraced across many spheres of influence such as business, finance, medicine, etc. to enhance produces, causes, efficiency, etc. This need for more accurate, detail-oriented classification increases the need for modifications, adaptations, and innovations to Deep Learning Algorithms. This article used Convolutional Neural Networks (CNN) to classify scenes in the CIFAR-10 database, and detect emotions in the KDEF database. The proposed method converted the data to the wavelet domain to attain greater accuracy and comparable efficiency to the spatial domain processing. By dividing image data into subbands, important feature learning occurred over differing low to high frequencies. The combination of the learned low and high frequency features, and processing the fused feature mapping resulted in an advance in the detection accuracy. Comparing the proposed methods to spatial domain CNN and Stacked Denoising Autoencoder (SDA), experimental findings revealed a substantial increase in accuracy.
基金the National Natural Science Foundation of China (No. 60075012).
文摘An approach of adaptive predictive control with a new structure and a fast algorithm of neural network (NN) is proposed. NN modeling and optimal predictive control are combined to achieve both accuracy and good control performance. The output of nonlinear network model is adopted as a measured disturbance that is therefore weakened in predictive feed-forward control. Simulation and practical application show the effectiveness of control by the proposed approach.
文摘目的利用2023年新推出的YOLOv8m网络,开发一款人工智能辅助系统,旨在实现腺瘤性息肉的自动定位和诊断。方法使用4个结肠息肉数据集,总计包括9411张静态图像和25段视频。所涵盖的息肉类别包括增生性息肉和腺瘤性息肉。利用LabelMe工具对图像进行标注,并将标注数据转换成适用于深度学习模型训练的YOLO格式。在模型训练方面,采用预训练的YOLOv5m和YOLOv8m模型,并结合实时数据增强以及多种图像处理技术进行迁移学习训练。模型性能的评估采用多个指标,包括敏感性、特异性、假阳性率和检测速度(每秒帧数,frames per second,FPS)、平均精度(mean average precision,mAP)等。此外,还使用混淆矩阵进行详细评估,并将模型的性能与不同资历的医师进行比较分析。结果在对1411个息肉的验证集进行评估中,YOLOv8m模型在多项性能指标上超越了YOLOv5。YOLOv8m的整体准确率为98.58%,在腺瘤性息肉、增生性息肉检测的敏感性分别为98.06%和99.32%,特异性分别为99.33%和98.09%,不同类型息肉预测的mAP50为0.994。在与内镜医师的性能比较中,YOLOv8m模型在准确率(98.58%)和处理速度(60.61帧/s)方面均优于低年资(准确率为86.02%)和高年资内镜医师(准确率为93.14%),其处理速度是低年资内镜医师的67.2倍。结论基于YOLOv8m网络的深度学习模型能够快速、精确地检测与分类结直肠息肉,在辅助内镜医师提高腺瘤性息肉检出率方面展现出很大的应用潜力。