The flexibility of traditional image processing system is limited because those system are designed for specific applications. In this paper, a new TMS320C64x-based multi-DSP parallel computing architecture is present...The flexibility of traditional image processing system is limited because those system are designed for specific applications. In this paper, a new TMS320C64x-based multi-DSP parallel computing architecture is presented. It has many promising characteristics such as powerful computing capability, broad I/O bandwidth, topology flexibility, and expansibility. The parallel system performance is evaluated by practical experiment.展开更多
This paper introduces the architecture and implementation of an industrial robot control system based on Windows NT. This robot control system, which is based on a single-processor structure, can run on general indust...This paper introduces the architecture and implementation of an industrial robot control system based on Windows NT. This robot control system, which is based on a single-processor structure, can run on general industrial computers. Owing to using Windows NT's real-time extension RTX, the control system can achieve good realtime performance and friendly user interface in one general-purpose operating system. A three layer hierarchical architecture of control software is proposed to make the system more scalable and flexible. Furthermore a communication and configuration system is implemented to enable modules to communicate with each other, which make the control system scalable and flexible.展开更多
In this study,an automated multimodal system for detecting,classifying,and dating fruit was developed using a two-stage YOLOv11 pipeline.In the first stage,the YOLOv11 detection model locates individual date fruits in...In this study,an automated multimodal system for detecting,classifying,and dating fruit was developed using a two-stage YOLOv11 pipeline.In the first stage,the YOLOv11 detection model locates individual date fruits in real time by drawing bounding boxes around them.These bounding boxes are subsequently passed to a YOLOv11 classification model,which analyzes cropped images and assigns class labels.An additional counting module automatically tallies the detected fruits,offering a near-instantaneous estimation of quantity.The experimental results suggest high precision and recall for detection,high classification accuracy(across 15 classes),and near-perfect counting in real time.This paper presents a multi-stage pipeline for date fruit detection,classification,and automated counting,employing YOLOv11-based models to achieve high accuracy while maintaining real-time throughput.The results demonstrated that the detection precision exceeded 90%,the classification accuracy approached 92%,and the counting module correlated closely with the manual tallies.These findings confirm the potential of reducing manual labour and enhancing operational efficiency in post-harvesting processes.Future studies will include dataset expansion,user-centric interfaces,and integration with harvesting robotics.展开更多
Real-time detection for object size has now become a hot topic in the testing field and image processing is the core algorithm. This paper focuses on the processing and display of the collected dynamic images to achie...Real-time detection for object size has now become a hot topic in the testing field and image processing is the core algorithm. This paper focuses on the processing and display of the collected dynamic images to achieve a real-time image pro- cessing for the moving objects. Firstly, the median filtering, gain calibration, image segmentation, image binarization, cor- ner detection and edge fitting are employed to process the images of the moving objects to make the image close to the real object. Then, the processed images are simultaneously displayed on a real-time basis to make it easier to analyze, understand and identify them, and thus it reduces the computation complexity. Finally, human-computer interaction (HCI)-friendly in- terface based on VC ++ is designed to accomplish the digital logic transform, image processing and real-time display of the objects. The experiment shows that the proposed algorithm and software design have better real-time performance and accu- racy which can meet the industrial needs.展开更多
This paper introduces a computational cognitive architecture that serves as a comprehensive computational theory of the human mind,from cognitive science and computational psychology.The cognitive architecture(named C...This paper introduces a computational cognitive architecture that serves as a comprehensive computational theory of the human mind,from cognitive science and computational psychology.The cognitive architecture(named Clarion)has been justified by,and validated against,psychological data,findings,and theoretical constructs.One important theoretical background for it is the dual-process theories,which led to its overall two-level structuring in a hybrid neuro-symbolic way.Furthermore,given the recent advances in AI and computing technology,LLMs are being incorporated into the model to better capture human intuition and instinct(and implicit processes in general),in order to further enhance Clarion.Integrating Clarion and LLMs can also help to develop AI systems that are more capable,more reliable,and more human-like.Overall,the paper advocates a multidisciplinary approach towards developing better models for cognitive science and for AI.展开更多
This paper proposed a general purpose real-time image processing system based on a flexible DSP-based Network, which is implemented by a high bandwidth communication channel, links. The links is realized using FPGA an...This paper proposed a general purpose real-time image processing system based on a flexible DSP-based Network, which is implemented by a high bandwidth communication channel, links. The links is realized using FPGA and provides a bandwidth of 12. 8 Gbit/s. Using the links, The topologic of multi-DSP system can be changed online to meet the variabilities of the parallel algorithm of image processing. The system can be assembled with utmost tens of boards and maintain the high communication speed. Analysis of the system adaptivity to image processing is testified followed by actual results. Key words real-time image processing - multi-DSP - flexible - scalable - FPGA - links CLC number TP 303 Foundation item: Supported by the National Natural Science Foundation of China (60135020)Biography: MAO Hai-cen(1973-), male, Ph.D. candidate, research direction: artificial intelligence, expert system, pattern recognition and image processing展开更多
In the era of Big Data, typical architecture of distributed real-time stream processing systems is the combination of Flume, Kafka, and Storm. As a kind of distributed message system, Kafka has the characteristics of ...In the era of Big Data, typical architecture of distributed real-time stream processing systems is the combination of Flume, Kafka, and Storm. As a kind of distributed message system, Kafka has the characteristics of horizontal scalability and high throughput, which is manly deployed in many areas in order to address the problem of speed mismatch between message producers and consumers. When using Kafka, we need to quickly receive data sent by producers. In addition, we need to send data to consumers quickly. Therefore, the performance of Kafka is of critical importance to the performance of the whole stream processing system. In this paper, we propose the improved design of real-time stream processing systems, and focus on improving the Kafka's data loading process.We use Kafka cat to transfer data from the source to Kafka topic directly, which can reduce the network transmission. We also utilize the memory file system to accelerate the process of data loading, which can address the bottleneck and performance problems caused by disk I/O. Extensive experiments are conducted to evaluate the performance, which show the superiority of our improved design.展开更多
Radar is an electronic device that uses radio waves to determine the range, angle, or velocity of objects. Real-time signal and information processor is an important module for real-time positioning, imaging, detectio...Radar is an electronic device that uses radio waves to determine the range, angle, or velocity of objects. Real-time signal and information processor is an important module for real-time positioning, imaging, detection and recognition of targets. With the development of ultra-wideband technology, synthetic aperture technology, signal and information processing technology, the radar coverage, detection accuracy and resolution have been greatly improved, especially in terms of one-dimensional(1D) high-resolution radar detection, tracking, recognition, and two-dimensional(2D) synthetic aperture radar imaging technology. Meanwhile, for the application of radar detection and remote sensing with high resolution and wide swath, the amount of data has been greatly increased. Therefore, the radar is required to have low-latency and real-time processing capability under the constraints of size, weight and power consumption. This paper systematically introduces the new technology of high resolution radar and real-time signal and information processing. The key problems and solutions are discussed, including the detection and tracking of 1D high-resolution radar, the accurate signal modeling and wide-swath imaging for geosynchronous orbit synthetic aperture radar, and real-time signal and information processing architecture and efficient algorithms. Finally, the latest research progress and representative results are presented, and the development trends are prospected.展开更多
Complex event processing (CEP) can extract meaningful events for real-time locating system (RTLS) applications. To identify complex event accurately in RTLS, we propose a new RFID complex event processing method GEEP,...Complex event processing (CEP) can extract meaningful events for real-time locating system (RTLS) applications. To identify complex event accurately in RTLS, we propose a new RFID complex event processing method GEEP, which is based on the timed automata (TA) theory. By devising RFID locating application into complex events, we model the timing diagram of RFID data streams based on the TA. We optimize the constraint of the event streams and propose a novel method to derive the constraint between objects, as well as the constraint between object and location. Experiments prove the proposed method reduces the cost of RFID complex event processing, and improves the efficiency of the RTLS.展开更多
Personal desktop platform with teraflops peak performance of thousands of cores is realized at the price of conventional workstations using the programmable graphics processing units(GPUs).A GPU-based parallel Euler/N...Personal desktop platform with teraflops peak performance of thousands of cores is realized at the price of conventional workstations using the programmable graphics processing units(GPUs).A GPU-based parallel Euler/Navier-Stokes solver is developed for 2-D compressible flows by using NVIDIA′s Compute Unified Device Architecture(CUDA)programming model in CUDA Fortran programming language.The techniques of implementation of CUDA kernels,double-layered thread hierarchy and variety memory hierarchy are presented to form the GPU-based algorithm of Euler/Navier-Stokes equations.The resulting parallel solver is validated by a set of typical test flow cases.The numerical results show that dozens of times speedup relative to a serial CPU implementation can be achieved using a single GPU desktop platform,which demonstrates that a GPU desktop can serve as a costeffective parallel computing platform to accelerate computational fluid dynamics(CFD)simulations substantially.展开更多
A novel reconfigurable hardware system which uses both muhi-DSP and FPGA to attain high performance and real-time image processing are presented. The system structure and working principle of mainly processing multi-B...A novel reconfigurable hardware system which uses both muhi-DSP and FPGA to attain high performance and real-time image processing are presented. The system structure and working principle of mainly processing multi-BSP board, extended multi-DSP board are analysed. The outstanding advantage is that the communication among different board components of this system is supported by high speed link ports & serial ports for increasing the system performance and computational power. Then the implementation of embedded real-time operating systems (RTOS) by us is discussed in detail. In this system, we adopt two kinds of parallel structures controlled by RTOS for parallel processing of algorithms. The experimental results show that exploitive period of the system is short, and maintenance convenient. Thus it is suitable for real-time image processing and can get satisfactory effect of image recognition.展开更多
This paper describes a dynamically reconfigurable data-flow hardware architecture optimized for the computation of image and video. It is a scalable hierarchically organized parallel architecture that consists of data...This paper describes a dynamically reconfigurable data-flow hardware architecture optimized for the computation of image and video. It is a scalable hierarchically organized parallel architecture that consists of data-flow clusters and finite-state machine (FSM) controllers. Each cluster contains various kinds of ceils that are optimized for video processing. Furthermore, to facilitate the design process, we provide a C-like language for design specification and associated design tools. Some video applications have been implemented in the architecture to demonstrate the applicability and flexibility of the architecture. Experimental results show that the architecture, along with its video applications, can be used in many real-time video processing.展开更多
This paper analyzes the current difficulties encountered in on-line inspection systems of strip surface quality, specifically relating to problems with real-time processing of huge amounts of data. To address this nee...This paper analyzes the current difficulties encountered in on-line inspection systems of strip surface quality, specifically relating to problems with real-time processing of huge amounts of data. To address this need, this paper describes an FPGA-based high-speed image processing module with both hardware and software aspects. Improving these two aspects together will help the system achieve real-time processing of massive image data, and simplifies the architecture of the strip surface quality on-line inspection system.展开更多
A low-power complementary metal oxide semiconductor(CMOS) operational amplifier (op-amp) for real-time signal processing of micro air vehicle (MAV) is designed in this paper.Traditional folded cascode architectu...A low-power complementary metal oxide semiconductor(CMOS) operational amplifier (op-amp) for real-time signal processing of micro air vehicle (MAV) is designed in this paper.Traditional folded cascode architecture with positive channel metal oxide semiconductor(PMOS) differential input transistors and sub-threshold technology are applied under the low supply voltage.Simulation results show that this amplifier has significantly low power,while maintaining almost the same gain,bandwidth and other key performances.The power required is only 0.12 mW,which is applicable to low-power and low-voltage real-time signal acquisition and processing system.展开更多
A discrete model reference adaptive controller of robot arm is obtained by integrating the reduced dynamic model of robot, model reference adaptive control (MRAC) and digital signal processing (DSP) computer syste...A discrete model reference adaptive controller of robot arm is obtained by integrating the reduced dynamic model of robot, model reference adaptive control (MRAC) and digital signal processing (DSP) computer system into an electromechanical system. With the DSP computer system, the control signal of each joint of the robot arm can be processed in real time and independently. The simulation and experiment results show that with the control strategy, the robot achieved a good trajectory following precision, a good decoupling performance and a high real-time adaptivity.展开更多
Wireless sensor network nodes have only limited resources concerning memory and battery life-time. Mem- ory can be efficiently used by sharing data, and the life-time of a battery can be extended, when the node has lo...Wireless sensor network nodes have only limited resources concerning memory and battery life-time. Mem- ory can be efficiently used by sharing data, and the life-time of a battery can be extended, when the node has long power saving sleep-phases. We propose a publish/subscribe architecture that achieves these two aims. The results of our work are of great interest for sensor application developers, giving them now the opportu- nity to use our architecture for sharing data among different applications on the node as well as the different layers of the operating system. We introduce a blackboard which is used for centrally storing published val- ues, like measured data from a monitored sensor. This makes it possible to share stored data without monitoring the sensors once again, which is advantageously concerning power consumption, memory space, and reaction time. Beside the proposed publish/subscribe method for sensor nodes with its notification possibili- ties, our architecture fulfills also real-time requirements. We show how the well-known sensor operating system MANTIS OS can be extended by a real-time enabled, blackboard-based publish/subscribe architect- ture. This architecture and first of all its implementation is of special interest for cross layer optimization of sensor applications. Cross-layer approaches benefit from our architecture because the available implementa- tion can be used as an efficient framework for central storing and managing of shared values.展开更多
To develop technically feasible and economically favorable dynamic process control(DPC)strategies for an alternating activated sludge(AAS)system,a bench-scale continuous-flow alternating aerobic and anoxic reactor,per...To develop technically feasible and economically favorable dynamic process control(DPC)strategies for an alternating activated sludge(AAS)system,a bench-scale continuous-flow alternating aerobic and anoxic reactor,performing short-cut nitrogen removal from real domestic wastewater was operated under different control strategies for more than five months.A fixed-time control(FTC) study showed that bending-points on pH and oxidation-reduction potential(ORP)profiles accurately coincided with the major biologic...展开更多
Autogenic processes are widely found in various sedimentary systems and they play an important role in the depositional evolution and corresponding sedimentary architecture.However,autogenic processes are often affect...Autogenic processes are widely found in various sedimentary systems and they play an important role in the depositional evolution and corresponding sedimentary architecture.However,autogenic processes are often affected by changing allogenic factors and are difficult to be identified and analyzed from modern and ancient records.Through the flume tank experiment under constant boundary conditions,the depositional process,evolution principles,and the sedimentary architecture of a river-dominated delta was presented,and a corresponding sedimentary architecture model was constructed.The evolution of river-dominated delta controlled only by autogenic process is obviously periodic,and each autogenic cycle can be divided into an initial progradational stage,a middle retrogratational stage,and a late aggradational–progradational stage.In the initial progradational stage,one feeder channel incised into the delta plain,mouth bar(s)was formed in front of the channel mouth,and small-scale crevasse splays were formed on the delta plain.In the middle retrogradational stage,the feeder channel was blocked by the mouth bar(s)which grew out of water at the end of the initial stage,and a set of large-scale distributary splay complexes were formed on the delta plain.These distributary splay complexes were retrogradationally overlapped due to the continuous migration of the bifurcation point of the feeder channel.In the late aggradational–progradational stage,the feeder channel branched into several radial distributary channels,overlapped distributary channels were formed on the delta plain,and terminal lobe complexes were formed at the end of distributary channels.The three sedimentary layers formed in the three stages constituted an autogenic succession.The experimental delta consisted of six autogenic depositional successions.Dynamic allocation of accommodation space and the following adaptive sediments filling were the two main driving factors of the autogenic evolution of deltas.展开更多
A real-time pricing system of electricity is a system that charges different electricity prices for different hours of the day and for different days, and is effective for reducing the peak and flattening the load cur...A real-time pricing system of electricity is a system that charges different electricity prices for different hours of the day and for different days, and is effective for reducing the peak and flattening the load curve. In this paper, using a Markov decision process (MDP), we propose a modeling method and an optimal control method for real-time pricing systems. First, the outline of real-time pricing systems is explained. Next, a model of a set of customers is derived as a multi-agent MDP. Furthermore, the optimal control problem is formulated, and is reduced to a quadratic programming problem. Finally, a numerical simulation is presented.展开更多
To equip data-driven dynamic chemical process models with strong interpretability,we develop a light attention–convolution–gate recurrent unit(LACG)architecture with three sub-modules—a basic module,a brand-new lig...To equip data-driven dynamic chemical process models with strong interpretability,we develop a light attention–convolution–gate recurrent unit(LACG)architecture with three sub-modules—a basic module,a brand-new light attention module,and a residue module—that are specially designed to learn the general dynamic behavior,transient disturbances,and other input factors of chemical processes,respectively.Combined with a hyperparameter optimization framework,Optuna,the effectiveness of the proposed LACG is tested by distributed control system data-driven modeling experiments on the discharge flowrate of an actual deethanization process.The LACG model provides significant advantages in prediction accuracy and model generalization compared with other models,including the feedforward neural network,convolution neural network,long short-term memory(LSTM),and attention-LSTM.Moreover,compared with the simulation results of a deethanization model built using Aspen Plus Dynamics V12.1,the LACG parameters are demonstrated to be interpretable,and more details on the variable interactions can be observed from the model parameters in comparison with the traditional interpretable model attention-LSTM.This contribution enriches interpretable machine learning knowledge and provides a reliable method with high accuracy for actual chemical process modeling,paving a route to intelligent manufacturing.展开更多
基金This project was supported by the National Natural Science Foundation of China (60135020).
文摘The flexibility of traditional image processing system is limited because those system are designed for specific applications. In this paper, a new TMS320C64x-based multi-DSP parallel computing architecture is presented. It has many promising characteristics such as powerful computing capability, broad I/O bandwidth, topology flexibility, and expansibility. The parallel system performance is evaluated by practical experiment.
基金Supported by National Natural Science foundation of China (No. 69975014)
文摘This paper introduces the architecture and implementation of an industrial robot control system based on Windows NT. This robot control system, which is based on a single-processor structure, can run on general industrial computers. Owing to using Windows NT's real-time extension RTX, the control system can achieve good realtime performance and friendly user interface in one general-purpose operating system. A three layer hierarchical architecture of control software is proposed to make the system more scalable and flexible. Furthermore a communication and configuration system is implemented to enable modules to communicate with each other, which make the control system scalable and flexible.
基金supported by the Deanship of Scientific Research,Vice Presidency for Graduate Studies and Scientific Research,King Faisal University,Saudi Arabia,Grant No.KFU250098.
文摘In this study,an automated multimodal system for detecting,classifying,and dating fruit was developed using a two-stage YOLOv11 pipeline.In the first stage,the YOLOv11 detection model locates individual date fruits in real time by drawing bounding boxes around them.These bounding boxes are subsequently passed to a YOLOv11 classification model,which analyzes cropped images and assigns class labels.An additional counting module automatically tallies the detected fruits,offering a near-instantaneous estimation of quantity.The experimental results suggest high precision and recall for detection,high classification accuracy(across 15 classes),and near-perfect counting in real time.This paper presents a multi-stage pipeline for date fruit detection,classification,and automated counting,employing YOLOv11-based models to achieve high accuracy while maintaining real-time throughput.The results demonstrated that the detection precision exceeded 90%,the classification accuracy approached 92%,and the counting module correlated closely with the manual tallies.These findings confirm the potential of reducing manual labour and enhancing operational efficiency in post-harvesting processes.Future studies will include dataset expansion,user-centric interfaces,and integration with harvesting robotics.
基金National Natural Science Foundation of China(No.61302159,61227003,61301259)Natual Science Foundation of Shanxi Province(No.2012021011-2)+2 种基金Specialized Research Fund for the Doctoral Program of Higher Education,China(No.20121420110006)Top Science and Technology Innovation Teams of Higher Learning Institutions of Shanxi Province,ChinaProject Sponsored by Scientific Research for the Returned Overseas Chinese Scholars,Shanxi Province(No.2013-083)
文摘Real-time detection for object size has now become a hot topic in the testing field and image processing is the core algorithm. This paper focuses on the processing and display of the collected dynamic images to achieve a real-time image pro- cessing for the moving objects. Firstly, the median filtering, gain calibration, image segmentation, image binarization, cor- ner detection and edge fitting are employed to process the images of the moving objects to make the image close to the real object. Then, the processed images are simultaneously displayed on a real-time basis to make it easier to analyze, understand and identify them, and thus it reduces the computation complexity. Finally, human-computer interaction (HCI)-friendly in- terface based on VC ++ is designed to accomplish the digital logic transform, image processing and real-time display of the objects. The experiment shows that the proposed algorithm and software design have better real-time performance and accu- racy which can meet the industrial needs.
文摘This paper introduces a computational cognitive architecture that serves as a comprehensive computational theory of the human mind,from cognitive science and computational psychology.The cognitive architecture(named Clarion)has been justified by,and validated against,psychological data,findings,and theoretical constructs.One important theoretical background for it is the dual-process theories,which led to its overall two-level structuring in a hybrid neuro-symbolic way.Furthermore,given the recent advances in AI and computing technology,LLMs are being incorporated into the model to better capture human intuition and instinct(and implicit processes in general),in order to further enhance Clarion.Integrating Clarion and LLMs can also help to develop AI systems that are more capable,more reliable,and more human-like.Overall,the paper advocates a multidisciplinary approach towards developing better models for cognitive science and for AI.
文摘This paper proposed a general purpose real-time image processing system based on a flexible DSP-based Network, which is implemented by a high bandwidth communication channel, links. The links is realized using FPGA and provides a bandwidth of 12. 8 Gbit/s. Using the links, The topologic of multi-DSP system can be changed online to meet the variabilities of the parallel algorithm of image processing. The system can be assembled with utmost tens of boards and maintain the high communication speed. Analysis of the system adaptivity to image processing is testified followed by actual results. Key words real-time image processing - multi-DSP - flexible - scalable - FPGA - links CLC number TP 303 Foundation item: Supported by the National Natural Science Foundation of China (60135020)Biography: MAO Hai-cen(1973-), male, Ph.D. candidate, research direction: artificial intelligence, expert system, pattern recognition and image processing
基金supported by the Research Fund of National Key Laboratory of Computer Architecture under Grant No.CARCH201501the Open Project Program of the State Key Laboratory of Mathematical Engineering and Advanced Computing under Grant No.2016A09
文摘In the era of Big Data, typical architecture of distributed real-time stream processing systems is the combination of Flume, Kafka, and Storm. As a kind of distributed message system, Kafka has the characteristics of horizontal scalability and high throughput, which is manly deployed in many areas in order to address the problem of speed mismatch between message producers and consumers. When using Kafka, we need to quickly receive data sent by producers. In addition, we need to send data to consumers quickly. Therefore, the performance of Kafka is of critical importance to the performance of the whole stream processing system. In this paper, we propose the improved design of real-time stream processing systems, and focus on improving the Kafka's data loading process.We use Kafka cat to transfer data from the source to Kafka topic directly, which can reduce the network transmission. We also utilize the memory file system to accelerate the process of data loading, which can address the bottleneck and performance problems caused by disk I/O. Extensive experiments are conducted to evaluate the performance, which show the superiority of our improved design.
基金supported in part by the National Natural Science Foundation of China under Grant Nos.61427802,31727901,61625103,61501032,61471038the Chang Jiang Scholars Program(T2012122)+1 种基金part by the 111 project of China under Grant B14010supported by the Program for Changjiang Scholars and Innovative Research Team in University of Ministry of Education of China
文摘Radar is an electronic device that uses radio waves to determine the range, angle, or velocity of objects. Real-time signal and information processor is an important module for real-time positioning, imaging, detection and recognition of targets. With the development of ultra-wideband technology, synthetic aperture technology, signal and information processing technology, the radar coverage, detection accuracy and resolution have been greatly improved, especially in terms of one-dimensional(1D) high-resolution radar detection, tracking, recognition, and two-dimensional(2D) synthetic aperture radar imaging technology. Meanwhile, for the application of radar detection and remote sensing with high resolution and wide swath, the amount of data has been greatly increased. Therefore, the radar is required to have low-latency and real-time processing capability under the constraints of size, weight and power consumption. This paper systematically introduces the new technology of high resolution radar and real-time signal and information processing. The key problems and solutions are discussed, including the detection and tracking of 1D high-resolution radar, the accurate signal modeling and wide-swath imaging for geosynchronous orbit synthetic aperture radar, and real-time signal and information processing architecture and efficient algorithms. Finally, the latest research progress and representative results are presented, and the development trends are prospected.
文摘Complex event processing (CEP) can extract meaningful events for real-time locating system (RTLS) applications. To identify complex event accurately in RTLS, we propose a new RFID complex event processing method GEEP, which is based on the timed automata (TA) theory. By devising RFID locating application into complex events, we model the timing diagram of RFID data streams based on the TA. We optimize the constraint of the event streams and propose a novel method to derive the constraint between objects, as well as the constraint between object and location. Experiments prove the proposed method reduces the cost of RFID complex event processing, and improves the efficiency of the RTLS.
基金supported by the National Natural Science Foundation of China (No.11172134)the Funding of Jiangsu Innovation Program for Graduate Education (No.CXLX13_132)
文摘Personal desktop platform with teraflops peak performance of thousands of cores is realized at the price of conventional workstations using the programmable graphics processing units(GPUs).A GPU-based parallel Euler/Navier-Stokes solver is developed for 2-D compressible flows by using NVIDIA′s Compute Unified Device Architecture(CUDA)programming model in CUDA Fortran programming language.The techniques of implementation of CUDA kernels,double-layered thread hierarchy and variety memory hierarchy are presented to form the GPU-based algorithm of Euler/Navier-Stokes equations.The resulting parallel solver is validated by a set of typical test flow cases.The numerical results show that dozens of times speedup relative to a serial CPU implementation can be achieved using a single GPU desktop platform,which demonstrates that a GPU desktop can serve as a costeffective parallel computing platform to accelerate computational fluid dynamics(CFD)simulations substantially.
基金This project was supported by the National Natural Science Foundation of China(60135020) National Key Pre-researchProject of China(413010701 -3) .
文摘A novel reconfigurable hardware system which uses both muhi-DSP and FPGA to attain high performance and real-time image processing are presented. The system structure and working principle of mainly processing multi-BSP board, extended multi-DSP board are analysed. The outstanding advantage is that the communication among different board components of this system is supported by high speed link ports & serial ports for increasing the system performance and computational power. Then the implementation of embedded real-time operating systems (RTOS) by us is discussed in detail. In this system, we adopt two kinds of parallel structures controlled by RTOS for parallel processing of algorithms. The experimental results show that exploitive period of the system is short, and maintenance convenient. Thus it is suitable for real-time image processing and can get satisfactory effect of image recognition.
基金Foundation item: the National Natural Science Foundation of China (No. 61136002), the Key Project of Chinese Ministry of Education (No. 211180), and the Shaanxi Provincial Industrial and Technological Project (No. 2011k06-47).
文摘This paper describes a dynamically reconfigurable data-flow hardware architecture optimized for the computation of image and video. It is a scalable hierarchically organized parallel architecture that consists of data-flow clusters and finite-state machine (FSM) controllers. Each cluster contains various kinds of ceils that are optimized for video processing. Furthermore, to facilitate the design process, we provide a C-like language for design specification and associated design tools. Some video applications have been implemented in the architecture to demonstrate the applicability and flexibility of the architecture. Experimental results show that the architecture, along with its video applications, can be used in many real-time video processing.
文摘This paper analyzes the current difficulties encountered in on-line inspection systems of strip surface quality, specifically relating to problems with real-time processing of huge amounts of data. To address this need, this paper describes an FPGA-based high-speed image processing module with both hardware and software aspects. Improving these two aspects together will help the system achieve real-time processing of massive image data, and simplifies the architecture of the strip surface quality on-line inspection system.
基金Sponsored by the National Natural Science Foundation of China (60843005)the Basic Research Foundation of Beijing Institute of Technology(20070142018)
文摘A low-power complementary metal oxide semiconductor(CMOS) operational amplifier (op-amp) for real-time signal processing of micro air vehicle (MAV) is designed in this paper.Traditional folded cascode architecture with positive channel metal oxide semiconductor(PMOS) differential input transistors and sub-threshold technology are applied under the low supply voltage.Simulation results show that this amplifier has significantly low power,while maintaining almost the same gain,bandwidth and other key performances.The power required is only 0.12 mW,which is applicable to low-power and low-voltage real-time signal acquisition and processing system.
文摘A discrete model reference adaptive controller of robot arm is obtained by integrating the reduced dynamic model of robot, model reference adaptive control (MRAC) and digital signal processing (DSP) computer system into an electromechanical system. With the DSP computer system, the control signal of each joint of the robot arm can be processed in real time and independently. The simulation and experiment results show that with the control strategy, the robot achieved a good trajectory following precision, a good decoupling performance and a high real-time adaptivity.
文摘Wireless sensor network nodes have only limited resources concerning memory and battery life-time. Mem- ory can be efficiently used by sharing data, and the life-time of a battery can be extended, when the node has long power saving sleep-phases. We propose a publish/subscribe architecture that achieves these two aims. The results of our work are of great interest for sensor application developers, giving them now the opportu- nity to use our architecture for sharing data among different applications on the node as well as the different layers of the operating system. We introduce a blackboard which is used for centrally storing published val- ues, like measured data from a monitored sensor. This makes it possible to share stored data without monitoring the sensors once again, which is advantageously concerning power consumption, memory space, and reaction time. Beside the proposed publish/subscribe method for sensor nodes with its notification possibili- ties, our architecture fulfills also real-time requirements. We show how the well-known sensor operating system MANTIS OS can be extended by a real-time enabled, blackboard-based publish/subscribe architect- ture. This architecture and first of all its implementation is of special interest for cross layer optimization of sensor applications. Cross-layer approaches benefit from our architecture because the available implementa- tion can be used as an efficient framework for central storing and managing of shared values.
文摘To develop technically feasible and economically favorable dynamic process control(DPC)strategies for an alternating activated sludge(AAS)system,a bench-scale continuous-flow alternating aerobic and anoxic reactor,performing short-cut nitrogen removal from real domestic wastewater was operated under different control strategies for more than five months.A fixed-time control(FTC) study showed that bending-points on pH and oxidation-reduction potential(ORP)profiles accurately coincided with the major biologic...
基金supported by a National Natural Science Foundation of China(No.41802123)China Postdoctoral Science Foundation funded project(No.2018M630843)Open Fund of Key Laboratory of Exploration Technologies for Oil and Gas Resources(Yangtze University),Ministry of Education(No.K2017-31)
文摘Autogenic processes are widely found in various sedimentary systems and they play an important role in the depositional evolution and corresponding sedimentary architecture.However,autogenic processes are often affected by changing allogenic factors and are difficult to be identified and analyzed from modern and ancient records.Through the flume tank experiment under constant boundary conditions,the depositional process,evolution principles,and the sedimentary architecture of a river-dominated delta was presented,and a corresponding sedimentary architecture model was constructed.The evolution of river-dominated delta controlled only by autogenic process is obviously periodic,and each autogenic cycle can be divided into an initial progradational stage,a middle retrogratational stage,and a late aggradational–progradational stage.In the initial progradational stage,one feeder channel incised into the delta plain,mouth bar(s)was formed in front of the channel mouth,and small-scale crevasse splays were formed on the delta plain.In the middle retrogradational stage,the feeder channel was blocked by the mouth bar(s)which grew out of water at the end of the initial stage,and a set of large-scale distributary splay complexes were formed on the delta plain.These distributary splay complexes were retrogradationally overlapped due to the continuous migration of the bifurcation point of the feeder channel.In the late aggradational–progradational stage,the feeder channel branched into several radial distributary channels,overlapped distributary channels were formed on the delta plain,and terminal lobe complexes were formed at the end of distributary channels.The three sedimentary layers formed in the three stages constituted an autogenic succession.The experimental delta consisted of six autogenic depositional successions.Dynamic allocation of accommodation space and the following adaptive sediments filling were the two main driving factors of the autogenic evolution of deltas.
文摘A real-time pricing system of electricity is a system that charges different electricity prices for different hours of the day and for different days, and is effective for reducing the peak and flattening the load curve. In this paper, using a Markov decision process (MDP), we propose a modeling method and an optimal control method for real-time pricing systems. First, the outline of real-time pricing systems is explained. Next, a model of a set of customers is derived as a multi-agent MDP. Furthermore, the optimal control problem is formulated, and is reduced to a quadratic programming problem. Finally, a numerical simulation is presented.
基金support provided by the National Natural Science Foundation of China(22122802,22278044,and 21878028)the Chongqing Science Fund for Distinguished Young Scholars(CSTB2022NSCQ-JQX0021)the Fundamental Research Funds for the Central Universities(2022CDJXY-003).
文摘To equip data-driven dynamic chemical process models with strong interpretability,we develop a light attention–convolution–gate recurrent unit(LACG)architecture with three sub-modules—a basic module,a brand-new light attention module,and a residue module—that are specially designed to learn the general dynamic behavior,transient disturbances,and other input factors of chemical processes,respectively.Combined with a hyperparameter optimization framework,Optuna,the effectiveness of the proposed LACG is tested by distributed control system data-driven modeling experiments on the discharge flowrate of an actual deethanization process.The LACG model provides significant advantages in prediction accuracy and model generalization compared with other models,including the feedforward neural network,convolution neural network,long short-term memory(LSTM),and attention-LSTM.Moreover,compared with the simulation results of a deethanization model built using Aspen Plus Dynamics V12.1,the LACG parameters are demonstrated to be interpretable,and more details on the variable interactions can be observed from the model parameters in comparison with the traditional interpretable model attention-LSTM.This contribution enriches interpretable machine learning knowledge and provides a reliable method with high accuracy for actual chemical process modeling,paving a route to intelligent manufacturing.