期刊文献+
共找到58篇文章
< 1 2 3 >
每页显示 20 50 100
An Efficient Real-Time Fault-Tolerant Scheduling Algorithm Based on Multiprocessor Systems 被引量:6
1
作者 YANG Fumin LUO Wei PANG Liping 《Wuhan University Journal of Natural Sciences》 CAS 2007年第1期113-116,共4页
In the context of real-time fault-tolerant scheduling in multiprocessor systems, Primary-backup scheme plays an important role. A backup copy is always preferred to be executed as passive backup copy whenever possible... In the context of real-time fault-tolerant scheduling in multiprocessor systems, Primary-backup scheme plays an important role. A backup copy is always preferred to be executed as passive backup copy whenever possible because it can take the advantages of backup copy de-allocation technique and overloading technique to improve schedulability. In this paper, we propose a novel efficient fault-tolerant ratemonotonic best-fit algorithm efficient fault-tolerant rate-monotonic best-fit (ERMBF) based on multiprocessors systems to enhance the schedulability. Unlike existing scheduling algorithms that start scheduling tasks with only one processor. ERMBF pre-allocates a certain amount of processors before starting scheduling tasks, which enlarge the searching spaces for tasks. Besides, when a new processor is allocated, we reassign the task copies that have already been assigned to the existing processors in order to find a superior tasks assignment configuration. These two strategies are all aiming at making as many backup copies as possible to be executed as passive status. As a result, ERMBF can use fewer processors to schedule a set of tasks without losing real-time and fault-tolerant capabilities of the system. Simulation results reveal that ERMBF significantly improves the schedulability over existing, comparable algorithms in literature. 展开更多
关键词 real-time periodic tasks FAULT-TOLERANCE primary/backup copy multiprocessor systems
在线阅读 下载PDF
Dynamic Load Balancing Based on Restricted Multicast Tree in Homogeneous Multiprocessor Systems 被引量:1
2
作者 刘滨 石峰 高玉金 《Journal of Beijing Institute of Technology》 EI CAS 2008年第2期184-188,共5页
To decrease the cost of exchanging load information among processors, a dynamic load-balancing (DLB) algorithm which adopts multieast tree technology is proposed. The muhieast tree construction rules are also propos... To decrease the cost of exchanging load information among processors, a dynamic load-balancing (DLB) algorithm which adopts multieast tree technology is proposed. The muhieast tree construction rules are also proposed to avoid wrongly transferred or redundant DLB messages due to the overlapping of multicast trees. The proposed DLB algorithm is distributed controlled, sender initiated and can help heavily loaded processors with complete distribution of redundant loads with minimum number of executions. Experiments were executed to compare the effects of the proposed DLB algorithm and other three ones, the results prove the effectivity and practicability of the proposed algorithm in dealing with great scale compute-intensive tasks. 展开更多
关键词 dynamic load balancing (DLB) multicast tree RULE MESSAGE MULTIPROCESSOR
在线阅读 下载PDF
Fork-Join program response time on multiprocessors with exchangeable join 被引量:1
3
作者 WANG Yong-cai ZHAO Qian-chuan ZHENG Da-zhong 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2006年第6期927-936,共10页
The Fork-Join program consisting of K parallel tasks is a useful model for a large number of computing applications. When the parallel processor has multi-channels, later tasks may finish execution earlier than their ... The Fork-Join program consisting of K parallel tasks is a useful model for a large number of computing applications. When the parallel processor has multi-channels, later tasks may finish execution earlier than their earlier tasks and may join with tasks from other programs. This phenomenon is called exchangeable join (EJ), which introduces correlation to the task’s service time. In this work, we investigate the response time of multiprocessor systems with EJ with a new approach. We analyze two aspects of this kind of systems: exchangeable join (EJ) and the capacity constraint (CC). We prove that the system response time can be effectively reduced by EJ, while the reduced amount is constrained by the capacity of the multiprocessor. An upper bound model is constructed based on this analysis and a quick estimation algorithm is proposed. The approximation formula is verified by extensive simulation results, which show that the relative error of approximation is less than 5%. 展开更多
关键词 Exchangeable join First come first served (FCFS) Fork-Join MULTIPROCESSOR Response time
在线阅读 下载PDF
Timed Petri Net Models of Shared-Memory Bus-Based Multiprocessors 被引量:1
4
作者 Wlodek M. Zuberek 《Journal of Computer and Communications》 2018年第10期1-14,共14页
In shared-memory bus-based multiprocessors, when the number of processors grows, the processors spend an increasing amount of time waiting for access to the bus (and shared memory). This contention reduces the perform... In shared-memory bus-based multiprocessors, when the number of processors grows, the processors spend an increasing amount of time waiting for access to the bus (and shared memory). This contention reduces the performance of processors and imposes a limitation of the number of processors that can be used efficiently in bus-based systems. Since the multi-processor’s performance depends upon many parameters which affect the performance in different ways, timed Petri nets are used to model shared-memory bus-based multiprocessors at the instruction execution level, and the developed models are used to study how the performance of processors changes with the number of processors in the system. The results illustrate very well the restriction on the number of processors imposed by the shared bus. All performance characteristics presented in this paper are obtained by discrete-event simulation of Petri net models. 展开更多
关键词 SHARED-MEMORY multiprocessors BUS-BASED multiprocessors TIMED PETRI NETS Discrete-Event Simulation
暂未订购
A Class of Parallel Runge-Kutta Methods for Differential-Algebraic Systems of Index 2
5
作者 Fei Jinggao(Beijing Institute of Computer Application and Simulation Technology, 100854, P. R. China) 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1999年第3期64-75,共12页
A class of parallel Runge-Kutta Methods for differential-algebraic equations of index 2are constructed for multiprocessor system. This paper gives the order conditions and investigatesthe convergence theory for such m... A class of parallel Runge-Kutta Methods for differential-algebraic equations of index 2are constructed for multiprocessor system. This paper gives the order conditions and investigatesthe convergence theory for such methods. 展开更多
关键词 MULTIPROCESSOR system PARALLEL algorithm Runges-Kutta method Differential-algebraic system
在线阅读 下载PDF
Study and Analysis of Bus Arbitration Mechanism in PI-MPS Multiprocessor System
6
作者 王申科 冯锡栋 +2 位作者 暴建民 李斌 杨孝宗 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 1997年第3期26-29,共4页
This paper presents the mechanism of the bus arbitration in PI-MPS multiprocessor sys-tem,describes encode approach,arbiter timing states and uniqueness of master modular ininterconnection bus,and measures and analyse... This paper presents the mechanism of the bus arbitration in PI-MPS multiprocessor sys-tem,describes encode approach,arbiter timing states and uniqueness of master modular ininterconnection bus,and measures and analyses latency of bus arbitration as well. 展开更多
关键词 MULTIPROCESSOR system DISTRIBUTED system BUS
在线阅读 下载PDF
Low-cost fault tolerance in evolvable multiprocessor systems:a graceful degradation approach
7
作者 Shervin VAKILI Sied Mehdi FAKHRAIE +1 位作者 Siamak MOHAMMADI Ali AHMADI 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2009年第6期922-926,共5页
The evolvable multiprocessor (EvoMP), as a novel multiprocessor system-on-chip (MPSoC) machine with evolvable task decomposition and scheduling, claims a major feature of low-cost and efficient fault tolerance. Non-ce... The evolvable multiprocessor (EvoMP), as a novel multiprocessor system-on-chip (MPSoC) machine with evolvable task decomposition and scheduling, claims a major feature of low-cost and efficient fault tolerance. Non-centralized control and adaptive distribution of the program among the available processors are two major capabilities of this platform, which remarkably help to achieve an efficient fault tolerance scheme. This letter presents the operational as well as architectural details of this fault tolerance scheme. In this method, when a processor becomes faulty, it will be eliminated of contribution in program execution in remaining run-time. This method also utilizes dynamic rescheduling capability of the system to achieve the maximum possible efficiency after processor reduction. The results confirm the efficiency and remarkable advantages of the proposed approach over common redundancy based techniques in similar systems. 展开更多
关键词 Fault tolerance Multiprocessor system-on-chip (MPSoC) Genetic algorithm (GA) Adaptive task scheduling
原文传递
Design of efficient parallel algorithms on shared memory multiprocessors
8
作者 Qiao Xiangzhen (Institute of Computing Technology, Chinese Academg of Science Beijing 100080, P. R. China) 《Wuhan University Journal of Natural Sciences》 CAS 1996年第Z1期344-349,共6页
The design of parallel algorithms is studied in this paper. These algorithms are applicable to shared memory MIMD machines In this paper, the emphasis is put on the methods for design of the efficient parallel algori... The design of parallel algorithms is studied in this paper. These algorithms are applicable to shared memory MIMD machines In this paper, the emphasis is put on the methods for design of the efficient parallel algorithms. The design of efficient parallel algorithms should be based on the following considerationst algorithm parallelism and the hardware-parallelism; granularity of the parallel algorithm, algorithm optimization according to the underling parallel machine. In this paper , these principles are applied to solve a model problem of the PDE. The speedup of the new method is high. The results were tested and evaluated on a shared memory MIMD machine. The practical results were agree with the predicted performance. 展开更多
关键词 parallel algorithm shared memory multiprocessor parallel granularity optimization.
在线阅读 下载PDF
YH-MCS Reconstructable Fault-tolerant Multiprocessor Control System
9
作者 肖刚 《High Technology Letters》 EI CAS 1996年第2期17-20,共4页
FMS is the basic and frontier technology of advanced manufacturing.Its critical compo-nent is FMS control system.Reconstructable fault-tolerant multiprocessor control system,YH-MCS,is the result of the research on the... FMS is the basic and frontier technology of advanced manufacturing.Its critical compo-nent is FMS control system.Reconstructable fault-tolerant multiprocessor control system,YH-MCS,is the result of the research on the high-performance and high-reliable FMS con-trol system.This paper describes its architecture,technology characteristics,academic valueand application potentiality. 展开更多
关键词 FMS MULTIPROCESSOR FAULT-TOLERANCE
在线阅读 下载PDF
一种分片式多核处理器的用户级模拟器 被引量:6
10
作者 黄琨 马可 +2 位作者 曾洪博 张戈 章隆兵 《软件学报》 EI CSCD 北大核心 2008年第4期1069-1080,共12页
随着片上晶体管资源的增多和互连线延迟的加大,分片式多核微处理器已成为多核处理器设计的新方向.为了对这种新型处理器进行体系结构的深入研究和设计空间的探索,设计并实现了针对分片式多核处理器的用户级多核性能模拟器.该多核模拟器... 随着片上晶体管资源的增多和互连线延迟的加大,分片式多核微处理器已成为多核处理器设计的新方向.为了对这种新型处理器进行体系结构的深入研究和设计空间的探索,设计并实现了针对分片式多核处理器的用户级多核性能模拟器.该多核模拟器在龙芯2号单处理器核的基础上,完整地模拟了基于目录的Cache一致性协议和存储转发式片上互联网络的结构模型,详细地刻画了由于系统乱序处理各种请求应答和请求之间的冲突而造成的时序特性,可以通过运行各种串行或并行的工作负载对多核处理器的各种重要性能指标加以评估,为多核处理器的结构设计提供了快速、灵活、高效的研究平台. 展开更多
关键词 分片式CMP(chip multiprocessor) 模拟器 片上网络 性能分析 龙芯2号微处理器
在线阅读 下载PDF
AN OBJECT ORIENTED MODEL SCHEDULING FOR MEDIA-SOC
11
作者 Cheng Xingmei Yao Yingbiao +2 位作者 Zhang Yixiong Liu Peng Yao Qingdong 《Journal of Electronics(China)》 2009年第2期244-251,共8页
This paper proposes an object oriented model scheduling for parallel computing in media MultiProcessors System on Chip(MPSoC).Firstly,the Coarse Grain Data Flow Graph(CGDFG) parallel programming model is used in this ... This paper proposes an object oriented model scheduling for parallel computing in media MultiProcessors System on Chip(MPSoC).Firstly,the Coarse Grain Data Flow Graph(CGDFG) parallel programming model is used in this approach.Secondly,this approach has the feature of unified abstraction for software objects implementing in processor and hardware objects implementing in ASICs,easy for mapping CGDFG programming on MPSoC.This approach cuts down the kernel overhead and reduces the code size effectively.The principle of the oriented object model,the method of scheduling,and how to map a parallel programming through CGDFG to the MPSoC are analyzed in this approach.This approach also compares the code size and execution cycles with conventional control flow scheduling,and presents respective management overhead for one application in me-dia-SoC. 展开更多
关键词 Multimedia computing Coarse Grain Data Flow Graph(CGDFG) Parallel program-ming Real Time Operating system(RTOS) multiprocessors system on Chip(MPSoC)
在线阅读 下载PDF
t/k-fault diagnosis algorithm of n-dimensional hypercube network based on the MM*model 被引量:4
12
作者 LIANG Jiarong ZHOU Ning YUN Long 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2018年第1期216-222,共7页
Compared with accurate diagnosis, the system’s selfdiagnosing capability can be greatly increased through the t/kdiagnosis strategy at most k vertexes to be mistakenly identified as faulty under the comparison model,... Compared with accurate diagnosis, the system’s selfdiagnosing capability can be greatly increased through the t/kdiagnosis strategy at most k vertexes to be mistakenly identified as faulty under the comparison model, where k is typically a small number. Based on the Preparata, Metze, and Chien(PMC)model, the n-dimensional hypercube network is proved to be t/kdiagnosable. In this paper, based on the Maeng and Malek(MM)*model, a novel t/k-fault diagnosis(1≤k≤4) algorithm of ndimensional hypercube, called t/k-MM*-DIAG, is proposed to isolate all faulty processors within the set of nodes, among which the number of fault-free nodes identified wrongly as faulty is at most k. The time complexity in our algorithm is only O(2~n n~2). 展开更多
关键词 hypercube network t/k-diagnosis algorithm multiprocessor systems the Maeng and Malek(MM)* model Preparata Metze and Chien(PMC)
在线阅读 下载PDF
Development of FPGA Based NURBS Interpolator and Motion Controller with Multiprocessor Technique 被引量:2
13
作者 ZHAO Huan ZHU Limin +1 位作者 XIONG Zhenhua DING Han 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2013年第5期940-947,共8页
The high-speed computational performance is gained at the cost of huge hardware resource,which restricts the application of high-accuracy algorithms because of the limited hardware cost in practical use.To solve the p... The high-speed computational performance is gained at the cost of huge hardware resource,which restricts the application of high-accuracy algorithms because of the limited hardware cost in practical use.To solve the problem,a novel method for designing the field programmable gate array(FPGA)-based non-uniform rational B-spline(NURBS) interpolator and motion controller,which adopts the embedded multiprocessor technique,is proposed in this study.The hardware and software design for the multiprocessor,one of which is for NURBS interpolation and the other for position servo control,is presented.Performance analysis and experiments on an X-Y table are carried out,hardware cost as well as consuming time for interpolation and motion control is compared with the existing methods.The experimental and comparing results indicate that,compared with the existing methods,the proposed method can reduce the hardware cost by 97.5% using higher-accuracy interpolation algorithm within the period of 0.5 ms.A method which ensures the real-time performance and interpolation accuracy,and reduces the hardware cost significantly is proposed,and it’s practical in the use of industrial application. 展开更多
关键词 NURBS interpolator FPGA-based interpolation MULTIPROCESSOR system on a programmable chip (SOPC) motion controller
在线阅读 下载PDF
Gradual refinement for application-specific MPSoC design from Simulink model to RTL implementation 被引量:1
14
作者 Kai HUANG Xiao-lang YAN +6 位作者 Sang-il HAN Soo-ik CHAE Ahmed A. JERRAYA Katalin POPOVICI Xavier GUERIN Lisane BRISOLARA Luigi CARRO 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2009年第2期151-164,共14页
The application-specific multiprocessor system-on-chip(MPSoC) architecture is becoming an attractive solution to deal with increasingly complex embedded applications,which require both high performance and flexible pr... The application-specific multiprocessor system-on-chip(MPSoC) architecture is becoming an attractive solution to deal with increasingly complex embedded applications,which require both high performance and flexible programmability. As an effective method for MPSoC development,we present a gradual refinement flow starting from a high-level Simulink model to a synthesizable and executable hardware and software specification. The proposed methodology consists of five different abstract levels:Simulink combined algorithm and architecture model(CAAM),virtual architecture(VA),transactional accurate architecture(TA),virtual prototype(VP) and field-programmable gate array(FPGA) emulation. Experimental results of Motion-JPEG and H.264 show that the proposed gradual refinement flow can generate various MPSoC architectures from an original Simulink model,allowing processor,communication and tasks design space exploration. 展开更多
关键词 Multiprocessor system-on-chip (MPSoC) design REFINEMENT Simulink systemC Motion-JPEG H.264
原文传递
DEVELOPMENT OF AN AIRBORNE SAR REAL-TIME DIGITAL IMAGING PROCESSOR 被引量:1
15
作者 Zhu Xixing Jin Lingjiao Huang Ling(institute of Electronics, Academia Sinica, Beijing 100080) 《Journal of Electronics(China)》 1996年第2期116-121,共6页
An airborne SAR real-time digital imaging processor is presented, and its realtime digital imaging principle and main technical parameters are analyzed briefly. The system configuration and logical structure are descr... An airborne SAR real-time digital imaging processor is presented, and its realtime digital imaging principle and main technical parameters are analyzed briefly. The system configuration and logical structure are described in detail. Finally the main features of this system and examples of imagery obtained with the system are also presented. 展开更多
关键词 SAR REAL-TIME IMAGING PROCESSOR MULTIPROCESSOR operation system
在线阅读 下载PDF
Approximation algorithm for multiprocessor parallel job scheduling 被引量:1
16
作者 陈松乔 黄金贵 陈建二 《Journal of Central South University of Technology》 2002年第4期267-272,共6页
P k |fix| C max problem is a new scheduling problem based on the multiprocessor parallel job, and it is proved to be NP hard problem when k ≥3. This paper focuses on the case of k =3. Some new observations and new te... P k |fix| C max problem is a new scheduling problem based on the multiprocessor parallel job, and it is proved to be NP hard problem when k ≥3. This paper focuses on the case of k =3. Some new observations and new techniques for P 3 |fix| C max problem are offered. The concept of semi normal schedulings is introduced, and a very simple linear time algorithm Semi normal Algorithm for constructing semi normal schedulings is developed. With the method of the classical Graham List Scheduling, a thorough analysis of the optimal scheduling on a special instance is provided, which shows that the algorithm is an approximation algorithm of ratio of 9/8 for any instance of P 3|fix| C max problem, and improves the previous best ratio of 7/6 by M.X.Goemans. 展开更多
关键词 MULTIPROCESSOR PARALLEL JOB SCHEDULING APPROXIMATION algorithm NP-HARD problem
在线阅读 下载PDF
一种改进的抗干扰性可预测的协作缓存机制
17
作者 苑风凯 季振洲 《智能计算机与应用》 2017年第6期19-24,共6页
私有Last Level Cache(LLC)配置可以提供更低的访问延迟和更好的性能隔离性,但剔除了共享未利用Cache资源的能力。协作缓存机制通过将本地LLC(溢出者)驱逐的Cache行溢出(Spill)到远程LLC(接收者)达到Cache容量共享的目的。后继的协作缓... 私有Last Level Cache(LLC)配置可以提供更低的访问延迟和更好的性能隔离性,但剔除了共享未利用Cache资源的能力。协作缓存机制通过将本地LLC(溢出者)驱逐的Cache行溢出(Spill)到远程LLC(接收者)达到Cache容量共享的目的。后继的协作缓存研究致力于Cache共享的有效性,然而,没有任何研究关注到接收者对于外来溢出行的抗干扰性。本文提出重用性和抗干扰性可预测的协作缓存(RAPCC)机制,进一步增强私有LLC的Cache容量共享能力。RAPCC借助重用位置分布(RPD)预测私有LLC的重用性和抗干扰性决定其接受溢出角色:溢出者、接收者、NEITHER和EITHER(全新中立的角色,既溢出又接收)。试验结果表明,RAPCC的HPKI和IPC性能表现均超越了CC、DSR、CBS、ASCC。 展开更多
关键词 Chip multiprocessors(CMPs) LAST Level Cache(LLC) 私有LLC 协作缓存
在线阅读 下载PDF
An On-chip Interconnection QoS Verification Platform of Processor of Large Data for Architectural Modeling Analysis 被引量:1
18
作者 Li Qinghua Qin Jilong +2 位作者 Ding Xu Wang Endong Gong Weifeng 《国际计算机前沿大会会议论文集》 2015年第1期126-128,共3页
This paper presents introduction for a QoS verification of on-chip interconnection based on the new progress of the industry, which combined with an AMD processor chip design for big data. Some verification experience... This paper presents introduction for a QoS verification of on-chip interconnection based on the new progress of the industry, which combined with an AMD processor chip design for big data. Some verification experience in architectural modeling and simulation of on-chip interconnection is also introduced in this paper. 展开更多
关键词 INTERCONNECT QOS Verification Modeling MULTIPROCESSOR Computer Architecture Big Data
在线阅读 下载PDF
Locality Aware Optimal Task Scheduling Algorithm for TriBA —— A Novel Scalable Architecture
19
作者 KHAN Haroon-Ur-Rashid 石峰 《Journal of Beijing Institute of Technology》 EI CAS 2008年第3期294-299,共6页
An optimal algorithmic approach to task scheduling for, triplet based architecture(TriBA), is proposed in this paper. TriBA is considered to be a high performance, distributed parallel computing architecture. TriBA ... An optimal algorithmic approach to task scheduling for, triplet based architecture(TriBA), is proposed in this paper. TriBA is considered to be a high performance, distributed parallel computing architecture. TriBA consists of a 2D grid of small, programmable processing units, each physically connected to its three neighbors. In parallel or distributed environment an efficient assignment of tasks to the processing elements is imperative to achieve fast job turnaround time. Moreover, the sojourn time experienced by each individual job should be minimized. The arriving jobs are comprised of parallel applications, each consisting of multiple-independent tasks that must be instantaneously assigned to processor queues, as they arrive. The processors independently and concurrently service these tasks. The key scheduling issues is, when some queue backlogs are small, an incoming job should first spread its tasks to those lightly loaded queues in order to take advantage of the parallel processing gain. Our algorithmic approach achieves optimality in task scheduling by assigning consecutive tasks to a triplet of processors exploiting locality in tasks. The experimental results show that tasks allocation to triplets of processing elements is efficient and optimal. Comparison to well accepted interconnection strategy, 2D mesh, is shown to prove the effectiveness of our algorithmic approach for TriBA. Finally we conclude that TriBA can be an efficient interconnection strategy for computations intensive applications, if tasks assignment is carried out optimally using algorithmic approach. 展开更多
关键词 multiprocessor architecture task scheduling MAPPING parallel processing SPEEDUP
在线阅读 下载PDF
Physical design method of MPSoC
20
作者 LIU Peng XIA Bing-jie TENG Zhao-wei 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2007年第4期631-637,共7页
Floorplan, clock network and power plan are crucial steps in deep sub-micron system-on-chip design. A novel di- agonal floorplan is integrated to enhance the data sharing between different cores in system-on-chip. Cus... Floorplan, clock network and power plan are crucial steps in deep sub-micron system-on-chip design. A novel di- agonal floorplan is integrated to enhance the data sharing between different cores in system-on-chip. Custom clock network con- taining hand-adjusted buffers and variable routing rules is constructed to realize balanced synchronization. Effective power plan considering both IR drop and electromigration achieves high utilization and maintains power integrity in our MediaSoC. Using such methods, deep sub-micron design challenges are managed under a fast prototyping methodology, which greatly shortens the design cycle. 展开更多
关键词 Physical design Fast prototyping FLOORPLAN Clock tree synthesis (CTS) Power plan Multiprocessor system-onchip (MPSoC)
在线阅读 下载PDF
上一页 1 2 3 下一页 到第
使用帮助 返回顶部