Presentation is loading. Please wait.

Presentation is loading. Please wait.

End of First Semester Presentation DigiSat Reliable Computer – Multiprocessor Control System. Niv Best, Shai Israeli Instructor: Oren Kerem HS-DS Lab,

Similar presentations


Presentation on theme: "End of First Semester Presentation DigiSat Reliable Computer – Multiprocessor Control System. Niv Best, Shai Israeli Instructor: Oren Kerem HS-DS Lab,"— Presentation transcript:

1 End of First Semester Presentation DigiSat Reliable Computer – Multiprocessor Control System. Niv Best, Shai Israeli Instructor: Oren Kerem HS-DS Lab, Technion, Winter 2003 This presentation will probably involve audience discussion, which will create action items. Use PowerPoint to keep track of these action items during your presentation In Slide Show, click on the right mouse button Select “Meeting Minder” Select the “Action Items” tab Type in action items as they come up Click OK to dismiss this box This will automatically create an Action Item slide at the end of your presentation with your points entered.

2 Project Goals Design & implement a hardware mechanism for multiprocessor monitoring & control. Part of the DigiSat reliable computer project.

3 The DigiSat Computer PowerPC based. Implemented upon the Virtex II-pro platform. Hardware redundancy throughout the entire system. Our project handles processor redundancy & control.

4 Description Satellites contain redundant hardware since servicing in space is not applicable. A monitoring system is required to identify & handle malfunctions. Must be implemented in hardware.

5 DigiSat Computer PPC1 PLB1 DATA ROUTE M1 M2 S1 S2 PPC2 PLB2 Our Project

6 Technology Virtex II-pro FPGA with embedded PowerPC cores.

7 PowerPC 405 32-bit RISC core. Low power consumption. Used in various system-on-chip (SoC) applications (PDAs, network routers, cellular phones…). Embedded within the Virtex II-pro platform.

8 PowerPC Interfaces RST EIC CPU CPM DBG ISPLB DSPLB

9 Software Used

10 Current Status Simulated a PowerPC system using ModelSim 5_6d. Built autonomic monitoring sub-systems that survey PowerPC hardware outputs. Studied Assembler programming of the PowerPC. Built state-machine for controller.

11 The Processor Controller

12 Controller State Machine (General) PPC_1 Online PPC_2 Online Switch 1  2 Switch 2  1 Error detected

13 Controller State-Machine Diagram

14 Signals Involved SignalI/OAffectsComments c405cpmtimerresetreqOCPM Timer Reset Request c405xxxmachinecheckOCPU Machine Check Error c405dbgstopackOEIC Stop Acknowledge c405rstchipresetreqORST Chip Reset Request c405plbdcuwritethruODSPLB Write Thru c405rstcoreresetreqORST Core Reset Request C405rstsysresetreqORST System Reset Request Bus_error_DetOPLBBus Error Dbgc405debughaltIDBG Debug Halt cpmc405clockICPMInput clock rstc405resetcoreIRST Reset Core sys_rstIPLBPLB RST plb_clkIPLBPLB CLK

15 Controller Implementation CPU1 CPU2 PLB1 PLB2 Controller + Arbiter MUX P_Sel To/From Peripherals External Signals

16 Controller Implementation

17 Controller Sub-Modules Core/Chip/System reset requests monitor. Timer reset request monitor. Write-thru monitor. Machine check error monitor. PLB error monitor.

18 Reset Requests Monitor Monitored signals: 1. Chip reset request (c405rstchipresetreq) 2. Core reset request (c405rstcoreresetreq) 3. System reset request (c405rstsysresetreq) If asserted, the required reset action is performed. Processors switch upon core reset request.

19 Timer Reset Monitor Monitored signal: c405cpmtimerresetreq The signal is the logical “OR” of the reset request signals. Serves as another way for resetting the system. Monitored along with the regular reset request signals – checks the watchdog functionality.

20 Timer Monitor Diagram Wt_to_chkr Chip reset request Watchdog timer Error C405rstchipresetreqC405rstcoreresetreqC405rstsysresetreqc405cpmtimerresetreqWt_to_err 00011 0 Timer reset request System reset request Core reset request

21 Write-Thru Monitor Write-back policy is unreliable in space. Monitored signal: c405plbdcuwritethru. Should remain high during normal operation. Low state – error in MMU.

22 Write-Thru Monitor Diagram writethruChecker PPC write-thru policy Write-thru Error c405plbdcuwritethruwritethru_err 01 10 0

23 Bus Error Monitor PPC core & PLB are “tightly coupled”. Monitored signal: bus_error_det Asserted when a bus error interrupt occurs. Implies an error has occurred within the PLB arbiter. Preliminary inspection of the PLB arbiter.

24 Bus Error Monitor Diagram BusErrorChecker Bus_error_detect Bus_err 00 11 0

25 Monitor Arbiters Our system consists of 3 identical monitoring sub-systems and an arbiter that uses majority voting to decide if an error signal is reliable. There are 2 ways of using the arbiters:

26 “ One Monitor to Rule Them All …” 1.Each signal is monitored thrice, passed on to an arbiter, then checked by ERR_Mon. M1 ARB M3 ARB M2 ARB Err_Mon Controller M1 ARB M3 ARB M2 ARB Err_Mon

27 “ The Fellowship of The Signals ” 2.Each signal batch is monitored by Err_Mon, results are passed onto an arbiter. Controller M1 M3 M2Err_Mon M1 M3 M2Err_Mon M1 M3 M2Err_Mon ARB M1 M3 M2Err_Mon M1 M3 M2Err_Mon M1 M3 M2Err_Mon ARB

28 Monitor Arbiters err1err2err3Err_out 0000 0010 0100 0111 1000 1011 1101 1111 0

29 Controller Demonstration

30 Schedule Find a way to initiate CoreReset on PPC. (3 weeks) Find more ways to monitor the PPC. (5 weeks) Write WatchDog software to check internal parts of PPC. (8 weeks) Download hardware to chip and run tests on single PPC. (11 weeks) Wait for chip with dual cores. (T.B.D)


Download ppt "End of First Semester Presentation DigiSat Reliable Computer – Multiprocessor Control System. Niv Best, Shai Israeli Instructor: Oren Kerem HS-DS Lab,"

Similar presentations


Ads by Google