Dtsi/Sol CEA System Software Activities 125/02/2005VD R&D topics Designing tools and system software for: The management of parallelism Mono-processor system Distributed Multi-processor system Real time environment (I&C) Safety-critical applications (reliability, dependability) Main projects: Supercomputer realization of an integrated operating system to manage all the hardware resources of a massively parallel machine OASIS A complete tools suite with a dedicated safety-oriented real time kernel for safety-classified system
Dtsi/Sol CEA System Software Activities 225/02/2005VD Supercomputer system software Main functional features: Management of 3 level of parallelism Jobs Tasks SIMD Automatic decomposition of Jobs and Tasks Automatic dispatching and scheduling Non functional features: Security design Protection against erroneous or malicious user or “administrative” command Detection and confinement mechanisms Safety design Fault-tolerance migration and check/restart-point Protection against critical hardware failure (cooling devices, high temperature, etc…) Single failure criterion
Dtsi/Sol CEA System Software Activities 325/02/2005VD OASIS project: a method and its supporting tools Aim of OASIS project: New rigorous and reliable method to design and implement safety-critical multitasking applications compound of advanced real time functionalities To guarantee responses in specified times To ensure predictable and reproducible behaviors The application behavior is deterministic Code Generation Tools Suite: Compiler of a semi-formal language that includes the ANSI C Complete application code and runtime code generation Automatic sizing of all communication buffers Dedicated memory segmentation of all binaries (data and instructions) Dedicated link edition, with MMU tables optionally Additional tools for testing, CPU sizing and actual simulation (on POSIX) Safety-oriented real time Kernel: Current naked targets are 68040/60 or IA32 Generic, time-triggered et safety-oriented Ensure the detection and the confinement of anomalies The execution is actually real time and deterministic Errors in a multitasking system have deterministic impact
Dtsi/Sol CEA System Software Activities 425/02/2005VD PhDs Automatic sizing of distributed architecture and its mathematical proof: passed by Ch. Aussaguès, 1998, 11/12 Static analysis of dynamic behaviors thanks to automatic abstraction and model-checking: passed by I. Quéteuil, 2001, 29/01 Calculation of safe upper bounds of execution duration with cache memory: passed by S. Louise 2002, 21/01 Design, study and implementation of a fault-tolerant predictable synchronous communication protocol over off- the-shelf components: by D. Chabrol, in progress since 2003, 10/03 => OASIS version for multiprocessor architecture
Dtsi/Sol CEA System Software Activities 525/02/2005VD Industrialization success Supercomputer: Delivered to final users and in operation 24h/24 for 2 years No system software failure recorded OASIS with AREVA/Framatome-ANP: Industrialization planned mid2003-mid2005 « Qualified Display System OASIS based » project A part of 1E safety-classified I&C system of the EPR (European pressurized reactor) qualification of all OASIS tools and its system software Full management of a standard IA32/PC-AT board PS/2, GPU, touch screen, FlashIDE, Ethernet Control of Chipsets (E7501, etc.) OASIS with SMIE (a French SME leading the world market of crane anti-collision system) Initialization achieved in early 2003