Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.

Slides:



Advertisements
Similar presentations
1/17/20141 Leveraging Cloudbursting To Drive Down IT Costs Eric Burgener Senior Vice President, Product Marketing March 9, 2010.
Advertisements

Clustering Technology For Scaleability Jim Gray Microsoft Research
NAS vs. SAN 10/2010 Palestinian Land Authority IT Department By Nahreen Ameen 1.
Chapter 4 Infrastructure as a Service (IaaS)
Introduction to DBA.
Network+ Guide to Networks, Fourth Edition
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Oracle Data Guard Ensuring Disaster Recovery for Enterprise Data
1 © Copyright 2010 EMC Corporation. All rights reserved. EMC RecoverPoint/Cluster Enabler for Microsoft Failover Cluster.
Business Continuity and DR, A Practical Implementation Mich Talebzadeh, Consultant, Deutsche Bank
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
Keith Burns Microsoft UK Mission Critical Database.
Lesson 1: Configuring Network Load Balancing
Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin CHAPTER FIVE INFRASTRUCTURES: SUSTAINABLE TECHNOLOGIES CHAPTER.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC RECOVERPOINT/ CLUSTER ENABLER FOR MICROSOFT FAILOVER CLUSTER.
VIRTUALIZATION AND YOUR BUSINESS November 18, 2010 | Worksighted.
Microsoft ® Application Virtualization 4.5 Infrastructure Planning and Design Series.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
High Availability Module 12.
11 SERVER CLUSTERING Chapter 6. Chapter 6: SERVER CLUSTERING2 OVERVIEW  List the types of server clusters.  Determine which type of cluster to use for.
Customer Sales Presentation Stoneware webNetwork Powered by ThinkServer.
Server Types Different servers do different jobs. Proxy Servers Mail Servers Web Servers Applications Servers FTP Servers Telnet Servers List Servers Video/Image.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Network+ Guide to Networks, Fourth Edition Chapter 1 An Introduction to Networking.
Express5800/ ft series Fault Tolerant Servers “Why choose a server designed to recover from failure rather than a server designed not to fail in the first.
FMEA-technique of Web Services Analysis and Dependability Ensuring Anatoliy Gorbenko Vyacheslav Kharchenko Olga Tarasyuk National Aerospace University.
Get More out of SQL Server 2012 in the Microsoft Private Cloud environment Guy BowermanMadhan Arumugam DBI208.
Module 12: Designing High Availability in Windows Server ® 2008.
1 Fault Tolerance in the Nonstop Cyclone System By Scott Chan Robert Jardine Presented by Phuc Nguyen.
Microsoft ® Official Course Module 10 Optimizing and Maintaining Windows ® 8 Client Computers.
Version 4.0. Objectives Describe how networks impact our daily lives. Describe the role of data networking in the human network. Identify the key components.
EarthLink Server Management and Monitoring Updated August 6, 2015.
Module 9: Configuring Storage
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
© 2005 Mt Xia Technical Consulting Group - All Rights Reserved. HACMP – High Availability Introduction Presentation November, 2005.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
 Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over a network (typically the Internet). 
FireProof. The Challenge Firewall - the challenge Network security devices Critical gateway to your network Constant service The Challenge.
Clustering In A SAN For High Availability Steve Dalton, President and CEO Gadzoox Networks September 2002.
OSIsoft High Availability PI Replication
Speaker Name 00/00/2013. Solution Requirements.
VMware vSphere Configuration and Management v6
CHAPTER 7 CLUSTERING SERVERS. CLUSTERING TYPES There are 2 types of clustering ; Server clusters Network Load Balancing (NLB) The difference between the.
WINDOWS SERVER 2003 Genetic Computer School Lesson 12 Fault Tolerance.
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
Install, configure and test ICT Networks
LHC Logging Cluster Nilo Segura IT/DB. Agenda ● Hardware Components ● Software Components ● Transparent Application Failover ● Service definition.
Virtual Machine Movement and Hyper-V Replica
By Harshal Ghule Guided by Mrs. Anita Mahajan G.H.Raisoni Institute Of Engineering And Technology.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
1 High-availability and disaster recovery  Dependability concepts:  fault-tolerance, high-availability  High-availability classification  Types of.
OSIsoft High Availability PI Replication Colin Breck, PI Server Team Dave Oda, PI SDK Team.
Azure Site Recovery For Hyper-V, VMware, and Physical Environments
Chapter 6: Securing the Cloud
Lab A: Planning an Installation
Scaling Network Load Balancing Clusters
Douglas Potter IBI Minneapolis User Group November 2008
High Availability 24 hours a day, 7 days a week, 365 days a year…
Managing Multi-User Databases
N-Tier Architecture.
Network Load Balancing
Maximum Availability Architecture Enterprise Technology Centre.
Introduction of Week 6 Assignment Discussion
Clustering Technology For Fault Tolerance
An Introduction to Computer Networking
Fault Tolerance Distributed Web-based Systems
Specialized Cloud Architectures
Presentation transcript:

Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers

Express5800/ft Series Servers High Availability Technologies

© NEC Corporation 2013 Page 3 Approaches to Reliability and Availability Select and combine hardware and software technologies for availability Cluster software Redundant hardware (dual modular architecture) Single server (Typical servers) Fault tolerant server Enhance availability of the system Failover across multiple servers FT server + cluster FT server cluster Continuous operation despite of hardware failures. Simplified installation and operation Enhanced HW/SW failure resilience For Large scale system with scalable nodes etc. Partially redundant hardware (e.g. HDD, PSU) Higher availability of a single server Higher availability of the system Select the best availability solution according to system requirements Enhance fault tolerance of the hardware

© NEC Corporation 2013 Page 4 FT Server and Cluster Solution Comparison Failover process Service during failure Performance enhancement Technology Resilience Aim Operation is interrupted for failover process Operation is interrupted for failover process (some several minutes to 10 minutes) Add CPU or node. Supports servers with 4 or more sockets Add CPU or node. Supports servers with 4 or more sockets EXPRESSCLUSTER FailoverFailure Cluster system Cluster system Hardware/ failures Hardware/ Software failures Failover Load balancing Achieve availability / scalability / load balancing Features load balancing as well as availability Software failure-resilient Suitable for large-scale systems (scalable nodes) Failover to other servers Continuous operation (no interruption) Add CPU Add CPU Supported apps Failover settings is required for each app. (creation of script batch files) General applications General applications No modifications needed Fault tolerant server Fault tolerant server Hardware failures Hardware failures Lockstep (CPU&MEM) and Failover (I/O) (Synchronized in normal conditions) High availability of a single server System configuration requires no app modifications Continuous operation without interruption Ideal for 24-7 systems, and Web servers Isolate faulty component CPU Memory CPU Memory Failure Isolation HDDHDD ft servers provide hardware availability and can be installed quick and easily Ft servers + EXPRESSCLUSTER solution takes advantage of both solutions

© NEC Corporation 2013 Page 5 Express5800/ft series server Express5800/ft series server Failover complete 1. Interruption (a few secs) 2. Determine failover host (a few secs to 1-2 mins) 4. Restart apps (a few secs to a few mins) 3. Takeover of cluster resources (e.g. NW settings and disks) (a few secs to 1 min) Start failover process Cluster system Failure In service Failure Failover Repair / Replace System down for a few mins to 10 mins 1. Instantaneous isolation of the faulty module Non-stop service 2. Resynchronization after replacement Recovery complete Service Intermittence Restart serviceIn service Continuous operation Processing Lockstep Processing Module #0 Module #1 Processing Replacement of faulty module Recovery Process from HW Failures Isolated faulty model

Express5800/ft Series Servers Optional Features to Increase Fault Tolerance

© NEC Corporation 2013 Page 7 Express Report Service Support Express Report Service CPU Mem HDD CPU Mem HDD Failure CPU Mem HDD CPU Mem HDD CPU Mem HDD CPU Mem HDD Isolation NEC (monitoring center) NEC Service Center Client Alert Notification Notification Hardware monitoring & detection Isolate the failed components to continue operation. Monitor hardware status at the service center. Support the system proactively to ensure continuous availability. Isolate the failed components to continue operation. Monitor hardware status at the service center. Support the system proactively to ensure continuous availability. Continuous Operation CPU Mem CPU Mem Replace HDDHDD Recovery Only the alert information will be sent out with dedicated software (secure environment) Via the internet (mail server) public line (modem connection)

© NEC Corporation 2013 Page 8 Support for Redundant Peripheral Devices Double backup configuration is supported to provide for failures during backup LTO or DAT drives are offered for selection Selection of LTO or DAT and support for redundant backup * Double backup configuration is supported to provide for failures during backup LTO or DAT drives are offered for selection A two UPS configuration provides tolerance against UPS defects* Module #1 Module #2 SAS Controller SAS Controller SAS Controller SAS Controller Backup device Backup device Backup device Backup device ft series Data is output from each module to achieve backup redundancy Both backups are created almost simultaneously * Configuration of standalone backup is also supported Module #1 Module #2 PSU ft series Uninterruptable power supply Uninterruptable power supply Uninterruptable power supply Uninterruptable power supply * Single UPS configuration is also supported. UPS is controlled through the network Connecting each UPS to separate power sources helps avoid being affected by failures of the power sources Peripheral Devices

© NEC Corporation 2013 Page 9 ft series + EXPRESSCLUSTER for Higher Availability Clusters with ft servers enhance both HW and SW availability Enhancement SW OS Apps Module #0Module #1 EXPRESSCLUSTER Software failure EXPRESSCLUSTER monitors SW Failover to secondary server ft server (secondary) ft server (primary) OS Apps Module #0Module #1 ft series server Hardware failure Highest level of availability suitable for critical systems

© NEC Corporation 2013 Page 10 Benefits of ft Series + EXPRESSCLUSTER Clusters using ft servers deliver the benefits of both solutions Express5800/ft serverCluster system (configured by normal servers) Cluster system (configured by ft servers) Function Lockstep and Failover (within a server) Failover (between multiple servers) Failover (between multiple servers) HW failure tolerance Treatment Isolate faulty module (within the server) Failover from the primary server to the secondary server Isolate faulty module within the primary server (no failover between nodes) Treatment time Instantaneous Few minutes (Depends on the time necessary to startup apps) Instantaneous SW failure tolerance Treatment - (Apps level failures can be resolved by SingleServerSafe software) Failover from the primary server to the secondary server Failover from the primary server to the secondary server Treatment time - Several minutes (Depends on the time necessary to startup apps) Several minutes (Depends on the time necessary to startup apps) Periodical maintenance (SW update) Active Upgrade enables OS patches to be applied with only short interruption Each node can be separated for upgrade Each node can be separated for upgrade Performance enhancement Add CPU Add CPU or Nodes Add CPU Apps settings General apps can be used without special modifications Takeover process is required for each app Takeover process is required for each app Enhancement SW Legend: : Excellent, : Good, : Fair

© NEC Corporation 2013 Page 11 ft server + Hyper V + EXPRESSCLUSTER Clusters configured on Hyper-V on an ft server Hyper-V 2.0 Guest OS Apps Module #0Module #1 ft server Hardware failure Guest OS Apps ft series server EXPRESSCluster Software failure EXPRESSCluster monitors SW In the event of a SW failure, the operation fails over to another guest OS High HW and SW availability for virtualized environments Enhancement SW

© NEC Corporation 2013 Page 12 OS SingleServerSafe Reboot Service Process Apps Restart ExpressCluster X SingleServerSafe SW is monitored on the ft server to automatically restart the SW in the event of a failure. SingleServerSafe (SSS) monitors the server and SW status at all times. In an event of a failure, SSS restarts the service, process, OS etc. to resume operation. The ft server and SSS in tandem can handle both HW and SW failures SW availability can be improved even for a single ft server Enhancement SW By enabling failure detection and restart/reboot, SSS helps handle a wide range of failures with a single server By using the optional monitoring function of EXPRESSCluster, SSS is capable of further detailed monitoring including the detection of stalling in data bases.

© NEC Corporation 2013 Page 13