rain technology (redundant array of independent nodes)

rain technology (redundant array of independent nodes)
Presented by: Ravikrishnan.P

contents RAID Flavors RAID 0 RAID 5 rain architecture Features of rain
rain platform rain project goals Distributed storage Advantages

INTRODUCTION The name of the original research project is RAIN which stand’s for Reliable Array of Independent Nodes. The RAIN technology originated in a research project at the California Institute of Technology (Caltech),in collaboration with NASA’s Jet Propulsion Laboratory A component that stores data across distributed processors and retrieves it even if some of the processors fail

RAID flavors Commonly used ones: RAID 0 RAID 1 RAID 5 RAID 10
Other types used…but rarely: RAID 2,3,4,6,50……

RAID 0

RAID 1 RAID1 is ‘data mirroring’.
Two copies of the data are held on two physical disks, and the data is always identical.

RAID 5 “Distributed Parity” is the key word here.

Network Connections Application MPI/PVM TCP/IP RAIN Ethernet Myrinet
rain s/w architecture Servernet:Error correction ATM:aysnchronous transfer mode protocol PVM:parallel virtual machine MPI:message passing i/f Network Connections Application MPI/PVM TCP/IP RAIN Ethernet Myrinet ATM Servernet

rain Network Connections Application MPI/PVM TCP/IP RAIN Ethernet
Myrinet ATM Servernet rain Global state sharing protocol Can use in different platform Provide group communication ip manage Ip assign Ip release Pool of ip

Myrinet NIC card Myrinet Switch:10Gb/s

Features of rain  Communication
Bundled Interface Link Monitoring Fault Tolerant Interconnect Topology  Communication  Group Membership: Identifies healthy nodes that are participating in the cluster  Data Storage

rain platform Heterogeneous network of nodes and switches
bus network switch Computing/Storage Nodes Multiple Network Interfaces Variety of types of Networks node node node

Rain testbed Myrinet & Ethernet. Come by and see our RAIN demos!
Myrinet Switches 10 Pentium boxes w/multiple NICs Myrinet & Ethernet. Come by and see our RAIN demos!

Proof of Concept: Video Server
Video client & server on every node. A B C D switch switch

DATA STORED IN DISKS Insufficient storage to replicate all the data on each node. A B C D switch switch

Node Failure A B C D switch switch

Node Failure Dynamically switch to another node. A B C D switch switch

k-of-n Code Erasure-correcting code: a d+c b d+a c a+b d b+c a b c d
recover data from any k of n columns b = a+b a + d = d+c c

Link Failure A B C D switch switch

Link Failure Dynamically switch to another network path. switch switch
B C D switch switch

Switch Failure A B C D switch switch

Switch Failure Dynamically switch to another network path. switch
B C D switch switch

Node Recovery A B C D switch switch

Node Recovery Continuous reconfiguration (e.g., load-balancing).
switch switch

Fault-tolerant interconnect technologies
Goal To connect computer nodes to a network of switches in order to maximize the network’s resistance to partitioning S C How do you connect n nodes to a ring of n switches?

A ring of switches N S a naïve solution
Given degree-2 & 4, how do we connect them? = Node = Switch S N

A ring of switches N S a naïve solution = Node = Switch S N

A ring of switches N N S S S a naïve solution N N easily partitioned S
= Node N N S S S = Switch N

resistance to partitioning
1 2 3 4 5 6 7 8 nodes on diagonals With switches in a ring, make connections as non-local as possible.

resistance to partitioning
degree-2 compute nodes, degree-4 switches 1 4 6 8 2 8 2 nodes on diagonals 7 7 3 tolerates any 3 switch failures (optimal) generalizes to arbitrary node/switch degrees. 3 Optimal in sense that for degree 2 & 4, no construction can tolerate more switch failures while avoiding partitioning. Also, generalizes to other switch networks, in particular a fully connected network. 4 5 5

Point-to-Point Connectivity
A node node node ? Is the path from A to B up or down? Network Local view: what node A sees. node node node B

connetivity Bi-directional communication.
Link is seen as up or down by each node. Node A Node B Model is Bi-Direct.: up/down with respect to both sends & receives. Mechanism we’ll use is pings & time-outs. {U,D} {U,D} Each node sends out pings. A node may time-out, deciding the link is down.

group membership Consistent global view given local, point-to-point connectivity information ABCD ABCD B A D C link/node failures dynamic reconfiguration Crucial for distributed services/computation: who’s alive? Who’s performing what? ABCD ABCD

group membership Token-Ring based Group Membership Protocol
Our solution: simple yet effective Begin with an initial ordering of the nodes into a ring.

1: ABCD A B Token carries: group membership list sequence number Sequence number is updated at every node. D C

1: ABCD B 1 A Token carries: group membership list sequence number Sequence number is updated at every node. D C

1 A 2 Token carries: 2: ABCD group membership list sequence number D C

1 A B 2 Token carries: group membership list sequence number 3: ABCD D C 3

1 A B 2 Token carries: 4: ABCD group membership list sequence number D C 4 3

5 A B 2 Token carries: group membership list sequence number D C 4 3

group membership Node or link fails: 5 A B 2 D C 4 3

group membership Node or link fails: 5 A B D C 4 3

group membership Node or link fails: ? 5 A B D C 4 3

group membership Node or link fails: If a node is inaccessible,
5 A 5: ACD If a node is inaccessible, it is excluded and bypassed. D C 4 3

5 A If a node is inaccessible, it is excluded and bypassed. 6: ACD D C 4 6

5 A B If a node is inaccessible, it is excluded and bypassed. D C 7 6

group membership Node with token fails: 5 A B D C 7 6

group membership Node with token fails: 5 A B D C 6

group membership Node with token fails: ? B 5 A ? D C 6

group membership Node with token fails: ? If the token is lost,
5 A If the token is lost, it is regenerated. ? D C 6

group membership Node with token fails: If the token is lost,
5 A If the token is lost, it is regenerated. D C 6

5 A If the token is lost, it is regenerated. 5: ACD 6: AD D C 6

5 A If the token is lost, it is regenerated. 5: ACD Highest sequence number prevails. 6: AD D C 6

7 A If the token is lost, it is regenerated. D C 6

group membership Node recovers: B 7 A D C 6

group membership Node recovers: Recovering nodes are added. 7 6 B D C

group membership Node recovers: Recovering nodes are added. 7 6 B
7: ADC D C 6

group membership Node recovers: Recovering nodes are added. 7 8 6 B
8: ADC D C 8 6

group membership Node recovers: Recovering nodes are added. 7 8 9 B
9: ADC D C 8 9

group membership Node recovers: Recovering nodes are added. 10 8 9 B D

group membership Features: A B 10 Dynamic reconfiguration D C 8 9

distributed storage disk disk disk disk

distributed storage Focus: reliability and performance. 1010 10 101 11
disk disk disk disk 1010 10 101 11

Rain project : goals Efficient, reliable distributed computing and storage systems: key building blocks Applications Identify and Develop Key BB Networks, Comm., Storage, API’s. Reliability, performance, functionality. Distributed solns. that work. Storage Communication Networks

advantages There is no limit on the size of a RAIN cluster.
There is no concept of master-slave relation. A RAIN cluster can tolerate multiple node failure. This is highly efficiency in traffic management. New node can be added into the cluster to participate in load sharing

Conclusion High Availability Video Server High Availability Web Server
Distributed Check pointing Mechanism

Thank you Any queries ?

rain technology (redundant array of independent nodes)

Similar presentations

Presentation on theme: "rain technology (redundant array of independent nodes)"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

rain technology (redundant array of independent nodes)

Similar presentations

Presentation on theme: "rain technology (redundant array of independent nodes)"— Presentation transcript:

Similar presentations

About project

Feedback