1 Grid vs. Peer-to-Peer Yin Chen 25 June 2003.

1 Grid vs. Peer-to-Peer Yin Chen s0231189@sms.ed.ac.uk 25 June 2003

2 Content zGrid vs. P2P zWhat’s the request zWhy P2P architecture zIssues of P2P zP2P case study- Freenet zDesign

3 Grid vs. P2P z Grid z Standards- based z Persistent z Addresses security issues z Resources are more powerful,more diverse, better connected z Data intensive z Facing problems of autonomic configuration and management z Not much scalable

4 Grid vs. P2P zP2P z Much scalability z Fault tolerance z Self-configuration z Automatic problem determination z Higher variable behaviour z But lack of infrastructure z Security problems z Less concerned with qualities of service

5 What’s the request zA user requests the car service, and keeps logs recording if the request success or fail zThe user may asks all other users about history request records. By statistic, we can know particular service responding ability. zWhich can also gives prediction of further request.

6 Why P2P zNot run-time information zBetter fault tolerance, zPull model efficient and less network traffic

7 Issues of P2P - Topology

8 Issues of P2P - Response Modes

9 Issues of P2P … It turns to be problem of query from distributed data stores, which is different from central database query …

10 Issues of P2P - Query Processing Recursively Partitionable Query

11 Issues of P2P - Abort Timeout (1) zProblems - User no longer interested in query results - Query will forever roaming the network without stop it - The query should be fade away after sometime - Static timeout remains unchanging across hops zSolution ->Dynamic Abort Timeout - Nodes further away from the originator timeout earlier than nodes closer to the originator. - Decrease the timeout at each hop - Exponential decay with halving

12 Issues of P2P - Abort Timeout (2)

13 Issues of P2P - Query Scope (1) zProblems - No necessary to search the whole net - Broadcast model will flooding the network. zSolutions -> Select a neighbour subset - Search only a specific domain, host, owner - Random select half of the neighbours - In a tree-like topology, select all child and ignore all parent - Only find a single result. - Specify the maximum number of result (maxResults) and bytes (maxResultBytes) to be returned.

14 Issues of P2P - Query Scope (2) zMaintain a statistics about its neighbours. Only select neighbours that meet minimum requirements in term of latency, bandwidth or historic (maxLatency, minBandwidth, minHistoricResult) zNeighbour Selection Query zRadius of a query - is a measure of path length. - Set the maximum number of hops a query is allowed to travel - The radius is decreased by one at each hop. - The roaming query and response fade away when a radius of less than zero.

15 Issues of P2P - Routing zRandom forwarding(random walk) zLearning : nodes record the requests answered by other nodes. A request is forwarded to the peer that answered similar requests previously or randomly. zBest neighbour : records the number of answers received from each peer. A request is forwarded to the peer who answered the largest number of requests. zLearning + best neighbour : identical with the learning, when no relevant experience exists, the request is forwarded to the best neighbour.

16 P2P Case Study - Freenet zFreenet provides a file-storage service zThe network is entirely decentralised zInformation publishers and consumers are anonymous zCommunications are encrypted zFiles in the data store are encrypted

17 Adding New File zA user assigns the file a GUID key, sends an insert message, containing file identifier(GUID) and a time-to- live(TTL)value. zGUID is location - independent globally unique identifier. By hashing the contents of the file. zOn receiving an insert, the node checks if the key already exist. If not, stores it, creates a routing entry for it, looks up the closest key, and forwards the message to the related node. zIf TTL expires, the final node returns an “all clear” message. The user then sends the data alone the path.

18 Requesting File zEvery node maintains a routing table, listing addresses of other nodes and GUID keys. zOn receiving a query, it first checks its own store. If it finds the file, it announces itself as the holder. Otherwise, it forwards the query to the node with the closest key. zIf the file is found, each node passes the file alone the chain, and creates a new entry in its routing table. zEach node might also cache a copy locally. zThe query maintains a TTL, decreased at each hop. zIf a node runs out of candidates, it reports failure and back the its predecessor, which then tries its second choice

19 Adding New Node zNew node sends a announcement to an existing node, with a TTL. zThe receiving node forwards the announcement to another node chosen randomly from its routing table. zThe announcement continues to propagate until its TTL runs out.

20 Training Routes zNodes that reliably answer queries will be added to more routing tables. zWell-known nodes tend to see more requests and become better connected. zSimilar keys tend to cluster in the nodes along the same path, because requests will be for similar files which have similar keys.

21 Managing Storage zGiven finite disk space, sometime need to decide which file to keep. zFreenet decides by the frequency of requests per file, keeps the more popular files. zFrequently requested files have more copies in the network. Tree grows in that direction zUnrequested files are subjected to delete. Tree shrinks in that direction.

22 Design z Tree Topology z Each node maintains a Log File z Each node also maintains a Local Data Store for storing the queries result.

23 Design zAdding New Node - When a new node adds to the network, it connects itself to only one existing node. zAdding Log Record - When a user accesses services, a log record will be created - Log records should provide information about service name, service accessing time, success/fail flag

24 Design - Query zQuery - When a node sets up a query, it first looks up its local data store to see if the same query exists. - If it is a new query, the node multicasts a query message to all connecting nodes. The query message contains Query Conditions, Maximum Data Volume value and a Dynamic Abort Timeout(DAT) value. - Query Condition may contains time period which user concerns, services name etc.

25 Query - On receiving the query message, a node first looks up its own local data store, if there is no same query, it multicasts the query to all connecting nodes. - When DAT expires, the final node begins to return data along the chain. - Response using Routed Response mode

26 Design - Query - To reduce network traffic, calculation will operate at each node. Using Recurisively Query Plan. The calculation result will propagate up along the chain.

27 Design - Query - To avoid data flooding, only necessary volume data will be calculated, that is specified by Maximum Data Volume - Each chain will return zero or one result - Dynamic Abort Time (DAT) using Exponential decay with halving model. DAT will decrease at each hop.

28 Design zCalculation - By particular statistics methodology zShowing Result - Final result will be shown in graph style - The query result will also be saved in the Local Data Store zDeleting log records - To save disk space, early log records should be deleted after period of time

29 Grid vs. P2R Thanks !

1 Grid vs. Peer-to-Peer Yin Chen 25 June 2003.

Similar presentations

Presentation on theme: "1 Grid vs. Peer-to-Peer Yin Chen 25 June 2003."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

1 Grid vs. Peer-to-Peer Yin Chen 25 June 2003.

Similar presentations

Presentation on theme: "1 Grid vs. Peer-to-Peer Yin Chen 25 June 2003."— Presentation transcript:

Similar presentations

About project

Feedback