Download presentation
Presentation is loading. Please wait.
Published byDraven Westray Modified over 10 years ago
1
HEC FSIO Workshop August 9, 2011 Communications and Protocols Active Networks and Active Object Storage John A. Chandy Department of Electrical and Computer Engineering Janardhan Singaraju, Ajith Thamarakuzhi, Cengiz Karakoyunlu, Orko Momin, Mike Runde, Paul Wortman
2
Active Storage Networks Active DisksActive Disks –Intelligence at the disk can distribute computation to parallel disks –Process data in streams –Disks only have local view of data Active Storage NetworkActive Storage Network –Network has a global view of data –Distributed caching of file system metadata and data –Redundancy optimizations Active Storage Networks HEC FSIO Workshop August 9, 2011
3
Active storage networks 1 /3 Gbps 1 Gbps An ASN is comprised of a smart switch along with intelligence embedded in the I/O network.An ASN is comprised of a smart switch along with intelligence embedded in the I/O network. Network Switches have global view of the data and can perform in-stream data reduction and transformation operations.Network Switches have global view of the data and can perform in-stream data reduction and transformation operations. ASN can enhance storage node performance as well as improve the computational performance of the parallel I/O systems.ASN can enhance storage node performance as well as improve the computational performance of the parallel I/O systems. Active Storage Networks HEC FSIO Workshop August 9, 2011
4
Network switch topology 2-dilated flattened butterfly2-dilated flattened butterfly Active Storage Networks HEC FSIO Workshop August 9, 2011 1 1 2 5 9 2 3 4 3 5 6 4 7 8 6 7 8 10 11 12 13 14 15 16
5
Hardware Implementation NetFPGA board from StanfordNetFPGA board from Stanford 4 GigE connects4 GigE connects 2 SATA connectors for node to node communication2 SATA connectors for node to node communication PCI bus for node to node communicationPCI bus for node to node communication Active Storage Networks HEC FSIO Workshop August 9, 2011
6
Active Storage Networks Application operationsApplication operations –Reduction operations - min/max, k-means clustering, search –Transformational operations – streaming, sort, File System OperationsFile System Operations –Locking –Redundancy optimizations Active Storage Networks HEC FSIO Workshop August 9, 2011
7
Parallelization techniques Functional units are re-used on reaching the reconfigurable hardware area limits.Functional units are re-used on reaching the reconfigurable hardware area limits. Data level parallelism by distributing the data to several functional units in several switch elements.Data level parallelism by distributing the data to several functional units in several switch elements. Functional level parallelism by distributing functions to several elements.Functional level parallelism by distributing functions to several elements. Active Storage Networks HEC FSIO Workshop August 9, 2011
8
K-means clustering Active Storage Networks HEC FSIO Workshop August 9, 2011
9
Runtime per iteration Active Storage Networks HEC FSIO Workshop August 9, 2011
10
Data search Active Storage Networks HEC FSIO Workshop August 9, 2011
11
Kmin/Kmax Active Storage Networks HEC FSIO Workshop August 9, 2011
12
Data sort Active Storage Networks HEC FSIO Workshop August 9, 2011
13
Redundancy optimizations Active Storage Networks HEC FSIO Workshop August 9, 2011 RAIDed files, parity calculated in switchRAIDed files, parity calculated in switch
14
File locking Lock table in switchLock table in switch Active Storage Networks HEC FSIO Workshop August 9, 2011
15
Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Storage Networks Lessons LearnedLessons Learned –Hardware design is hard –HW Libraries can help –ASNs make most sense for reductions –Storage systems optimizations show promise What needs to get doneWhat needs to get done –Better HW design –Application and FS hooks –When to do ASN and when to do SW?
16
Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Object Storage Active DisksActive Disks –Intelligence at the disk can distribute computation to parallel disks Active Object Storage for Parallel File SystemsActive Object Storage for Parallel File Systems –Active Disks for OSDs –Use Active Storage to improve parallel file system performance –Use Active Storage to improve parallel file system reliability –Application aware storage and autonomic storage using active OSDs.
17
Active Disks Can we use OSDs to make Active Disks a reality?Can we use OSDs to make Active Disks a reality? –Application-aware storage Object attributes can give hints to the diskObject attributes can give hints to the disk Application specificApplication specific –Parallel File Systems Felix et al. added a filtering layer to Lustre to provide active processingFelix et al. added a filtering layer to Lustre to provide active processing –T10 OSD? Active Storage Networks HEC FSIO Workshop August 9, 2011
18
Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD Previous ImplementationPrevious Implementation –Based on disc-osd –Object-oriented (Java) Attach object types to storage objectsAttach object types to storage objects Define methods for object typesDefine methods for object types New ImplementationNew Implementation –Based on osc-osd (supported by Panasas) –RPC - Call functions on OSD remotely –Execute Engines – C, Java, Python, etc.
19
Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD How do you move code from client to target within OSD framework?How do you move code from client to target within OSD framework? –Create an object with the code –Each function object has a special attribute that defines the type of associated execute engine –OSD can support multiple execute engines
20
Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD How do you execute the method remotely within the OSD framework?How do you execute the method remotely within the OSD framework? –New EXECUTE FUNCTION command so that we can invoke a function –We use the CDB continuation to specify the parameters –Results (if any) returned directly or written to a new object From T10/08-185r5 changes to OSD-2
21
Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD Client OSD iSCSI OSD Target C Engine Java Engine Function Object.so,.class
22
Active Storage Networks HEC FSIO Workshop August 9, 2011 Active Disks using OSD Status:Status: –C and Java engines complete –Python engine soon –OrangeFS support for OSDs
23
Summary Active storage networksActive storage networks –Improves performance of computation kernels –Useful in parallel file system optimizations Active storage for improved file system performanceActive storage for improved file system performance Acknowledgements: NSF CCF-0621448, CCF- 093787Acknowledgements: NSF CCF-0621448, CCF- 093787 Active Storage Networks HEC FSIO Workshop August 9, 2011
24
Communication and Protocols Active Storage Networks HEC FSIO Workshop August 9, 2011 Coherence schemesCoherence schemes Scalable abstractions for scientific dataScalable abstractions for scientific data Scalable replication, relocation, failure detection, and fault toleranceScalable replication, relocation, failure detection, and fault tolerance Topology aware storage layoutTopology aware storage layout Wide area storage access protocolsWide area storage access protocols Cloud storage?Cloud storage? Inter-stack communication?Inter-stack communication? Memory hierarchy?Memory hierarchy?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.