Unistore: A Unified Storage Architecture for Cloud Computing Wei Xie, Jiang Zhou, and Yong Chen Unistore: A Unified Storage Architecture for Cloud Computing Unistore: review of project plan Sheepdog: object storage Benchmark Based on sheepdog distributed store for virtual machine Optimization for heterogeneous storage (SSDs and HDDs) Optimization for heterogeneous workload Planned testing tools fio, dd for generating synthetic workload Real workload benchmark iostat Comparison with other product GlusterFS Ceph Object storage Workload characterization Hot/cold data detection and separation Multiple bloom filter [1] Temporal locality [2] I/O size, write/read ratio, inter-arrival time, queue depth, latency, and IOPS Online vs.. offline workload tracing and characterization Sheepdog: gateway Responsible for where to store objects, or data placement Consistent hashing Add/remove node not significantly change mapping I/O load balance How to make consistent hashing support heterogonous device? Two hash rings for HDD and SSD, respectively Planned Schedule Year: 2015 Q1: investigation and survey about Unistore Q2: characterization component development of Unistore Q3: metadata management of Unistore Q4: data distribution management of Unistore Year: 2016 Q1: VM image store and loading Q2: advanced functions of Unistore Q3: performance optimization of Unistore Q4: module integration and system benchmarking Initial Characterization Result Plan to implement online hot/cold data detection Hot data store on SSD, cold on HDD Initial result collected from OLTP I/O trace. Write_ops Write_to_HDD Write_to_SSD finished on-going to-be-done Sheepdog: component Cluster manager QEMU block Driver Object storage Gateway Object manager Deployment We are grateful to the Nimboxx and the Cloud and Autonomic Computing site at Texas Tech University for the valuable support for this project. Acknowledgements Environment 3 CentOS 6.5 virtual machines on iMac workstation Sheepdog built on the 3 virtual machine and form a cluster Use corosync to manage the cluster (can switch to zookeeper if necessary) Will migrate to a real Linux cluster later (for testing)