Presentation is loading. Please wait.

Presentation is loading. Please wait.

Digital Science Center I

Similar presentations


Presentation on theme: "Digital Science Center I"— Presentation transcript:

1 Digital Science Center I
Geoffrey C. Fox, David Crandall, Judy Qiu, Gregor von Laszewski, Fugang Wang, Badi' Abdul-Wahid Saliya Ekanayake, Supun Kamburugamuva, Jerome Mitchell, Bingjing Zhang, Pulasthi Wickramasinghe, Hyungro Lee, Andrew Younge School of Informatics and Computing, Indiana University Digital Science Center Research Areas DA-MDS speedup for 200K with different optimization techniques Java K-Means 1 mil points and 1k centers performance on 16 nodes for LRT-FJ and LRT-BSP with varying affinity patterns over varying threads and processes. Digital Science Center Facilities RaPyDLI Deep Learning Environment SPIDAL Scalable Data Analytics Library and applications including Bioinformatics and Polar Remote Sensing Data Analysis MIDAS Big Data Software; Harp for HPC-ABDS Big Data Ogres Classification and Big Data Analytics Performance including communication, VM and Java overhead CloudIOT Internet of Things Environment Cloudmesh Cloud and Bare metal Automation and NIST use cases XSEDE TAS Monitoring citations and system metrics Visualization WebPlotviz Best MPI; inter and intra node Ice Layer Detection Algorithm Polar Remote Sensing Algorithms High Performance Molecular Dynamics in Cloud Infrastructure Parallel Sparse LDA using Harp The polar science community has built radars capable of surveying the polar ice sheets, and as a result, have collected terabytes of data and is increasing its repository each year as signal processing techniques improve and the cost of hard drives decrease enabling a new-generation of high resolution ice thickness and accumulation maps. Manually extracting layers from an enormous corpus of ice thickness and accumulation data is time-consuming and requires sparse hand-selection, so developing image processing techniques to automatically aid in the discovery of knowledge is of high importance. Original LDA (orange) compared to LDA exploiting sparseness (blue) Note data analytics making use of Infiniband (i.e. limited by communication!) Java code running under Harp – Hadoop plus HPC plugin Corpus: 3,775,554 Wikipedia documents, Vocabulary: 1 million words; Topics: 10k topics; BR II is Big Red II supercomputer with Cray Gemini interconnect Juliet is Haswell Cluster with Intel (switch) and Mellanox (node) infiniband (not optimized) Large potential for running MD simulations in virtualized infrastructure using KVM hypervisor Advanced HW: InfiniBand and GPUs in VMs LAMMPS MD – 1.9% overhead = near native KVM can outperform native with Huge Pages Building scalable High Performance Virtual Clusters Harp LDA on Juliet (36 core nodes)


Download ppt "Digital Science Center I"

Similar presentations


Ads by Google