Commodity-SC Workshop, Mar00 Cluster-based Visualization Dino Pavlakos Sandia National Laboratories Albuquerque, New Mexico.

Slides:



Advertisements
Similar presentations
University of Chicago Department of Energy The Parallel and Grid I/O Perspective MPI, MPI-IO, NetCDF, and HDF5 are in common use Multi TB datasets also.
Advertisements

Electronic Visualization Laboratory, University of Illinois at Chicago Collaborative Visualization Architecture in Scalable Adaptive Graphics Environment.
♦ Commodity processor with commodity inter- processor connection Clusters Pentium, Itanium, Opteron, Alpha GigE, Infiniband, Myrinet, Quadrics, SCI NEC.
SGI’2000Parallel Programming Tutorial Supercomputers 2 With the acknowledgement of Igor Zacharov and Wolfgang Mertz SGI European Headquarters.
1 Cplant I/O Pang Chen Lee Ward Sandia National Laboratories Scalable Computing Systems Fifth NASA/DOE Joint PC Cluster Computing Conference October 6-8,
Xingfu Wu Xingfu Wu and Valerie Taylor Department of Computer Science Texas A&M University iGrid 2005, Calit2, UCSD, Sep. 29,
Graphics Hardware and Software Architectures
SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO IEEE Symposium of Massive Storage Systems, May 3-5, 2010 Data-Intensive Solutions.
Silicon Graphics, Inc. Poster Presented by: SGI Proprietary Technologies for Breakthrough Research Rosario Caltabiano North East Higher Education & Research.
Computer Graphics Hardware Acceleration for Embedded Level Systems Brian Murray
The Virtual Microscope Umit V. Catalyurek Department of Biomedical Informatics Division of Data Intensive and Grid Computing.
SUMS Storage Requirement 250 TB fixed disk cache 130 TB annual increment for permanently on- line data 100 TB work area (not controlled by SUMS) 2 PB near-line.
INTRODUCTION OS/2 was initially designed to extend the capabilities of DOS by IBM and Microsoft Corporations. To create a single industry-standard operating.
DisplayWall Software Architecture Yuqun Chen, Grant Wallace, Kai Li.
1/13 A MJPEG Encoder for the NOW to visualize and navigate on data- intensive scenarios José María González
DDDDRRaw: A Prototype Toolkit for Distributed Real-Time Rendering on Commodity Clusters Thu D. Nguyen and Christopher Peery Department of Computer Science.
1 Component Description Alice 3d Graphics Software Human Computer Interaction Institute Carnegie Mellon University Prepared by: Randy Pausch,
Introduction to Parallel Rendering: Sorting, Chromium, and MPI Mengxia Zhu Spring 2006.
A Pipelined Execution of Tiled Nested Loops on SMPs with Computation and Communication Overlapping Maria Athanasaki, Aristidis Sotiropoulos, Georgios Tsoukalas,
Earth Simulator Jari Halla-aho Pekka Keränen. Architecture MIMD type distributed memory 640 Nodes, 8 vector processors each. 16GB shared memory per node.
Parallel Rendering Ed Angel
Fast Isosurface Visualization on a High-Resolution Scalable Display Wall Adam Finkelstein Allison Klein Kai Li Princeton University Sponsors: DOE, Intel,
Data-centric computing with Netezza Architecture DISC reading group September 24, 2007.
NPACI: National Partnership for Advanced Computational Infrastructure August 17-21, 1998 NPACI Parallel Computing Institute 1 Cluster Archtectures and.
Parallel Graphics Rendering Matthew Campbell Senior, Computer Science
Workshop on Commodity-Based Visualization Clusters Big Data, Big Displays, and Cluster-Driven Interactive Visualization Sunday, October 27, 2002 Kenneth.
Sort-Last Parallel Rendering for Viewing Extremely Large Data Sets on Tile Displays Paper by Kenneth Moreland, Brian Wylie, and Constantine Pavlakos Presented.
Parallel Rendering 1. 2 Introduction In many situations, standard rendering pipeline not sufficient ­Need higher resolution display ­More primitives than.
SDSC RP Update TeraGrid Roundtable Reviewing Dash Unique characteristics: –A pre-production/evaluation “data-intensive” supercomputer based.
Principles of Scalable HPC System Design March 6, 2012 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
Maximizing The Compute Power With Mellanox InfiniBand Connectivity Gilad Shainer Wolfram Technology Conference 2006.
Trace Generation to Simulate Large Scale Distributed Application Olivier Dalle, Emiio P. ManciniMar. 8th, 2012.
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
Seminar II: Rendering Architectures Yan Cui Love Joy Mendoza Oscar Kozlowski John Tang.
So far we have covered … Basic visualization algorithms Parallel polygon rendering Occlusion culling They all indirectly or directly help understanding.
Syzygy Design overview Distributed Scene Graph Master/slave application framework I/O Device Integration using Syzygy Scaling down: simulators and other.
Image Compositing Hardware The Metabuffer: A Scalable Multiresolution Multidisplay 3-D Graphics System Using Commodity Rendering Engines Lightning-2: A.
National Energy Research Scientific Computing Center (NERSC) Visualization Tools and Techniques on Seaborg and Escher Wes Bethel & Cristina Siegerist NERSC.
Master Program (Laurea Magistrale) in Computer Science and Networking High Performance Computing Systems and Enabling Platforms Marco Vanneschi 1. Prerequisites.
A High-Performance Scalable Graphics Architecture Daniel R. McLachlan Director, Advanced Graphics Engineering SGI.
Stream Processing Main References: “Comparing Reyes and OpenGL on a Stream Architecture”, 2002 “Polygon Rendering on a Stream Architecture”, 2000 Department.
Parallel Rendering. 2 Introduction In many situations, a standard rendering pipeline might not be sufficient ­Need higher resolution display ­More primitives.
1 Oak Ridge Interconnects Workshop - November 1999 ASCI ASCI ASCI Terascale Simulation Requirements and Deployments David A. Nowak ASCI.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
Computing Environment The computing environment rapidly evolving ‑ you need to know not only the methods, but also How and when to apply them, Which computers.
1 THE EARTH SIMULATOR SYSTEM By: Shinichi HABATA, Mitsuo YOKOKAWA, Shigemune KITAWAKI Presented by: Anisha Thonour.
Hank Childs, University of Oregon Large Data Visualization.
Site Report DOECGF April 26, 2011 W. Alan Scott Sandia National Laboratories Sandia National Laboratories is a multi-program laboratory managed and operated.
MEAD: Volume Visualization David Porter, U. Minnesota/LCSE Data Pipeline A3D HVR Tiled Commas Output Windows & Unix Formats & Frameworks linking to MEAD.
Gravitational N-body Simulation Major Design Goals -Efficiency -Versatility (ability to use different numerical methods) -Scalability Lesser Design Goals.
MPEG-4: Multimedia Coding Standard Supporting Mobile Multimedia System Lian Mo, Alan Jiang, Junhua Ding April, 2001.
Interconnection network network interface and a case study.
ANU comp2110 Software Design lecture 10 COMP2110 Software Design in 2004 lecture 10 Software Architecture 2 of 2 design lecture 5 of 6 Goal of this small.
Clusters Rule! (SMPs DRUEL!) David R. White Sandia National Labs Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin.
Scientific Computing Goals Past progress Future. Goals Numerical algorithms & computational strategies Solve specific set of problems associated with.
ECG Simulation NCRR Overview Technology for the ECG Simulation project CardioWave BioPSE project background Tools developed to date Tools for the next.
CDA-5155 Computer Architecture Principles Fall 2000 Multiprocessor Architectures.
Background Computer System Architectures Computer System Software.
Electronic visualization laboratory, university of illinois at chicago Sort Last Parallel Rendering for Viewing Extremely Large Data Sets on Tile Displays.
Architecture of a platform for innovation and research Erik Deumens – University of Florida SC15 – Austin – Nov 17, 2015.
Parallel Data Laboratory, Carnegie Mellon University
So far we have covered … Basic visualization algorithms
Mapping the Data Warehouse to a Multiprocessor Architecture
Component Frameworks:
Parallel I/O System for Massively Parallel Processors
Implementation II Ed Angel Professor Emeritus of Computer Science
Physics-based simulation for visual computing applications
Implementation II Ed Angel Professor Emeritus of Computer Science
Jeremy Meredith, Lawrence Livermore National Laboratory
Presentation transcript:

Commodity-SC Workshop, Mar00 Cluster-based Visualization Dino Pavlakos Sandia National Laboratories Albuquerque, New Mexico

Commodity-SC Workshop, Mar00 High End Graphics Platforms Years/Compute Performance PC Graphics SGI Graphics Tflops 10 Tflops 100 Tflops Polygon Rendering Rate (Megapolys/Second)

Commodity-SC Workshop, Mar00 Rendering and Sorting Sort First Sort Middle Sort Last Polygon Rendering Pipe

Commodity-SC Workshop, Mar00 Tiled vs. Single / Composite Displays Renderer Display(s) Tiled Renderer Display Composite

Commodity-SC Workshop, Mar00 Data Exploration Architecture Reduced polys/data Compute Service User Work- station Simulate Compress Decompress User d  I render Vis. Service Dec/C D  D D  d D  I Archive Render Dec/C D  I Data Service ImagesPolys/ Data Data Archive Big Data

Commodity-SC Workshop, Mar00 Tightly Coupled Compute, Data Services and Visualization InfiniBand x12 Link Speed: 6 GB/s Bidirectional (3 GB/s each way) Aggregate bandwidth across vertical plane: 768 GB/s each way, with 256 (16 x 16) rows (exceeds I/O requirements) 16 rows

Commodity-SC Workshop, Mar00 Visualization/Data Service SNL Existing 16-node SGI/320 NT graphics cluster (GigE) 72 node Compaq/NT data service cluster (ServerNet) Coming 64 node graphics cluster (ASCI V1 Corridor) 8-16 node graphics cluster (open testbed)

Commodity-SC Workshop, Mar00 Cluster-based visualization issues Rendering scalability vs. interactive latency –Expect good results for rendering large data –Getting high frame rates (e.g. 60Hz) harder Dynamic resource management Desktop access to large resource Parallel/Scalable IO Parallel inter-process communication (runtime visualization & data services, computational experiments) Classified/Unclassified use

Commodity-SC Workshop, Mar00 Cluster-based Composite Rendering Simplistic Projection 16 node SGI/320 cluster Peak 4 Million polygons/sec. per node 16 x 4 = 64 Million polygons/sec. peak (perfect scaling) Assume –64 Million-Polygon surface data –Sort-last rendering (HW-accelerated) –1K x 1K Display Render 64M polygons in 1 sec. Add.84 sec. composite time (Compaq NT cluster / ServerNet) –gives 35M polygons/sec. Add.16 sec. composite time (ASCI Red) –gives 55M polygons/sec.

Commodity-SC Workshop, Mar00 Parallel Visualization Abstract Partitioning Model Data Repository/ Buffer Data Interface Simulation Code Visualization Module Data Service Module... Control Interface (Parallel Disk, Shared Memory, Dist. Memory Buffer, …) Abstract Data Space

Commodity-SC Workshop, Mar00 Do-It-All on C-Plant Instantiation Node 1 Node 2 Node N Disk 1 Disk 2 Disk N... Comp 1 Comp 2 Comp N Abstract Data Space