Clemson NextNet SDN Use Cases for Life Sciences Research Kuang-Ching “KC” Wang Associate Professor Clemson University Sponsored by NSF grant OCI ‐ 1245936.

Slides:



Advertisements
Similar presentations
Next-Generation Sequencing: Methodology and Application
Advertisements

Bio-organic molecules 1. carbohydrates 2. proteins 3. lipids 4. nucleic acids.
The Past, Present, and Future of DNA Sequencing
Vanderbilt Center for Quantitative Sciences Summer Institute Sequencing Analysis Yan Guo.
IMGS 2012 Bioinformatics Workshop: RNA Seq using Galaxy
Presentation to Cyberinfrastructure Symposium Jim Bottum CIO, Clemson University Presidential Fellow, Internet2 February 13, 2013.
Christopher Roberts Supervisor: Dr. Luis Mur (former IBS) and Dr. Ian Armstead (formerly IGER) Institute of Biological, Environmental.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
The Rice Functional Genomics Program of China cDNA microarray database (RIFGP-CDMD) consists of complete datasets, including the probe sequences, microarray.
Genomics Chapter 18.
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Pathways Bioinformatics & Biomolecular Center at the City College of New York Marshak Science Building, Room 1102 Tel: 212/ Fax: 212/
University of Louisville The Department of Bioinformatics and Biostatistics.
The Golden Age of Biology DNA -> RNA -> Proteins -> Metabolites Genomics Technologies MECHANISMS OF LIFE Health Care Diagnostics Medicines Animal Products.
NICLS: Development of Biomedical Computing and Information Technology Infrastructure Presented by Simon Sherman August 15, 2005.
The SOLiD System: Next-Generation Sequencing Overview of the SOLiD System –  Scalable  Accurate Ultra High Throughput  Flexible  Mate Pairs.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Transcriptional profiling I – microarrays and proteomics
High Throughput Sequencing
Experimenting with Persistent Live Video Streaming Service Kuang-Ching (KC) Wang Clemson University joint project with Parmesh Ramanathan University of.
Adam Kutz Kern Walster.  The task of sequencing genomes produces massive amounts of data  Traditional data transmission is becoming a bottleneck  Researchers.
Before we start: Align sequence reads to the reference genome
Institutional Research Computing at WSU: Implementing a community-based approach Exploratory Workshop on the Role of High-Performance Computing in the.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
An Introduction to RNA-Seq Transcriptome Profiling with iPlant
Bioinformatics.
Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
Chapter 24, Future Design Issues Paul King. Biomedical Engineering Handbook Bioelectric Phenomena Biomaterials Biomedical Sensors/Instrumentation Biomedical.
ARC Biotechnology Platform: Sequencing for Game Genomics Dr Jasper Rees
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
Agenda Introduction to microarrays
Network requirements from Ukrainian Biotechnology communities Lubov N. Shynkarenko FBB.
Next Generation DNA Sequencing
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
-- Don Preuss NCBI/NLM/NIH
3/24/2005 TIGP 1 Bioinformatics for Microarray Studies at IBS Pei-Ing Hwang, Ph.D. Mar. 24, 2005.
SCIENCE VOL FEBRUARY 2011 R 黃博強 R 林彥伯 R 蘇醒宇 R 吳卓翰 R 蘇煒迪 R 陳維.
DM ChurchLast Updated: 7 May 2012 Intro to Next Generation Sequencing.
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Ritesh Krishna Department Of Computer Science WPCCS July 1, 2008.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
Slide 1 9/29/15 End-to-End Performance Tuning and Best Practices Moderator: Charlie McMahon, Tulane University Jan Cheetham, University of Wisconsin-Madison.
Computational Biology and Bioinformatics Lab. Songhwan Hwang Functional Genomics DNA Microarray Technology.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
CC-NIE Workshop Clemson University James Pepin (CTO) Integration award Build 10Gb to ‘lab/desktop’ in ~20 buildings(overlay) Use major.
An approach to carry out research and teaching in Bioinformatics in remote areas Alok Bhattacharya Centre for Computational Biology & Bioinformatics JAWAHARLAL.
BIOINFOGRID: Bioinformatics Grid Application for life science MILANESI, Luciano National Research Council Institute of.
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
An Introduction to RNA-Seq Transcriptome Profiling with iPlant (
No reference available
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
First of all: “Darnit Jim, I’m a doctor not a bioinformatician!”
The State of Microarrays The Scientist: 2003 By: Hien Dang.
Center for Bioinformatics and Genomic Systems Engineering Bioinformatics, Computational and Systems Biology Research in Life Science and Agriculture.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
Bioinformatics for biologists (2) Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Introductory RNA-seq Transcriptome Profiling of the hy5 mutation in Arabidopsis thaliana.
Department of Pathology UC Davis School of Medicine Jeff Gregg, M.D. The Development of an Informatics Platform for the Characterization of Clinical Samples.
Vipin Kumar Regents Professor and William Norris Chair in Large Scale Computing Research interests – Data mining, – high-performance computing, and – their.
Introductory RNA-seq Transcriptome Profiling
Cancer Genomics Core Lab
Chapter 24, Future Design Issues
HII Technical Infrastructure
Genetic Engineering in Medicine, Professor Bob Goldberg
Alignment and CNV analysis in cattle
Automating NGS Gene Panel Analysis Workflows
Presentation transcript:

Clemson NextNet SDN Use Cases for Life Sciences Research Kuang-Ching “KC” Wang Associate Professor Clemson University Sponsored by NSF grant OCI ‐ KC Wang Clemson University 1July

Clemson NextNet: A NSF CC-NIE Project July KC Wang Clemson University Objectives: Direct access to I2 100G Innovation Platform Science DMZ from anywhere, w/o manual plumbing Campus production, end-to-end support Flexible, optimized 10~40G access to resources on campus and other universities Software defined network (SDN)

What is the Fuss About SDN? KC Wang Clemson University July Network Researchers: Industry: Traditional network gettinging unmanageable (not about bandwidth)! Traditional Network SDN

What Do Our (Life Sciences) Folks Need? KC Wang Clemson University July Real-time medical imaging Two Clemson life sciences researchers in attendance today: Alex Feltus – Associate Professor in Genetics & Biochemistry – Faculty Consultant in Clemson University Genomics Institute – Research: Rapid crop design with massive gene interaction networks David Kwartowitz – Assistant Professor in Bioengineering – Research: Rapid processing stereo laparoscopic data for real-time pre- and intra-surgery support Palmetto HPC Cluster Data Store N Data Store N …

The Feltus Lab Builds Massive Gene Interaction Networks Using RNA Expression Profiles From Next-Generation Sequence (NGS) and Microarray Experiments. Rice (Oryza sativa) Goal: Rapidly design new crop varieties for a specific environment including “old” environments with a changed climate… Personalized Agriculture Slide prepared by Alex Feltus KC Wang Clemson University July

Massive amounts of DNA/RNA/Genetic Data in Databases 1.64 Quadrillion base pairs in 5 yrs! Slide prepared by Alex Feltus KC Wang Clemson University July

A NGS Biomarker Example Datasets 5.7GSample_Feltus1_L006_R1.cat.fastq 5.7GSample_Feltus1_L006_R2.cat.fastq 5.8GSample_Feltus1_L007_R1.cat.fastq 5.8GSample_Feltus1_L007_R2.cat.fastq 6.7GSample_Feltus2_L006_R1.cat.fastq 6.7GSample_Feltus2_L006_R2.cat.fastq 6.8GSample_Feltus2_L007_R1.cat.fastq 6.8GSample_Feltus2_L007_R2.cat.fastq 6.5GSample_Feltus3_L006_R1.cat.fastq 6.5GSample_Feltus3_L006_R2.cat.fastq 6.6GSample_Feltus3_L007_R1.cat.fastq 6.6GSample_Feltus3_L007_R2.cat.fastq 7.3GSample_Feltus4_L006_R1.cat.fastq 7.3GSample_Feltus4_L006_R2.cat.fastq 7.4GSample_Feltus4_L007_R1.cat.fastq 7.4GSample_Feltus4_L007_R2.cat.fastq 5.6GSample_Feltus5_L006_R1.cat.fastq 5.6GSample_Feltus5_L006_R2.cat.fastq 5.7GSample_Feltus5_L007_R1.cat.fastq 5.7GSample_Feltus5_L007_R2.cat.fastq 8.8GSample_Feltus6_L006_R1.cat.fastq 8.8GSample_Feltus6_L006_R2.cat.fastq 8.9GSample_Feltus6_L007_R1.cat.fastq 8.9GSample_Feltus6_L007_R2.cat.fastq 2.4GSample_Feltus1_L007_R1.MERGED.BAM 2.7GSample_Feltus2_L006_R1.MERGED.BAM 2.7GSample_Feltus2_L007_R1.MERGED.BAM 2.6GSample_Feltus3_L006_R1.MERGED.BAM 2.6GSample_Feltus3_L007_R1.MERGED.BAM 3.0GSample_Feltus4_L006_R1.MERGED.BAM 3.0GSample_Feltus4_L007_R1.MERGED.BAM 2.2GSample_Feltus5_L006_R1.MERGED.BAM 2.9GSample_Feltus6_L006_R1.MERGED.BAM 2.9GSample_Feltus6_L007_R1.MERGED.BAM 6 RNA Samples in Duplicate GB (raw) GB (processed) = GB of critical data files (<6 hours to process on cluster) Does not include: Intermediate processing files Reference genome (0.72 GB) RAW DATA (uncompressed)PROCESSED DATA (compressed) Slide prepared by Alex Feltus KC Wang Clemson University July

The CUTTERS (Kwartowitz) lab is working to enable remote processing of stereo laparoscopic data for real-time feedback with surgical robot systems on partner sites (Vanderbilt, Mayo Clinic) KC Wang Clemson University 8July Clemson, SC Vanderbilt, TN Mayo Clinic, MN Palmetto HPC Cluster

How Does It Work Today KC Wang Clemson University July ISP 1 Internet ISP 1 Internet ISP 2 Internet ISP 2 Internet R&E net R&E net … … Data Center Data Center Campus Network Campus Network Research Network Research Network R&E net 1 R&E net 1 G Down the road compliances User-specific privileges access control Down the road compliances User-specific privileges access control

What Are We Building NOW KC Wang Clemson University July

Porting GENI Research Prototype to Production SOS: Seamless Large Data Transport KC Wang Clemson University 11July Steroid OpenFlow Service (SOS) by Aaron Rosen and KC Wang Seamless TCP throughput upgrade, e.g., 2.5 Mbps  120 Mbps Multipath support Automatic site agent detection Upcoming demos of SOS: NSF 12 th GENI conference, Kansas City, MO. Supercomputing 2011, Seattle, WA.

Condo of Condos: Connecting Campus HPC with SDN KC Wang Clemson University July

Significance of IT Support Team to Bootstrap Researcher Use of HPC and SDN KC Wang Clemson University May 2010: Galen joins CITI and begins recruiting & training users New Palmetto Cluster Users Number of Users

And to Create a Transformative University a unique coalition among academy, IT, and industrial partners within and beyond Clemson. Synergy with other university research centers: Cyberinstitute, ICAR, and Watts Innovation Center KC Wang Clemson University July

Synergy with Cross-Communities Momentum KC Wang Clemson University July Research Communities Companies Open Source Communities IT Communities Universities...

FURTHER QUESTIONS KC Wang Clemson University July