Lin Fang Director, Computer Platform BGI, Shenzhen Oct.10th, 2009

Slides:



Advertisements
Similar presentations
Inetrconnection of CNGrid and European Grid Infrastructure Depei Qian Beihang University Feb. 20, 2006.
Advertisements

Cloud Computing: Theirs, Mine and Ours Belinda G. Watkins, VP EIS - Network Computing FedEx Services March 11, 2011.
Kansas Gov Cloud update. Thank You! Recently we have all been through 2 studies. One performed by EMC and the other by IBM Both vendors commented on the.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
11 Decembre 2000V. Breton Milan WP6 DataGRID meeting Biological applications in testbed 0 Evaluate GRID added value for handling biological data –What.
Dawei Lin, Ph.D. Director, Bioinformatics Core UC Davis Genome Center July 20, 2008, SLIMS (Solexa sequencing.
The Golden Age of Biology DNA -> RNA -> Proteins -> Metabolites Genomics Technologies MECHANISMS OF LIFE Health Care Diagnostics Medicines Animal Products.
Workshop in Bioinformatics 2010 Class # Class 8 March 2010.
THE DICOM 2013 INTERNATIONAL CONFERENCE & SEMINAR March 14-16Bangalore, India DICOM Medical Image Management the Challenges and Solutions – Cloud as a.
Software Engineering for Cloud Computing Rao, Feng 04/27/2011.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
CNGI Applications in CSTNET QingHua Zhang CSTNET January 2007.
Genomics Virtual Lab: analyze your data with a mouse click Igor Makunin School of Agriculture and Food Sciences, UQ, April 8, 2015.
Your First Azure Application Michael Stiefel Reliable Software, Inc.
Bio-IT World Asia, June 7, 2012 High Performance Data Management and Computational Architectures for Genomics Research at National and International Scales.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
RNA-Seq 2013, Boston MA, 6/20/2013 Optimizing the National Cyberinfrastructure for Lower Bioinformatic Costs: Making the Most of Resources for Publicly.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
-- Don Preuss NCBI/NLM/NIH
May 6 th, 2010 Danny Fang.  “Microsoft Office SharePoint Server 2007 is an integrated suite of server capabilities that can help improve organizational...
TECHONOLOGY experts INDUSTRY Some of our clients Link Translation’s extensive experience includes translation for some of the world's largest and leading.
SCIENCE VOL FEBRUARY 2011 R 黃博強 R 林彥伯 R 蘇醒宇 R 吳卓翰 R 蘇煒迪 R 陳維.
Technology Update October Bring Your Own Device (BYOD) Different stages of implementation (contingent on wireless infrastructure) Full School Partial.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Tsute (George) Chen Bioinformatics Core Department of Microbiology The Forsyth Institute March 24 th, 2015 HOMD A Tour to the Data and Tools.
Discovery Tools for Health Libraries  11 th September 2015 WorldCat Discovery Services Simon Day Product Manager.
An approach to carry out research and teaching in Bioinformatics in remote areas Alok Bhattacharya Centre for Computational Biology & Bioinformatics JAWAHARLAL.
IHEP(Beijing LCG2) Site Report Fazhi.Qi, Gang Chen Computing Center,IHEP.
Power and Cooling at Texas Advanced Computing Center Tommy Minyard, Ph.D. Director of Advanced Computing Systems 42 nd HPC User Forum September 8, 2011.
Bio-IT World Conference and Expo ‘12, April 25, 2012 A Nation-Wide Area Networked File System for Very Large Scientific Data William K. Barnett, Ph.D.
Galaxy Community Conference July 27, 2012 The National Center for Genome Analysis Support and Galaxy William K. Barnett, Ph.D. (Director) Richard LeDuc,
Big Data to Knowledge Panel SKG 2014 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China August Geoffrey Fox
Hello Cloud… Mike Benkovich
Canadian Bioinformatics Workshops
ChinaGrid: National Education and Research Infrastructure Hai Jin Huazhong University of Science and Technology
Bioinformatics Educated by Zhenglin Zhu School of Life Sciences, Chongqing U.
Scientific Data Processing Portal and Heterogeneous Computing Resources at NRC “Kurchatov Institute” V. Aulov, D. Drizhuk, A. Klimentov, R. Mashinistov,
Galaxy for analyzing genome data Hardison October 05, 2010
Accessing the VI-SEEM infrastructure
Introduction to Genes and Genomes with Ensembl
Introduction to Bioinformatics and Functional Genomics
Tools and Services Workshop
Joslynn Lee – Data Science Educator
ALICE Monitoring
Comprehensive Library for Modern Biotechnology
Cloud Computing Solutions |
Sequence analysis Introduction
Bridges and Clouds Sergiu Sanielevici, PSC Director of User Support for Scientific Applications October 12, 2017 © 2017 Pittsburgh Supercomputing Center.
CBTTC Expansion in China through BGI/CNBC
Bioinformatics Community of CNGrid A New Approach to Utilizing Grids
Introduction to bioinformatics
”The Ball” Radical Cloud Resource Consolidation
생물정보학 Bioinformatics.
3.2 Virtualisation.
Functional Annotation of the Horse Genome
Access to Sequence Data and Related Information
Ensembl Genomes: Overview Poznań, 27th-28th June 2013
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Syllabus and Introduction Keke Chen
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
Ensembl Genomes: Overview Versailles, 12th-13th November 2012
Material for today’s workshop is at:
EMBRC - European Marine Biological Resource Center K. Deneudt, I. Nardello Pilot Blue Cloud Workshop March 28th, 2017 Brussels.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
AKSUM UNIVERSITY ICT Directorate Oct 10, 2018 Axum, Tigray, Ethiopia
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
URGI platform PEPI IBIS June 7th, 2019
Presentation transcript:

Lin Fang Director, Computer Platform BGI, Shenzhen Oct.10th, 2009 BGI IT Infrastructure Lin Fang Director, Computer Platform BGI, Shenzhen Oct.10th, 2009

Summary Data generation Computer capacity Current networks Future

Data generation Shenzhen, 20 GAII, 76bp PE reads & 100bp PE reads, 1.2T RTA data/day Hongkong, 76bp PE reads, 8 GAII, 500G RTA data/day Beijing, 1 GA, 36bp SE reads, 200G raw data/day 4 month queuing jobs need sequencing!

Computer Capacity 300 nodes 4000 CPU cores 4T memory 2.5P storage 30 Tflops peak value

Current Networks Beijing Center Hangzhou Center Shenzhen Center CSTNET 20M CNC SDH 2M Hangzhou Center CNC SDH 8M CNC 5M Shenzhen Center CSTNET 20M China Telecom Green 10M Hongkong Center China Telecom 10M CSTNET 20M PCCW 2M

Data Distributing EBI FTP 10Mb/S Aspera 9Mb/S FTP 10Mb/S Beijing Center NCBI Shenzhen Center

By the end of 2009 150bp PE reads 50Gbp/run, 500Gbp/day 5T/day fastaq data 15T RTA data/day Triple sequencing machine and what will be…

Expand Computer Capacity 1000 nodes 12000 CPU cores 10P storage 100 Tflops peak value

Networks to be… Beijing Center Hangzhou Center Shenzhen Center CSTNET 20M Hangzhou Center 20M VPN Share CSTNET 5M VPN Between CNC & CSTNET Shenzhen Center CSTNET >20M For service & data trans. 20M VPN Share CSTNET China Telecom Green 10M For office Hongkong Center CSTNET 20M

Data Distributing to be… EBI FTP 10Mb/S Aspera 9Mb/S FTP 10Mb/S Aspera 9Mb/S NCBI Shenzhen Center

Cost and Efficiency 100K RMB/month for internet connection, transmits 240G/day 400 RMB/HD to transport 1T data How to distribute 15T data every day?

BGI Effort Mirroring first class biological database Managing data generated by BGI Build bioinformatics “cloud” center

Mirrors EnsEMBL, 6 releases now! http://ensembl.genomics.org.cn http://ensembl.genomics.org.cn:8050 http://ensembl.genomics.org.cn:8051 http://ensembl.genomics.org.cn:8052 http://ensembl.genomics.org.cn:8053 http://ensembl.genomics.org.cn:8054 EnsEMBL Bacteria Browser http://bacteria.genomics.org.cn EnsEMBL Protists Browser http://protists.genomics.org.cn UCSC Genome Browser, processing… http://ucsc.genomics.org.cn

Plan CLOUD http://cloud.genomics.org.cn Integrated biological data and bioinformatics tools in a single interface Click to go professional pipeline for Digital Gene Express, RNA analysis etc. Flexible workflow design Knowledge managing and mining It’s FREE!

Thanks!