Using Network Processors in Genomics Herbert Bos * † Kaiming Huang * * Leiden Universiteit, Netherlands † Vrije Universiteit,

Slides:



Advertisements
Similar presentations
Employability and Bologna
Advertisements

Oct 28, 2004WPES Off-the-Record Communication, or, Why Not to Use PGP Nikita Borisov Ian Goldberg Eric Brewer.
Sales Presentation The Satellite A60 series TISB, June 2004
ERecruitment Trends, eRecruitment Trends and Techniques NAFSA Conference | Baltimore | 26 May 2004 Session Chair / Co-Presenter: Cheryl Darrup-Boychuck.
Copyright © 2004 Toshiba Corporation. All rights reserved. Please use the speaker notes in PowerPoint for additional information Enter the world of infinite.
Copyright © 2004 Toshiba Corporation. All rights reserved. Sales presentation The Satellite A50 series TISB, May 2004.
Theory and Implementation of Particle Filters
REDS: A Reconfigurable Event Dispatching Infrastructure Gianpaolo Cugola Dipartimento di Elettronica e Informazione Politecnico di Milano, Italy
Trend evaluation and comparison of the use and value of GL in core demography and computer science journals Rosa Di Cesare, Roberta Ruggieri, CNR-IRPPS.
A Direct Conversion CMOS Transceiver for IEEE802.11a WLANs
Demonstration of VO Tools and Technology Tamás Budavári Johns Hopkins University.
Spring Process Control Spring Outline 1.Optimization 2.Statistical Process Control 3.In-Process Control.
2.008 Manufacturing Systems
TBT Special Meeting On Procedures For Information Exchange, 2-3rd November MALAYSIAS EXPERIENCE IN IMPLEMENTING CODE OF GOOD PRACTICE FOR THE PREPARATION,
2/11/20041 Preparation and submission of notifications (2-3 November 2004) by Mrs. Rampaipan Nakasatis (Director) Standards Bureau 1, TISI Special Meeting.
Tim Richards & WP3 Partners NEFIS Project Meeting WP3 Summary 28 – 30 June 2005, Ispra.
ITU-T/SG 17 Mar UML for ODP system specifications Bryan Wood Convenor, JTC1/SC7/WG19 ITU-T/SG 17 Geneva, 14 March 2004.
UML and WSDL for JISC e-Learning Projects INTRODUCTION TO UML Richard Hopkins NeSC Training Team Member
A Tailorable Environment for Assessing the Quality of Deployment Architectures in Highly Distributed Settings Sam Malek and Marija Mikic-Rakic Nels Beckman.
UML and WSDL for JISC e-Learning Projects Major Practical Richard Hopkins NeSC Training Team Member
Workshop Cross-species Mapping between Anatomical Ontologies: Terminological and Structural Support by Sarah Luger, Stuart.
RCUK, Octiber Archiving research data and research publications. Dr Leslie Carr, Intelligence, Agents Multimedia, University of Southampton Dr Simon.
UKOLN is supported by: Future-proofing the Technology – Addressing the Challenge Dr Liz Lyon, UKOLN, University of Bath, UK Collaboration for Sustainability.
D. Elia, R. SantoroITS week / SPD meeting - May 12, Test beam data analysis D. Elia, R. Santoro – Bari SPD Group Alignments, plane rotation for setup.
17 May Multiple Sites. 17 May Multiple Sites This presentation assumes you are already familiar with Doors and all its standard commands It.
21 April Temp Users. 21 April Setup Temp Users Allows you to set activation and expiration dates for cards ahead of time Works only on PXL-500/510.
September 24, Fuji Electric Paperless Recorder.
September 23, Fuji Electric Ultrasonic Flowmeters.
An Advanced Shell Theory Based Tire Model by D. Bozdog, W. W. Olson Department of Mechanical, Industrial and Manufacturing Engineering The 23 rd Annual.
Multi-Resolution Homogenization of Multi-Scale Laminates: Scale Dependent Parameterization or: Homogenization procedure that retains FINITE-scale-related.
Automating Test File Creation Using Excel, UltraEdit, and Batch files to build test data.
Introducing TV Anytime Phase 1 Ronald Tol Technology Manager, Philips Digital System Laboratories Convenor TVAF STC Working Group.
Fitzkilism Production, Putting the Fun in Function By Mrs. Kiley Sandymount Elementary.
© Pearson Education Limited, Chapter 8 Normalization Transparencies.
David Evans CS200: Computer Science University of Virginia Computer Science Lecture 6: Cons car cdr sdr wdr.
Mohawk II Damage report – aug Structural damages Bulb, fishing vessel approx. 10 knots Severe damage to hull from keel to reeling 7-8 ribs broken.
10/24-10/ MWCN Theoretical Capacity of Multi-hop Wireless Ad Hoc Networks Yue Fang A.Bruce McDonald R-WIN Lab ECE Department Northeastern University.
COAT -TRIBUNALS' MODEL PRACTICE GUIDE - AIJA COAT TRIBUNALS MODEL PRACTICE MANUAL Livingston Armytage Centre for Judicial Studies
Søren Poulsen, Ørsted·DTU, Automation Technical University of Denmark NORPIE Integrating switch mode audio power amplifiers and.
Faculty of Arts University of Groningen The acquisition of the weak-strong distinction and the Dutch quantifier allemaal Erik-Jan Smits
EFIMED Advanced course on MODELLING MEDITERRANEAN FOREST STAND DYNAMICS FOR FOREST MANAGEMENT MARC PALAHI Head of EFIMED Office INDIVIDUAL TREE.
Tim Richards, Tim Green, Simo Varis EFIS Information Resource Discovery - Demonstrator (a.k.a EFIS-RD/ Metadata) 28 June 2005.
Full-System Timing-First Simulation Carl J. Mauer Mark D. Hill and David A. Wood Computer Sciences Department University of Wisconsin—Madison.
Interaction of Particles with Matter
Processing of multiple frequency test data of Traction Auto Transformer Helen Di Yu Power Systems Research Group University of Strathclyde.
MB-NG Review – 24 April 2004 Richard Hughes-Jones The University of Manchester, UK MB-NG Review High Performance Network Demonstration 21 April 2004.
1 Cluster Operating System Support For Parallel Autonomic Computing Andrzej M. Goscinski, J. Silcock, M. Hobbs School of Information Technology Deakin.
David Evans CS200: Computer Science University of Virginia Computer Science Class 38: Intractable Problems (Smiley Puzzles.
OBA Forward Air Controllers Museum A Proposal by the OV-10 Bronco Association.
Web Services Choreography Description Language (WS-CDL) Jacek Kopecký June 2004.
12/6/20041 TITLE: Basic Multicarrier EVDV operation SOURCE: Srinivasan
WB-KMV-SEPTEMBER ICT and Global Economic Growth Contribution, Impact, and Policy Implications eDevelopment Services Thematic Group, Informatics.
What Is the Council’s Role in Program Implementation? County Extension Council Training Module Missouri Council Leadership Development — a partnership.
Søren Poulsen, Ørsted·DTU, Automation Technical University of Denmark NORPIE Hysteresis Controller with constant switching frequency.
Søren Poulsen, Ørsted·DTU, Automation Technical University of Denmark NORPIE Integrating switch mode audio power amplifiers and.
22/09/20041 PM and shoebox tests  Hamamatsu 19 and 16 dynodes compared to FEU dynodes  Jyväskylä’s shoebox (T0) V0A V0C.
16 March PXL-500 Installation Site Preparation.
8 April Doors TM Set System Options. 8 April Set System Options Allows you to set certain standard Doors operating parameters and enable certain.
November 2004 MIC PropeRWeb Archetypes and Screen Representations.
Parallel BioInformatics Sathish Vadhiyar. Parallel Bioinformatics  Many large scale applications in bioinformatics – sequence search, alignment, construction.
Sequence Similarity Searching Class 4 March 2010.
1 Bio-Sequence Analysis with Cradle’s 3SoC™ Software Scalable System on Chip Xiandong Meng, Vipin Chaudhary Parallel and Distributed Computing Lab Wayne.
RAID2005 CardGuard: Towards software-based signature detection for intrusion prevention on the network card Herbert Bos and Kaiming Huang presented by.
ECE 526 – Network Processing Systems Design Network Processor Architecture and Scalability Chapter 13,14: D. E. Comer.
Construction of Substitution Matrices
Doug Raiford Phage class: introduction to sequence databases.
David Wishart February 18th, 2004 Lecture 3 BLAST (c) 2004 CGDN.
Gnort: High Performance Network Intrusion Detection Using Graphics Processors Date:101/2/15 Publisher:ICS Author:Giorgos Vasiliadis, Spiros Antonatos,
Ambika Shrestha Chitrakar Prof. Slobodan Petrovic
James Logan CS526 Dr. Chow April 29, 2009
Presentation transcript:

Using Network Processors in Genomics Herbert Bos * † Kaiming Huang * * Leiden Universiteit, Netherlands † Vrije Universiteit, Netherlands H. Bos – Leiden University 13/02/20041

Case study: BLAST ● search nucleotide/protein database for query ● BLAST discovers similarity rather than exact match ● two main phases: 1. scoring (registering where query and DNA DB match) 2. alignment (dynamic programming) ● only the first phase on NPUs H. Bos – Leiden University 13/02/20042

Window matching H. Bos – Leiden University 13/02/20043

Window matching H. Bos – Leiden University 13/02/20044

Window matching H. Bos – Leiden University 13/02/20045

Window matching H. Bos – Leiden University 13/02/20046

Window matching ● naïve approach: roughly W*N*M comparisons ● does not scale ● string search algorithms: Aho-Corasick – all windows matched at the same time – shifting genome one nucleotide at a time – matching algorithm transformed in a DFA ● DFA may be quite large H. Bos – Leiden University 13/02/20047

Aho-Corasick H. Bos – Leiden University 13/02/20048 ● Alphabet: acgt ● Window size: 3 ● Query: acgccga ● Windows: {acg,cgc,gcc,ccg,cga}

Aho-Corasick H. Bos – Leiden University 13/02/ t acg c g gc a g cc c s f(s) ● Alphabet: acgt ● Window size: 3 ● Query: acgccga ● Windows: {acg,cgc,gcc,ccg,cga}

Aho-Corasick H. Bos – Leiden University 13/02/ t acg c g gc a g cc c ● Alphabet: acgt ● Window size: 3 ● Query: acgccga ● Windows: {acg,cgc,gcc,ccg,cga} s f(s) acgcgcgccccgcga

Aho-Corasick H. Bos – Leiden University 13/02/ t acg c g gc a g cc c ● Alphabet: acgt ● Window size: 3 ● Query: acgccga ● Windows: {acg,cgc,gcc,ccg,cga} s f(s) acgcgcgccccgcga tacgcga

H. Bos – Leiden University 13/02/ Control Processor NPU (IXP1200) ME PCI Bus StrongARMMicroengines DRAM SRAM Gbps ports Pentium PCI scratch IXPBlast Architecture

H. Bos – Leiden University 13/02/ Control Processor NPU (IXP1200) ME PCI Bus StrongARMMicroengines DRAM SRAM Gbps ports Pentium PCI scratch IXPBlast Architecture

H. Bos – Leiden University 13/02/ Control Processor NPU (IXP1200) ME PCI Bus StrongARMMicroengines DRAM SRAM Gbps ports Pentium PCI scratch IXPBlast Architecture

H. Bos – Leiden University 13/02/ Control Processor NPU (IXP1200) ME PCI Bus StrongARMMicroengines DRAM SRAM Gbps ports Pentium PCI scratch IXPBlast Architecture t acg c g gc a g cc c

H. Bos – Leiden University 13/02/ Control Processor NPU (IXP1200) ME PCI Bus StrongARMMicroengines DRAM SRAM Gbps ports Pentium PCI scratch IXPBlast Architecture t acg c g gc a g cc c

H. Bos – Leiden University 13/02/ Control Processor NPU (IXP1200) ME PCI Bus StrongARMMicroengines DRAM SRAM Gbps ports Pentium PCI scratch IXPBlast Architecture t acg c g gc a g cc c

IXPBlast: packet handling ● packets read and processed in batches of ● “spilling” must be taken into account ● currently no feedback H. Bos – Leiden University 13/02/

Results ● 232 MHz IXP1200 ~ 1.8GHz Pentium-4 ● 1611 Nucleotide query (MyD88) ● 1.4 GB genome (Zebrafish) – IXP1200: 90 sec with DFA – IXP1200: 129 sec with “trie” – P4: 132: 132 sec with “trie” ● number of matches: H. Bos – Leiden University 13/02/200419

Results H. Bos – Leiden University 13/02/ Query size DNA DB size Impl.Performance GBP4132 sec GBIXP sec GB IXP1200 DFA 90 sec

Conclusions ● NPUs are useful in other application domains ● Newer hardware is expected to perform much better ● “Throughput processors” ● Adapting our current approach to use BLAST tricks/heuristics H. Bos – Leiden University 13/02/200421

Network processors ● geared for high throughput ● used exclusively in network systems ● example: intrusion detection ● similar to looking for gene on in genomes ● differences H. Bos – Leiden University 13/02/ Radisys ixp1200 board

Application domain: “Genomics” ● example: search genome for occurrence of “patterns” ● similar problems as IDS, poor performance on GPP  cannot exploit parallelism – throughput-driven – how about FPGAs? – how about clusters? ● NPU – easier to program than FPGAs – cheaper than cluster computing – “on the desktop”  IP never leaves the room H. Bos – Leiden University 13/02/200423