GRIM-Filter: Fast Seed Location Filtering in DNA Read Mapping

Slides:

Advertisements

Similar presentations

Tuning of Loop Cache Architectures to Programs in Embedded System Design Susan Cotterell and Frank Vahid Department of Computer Science and Engineering.

Advertisements

Indexing DNA Sequences Using q-Grams

Key idea: SHM identifies matching by incrementally shifting the read against the reference Mechanism: Use bit-wise XOR to find all matching bps. Then use.

Tiered-Latency DRAM: A Low Latency and A Low Cost DRAM Architecture

1 Tiered-Latency DRAM: A Low Latency and A Low Cost DRAM Architecture Donghyuk Lee, Yoongu Kim, Vivek Seshadri, Jamie Liu, Lavanya Subramanian, Onur Mutlu.

Heuristic methods for sequence alignment in practice Sushmita Roy BMI/CS 576 Sushmita Roy Sep 27 th,

Accelerating Read Mapping with FastHASH †† ‡ †† Hongyi Xin † Donghyuk Lee † Farhad Hormozdiari ‡ Samihan Yedkar † Can Alkan § Onur Mutlu † † † Carnegie.

Massively Parallel Mapping of Next Generation Sequence Reads Using GPUs Azita Nouri, Reha Oğuz Selvitopi, Özcan Öztürk, Onur Mutlu, Can Alkan Bilkent University,

SHRiMP: Accurate Mapping of Short Reads in Letter- and Colour-spaces Stephen Rumble, Phil Lacroute, …, Arend Sidow, Michael Brudno.

Accelerating Error Correction in High-Throughput Short-Read DNA Sequencing Data with CUDA Haixiang Shi Bertil Schmidt Weiguo Liu Wolfgang Müller-Wittig.

Biomolecular Computation in Virtual Test Tubes 7 th International Meeting on DNA Based Computers, p75-83, June 10-13, 2001 Max Garzon, Chris Oehmen Summarized.

On Adding Bloom Filters to Longest Prefix Matching Algorithms

U N I V E R S I T Y O F S O U T H F L O R I D A Hadoop Alternative The Hadoop Alternative Larry Moore 1, Zach Fadika 2, Dr. Madhusudhan Govindaraju 2 1.

Heuristic Methods for Sequence Database Searching BMI/CS 576 Colin Dewey Fall 2015.

Biosequence Similarity Search on the Mercury System Praveen Krishnamurthy, Jeremy Buhler, Roger Chamberlain, Mark Franklin, Kwame Gyang, and Joseph Lancaster.

Heuristic Methods for Sequence Database Searching BMI/CS 576 Colin Dewey Fall 2010.

Qq q q q q q q q q q q q q q q q q q q Background: DNA Sequencing Goal: Acquire individual’s entire DNA sequence Mechanism: Read DNA fragments and reconstruct.

Simultaneous Multi-Layer Access Improving 3D-Stacked Memory Bandwidth at Low Cost Donghyuk Lee, Saugata Ghose, Gennady Pekhimenko, Samira Khan, Onur Mutlu.

Parallelization of mrFAST on GPGPU Hongyi Xin, Donghyuk Lee Milestone II.

April 21, 2016Introduction to Artificial Intelligence Lecture 22: Computer Vision II 1 Canny Edge Detector The Canny edge detector is a good approximation.

Genome Read In-Memory (GRIM) Filter: Fast Location Filtering in DNA Read Mapping using Emerging Memory Technologies Jeremie Kim 1, Damla Senol 1, Hongyi.

Parallel Programming Models

FastHASH: A New Algorithm for Fast and Comprehensive Next-generation Sequence Mapping Hongyi Xin1, Donghyuk Lee1, Farhad Hormozdiari2, Can Alkan3, Onur.

Transparent Offloading and Mapping (TOM) Enabling Programmer-Transparent Near-Data Processing in GPU Systems Kevin Hsieh Eiman Ebrahimi, Gwangsun Kim,

UH-MEM: Utility-Based Hybrid Memory Management

Seth Pugsley, Jeffrey Jestes,

Nanopore Sequencing Technology and Tools:

1Carnegie Mellon University

Understanding Latency Variation in Modern DRAM Chips Experimental Characterization, Analysis, and Optimization Kevin Chang Abhijith Kashyap, Hasan Hassan,

Concurrent Data Structures for Near-Memory Computing

Genomic Data Clustering on FPGAs for Compression

Ambit In-Memory Accelerator for Bulk Bitwise Operations Using Commodity DRAM Technology Vivek Seshadri Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali.

BitWarp Energy Efficient Analytic Data Processing on Next Generation General Purpose GPUs Jason Power || Yinan Li || Mark D. Hill || Jignesh M. Patel.

Paging and Segmentation

Department of Computer Science

Adaptation Behavior of Pipelined Adaptive Filters

Anne Pratoomtong ECE734, Spring2002

Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller

Hasan Hassan, Nandita Vijaykumar, Samira Khan,

Genome Read In-Memory (GRIM) Filter Fast Location Filtering in DNA Read Mapping with Emerging Memory Technologies Jeremie Kim, Damla Senol, Hongyi Xin,

Yu Su, Yi Wang, Gagan Agrawal The Ohio State University

Nanopore Sequencing Technology and Tools:

Nanopore Sequencing Technology and Tools for Genome Assembly:

GateKeeper: A New Hardware Architecture

Jeremie S. Kim Minesh Patel Hasan Hassan Onur Mutlu

Chapter 9: Virtual-Memory Management

Today, DRAM is just a storage device

Genome Read In-Memory (GRIM) Filter:

Accelerator Architecture in Computational Biology and Bioinformatics, February 24th, 2018, Vienna, Austria Exploring Speed/Accuracy Trade-offs in Hardware.

Accelerating Approximate Pattern Matching with Processing-In-Memory (PIM) and Single-Instruction Multiple-Data (SIMD) Programming Damla Senol Cali1 Zülal.

Main Memory Background Swapping Contiguous Allocation Paging

Fast Sequence Alignments

Efficient Document Analytics on Compressed Data: Method, Challenges, Algorithms, Insights Feng Zhang †⋄, Jidong Zhai ⋄, Xipeng Shen #, Onur Mutlu ⋆, Wenguang.

Accelerating Approximate Pattern Matching with Processing-In-Memory (PIM) and Single-Instruction Multiple-Data (SIMD) Programming Damla Senol Cali1, Zülal.

Lecture 3: Main Memory.

Reducing DRAM Latency via

Yaohua Wang, Arash Tavakkol, Lois Orosa, Saugata Ghose,

Sahand Kashani, Stuart Byma, James Larus 2019/02/16

BIOINFORMATICS Fast Alignment

Memory Hierarchy Memory: hierarchy of components of various speeds and capacities Hierarchy driven by cost and performance In early days Primary memory.

Fourier Transform of Boundaries

Hash Functions for Network Applications (II)

A flow aware packet sampling mechanism for high speed links

RAIDR: Retention-Aware Intelligent DRAM Refresh

CSE 542: Operating Systems

By Nuno Dantas GateKeeper: A New Hardware Architecture for Accelerating Pre-Alignment in DNA Short Read Mapping Mohammed Alser §, Hasan Hassan †, Hongyi.

Apollo: A Sequencing-Technology-Independent, Scalable,

BitMAC: An In-Memory Accelerator for Bitvector-Based

SneakySnake: A New Fast and Highly Accurate Pre-Alignment Filter

Presentation transcript:

GRIM-Filter: Fast Seed Location Filtering in DNA Read Mapping using Emerging Memory Technologies Jeremie Kim1, Damla Senol1, Hongyi Xin1, Donghyuk Lee1, Saugata Ghose1, Mohammed Alser2, Hasan Hassan4,3, Oguz Ergin3, Can Alkan2 and Onur Mutlu4,1 4 1 2 3 1: Read Mapping 2: Hash Table Based Mappers 3: Problem Read Mapping: Mapping billions of DNA fragments (reads) against a reference genome to identify genomic variants Requires approximate string matching Computationally expensive alignment using quadratic-time dynamic programming algorithm Bottlenecked by memory bandwidth Three general types of read mappers: Suffix-array based mappers Hash table based mappers Hybrid Seed-and-extend procedure to map reads against a reference genome allowing e indels. They have: High sensitivity (can tolerate many errors) High comprehensiveness (can find more mappings) BUT Low speed The most recent fastest hash table based read mapper, mrFAST with FastHASH [Xin+, BMC Genomics 2013] For lower runtimes, location filters can efficiently determine whether a candidate mapping location will result in an incorrect mapping before performing the computationally expensive incorrect verification by alignment. They should be fast. 4: Our Goal Design and implement a new filter that rejects incorrect mappings before the alignment step Minimize the occurrences of unnecessary alignment Maintain high sensitivity and comprehensiveness Obtain low runtime and low false positive rate Accelerate read mapping by overcoming memory bottleneck with 3D-stacked memory and its PIM for data-intensive computation Very fast parallel operations on big data sets near memory 5: 3D-Stacked Logic-in-Memory DRAM Recent technology that tightly couples memory and logic vertically with very high bandwidth connectors. Numerous Through Silicon Vias (TSVs) connecting layers, enable higher bandwidth and lower latency and energy consumption. Customizable logic layer for application-specific accelerators. Logic layer enables fast, massively parallel operations on large sets of data, and provides the ability to run these operations near memory to alleviate the memory bottleneck. Processing in 3D-stacked memory is extremely good at accelerating embarrassingly parallel simple bit operations. 6: GRIM-Filter Mechanism GRIM-Filter is based on two key ideas: Introduce parallelism to q-gram string matching Utilize a 3D-stacked DRAM to alleviates the memory bandwidth issue of our algorithm and parallelizes most of the filter. GRIM-Filter has two main steps: Precomputation: Divide the reference genome into consecutive bins and generate existence bitvectors for each bin. Filtering Algorithm: Filter locations by quickly determining whether a read can map to a specific segment of the genome. http://www.amd.com/en-us/innovations/software-technologies/hbm 7: Bins & Bitvectors 8: GRIM-Filter Walkthrough Existence bitvectors represent the existence of all possible permutations of q-length nucleotide sequences in each bin. Bitvectors are distributed throughout the memory layers of 3D-stacked DRAM on top of the customized logic layer, which enables processing-in-memory and high parallelism. Memory Array Customized Logic 9: Results & Conclusion False Negative Rates for GRIM-Filter as error threshold varies Runtimes for GRIM-Filter as error threshold varies Baseline: mrFAST with FastHASH mapper code [Xin+, BMC Genomics 2013]. However, GRIM-Filter is fully complementary to other mappers, too. Key Results of GRIM-Filter: 5.59x-6.41x less false negative locations, and 1.81x-3.65x end-to-end speedup over the state-of-the-art read mapper mrFAST with FastHASH. We show the inherent parallelism of our filter and ease of implementation for 3D-stacked memory. There is great promise in adapting DNA read mapping algorithms to state-of-the-art and emerging memory and processing technologies. Other Results: Examined sensitivity to number of bins: 450x65536 Examined sensitivity to q-gram size: 5 Found to be the best tradeoff between memory consumption, filtering efficiency, and runtime. *In the figures shown, smaller bars indicate better performance