WABI: Workshop on Algorithms in Bioinformatics

Slides:



Advertisements
Similar presentations
Andrew Meade School of Biological Sciences.
Advertisements

An Algorithm for Constructing Parsimonious Hybridization Networks with Multiple Phylogenetic Trees Yufeng Wu Dept. of Computer Science & Engineering University.
A Separate Analysis Approach to the Reconstruction of Phylogenetic Networks Luay Nakhleh Department of Computer Sciences UT Austin.
1 An Adaptive GA for Multi Objective Flexible Manufacturing Systems A. Younes, H. Ghenniwa, S. Areibi uoguelph.ca.
Lauritzen-Spiegelhalter Algorithm
Student : Mateja Saković 3015/2011.  Genetic algorithms are based on evolution and natural selection  Evolution is any change across successive generations.
DCJUC: A Maximum Parsimony Simulator for Constructing Phylogenetic Tree of Genomes with Unequal Contents Zhaoming Yin Bader-Polo Joint Group Meeting, Nov.
GENE TREES Abhita Chugh. Phylogenetic tree Evolutionary tree showing the relationship among various entities that are believed to have a common ancestor.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Combinatorial Pattern Matching CS 466 Saurabh Sinha.
High-Performance Algorithm Engineering for Computational Phylogenetics [B Moret, D Bader] Kexue Liu CMSC 838 Presentation.
CISC667, F05, Lec14, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Phylogenetic Trees (I) Maximum Parsimony.
5 - 1 Chap 5 The Evolution Trees Evolutionary Tree.
Chapter 11: Limitations of Algorithmic Power
Algorithm Animation for Bioinformatics Algorithms.
BNFO 602 Phylogenetics Usman Roshan. Summary of last time Models of evolution Distance based tree reconstruction –Neighbor joining –UPGMA.
. Comput. Genomics, Lecture 5b Character Based Methods for Reconstructing Phylogenetic Trees: Maximum Parsimony Based on presentations by Dan Geiger, Shlomo.
On the Task Assignment Problem : Two New Efficient Heuristic Algorithms.
FPGA Acceleration of Gene Rearrangement Analysis Jason D. Bakos Dept. of Computer Science and Engineering University of South Carolina Columbia, SC USA.
1 Genome Rearrangements João Meidanis São Paulo, Brazil December, 2004.
Phylogenetic Tree Construction and Related Problems Bioinformatics.
Phylogenetic trees Sushmita Roy BMI/CS 576
Novel computational methods for large scale genome comparison PhD Director: Dr. Xavier Messeguer Departament de Llenguatges i Sistemes Informàtics Universitat.
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Charalampos (Babis) E. Tsourakakis WABI 2013, France WABI '131.
1 Generalized Tree Alignment: The Deferred Path Heuristic Stinus Lindgreen
Binary Encoding and Gene Rearrangement Analysis Jijun Tang Tianjin University University of South Carolina (803)
PHYLOGENETIC TREES Dwyane George February 24,
Genomics Lecture 8 By Ms. Shumaila Azam. 2 Genome Evolution “Genomes are more than instruction books for building and maintaining an organism; they also.
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
PERFORMANCE ANALYSIS cont. End-to-End Speedup  Execution time includes communication costs between FPGA and host machine  FPGA consistently outperforms.
RESOURCES, TRADE-OFFS, AND LIMITATIONS Group 5 8/27/2014.
Sorting by Cuts, Joins and Whole Chromosome Duplications
CS 173, Lecture B August 25, 2015 Professor Tandy Warnow.
1 Approximate Algorithms (chap. 35) Motivation: –Many problems are NP-complete, so unlikely find efficient algorithms –Three ways to get around: If input.
Identification of Copy Number Variants using Genome Graphs
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
1 Alignment Matrix vs. Distance Matrix Sequence a gene of length m nucleotides in n species to generate an… n x m alignment matrix n x n distance matrix.
Subtree Prune Regraft & Horizontal Gene Transfer or Recombination.
Phylogenetic Trees - Parsimony Tutorial #13
Finding Motifs Vasileios Hatzivassiloglou University of Texas at Dallas.
Hierarchical clustering approaches for high-throughput data Colin Dewey BMI/CS 576 Fall 2015.
CSE280Stefano/Hossein Project: Primer design for cancer genomics.
Recombination and Pedigrees Genealogies and Recombination: The ARG Recombination Parsimony The ARG and Data Pedigrees: Models and Data Pedigrees & ARGs.
Algorithm Complexity is concerned about how fast or slow particular algorithm performs.
Studying and Manipulating Genomes
P & NP.
Bioinformatics Overview
Gil McVean Department of Statistics
394C, Spring 2012 Jan 23, 2012 Tandy Warnow.
Distance based phylogenetics
Original Synteny Vincent Ferretti, Joseph H. Nadeau, David Sankoff, 1996 Presented by: Suzy Sun.
Section 1: Mutation and Genetic Change
Approximate Algorithms (chap. 35)
Pipelines for Computational Analysis (Bioinformatics)
Genome Rearrangement and Duplication Distance
European Symposium on Algorithms – ESA
Character-Based Phylogeny Reconstruction
Multiple Alignment and Phylogenetic Trees
Hierarchical clustering approaches for high-throughput data
Genomes and Their Evolution
Ranking Tumor Phylogeny Trees by Likelihood
BNFO 602 Phylogenetics Usman Roshan.
BNFO 602 Phylogenetics – maximum parsimony
Mohammed El-Kebir, Gryte Satas, Layla Oesper, Benjamin J. Raphael 
Ch. 13 Genetic Engineering
Gene Tree Estimation Through Affinity Propagation
Double Cut and Join with Insertions and Deletions
What is Computer Science About? Part 2: Algorithms
JAKUB KOVÁĆ, ROBERT WARREN, MARÍLIA D.V. BRAGA and JENS STOYE
Presentation transcript:

WABI: Workshop on Algorithms in Bioinformatics Olivera Perunkovska

Summary This is the 16th Workshop on Algorithms in Bioinformatics (WABI). Held on August 22-24 in Aarhus, Denmark. 56 submitted. 25 papers accepted. It covers researches in all aspects of algorithms in bioinformatics, computational biology and systems biology.

A Linear Time Approximation Algorithm for the DCJ Distance for Genomes with Bounded Number of Duplicates Large-scale mutations or rearrangements can produce complex changes and structural variations in genomes. Rearrangements can be represented by the double cut and join (DCJ) operation These operations cuts a genome in two different positions, creating four open ends, and joins these open ends in a different way The minimum number of rearrangements that transform one genome into the other and to find the rearrangement distance between two given genomes is a basic task in comparative genomics

For genomes without duplicate genes, there are linear time algorithms to compute the distance allowing only DCJ operations Computing the DCJ distance between two genomes with the same content and possibly duplicate genes, with some restriction. This paper describing a linear time approximation algorithm with approximation factor O(k), where k is the maximum number of duplicates of any gene in the input genomes

Gerbil: A Fast and Memory-Efficient k-mer Counter with GPU-Support* A basic task in bioinformatics is the counting of k-mers in genome strings. Counting of k-mers becomes a challenging problem for large instances, when its aims to be resource and time-efficient. Existing tools for k-mer counting are often optimized for k < 32 and lack good performance for larger k. Working with long sequencing reads helps to improve accuracy and counting assembly Among the first software tools that succeeded in counting the k-mers of large genome data sets was Jellysh which uses a lock-free hash table that allows parallel insertion.

Gerbil open source k-mer counting tool. High performance for large values of k. This software is the result of an extensive process of algorithm engineering that tried to bring together the best ideas from the literature. Time efficient and memory frugal. Optionally use GPUs to accelerate the counting step. For large values of k, it reduces the runtime by up to a factor of four.

Copy-Number Evolution Problems: Complexity and Algorithms Computational inference of phylogenetic trees - fundamental problem in species evolution studied for tumor. Cancer is an evolutionary process characterized by the accumulation of somatic mutations in a population of cells that form a tumor. A tumor is made of clones, which are subpopulations of cells sharing a unique combination of somatic mutations. Evolutionary history of these clones can be described by the phylogenetic trees (leaves - clones edges - mutations).

Consider 2 problems in the evolutionary analysis of copy-number profiles: Copy-Number Triplet (CN3) - linear in n Copy-Number Tree (CNT) – NP - hard CN3 aims to find a parental profile that minimizes the sum of distances to its children CNT asks to construct a phylogenetic tree whose k leaves are labeled by the k given profiles, and to assign profiles to the internal vertices so that the sum of distances over all edges is minimum.