BNFO 615 Usman Roshan. Projects and papers An opportunity to do hands on work Proposal presentations due by end of September Papers: present at least.

Slides:



Advertisements
Similar presentations
GPU and machine learning solutions for comparative genomics Usman Roshan Department of Computer Science New Jersey Institute of Technology.
Advertisements

Reference mapping and variant detection Peter Tsai Bioinformatics Institute, University of Auckland.
Metabarcoding 16S RNA targeted sequencing
Introduction BNFO 602 Usman Roshan. Course grade Project: –Find a bioinformatics topic by Feb 5th. This can be a paper or a research question you wish.
Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 1 Introduction Aleppo University Faculty of technical engineering.
Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities By Kevin Chen, Lior Pachter PLoS Computational Biology, 2005 David Kelley.
Single nucleotide polymorphisms and applications Usman Roshan BNFO 601.
CIS786, Lecture 7 Usman Roshan Some of the slides are based upon material by Dennis Livesay and David.
JYC: CSM17 BioinformaticsCSM17 Week 10: Summary, Conclusions, The Future.....? Bioinformatics is –the study of living systems –with respect to representation,
Lecture 1 BNFO 240 Usman Roshan. Course overview Perl progamming language (and some Unix basics) Sequence alignment problem –Algorithm for exact pairwise.
BNFO 602 Lecture 1 Usman Roshan.
Machine Learning for Information Retrieval: Neural Networks, Symbolic Learning, and Genetic Algorithms Prerak Sanghvi Paper by: Hsinchun Chen Artificial.
Hidden Markov Models Usman Roshan BNFO 601. Hidden Markov Models Alphabet of symbols: Set of states that emit symbols from the alphabet: Set of probabilities.
Hidden Markov Models Usman Roshan BNFO 601. Hidden Markov Models Alphabet of symbols: Set of states that emit symbols from the alphabet: Set of probabilities.
Pairwise profile alignment Usman Roshan BNFO 601.
Evaluating alignments using motif detection Let’s evaluate alignments by searching for motifs If alignment X reveals more functional motifs than Y using.
BNFO 602 Lecture 2 Usman Roshan. Bioinformatics problems Sequence alignment: oldest and still actively studied Genome-wide association studies: new problem,
Seven Minute Madness: Reconfigurable Computing Dr. Jason D. Bakos.
BNFO 235 Lecture 5 Usman Roshan. What we have done to date Basic Perl –Data types: numbers, strings, arrays, and hashes –Control structures: If-else,
(1) Risk prediction by kernels and (2) Ranking SNPs Usman Roshan.
Phylogeny reconstruction BNFO 602 Roshan. Simulation studies.
BNFO 602 Phylogenetics Usman Roshan. Summary of last time Models of evolution Distance based tree reconstruction –Neighbor joining –UPGMA.
Utilizing Fuzzy Logic for Gene Sequence Construction from Sub Sequences and Characteristic Genome Derivation and Assembly.
Single nucleotide polymorphisms and applications Usman Roshan BNFO 601.
Topic : Phylogenetic Reconstruction I. Systematics = Science of biological diversity. Systematics uses taxonomy to reflect phylogeny (evolutionary history).
CIS786, Lecture 8 Usman Roshan Some of the slides are based upon material by Dennis Livesay and David.
Whole Exome Sequencing for Variant Discovery and Prioritisation
Todd J. Treangen, Steven L. Salzberg
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
Genome alignment Usman Roshan. Applications Genome sequencing on the rise Whole genome comparison provides a deeper understanding of biology – Evolutionary.
The use of short-read next generation sequences to recover the evolutionary histories in multi-individual samples Systematic biology presentation Yuantong.
MPL Identification of alternative spliced mRNA variants related to cancers by genome-wide ESTs alignment KIM DAE SOO Oncogene Apr.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
BNFO 615 Usman Roshan. Short read alignment Input: – Reads: short DNA sequences (upto a few hundred base pairs (bp)) produced by a sequencing machine.
The metagenomics sequencing service CD Genomics. Metagenomics: Metagenomics is the study of metagenomes, genetic material recovered directly from environmental.
Big data Usman Roshan CS 675. Big data Typically refers to datasets with very large number of instances (rows) as opposed to attributes (columns). Data.
Short read alignment BNFO 601. Short read alignment Input: –Reads: short DNA sequences (upto a few hundred base pairs (bp)) produced by a sequencing machine.
CS 732: Advance Machine Learning
CS 395T: Computational phylogenetics January 18, 2006 Tandy Warnow.
N. Jacq – Bio informatics Tests WP n° 1 WP6-WP7-WP10 Biology applications on testbed 0 Laboratoire de Biologie des.
Lecture 1 BNFO 601 Usman Roshan. Course overview Perl progamming language (and some Unix basics) –Unix basics –Intro Perl exercises –Dynamic programming.
A brief guide to sequencing Dr Gavin Band Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for Health.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
BSP: An iterated local search heuristic for the hyperplane with minimum number of misclassifications Usman Roshan.
Population sequencing using short reads: HIV as a case study Vladimir Jojic et.al. PSB 13: (2008) Presenter: Yong Li.
BNFO 615 Fall 2016 Usman Roshan NJIT. Outline Machine learning for bioinformatics – Basic machine learning algorithms – Applications to bioinformatics.
BLAST BNFO 236 Usman Roshan. BLAST Local pairwise alignment heuristic Faster than standard pairwise alignment programs such as SSEARCH, but less sensitive.
Disease risk prediction
Genome alignment Usman Roshan.
Genomic Data Clustering on FPGAs for Compression
Local alignment and BLAST
BNFO 236 Smith Waterman alignment
H = -Σpi log2 pi.
Distance based phylogeny reconstruction
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
BNFO 602 Phylogenetics Usman Roshan.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
BNFO 602 Phylogenetics – maximum parsimony
Representing binary trees with lists
BNFO 602 Phylogenetics – maximum likelihood
BNFO 602 Phylogenetics Usman Roshan.
Classification.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Bionformatics – Theory and Practice
Deep Learning in Bioinformatics
The power of metagenomic read recruitment
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Integrative analysis of single-nucleus whole-genome sequencing and bulk tumor tissue enables detection of the subclonal architecture of tumors. Integrative.
General overview of the bioinformatic pipelines for the 16S rRNA gene microbial profiling and shotgun metagenomics. General overview of the bioinformatic.
Presentation transcript:

BNFO 615 Usman Roshan

Projects and papers An opportunity to do hands on work Proposal presentations due by end of September Papers: present at least one paper related to the project Python programming (R is also fine)

Projects Genome alignment Metagenomics Variant detection Sequence alignment classification Whole genome phylogeny reconstruction

Genome alignment Contemporary sequence alignment problem Hard due to complex genome evolution Data – Simulated: evolver, alignathon – Real Programs – LASTZ New alignment methods for distant genomes

Metagenomics Contemporary alignment problem Simulated data Real data Comparison of short read mappers for metagenomics New metagenomics methods

Variant detection Key in cancer genomics and understand disease Variant detection in unmapped reads – First we map them – Then look for variants

Sequence alignment classifier Popular problem in bioinformatics Plethora of sequence alignment programs available Can we create a machine learning classifier that would give us the best alignment? K-means style deep learning method

Whole genome phylogeny Phylogenies from whole genome sequences data Hard to study on real data Simulated data via evolver