Next generation sequencing platforms Applications

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

High throughput sequencing Barbera van Schaik
Functional Genomics with Next-Generation Sequencing
The Past, Present, and Future of DNA Sequencing
Genome Sequence & Gene Expression Chromatin & Nuclear Organization Chromosome Inheritance & Genome Stability.
 Sequencing technology › Roche/454 GS-FLX (‘454’) › Illumina  Prokaryotic profiling › De novo genome sequencing › Metagenomics › SNP profiling › Species.
Bioinformatics Lectures at Rice
Understanding the Human Genome: Lessons from the ENCODE project
Next-generation sequencing
Next-generation sequencing – the informatics angle Gabor T. Marth Boston College Biology Department AGBT 2008 Marco Island, FL. February
What Is Genomics? Genomics is the study of how the entire genome of a species functions as a unit and evolves over time. It is the study of life’s blueprint,
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
Greg Phillips Veterinary Microbiology
10 Genomics, Proteomics and Genetic Engineering. 2 Genomics and Proteomics The field of genomics deals with the DNA sequence, organization, function,
1 Next Generation Sequencing Itai Sharon November 11th, 2009 Introduction to Bioinformatics.
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
High Throughput Sequencing
Department of Bioinformatics and Computational Biology
CS 6293 Advanced Topics: Current Bioinformatics
The impact of next-generation sequencing technology of genetics Elaine R. Mardis – 11 February Washington School of Medicine, Genome Sequencing Center.
Next Now-Generation Genomics: methods and applications for modern disease research Aaron J. Mackey, Ph.D. Center for Public Health.
Next generation sequencing Xusheng Wang 4/29/2010.
Dr Katie Snape Specialist Registrar in Genetics St Georges Hospital
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
AP Biology Ch. 20 Biotechnology.
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
CEITEC BRNO | CZECH REPUBLIC central european institute of technology CEITEC Genomics and proteomics at MU Jiří Fajkus.
Todd J. Treangen, Steven L. Salzberg
CO 10.
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
NEXT – GEN SEQUENCING TECHNIQUES
Massive Parallel Sequencing
Genomics and High Throughput Sequencing Technologies: Applications Jim Noonan Department of Genetics.
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
P. Tang ( 鄧致剛 ); RRC. Gan ( 甘瑞麒 ); PJ Huang ( 黄栢榕 ) Bioinformatics Center, Chang Gung University. Genome Sequencing Genome Resequencing De novo Genome.
High throughput sequencing: informatics & software aspects Gabor T. Marth Boston College Biology Department BI543 Fall 2013 January 29, 2013.
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
Literature reviews revised is due4/11 (Friday) turn in together: revised paper (with bibliography) and peer review and 1st draft.
I519 Introduction to Bioinformatics, Fall, 2012
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
Next Generation Sequencing
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
SEQUENCING – THE BENCHTOPS. Roche 454 Junior Same technology as 454 FLX Read length: 400 bases Paired-end 100,000 reads 12 hours (instrument time) Output.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Analysis of protein-DNA interactions with tiling microarrays
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Trends Biomedical Science
Lecture-5 ChIP-chip and ChIP-seq
Biases in RNA-Seq data. Transcript length bias Two transcripts of length 50 and 100 have the same abundance in a control sample. The expression of both.
STAT115 STAT225 BIST512 BIO298 - Intro to Computational Biology.
BIOL 433 Plant Genetics Term 2, Instructors: Dr. George Haughn Dr. Ljerka Kunst BioSciences 2239BioSciences Tel
Notes: Human Genome (Right side page)
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
KEY CONCEPT DNA sequences of organisms can be changed.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
YOUR FUTURE STARTS WITH HOPE YOUR FUTURE STARTS WITH HOPE Genome Biology & Applied Bioinformatics Human Genome Mehmet Tevfik DORAK, MD PhD.
Next-generation sequencing technology
Next generation sequencing
Epigenetics 04/04/16.
BIOL 433 Plant Genetics Term 2,
Themes of Biology Chapter 1
Next-generation sequencing technology
BIOL 433 Plant Genetics Term 2,
Next-generation DNA sequencing
Presentation transcript:

Next generation sequencing platforms Applications Anna Esteve Codina

Sanger vs NGS ‘Sanger sequencing’ has been the only DNA sequencing method for 30 years but… …hunger for even greater sequencing throughput and more economical sequencing technology… NGS has the ability to process millions of sequence reads in parallel rather than 96 at a time (1/6 of the cost) Objections: fidelity, read length, infrastructure cost, handle large volum of data .

Sequencing of the human genome Public Consortium Many years of hard work More than 20.000 BAC clones Each containing about 100kb fragment Together provided a tiling path through each human chromosome Amplification in bacterial culture Isolation, select pieces about 2-3 kb Subcloned into plasmid vectors, amplification, isolation recreate contigs Refinement, gap closure, sequence quality improvement (less 1 error/ 40.000 bases) BAC based approaches toward WGS

Platforms Roche/454 FLX: 2004 Illumina Solexa Genome Analyzer: 2006 Applied Biosystems SOLiDTM System: 2007 Helicos HeliscopeTM : recently available Pacific Biosciencies SMRT: launching 2010

Roche 454 technology

Illumina Solexa

454 vs Solexa Homopolymers (AAAAA..) Read length: 400 bp Number of reads: 400.000 Per-base cost greater Novo assembly, metagenomics Read length: 40 bp Number of reads: millions Per-base cost cheaper Ideal for application requiring short reads: ncRNA

Applications Ancient DNA DNA mixtures from diverse ecosystems, metagenomics Resequencing previously published reference strains Identification of all mutations in an organism Errors in published literature Expand the number of available genomes Comparative studies Deciphering cell’s transcripts at sequence level without knowledge of the genome sequence Sequencing extremely large genomes, crop plants Detection of cancer specific alleles avoiding traditional cloning Chip-seq: interactions protein-DNA Epigenomics Detecting ncRNA Genetic human variation : SNP, CNV (diseases)

Ancient Genomes Resurrected Degraded state of the sample  mitDNA sequencing Nuclear genomes of ancient remains: cave bear, mommoth, Neanderthal (106 bp ) Problems: contamination modern humans and coisolation bacterial DNA

Elucidating DNA-protein interactions through chromoatin immunoprecipitation sequencing Key part in regulating gene expression Chip: technique to study DNA-protein interaccions Recently genome-wide ChIP-based studies of DNA-protein interactions Readout of ChIP-derived DNA sequences onto NGS platforms Insights into transcription factor/histone binding sites in the human genome Enhance our understanding of the gene expression in the context of specific environmental stimuli

Discovering noncoding RNAs ncRNA presence in genome difficult to predict by computational methods with high certainty because the evolutionary diversity Detecting expression level changes that correlate with changes in environmental factors, with disease onset and progression, complex disease set or severity Enhance the annotation of sequenced genomes (impact of mutations more interpretable)

Mutation discovery Extreme example: multiplexing the amplification of 10 000 human exons using primers from a programmable microarray and sequencing them using NGS.

Metagenomics Characterizing the biodiversity found on Earth The growing number of sequenced genomes enables us to interpret partial sequences obtained by direct sampling of specif environmental niches. Examples: ocean, acid mine site, soil, coral reefs, human microbiome which may vary according to the health status of the individual

Defining variability in many human genomes Common variants have not yet completly explained complex disease genetics rare alleles also contribute Also structural variants, large and small insertions and deletions Accelerating biomedical research

Epigenomic variation Enable of genome-wide patterns of methylation and how this patterns change through the course of an organism’s development. Enhanced potential to combine the results of different experiments, correlative analyses of genome-wide methylation, histone binding patterns and gene expression, for example.

More about epigenetics… Epigenetics: beyond the sequence. "The major problem, I think, is chromatin. What determines whether a given piece of DNA along the chromosome is functioning, since it's covered with the histones? What is happening at the level of methylation and epigenetics? You can inherit something beyond the DNA sequence. That's where the real excitement of genetics is now." (James D. Watson). Chromatin is defined as the dynamic complex of DNA and histone proteins that makes up chromosomes. Epigenetics is defined as the chemical modification of DNA that affects gene expression but does not involve changes to the underlying DNA sequence. As the emphasis in biology is switching away from genetic sequence and towards the mechanisms by which gene activity is controlled, epigenetics is becoming increasingly popular. Epigenetic processes are essential for packaging and interpreting the genome, are fundamental to normal development and are increasingly recognized as being involved in human disease. Epigenetic mechanisms include, among other things, histone modification, positioning of histone variants, nucleosome remodelling, DNA methylation, small and non-coding RNAs. (Nature, 7 Aug 2008).

Technology improvements Reduced sequencing error Increment read length Developing new bioinformatic tools Align: MAQ, SOAP Assembly: SSAKE Base caller: PyroBayes Variant detection: MAQ, GEM Cost reduction: 1000$ for personal genomics

Bibliografia Schuster 2008. Next-generation sequencing transforms today’s biology. Nature Methods - 5, 16 - 18 (2008). Published online: 19 December 2007; | doi:10.1038/nmeth1156. Mardis ER. 2008. Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. 2008;9:387-402. Review. Mardis ER. 2008. The impact of next-generation sequencing technology on genetics.Trends Genet. 2008 Mar;24(3):133-41. Epub 2008 Feb 11. Review. Shendure and Ji. 2008 Next-generation DNA sequencing. Nat Biotechnol. 2008 Oct;26(10):1135-45. Wheeler DA et al. 2008. The complete genome of an individual by massively parallel DBA sequencing. Nature. 2008 Apr 17;452(7189):872-6.