Big Data in Biology: A focus on genomics. Bioinformatics and Genomics O Applications: O Personalized cancer medicines O Disease determination O Pathway.

Slides:



Advertisements
Similar presentations
Linkage and Genetic Mapping
Advertisements

CSC411- Machine Learning and Data Mining Tutorial 10– March 23 th, 2007 University of Toronto (Mississauga Campus)
Test-tube or keyboard? Computation in the life sciences.
Introduction to genomes & genome browsers
Genetic Analysis in Human Disease
Scientific themes in personal genetics Personal Genetics Education Project (pgEd) Harvard Medical School - Wu Laboratory
Unit 1: DNA and the Genome Key area 8: Genomic sequencing.
Key area 6: Mutations.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Computational Tools for Finding and Interpreting Genetic Variations Gabor T. Marth Department of Biology, Boston College
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
Course Overview Personalized Medicine: Understanding Your Own Genome Fall 2014.
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
KEY CONCEPT Genetics provides a basis for new medical treatments.
Whole Exome Sequencing for Variant Discovery and Prioritisation
Biotechnology SB2.f – Examine the use of DNA technology in forensics, medicine and agriculture.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Genomes and Genomics.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
1 Human Genome Project, Gene Therapy, and Cloning Adapted from the University of Utah Genetic Science Learning Center and The National Genome Research.
Your genome: What does your DNA say about you? Personal Genetics Education Project (pgEd) Harvard Medical School personal genetics education.
Data provenance in biomedical discovery Donald Dunbar Queen’s Medical Research Institute University of Edinburgh Workshop on Principles of Provenance in.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
KEY CONCEPT Genetics provides a basis for new medical _____________.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Scientific themes in personal genetics Personal Genetics Education Project (pgEd) Harvard Medical School - Wu Laboratory
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Microbial Genetics.  In bacteria genetic transfer (recombination) can happen three ways:  Transformation  Transduction  Conjugation  The result is.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
3 RD BLOCK WARM-UP 1. Have out your homework (Graphic Organizer). 2. After I check it, go check your answers at the SSS. 3. Open your Biology Handbook.
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
Unit 1 – Living Cells.  The study of the human genome  - involves sequencing DNA nucleotides  - and relating this to gene functions  In 2003, the.
Higher Human Biology Unit 1 Human Cells KEY AREA 5: Human Genomics.
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
WHAT IS THE IMPACT OF THE HUMAN GENOME PROJECT FOR DRUG DEVELOPMENT? Arman & Fin.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Interpreting exomes and genomes: a beginner’s guide
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Biotechnology.
Nucleotide variation in the human genome
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
DNA Marker Lecture 10 BY Ms. Shumaila Azam
Human Cells Human genomics
New genes can be added to an organism’s DNA.
Scientists use several techniques to manipulate DNA.
KEY CONCEPT Genetics provides a basis for new medical treatments.
Linking Genetic Variation to Important Phenotypes
By Michael Fraczek and Caden Boyer
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Mutations & Genetic Engineering
Applications of DNA Analysis
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Mutations Mutations are changes in DNA.
KEY CONCEPT Genetics provides a basis for new medical treatments.
KEY CONCEPT Genetics provides a basis for new medical treatments.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Genetics provides a basis for new medical treatments.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Genetics provides a basis for new medical treatments.
Mutations.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Genetics provides a basis for new medical treatments.
Presentation transcript:

Big Data in Biology: A focus on genomics

Bioinformatics and Genomics O Applications: O Personalized cancer medicines O Disease determination O Pathway Analysis O Biomarker Discovery

An Interesting Point O “One article estimated that the output from genomics may soon dwarf data heavyweights such as YouTube” O “I don't know if a million genomes is the right number, but clearly we need more than we've got,” says Marc Williams, director of the Geisinger Genomic Medicine Institute.

Stephens, Z. D. et al. PLoS Biol.13, e (2015)

Genomics in the Past O DNA can have 4 different bases, A, C, G, T O Exons (1%): parts of the DNA that code for proteins O Look at nucleotides O ~13,000 single nucleotide variants. O Roughly 2% of these will affect protein composition O Unfortunately, research used cell cultures or animal modes. O However: Many of these associations were made with low levels of evidence.

Genomics Continued O Structural Variants – deletion, duplication, and translocation. O Much harder to detect than single mutations O Many genes do not code for proteins, but can still regulate protein creation, but it’s still not well known the function of many of these regions. O Capturing all such variation is desirable, but not the best in the short term O Tldr; genomics is hard.

Applications O Iceland deCODE Project: medical history records and genome data of 150,000 people O Led to Discovery of: O Genetic risk factors O Breast cancer O Alzheimer’s O Also found 10,000 people missing 1,500 different copies of both genes. O Drug responsiveness: ADHD medicine only works for one of ten preschoolers, cancer drugs are effective for 25% of patients, and depression drugs work with 6 of 10 patients. O Personalized Medicine

Issues with Bioinformatics O Icelandic work helped by a homogeneous population. O 1000 Genomes project captured some diversity, but mainly captured Caucasian populations. O “Because they come from the genetic mother ship, so to speak, people of African ancestry carry a lot more genetic variants than non-Africans… Variants that seem unusual in Caucasians might be common in Africans, and may not actually cause disease.” - says Isaac Kohane, a bioinformatician at Harvard Medical School in Boston, Massachusetts. O Reference genome: the comparison tool that many researchers use is flawed. O 1 st iteration: random donors of unidentified ethnicity. O Currently it incorporates more human genomic diversity.

Solutions O Relationships between doctors and researchers to create models between diseases and genetics. O Harvesting genomes produces up to 40 Petabytes (PB) per a year. O Computational power: The more variables you add, the more people you add, it gets harder and harder. O Silicon Valley Lure: people needed for bioinformatics need to be able to harness massive parallel computation.

Conclusion O Two Main Issues: O Difficulty of bioinformatics due to genomics O Computational power and the need for collaboration O Yet solving these problems, could easily lead to incredible improvements in medicine.