Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
A Lite Introduction to (Bioinformatics and) Comparative Genomics Chris Mueller August 10, 2004.
Outline to SNP bioinformatics lecture
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Introduction to Bioinformatics Yana Kortsarts Bob Morris.
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Whole Genome Polymorphism Analysis of Regulatory Elements in Breast Cancer AAGTCGGTGATGATTGGGACTGCTCT[C/T]AACACAAGCGAGATGAAGAAACTGA Jacob Biesinger Dr.
SNP Resources: Finding SNPs Databases and Data Extraction Mark J. Rieder, PhD SeattleSNPs Variation Workshop March 20-21, 2006.
FROM GENE TO PROTEIN: TRANSCRIPTION & RNA PROCESSING Chapter 17.
From T. MADHAVAN, & K.Chandrasekaran Lecturers in Zoology.. EXIT.
Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
14.3 Studying the Human Genome
Human Genome Project by: Amanda Mosello. What is the Human Genome Project? created in 1990, by the National Institutes of Health and the US Department.
Single Nucleotide Polymorphisms Mrs. Stewart Medical Interventions Central Magnet School.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Epidemiology 217 Omics, Bioinformatics, & Resources at UCSF John Witte.
Copyright © 2009 Pearson Education, Inc. Art and Photos in PowerPoint ® Concepts of Genetics Ninth Edition Klug, Cummings, Spencer, Palladino Chapter 21.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
A Lite Introduction to (Bioinformatics and) Comparative Genomics Chris Mueller November 18, 2004 Based on the Genomics in Biomedical Research course at.
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Copyright © 2009 Pearson Education, Inc. Genomics, Bioinformatics, and Proteomics Chapter 21 Lecture Concepts of Genetics Tenth Edition.
 The process by which desired traits of certain plants and animals are selected and passed on to their future generations is called selective breeding.
ARE THESE ALL BEARS? WHICH ONES ARE MORE CLOSELY RELATED?
Bioinformatics Core Facility Guglielmo Roma January 2011.
Predicting protein degradation rates Karen Page. The central dogma DNA RNA protein Transcription Translation The expression of genetic information stored.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
Using a Single Nucleotide Polymorphism to Predict Bitter Tasting Ability Lab Overview.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte, Xin Liu & Mark Pletcher.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
Bioinformatics and Computational Biology
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Chapter 12 Assessment How could manipulating DNA be beneficial?
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
Using a Single Nucleotide Polymorphism to Predict Bitter Tasting Ability Lab Overview.
Finding genes in the genome
The Central Dogma of Molecular Biology DNA  RNA  Protein  Trait.
Unit 1 – Living Cells.  The study of the human genome  - involves sequencing DNA nucleotides  - and relating this to gene functions  In 2003, the.
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
Notes: Human Genome (Right side page)
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
Bioinformatics Overview
The Transcriptional Landscape of the Mammalian Genome
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Human Cells Human genomics
Single Nucleotide Polymorphisms
Bellwork: What is the human genome project. What was its purpose
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
14-3 Human Molecular Genetics
Concept 18.2: Eukaryotic gene expression can be regulated at any stage
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
The Future of Genetic Research
From Mendel to Genomics
Problems from last section
.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 3 Gene Prediction and Annotation 4 Genome Structure 5 Genome.
Introduction to Bioinformatics
Presentation transcript:

Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte

Coding Genotypes CodingCCCTTT Co-dominant Dominant011 Recessive001 Log Additive012

Post-Genomic Era: Lots of Data!

“The study of genetic and other biological information using computer and statistical techniques.” A Genome Glossary, Science, Feb 16, 2001

Bioinformatics in Genetic Epi Some key aspects: Data management Candidate regions / genes (selection and SNP mining) Genetic Analyses (e.g., genotyping) Statistical Analyses

Data Management 5/20 Demogr. Database Laboratory Database Clinical Database Health and Habits Database Nutritional Database Genomic Database CaP Genes Databases Hub

From gene to polymorphisms Given a gene, how do I… Find its polymorphisms? Find its polymorphisms? Find information about those polymorphisms? Find information about those polymorphisms?

Hands-on guide for browsing and analyzing genomic data. Contains worked examples, providing: –overview of the types of data available, –details on how these data can be browsed, and –step-by-step instructions for using many of the most commonly- used tools for sequence based discovery.

Nature Genetics: A User's Guide to the Human Genome 3 of the 13 worked example questions How does one find a gene of interest and determine that gene's structure? How would one retrieve the sequence of a gene, along with all annotated exons and introns, as well as a certain number of flanking bases for use in primer design? A user wishes to find all the single nucleotide polymorphisms that lie between two sequence-tagged sites. Do any of these single nucleotide polymorphisms fall within the coding region of a gene? Where can any additional information about the function of these genes be found?

Look for SNPs in Databases General databases: --- dbSNP ( --- UCSC Genome Bioinformatics ( --- HapMap ( --- The SNP consortium (TSC) ( --- Human gene variation base (HGVbase) ( Special databases: --- The UW-FHCRC Variation Discovery Resource (SeattleSNPs) ( --- Cancer Genome Anatomy Project - SNP500Cancer Database ( ( --- InnateImmunity ( --- Drug response ( More….

UCSC Browser Comparative Genomics SNPs Gene structure

SeattleSNPs Resequencing the complete genomic region of each gene among 24 African-American (AA) subjects and 23 European (CEPH) subjects –2000 bp upstream of first exon –1500 bp downstream of poly-A signal –All exons and introns for genes below 35 kbp Summary data (2/18/05) –Number of genes sequenced: 208 –Total kilobases sequenced: –Number of SNPs found: 23,590 –SNPs in AA sample: 20,765 –SNPs in CEPH sample: 12,937

From Genomics to Proteomics Our ~ 25,000 genes carry the blueprint for making proteins, of which all living matter is made. Each protein has a particular shape and function that determine its role in the body. Proteomics is the study of protein shape, function, and patterns of expression.

5`3`DNA Pre-splicing RNA Post-splicing RNA Protein Exon, non-coding (5`UTR, 3`UTR) Exon, coding Promoter Enhancer Intron Poly-adenilation Anatomy of a gene

Characterize proteins derived from genetic code Compare variations in their expression levels under different conditions Study their interactions Identify their functional role. Proteomics

Proteome Complexity Recall that genome is relatively static. In contrast, many cellular proteins are continually moving and undergoing changes such as: 1.binding to a cell membrane, 2.partnering with another protein, 3.gaining or losing a chemical group such as a sugar, fat, or phosphate, or 4.breaking into two or more pieces.

Size of Proteome? > 1 Million Proteins >>> 25,000 genes in humans. Large number due to complexity (a given gene can make many different proteins) Features such as folds and motifs, allow them to be categorized into groups and families. This should help make it easier to undertake proteomic research. But no proteome has yet been sequenced.

How to Analyze Proteomes Broad range of technologies Central paradigm: –2-D gel electrophoresis (2D- GE), and mass spectrometry (MS). –2D-GE is used to separate the proteins by isoelectric point and then by size. –MS determines their identity and characteristics.

Bioinformatics in Proteomics Creation and maintenance of databases of protein info. Development of methods to predict the structure and/or function of newly discovered proteins and structural RNA sequences. Clustering protein sequences into families of related sequences and the development of protein models. Aligning similar proteins and generating phylogenetic trees to examine evolutionary relationships