Final Final: 2 of the following 3 choices, –1 hour exam covering recent materials (June 11), –2 page review of an assigned paper (due June 11), –Self-study.

Slides:



Advertisements
Similar presentations
Recombinant DNA Technology
Advertisements

Microarray technology and analysis of gene expression data Hillevi Lindroos.
DNA microarray and array data analysis
DNA Microarray: A Recombinant DNA Method. Basic Steps to Microarray: Obtain cells with genes that are needed for analysis. Isolate the mRNA using extraction.
Additional Powerful Molecular Techniques Synthesis of cDNA (complimentary DNA) Polymerase Chain Reaction (PCR) Microarray analysis Link to Gene Therapy.
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
The Human Genome Project and ~ 100 other genome projects:
Exam #2 Mean = 73% Median = 74% Mode = 90% A range: | | | | | | | | | B range: | | | | | | | | | C range: | | | | | | | D range: | | | | | | | | | | Failing:
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
Bacterial Physiology (Micr430)
Final Final: 2 of the following 3 choices, –1 hour exam covering recent materials, –2 page review of an assigned paper (due June 11), –Self-study of a.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Information Aspects of Nucleic Acids Measurement Technologies Description of nucleic acid measurement technologies Algorithmic, optimization, data analysis.
Arrays: Narrower terms include bead arrays, bead based arrays, bioarrays, bioelectronic arrays, cDNA arrays, cell arrays, DNA arrays, gene arrays, gene.
Probes/Targets DNA Arrays...Probes: are the tethered nucleic acids with known sequence, –the DNA on the chip,...Target: is the free nucleic acid sample.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Introduce to Microarray
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Microarrays: Basic Principle AGCCTAGCCT ACCGAACCGA GCGGAGCGGA CCGGACCGGA TCGGATCGGA Probe Targets Highly parallel molecular search and sort process based.
Analysis of microarray data
with an emphasis on DNA microarrays
Chapter 5 Nucleic Acid Hybridization Assays A. Preparation of nucleic acid probes: 1. Labeling DNA & RNA - Nick Translation - Random primed DNA labeling.
‘Omics’ - Analysis of high dimensional Data
AP Biology Ch. 20 Biotechnology.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
DNA Technology Chapter 20.
CDNA Microarrays MB206.
Data Type 1: Microarrays
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Microarray Technology
Finish up array applications Move on to proteomics Protein microarrays.
Microarray - Leukemia vs. normal GeneChip System.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Monday Human and chimp DNA is ~98.7 similar, But, we differ in many and profound ways, Can this difference be attributed, at least in part, to differences.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
How are we different? …at the DNA level.
Genomics I: The Transcriptome
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
By Melissa Rivera.  GENE CLONING: production of multiple identical copies of DNA  It was developed so scientists could work directly with specific genes.
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Biotechnology and Genomics Chapter 16. Biotechnology and Genomics 2Outline DNA Cloning  Recombinant DNA Technology ­Restriction Enzyme ­DNA Ligase 
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
DNA Gene A Transcriptional Control Imprinting Histone Acetylation # of copies of RNA? Post Transcriptional Processing mRNA Stability Translational Control.
Chapter 20: DNA Technology and Genomics - Lots of different techniques - Many used in combination with each other - Uses information from every chapter.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
目录 The Principle and Application of Common Used Techniques in Molecular Biology chapter 18.
Gene expression and DNA microarrays No lab on Thursday. No class on Tuesday or Thursday next week –NCBI training Monday and Tuesday –Feb. 5 during class.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Introduction to Oligonucleotide Microarray Technology
Rest of Chapter 11 Chapter 12 Genomics, Proteomics, and Transgenics Jones and Bartlett Publishers © 2005.
Unit 1 – Living Cells.  The study of the human genome  - involves sequencing DNA nucleotides  - and relating this to gene functions  In 2003, the.
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
Microarray: An Introduction
Biotechnology.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Microarray Technology and Applications
Recombinant DNA Technology
Presentation transcript:

Final Final: 2 of the following 3 choices, –1 hour exam covering recent materials (June 11), –2 page review of an assigned paper (due June 11), –Self-study of a remaining chapter in the text, answers to the “odd” problems (due June 11).

DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative genomics, DNA hybridization to DNA, –inter- and intra-species comparisons, etc. –potential yet to be developed.

Arrays solid substrate DNA Chip: oligonucleotides, up to 1000s kb fragments.

Probes/Targets...Probes: are the tethered nucleic acids with known sequence, –the DNA on the chip,...Target: is the free nucleic acid sample whose identity/abundance is being detected, –the labeled nucleic acid that is washed over the chip.

DNA-Probes –cDNA arrays, DNA arrays, DNA Microarrays, –oligonucleotide arrays, DNA chips. nucleic acid is spotted onto the substrate. nucleic acid is synthesized directly onto on the substrate.

DNA Chips …oligonucleotides systematically synthesized in situ at high density. Affymetrix DNA Chip

Allele-Specific Oligonucleotides (DNA Chips) …allele specific oligonucleotides (ASOs) recognize single base pair differences in DNA sequences. --AGTAGCTGTAGCT-- --TCATCGACATCGA-- --AGTAGCTaTAGCT-- --TCATCGACATCGA-- mismatch no binding

Ordered Array of ASOs linker molecule...over a million ASOs and controls can be gridded per cm 2.

Photolithography …the process of using an optical image and a photosensitive substrate to produce a pattern, oligonucleotide synthesis can be inhibited by a ‘protection group’ molecule, the ‘protection group’ can be linked by a photosensitive bond, and thus cleaved by light.

Targets...fluorescent targets, –genomic DNA, –cDNA, mRNA or cRNA for expression studies, …targets are washed over the chip for hybridization.

cDNA Microarrays...denatured, double stranded DNA ( bp) is dotted, or sprayed on a glass or nylon substrate,...up to tens of thousands of spots per array, quill technology...

Hybridization Detection …fluorescent images are read by an optical scanner, and intensities are compared using algorithms to differentiate artifacts.

Screening for Genetic Disease Cystic fibrosis: 75% of mutations are at the  508 deletion site, –8% are in three additional specific locations in the gene, the rest are spread across the length of the gene, Pre-Array tests yielded only an ~83% chance of detecting a mutation.

Cystic fibrosis Detection Create a DNA chip with ASOs for wild- type Cystic fibrosis gene, –approximately 4.5 kb of the 250 kb gene codes for the structural portion of the gene, mers span 4.5 kb, 20 mismatches per 20-mer requires 4500 ASOs, or grids, plus controls.

Creating the Mask …computer algorithms are used to design the mask, –creation of mask is now the limiting process, requires months to accomplish, and about $100,000 per mask, –masks have limited lifetimes, each array costs about $100 currently.

Cystic fibrosis Chip …using photlithography, create a chip with ASOs to identify any difference from wild- type DNA, …match results with mutations at know deleterious loci, …catalog new deleterious loci.

1 Gene of Many …with controls, the Cystic fibrosis gene may require up to 20,000 grids, …new chips can accommodate up to 1 million grids, …can look at 50 similarly sized genes on one chip.

Genetic Diseases …as genes are linked to diseases, quick, inexpensive tests can be performed to determine who carries specific mutations, …computer analysis will provide genome profiles that predict a variety of traits.

Genome Profiling …with 1500 SNPs now, and up to thousands available, genetic profiles can be made, …choose SNPs in or near genes involved in traits or diseases, …compare profiles over large populations.

How are we different? …at the RNA level.

Southern Analysis DNA hybridizing to RNA,

DNA Arrays and Expression …grid gene-specific ASOs onto the DNA chip, or cDNAs onto microarrays, …assay with labeled cDNA, genes that are expressed at a specific time, place or under a specific condition will bind to the chip for display.

Genes and Targets once the Human Genome Project is done, all of the genes can be gridded, –presently, several completely sequenced genomes have been gridded, yeast, E. coli, various bacteria, drug identification, fundamental research, etc.,

Gene Expression Technologies DNA Chips (Affymetrix) and MicroArrays can measure mRNA concentration of thousands of genes simultaneously General scheme: Extract RNA, synthesize labeled cDNA, Hybridize with DNA on chip.

The Experiment After hybridization –Scan the Chip and obtain an image file –Image Analysis (find spots, measure signal and noise) Output File –Affymetrix chips: Measure each gene’s signal and make a present/absent call. –cDNA MicroArrays: competing hybridization of target and control. For each gene the log ratio of target and control.

Preprocessing: From one experiment to many Chip and Channel Normalization –Aim: bring readings of all experiments to be on the same scale –Cause: different RNA amounts, labeling efficiency and image acquisition parameters –Method: Multiply readings of each array/channel by a scaling factor such that: The sum of the scaled readings will be the same for all arrays Find scaling factor by a linear fit of the highly expressed genes

Preprocessing: From one experiment to many Filtering of Genes –Remove genes that are absent in most experiments –Remove genes that are constant in all experiments –Remove genes with low readings which are not reliable.

Noise and Repeats >90% 2 to 3 fold Multiplicative noise Repeat experiments Log scale dist(4,2)=dist(2,1) log – log plot

We can ask many questions? Which genes are expressed differently in two known types of conditions? What is the minimal set of genes needed to distinguish one type of conditions from the others? Which genes behave similarly in the experiments? How many different types of conditions are there? Supervised Methods (use predefined labels) Unsupervised Methods (use only the data)

Goal A: Find groups of genes that have correlated expression profiles. These genes are believed to belong to the same biological process and/or are co-regulated. Goal B: Divide conditions to groups with similar gene expression profiles. Example: divide drugs according to their effect on gene expression. Unsupervised Analysis Clustering Methods

Linear Round What is clustering?

T (RESOLUTION) Cluster Analysis Yields Dendrogram

Applications Monitor expression patterns under the experimental conditions of your choosing to determine the function of the thousands genes, Common expression patterns can be used to identify genes that are members of the same pathway, Explore expression of candidate/unknown genes.

Gene/Drug Discovery …genes involved in cancer and other diseases have been identified through a variety of techniques, –genome expression analysis provides a means of discovering other genes that are concomitantly expressed, –genome expression analysis provides a means of monitoring drug/treatment regimes.

Applications Can study the role of more than 1700 cancer related genes in association with the (rest) of the genome, Define interactions and describe pathways, Measure drug response, Build databases for use in molecular tumor classifications, –benign vs. cancerous, slow vs. aggressive

Extended Applications Water quality testing (4 hours vs. 4 days), Environmental watchdogs, Fundamental research on non-human subjects, Direct sequencing of related species for evolutionary studies, Comparisons of gene regulation between closely related species, etc.

What’s the Question Human and chimp DNA is ~98.7 similar, But, we differ in many and profound ways, Can this difference be attributed, at least in part, to differences in gene expression, rather than differences in the actual gene and gene products?

Huh? Prevailing notion: a gene is mutated, better alleles survive and, in fact, out-compete old alleles…evolution marches on. Paper’s hypothesis: it’s not the genes that are changing, but the REGULATION of the genes.

Regulation? Although the # of genes (~35,000) in the genome remains controversial, it appears to be a lot less than early dogma (100, ,000 genes), One thought, “many” of the additional genes found in complex organisms, are transcription factors.

First...What does it mean that our genomes are 98.7% similar at the DNA level, and how do we know this?

DNA Sequence Comparisons

Bacterial Artificial Chromosomes BACs F plasmid ancestry, –maintain bacterial replication system and copy number control system.

BAC End Sequencing “Mate Pairs”, –sequence both ends of the BAC using vector derived sequencing primers, –yields about 600 bp per sequence.

Contiguous Sequences (contigs)...looks for end-to end overlaps of at least 40 bp with no more than 6% differences in match. What’s the significance?...a one in event.

...if 100% sequenced. x 543bp / read = Science 291 (5507), , September , June 2000

Chimp DNA Sequences 3.3x Coverage of the genome.

Human/Chimp BES Similarity This represents coding (highly conserved) and non-coding (low conservation) regions of the genome.

Are our Phenotypes 98.7% Similar? Some apparent differences, –HIV susceptibility, epithelial neoplasms (cancers), malaria, and Alzheimers, In fact, there is only one well understood biochemical difference, –A 92 bp deletion in a gene that codes for a hydroxylase, results in an un-hydroxylated secretion protein in our immune system.

The Experiment Check patterns of gene expression level, using DNA chips, for 12,000 genes in humans, chimps, orangutans, and macaques, (TRANSCRIPTOME), –brain, liver, and blood Check for protein levels using 2-D gel analysis, (PROTEOME) Controls, –Microarray analysis, (17,997 transcripts), –Rodent tests.

Affymetrix U95A array...

Targets Labeled Human cDNA, Chimp cDNA, Macaque cDNA, –Collect tissue, –Extract RNA, –Label RNA.

Cluster Analysis Distances represent the relative differences in expression changes.

So What? Changes in gene expression are greatest in the Human gene cluster. Primates Mice

Probably Rejected by the Journal Why? –Probe was human, target at least 98.7% different, –At the “allele specific oligonucleotide” level, single base changes may skew the data.

Microarray Spotted 17,997 PCR products onto nylon, probed with labeled cDNAs, –PCR primers are available, in kits, that will amplify just about any part of the human genome, –1000 bp fragments were generated, Base pair differences won’t affect probe sensitivity over this large a target.

Microarray Data 5:1 difference in expression profiles.

Proteomics (2d-gels) Proteins separated by mass, then by charge. Qualitative (positions), Quantitative (amount)

8500 Protein Spots

What do You Think?

Monday Schedule change... *RNAi (June 3) Background: Review of RNAi Specific and heritable genetic interference by double-stranded RNA in Arabidopsis thaliana