Uses of microarrays and related methodologies in animal breeding

Slides:



Advertisements
Similar presentations
Recombinant DNA Technology
Advertisements

Planning breeding programs for impact
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Microarrays Dr Peter Smooker,
Microarray analysis Golan Yona ( original version by David Lin )
Quantitative Genetics
10 Genomics, Proteomics and Genetic Engineering. 2 Genomics and Proteomics The field of genomics deals with the DNA sequence, organization, function,
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
General Microbiology (Micr300) Lecture 11 Biotechnology (Text Chapters: ; )
Uses of microarrays and related methodologies in animal breeding Bruce Walsh, University of Arizona (Depts. of Ecology & Evolutionary.
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Library screening Heterologous and homologous gene probes Differential screening Expression library screening.
Chapter 5 Characterizing Genetic Diversity: Quantitative Variation Quantitative (metric or polygenic) characters of Most concern to conservation biology.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
DNA Chips Attach DNA to tiny spots on glass slides (i.e., chip). Hybridize fluorescently-labeled DNA probes to chip. Detect hybridization to different.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
1 Before considering selection, it’s important to characterize how gene expression varies within and between species. What evolutionary forces act on gene.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
1 Paper Outline Specific Aim Background & Significance Research Description Potential Pitfalls and Alternate Approaches Class Paper: 5-7 pages (with figures)
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Microarray: An Introduction
Sadia Sayed. The expression of the cloned gene in a selected host organism. It does not necessarily ensure that it will be successfully expressed. A high.
Daniel Kahn, Jean-François Gout & Laurent Duret
Single Nucleotide Polymorphisms (SNPs
EQTLs.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
MULTIPLE GENES AND QUANTITATIVE TRAITS
Genetical Genomics in the Mouse
Statistical Applications in Biology and Genetics
Genetic Variation Genetic Variation in Populations
Biostatistics?.
Quantitative traits Lecture 13 By Ms. Shumaila Azam
Section 3: Gene Technologies in Detail
Microarray Technology and Applications
The Role of Recombinant DNA Technology in Biotechnology
Genetic Control of Metabolism
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
The Genetic Basis of Complex Inheritance
Gene Interactions Chapter 6, Part b
A few Simple Applications of Index Selection Theory
University of California at San Diego
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Overview Expression data basics Introduction Biological network data
Mapping Quantitative Trait Loci
MULTIPLE GENES AND QUANTITATIVE TRAITS
Germplasm Issues Chapter 3. Variation: Type, Origin, and Scale
Genome-wide Associations
The ‘V’ in the Tajima D equation is:
Remember: Final Draft of Posters Due at 10 am tomorrow!
Inferring Genetic Architecture of Complex Biological Processes Brian S
Review Warm-Up What is the Central Dogma?
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
What are BLUP? and why they are useful?
Lecture 5 Artificial Selection
Uses of microarrays and related methodologies in animal breeding
Statistical Analysis and Design of Experiments for Large Data Sets
Copyright Pearson Prentice Hall
Chapter 7 Beyond alleles: Quantitative Genetics
Evan G. Williams, Johan Auwerx  Cell 
Copyright Pearson Prentice Hall
Modes of selection.
Copyright Pearson Prentice Hall
Copyright Pearson Prentice Hall
Data Type 1: Microarrays
Design Issues Lecture Topic 6.
Presentation transcript:

Uses of microarrays and related methodologies in animal breeding Bruce Walsh, jbwalsh@u.arizona.edu University of Arizona (Depts. of Ecology & Evolutionary Biology, Molecular & Cellular Biology, Plant Sciences, Animal Sciences, and Epidemology & Biostatistics)

The basic idea behind gene expression arrays With a complete (or partial) genome sequence in hand, one can array sequences from genes of interest on small chip, glass slide, or a membrane mRNA is extracted from cells of interest and hybridized to the array Genes showing different levels of mRNA can be detected

Types of microarrays Synthetic oligonucleotide arrays Chemically synthesize oligonucleotide sequences directly on slide/chip/membrane (e.g., using photolithography) Affymetrix, Agilent Spotted cDNA arrays PCR products from clones of genes of interest are spotted on a glass slide using a robot Extracted cellular mRNA is reverse-transcribed into cDNAs for hybridization

Cell type 2 Cell Type 1 The color of the spot corresponds to the relative concentrations of mRNAs for that gene in the two cell types

mRNAs for these genes more abundant in cell type 2 mRNAs from these Genes of roughly equal Abundance in both cell types mRNAs from these genes more abundant in cell type 1

Analysis of microarray data Image processing and normalization Detecting significant changes in expression Clustering and classification Clustering: detecting groups of co-expressed genes Classification: finding those genes at which changes in mRNA expression level predict phenotype

Significance testing-- GLM Interaction between gene i and treatment j Yklijk = u + Ak +Rkl + Ti + Gj + TGij +elkijk Replicate l in array k k-th spotting of gene j under treatment i on replicate l of array k Array k Treatment i Gene j

Problem of very many tests (genes) vs. few actual data vectors Expectation: A large number of the GxT interactions will be significant Controlling experiment-wide p value is very overly conservative (further, tests may be strongly correlated) Generating a reduced set of genes for future consideration (data mining) FDR (false discovery rate) PFP (proportion of false positives) Empirical Bayes approaches

Which loci control array-detected changes in mRNA expression? Cis-acting factors Control regions immediately adjacent to the gene Trans-acting factors Diffusable factors unlinked (or loosely linked) to the gene of interest Global (Master) regulators Trans-acting factors that influence a large number of genes

David Treadgill’s (UNC) mouse experiment Recombinant Inbred lines from a cross of DBA/2J and C57BL The level of mRNA expression (measured by array analysis) is treated as a quantitative trait and QTL analysis performed for each gene in the array

Distribution of >12,000 gene interactions TRANS-modifiers MASTER modifiers CIS-modifiers Distribution of >12,000 gene interactions Genomic location of genes on array Genomic location of mRNA level modifiers

Candidate loci : Differences in Gene Expression between lines Correlate differences in levels of expression with trait levels Map factors underlying changes in expression These are (very) often trans-acting factors Difference between structural alleles and regulatory alleles

Expanded selection opportunities offered by microarrays G x E Candidate genes may be suggested by examining levels of mRNA expression over different major environments With candidates in hand, potential for selection of genes showing reduced variance in expression over critical environments Breaking (or at least reducing) potentially deleterious genetic correlations Look for variation in genes that have little (if any) trans-acting effects on other genes

Towards the future Selection decisions using information on gene networks / pathways Microarrays are one tool for reconstructing gene networks Tools for examining protein-protein interactions Two hybrid screens FRET & FRAP

Analysis and Exploitation of Gene and Metabolic Networks Graph theory Most estimation and statistical issues unresolved Major (current) analytic tool: Kascer-Burns Sensitivity Analysis

Gene networks are graphs

Kascer-Burns Sensitivity Analysis (aka. Metabolic Control Analysis) “No theory should fit all the facts because some of the facts are wrong” (N. Bohr) “All models are wrong, although some models are Useful” (Box)

Flux = production rate of a particular product, here F Perhaps we increase the concentration of e1 However, it may be more efficient To increase the concentration of e4 The flux control coefficient, introduced by Kascer and Burns, provides a quantitative solution to this problem How best to increase the flux through this pathway?

Flux Control Coefficients, The control coefficient for the flux at step j in a pathway associated with enzyme j, Roughly speaking, the control coefficient is the percentage change in flux divided by percentage change in enzyme activity

Why many mutations are recessive: a 50% reduction in activity (the heterozygote) results in only a very small change in the flux When the activity of E is large, C is close to zero When the activity of E is near zero, C is close to 1

Kacser-Burns Flux summation theorem: • Coefficients are not intrinsic properties of an enzyme, but rather a (local) system property • If a control coefficient is greatly increased in value, this decreases the values of other control coefficients • While most values of C for proteins are positive, negative regulators (repressors) give negative values, allowing for C values > 1. • Truly rate-limiting steps are rare

“rate-limiting” steps in pathways Small-Kacser theorem: the factor f by which flux is increased by an r-fold increase in activity of E is Hence, the limiting increase in f is

Using estimated Control Coefficients as selection aids Loci with larger C values should respond faster to selection Such loci are obvious targets for screens of natural variation (candidate loci) Selection with reduced correlations Tallis or Kempthorne - Nordskog restricted selection index Select on loci with large C for flux of interest, smallest C for other fluxes not of concern Positive selection on C for flux of interest, selection to reduce flux changes in other pathways

We wish this flux to remain unchanged The initial approach might be to try either e3 or e4, rather than e1 or e2 Flux we wish to increase A more correct approach, however is to Pick the step(s) that maximize CF while minimizing CH

Index selection on pathways The elements of selection include both phenotype and C, and (possibly) marker markers as well Problems: C is a local estimate, changing as the pathway evolves Still have all the standard concerns with a selection index (e.g., stability of inverse of genetic covariance matrix) These are important caveats to consider even under the rosy scenaro where all C’s are know

What to call it? MAS = Marker Assisted Selection CAS = Control Coefficient Assisted Selection CASH $ = Control Activity Selection Helper

Summary Microarray analysis = data mining Potential (immediate) useage: Suggesting candidate loci More efficient use of G X E Reducing/breaking deleterious correlations Cis (easy) vs. trans (hard) control of expression levels Future = analysis of pathways Index selection (and all its problems)

Farewell from the “desert” U of A Campus