Functional Genomics and Bioinformatics Applied to Understanding Oxidative Stress Resistance in Plants Ruth Grene Alscher Lenwood S. Heath Naren Ramakrishnan.

Slides:



Advertisements
Similar presentations
Characterization of Transcriptional Regulatory Networks controlling plant cell adaptation to environmental stresses.
Advertisements

AGEING CAN BE DEFINED AS THE PROGRESSIVE LOSS OF FUNCTION ACCOMPANIED BY DECREASING FERTILITY AND INCREASING MORTALITY.
M. Kathleen Kerr “Design Considerations for Efficient and Effective Microarray Studies” Biometrics 59, ; December 2003 Biostatistics Article Oncology.
2nd IeCAB symposium June Mohamed M. Ibrahim 1* and Sameera O. Bafeel 2 1 Science Department ( Biology section), Teacher's College, King.
Microarray Simultaneously determining the abundance of multiple(100s-10,000s) transcripts.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Modeling and Understanding Stress Response Mechanisms with Expresso Ruth G. Alscher Lenwood S. Heath Naren Ramakrishnan Virginia Tech, Blacksburg, VA
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Modeling and Understanding Stress Response Mechanisms with Expresso Ruth G. Alscher Lenwood S. Heath Naren Ramakrishnan Virginia Tech, Blacksburg, VA
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Bioinformatics: A New Frontier for Computer Scientists Ruth G. Alscher Lenwood S. Heath.
Gene expression analysis summary Where are we now?
Microarrays Dr Peter Smooker,
Functional Genomics and Bioinformatics Applied to Understanding Oxidative Stress Resistance in Plants Ruth Grene Alscher Lenwood S. Heath Virginia Tech.
Parallel human genome analysis: Microarray-based expression monitoring of 1000 genes Mark Schena, Dari Shalon, Renu Heller, Andrew Chai, Patrick O. Brown,
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Indiana University Bloomington, IN Junguk Hur Computational Omics Lab School of Informatics Differential location analysis A novel approach to detecting.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
The Power of Microarray Technology Ruth G. Alscher.
December 14, 2001Slide 1 Some Biology That Computer Scientists Need for Bioinformatics Lenwood S. Heath Virginia Tech Blacksburg, VA 24061
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Review of important points from the NCBI lectures. –Example slides Review the two types of microarray platforms. –Spotted arrays –Affymetrix Specific examples.
Applications of Functional Genomics and Bioinformatics Towards an Understanding of Oxidative Stress Resistance in Plants: Expresso and Chips.
Exploring the Metabolic and Genetic Control of Gene Expression on a Genomic Scale Joseph L. DeRisi, Vishwanath R. Iyer, Patrick O. Brown Science Vol. 278.
with an emphasis on DNA microarrays
Statistical Bioinformatics QTL mapping Analysis of DNA sequence alignments Postgenomic data integration Systems biology.
Fine Structure and Analysis of Eukaryotic Genes
HC70AL Spring 2009 Gene Discovery Laboratory RNA and Tools For Studying Differential Gene Expression During Seed Development 4/20/09tratorp.
Unit 1: The Language of Science  communicate and apply scientific information extracted from various sources (3.B)  evaluate models according to their.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Potato Genomics In Fredericton Dr. Barry Flinn Co-Lead Investigator - Genome Atlantic CPGP Research Director - Solanum Genomics International Inc.
Expresso and Chips Studying Drought Stress in Plants with cDNA Microarrays Lenwood S. Heath Department of Computer Science Virginia Tech, VA
Results Davidson College Biology Department DNA Microarrays: A guide to teaching chips Allison Amore, Sheena Bossie, Max Citrin, Erin Cobain,
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Compare and contrast prokaryotic and eukaryotic cells.[BIO.4A] October 2014Secondary Science - Biology.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
CDNA Microarrays MB206.
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Biology The Study of Life. Course Description "Biology of organisms and cells concerns living things, their appearance, different types of life, the scope.
November 16, 2001Slide 1 Opportunities in Bioinformatics for Computer Science Lenwood S. Heath Virginia Tech Blacksburg, VA University.
Principal component analysis Treated Vehicle Treated Vehicle.
3/24/2005 TIGP 1 Bioinformatics for Microarray Studies at IBS Pei-Ing Hwang, Ph.D. Mar. 24, 2005.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Big Ideas in AP Biology. Big Idea I. Evolution Evolution is the core theme in biology Descent from a common ancestor with modification Natural Selection.
MICROARRAY TECHNOLOGY
Lecture 7. Functional Genomics: Gene Expression Profiling using
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
May 23, 2002Slide 1 Networks in Bioinformatics Lenwood S. Heath Virginia Tech Blacksburg, VA, USA I-SPAN’02 Manila, Philippines May 23, 2002.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
IFAFS Meeting Gene Expression – Disease and Water Deficit John Davis.
Microarray Data Analysis The Bioinformatics side of the bench.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Microarray: An Introduction
Risheng Chen et al BMC Genomics
Understanding Stress Response Mechanisms with Expresso
Summary of the Standards of Learning
Presentation transcript:

Functional Genomics and Bioinformatics Applied to Understanding Oxidative Stress Resistance in Plants Ruth Grene Alscher Lenwood S. Heath Naren Ramakrishnan Virginia Tech December 6, 2001

Overview Organization of our group About environmental stress and reactive oxygen species (ROS) Plant responses to ROS Analysis of responses to stress on a chip - microarray technology Expresso: management system for microarrays –Managing expression experiments –Analyzing expression data –Reaching conclusions Where do we go from here?

Boris Chevone Ron Sederoff Dawei Chen Ruth Alscher Lenny Heath Naren Ramakrishnan Keying Ye Len van Zyl Carol Loopstra Jonathan Watkinson Margaret Ellis Logan Hanks Senior Collaborators Students Cecilia Vasquez

Detection of stress - mediated gene expression effects on microarrays Computational tools to infer interaction among genes, pathways Revised / New Tools and Experiments Genetic Regulatory Networks Test inferences with varying conditions and genotypes Iterative strategy for detection of stress -mediated effects on gene expression using microarrays and CS expertise

Proposed Project: Plant Biology (with co-PIs: Ron Sederoff, NCSU; Carol Loopstra, TAMU) An investigation of drought stress responses in lobolly pine in a variety of provenances. Quantitative RT-PCR to confirm and expand results obtained with microarrays. In situ hybridization to stressed and unstressed cell and tissue types.

Plants adapt to changing environmental conditions through global cellular responses involving successive changes in, and interactions among, expression patterns of numerous genes. Our group studies these changes through a combination of bioinformatics and genomic techniques. Plant Response to Stress

Biological: To identify molecular stress resistance mechanisms in tree and crop species. Bioinformatic: To support iterative experimentation in plant genomics, capture and analyze experimental data, integrate biological information from diverse sources, and close the experimental loop. Long Term Goals

The Paradox of Aerobiosis Oxygen is essential, but toxic. Aerobic cells face constant danger from reactive oxygen species (ROS). ROS can act as mutagens, they can cause lipid peroxidation and denature proteins.

ROS Arise as a Result of Exposure to: Ozone Sulfur dioxide High light Paraquat Extremes of temperature Salinity Drought

Redox Regulation of Cellular Systems Membrane Receptors Environmental Stress Metabolite Defense Protein kinases; phosphatases Transcription factors Gene Expression Defense, Repair, Apoptosis Prooxidants (ROS) Antioxidants

Scenarios for Effects of Abiotic Stress on Gene Expression in Plants

Hypotheses There is a group of genes whose expression confers resistance to drought stress. Based on previous work (Alscher and others for superoxide dismutases and glutathione reductases) increased expression of defense genes is co-regulated and is correlated with resistance to oxidative stress. Failure to cope is correlated with little or no defense gene activation. A common core of defense genes exists, which responds to several different stresses.

Components of 1999 Stress Study 1999 Pine Drought Stress Experiments 2000 Expresso Prototype Design and Print Microarrays Select 384 Pine cDNAs Design Functional Hierarchy Capture Spot Intensities Integrate and Analyze Inductive Logic Programming (ILP)

Imposition of Successive Cycles of Mild or Severe Drought Stress on 1-year-old Loblolly Pine Seedlings DAYS  water potential (bars) RNA Harvest I RNA Harvest II RNA Harvest III RNA Harvest IV Cycles of Mild Drought Stress DRY DOWN = P S (photosynthesis) DAYS  water potentional (bars) Cycle I Cycle II Cycle III RNA Harvest I RNA Harvest II RNA Harvest III Cycles of Severe Drought Stress DRY DOWN Water withheld Water given Water withheld Water given Water withheld RECOVERY

Categories within Protective and Protected Processes Plant Growth Regulation Environmental Change Gene Expression Signal Transduction Protective Processes Protected Processes ROS and Stress Cell Wall Related Phenylpropanoid Pathway Development Metabolism Chloroplast Associated Carbon Metabolism Respiration and Nucleic Acids Mitochondrion Cells Tissues Cytoskeleton Secretion Trafficking Nucleus Protease-associated

Protective Processes Stress Cell Wall Related Phenylpropanoid Pathway Abiotic Biotic Antioxidant Processes Drought Heat Non-Plant Xenobiotics NADPH/Ascorbate/ Glutathione Scavenging Pathway Cytosolic ascorbate peroxidase Dehydrins, Aquaporins Heat shock proteins (Chaperones) superoxide dismutase-Fe superoxide dismutase-Cu-Zn glutathione reductase Sucrose Metabolism Cellulose Arabionogalactan proteins Hemicellulose Pectins Xylose Other Cell Wall Proteins isoflavone reductases phenylalanine ammonia-lyases S-adenosylmethionine decarboxylases glycine hydromethyltransferases Lignin Biosynthesis CCoAOMTs 4-coumarate-CoA ligases cinnamyl-alcohol dehydrogenase Chaperones “Isoflavone Reductases” GSTs Extensins and proline rich proteins Categories within “Protective Processes”

Hypotheses versus Results – 1999 Expt oAmong the genes responding to mild stress, there exists a population of genes whose expression confers resistance. –Candidate stress resistance genes. Genes in 69 categories ( e.g. HSP70s and 100s, but not HSP80s, aquaporins) responded positively to mild stress. Effect of severe stress was not detectable or negative.

Genes associated with other stresses responded to drought stress –Isoflavone reductase homologs and GSTs responded positively to mild drought stress. –These categories are previously documented to respond to biotic stress and xenobiotics, respectively. –However, both isoflavone reductase homologs and GSTs responded positively to severe drought stress. Hypotheses versus Results – 1999 Experiment

Flow of a Microarray Experiment Hypotheses Select cDNAs PCR Test of Hypotheses Extract RNA Replication and Randomization Reverse Transcription and Fluorescent Labeling Robotic Printing HybridizationIdentify SpotsIntensitiesStatisticsClusteringData Mining, ILP

Image Analysis: gridding, spot identification, intensity and background calculation, normalization Statistics: Fold or ratio estimation Combining replicates Higher-level Analysis: Clustering methods Inductive logic programming (ILP) Spot and Clone Analysis

Data Mining: Inductive Logic Programming ILP is a data mining algorithm expressly designed for inferring relationships. By expressing relationships as rules, it provides new information and resultant testable hypotheses. ILP groups related data and chooses in favor of relationships having short descriptions. ILP can also flexibly incorporate a priori biological knowledge (e.g., categories and alternate classifications).

Rule Inference in ILP Infers rules relating gene expression levels to categories, both within a probe pair and across probe pairs, without explicit direction Example Rule: [Rule 142] [Pos cover = 69 Neg cover = 3] level(A,moist_vs_severe,not positive) :- level(A,moist_vs_mild,positive). Interpretation: “If the moist versus mild stress comparison was positive for some clone named A, it was negative or unchanged in the moist versus severe comparison for A, with a confidence of 95.8%.”

ILP subsumes two forms of reasoning Unsupervised learning –“Find clusters of genes that have similar/consistent expression patterns” Supervised learning –“Find a relationship between a priori functional categories and gene expression” Hybrid reasoning: Information Integration –“Is there a relationship between genes in a given functional category and genes in a particular expression cluster?” –ILP mines this information in a single step

NGS-Supported Work of 2001: Expresso

NGS-Supported Work of 2001: Expresso Progress to Date Margaret Ellis and Logan Hanks (computer science graduate students): MEL: Semistructured data model for experiment capture Parsing: Automatic parser generators to drive archival storage Database: Loading and cataloging MEL data in a Postgres RDBMS Pipeline: Linkages to data analysis and data mining software

NGS-Supported Work of 2001: Progress to Date Cecilia Vasquez (plant biology graduate student): Loblolly pine seedlings were subjected to the same cycles of drought stress as in 1999, with photosynthesis, water potential measurements, and RNA isolations carried out throughout the time course of the experiment. Jonathan Watkinson (post-doctoral associate): RNA was hybridized to an array of 2400 pine cDNAs at NCSU. Data capture.

Imposition of Successive Cycles of Mild or Severe Drought Stress on 1-year-old Loblolly Pine Seedlings DAYS  water potential (bars) RNA Harvest I RNA Harvest II RNA Harvest III RNA Harvest IV Cycles of Mild Drought Stress DRY DOWN = P S (photosynthesis) DAYS  water potentional (bars) Cycle I Cycle II Cycle III RNA Harvest I RNA Harvest II RNA Harvest III Cycles of Severe Drought Stress DRY DOWN Water withheld Water given Water withheld Water given Water withheld RECOVERY

Cy3 TIFF Image Final Harvest; Control versus Mild Stress; 2001 Cy5 TIFF Image Replication Differential Expression

Final Harvest; Control versus Mild Stress; 2001 Cy5 to Cy3 ratios. Final harvest after four drought cycles. RNA harvested 24 hours after final watering. Cy5 = treated; Cy3 = control. Aquaporins responded positively, while HSP 80’s were unaffected, as in 1999 results.

Drought Stress Responses in Loblolly Pine: Questions to be Addressed Can a hierarchy of drought stress resistance mechanisms be identified ? Can a clear distinction be made between rapidly responding and long term adaptational mechanisms? Can particular subgroups within gene families be associated with drought tolerance?

Drought Stress Responses in Loblolly Pine: Proposed Bioinformatics Goals Support incorporation of biological information in the form of functional hierarchies and gene families. Close the computational and experimental loop to support iterative experimental regimes. Integrate information from multiple experiments involving multiple provenances, drought stresses, and EST sets.

Proposed Project: Sources of cDNAs for arrays NCSU ESTs selected on the basis of function. Stressed cDNA libraries from roots and stems of drought tolerant families from East Texas and Lost Pines, and from the Atlantic Coastal Plain (humid conditions). Homologs of drought-responsive Arabidopsis genes.

Gene Discovery in the Arabidopsis Transcriptome Data Capture Postgres Database Database Queries Statistical Analysis and Clustering Data Mining, ILP Possible Identification of Novel Drought Responsive Genes in Arabidopsis Drought Stress (short and long term) Hybridize to Arabidopsis Transcriptome Scanning, Image Processing

Select Pine cDNAs Via Contigs Robotic Replication and Printing Identification of Drought Responsive Genes and Pathways Across Provenances in Loblolly Pine Data Capture Postgres Database Database Queries Statistical Analysis and Clustering Data Mining, ILP Drought Stress Experiments on NC, TX Pine Hybridization Scanning, Image Processing Identification of Drought Responsive Pine Genes Close The Loop Arabidopsis Drought Responsive genes

Proposed Project: Bioinformatics I (Alscher, Heath, Ramakrishnan) Constraint-based selection of cDNAs, including intelligent use of contigs. Assignment of pine ESTs to subgroups within protein families (ProDom, Pfam). Extend information integration in ILP to include Mendel classification of gene families. Integrating data across provenances and known degrees of drought tolerance.

Proposed Project: Bioinformatics II (Ramakrishnan, Heath) Specialize ILP for particular biological information sources. Automatic tuning of ILP parameters. Pushing data mining functionality into the database. Interleaving and iteration of query, data analysis, and data mining operations.