Bioinformatics: A New Frontier for Computer Scientists Ruth G. Alscher Lenwood S. Heath.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Recombinant DNA Technology
Classical and Modern Genetics.  “Genetics”: study of how biological information is carried from one generation to the next –Classical Laws of inheritance.
C-26 Genetics Packet. What are most homologous chromosomal pairs called? Homozygous or Pure.
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione From DNA.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Modeling and Understanding Stress Response Mechanisms with Expresso Ruth G. Alscher Lenwood S. Heath Naren Ramakrishnan Virginia Tech, Blacksburg, VA
1 Genetics The Study of Biological Information. 2 Chapter Outline DNA molecules encode the biological information fundamental to all life forms DNA molecules.
13.2 Ribosomes and Protein Synthesis
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Genetics and the Organism 10 Jan, Genetics Experimental science of heredity Grew out of need of plant and animal breeders for greater understanding.
13.2 Ribosomes and Protein Synthesis
The Power of Microarray Technology Ruth G. Alscher.
December 14, 2001Slide 1 Some Biology That Computer Scientists Need for Bioinformatics Lenwood S. Heath Virginia Tech Blacksburg, VA 24061
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Biomolecules Nucleic acids.  Are the genetic materials of all organisms and determine inherited characteristics.  The are two kinds of nucleic acids,
From Haystacks to Needles AP Biology Fall Isolating Genes  Gene library: a collection of bacteria that house different cloned DNA fragments, one.
AP Biology Ch. 20 Biotechnology.
Unit 4 Vocabulary Review. Nucleic Acids Organic molecules that serve as the blueprint for proteins and, through the action of proteins, for all cellular.
Cellular Metabolism Chapter 4. Introduction Metabolism is many chemical reactionss Metabolism breaks down nutrients and releases energy= catabolism Metabolism.
CSE 6406: Bioinformatics Algorithms. Course Outline
Section 2 Genetics and Biotechnology DNA Technology
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
Chapter 12 DNA and RNA *This presentation contains copyrighted material.
Chapter 13: RNA and Protein Synthesis
DNA and Modern Genetics Chapter 5C. D eoxyribo N ucleic A cid DNA is a molecule that stores information that a cell needs to function, grow, & divide.
November 16, 2001Slide 1 Opportunities in Bioinformatics for Computer Science Lenwood S. Heath Virginia Tech Blacksburg, VA University.
DNA and Modern Genetics Chapter 5. Chapter 5 Section 1 NOTES Page 135.
Chapter 21 Eukaryotic Genome Sequences
 The process by which desired traits of certain plants and animals are selected and passed on to their future generations is called selective breeding.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Lecture #3 Transcription Unit 4: Molecular Genetics.
Chapter 9 From DNA to Protein.
By Melissa Rivera.  GENE CLONING: production of multiple identical copies of DNA  It was developed so scientists could work directly with specific genes.
EB3233 Bioinformatics Introduction to Bioinformatics.
May 23, 2002Slide 1 Networks in Bioinformatics Lenwood S. Heath Virginia Tech Blacksburg, VA, USA I-SPAN’02 Manila, Philippines May 23, 2002.
DNA, RNA & Genetics Notes
Genetics Review Honors Human Anatomy & Physiology Mr. Mazza
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Leaving Cert Biology Genetics – section 2.5 Genetics ( RNA), 2.5.5,
Johnson - The Living World: 3rd Ed. - All Rights Reserved - McGraw Hill Companies Genomics Chapter 10 Copyright © McGraw-Hill Companies Permission required.
Chapter 12 Assessment How could manipulating DNA be beneficial?
DNA Deoxyribose Nucleic Acid – is the information code to make an organism and controls the activities of the cell. –Mitosis copies this code so that all.
Biology Ch. 11 DNA and Genes DNA  DNA controls the production of proteins Living tissue is made up of protein, so DNA determines an organism’s.
From DNA to Proteins Section 2.3 BC Science Probe 9 Pages
Chapter 10: Nucleic Acids And Protein Synthesis Essential Question: What roles do DNA and RNA play in storing genetic information?
Notes: Human Genome (Right side page)
Gene Expression DNA, RNA, and Protein Synthesis. Gene Expression Genes contain messages that determine traits. The process of expressing those genes includes.
DNA Technology & Genomics CHAPTER 20. Restriction Enzymes enzymes that cut DNA at specific locations (restriction sites) yielding restriction fragments.
Transcription & Translation. Objectives: Relate the concept of the gene to the sequences of nucleotides in DNA Sequence the steps involved in protein.
The DNA connection Coulter. The genetic code  The main function of genes is to control the production of proteins in an organism’s cells. Proteins help.
Data-intensive Computing: Case Study Area 1: Bioinformatics
The DNA connection Coulter.
Section 2 Genetics and Biotechnology DNA Technology
Nucleotide.
Genomes and Their Evolution
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
How Proteins are Made Biology I: Chapter 10.
Genetics: From Genes to Genomes
The Study of Biological Information
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Notes – Genetics 1.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Bioinformatics: A New Frontier for Computer Scientists Ruth G. Alscher Lenwood S. Heath

A new language has been created. Words in the language that are useful for today’s talk. Genomics Functional Genomics Proteomics cDNA microarrays Global Gene Expression Patterns The Language of the New Biology

Human Genome Project How many individuals? Which races? Statistics about sequencing Etc. (Ruth)

New Computational Tools Needed for Biology Sequencing Analyzing experimental data Representing vast quantities of information Searching Pattern matching Data mining Gene discovery Function discovery

Molecular Biology Cell function Nucleic acids, DNA, RNA, chromosomes, genes Amino acids, proteins

DNA Strand A= adenine complements T= thymine C = cytosine complements G=guanine

Complementary DNA Strands Double-Stranded DNA

RNA Strand U=uracil replaces T= thymine

Proteins Unlike DNA, proteins have three-dimensional structure Protein folds to a three-dimensional shape that minimizes energy

Amino Acids Protein is a large molecule that is a chain of amino acids (100 to 5000). There are 20 common amino acids (Alanine, Cysteine, …, Tyrosine) Three bases --- a codon --- suffice to encode an amino acid. There are also START and STOP codons.

Chromosomes Long molecules of DNA: 10^4 to 10^8 base pairs 26 matched pairs in humans A gene is a subsequence of a chromosome that encodes a protein. Proteins associated with regulation. Only a fraction of the genes are in use at any time. Every gene is present in every cell.

Cell’s Fetch-Execute Cycle Stored Program: DNA, chromosomes, genes Fetch/Decode: RNA, ribosomes Execute Functions: Proteins --- oxygen transport, cell structures, enzymes Inputs: Nutrients, environmental signals, external proteins Outputs: Waste, response proteins, enzymes

Evolution Genotype: Genetic makeup of individuals or species Mutations are basis for evolution of species Phenotype: Perceived traits of organism (eye color, number of limbs, etc.); controlled by interaction of many genes

Genetics An individual organism has some set of genes, stored in DNA of each cell. Gene set determines biological functions and individual characteristics. Genetic makeup of a particular species defines that species.

Protein-Coding Genes

Genomics: Discovery of genetic sequences and the ordering of those sequences into individual genes, into gene families, and into chromosomes. Identification of sequences that code for gene products/proteins and sequences that act as regulatory elements. Genomics

Functional Genomics: The biological role of individual genes, mechanisms underlying the regulation of their expression, and regulatory interactions among them. Functional Genomics

Biologists Need Computer Scientists Assembling DNA fragments Physical mapping Identifying genes and gene families Protein folding Determining protein function Data analysis (microarrays) Data visualization Searching Sequence alignment Data mining

How to use microarrays to learn more about the influence of drought stress on gene expression? Where the biologists need the computer scientists. A. Confounding factors in the raw data 1. Limitations in accuracy (technique) 2. Biological variation (individuals) B. How to apply corrections for these confounding factors to maximize the predictive power of the data. C. Modeling regulatory networks. Microarray Data Analysis

Effects of drought stress on loblolly pine- a pilot experiment Virginia Tech: Plant Biologists: Ruth Alscher, Boris Chevone. CS: Lenny Heath and colleagues. Statistics: Ina Hoeschele, Shun-Hwa Li. NC State (Forest Biotechnology): Ying-Hsuan Sun, Ron Sederoff, Ross Whetten Effects of Drought Stress

Spots: (Sequences affixed to slide) TreatmentControl Mix 123 Excitation Emission Detection Relative Abundance Detection Hybridization Relative Abundance Detection

Biological Variation as Reflected in A Comparison of Expression in Two Trees of the Same Clone. A Subquadrant Biological Variation

Detection of gene expression effects on microarrays Characterize gene function Test mutant phenotypes Genetic Regulatory Networks Identify mutants Iterative strategy for detection of genetic interactions using microarrays Iterative Strategy

Glycolysis, Citric Acid Cycle, and Related Metabolic Processes

Gene Expression: Control Points

Responses to Environmental Signals

Intracellular Decision Making

Drosophila Genome

A publicly accessible collection of cDNAs representing mRNAs present in specific tissues. The cDNAs have been partially sequenced and identified, where possible, as homologs to publicly accessible genes of known function. Expressed Sequence Tags

Microarray Quotes “ A fresh, comprehensive and open-mined look at every problem in biology” Brown and Botstein, page 33. WOW! “… the construction of a Biological Periodic Table…” Lander, page 3. “… as model-independent as possible…” Brown and Botstein, page 33. From The Chipping Forecast

ROS arise throughout the cell. ROS arise throughout the cell

Free Radicals

Bioinformatics Institute Research institute based at Virginia Tech Begins July 1 with $3 million Will occupy 2 building and have 100+ employees in 4 years

Getting Into Bioinformatics Get a minor in biology Get involved with bioinformatics research –Dr. Alscher –Dr. Heath –Dr. Keller –Dr. Ramakrishnan –Dr. Watson