December 14, 2001Slide 1 Some Biology That Computer Scientists Need for Bioinformatics Lenwood S. Heath Virginia Tech Blacksburg, VA 24061

Slides:



Advertisements
Similar presentations
DNA RNA Protein Synthesis Gene Expression All mixed up 1pt 1 pt 1 pt
Advertisements

Transformation Principle In 1928 Fredrick Griffith heated the S bacteria and mixed with the harmless bacteria thinking that neither would make the mice.
Replication, Transcription, & Translation
GENETIC-CONCEPTS.
Introduction to Bioinformatics Yana Kortsarts Bob Morris.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Bioinformatics: A New Frontier for Computer Scientists Ruth G. Alscher Lenwood S. Heath.
RNA and Protein Synthesis
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Basic Biology for CS262 OMKAR DESHPANDE (TA) Overview Structures of biomolecules How does DNA function? What is a gene? How are genes regulated?
DNA and RNA. I. DNA Structure Double Helix In the early 1950s, American James Watson and Britain Francis Crick determined that DNA is in the shape of.
Protein Synthesis Ordinary Level. Lesson Objectives At the end of this lesson you should be able to 1.Outline the steps in protein synthesis 2.Understand.
13.3: RNA and Gene Expression
10-2: RNA and 10-3: Protein Synthesis
Biomolecules Nucleic acids.  Are the genetic materials of all organisms and determine inherited characteristics.  The are two kinds of nucleic acids,
RNA Ribonucleic Acid.
8.4 DNA Transcription 8.5 Translation
Protein Synthesis Chapter 11.
12-3: RNA AND PROTEIN SYNTHESIS Biology 2. DNA double helix structure explains how DNA can be copied, but not how genes work GENES: sequence of DNA that.
How DNA helps make you you. DNA Function Your development and survival depend on… Your development and survival depend on…  which proteins your cells.
Biology 10.1 How Proteins are Made:
What is the structure of DNA? Hw Q 1-4 p. 299.
Lesson Overview 13.1 RNA.
Cellular Metabolism Chapter 4. Introduction Metabolism is many chemical reactionss Metabolism breaks down nutrients and releases energy= catabolism Metabolism.
Protein Synthesis. The DNA Code It is a universal code. The order of bases along the DNA strand codes for the order in which amino acids are chemically.
RNA AND PROTEIN SYNTHESIS RNA vs DNA RNADNA 1. 5 – Carbon sugar (ribose) 5 – Carbon sugar (deoxyribose) 2. Phosphate group Phosphate group 3. Nitrogenous.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
Chapter 13: RNA and Protein Synthesis
Chapter 13.1 and 13.2 RNA, Ribosomes, and Protein Synthesis
November 16, 2001Slide 1 Opportunities in Bioinformatics for Computer Science Lenwood S. Heath Virginia Tech Blacksburg, VA University.
DNA and RNA Objectives: 8.0 Identify the structure and function of DNA, RNA, and protein. 8.1 Explaining relationships among DNA, genes, and chromosomes.
DNA, RNA, and Proteins Section 3 Section 3: RNA and Gene Expression Preview Bellringer Key Ideas An Overview of Gene Expression RNA: A Major Player Transcription:
Molecular Genetics - From DNA to Trait. How Are Different Types of Cells Created and Maintained? Different types of cells are created by differential.
 During DNA replication, the two strands of the original parent DNA molecule, shown in blue, each serve as a template for making a new strand, shown in.
12.3 DNA, RNA, and Protein Objective: 6(C) Explain the purpose and process of transcription and translation using models of DNA and RNA.
Transcription and Translation
May 23, 2002Slide 1 Networks in Bioinformatics Lenwood S. Heath Virginia Tech Blacksburg, VA, USA I-SPAN’02 Manila, Philippines May 23, 2002.
Transcription and Translation. Protein Structure  Made up of amino acids  Polypeptide- string of amino acids bonded together (peptide bonds) Enzymes.
DNA Deoxyribonucleic Acid. DNA Structure What is DNA? The information that determines an organisms traits. DNA produces proteins which gives it “The.
Nucleic Acids and Protein Synthesis 10 – 1 DNA 10 – 2 RNA 10 – 3 Protein Synthesis.
DNA Structure and Protein Synthesis (also known as Gene Expression)
DNA How are cells structured to do the “right” thing?
RNA And PROTEIN SYNTHESIS. What DNA is for…… Making Proteins Why is this important?
The student is expected to: 4B investigate and explain cellular processes, including homeostasis, energy conversions, transport of molecules, and synthesis.
Ch Gene  Protein A gene is a sequence of nucleotides that code for a polypeptide (protein) Hundreds-thousands of genes are on a typical chromosome.
Transcription Objectives: Trace the path of protein synthesis.
DNA Deoxyribose Nucleic Acid – is the information code to make an organism and controls the activities of the cell. –Mitosis copies this code so that all.
DNA. Unless you have an identical twin, you, like the sisters in this picture will share some, but not all characteristics with family members.
RNA and Protein Synthesis Chapter How are proteins made? In molecular terms, genes are coded DNA instructions that control the production of.
Chapter 10: Nucleic Acids And Protein Synthesis Essential Question: What roles do DNA and RNA play in storing genetic information?
Gene Expression DNA, RNA, and Protein Synthesis. Gene Expression Genes contain messages that determine traits. The process of expressing those genes includes.
Transcription & Translation. Objectives: Relate the concept of the gene to the sequences of nucleotides in DNA Sequence the steps involved in protein.
 Genes are coded DNA instructions that will control the production of proteins  These messages have to change to RNA to be decoded.  RNA will give.
Molecular Genetics Transcription & Translation
Jeopardy: DNA & Protein Synthesis
Section 3: RNA and Gene Expression
12-3 RNA and Protein Synthesis
Transcription.
Ch 12 DNA and RNA.
What is RNA? Do Now: What is RNA made of?
The Cell Cycle and Protein Synthesis
Transcription and Translation
Central Dogma Central Dogma categorized by: DNA Replication Transcription Translation From that, we find the flow of.
Protein Synthesis RNA.
12-3 RNA and Protein Synthesis
An Overview of Gene Expression
Genes and Protein Synthesis Review
Animation: DNA makes DNA
DNA Deoxyribonucleic Acid.
Presentation transcript:

December 14, 2001Slide 1 Some Biology That Computer Scientists Need for Bioinformatics Lenwood S. Heath Virginia Tech Blacksburg, VA University of Maryland December 14, 2001

Slide 2 I. Some Molecular Biology and Genomics II. Language of the New Biology III. Existing bioinformatics tools IV. Bioinformatics challenges V. Bioinformatics at Virginia Tech Overview

December 14, 2001Slide 3 I. Some Molecular Biology The instruction set for a cell is contained in its chromosomes. Each chromosome is a long molecule called DNA. Each DNA molecule contains 100s or 1000s of genes. Each gene encodes a protein. A gene is transcribed to mRNA in the nucleus. An mRNA is translated to a protein on ribosomes.

December 14, 2001Slide 4 Transcription and Translation DNAmRNAProtein TranscriptionTranslation

December 14, 2001Slide 5 Elaborating Cellular Function DNAmRNAProtein TranscriptionTranslation Reverse Transcription Degradation Regulation Functions: Structure Catalyze chemical reactions Respond to environment (Genetic Code) Thousands of Genes!

December 14, 2001Slide 6 Chromosomes Long molecules of DNA: 10^4 to 10^8 base pairs 26 matched pairs in humans A gene is a subsequence of a chromosome that encodes a protein. Proteins associated with cell function, structure, and regulation. Only a fraction of the genes are in use at any time. Every gene is present in every cell.

December 14, 2001Slide 7 DNA Strand C (cytosine) complements G (guanine) CTCAATTGAGCG Bases A (adenine) complements T (thymine) 2’-deoxyribose (sugar) 5’ End3’ End

December 14, 2001Slide 8 Complementary DNA Strands Double-Stranded DNA C G TGA CTCAATTGAGCG C G C G A T A T A T A TA T A T C G C G C G GC C TTAA C G

December 14, 2001Slide 9 RNA Strand CUCAAUUGAGCG Bases U (uracil) replaces T (thymine) Ribose (sugar) 5’ End3’ End

December 14, 2001Slide 10 Transcription of DNA to mRNA C G C G C G A T A T A T A TA T A T C G C G C G TGAGC C TTAA C G CUCAAUUGAGCG mRNA Strand Template DNA Strand Coding DNA Strand Template DNA Strand

December 14, 2001Slide 11 Proteins and Amino Acids Protein is a large molecule that is a chain of amino acids (100 to 5000). There are 20 common amino acids (Alanine, Cysteine, …, Tyrosine) Three bases --- a codon --- suffice to encode an amino acid. There are also START and STOP codons.

December 14, 2001Slide 12 Genetic Code

December 14, 2001Slide 13 Translation to a Protein CUCAAUUGAGCG Phenylalanine ArginineHistidineAlanine Unlike DNA, proteins have three-dimensional structure essential to protein function. Protein folds to a three-dimensional shape that cannot yet be predicted from the primary sequence. mRNA Strand Nascent Polypeptide: Amino Acids Bound Together by Peptide Bonds

December 14, 2001Slide 14 Transcription and Translation DNAmRNAProtein TranscriptionTranslation

December 14, 2001Slide 15 Transcription of DNA to mRNA C G C G C G A T A T A T A TA T A T C G C G C G TGAGC C TTAA C G CUCAAUUGAGCG mRNA Strand Template DNA Strand Coding DNA Strand Template DNA Strand

December 14, 2001Slide 16 Translation to a Protein CUCAAUUGAGCG Phenylalanine ArginineHistidineAlanine mRNA Strand Nascent Polypeptide: Amino Acids Bound Together by Peptide Bonds

December 14, 2001Slide 17 Cell’s Fetch-Execute Cycle Stored Program: DNA, chromosomes, genes Fetch/Decode: RNA, ribosomes Execute Functions: Proteins --- oxygen transport, cell structures, enzymes Inputs: Nutrients, environmental signals, external proteins Outputs: Waste, response proteins, enzymes

December 14, 2001Slide 18 A new language has been created. Words in the language that are useful for today’s talks. Genomics Functional Genomics Proteomics cDNA Microarrays Global Gene Expression Patterns II. The Language of the New Biology

December 14, 2001Slide 19 Discovery of genetic sequences and the ordering of those sequences into individual genes; gene families; chromosomes. Identification of sequences that code for gene products/proteins; sequences that act as regulatory elements. Genomics

December 14, 2001Slide 20 Genome Sequencing Projects Drosophila Yeast Mouse Rat Arabidopsis Human Microbes …

December 14, 2001Slide 21 Drosophila Genome

December 14, 2001Slide 22 The biological role of individual genes. Mechanisms underlying the regulation of their expression. Regulatory interactions among them. Functional Genomics

December 14, 2001Slide 23 Glycolysis, Citric Acid Cycle, and Related Metabolic Processes

December 14, 2001Slide 24 Only certain genes are “turned on” at any particular time. When a gene is transcribed (copied to mRNA), it is said to be expressed. The mRNA in a cell can be isolated. Its contents give a snapshot of the genes currently being expressed. Correlating gene expressions with conditions gives hints into the dynamic functioning of the cell. Gene Expression

December 14, 2001Slide 25 Gene Expression: Control Points

December 14, 2001Slide 26 Responses to Environmental Signals

December 14, 2001Slide 27 Intracellular Decision Making

December 14, 2001Slide 28 Microarray Technology In the past, gene expression and gene interactions were examined known gene by known gene, process by process. With microarray technology: –Simultaneous examination of large groups of genes and associated interactions –Possible discovery of new cellular mechanisms involving gene expression

December 14, 2001Slide 29 Flow of a Microarray Experiment Hypotheses Select cDNAs PCR Test of Hypotheses Extract RNA Replication and Randomization Reverse Transcription and Fluorescent Labeling Robotic Printing HybridizationIdentify SpotsIntensitiesStatisticsClusteringData Mining, ILP

December 14, 2001Slide 30 Spots: (Sequences affixed to slide) TreatmentControl Mix 123 Excitation Emission Detection Relative Abundance Detection Hybridization

December 14, 2001Slide 31 Gene Expression Varies Cy5 to Cy3 ratios

December 14, 2001Slide 32 III. Existing Computational Tools in Bioinformatics Sequence similarity Multiple sequence alignments Database searching Evolutionary (phylogenetic) tree construction Sequence assemblers Gene finders

December 14, 2001Slide 33 Existing Biological Databases Molecular Sequences: Genomic DNA, mRNA, ESTs, proteins Protein domains, motifs, or blocks Protein families Genomes Nomenclature and ontologies Biological literature

December 14, 2001Slide 34 IV. Challenges for Bioinformatics Analyzing and synthesizing complex experimental data Representing and accessing vast quantities of information Pattern matching Data mining --- whole genome analysis Gene discovery Function discovery Modeling the dynamics of cell function

December 14, 2001Slide 35 Computer science interacts with the life sciences. V. Bioinformatics at Virginia Tech Computer Science in Bioinformatics: Joint research with: plant biologists, microbial biologists, biochemists, cell-cycle biologists, animal scientists, crop scientists, statisticians. Projects: Expresso; Nupotato; MURI; Arabidopsis Genome; Barista; Cell-Cycle Modeling Graduate option in bioinformatics Virginia Bioinformatics Institute (VBI)

December 14, 2001Slide 36 Integration of design and procedures Integration of image analysis tools and statistical analysis Data mining using inductive logic programming (ILP) Closing the loop Integrating models Expresso: A Problem Solving Environment (PSE) for Microarray Experiment Design and Analysis

December 14, 2001Slide 37 Nupotato Potatoes originated in the Andes, where there are many varieties. Many varieties survive at high altitude in cold, dry conditions. Microarray technology can be used to investigate genes that are responsible for stress resistance and that are responsible for the production of nutrients.

December 14, 2001Slide 38 MURI Some microorganisms have the ability to survive drying out or intense radiation. Their genomes are just being sequenced. Using microarrays and proteomics, we will try to correlate computationally the genes in the genomes with the special traits of the microorganisms. We are currently using multiple genome analysis.

December 14, 2001Slide 39 Arabidopsis Genome Project Arabidopsis is a model higher plant. It is the first higher plant whose genome has been fully sequenced. Gene finder software has been used to identify putative genes. We are computationally mining the regulatory regions of these genes for promoter patterns.

December 14, 2001Slide 40 Barista Barista serves Expresso! Software development team across projects to minimize duplication of effort. Work with Linux, Perl, C, Python, cvs, Apache, PHP, …

December 14, 2001Slide 41 Virginia Bioinformatics Institute (VBI) Research institute based at Virginia Tech Established July 1, 2000, with $3 million Will occupy 2 building and have 100+ employees in 4 years

December 14, 2001Slide 42 Getting Into Bioinformatics Learn some biology --- genetics, cell biology Study computational (molecular) biology Get involved with bioinformatics research in interdisciplinary teams Work with biologists to solve their problems