DNA as Biological Information Rasmus Wernersson Henrik Nielsen.

Slides:



Advertisements
Similar presentations
DNA as Biological Information Rasmus Wernersson Henrik Nielsen.
Advertisements

Biologisk information Med fokus på DNA. Læringsmål / learning objectives Læringsmål –Hvad er biologisk information –Informations flow –Teknikken bag DNA.
Introduktion til Bioinformatik Hold 01 Oktober 2010.
DNA as Biological Information Rasmus Wenersson. Overview Learning objectives –About Biological Information –A note about DNA sequencing techniques and.
Nucleic acids Nucleic Acids Information storage.
What is bioinformatics? Finding patterns in molecular biological data Implies: managing molecular biological data identifying correlations in molecular.
DNA and RNA. I. DNA Structure Double Helix In the early 1950s, American James Watson and Britain Francis Crick determined that DNA is in the shape of.
DNA: The Molecule of Heredity. DNA Structure Deoxyribonucleic acid. A macromolecule composed of two strands of monomers called nucleotides. These strands.
RNA, DNA, & Proteins Chapter 9 & 10.1 Review
What is bioinformatics?. What are bioinformaticians up to, actually? Manage molecular biological data –Store in databases, organise, formalise, describe...
DNA Chapter 10 The Code of Life. History Griffith Hershey and Chase Chargaff Linus Pauling Maurice Wilkins Rosalind Franklin Francis Crick James Watson.
Essential Bioinformatics and Biocomputing Module (Tutorial) Biological Databases Lecturer: Chen Yuzong Jan 2003 TAs: Cao Zhiwei Lee Teckkwong, Bernett.
DNA: The Molecule of Heredity
DNA: Structure and Function. DNA Structure Deoxyribonucleic acid. A macromolecule composed of two strands of monomers called nucleotides. These strands.
KEY CONCEPT DNA structure is the same in all organisms.
DNA Structure and Replication Honors Biology 2013.
Objectives 12.2 The Structure of DNA
Warm Up Where is DNA located within a cell? Why is DNA important?
Nucleic Acids Genetic Material. Nucleic Acids are macromolecules There are two main types: DNARNA.
Assessment Statements: Describe the structure of DNA.
Inheritance and the Structure of DNA. Deoxyribonucleic Acid.
Quick Review 1.What is genetic information stored as? 2.What organelle is this information found in?
DNA The Molecule of Life. What is DNA? DeoxyriboNucleic Acid Chargaff’s Law A=T, G=C R. Franklin and M. Wilkins Crystal X-ray J Watson and F Crick Model.
AP Biology Nucleic Acids Information storage Energy Transfer.
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
Thursday, March 31 st Objective: Explain and apply laws of heredity and their relationship to the structure and function of DNA Agenda: 1. Introduction.
And the RACE BEGINS! Once DNA was identified as the genetic molecule the race was on to determine its structure. The combined work of different researchers.
DNA (Deoxyribonucleic Acid)
DNA DeoxyriboNucleic Acid. What can DNA do? Carries information from one generation to the next Determines the heritable characteristics of organisms.
Week #5 Quarter 2 (11/12/13) Homework: Lab report Due Nov 19 one week! To Do Today : Lab report directions Due Nov 19 Finish coloring DNA Video on Genetics.
Question for Today: DNA is a polymer, which means that it is made up of many repeating single units called monomers. These monomers are called what? Nucleotides.
Gene Expression Gene: contains the recipe for a protein 1. is a specific region of DNA on a chromosome 2. codes for a specific mRNA.
DNA –Was known as a chemical in cells by the end of the nineteenth century –Has the capacity to store genetic information –Can be copied and passed from.
Mark Biodiversity HW DNA Structure Lesson 1 Structure of DNA.
Molecular Genetics Structure of DNA. Phoebus Levene (1920’s) identified the 3 components of DNA molecule –deoxyribose sugars –phosphate groups –nitrogenous.
DNA Introduction. What is DNA? Genetic information of life Type of Nucleic Acid Double Stranded.
GENBANK FILE FORMAT LOCUS –LOCUS NAME Is usually the first letter of the genus and species name, followed by the accession number –SEQUENCE LENGTH Number.
12.2 The Structure of DNA 1)What are the chemical components of DNA? 2)What clues helped scientists solve the structure of DNA? 3)What does the double-helix.
Chapter 12: DNA Mr. Freidhoff 1900’s What is known to man? – Chromosomes carry genetic information. – Some type of heredity is passed on to offspring.
AP Biology Nucleic acids AP Biology Nucleic Acids Information storage.
Molecular Biology. The study of DNA and how it serves as a chemical basis of heredity.
Chapter #12 – DNA, RNA, & Protein Synthesis. I. DNA – experiments & discoveries A. Griffith and Transformation Frederick Griffith – British scientist.
DNA and RNA. Rosalind Franklin Worked with x-ray crystallography Discovered: That DNA had a helical structure with two strands.
DNA Vocabulary Draw a word from the bucket Complete a 4 Corners mini poster about your word! Remember to make your poster neat and colorful!! Vocabulary.
The Structure of DNA -Identify the components of DNA and how they pair up. -Discuss the scientists responsible for the identification of DNA’s structure.
DNA,RNA, and Proteins. In the 1950’s, James Watson and Francis Crick, built a model of DNA. Their model was inspired by the work of Rosalind Franklin.
DNA DNA Deoxyribose Nucleic Acid DNA is a heredity molecule –passed on from parent/s –generation to generation Stores and transmits genetic information.
DNA Sequencing First generation techniques
Lesson Overview 12.2 The Structure of DNA.
And the RACE BEGINS! Once DNA was identified as the genetic molecule the race was on to determine its structure. The combined work of different researchers.
Chapter 10 sections 1, 2,3 Student 2015.
DNA as Biological Information
DNA as Biological Information
Lec 1 Molecular biology.
THE MOLECULE BASIS OF INHERITANCE
Structure of DNA.
DNA The Molecule of Life.
10.2 DNA and RNA are polymers of nucleotides
THE STRUCTURE OF DNA Section 4.2 Page 210.
And the RACE BEGINS! Once DNA was identified as the genetic molecule the race was on to determine its structure. The combined work of different researchers.
Warm-up: DNA What does DNA stand for? Where do we find DNA?
12.2 Notes The Structure of DNA
DNA Notes!.
DNA and its Structure.
Lesson: Structure of DNA Key Questions:
DNA STRUCTURE SBI 4UI – 4.2.
Warm-up: DNA What does DNA stand for? Where do we find DNA?
DNA The Code of Life.
DNA Notes!.
Presentation transcript:

DNA as Biological Information Rasmus Wernersson Henrik Nielsen

Overview Learning objectives –About Biological Information –A note about DNA sequencing techniques and DNA data –File formats used for biological data –Introduction to the GenBank database

Hvad er gener? rough strain & DNA from killed smooth strain

DNA: sammensætning Omkring 1950 vidste man at DNA var en polymer af nukleotider – nærmere bestemt deoxyribonukleotider. De fire nukleotider der udgør DNA er kun forskellige i deres nitrogen-base. Der er to puriner (adenin og guanin) og to pyrimidiner (cytosin og thymin). Uracil (en pyrimidin) forekommer kun i RNA

Deoxyribonukleotider 1’ 2’ 3’ 4’ 5’ deoxy

DNA: sammensætning 2 Chargaff’s regel: Der er lige mængder A og T samt lige mængder C og G. (mens forholdet mellem G+C og A+T kan variere)

DNA: Røntgenkrystallografi Røntgenkrystallografi: Rosalind Franklin DNA præparation: Maurice Wilkins Modelbygning og tolkning af røntgenspektre: Francis Crick & James Watson

Watson & Crick 1953

DNA Struktur

Information flow in biological systems

DNA sequences = summary of information 5’ AGACC 3’ 3’ TCTGG 5’ 5’ ATGGCCAGGTAA 3’ DNA backbone: (Deoxy)ribose: Ribose Deoxyribose ’ 3’ 5’ 3’

PCR Melting 96º, 30 sec Annealing ~55º, 30 sec Extension 72º, 30 sec 35 cycles Animation :

PCR Animation: PCR graph:

Gel electrophoresis DNA fragments are separated using gel electrophoresis –Typically 1% agarose –Colored with EtBr or ZybrGreen (glows in UV light). –A DNA ”ladder” is used for identification of known DNA lengths. Gel picture: PCR setup: + -

The Sanger method of DNA sequencing Images: } Terminator X-ray sequenceing gel OH

Automated sequencing The major break-through of sequencing has happened through automation. Fluorescent dyes. Laser based scanning. Capillary electrophoresis Computer based base- calling and assembly. Images:

Handout exercise: ”base-calling” Handout: Chromotogram Groups of 2-3. Tasks: –Identify “difficult” regions –Identify “difficult” sequence stretches. –Try to estimate the best interval to use.

Biological data on computers The GenBank database File formats –FASTA –GenBank

NCBI GenBank GenBank is one of the main international DNA databases. GenBank is hosted by NCBI: National Center for Biotechnology Information. GenBank has existed since The database is public - no restrictions on the use of the data within.

FASTA format >alpha-D ATGCTGACCGACTCTGACAAGAAGCTGGTCCTGCAGGTGTGGGAGAAGGTGATCCGCCAC CCAGACTGTGGAGCCGAGGCCCTGGAGAGGTGCGGGCTGAGCTTGGGGAAACCATGGGCA AGGGGGGCGACTGGGTGGGAGCCCTACAGGGCTGCTGGGGGTTGTTCGGCTGGGGGTCAG CACTGACCATCCCGCTCCCGCAGCTGTTCACCACCTACCCCCAGACCAAGACCTACTTCC CCCACTTCGACTTGCACCATGGCTCCGACCAGGTCCGCAACCACGGCAAGAAGGTGTTGG CCGCCTTGGGCAACGCTGTCAAGAGCCTGGGCAACCTCAGCCAAGCCCTGTCTGACCTCA GCGACCTGCATGCCTACAACCTGCGTGTCGACCCTGTCAACTTCAAGGCAGGCGGGGGAC GGGGGTCAGGGGCCGGGGAGTTGGGGGCCAGGGACCTGGTTGGGGATCCGGGGCCATGCC GGCGGTACTGAGCCCTGTTTTGCCTTGCAGCTGCTGGCGCAGTGCTTCCACGTGGTGCTG GCCACACACCTGGGCAACGACTACACCCCGGAGGCACATGCTGCCTTCGACAAGTTCCTG TCGGCTGTGTGCACCGTGCTGGCCGAGAAGTACAGATAA >alpha-A ATGGTGCTGTCTGCCAACGACAAGAGCAACGTGAAGGCCGTCTTCGGCAAAATCGGCGGC CAGGCCGGTGACTTGGGTGGTGAAGCCCTGGAGAGGTATGTGGTCATCCGTCATTACCCC ATCTCTTGTCTGTCTGTGACTCCATCCCATCTGCCCCCATACTCTCCCCATCCATAACTG TCCCTGTTCTATGTGGCCCTGGCTCTGTCTCATCTGTCCCCAACTGTCCCTGATTGCCTC TGTCCCCCAGGTTGTTCATCACCTACCCCCAGACCAAGACCTACTTCCCCCACTTCGACC TGTCACATGGCTCCGCTCAGATCAAGGGGCACGGCAAGAAGGTGGCGGAGGCACTGGTTG AGGCTGCCAACCACATCGATGACATCGCTGGTGCCCTCTCCAAGCTGAGCGACCTCCACG CCCAAAAGCTCCGTGTGGACCCCGTCAACTTCAAAGTGAGCATCTGGGAAGGGGTGACCA GTCTGGCTCCCCTCCTGCACACACCTCTGGCTACCCCCTCACCTCACCCCCTTGCTCACC ATCTCCTTTTGCCTTTCAGCTGCTGGGTCACTGCTTCCTGGTGGTCGTGGCCGTCCACTT CCCCTCTCTCCTGACCCCGGAGGTCCATGCTTCCCTGGACAAGTTCGTGTGTGCCGTGGG CACCGTCCTTACTGCCAAGTACCGTTAA (Handout)

GenBank format Originates from the GenBank database. Contains both a DNA sequence and annotation of feature (e.g. Location of genes). (handout)

GenBank format - HEADER LOCUS CMGLOAD 1185 bp DNA linear VRT 18-APR-2005 DEFINITION Cairina moschata (duck) gene for alpha-D globin. ACCESSION X01831 VERSION X GI:62724 KEYWORDS alpha-globin; globin. SOURCE Cairina moschata (Muscovy duck) ORGANISM Cairina moschata Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Archosauria; Aves; Neognathae; Anseriformes; Anatidae; Cairina. REFERENCE 1 (bases 1 to 1185) AUTHORS Erbil,C. and Niessing,J. TITLE The primary structure of the duck alpha D-globin gene: an unusual 5' splice junction sequence JOURNAL EMBO J. 2 (8), (1983) PUBMED COMMENT Data kindly reviewed (13-NOV-1985) by J. Niessing.

GenBank format - ORIGIN section ORIGIN 1 ctgcgtggcc tcagcccctc cacccctcca cgctgataag ataaggccag ggcgggagcg 61 cagggtgcta taagagctcg gccccgcggg tgtctccacc acagaaaccc gtcagttgcc 121 agcctgccac gccgctgccg ccatgctgac cgccgaggac aagaagctca tcgtgcaggt 181 gtgggagaag gtggctggcc accaggagga attcggaagt gaagctctgc agaggtgtgg 241 gctgggccca gggggcactc acagggtggg cagcagggag caggagccct gcagcgggtg 301 tgggctggga cccagagcgc cacggggtgc gggctgagat gggcaaagca gcagggcacc 361 aaaactgact ggcctcgctc cggcaggatg ttcctcgcct acccccagac caagacctac 421 ttcccccact tcgacctgca tcccggctct gaacaggtcc gtggccatgg caagaaagtg 481 gcggctgccc tgggcaatgc cgtgaagagc ctggacaacc tcagccaggc cctgtctgag 541 ctcagcaacc tgcatgccta caacctgcgt gttgaccctg tcaacttcaa ggcaagcggg 601 gactagggtc cttgggtctg ggggtctgag ggtgtggggt gcagggtctg ggggtccagg 661 ggtctgagtt tcctggggtc tggcagtcct gggggctgag ggccagggtc ctgtggtctt 721 gggtaccagg gtcctggggg ccagcagcca gacagcaggg gctgggattg catctgggat 781 gtgggccaga ggctgggatt gtgtttggaa tgggagctgg gcaggggcta gggccagggt 841 gggggactca gggcctcagg gggactcggg gggggactga gggagactca gggccatctg 901 tccggagcag gggtactaag ccctggtttg ccttgcagct gctggcacag tgcttccagg 961 tggtgctggc cgcacacctg ggcaaagact acagccccga gatgcatgct gcctttgaca 1021 agttcttgtc cgccgtggct gccgtgctgg ctgaaaagta cagatgagcc actgcctgca 1081 cccttgcacc ttcaataaag acaccattac cacagctctg tgtctgtgtg tgctgggact 1141 gggcatcggg ggtcccaggg agggctgggt tgcttccaca catcc //

FEATURES Location/Qualifiers source /organism="Cairina moschata" /mol_type="genomic DNA" /db_xref="taxon:8855" CAAT_signal TATA_signal precursor_RNA /note="primary transcript" exon /number=1 CDS join( , , ) /codon_start=1 /product="alpha D-globin" /protein_id="CAA " /db_xref="GI: " /db_xref="GOA:P02003" /db_xref="InterPro:IPR000971" /db_xref="InterPro:IPR002338" /db_xref="InterPro:IPR002340" /db_xref="InterPro:IPR009050" /db_xref="UniProt/Swiss-Prot:P02003" /translation="MLTAEDKKLIVQVWEKVAGHQEEFGSEALQRMFLAYPQTKTYFP HFDLHPGSEQVRGHGKKVAAALGNAVKSLDNLSQALSELSNLHAYNLRVDPVNFKLLA QCFQVVLAAHLGKDYSPEMHAAFDKFLSAVAAVLAEKYR" repeat_region /note="direct repeat 1" intron /number=1 repeat_region /note="direct repeat 1" exon /number=2 intron /number=2 exon /number=3 polyA_signal polyA_signal 1114 GenBank format - FEATURE section

Exercise: GenBank Work in groups of 2-3 people. The exercise guide is linked from the course programme. Read the guide carefully - it contains a lot of information about GenBank.