Reading the blueprint of life DNA sequencing. Introduction The blueprint of life is contained in the DNA in the nuclei of eukaryotic cells and simply.

Slides:



Advertisements
Similar presentations
DNA Technology & Gene Mapping Biotechnology has led to many advances in science and medicine including the creation of DNA clones via recombinant clones,
Advertisements

Recombinant DNA technology
Bioinformatics. Bioinformatics is an applied science that uses computer programs to access molecular biology databanks to make inferences about the information.
ATG GAG GAA GAA GAT GAA GAG ATC TTA TCG TCT TCC GAT TGC GAC GAT TCC AGC GAT AGT TAC AAG GAT GAT TCT CAA GAT TCT GAA GGA GAA AAC GAT AAC CCT GAG TGC GAA.
Supplementary Fig.1: oligonucleotide primer sequences.
13-2 Manipulating DNA.
Biotech Continued… How do forensic scientists determine who’s blood has been left at a crime scene? How do forensic scientists determine who’s blood.
chromosome organization, what about genome organization?
Introduction to Molecular Biology. G-C and A-T pairing.
Manipulating the Genome: DNA Cloning and Analysis 20.1 – 20.3 Lesson 4.8.
What are the three steps in PCR?. Denaturation Hybridization of Primer DNA replication.
Reading the Blueprint of Life
IGEM Arsenic Bioremediation Possibly finished biobrick for ArsR by adding a RBS and terminator. Will send for sequencing today or Monday.
DNA Technology- Cloning, Libraries, and PCR 17 November, 2003 Text Chapter 20.
From Haystacks to Needles AP Biology Fall Isolating Genes  Gene library: a collection of bacteria that house different cloned DNA fragments, one.
Gene Technology Chapters 11 & 13. Gene Expression 0 Genome 0 Our complete genetic information 0 Gene expression 0 Turning parts of a chromosome “on” and.
Nature and Action of the Gene
Chapter 20 DNA Technology. DNA Cloning  Gene cloning allows scientists to work with small sections of DNA (single genes) in isolation. –Exactly what.
1 Genetics Faculty of Agriculture Instructor: Dr. Jihad Abdallah Topic 13:Recombinant DNA Technology.
Biological Dynamics Group Central Dogma: DNA->RNA->Protein.
Technological Solutions. In 1977 Sanger et al. were able to work out the complete nucleotide sequence in a virus – (Phage 0X174) This breakthrough allowed.
More on translation. How DNA codes proteins The primary structure of each protein (the sequence of amino acids in the polypeptide chains that make up.
Module 1 Section 1.3 DNA Technology
Cutting and Pasting DNA The cutters are called restriction enzymes, they cut DNA at specific nucleotide sequences.
DNA Technology Chapter 12. Transgenic Organisms Contain recombinant DNA – Nucleotide sequences from 2+ different sources Cells express original AND newly.
Supplemental Table S1 For Site Directed Mutagenesis and cloning of constructs P9GF:5’ GAC GCT ACT TCA CTA TAG ATA GGA AGT TCA TTT C 3’ P9GR:5’ GAA ATG.
PART 1 - DNA REPLICATION PART 2 - TRANSCRIPTION AND TRANSLATION.
Chapter 21 Eukaryotic Genome Sequences
Cloning of Atrolysin A from Crotulas atrox AJ Goos and Kayla Ohrt.
Success criteria - PCR By the end of this lesson we will be able to: 1. The polymerase chain reaction (PCR) is a technique for the amplification ( making.
Chap. 1 basic concepts of Molecular Biology Introduction to Computational Molecular Biology Chapter 1.
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
Locating and sequencing genes
Manipulating DNA. Scientists use their knowledge of the structure of DNA and its chemical properties to study and change DNA molecules Different techniques.
Molecular Genetic Technologies Gel Electrophoresis PCR Restriction & ligation Enzymes Recombinant plasmids and transformation DNA microarrays DNA profiling.
Recombinant DNA Technology. DNA replication refers to the scientific process in which a specific sequence of DNA is replicated in vitro, to produce multiple.
ESTs Ian Keller Laboratory Techniques in Molecular Bio.
1 PCR: identification, amplification, or cloning of DNA through DNA synthesis DNA synthesis, whether PCR or DNA replication in a cell, is carried out by.
DNA Technology Ch. 20. The Human Genome The human genome has over 3 billion base pairs 97% does not code for proteins Called “Junk DNA” or “Noncoding.
DNA Technology & Genomics
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
Structure and Function of DNA DNA Replication and Protein Synthesis.
Chapter 14 GENETIC TECHNOLOGY. A. Manipulation and Modification of DNA 1. Restriction Enzymes Recognize specific sequences of DNA (usually palindromes)
Green with envy?? Jelly fish “GFP” Transformed vertebrates.
Example 1 DNA Triplet mRNA Codon tRNA anticodon A U A T A U G C G
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
DNA, RNA and Protein.
bacteria and eukaryotes
Biotechnology.
Introduction to Biotechnology Transformation and more!
Part 3 Gene Technology & Medicine
Success criteria - PCR By the end of this lesson we will be know:
Chapter 7 Recombinant DNA Technology and Genomics
Cloning Overview DNA can be cloned into bacterial plasmids for research or commercial applications. The recombinant plasmids can be used as a source of.
AMPLIFYING AND ANALYZING DNA.
DNA By: Mr. Kauffman.
Jeopardy! Molecular Genetics Edition.
Chapter 14 Bioinformatics—the study of a genome
DNA Technology.
Gene architecture and sequence annotation
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
The student is expected to: (6H) describe how techniques such as DNA fingerprinting, genetic modifications, and chromosomal analysis are used to study.
CHAPTER 12 DNA Technology and the Human Genome
More on translation.
Reading Frames and ORF’s
DNA Profiling Vocabulary
Using the DNA Sequence Knowing the sequence of an organism’s DNA allows researchers to study specific genes, to compare them with the genes of other organisms,
Presentation transcript:

Reading the blueprint of life DNA sequencing

Introduction The blueprint of life is contained in the DNA in the nuclei of eukaryotic cells and simply within prokaryotic cells. Human genome project – just obtain the list of approximately 3x10 9 bases (As, Cs, Gs and Ts) in the 23 chromosomes. Extraction of useful information from this list and genome sequence of other organisms relies on computer-intensive data handling – Bioinformatics.

Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping segments. The DNA is amplified by cloning into bacteria (PCR, see later, doesn’t produce enough and requires sequence information for the primers). It is then denatured (ie. melted), so that the two strands split apart.

Sequencing- continued Denatured DNA is added to reaction mix with: – a primer (to start complementary pairing), –DNA polymerase –nucleotides including special ones called dideoxynucleotides. These special nucleotides do not allow further nucleotides to be added to the chain. So in a mix with dideoxy-A, every time a dideoxy-A is added (small proportion of As), the reaction ends. This results in different length fragments. The dideoxynucleotides are fluorescently tagged. Fragments can be separated out on a gel by electrophoresis and their length calculated. Working out DNA sequence ~ jigsaw puzzle.

DNA sequencing – preparation In order to sequence a piece of DNA, first need to amplify it. This is sometimes done by a process called polymerase chain reaction (PCR). PCR: The necessary ingredients for DNA replication are 1) the DNA itself, 2) DNA polymerase, 3) free nucleotides and 4) primers - Place all these in a test tube.

DNA amplification - PCR Step 1 – heat to c. 95°C for 30s – this denatures the DNA and unzips the two strands Step 2 – cool to c. 55°C for 20s, this causes the primer to bind to the DNA Step 3 – heat to c. 72°C for a minute per kb (kilobase)– this allows the polymerase to catalyse the addition of free nucleotides to the primer, replicating the DNA. So in two minutes a c. 1kb piece of DNA is replicated. Repeat for a few hours  a million copies.

DNA Amplification - cloning An alternative to PCR is to insert the piece of DNA into the DNA of a bacterium. Replicating the bacterium thus replicates the DNA. Cf. recombinant DNA technology

Sequencing using gel electrophoresis Here is a gel with 28 DNA samples: green bands represent A, blue C, yellow G and red T. Small molecules move faster.

Sequence assembly using mapping Originally sequencing was performed by cutting the chromosomes into large pieces which were cloned into bacteria, creating a whole library of DNA segments. The segments were cut open to look for common sequence landmarks in overlapping fragments. These were used to fingerprint the fragments, so that it was known where in the chromosome the fragment was- this is called mapping. The fragments were cut into smaller pieces and the process repeated and the small fragments were sequenced. Finally the whole sequence is known (in terms of short fragments and their locations on the chromosome).

Shotgun sequencing Shotgun sequencing dispenses with the need for mapping and so is much faster. It involves chopping the DNA into fragments of size c base pairs (bps) and bps, sequencing the first and last 500 bps of each fragment. It then uses computer algorithms to assemble the entire sequence from the sequenced fragments.

Speed and accuracy of sequencing Shotgun sequencing is much faster- it took a matter of months to obtain a draft sequence of the fruit-fly, Drosophila Melanogaster (135Mbps), when the state-funded conventional sequencing effort had taken several years to achieve a similar level of completion. BUT assembly of pieces, in eg. the human (3x10 9 bps), requires very powerful computers AND repetitive DNA, which is common in eukaryotic genomes, causes great difficulties in the assembly process – may get it wrong.

Acquisition of sequence data Genomes must be sequenced several times over on average, both to ensure complete coverage of the genome is achieved, and because sequencing data is somewhat error-prone. Increases in the efficiency of sequencing have led to a year on year increase in the rate of new sequence data acquisition:

Statistics of genome sequences Statistics can be global or local: Base composition of genomes: Bacteria (E. coli): 25% A, 25% C, 25% G, 25% T Mosquito (P. falciparum): 82%A+T Human: 59%A+T Translation initiation: ATG is the near universal motif (codon) indicating the start of translation in DNA coding sequence.

Databases of sequence information Internet has become a vital resource in making sequence data generally available to the biological community at large. Examples: GenBank ( EMBL ( DDBJ ( Used for: gene prediction, protein structure/ function prediction, homology searching

Extracting important information The most important parts of the genome are the genes. Efforts have been made to identify genes out of sequence data. Expressed sequence tags (ESTs) are short pieces of sequence data that correspond to mRNAs found in cells of the organism. ESTs are produced by purifying mRNA from cells and then using an enzyme called reverse transcriptase to convert these to copy DNA (cDNA). The DNA is then cloned in bacteria and sequenced. The sequence obtained is usually only short (c. 700 base pairs) and may not be very accurate, but ESTs still provide very useful information.

Gene prediction A weakness of ESTs is that it is very difficult to obtain them for genes which are expressed at a low level/ only under certain conditions, also slow, so People try to predict where in sequence the genes are. In prokaryotes, just look for long stretches of DNA without stop codon in any of the 6 reading frames.

Open reading frames There are 6 reading frames, 3 forwards: And 3 backwards (on the other strand). A frame is said to be open if it contains long stretches without a stop codon. [Lower lines are single-letter amino acid codes, *=stop.] 5' 3' atgcccaagctgaatagcgtagaggggttttcatcatttgaggacgatgtataa 1 atg ccc aag ctg aat agc gta gag ggg ttt tca tca ttt gag gac gat gta taa M P K L N S V E G F S S F E D D V * 2 tgc cca agc tga ata gcg tag agg ggt ttt cat cat ttg agg acg atg tat C P S * I A * R G F H H L R T M Y 3 gcc caa gct gaa tag cgt aga ggg gtt ttc atc att tga gga cga tgt ata A Q A E * R R G V F I I * G R C I

Gene prediction in eukaryotes In bacteria, open reading frames (ORFs) are pretty much enough to indicate genes, but in eukaryotes finding genes is more complicated, because 1.Eukaryotic DNA is roughly 97-98% noncoding- in such a large amount, ORFs may exist by chance. 2.Eukaryotic DNA contains introns, so finding the start and the end of a gene is not enough- also have to find which bits (introns) to edit out of sequence. Also introns break up open reading frames.

Introns - reminder Mentioned in “Introduction to Molecular Biology” These are pieces of DNA within genes, which are transcribed but then spliced out of the RNA before it is translated. They make it much harder to find genes, since finding open reading frames is not enough, you also need to find where introns and exons start and end.

Conclusions Sequencing DNA involves: –Amplifying it by PCR or cloning –Chopping it up into manageable bits –Replicating it with fluorescently-tagged dideoxynucleotides –Running the different length fragments on a gel and reading this –Assembling the pieces (sequences of manageable bits). Shotgun sequencing is faster than mapping- based assembly methods, but can have accuracy problems.

Conclusions Sequence data is stored in online databases Extracting useful information and patterns from such data is part of bioinformatics and often employs intelligent systems techniques.

Next block of lectures History of genomics Introduction to bioinformatics More on gene prediction