Data Available Genomic sequences Corresponding mRNA transcripts

Slides:



Advertisements
Similar presentations
RT-PCR lab You have a cell…is a certain gene on (by “on,” we mean active and producing mRNA?)? If a certain gene is on when the cell divides, the gene.
Advertisements

DNA Sequencing and Gene Analysis
MiRNA targets Using undergraduate molecular biology labs to discover targets of miRNAs in humans Adam Idica, Jordan Thompson, Irene Munk Pedersen, Pavan.
Bioinformatics page 12, part of ch. 21 Cell and Mol Biol Lab.
A Novel Third Isoform of Zebrafish Cytochrome Oxidase IV Brandon Smith Dr. Nancy Bachman, Faculty Advisor.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 3 Cell Structures and Their Functions Dividing Cells.
DNA Replication DNA mRNA protein transcription translation replication Before each cell division the DNA must be replicated so each daughter cell can get.
Kamila Balušíková.  DNA – sequence of genes, repetitive sequence of noncoding regions  RNA  Proteins gene expression.
7.1 Techniques for Producing and Analyzing DNA SBI4UP MRS. FRANKLIN.
Protein Synthesis The genetic code – the sequence of nucleotides in DNA – is ultimately translated into the sequence of amino acids in proteins – gene.
DNA Technology- Cloning, Libraries, and PCR 17 November, 2003 Text Chapter 20.
Transcription Transcription is the synthesis of mRNA from a section of DNA. Transcription of a gene starts from a region of DNA known as the promoter.
AP Biology: Chapter 14 DNA Technologies
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
Genomic walking (1) To start, you need: -the DNA sequence of a small region of the chromosome -An adaptor: a small piece of DNA, nucleotides long.
1 Genetics Faculty of Agriculture Instructor: Dr. Jihad Abdallah Topic 13:Recombinant DNA Technology.
Biotechnology Methods Producing Recombinant DNAProducing Recombinant DNA Locating Specific GenesLocating Specific Genes Studying DNA SequencesStudying.
DNA Technologies.
POLYMERASE CHAIN REACTION. DNA Structure DNA consists of two molecules that are arranged into a ladder-like structure called a Double Helix. A molecule.
UNIT 1 INFORMATION METHODS OF A CELL. What do you know about DNA? Building blocks are called? –nucleotides The shape is ? –Double helix The three primary.
Recombinant DNA Technology (8)
 DNA (gene mutations, paternity, organs compatibility for transplantations)  RNA  Proteins (gene expression)
Transcription Packet #20 5/31/2016 2:49 AM1. Introduction  The process by which information encoded in DNA specifies the sequences of amino acids in.
DNA replication Semiconservative replication Replicative Structures Replication Fork.
Fig Fig Fig Fig Fig
Chap. 5 Problem 1 Recessive mutations must be present in two copies (homozygous) in diploid organisms to show a phenotype (Fig. 5.2). These mutations show.
TRANSCRIPTION Copying of the DNA code for a protein into RNA Copying of the DNA code for a protein into RNA 4 Steps: 4 Steps: Initiation Initiation Elongation.
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
While replication, one strand will form a continuous copy while the other form a series of short “Okazaki” fragments Genetic traits can be transferred.
Molecular Tools. Recombinant DNA Restriction enzymes Vectors Ligase and other enzymes.
Recombinant DNA Technology. DNA replication refers to the scientific process in which a specific sequence of DNA is replicated in vitro, to produce multiple.
The C3HC4-Type RING Zinc Finger and MYB Transcription Factor Families Matthew Taube June 5, 2008 HC70AL.
DNA Technology Ch. 20. The Human Genome The human genome has over 3 billion base pairs 97% does not code for proteins Called “Junk DNA” or “Noncoding.
Transcription and The Genetic Code From DNA to RNA.
Green with envy?? Jelly fish “GFP” Transformed vertebrates.
DNA Sequencing First generation techniques
Mahmuda Akter, Paige Fairrow-Davis, and Rebecca Seipelt-Thiemann
Protein Synthesis - Transcription
Eukaryotic Gene Regulation
Part 3 Gene Technology & Medicine
Figure 1. RT–PCR identification of an abnormal transcript of the PTPN6 gene in normal and leukemic bone marrow cells and cell line. (a) Diagrammatic representation.
Chapter 7 Recombinant DNA Technology and Genomics
Experimental Verification Department of Genetic Medicine
Expression of the Genome
Transcription & Gene Expression
The Role of Recombinant DNA Technology in Biotechnology
Transcription.
Expression of the Genome
How are areas of DNA that don’t code for proteins (genes) used by our cells? How can we make use of these areas?
Jeopardy! Molecular Genetics Edition.
Chapter 14 Bioinformatics—the study of a genome
Screening a Library for Clones Carrying a Gene of Interest
Recombinant DNA Technology
Small RNA Sample Preparation
CHAPTER 12 DNA Technology and the Human Genome
Relationship between Genotype and Phenotype
Recombinant DNA Technology
mRNA Sequencing Sample Preparation
Transcription.
SBI 4U: Metablic Processes
Gene structure DNA replication
Transcription Definition
Transcription Packet #21 12/8/ :59 PM.
TRANSCRIPTION AHL Topic 7.3 IB Biology Miss Werba.
Protein Synthesis The genetic code – the sequence of nucleotides in DNA – is ultimately translated into the sequence of amino acids in proteins – gene.
Volume 3, Issue 4, Pages (April 2013)
Digital Gene Expression – Tag Profiling Sample Preparation
Protein Synthesis The genetic code – the sequence of nucleotides in DNA – is ultimately translated into the sequence of amino acids in proteins – gene.
Chapter 6.2 McGraw-Hill Ryerson Biology 12 (2011)
Figure Genetic characterization of the novel GYG1 gene mutation (A) GYG1_cDNA sequence and position of primers used. Genetic characterization of the novel.
Presentation transcript:

Data Available Genomic sequences Corresponding mRNA transcripts SNP database

What we are searching for A/G discrepancy between gDNA & mRNA Discrepancy encodes nonsynonymous amino acid change Site is not SNP Possible self-complementarity within pre-mRNA

RNA Editing Dataflow System (REDS)

Stage 1: Finding Differences Scans the genome and mRNA for discrepancies

Stage 2: Filter SNPs Removes all validated SNPs from the search

Stage 3: Perform Foldback Searches results for self-complementary sequences

REDS Results

Does REDS work?

Validating REDS Returns all known sites possible within set parameters Exceptions: mRNA not in database used by REDS Largest foldback is less than 5 base pairs Finds self-complementarity present in known RNA secondary structures these results are just a few among 100s of possible self-complementarity sites

Shot of REDS with other foldback predictions

Using REDS Results: Ranked by Basepairing length Hit Chr. mRNA acc. # Reference sequence # Site position Continuous basepairing length 1 chr14 BC034554 NM_001085: serpin peptidase inhibitor, clade A, member 3) 94155441 90 2 chr1 AK127250 NM_000851: glutathione S-transferase M5) 110057883 50 3 CR627441 NM_001077700: mesoderm induction early response 1 isoform 67223383 32 4 chr20 BC027448 4071869 30 5 chr3 BC029869 NM_001248: ectonucleoside triphosphate diphosphohydrolase 40408547 6 AL832131 NM_017895: (Asp-Glu-Ala-Asp) box polypeptide 27 47289277 28 7 chr10 AK123885 NM_207372: domain containing 4B 82338403 26 8 chr2 BC045801 Homo sapiens hypothetical protein LOC554226, mRNA 132622172 9 chr9 AF098483 NM_021144: Homo sapiens transcriptional coactivator p52 mRNA NM_033222: SFRS1 interacting protein 1 isoform 2 15464194 25 10 AF098482 11 chr12 AK001499 NM_018164: hypothetical protein LOC55726 26980918 24 12 BC040032 15476021 23 13 AK074357 NM_021627: specific protease 2 186809754 14 chr21 AJ001866 NM_001001894: tetratricopeptide repeat domain 3 NM_003316: tetratricopeptide repeat domain 3 37389527 15 AK222568 NM_018101: cell division cycle associated 8 37940054 22 16 chr22 U06935 NM_003216: thyrotrophic embryonic factor 40113303 17 chr19 BX648389 NM_173481: hypothetical protein LOC126353 709603 18 709625 19 BX648547 NM_145858: crystallin, zeta-like 1 33916232 20 AK225162 NM_022895: hypothetical protein LOC64897 119926554 21 chr11 CR627226 NM_015065: exophilin 5 107917676 AK223179 NM_006675: tetraspanin 9 3261243 AK057985 NM_002972: binding factor 1 isoform a 49247963 BC094885 NM_014871: ubiquitin specific protease 52 54999370 AK122695 NM_001033: ribonucleoside-diphosphate reductase M1 chain 4116107 L41669 NM_152437: zinc finger protein 664 123062981 27 chr5 AK074318 NM_016340: domain-containing guanine nucleotide 130816647 chr7 X16323 NM_000601: hepatocyte growth factor isoform 1 NM_001010931: hepatocyte growth factor isoform 2 precursor NM_001010932: hepatocyte growth factor isoform 3 precursor NM_001010933: hepatocyte growth factor isoform 4 precursor NM_001010934: hepatocyte growth factor isoform 5 precursor 81197020

Where REDS & Mfold Agree

Hit 6: 28 base pairs AL832131 ((Asp-Glu-Ala-Asp) box polypeptide 27) mfold with 2500nt

Hit 20: 22 base pairs AK225162 (hypothetical protein LOC64897) mfold with 2500nt

Hit 21: 22 base pairs CR627226 (exophilin 5) mfold with 2500nt

Where REDS & MFold Conflict

AK074357 (specific protease 2) Hit 13: 23 base pairs AK074357 (specific protease 2) mfold with 2500nt

Hit 17 & 18: 22 bp BX648389 (hypothetical protein LOC126353) mfold with 2500nt

Reconciling the Discrepancies REDS predicts self-complementarity Mfold uses energy minimization calculations What would be an accurate way of determining the utility of the folding heuristic?

Hit 7: 26 base pairs - mfold with 2500nt AK123885 (domain containing 4B)

Hit 15: 22 base pairs AK222568 (cell division cycle associated 8) mfold with 2500nt

Hit 16: 22 base pairs U06935 (thyrotrophic embryonic factor) mfold with 2500nt

Experimental Validation Reverse transcription on mammalian brain RNA. PCR on cDNA and gDNA pairs. Amplicon purification and isolation for sequencing. Comparison of cDNA and gDNA electropherograms. Possible subcloning of amplicon.

Reverse Transcription Synthesizing a cDNA library to be used as a PCR template Editing is most upregulated in the brain single-stranded RNA is reverse transcribed into complementary DNA (cDNA) by using total cellular RNA or poly(A) RNA, a reverse transcriptase enzyme, a primer, dNTPs and an RNase inhibitor 32

Amplifying a DNA fragment via enzymatic replication PCR Amplifying a DNA fragment via enzymatic replication isolating and exponentially amplifying a fragment of DNA, via enzymatic replication, without using a living organism 33

Designing Primers for PCR Basic Criteria for Primers: Equal amount of all 4 nucleotides About 20-25 base pairs product should extend over 2+ exons BLAST sequence to check uniqueness Product should be 200-800 nucleotides long

Purifying and Isolating an Amplicon Gel Electrophoresis expected size? Change PCR parameters NO YES Phenol Chloroform Extraction to remove proteins Preparative Gel expected size? Product lost during purification NO YES Isolate desired bands for Agarose Gel Purification Gel Electrophoresis expected size? NO Isolated incorrect band Low product yield YES Determine concentration & Ready to be Sequenced 35

PCR Amplicons V0 PCR here

Preparative Gels 15 – 20 V0

Ready for Sequencing V0 15 - 20

Analyzing Sequences V0 cDNA

Validating/Confirming RNA Editing V0 gDNA

Further Characterization … By Sublconing What percentage of gene transcripts are edited? What effect does editing have on alternative splicing or translation? … By knockout studies What functional impact does the modification have in the organism?

Subcloning Moving our amplicon of interest into a destination vector isolating and exponentially amplifying a fragment of DNA, via enzymatic replication, without using a living organism 42

SNX subclones

Our Searches Lab Reference # Gene Name REDS Hit # Analysis Status 3 SNX n/a Subcloned – no further evaluation 4 FZD Subcloning 5 SMAP Amplified incorrect sequence 6 V0 Confirmed 5 sites; to be subcloned 7 UNK Ready for sequencing 8 SHDC To be amplified 9 PSIP 12 Sequenced – subclone? 10 SPP 13 Sequenced – no further evaluation 11 TSP 22 HYP 20 AGA 14 HY2P 17, 18 15 EXPH 21 16 PANK2 29 17 BP4 34 18 GTF2I 40 19 ABP4 50 ACA 51

Summary of Experimental Data

Conclusions 2 found by REDS without Stage 3 do not appear to be edited 16 predicted with Stage 3  1 with 5 sites confirmed to date, maybe more Demonstrates importance of strong foldback structure Experimental REDS data set sorted by longest continuous base pairing ranks known sites low other contributing factors that should be weighted more heavily in ranking Negative sequencing results are disappointing but will be useful in making REDS tool a more accurate predictor of targets