CRISPR-associated Proteins

Slides:



Advertisements
Similar presentations
Presented By Dr. Shazzad Hosain Asst. Prof. EECS, NSU Multiple Sequence Alignment Motif Finding and Gene Prediction.
Advertisements

CITE EVIDENCE THAT ORGANISMS ARE LINKED BY LINES OF DESCENT FROM COMMON ANCESTRY LEARNING GOAL.
Testing Bacterial Proteins for Evidence of Horizontal Gene Transfer James Godde, John Iverson, Kabi Neupane, and Sara Penhale.
Identifying functional residues of proteins from sequence info Using MSA (multiple sequence alignment) - search for remote homologs using HMMs or profiles.
Genomics and bioinformatics summary 1. Gene finding: computer searches, cDNAs, ESTs, 2.Microarrays 3.Use BLAST to find homologous sequences 4.Multiple.
Chromosomes carry genetic information
Bio 178 Lecture 29 DNA and Gene Expression. Reading Chapters 14 & 15 Quiz Material Questions on P 300 & 318 Chapters 14 & 15 Quizzes on Text Website (
© Wiley Publishing All Rights Reserved. Biological Sequences.
Identifying recombination events in phage Giles through presence of repeat sequences MEGAN MAIR.
DNA Structure Replication Functions (Stores and provides copies of genetic material- genes) – Blueprint (genes) for Protein Synthesis (Enzymes and cell.
SC.L.16.3 Describe the basic process of DNA replication and how it relates to the transmission and conservation of the genetic information.
Section 8.4: Transcription.
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
DNA Replication Accel Bio 2014.
Replication of Nucleic Acids. 2 Because sometimes this... 3.
Multiple Sequence Alignment May 12, 2009 Announcements Quiz #2 return (average 30) Hand in homework #7 Learning objectives-Understand ClustalW Homework#8-Due.
CRISPRs: An Exploration into Spacers and Captured Viral Similarity By: Katie Richeson November 19, 2009 Genomics Laboratory Methods Davidson College, Davidson,
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
* only 17% of SNPs implicated in freshwater adaptation map to coding sequences Many, many mapping studies find prevalent noncoding QTLs.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
DNA Replication Accel Bio Overview: What is DNA for? The purpose of DNA is to store the information necessary to allow cells & organisms to function.
Nucleic Acids (How much do you recall?) - Structure - Replication - Packaging Refer to chapter 12.1, 12.2 in text.
 DNA must be copied (before a cell divides)  The DNA molecule produces 2 IDENTICAL new complementary strands following the rules of base pairing: A-T,
Do Now!!  Why must DNA be replicated?  Where do you think replication takes place?  Are mistakes ever made while replicating DNA?  Why must DNA be.
DNA to Protein – 12 Part one AP Biology. What is a Gene? A gene is a sequence of DNA that contains the information or the code for a protein or an RNA.
Ch. 10: Nucleic Acids and Protein Synthesis - DNA controls the production of proteins within the cell. - These proteins form the structural units of cells.
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
Homologous Recombination
Protein and RNA Families
Breakthrough Technology to Improve the Range and Accuracy of Cas9 Target.
Nucleic Acids Comparing DNA and RNA. Both are made of nucleotides that contain  5-carbon sugar,  a phosphate group,  nitrogenous base.
(Foundation Block) Dr. Sumbul Fatma
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
LEHNINGER PRINCIPLES OF BIOCHEMISTRY Sixth Edition David L. Nelson and Michael M. Cox © 2013 W. H. Freeman and Company CHAPTER 8 Nucleotides and Nucleic.
Lecture-7 Genome editing CRISPR
CRISPR Direct Repeat Sequences Olivia Ho-Shing 22 November 2009.
RiboNucleic Acid (RNA) -Contrast RNA and DNA. -Explain the process of transcription. - Differentiate between the 3 main types of RNA -Differentiate between.
Protein Structure and Function. Proteins are organic compounds made from amino acids held together by peptide bonds.
DNA Structure Replication Functions (Stores and provides copies of genetic material- genes) – Blueprint (genes) for Protein Synthesis (Enzymes and cell.
Pairwise Sequence Alignment. Three modifications for local alignment The scoring system uses negative scores for mismatches The minimum score for.
CRISPR BTE 302 FINAL PRESENTATION.  Ehsan Sakib ( )  Prateem Das ( )  Olia (136032)
Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings PowerPoint Lectures for Biology: Concepts and Connections, Fifth Edition – Campbell,
Introduction to Genetics 304 Gene Regulation in Prokaryotes Instructor: Dr. Shelagh Campbell rta.ca.
Molecular biology (2) (Foundation Block). Objectives By the end of this lecture, the students should be able to: To understand DNA replication To know.
Sequence based searches:
University of Pittsburgh
13.1 RNA.
CRISPR + CAS = Defensive or Immune System
Regulation of Gene Expression
CRISPR-Cas Systems: Prokaryotes Upgrade to Adaptive Immunity
Do Now!!  Why must DNA be replicated?
There are four levels of structure in proteins
Archaeal CRISPR-based immune systems: exchangeable functional modules
CRISPR-based adaptive and heritable immunity in prokaryotes
CRISPR CRISPRs (clustered regularly interspaced short palindromic repeats) are repetitive nucleotide sequences followed by a short spacer DNA segments.
CRISPR-Cas Systems: Prokaryotes Upgrade to Adaptive Immunity
Addison V. Wright, James K. Nuñez, Jennifer A. Doudna  Cell 
Figure 1 Adaptive immune system of bacteria and archaea
Volume 164, Issue 1, Pages (January 2016)
RNAi: Prokaryotes Get in on the Act
The Evolution of Antiviral Defense Systems
Volume 88, Issue 2, Pages (January 1997)
Zhao Zhang, Yuelin Zhang, Fei Gao, Shuo Han, Kathryn S
Unit Genomic sequencing
A CRISPR Approach to Gene Targeting
Immunological Comparisons
The structure of ribosomal protein S7 at 1
Essential knowledge 1.B.1:
Volume 97, Issue 6, Pages (June 1999)
Molecular biology (2) (Foundation Block).
Presentation transcript:

CRISPR-associated Proteins Sarah Pyfrom sapyfrom@davidson.edu

Research Questions What Cas-proteins does our species share with the 10 other species we chose to study? If so, how do they compare? How do Cas-proteins function in relation to CRISPR units? [Edit]: Why did JGI change its annotation?

Cas Proteins Proteins that are almost always associated with (near) CRISPR sequences Originally four major families Now, at least 45 families total

JGI annotation Cas1 Cas2 Cas3 Cas4 TM1800 TM1801 Cas1 Cas2 Cas4 Cas5 “Old” Cas-Proteins “New” Cas-Proteins Cas1 Cas2 Cas3 Cas4 TM1800 TM1801 Cas1 Cas2 Cas4 Cas5 Cas6 Csh1 Csh2

Changes: TM1800= Cas5 TM1801=Csh2 Hypothetical protein = Csh1 Part of hypothetical protein = Cas6 Cas3 = hypothetical protein Cas4: MTDSSGDPVDRFLAAARDESAELPFRLTGVMFQYYVVCER ELWFLSRDVEIDRDTPAIVRGSDVDDSAYADKRRDVRVDGII AIDVLDSGEILEVKPSSSMTEPARLQLLFYLWYLDRVTGVEK TGVLAHPAEKRRETVELTPETSAEVESAIEGIRAVVTAESPPP AEEKPVCDSCAYHDFCWSC (red = original Cas4)

Map of CRISPR region TM1800 TM1801 Transposases Cas3 Hypothetical proteins CRISPR Cas1 Unidentified Csh1 Cas2 Cas4 Cas5 Cas6 Csh2

Cas1 (from Sulfolobus solfataricus) high-affinity nucleic acid binding protein binds DNA, RNA and DNA–RNA hybrid sequence non-specific in a multi-site binding mode promotes the hybridization of complementary nucleic acid strands. From: SSO1450 – A CAS1 protein from Sulfolobus solfataricus P2 with high affinity for RNA and DNA

Usually similar to helicases Unwinds double-stranded DNA Thought to be involved in DNA metabolism and repair Often resemble Rec-B exonucleases Break down nucleic acid strains Thought to be involved in DNA metabolism Cas2 function unknown From Genbank

Often found with Cas1, and Cas6. Share and N-terminal region of about 43 amino acids in length Are usually 210-265 amino acids long Characterized by GhGxxxxxGhG motif, where h indicates a hydrophobic residue, at the C-terminus From: EMBL IPR013422 profile page (: http://www.ebi.ac.uk/interpro/IEntry?ac=IPR013422) From: Sanger PF09559 Profile page ( http://pfam.sanger.ac.uk/family/PF09559)

Csh1 and Csh2? Protein families determined for ease of alignment Often large differences between species Alignment easier if protein “soup” is divided into more readily- compared subgroups.

CRISPRs thought to create stable secondary RNA structures Spacers remain associated with their DR neighbors. Provide a way for Cas- Proteins to recognize the spacers and facilitate immune response. From: Evolutionary conservation of sequence and secondary structures in CRISPR repeats

Cas-Proteins and Immunity Thought to act like Slicer and Dicer (eukaryotic counterpart) Create siRNA that will inhibit/break down invading RNA Not known if Cas-proteins are involved in integrating pathogenic DNA into spacers Video of eukaryotic siRNA process: http://www.youtube.com/watch?v=D- 77BvIOLd0

Alignments of Cas Compared Cas1, Cas2, Cas3 etc. proteins across all 10 species…

Comparison with other species: (based on “old” proteins) Cas1 Cas2 Cas3 Cas4 TM1800 TM1801 H: vallismortis H. volcanii H. sulfurifontis X H. sinaiiensis H. californiae H. utahensis H. mucosum H. mediteranei H. denitrificans H. mukohataei

Phylogenetic tree comparing amino acid sequences for all CAS-proteins 2 2 Halomicrobium mukohataei 2 Haloarcula sinaiiensis 2 1800 Haloarcula californiae 1 4 Haloferax dentrificans 3 1 Haloferax mediteranei 1801 Haloferax sulfurifontis 1801 1801 Haloferax mucosum 1801 1801 Halorhabdus utahensis 3 1 1 1800 2 3 3 2 1800 1 1800 1800 1 3 1801 4 4 4 4 4 1 4 1800 3 3 1800 1801

Cas 1 and Cas2 did not change

Cas 4 JGI revision shortened this protein Would expect low sequence similarity near end of protein

TM1801 (Csh2) Revision by JGI simply renamed this protein Would expect sequence similarity

Map of CRISPR region TM1800 TM1801 Transposases Cas3 Hypothetical proteins CRISPR Cas1 Unidentified Csh1 Cas2 Cas4 Cas5 Cas6 Csh2

…but we do know everything that everybody else knows. In conclusion: We don’t know much…. …but we do know everything that everybody else knows.

Questions?

References Kunin, V., Sorek, R., Hugenholtz, P. (2007) Evolutionary conservation of sequence and secondary structures in CRISPR repeats. Genome Biology.http://genomebiology.com/2007/8/4/R61. Accessed 24 Nov, 2009. Haft, D.H., Selengut, J. Mongodin, EF., Nelson, K.E. (2005). A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes. PLoS Comput Biol. http://www.ncbi.nlm.nih.gov/pubmed/16292354. Accessed 24 Nov 2009.