CS374 A Zero-Knowledge Based Introduction to Biology.

Slides:



Advertisements
Similar presentations
BNFO 615 Data Analysis in Bioinformatics Instructor Zhi Wei.
Advertisements

• Exam II Tuesday 5/10 – Bring a scantron with you!
5’ C 3’ OH (free) 1’ C 5’ PO4 (free) DNA is a linear polymer of nucleotide subunits joined together by phosphodiester bonds - covalent bonds between.
A Zero-Knowledge Based Introduction to Biology Cory McLean 26 Sep 2008 Thanks to George Asimenos.
CS262 Introduction to Biology. Sources John Kimball’s Biology Pages Wikipedia Warning: ∀ rule ∃ exception.
© 2010 Pearson Education, Inc. Lectures by Chris C. Romero, updated by Edward J. Zalisko PowerPoint ® Lectures for Campbell Essential Biology, Fourth Edition.
Exciting Developments in Molecular Biology As seen by an amateur.
Welcome to CS262: Computational Genomics Instructor: Serafim Batzoglou TA: Eugene Fratkin Tuesday&Thursday 2:45-4:00 Skilling Auditorium.
RNA Ribonucleic Acid.
The Molecular Biology of the Gene Identifying the Genetic Material Mendel’s experiments—inherit chromosomes that contain genes The Question now: –What.
Transcription and Translation
Unit 7 RNA, Protein Synthesis & Gene Expression Chapter 10-2, 10-3
How does DNA work? What is a gene?
Chapter 2 An Introduction to Genes and Genomes. Introduction to Molecular Biology.
Proteins are made by decoding the Information in DNA Proteins are not built directly from DNA.
Protein Synthesis. DNA RNA Proteins (Transcription) (Translation) DNA (genetic information stored in genes) RNA (working copies of genes) Proteins (functional.
CHAPTER 12 PROTEIN SYNTHESIS AND MUTATIONS -RNA -PROTEIN SYNTHESIS -MUTATIONS.
DNA.
How Proteins Are Made Mrs. Wolfe. DNA: instructions for making proteins Proteins are built by the cell according to your DNA What kinds of proteins are.
Translation PROTEIN SYNTHESIS. 4 Components used in Translation 1.mRNA- the message to be translated into protein. 2.Amino acids- the building blocks.
VII RNA and Protein Synthesis
Welcome to CSE 527: Computational Biology
1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Human Biology Sylvia S. Mader Michael Windelspecht Chapter.
Chapter Intro-page 280 What You’ll Learn You will relate the structure of DNA to its function. You will explain the role of DNA in protein production.
© 2010 Pearson Education, Inc. Lectures by Chris C. Romero, updated by Edward J. Zalisko PowerPoint ® Lectures for Campbell Essential Biology, Fourth Edition.
KEY CONCEPT DNA structure is the same in all organisms.
Learning Targets “I Can...” -State how many nucleotides make up a codon. -Use a codon chart to find the corresponding amino acid.
RNA Structure Like DNA, RNA is a nucleic acid. RNA is a nucleic acid made up of repeating nucleotides.
Aim: How does DNA direct the production of proteins in the cell?
CELL REPRODUCTION: MITOSIS INTERPHASE: DNA replicates PROPHASE: Chromatin condenses into chromosomes, centrioles start migrating METAPHASE: chromosomes.
Chapter 11 DNA and Genes.
The Purpose of DNA To make PROTEINS! Proteins give us our traits (ex: one protein gives a person blue eyes, another gives brown Central Dogma of Molecular.
End Show Slide 1 of 39 Copyright Pearson Prentice Hall 12-3 RNA and Protein Synthesis 12–3 RNA and Protein Synthesis.
Online – animated web site 5Storyboard.htm.
Transcription and Translation
1 Human chromosomes: 50->250 million base pairs. Average gene: 3000 base pairs.
CS273a A Zero-Knowledge Based Introduction to Biology Courtesy of George Asimenos.
DANDY Deoxyribonucleic Acid ALL CELLS HAVE DNA… Cells are the basic unit of structure and function of all living things. –Prokaryotes (bacteria) –Eukaryotes.
Parts is parts…. AMINO ACID building block of proteins contain an amino or NH 2 group and a carboxyl (acid) or COOH group PEPTIDE BOND covalent bond link.
UCAG U UUU Phenylalanine (Phe) UCU Serine (Ser)UAU Tyrosine (Tyr)UGU Cysteine (Cys)U UUC PheUCC SerUAC TyrUGC CysC UUA Leucine (Leu)UCA SerUAA STOPUGA.
CH 12.3 RNA & Protein Synthesis. Genes are coded DNA instructions that control the production of proteins within the cell…
Genomics Lecture 3 By Ms. Shumaila Azam. Proteins Proteins: large molecules composed of one or more chains of amino acids, polypeptides. Proteins are.
8.2 KEY CONCEPT DNA structure is the same in all organisms.
G U A C G U A C C A U G G U A C A C U G UUU UUC UUA UCU UUG UCC UCA
Gene Translation:RNA -> Protein How does a particular sequence of nucleotides specify a particular sequence of amino acids?nucleotidesamino acids The answer:
From DNA to Protein.
Translation PROTEIN SYNTHESIS.
Whole process Step by step- from chromosomes to proteins.
Please turn in your homework
A Zero-Knowledge Based Introduction to Biology
BIOLOGY 12 Protein Synthesis.
RNA Ribonucleic Acid.
A Zero-Knowledge Based Introduction to Biology
Chapter 12 DNA and RNA.
Section Objectives Relate the concept of the gene to the sequence of nucleotides in DNA. Sequence the steps involved in protein synthesis.
Protein Synthesis Translation.
Warm Up.
Transcription You’re made of meat, which is made of protein.
BELL RINGER What are the base pairing rules for DNA replication?
Chapter 12 DNA and RNA.
How is the genetic code contained in DNA used to make proteins?
Transcription and Translation
Today’s notes from the student table Something to write with
Transcription and Translation
A Zero-Knowledge Based Introduction to Biology
Translation.
Bellringer Please answer on your bellringer sheet:
REVIEW DNA DNA Replication Transcription Translation.
Aim: How does DNA direct the production of proteins in the cell?
Presentation transcript:

CS374 A Zero-Knowledge Based Introduction to Biology

Biology From the greek word βίος = life Timeline: 1683 – discovery of bacteria 1858 – Darwin’s natural selection 1865 – Mendel’s laws 1953 – double helix suggested by Watson-Crick 1955 – discovery of DNA and RNA polymerase 1978 – sequencing of first genome (5kb virus) 1983 – invention of PCR 1990 – discovery of RNAi 2000 – human genome (draft)

How to learn some? Online sources Wikipedia John Kimball’s Biology Pages Cold Spring Harbor Meetings CSHL Biology of Genomes CSHL Genome Informatics Hang out with biologists

Properties of Biology Still a lot to understand Experimental data… are non-discrete have errors Hard to model ∀ rule ∃ exception

cell, nucleus, cytoplasm, mitochondrion Title The Cell Keywords presented in this slide

The Cell © Coriell Institute for Medical Research cell, nucleus, cytoplasm, mitochondrion

How many? Cells in the human body: >10 14 (100 trillion)

Chromosomes histone, nucleosome, chromatin, chromosome, centromere, telomere H1DNA H2A, H2B, H3, H4 ~146bp telomere centromere nucleosome chromatin

How many? Chromosomes in a human cell: 46 (2x22 + X/Y)

Nucleotide O CC CC H H HH H H H COPO O-O- O to next nucleotide to previous nucleotide to base deoxyribose, nucleotide, base, A, C, G, T, purine, pyrimidine, 3’, 5’ 3’ 5’ Adenine (A) Cytosine (C) Guanine (G) Thymine (T) Let’s write “AGACC”! pyrimidines purines

“AGACC” (backbone)

“AGACC” (DNA) deoxyribonucleic acid (DNA) 5’ 3’

DNA is double stranded 3’ 5’ 3’ DNA is always written 5’ to 3’ AGACC or GGTCT strand, reverse complement

RNA O CC CC H OH HH H H H COPO O-O- O to next ribonucleotide to previous ribonucleotide to base ribose, ribonucleotide, U 3’ 5’ Adenine (A) Cytosine (C) Guanine (G) Uracil (U) pyrimidines purines

How many? Nucleotides in the human genome: ~ 3 billion

Genes & Proteins 3’ 5’ 3’ TAGGATCGACTATATGGGATTACAAAGCATTTAGGGA...TCACCCTCTCTAGACTAGCATCTATATAAAACAGAA ATCCTAGCTGATATACCCTAATGTTTCGTAAATCCCT...AGTGGGAGAGATCTGATCGTAGATATATTTTGTCTT AUGGGAUUACAAAGCAUUUAGGGA...UCACCCUCUCUAGACUAGCAUCUAUAUAA (transcription) (translation) Single-stranded RNA protein Double-stranded DNA gene, transcription, translation, protein

How many? Genes in the human genome: ~ 20,000 – 25,000

Gene Transcription promoter 3’ 5’ 3’ G A T T A C A... C T A A T G T...

Gene Transcription transcription factor, binding site, RNA polymerase 3’ 5’ 3’ Transcription factors recognize transcription factor binding sites and bind to them, forming a complex. RNA polymerase binds the complex. G A T T A C A... C T A A T G T...

Gene Transcription 3’ 5’ 3’ The two strands are separated G A T T A C A... C T A A T G T...

Gene Transcription 3’ 5’ 3’ An RNA copy of the 5’ → 3’ sequence is created from the 3’ → 5’ template G A T T A C A... C T A A T G T... G A U U A C A

Gene Transcription 3’ 5’ 3’ G A U U A C A... G A T T A C A... C T A A T G T... pre-mRNA5’3’

RNA Processing 5’ cap, polyadenylation, exon, intron, splicing, UTR, mRNA 5’ cap poly(A) tail intron exon mRNA 5’ UTR3’ UTR

Gene Structure 5’3’ promoter 5’ UTR exons3’ UTR introns coding non-coding

How many? Exons per gene: ~ 8 on average (max: 148) Nucleotides per exon: 170 on average (max: 12k) Nucleotides per intron: 5,500 on average (max: 500k) Nucleotides per gene: 45k on average (max: 2,2M)

Amino acid amino acid C O N H C H HOH R There are 20 standard amino acids Alanine Arginine Asparagine Aspartate Cysteine Glutamate Glutamine Glycine Histidine Isoleucine Leucine Lysine Methionine Phenylalanine Proline Serine Threonine Tryptophan Tyrosine Valine

Proteins C O N H C H R to previous aato next aa N-terminus HOH C-terminus N-terminus, C-terminus

Translation The ribosome synthesizes a protein by reading the mRNA in triplets (codons). Each codon is translated to an amino acid. ribosome, codon mRNA P siteA site

The Genetic Code UCAG U UUU Phenylalanine (Phe)UCU Serine (Ser)UAU Tyrosine (Tyr)UGU Cysteine (Cys)U UUC PheUCC SerUAC TyrUGC CysC UUA Leucine (Leu)UCA SerUAA STOPUGA STOPA UUG LeuUCG SerUAG STOPUGG Tryptophan (Trp)G C CUU Leucine (Leu)CCU Proline (Pro)CAU Histidine (His)CGU Arginine (Arg)U CUC LeuCCC ProCAC HisCGC ArgC CUA LeuCCA ProCAA Glutamine (Gln)CGA ArgA CUG LeuCCG ProCAG GlnCGG ArgG A AUU Isoleucine (Ile)ACU Threonine (Thr)AAU Asparagine (Asn)AGU Serine (Ser)U AUC IleACC ThrAAC AsnAGC SerC AUA IleACA ThrAAA Lysine (Lys)AGA Arginine (Arg)A AUG Methionine (Met) or STARTACG ThrAAG LysAGG ArgG G GUU Valine (Val)GCU Alanine (Ala)GAU Aspartic acid (Asp)GGU Glycine (Gly)U GUC ValGCC AlaGAC AspGGC GlyC GUA ValGCA AlaGAA Glutamic acid (Glu)GGA GlyA GUG ValGCG AlaGAG GluGGG GlyG

Translation (tRNA) tRNA, anticodon C C A (Tryptophan codon: UGG) Tryptophan anticodon

Translation (tRNA) aminoacylation Tryptophan Aminoacylation C C A C C A Charged tRNAUnloaded tRNA

Translation 5’... A U U A U G G C C U G G A C U U G A... 3’ UTR Met Start Codon AlaTrpThr

Translation 5’... A U U A U G G C C U G G A C U U G A... 3’

Translation MetAla 5’... A U U A U G G C C U G G A C U U G A... 3’ Trp

Errors? What if the transcription / translation machinery makes mistakes? What is the effect of mutations in coding regions? mutation

Reading Frames reading frame G C U U G U U U A C G A A U U A G

Synonymous Mutation G C U U G U U U A C G A A U U A G Ala Cys Leu Arg Ile G C U U G U U U A C G A A U U A G synonymous (silent) mutation, fourfold site G G C U U G U U U G C G A A U U A G Ala Cys Leu Arg Ile

Missense Mutation G C U U G U U U A C G A A U U A G Ala Cys Leu Arg Ile G C U U G U U U A C G A A U U A G missense mutation G G C U U G G U U A C G A A U U A G Ala Trp Leu Arg Ile

Nonsense Mutation G C U U G U U U A C G A A U U A G Ala Cys Leu Arg Ile G C U U G U U U A C G A A U U A G nonsense mutation A G C U U G A U U A C G A A U U A G Ala STOP

Frameshift G C U U G U U U A C G A A U U A G Ala Cys Leu Arg Ile G C U U G U U U A C G A A U U A G frameshift G C U U G U U A C G A A U U A G Ala Cys Tyr Glu Leu

Quality Control Nonsense-Mediated mRNA Decay (NMD) (Destroy mRNA with premature STOP codon) nonsense-mediated decay protein complex bound to exon-exon boundary intron All complexes removed as ribosome moves Complexes remain after the end of translation; signal the destruction

Gene Expression Regulation When should each gene be expressed? Regulate gene expression Examples: Make more of gene A when substance X is present Stop making gene B once you have enough Make genes C 1, C 2, C 3 simultaneously regulation

Regulatory Mechanisms enhancer, silencer Transcription Factor Specificity: Enhancer: Silencer:

Assemblies read, contig, scaffold, sequencing gaps, assembly Hundreds of N’s Thousands of N’s reads contigs scaffolds chromosomes

Retrovirus virus, reverse transcriptase, integrase docking protein envelope protein RNA Integrase Reverse Transcriptase

Infection

Infection

Replication cycle Reverse Transcription DNA RNA

Replication cycle

Are they alive? Polio virus made from scratch ($300,000 DARPA project – 2002) The first part of the sequence was painstakingly pieced together by hand and took over a year. The researchers then hired a commercial laboratory, Integrated DNA Technologies, to synthesise the remaining two thirds of the sequence mechanically. This took an additional two months. “ ”

Are they alive? Polio virus made from scratch ($300,000 DARPA project – 2002) Once the entire sequence was replicated, it was reconverted into RNA by enzymatic means. Viral propagation and replication were accomplished by throwing the virus into a predesigned protein soup that contained all the polymerases and other enzymatic ingredients necessary for RNA transcription and translation. The synthetic virus was able to successfully replicate itself from this mixture. “ ”

Are they alive? Polio virus made from scratch ($300,000 DARPA project – 2002) The viral copies were then injected into the brains of mice, which subsequently developed paralysis indistinguishable from polio. “ ”

The end?

Keywords cell, nucleus, cytoplasm, mitochondrion, histone, nucleosome, chromatin, chromosome, centromere, telomere, deoxyribose, nucleotide, base, A, C, G, T, purine, pyrimidine, 3’, 5’, deoxyribonucleic acid (DNA), strand, reverse complement, ribose, ribonucleotide, U, gene, transcription, translation, protein, promoter, transcription factor, binding site, RNA polymerase, 5’ cap, polyadenylation, exon, intron, splicing, UTR, mRNA, amino acid, N terminus, C terminus, ribosome, codon, tRNA, anticodon, aminoacylation, mutation, reading frame, synonymous (silent) mutation, fourfold site, missense mutation, nonsense mutation, frameshift, nonsense-mediated decay, regulation, enhancer, silencer, read, contig, scaffold, sequencing gaps, assembly, virus, reverse transcriptase, integrase