Presentation is loading. Please wait.

Presentation is loading. Please wait.

A Zero-Knowledge Based Introduction to Biology

Similar presentations


Presentation on theme: "A Zero-Knowledge Based Introduction to Biology"— Presentation transcript:

1 A Zero-Knowledge Based Introduction to Biology
Karthik Jagadeesh, Bo Yoo 28 September 2016

2 Q: What is your genome?

3 Q: What is your genome? A: The sum of your hereditary information.

4 Human Genome 3 billion base pairs: A,T,G,C
Full DNA sequence in virtually all cells DNA is the blueprint for life: Cookbook with many “recipes” for proteins - genes Proteins do most of the work in biology Yet, only ~2% of the genome is protein-coding genes!

5 What does the rest of the genome do?
3 billion base pairs – 2% coding, % regulatory Organism’s complexity NOT correlated with number of genes! Human (20-25k genes) vs. Rice (51k genes) 1 million Regulatory elements (switches) enable: Precise control for turning genes on/off Diverse cell types (lung, heart, skin) Analogy: Making specific recipes (genes) from a large cookbook (genome) at a given time

6 Quick Recap

7 DNA: “Blueprints” for a cell
Genetic information encoded in long strings Deoxyribonucleic acid (DNA) comes in four bases: adenine (A), thymine (T), guanine (G) , and cytosine (C)

8 From DNA to Organism You are composed of ~ 10 trillion cells

9 From DNA to Organism Cell

10 From DNA to Organism Cell Protein
Proteins do most of the work in biology

11 Q: How does one genome encode a variety of cell types in a complex organism?

12 Regulatory Elements ~ 20-25k genes CS analogy:
Expression Modulated by Regulatory elements Enhancer, Promoters, Silencers CS analogy: Genes are like variable assignments (a = 7) Regulatory elements are control flow, complex logic

13 Controlling Gene Expression
Transcription factors (TFs): Proteins that recognize sequence motifs in enhancers, promoters Combinatorial switches that turn genes on/off

14 How does the genome influence human disease?

15 Disease Implications SHH MUTATIONS Brain Limb Other Bejerano Lab

16 Limb Enhancer 1Mb away from Gene
SHH Bejerano Lab

17 Enhancer Deletion limb SHH DELETE Limb Bejerano Lab

18 Enhancer 1bp Substitution
limb SHH MUTATIONS Limb Lettice et al. HMG : Bejerano Lab

19 Genome Wide Association Study (GWAS):
80% of GWAS SNPs are noncoding (hard to interpret) Active area of research GWAS is a study that takes the entire genetic variants from multiple individuals  and try to associate variants to traits. Usually it particularly focuses on single nucleotide polymorphisms or SNPs which is if the majority of the population has A in a particular location in the genome and a person has G instead. if an individual has a distinctive trait and a genetic variant that is not found in any other population,then the researcher may try to associate that trait with the SNPs. However since 80% of GWAS SNPs are noncoding which is difficult to analyze since the genomic region where that SNP is contain in does not have a well-known function compared to a protein coding gene  Bejerano Lab

20 How exactly do genes code for proteins?

21 Central Dogma of Biology

22 DNA: “Blueprints” for a cell
Genetic information encoded in long strings Deoxyribonucleic acid comes in four bases: adenine, thymine, guanine, and cytosine

23 Nucleobase Complementary Pairing
purines Adenine (A) Guanine (G) Thymine (T) Cytosine (C) pyrimidines

24 DNA Double Helix

25 DNA Packaging

26 Q: What is your genome? A:The sum of your hereditary information.

27 Q: What is your genome? A:The sum of your hereditary information. Humans bundle two copies of the genome into 46 chromosomes in every cell

28 Central Dogma of Biology

29 DNA vs RNA 5' end has phosphate group and 3' has hydroxyl group (OH), number comes from the number of carbon in the sugar attached to the base

30 RNA Nucleobases purines pyrimidines Adenine (A) Guanine (G) Uracil (U)
Cytosine (C) pyrimidines

31 Gene Transcription G A T T A C A . . . 5’ 3’ 3’ 5’ C T A A T G T . . .

32 Gene Transcription G A T T A C A . . . 5’ 3’ 3’ 5’ C T A A T G T . . . RNA polymerase binds to the DNA at the promoter site near the transcription starting site, RNA polymerase synthesizes the RNA from the DNA template

33 Gene Transcription Strands are separated (DNA helicase)
G A T T A C A . . . 5’ 3’ 3’ 5’ C T A A T G T . . . DNA helicase are class of enzymes that open the double helix structure of DNA to allow RNA Polymerase to synthesize RNA Strands are separated (DNA helicase)

34 Gene Transcription G A T T A C A . . . 5’ 3’ 3’ G A U U A C A 5’ C T A A T G T . . . Read and add complements of the template An RNA copy of the 5’→3’ sequence is created from the 3’→5’ template

35 Gene Transcription 5’ 3’ 3’ 5’ pre-mRNA 5’ 3’ G A T T A C A . . .
C T A A T G T . . . G A U U A C A . . . pre-mRNA 5’ 3’

36 RNA Processing 5’ cap poly(A) tail exon intron regions near the ends are called UTR or untranslated region and are never translated to protein mRNA 5’ UTR 3’ UTR

37 Gene Structure introns 5’ 3’ promoter exons 3’ UTR 5’ UTR coding
non-coding

38 Central Dogma of Biology

39 From RNA to Protein Proteins are long strings of amino acids joined by peptide bonds Translation from RNA sequence to amino acid sequence performed by ribosomes 20 amino acids → 3 RNA letters required to specify a single amino acid

40 Amino Acid There are 20 standard amino acids H O H N C C OH H R
Alanine Arginine Asparagine Aspartate Cysteine Glutamate Glutamine Glycine Histidine Isoleucine Leucine Lysine Methionine Phenylalanine Proline Serine Threonine Tryptophan Tyrosine Valine H O H N C C OH H R There are 20 standard amino acids

41 Translation G GGG Gly GAG Glu GCG Ala GUG Val A GGA Gly
GAA Glutamic acid (Glu) GCA Ala GUA Val C GGC Gly GAC Asp GCC Ala GUC Val U GGU Glycine (Gly) GAU Aspartic acid (Asp) GCU Alanine (Ala) GUU Valine (Val) AGG Arg AAG Lys ACG Thr AUG Methionine (Met) or START AGA Arginine (Arg) AAA Lysine (Lys) ACA Thr AUA Ile AGC Ser AAC Asn ACC Thr AUC Ile AGU Serine (Ser) AAU Asparagine (Asn) ACU Threonine (Thr) AUU Isoleucine (Ile) CGG Arg CAG Gln CCG Pro CUG Leu CGA Arg CAA Glutamine (Gln) CCA Pro CUA Leu CGC Arg CAC His CCC Pro CUC Leu CGU Arginine (Arg) CAU Histidine (His) CCU Proline (Pro) CUU Leucine (Leu) UGG Tryptophan (Trp) UAG STOP UCG Ser UUG Leu UGA STOP UAA STOP UCA Ser UUA Leucine (Leu) UGC Cys UAC Tyr UCC Ser UUC Phe UGU Cysteine (Cys) UAU Tyrosine (Tyr) UCU Serine (Ser) UUU Phenylalanine (Phe)

42 5’ . . . A U U A U G G C C U G G A C U U G A . . . 3’
Translation 5’ A U U A U G G C C U G G A C U U G A ’ UTR Met Start Codon Ala Trp Thr Stop Codon

43 Translation The ribosome (a complex of protein and RNA) synthesizes a protein by reading the mRNA in triplets (codons). Each codon is translated to an amino acid.

44 Single Nucleotide Changes

45 Single Nucleotide Changes

46 Central Dogma of Biology

47

48

49 Different Cell Types Subsets of the DNA sequence determine the identity and function of different cells

50 Gene Expression Regulation
When should each gene be expressed? Why? Every cell has same DNA but each cell expresses different proteins. Signal transduction: One signal converted to another: cascade has “master regulators” turning on many proteins, which in turn each turn on many proteins

51 Central Dogma of Biology

52 Transcription Regulation
Transcription factors link to binding sites Complex of transcription factors forms Complex assists or inhibits formation of the RNA polymerase machinery

53 Transcription Factor Binding Sites
Short, degenerate DNA sequences recognized by particular transcription factors For complex organisms, cooperative binding of multiple transcription factors required to initiate transcription Binding Sequence Logo

54 Transcription Regulation
TF A Binding Site Gene B Transcription Factor A

55 Q: What if the transcription/translation machinery makes mistakes?
Q:What is the effect in coding regions?

56 Evolution = Mutation + Selection

57 Mutation: Structural Abnormalities

58 Single Nucleotide Changes

59 Single Nucleotide Changes

60 Evolution = Mutation + Selection

61 Selection time Harmful mutation Beneficial mutation

62 Evolution = Mutation + Selection

63 Evolution = Mutation + Selection
Summary Evolution = Mutation + Selection

64 Summary All hereditary information encoded in double- stranded DNA
Each cell in an organism has same DNA DNA → RNA → protein Proteins have many diverse roles in cell Gene regulation diversifies protein products within different cells

65 Further Reading See website: cs273a.stanford.edu

66 Extra Slides

67 Gene Regulatory Region


Download ppt "A Zero-Knowledge Based Introduction to Biology"

Similar presentations


Ads by Google