Download presentation
Presentation is loading. Please wait.
1
A Zero-Knowledge Based Introduction to Biology
Karthik Jagadeesh, Bo Yoo 28 September 2016
2
Q: What is your genome?
3
Q: What is your genome? A: The sum of your hereditary information.
4
Human Genome 3 billion base pairs: A,T,G,C
Full DNA sequence in virtually all cells DNA is the blueprint for life: Cookbook with many “recipes” for proteins - genes Proteins do most of the work in biology Yet, only ~2% of the genome is protein-coding genes!
5
What does the rest of the genome do?
3 billion base pairs – 2% coding, % regulatory Organism’s complexity NOT correlated with number of genes! Human (20-25k genes) vs. Rice (51k genes) 1 million Regulatory elements (switches) enable: Precise control for turning genes on/off Diverse cell types (lung, heart, skin) Analogy: Making specific recipes (genes) from a large cookbook (genome) at a given time
6
Quick Recap
7
DNA: “Blueprints” for a cell
Genetic information encoded in long strings Deoxyribonucleic acid (DNA) comes in four bases: adenine (A), thymine (T), guanine (G) , and cytosine (C)
8
From DNA to Organism You are composed of ~ 10 trillion cells
9
From DNA to Organism Cell
10
From DNA to Organism Cell Protein
Proteins do most of the work in biology
11
Q: How does one genome encode a variety of cell types in a complex organism?
12
Regulatory Elements ~ 20-25k genes CS analogy:
Expression Modulated by Regulatory elements Enhancer, Promoters, Silencers CS analogy: Genes are like variable assignments (a = 7) Regulatory elements are control flow, complex logic
13
Controlling Gene Expression
Transcription factors (TFs): Proteins that recognize sequence motifs in enhancers, promoters Combinatorial switches that turn genes on/off
14
How does the genome influence human disease?
15
Disease Implications SHH MUTATIONS Brain Limb Other Bejerano Lab
16
Limb Enhancer 1Mb away from Gene
SHH Bejerano Lab
17
Enhancer Deletion limb SHH DELETE Limb Bejerano Lab
18
Enhancer 1bp Substitution
limb SHH MUTATIONS Limb Lettice et al. HMG : Bejerano Lab
19
Genome Wide Association Study (GWAS):
80% of GWAS SNPs are noncoding (hard to interpret) Active area of research GWAS is a study that takes the entire genetic variants from multiple individuals and try to associate variants to traits. Usually it particularly focuses on single nucleotide polymorphisms or SNPs which is if the majority of the population has A in a particular location in the genome and a person has G instead. if an individual has a distinctive trait and a genetic variant that is not found in any other population,then the researcher may try to associate that trait with the SNPs. However since 80% of GWAS SNPs are noncoding which is difficult to analyze since the genomic region where that SNP is contain in does not have a well-known function compared to a protein coding gene Bejerano Lab
20
How exactly do genes code for proteins?
21
Central Dogma of Biology
22
DNA: “Blueprints” for a cell
Genetic information encoded in long strings Deoxyribonucleic acid comes in four bases: adenine, thymine, guanine, and cytosine
23
Nucleobase Complementary Pairing
purines Adenine (A) Guanine (G) Thymine (T) Cytosine (C) pyrimidines
24
DNA Double Helix
25
DNA Packaging
26
Q: What is your genome? A:The sum of your hereditary information.
27
Q: What is your genome? A:The sum of your hereditary information. Humans bundle two copies of the genome into 46 chromosomes in every cell
28
Central Dogma of Biology
29
DNA vs RNA 5' end has phosphate group and 3' has hydroxyl group (OH), number comes from the number of carbon in the sugar attached to the base
30
RNA Nucleobases purines pyrimidines Adenine (A) Guanine (G) Uracil (U)
Cytosine (C) pyrimidines
31
Gene Transcription G A T T A C A . . . 5’ 3’ 3’ 5’ C T A A T G T . . .
32
Gene Transcription G A T T A C A . . . 5’ 3’ 3’ 5’ C T A A T G T . . . RNA polymerase binds to the DNA at the promoter site near the transcription starting site, RNA polymerase synthesizes the RNA from the DNA template
33
Gene Transcription Strands are separated (DNA helicase)
G A T T A C A . . . 5’ 3’ 3’ 5’ C T A A T G T . . . DNA helicase are class of enzymes that open the double helix structure of DNA to allow RNA Polymerase to synthesize RNA Strands are separated (DNA helicase)
34
Gene Transcription G A T T A C A . . . 5’ 3’ 3’ G A U U A C A 5’ C T A A T G T . . . Read and add complements of the template An RNA copy of the 5’→3’ sequence is created from the 3’→5’ template
35
Gene Transcription 5’ 3’ 3’ 5’ pre-mRNA 5’ 3’ G A T T A C A . . .
C T A A T G T . . . G A U U A C A . . . pre-mRNA 5’ 3’
36
RNA Processing 5’ cap poly(A) tail exon intron regions near the ends are called UTR or untranslated region and are never translated to protein mRNA 5’ UTR 3’ UTR
37
Gene Structure introns 5’ 3’ promoter exons 3’ UTR 5’ UTR coding
non-coding
38
Central Dogma of Biology
39
From RNA to Protein Proteins are long strings of amino acids joined by peptide bonds Translation from RNA sequence to amino acid sequence performed by ribosomes 20 amino acids → 3 RNA letters required to specify a single amino acid
40
Amino Acid There are 20 standard amino acids H O H N C C OH H R
Alanine Arginine Asparagine Aspartate Cysteine Glutamate Glutamine Glycine Histidine Isoleucine Leucine Lysine Methionine Phenylalanine Proline Serine Threonine Tryptophan Tyrosine Valine H O H N C C OH H R There are 20 standard amino acids
41
Translation G GGG Gly GAG Glu GCG Ala GUG Val A GGA Gly
GAA Glutamic acid (Glu) GCA Ala GUA Val C GGC Gly GAC Asp GCC Ala GUC Val U GGU Glycine (Gly) GAU Aspartic acid (Asp) GCU Alanine (Ala) GUU Valine (Val) AGG Arg AAG Lys ACG Thr AUG Methionine (Met) or START AGA Arginine (Arg) AAA Lysine (Lys) ACA Thr AUA Ile AGC Ser AAC Asn ACC Thr AUC Ile AGU Serine (Ser) AAU Asparagine (Asn) ACU Threonine (Thr) AUU Isoleucine (Ile) CGG Arg CAG Gln CCG Pro CUG Leu CGA Arg CAA Glutamine (Gln) CCA Pro CUA Leu CGC Arg CAC His CCC Pro CUC Leu CGU Arginine (Arg) CAU Histidine (His) CCU Proline (Pro) CUU Leucine (Leu) UGG Tryptophan (Trp) UAG STOP UCG Ser UUG Leu UGA STOP UAA STOP UCA Ser UUA Leucine (Leu) UGC Cys UAC Tyr UCC Ser UUC Phe UGU Cysteine (Cys) UAU Tyrosine (Tyr) UCU Serine (Ser) UUU Phenylalanine (Phe)
42
5’ . . . A U U A U G G C C U G G A C U U G A . . . 3’
Translation 5’ A U U A U G G C C U G G A C U U G A ’ UTR Met Start Codon Ala Trp Thr Stop Codon
43
Translation The ribosome (a complex of protein and RNA) synthesizes a protein by reading the mRNA in triplets (codons). Each codon is translated to an amino acid.
44
Single Nucleotide Changes
45
Single Nucleotide Changes
46
Central Dogma of Biology
49
Different Cell Types Subsets of the DNA sequence determine the identity and function of different cells
50
Gene Expression Regulation
When should each gene be expressed? Why? Every cell has same DNA but each cell expresses different proteins. Signal transduction: One signal converted to another: cascade has “master regulators” turning on many proteins, which in turn each turn on many proteins
51
Central Dogma of Biology
52
Transcription Regulation
Transcription factors link to binding sites Complex of transcription factors forms Complex assists or inhibits formation of the RNA polymerase machinery
53
Transcription Factor Binding Sites
Short, degenerate DNA sequences recognized by particular transcription factors For complex organisms, cooperative binding of multiple transcription factors required to initiate transcription Binding Sequence Logo
54
Transcription Regulation
TF A Binding Site Gene B Transcription Factor A
55
Q: What if the transcription/translation machinery makes mistakes?
Q:What is the effect in coding regions?
56
Evolution = Mutation + Selection
57
Mutation: Structural Abnormalities
58
Single Nucleotide Changes
59
Single Nucleotide Changes
60
Evolution = Mutation + Selection
61
Selection time Harmful mutation Beneficial mutation
62
Evolution = Mutation + Selection
63
Evolution = Mutation + Selection
Summary Evolution = Mutation + Selection
64
Summary All hereditary information encoded in double- stranded DNA
Each cell in an organism has same DNA DNA → RNA → protein Proteins have many diverse roles in cell Gene regulation diversifies protein products within different cells
65
Further Reading See website: cs273a.stanford.edu
66
Extra Slides
67
Gene Regulatory Region
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.