Copyright © 2004 by Limsoon Wong A Biology Review.

Slides:



Advertisements
Similar presentations
Proteins: Structure reflects function….. Fig. 5-UN1 Amino group Carboxyl group carbon.
Advertisements

Review.
• Exam II Tuesday 5/10 – Bring a scantron with you!
5’ C 3’ OH (free) 1’ C 5’ PO4 (free) DNA is a linear polymer of nucleotide subunits joined together by phosphodiester bonds - covalent bonds between.
RNA Say Hello to DNA’s little friend!. EngageEssential QuestionExplain Describe yourself to long lost uncle. How do the mechanisms of genetics and the.
A Zero-Knowledge Based Introduction to Biology Cory McLean 26 Sep 2008 Thanks to George Asimenos.
© 2010 Pearson Education, Inc. Lectures by Chris C. Romero, updated by Edward J. Zalisko PowerPoint ® Lectures for Campbell Essential Biology, Fourth Edition.
Introduction to Bioinformatics Molecular Biology Primer.
Exciting Developments in Molecular Biology As seen by an amateur.
LECTURE 5: DNA, RNA & PROTEINS
Introduction to Molecular Biology. G-C and A-T pairing.
10-2: RNA and 10-3: Protein Synthesis
Today… Genome 351, 8 April 2013, Lecture 3 The information in DNA is converted to protein through an RNA intermediate (transcription) The information in.
Unit 7 RNA, Protein Synthesis & Gene Expression Chapter 10-2, 10-3
How does DNA work? What is a gene?
DNA DNA. DNA is often called the blueprint of life. In simple terms, DNA contains the instructions for making proteins within the cell.
Nature and Action of the Gene
Chapter 12 DNA and RNA. What is DNA again? Deoxyribonucleic acid Long double-stranded molecule of nucleotides Stores genetic code that is transferred.
Protein Synthesis. DNA RNA Proteins (Transcription) (Translation) DNA (genetic information stored in genes) RNA (working copies of genes) Proteins (functional.
Introduction to Bioinformatics Molecular Biology Primer 1.
CHAPTER 12 PROTEIN SYNTHESIS AND MUTATIONS -RNA -PROTEIN SYNTHESIS -MUTATIONS.
More on translation. How DNA codes proteins The primary structure of each protein (the sequence of amino acids in the polypeptide chains that make up.
How Proteins Are Made Mrs. Wolfe. DNA: instructions for making proteins Proteins are built by the cell according to your DNA What kinds of proteins are.
VII RNA and Protein Synthesis
1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Human Biology Sylvia S. Mader Michael Windelspecht Chapter.
© 2010 Pearson Education, Inc. Lectures by Chris C. Romero, updated by Edward J. Zalisko PowerPoint ® Lectures for Campbell Essential Biology, Fourth Edition.
LESSON 4: Using Bioinformatics to Analyze Protein Sequences PowerPoint slides to accompany Using Bioinformatics : Genetic Research.
AMINO ACIDS.
Genetics in ~1920: 1. Cells have chromosomes Sketch of Drosophila chromosomes (Bridges, C. 1913)
PART 1 - DNA REPLICATION PART 2 - TRANSCRIPTION AND TRANSLATION.
Learning Targets “I Can...” -State how many nucleotides make up a codon. -Use a codon chart to find the corresponding amino acid.
Fig Second mRNA base First mRNA base (5 end of codon) Third mRNA base (3 end of codon)
Today… Genome 351, 8 April 2013, Lecture 3 The information in DNA is converted to protein through an RNA intermediate (transcription) The information in.
Protein Synthesis Biology 11 preAP Overview and Application.
Outline 1.What is an amino acid / protein naturally occurring amino acids 3.Codon – triplet coding for an amino acid 1.How are proteins synthesized.
DNA & GENETICS. There are four kinds of bases in DNA: adenine guanine cytosine thymine.
Outline What is an amino acid / protein
CELL REPRODUCTION: MITOSIS INTERPHASE: DNA replicates PROPHASE: Chromatin condenses into chromosomes, centrioles start migrating METAPHASE: chromosomes.
End Show Slide 1 of 39 Copyright Pearson Prentice Hall 12-3 RNA and Protein Synthesis 12–3 RNA and Protein Synthesis.
RNA 2 Translation.
Transcription and Translation
DNA, RNA, and Protein Synthesis
Amino Acids ©CMBI 2001 “ When you understand the amino acids, you understand everything ”
Proteins.
1 Human chromosomes: 50->250 million base pairs. Average gene: 3000 base pairs.
CS273a A Zero-Knowledge Based Introduction to Biology Courtesy of George Asimenos.
Chapter 12 DNA and RNA.
DANDY Deoxyribonucleic Acid ALL CELLS HAVE DNA… Cells are the basic unit of structure and function of all living things. –Prokaryotes (bacteria) –Eukaryotes.
Parts is parts…. AMINO ACID building block of proteins contain an amino or NH 2 group and a carboxyl (acid) or COOH group PEPTIDE BOND covalent bond link.
 Molecules of DNA are composed of long chains of _______.
Prepared By: Syed Khaleelulla Hussaini. Outline Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity.
DNA, RNA and Protein.
Protein Synthesis The Making of Proteins Using the Genetic Information Stored in DNA.
Genomics Lecture 3 By Ms. Shumaila Azam. Proteins Proteins: large molecules composed of one or more chains of amino acids, polypeptides. Proteins are.
Protein Synthesis. Review Questions What is the function of DNA? Stores genetic information and holds the instructions for building proteins Why is DNA.
Unit 5: DNA and Protein Synthesis
Translation PROTEIN SYNTHESIS.
Whole process Step by step- from chromosomes to proteins.
Please turn in your homework
BIOLOGY 12 Protein Synthesis.
Section Objectives Relate the concept of the gene to the sequence of nucleotides in DNA. Sequence the steps involved in protein synthesis.
DNA By: Mr. Kauffman.
Outline What is an amino acid / protein
More on translation.
Fundamentals of Protein Structure
Today’s notes from the student table Something to write with
Translation.
Transcription & Translation
Presentation transcript:

Copyright © 2004 by Limsoon Wong A Biology Review

Body Our body consists of a number of organs Each organ is composed of a number of tissues Each tissue is composed of cells of the same type

Cell Performs two types of function –Chemical reactions necessary to maintain our life –Pass info for maintaining life to next generation In particular –Protein performs chemical reactions –DNA stores & passes info –RNA is intermediate between DNA & proteins

Protein A protein sequence composed from an alphabet of 20 amino acids –Length is usually 20 to 5000 amino acids –Average around 350 amino acids Folds into 3D shape, forming the building blocks & performing most of the chemical reactions within a cell

Classification of Amino Acids Amino acids can be classified into 4 types. Positively charged (basic) –Arginine (Arg, R) –Histidine (His, H) –Lysine (Lys, K) Negatively charged (acidic) –Aspartic acid (Asp, D) –Glutamic acid (Glu, E)

Classification of Amino Acids Polar (overall uncharged, but uneven charge distribution. can form hydrogen bonds with water. they are called hydrophilic) –Asparagine (Asn, N) –Cysteine (Cys, C) –Glutamine (Gln, Q) –Glycine (Gly, G) –Serine (Ser, S) –Threonine (Thr, T) –Tyrosine (Tyr, Y) Nonpolar (overall uncharged and uniform charge distribution. cant form hydrogen bonds with water. they are called hydrophobic) –Alanine (Ala, A) –Isoleucine (Ile, I) –Leucine (Leu, L) –Methionine (Met, M) –Phenylalanine (Phe, F) –Proline (Pro, P) –Tryptophan (Trp, W) –Valine (Val, V)

Genetic Code Each amino acid is composed of three nucleotides Start codon: ATG (code for M) Stop codon: TAA, TAG, TGA Copyright © 2004 by Limsoon Wong

DNA DNA stores instruction needed by the cell to perform daily life function Consists of two strands interwoven together and form a double helix Each strand is a chain of some small molecules called nucleotides Francis Crick shows James Watson the model of DNA in their room number 103 of the Austin Wing at the Cavendish Laboratories, Cambridge Copyright © 2004 Limsoon Wong

ACGT U Classification of Nucleotides 5 different nucleotides: adenine(A), cytosine(C), guanine(G), thymine(T), & uracil(U) A, G are purines. They have a 2-ring structure C, T, U are pyrimidines. They have a 1-ring structure DNA only uses A, C, G, & T Copyright © 2004 by Limsoon Wong

A T  10Å G C Watson-Crick rules Complementary bases: –A with T (two hydrogen-bonds) –C with G (three hydrogen-bonds) Copyright © 004 by Limsoon Wong

Double Stranded DNA DNA is double stranded in a cell. The two strands are anti-parallel. One strand is reverse complement of the other The double strands are interwoven to form a double helix Copyright © 2004 by Limsoon Wong

Locations of DNAs in a Cell? Two types of organisms –Prokaryotes (single-celled organisms with no nuclei. e.g., bacteria) –Eukaryotes (organisms with single or multiple cells. their cells have nuclei. e.g., plant & animal) In Prokaryotes, DNA swims within the cell In Eukaryotes, DNA locates within the nucleus

Chromosome DNA is usually tightly wound around histone proteins and forms a chromosome The total info stored in all chromosomes constitutes a genome In most multi-cell organisms, every cell contains the same complete set of chromosomes –May have some small differences due to mutation Human genome has 3G base pairs, organized in 23 pairs of chromosomes

Gene A gene is a sequence of DNA that encodes a protein or an RNA molecule About 30,000 – 35,000 (protein-coding) genes in human genome For gene that encodes protein –In Prokaryotic genome, one gene corresponds to one protein –In Eukaryotic genome, one gene can corresponds to more than one protein because of the process “alternative splicing”

Complexity of Organism vs. Genome Size Human Genome: 3G base pairs Amoeba dubia (a single cell organism): 600G base pairs  Genome size has no relationship with the complexity of the organism

Number of Genes vs. Genome Size Prokaryotic genome (e.g., E. coli) –Number of base pairs: 5M –Number of genes: 4k –Average length of a gene: 1000 bp Eukaryotic genome (e.g., human) –Number of base pairs: 3G –Estimated number of genes: 30k – 35k –Estimated average length of a gene: bp ~ 90% of E. coli genome are of coding regions. < 3% of human genome is believed to be coding regions  Genome size has no relationship with the number of genes!

RNA vs DNA RNA is single stranded Nucleotides of RNA are similar to that of DNA, except that have an extra OH at position 2’ –Due to this extra OH, it can form more hydrogen bonds than DNA –So RNA can form complex 3D structure RNA use the base U instead of T –U is chemically similar to T –In particular, U is also complementary to A

Central Dogma Gene expression consists of two steps –Transcription DNA  mRNA –Translation mRNA  Protein Copyright © 2004 by Limsoon Wong

Transcription Synthesize mRNA from one strand of DNA –An enzyme RNA polymerase temporarily separates double- stranded DNA –It begins transcription at transcription start site (ATG) –A  A, C  C, G  G, & T  U –Once RNA polymerase reaches transcription stop site, transcription stops (TGA, TAG, and TAA) Additional “steps” for Eukaryotes –Transcription produces pre-mRNA that contains both introns & exons –5’ cap & poly-A tail are added to pre-mRNA –RNA splicing removes introns & mRNA is made –mRNA are transported out of nucleus

Translation Synthesize protein from mRNA Each amino acid is encoded by consecutive seq of 3 nucleotides, called a codon The decoding table from codon to amino acid is called genetic code 4 3 =64 diff codons  Codons are not 1-to-1 corr to 20 amino acids All organisms use the same decoding table Recall that amino acids can be classified into 4 groups. A single-base change in a codon is usually not sufficient to cause a codon to code for an amino acid in different group

Ribosome Translation is handled by a molecular complex, ribosome, which consists of both proteins & ribosomal RNA (rRNA) Ribosome reads mRNA & the translation starts at a start codon (the translation start site) With help of tRNA, each codon is translated to an amino acid Translation stops once ribosome reads a stop codon (the translation stop site)

Introns and exons Eukaryotic genes contain introns & exons –Introns are seq that are ultimately spliced out of mRNA –Introns normally satisfy GT-AG rule, viz. begin w/ GT & end w/ AG –Each gene can have many introns & each intron can have thousands bases Introns can be very long An extreme example is a gene associated with cystic fibrosis in human: –Length of 24 introns ~1Mb –Length of exons ~1kb

Unlike eukaryotic genes, a prokaryotic gene typically consists of only one contiguous coding region Typical Eukaryotic Gene Structure Copyright © 2004 by Limsoon Wong Image credit: Xu

Reading frame #1 ATG GCT TAC GCT TGC Reading frame #2 TGG CTT ACG CTT GA. Reading frame #3 GGC TTA CGC TTG A.. ATGGCTTACGCTTGA Forward strand: Reading frame #4 TCA AGC GTA AGC CAT Reading frame #5 CAA GCG TAA GCC AT. Reading frame #6 AAG CGT AAG CCA T.. TCAAGCGTAAGCCAT Reverse strand: Reading Frame Each DNA segment has six possible reading frames Copyright © 2004 by Limsoon Wong

stop ORF Open Reading Frame (ORF) ORF is a segment of DNA with two in-frame stop codons at the two ends and no in-frame stop codon in the middle Each ORF has a fixed reading frame

Coding Region Each coding region (exon or whole gene) has a fixed translation frame A coding region always sits inside an ORF of same reading frame All exons of a gene are on the same strand Neighboring exons of a gene could have different reading frames

exon1[i, j] in frame A and exon2[m, n] in frame B are consistent if B = (m - j A) mod 3 ATG GCT TGG GCT TTA A GT TTC CCG GAG AT T GGG exon 1 exon 3exon 2 Frame Consistency Neighbouring exons of a gene should be frame-consistent

What is Gene Finding? Find all coding regions from a stretch of DNA sequence, and construct gene structures from the identified exons Can be decomposed into –Find coding potential of a region in a frame –Find boundaries between coding & non-coding regions

Genbank or nr candidate gene BLAST search sequence alignments with known genes, alignment p-values Image credit: Xu Copyright © 2004 by Limsoon Wong Search-by-Homology Example: Gene Finding Using BLAST High seq similarity typically implies homologous genes  Search for genes in yeast seq using BLAST  Extract Feature for gene identification

BLAST hits sequence Searching all ORFs against known genes in nr db helps identify an initial set of (possibly incomplete) genes Image credit: Xu

A (yeast) gene starts w/ ATG and ends w/ a stop codon, in same reading frame of ORF Have “strong” coding potentials, measured by, preference models, Markov chain model,... Have “strong” translation start signal, measured by weight matrix model,... Have distributions wrt length, G+C composition,... Have special seq signals in flanking regions,... known genes 0 % known non- genes coding potential gene length distribution