A Quest for Genes What’s a gene? gene (jēn) n.

Slides:



Advertisements
Similar presentations
Molecular Genetics DNA RNA Protein Phenotype Genome Gene
Advertisements

Cell Division, Genetics, Molecular Biology
Nucleic Acids and Protein Synthesis
Basics of Molecular Biology
RNA and Protein Synthesis
© 2006 W.W. Norton & Company, Inc. DISCOVER BIOLOGY 3/e
2.7 DNA Replication, transcription and translation
DNA & genetic information DNA replication Protein synthesis Gene regulation & expression DNA structure DNA as a carrier Gene concept Definition Models.
Central Dogma of Biology
An Overview of Protein Synthesis. Genes A sequence of nucleotides in DNA that performs a specific function such as coding for a particular protein.
Making of Proteins: Transcription and Translation
Protein Synthesis. DNA acts like an "instruction manual“ – it provides all the information needed to function the actual work of translating the information.
Protein Synthesis Transcription and Translation DNA Transcription RNA Translation Protein.
Protein Synthesis 12-3.
RNA and Protein Synthesis
FROM DNA TO PROTEIN Transcription – Translation. I. Overview Although DNA and the genes on it are responsible for inheritance, the day to day operations.
Chapter 10: RNA & Protein Synthesis Mrs. Cook Biology
Gene Expression How is the information in DNA used to determine an organism’s characteristics?
1 Genes and How They Work Chapter Outline Cells Use RNA to Make Protein Gene Expression Genetic Code Transcription Translation Spliced Genes – Introns.
Protein Synthesis Process that makes proteins
Transcription & Translation Transcription DNA is used to make a single strand of RNA that is complementary to the DNA base pairs. The enzyme used is.
Replication Transcription Translation
12-3 RNA and Protein Synthesis
Protein Synthesis Transcription and Translation DNA Transcription RNA Translation Protein.
GENE EXPRESSION What is a gene? Mendel –Unit of inheritance conferring a phenotype Modern definition –Unit of DNA directing the synthesis of a polypeptide.
Genes and How They Work Chapter The Nature of Genes information flows in one direction: DNA (gene)RNAprotein TranscriptionTranslation.
Transcription and Translation How genes are expressed (a.k.a. How proteins are made) Biology.
Molecular Genetics Protein Synthesis Overview. The Central Dogma DNA contains the blueprint for protein synthesis, but proteins are synthesized outside.
Eukaryotic Gene Structure. 2 Terminology Genome – entire genetic material of an individual Transcriptome – set of transcribed sequences Proteome – set.
DNA, RNA, and Protein Replication Transcription Translation.
Ch Gene  Protein A gene is a sequence of nucleotides that code for a polypeptide (protein) Hundreds-thousands of genes are on a typical chromosome.
RNA and Gene Expression BIO 224 Intro to Molecular and Cell Biology.
PROTEIN SYNTHESIS TRANSCRIPTION AND TRANSLATION. TRANSLATING THE GENETIC CODE ■GENES: CODED DNA INSTRUCTIONS THAT CONTROL THE PRODUCTION OF PROTEINS WITHIN.
RNA, Transcription, and the Genetic Code. RNA = ribonucleic acid -Nucleic acid similar to DNA but with several differences DNARNA Number of strands21.
Chapter 10: Nucleic Acids And Protein Synthesis Essential Question: What roles do DNA and RNA play in storing genetic information?
Microbial Genetics Structure and Function of Genetic Material The Regulation of Bacterial Gene Expression Mutation: Change in Genetic Material Genetic.
CHAPTER 13 RNA & Protein Synthesis. GENE EXPRESSION  When a cell “reads” the DNA, it doesn’t directly say for example blue eyes.  What the DNA actually.
CH 12.3 RNA & Protein Synthesis. Genes are coded DNA instructions that control the production of proteins within the cell…
12-3 RNA and Protein Synthesis Page 300. A. Introduction 1. Chromosomes are a threadlike structure of nucleic acids and protein found in the nucleus of.
Gene Expression = Protein Synthesis.
Section 20.2 Gene Expression
FROM DNA TO PROTEIN Transcription – Translation
Eukaryotic Gene Structure
Cell Biology Unit Six.
Protein Synthesis Human Biology.
Lesson Four Structure of a Gene.
Lesson Four Structure of a Gene.
12-3 RNA & Protein Synthesis
Transcription Part of the message encoded within the sequence of bases in DNA must be transcribed into a sequence of bases in RNA before translation can.
Transfer of RNA molecules serve as interpreters during translation
Enzymes and their functions involved in DNA replication
Protein Synthesis How are they made??.
Transcription and Translation
RNA Ribonucleic Acid.
Introduction to Bioinformatics II
Transcription is the synthesis of RNA under the direction of DNA
Protein synthesis: Overview
PROTEIN SYNTHESIS THE DETAILS.
Genes and How They Work Chapter 15
RNA and Protein Synthesis
Transcription Steps to Transcribe DNA:
RNA & Protein synthesis
Review.
12-3 RNA and Protein Synthesis
Protein Synthesis.
Protein Synthesis.
GENE EXPRESSION / PROTEIN SYNTHESIS
Protein Synthesis The genetic code – the sequence of nucleotides in DNA – is ultimately translated into the sequence of amino acids in proteins – gene.
From DNA to Protein Genotype to Phenotype.
12-3: RNA and Protein Synthesis (part 1)
Presentation transcript:

A Quest for Genes What’s a gene? gene (jēn) n. A hereditary unit consisting of a sequence of DNA that occupies a specific location on a chromosome and determines a particular characteristic in an organism. Genes undergo mutation when their DNA sequence changes. [German Gen, from gen-, begetting, in Greek words (such as genos, race, offspring).] from: http://www.answers.com/topic/gene Genes are the units of heredity in living organisms. They are encoded in the organism's genetic material (usually DNA or RNA), and control the development and behavior of the organism. The word "gene" ... is shared by many disciplines, including classical genetics, molecular genetics, evolutionary biology and population genetics. Because each discipline models the biology of life differently, the usage of the word gene varies between disciplines. It may refer to either material or conceptual entities. Following the discovery that DNA is the genetic material, and with the growth of biotechnology and the project to sequence the human genome, the common usage of the word "gene" has increasingly reflected its meaning in molecular biology, namely the segments of DNA which cells transcribe into RNA and translate, at least in part, into proteins. The Sequence Ontology project defines a gene as: "A locatable region of genomic sequence, corresponding to a unit of inheritance, which is associated with regulatory regions, transcribed regions and/or other functional sequence regions". From: http://en.wikipedia.org/wiki/Gene

some/all of these features can be used to find genes And so the gene is A segment of DNA It is inherited It is transcribed It may be translated It has regulatory and other functional regions it has certain coordinates and sequence genes in different organisms may be homologous its sequence may be optimized for transcription its sequence may be optimized for translation the structure and sequence of these regions may have something in common some/all of these features can be used to find genes

RNA- and protein-coding genes All genes are transcribed. Transcripts are messenger RNAs (mRNAs), ribosomal RNAs (rRNAs), transfer RNAs (tRNAs), etc. Only mRNAs are translated into proteins How to find rRNA- and tRNA-coding genes? Homology Secondary structure

Protein-coding genes are transcribed and translated their sequence is optimized for transcription and translation

Most important contribution: transcription-coupled DNA repair Is performed by a special enzyme – RNA polymerase RNA polymerase binds strongly to a specific promoter sequence, where DNA is unwound RNA is polymerized in 5’>3’ direction (DNA is read from 3’>5’) RNA polymerization is stopped upon reaching a terminator Signals in DNA sequence that can be recognized: promoters and terminators. Promoters are A+T rich for easy strand separation, terminators are G+C rich. Most important contribution: transcription-coupled DNA repair

Transcription-coupled DNA repair and mutational drift Mutations: transitions (purine to purine or pyrimidine to pyrimidine) and transversions (purine to pyrimidine or pyrimidine to purine) Transitions are more common than transversions Transitions change G:C to A:T or C:G to T:A If unrepaired, spontaneous mutations will enrich DNA sequence with A+T, while G+C will disappear Sequences repaired more efficiently will have higher G+C content than the rest of the genome The rate-limiting step in DNA repair is recognition of damaged base pairs Damaged base pairs are efficiently recognized by RNA polymerase, which stalls and attracts DNA repair machinery Actively transcribed sequences are repaired more efficiently than the rest of the genome Protein-coding regions will have higher G+C content than non-coding regions

Translation Ribosome binds to a region upstream of the start codon (Shine-Dalgarno sequence) The start codon defines the reading frame for subsequent codons mRNA is read in 5’>3’ direction Translation is terminated upon reaching a stop codon Signals in DNA sequence that can be recognized: Shine-Dalgarno start codons (AUG, GUG, UUG) stop codons (UAG, UGA, UAA) open reading frames (ORFs) codon, di-codon and tri-codon usage amino acid composition

Computational protein-coding gene prediction algorithms GeneMark Glimmer GeneScan GeneLook GeneHacker EasyGene GS-Finder ZCurve Orpheus Fgenes Critica ... Find a set of ORFs Consider some ORFs as “typical protein-coding sequences” and some other regions as “typical non-coding sequences” for use as a training set Find some features that would distinguish “typical coding sequences” from “typical non-coding sequences” (GC content, codon usage, di-codon usage, patterns, motifs, etc.) Find other ORFs with similar features How typical is your “typical coding” sequence?

Guinness Book of protein-coding genes The longest human gene is 2,220,223 nucleotides long. It has 79 exons, with a total of only 11,058 nucleotides, which specify the sequence of the 3,685 amino acids and codes for a protein dystrophin. It is part of a protein complex located in the cell membrane, which transfers the force generated by the actin-myosin structure inside the muscle fiber to the entire fiber. The smallest human gene is 252 nucleotides long, it specifies a polypeptide of 67 amino acids and codes for an insulin-like growth factor II. The longest bacterial gene is 110,418 nucleotides long, which specify the sequence of 36,805 amino acids. Its function is unknown, most likely a surface protein. The smallest bacterial gene is 54 nucleotides long, it specifies a polypeptide of 17 amino acids and codes for a regulatory protein in cyanobacteria

Manual analysis is required to find the rest. Computational protein-coding gene prediction algorithms identify correctely 90-95% of genes. Manual analysis is required to find the rest.