Fine Structure and Analysis of Eukaryotic Genes

Slides:



Advertisements
Similar presentations
Recombinant DNA Technology
Advertisements

Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction Gene prediction methods Gene indices Mapping cDNA on.
2 March, 2005 Chapter 12 Mutational dissection Normal gene Altered gene with altered phenotype mutagenesis.
Additional Powerful Molecular Techniques Synthesis of cDNA (complimentary DNA) Polymerase Chain Reaction (PCR) Microarray analysis Link to Gene Therapy.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
1 Library Screening, Characterization, and Amplification Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis.
DNA Sequencing and Gene Analysis
Genes. Eukaryotic Protein-Coding Gene Structure codingnon-coding.
Characterization, Amplification, Expression
Bacterial Physiology (Micr430)
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
1 Characterization, Amplification, Expression Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis of DNA.
Arrays: Narrower terms include bead arrays, bead based arrays, bioarrays, bioelectronic arrays, cDNA arrays, cell arrays, DNA arrays, gene arrays, gene.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Bioinformatics page 12, part of ch. 21 Cell and Mol Biol Lab.
General Microbiology (Micr300) Lecture 11 Biotechnology (Text Chapters: ; )
Making, screening and analyzing cDNA clones Genomic DNA clones
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
HC70AL Spring 2009 Gene Discovery Laboratory RNA and Tools For Studying Differential Gene Expression During Seed Development 4/20/09tratorp.
Fig 11-1 Chapter 11: recombinant DNA and related techniques.
AP Biology Ch. 20 Biotechnology.
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
PHYSICAL MAPPING AND POSITIONAL CLONING. Linkage mapping – Flanking markers identified – 1cM, for example Probably ~ 1 MB or more in humans Need very.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
DNA Technology Chapter 20.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Restriction Nucleases Cut at specific recognition sequence Fragments with same cohesive ends can be joined.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Gene Expression and Gene Regulation. The Link between Genes and Proteins At the beginning of the 20 th century, Garrod proposed: – Genetic disorders such.
1. Bacterial genomes - genes tightly packed, no introns... HOW TO FIND GENES WITHIN A DNA SEQUENCE? Scan for ORFs (open reading frames) - check all 6 reading.
Remember the limitations? –You must know the sequence of the primer sites to use PCR –How do you go about sequencing regions of a genome about which you.
Expression of the Genome The transcriptome. Decoding the Genetic Information  Information encoded in nucleotide sequences contained in discrete units.
Biotechnology.
LECTURES 3/4. CONSTRUCTING and SCREENING cDNA LIBRARIES to ISOLATE NEW GENES ORIGINAL ARTICLES: CLONING BY COMPLEMENTATION: Lew, D, Dulic, V, and Reed.
Chapter 21 Eukaryotic Genome Sequences
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
DNA TECHNOLOGY AND GENOMICS CHAPTER 20 P
MCB 317 Genetics and Genomics Topic 11 Genomics. Readings Genomics: Hartwell Chapter 10 of full textbook; chapter 6 of the abbreviated textbook.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
Chapter 2 From Genes to Genomes. 2.1 Introduction We can think about mapping genes and genomes at several levels of resolution: A genetic (or linkage)
Chapter 20: DNA Technology and Genomics - Lots of different techniques - Many used in combination with each other - Uses information from every chapter.
目录 The Principle and Application of Common Used Techniques in Molecular Biology chapter 18.
Jan. 13, 2011 B4730/5730 Plant Physiological Ecology Introduction to Physiology and Genetics.
While replication, one strand will form a continuous copy while the other form a series of short “Okazaki” fragments Genetic traits can be transferred.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
DNA Technology and Genomics
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Finding genes in the genome
Recombinant DNA Reverse genetics Synthesis of DNA probes Restriction enzymes, plasmids and recombinant DNA Genomic and cDNA libraries Applications.
Chapter 14 GENETIC TECHNOLOGY. A. Manipulation and Modification of DNA 1. Restriction Enzymes Recognize specific sequences of DNA (usually palindromes)
Gene Cloning & Creating DNA Libraries. Клонирование генов Что означает термин «клонирование»? Как происходит клонирование генов? Чем это отличается от.
DNA Technology & Genomics CHAPTER 20. Restriction Enzymes enzymes that cut DNA at specific locations (restriction sites) yielding restriction fragments.
bacteria and eukaryotes
Human Genome Project.
Lecture 8 A toolbox for mechanistic biologists (continued)
Expression of the Genome
DNA Technology and Genomics
Genomes and Their Evolution
Expression of the Genome
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
Chapter 14 Bioinformatics—the study of a genome
CHAPTER 12 DNA Technology and the Human Genome
Introduction to Bioinformatics II
Material for Quiz 5 from Chapter 8
Gene Structure.
Gene Structure.
Presentation transcript:

Fine Structure and Analysis of Eukaryotic Genes Split genes Multigene families Functional analysis of eukaryotic genes

Split genes and introns The mRNA-coding portion of a gene can be split by DNA sequences that do not encode mature mRNA Exons code for mRNA, introns are segments of genes that do not encode mRNA. Introns are found in most genes in eukaryotes Also found in some bacteriophage genes and in some genes in archae

R-loops can reveal introns

Examples of R-loops in mammalian hemoglobin genes

Comparison of cDNA and genomic clone maps can reveal introns, 1.6.3

Types of exons Transcription start polyA Stop 5’ Gene 3’ promoter GT AG GT AG GT AG GT AG Gene 3’ promoter Open reading frame Initial exon Internal exon Internal coding exon Terminal exon Translation Translation Start Stop mRNA 5’ 3’ 5’ untranslated region Protein coding region 3’ untranslated region

Finding exons with computers Ab initio computation E.g. Genscan: http://genes.mit.edu/GENSCAN.html Uses an explicit, sophisticated model of gene structure, splice site properties, etc to predict exons Compare cDNA sequence with genomic sequence BLAST2 alignments between cDNA and genomic sequences http://www.ncbi.nlm.nih.gov/blast/ Better: Use sim4 Takes into account terminal redundancy at ends of introns http://bio.cse.psu.edu Follow link to “sim4 server in France”

Sequence for human beta-globin gene (HBB): Find exons for HBB Sequence for human beta-globin gene (HBB): Accession number L48217 Thalassemia variant Sequence for HBB mRNA NM_000518 Retrieve those from GenBank at NCBI (or the course website) http://www.ncbi.nlm.nih.gov Get the files in FASTA format Run Genscan and BLAST2 sequences

Genscan analysis of HBB gene

BLAST2: HBB gene vs. cDNA gene cDNA Score = 275 bits (143), Expect = 1e-71 Identities = 143/143 (100%), Positives = 143/143 (100%) Query: 167 acatttgcttctgacacaactgtgttcactagcaacctcaaacagacaccatggtgcacc 226 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| Sbjct: 1 acatttgcttctgacacaactgtgttcactagcaacctcaaacagacaccatggtgcacc 60 hemoglobin, beta 1 M V H Query: 227 tgactcctgaggagaagtctgccgttactgccctgtggggcaaggtgaacgtggatgaag 286 Sbjct: 61 tgactcctgaggagaagtctgccgttactgccctgtggggcaaggtgaacgtggatgaag 120 hemoglobin, beta 4 L T P E E K S A V T A L W G K V N V D E Query: 287 ttggtggtgaggccctgggcagg 309 ||||||||||||||||||||||| Sbjct: 121 ttggtggtgaggccctgggcagg 143 hemoglobin, beta 24 V G G E A L G R

Introns are removed by splicing RNA precursors

Alternative splicing can generate multiple polypeptides from a single gene

Alternative splicing can generate multiple polypeptides from a single gene, part 2

Multigene families, e.g. encoding hemoglobin

Blot-hybridization analysis showing multiple beta-like globin genes in mammals A: clones, gel B: clones, blot- Hybridization C: genomic DNA, blot- hybridization Rabbit HBE HBG HBD HBB Genomic DNA 3.3 2.8 6.3 2.6 Size of EcoRI fragments that hybridize to globin cDNA, in kb Clones

Maintaining sequence similarity in gene families: Unequal cross-overs and gene conversions

Functional analysis of isolated genes

Gene Expression: where and how much? A gene is expressed when a functional product is made from it. One wants to know many things about how a gene is expressed, e.g. In which tissues? At what developmental stages? In response to which environmental conditions? At which stages of the cell cycle? How much product is made?

RNA blot-hybridizations = Northerns

RNA blot-hybridization: Stage specificity

RT-PCR to detect RNA Gene 5’ 3’ Transcription start start Translation polyA promoter stop mRNA AAAA Duplex PCR product, distinctive for mRNA Reverse transcriptase, dNTPs Random sequence primers cDNAs, or reverse transcripts PCR: primers from adjacent exons, dNTPs, Taq polymerase

In situ hybrid-ization and immuno-reactions

Sequence everything, find function later Determine the sequence of hundreds of thousands of cDNA clones from libraries constructed from many different tissues and stages of development of organism of interest. Initially, the sequences are partials, and are referred to as expressed sequence tags (ESTs). Use these cDNAs in high-throughput screening and testing, e.g. expression microarrays (next presentation).

Massively parallel screening of high-density chip arrays Once the sequence of an entire genome has been determined, a diagnostic sequence can be generated for all the genes. Synthesize this diagnostic sequence (a tag) for each gene on a high-density array on a chip, e.g. 6000 to 20,000 gene tags per chip. Hybridize the chip with labeled cDNA from each of the cellular states being examined. Measure the level of hybridization signal from each gene under each state. Identify the genes whose expression level differs in each state. The genes are already available.

Hybridization of RNA to “Gene chips” Gene chip = high density microarray of sequences from many (all) genes of an organism

Expression profiling using microarrays

PSU’s microarray spotting robot

Find clusters of co-regulated genes Yeast cell-cycle regulated genes, 2.5 cycles Human genes expressed in fibroblasts in response to serum Yeast sporulation associated genes Spellman et al, (1998) Mol. Biol. Cell 9:3273; Chu et al. (1998) Science 282:699; Iyer et al. (1999) Science 283:83.

Search the databases What can be learned from the DNA sequence of a novel gene or polypeptide? Many metabolic functions are carried out by proteins conserved from bacteria or yeast to humans - one may find a homolog with a known function. Many sequence motifs are associated with a specific biochemical function (e.g. kinase, ATPase). A match to such a motif identifies a potential class of reactions for the novel polypeptide.

Databases, cont’d One may find a match to other genes with no known function, but their pattern of expression may be known. Types of databases: Whole and partial genomic DNA sequences Partial cDNAs from tissues (ESTs = expressed sequence tags) Databases on gene expression Genetic maps

Express the protein product Express the protein in large amounts In bacteria In mammalian cells In insect cells (baculovirus vectors) Purify it Assay for various enzymatic or other activities, guided by (e.g.) The way you screened for the clone Sequence matches

Phenotype of directed mutation Mutate the gene in the organism of interest, and then test for a phenotype Gain of function Over-expression Ectopic expression (where normally is silent) Loss of function Knock-out expression of the endogenous gene (homologous recombination, antisense) Express dominant negative alleles Conditional loss-of-function, e.g. knock-out by recombination only in selected tissues

Localization on a gene map E.g., use gene-specific probes for in situ hybridizations to mitotic chromosomes. Align the hybridization pattern with the banding pattern Are there any previously mapped genes in this region that provide some insight into your gene?