Charting 2D Gels making use of the different Chlamydomonas databases available Christine Markert, Universität Jena.

Slides:



Advertisements
Similar presentations
Advancing Science with DNA Sequence Maize Missouri 17 chromosome 10 project update Dan Rokhsar 3 October 2006.
Advertisements

Annotation of Gene Function …and how thats useful to you.
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Scott E. Baker Pacific Northwest National Laboratory BMS Annual Scientific Meeting: Exploitation of Fungi Manchester, UK September 6, 2005 Genome and proteomic.
Doug Brutlag 2011 Sequencing the Human Genome Doug Brutlag Professor Emeritus of Biochemistry.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display Human Genetics Concepts and Applications Eighth Edition.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Central Dogma Information storage in biological molecules DNA RNA Protein transcription translation replication.
BI420 – Course information Web site: Instructor: Gabor Marth Teaching.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Protein-protein Interactions Hsueh-Fen Juan 2003, Mar 31 NTNU.
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
The Integrated Molecular Analysis of Genomes and their Expression Consortium’s Data Mining Tools: Introducing the IQ Peg Folta Lawrence Livermore National.
Compartmentalized Shotgun Assembly ? ? ? CSA Two stated motivations? ?
Reminder: Class on Friday, Discussion of Li et al. Proposal/Projects CAMERA feedback?
Computational studies of intramolecular disulfide bonded catenanes as a novel stabilizing mechanism in thermophilic microbes August 23, 2007 Daniel Park.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Human Genome Project Seminal achievement. Scientific milestone. Scientific implications. Social implications.
GTL User Facilities Facility II: Whole Proteome Analysis Michelle V. Buchanan.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Human Genome Project by: Amanda Mosello. What is the Human Genome Project? created in 1990, by the National Institutes of Health and the US Department.
CUGI Pilot Sequencing/Assembly Projects Christopher Saski.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
DOE Resources & Facilities for Biological Discovery : Realizing the Potential Presentation to the BERAC 25 April 2002.
Bioinformatics Overview, NCBI & GenBank JanPlan 2012.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Genome Sequencing in the Legumes Le et al Phylogeny Major sequencing efforts Minor sequencing efforts ~14 MY ~45 MY.
Steps in a genome sequencing project Funding and sequencing strategy source of funding identified / community drive development of sequencing strategy.
Biological Motivation for Fragment Assembly Rhys Price Jones Anne R. Haake.
>5000 The length of non-redundant consensus sequences (bp) Number of non-redundant consensus.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
DNA TECHNOLOGY AND BIOTECHNOLOGY PAGES Chapter 10.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Initial sequencing and analysis of the human genome Averya Johnson Nick Patrick Aaron Lerner Joel Burrill Computer Science 4G October 18, 2005.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, C. Perla 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino.
Chromosome 12 M. Pietrella 1, G. Falcone 1, E. Fantini 1, A. Fiore 1, M.R. Ercolano 2, A. Barone 2, M.L. Chiusano 2, S. Grandillo 3, N. D’Agostino 2, A.
Genomics.
ORNL scientists report the most comprehensive characterization of the subcellular proteome of Populus xylem. Contact: Udaya Kalluri,
EB3233 Bioinformatics Introduction to Bioinformatics.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Center for Integrated Fungal Research
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
MAPPING OF SEQUENCES TO GENE ONTOLOGY. GO consortium.
Genome Analysis Assaad text book slides only Lectures by F. Assaad can be downlaoded from muenchen.de/~farhah/index.htm.
Drosophila Genomics Where are we now? Where are we going? Christopher Shaffer, Wilson Leung, Sarah Elgin Dept of Biology; Washington University in St.
Plasmodium falciparum (3D7) - published in Draft coverage. No sequence updates for a year. No new annotation since? Leishmania major Friedlin - version.
Welcome to the combined BLAST and Genome Browser Tutorial.
High throughput biology data management and data intensive computing drivers George Michaels.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
生物資料庫搜尋 ( 第八組 ) 連威森 王鼎 黃智楹 張鈞淵
Why is Drug Target Identification important for Drug Discovery? I. Introduction.
Virginia Commonwealth University
Human Genome Project.
Department of Genetics • Stanford University School of Medicine
Greg Challis Department of Chemistry, University of Warwick, UK
Genomic Data Manipulation
Genomes and Their Evolution
Chromatophore Genome Sequence of Paulinella Sheds Light on Acquisition of Photosynthesis by Eukaryotes  Eva C.M. Nowack, Michael Melkonian, Gernot Glöckner 
Schematic of cellular role categories of theoretical (open bars) and identified proteins on a 2-D electrophoresis gel, pH 4–7 (black bars), in L. casei.
From Mendel to Genomics
Introduction to Sequencing
Sequence the 3 billion base pairs of human
Pangenomes and core genomes of 13 M. florum strains.
Human Genome Project Seminal achievement. Scientific milestone.
Milk-associated proteomes.
Presentation transcript:

Charting 2D Gels making use of the different Chlamydomonas databases available Christine Markert, Universität Jena

Outline Project: Mapping Thylakoid Proteins 2D-PAGE Databases Types Limitations

Thesis Project Comparison of different mutant strains Isolation of Thylakoids 2D-PAGE Mass Spectrometry Database Analysis  PROTEIN INFORMATION

Information of Interest Amino acid sequence Database Entry Annotation possible Modifications New Database Information

Types of Databases Assembled Lhc Proteins EST Databases Genomic Database (first release February 2003)

Protein dynamics of photosystem I and its light- harvesting antenna proteins in assembly and adaptation processes Variation of physiological conditions Mutant analysis (PSI defective) Functional Proteomics of Transmembrane Multiprotein Complexes

Data Mining Annotation of Database Genome  Introns Modifications (e.g. Phosphorylation) Homologues

Chlamydomonas reinhardtii is a unicellular green alga which swims with two flagella and has chloroplasts for photosynthesis. The genome is ~100 Mbp and is distributed in 17 chromosomes. The initial goal of the Chlamydomonas reinhardtii genome project is to generate EST sequences, and to date there are over 55,860 Chlamydomonas ESTs in GenBank. Because Chlamydomonas reinhardtii is quite GC rich (~65% G+C), we have begun a pilot project to sequence several BACs in collaboration with Dr. Susan K. Dutcher in the Department of Genetics at Washington University School of Medicine in St. Louis, Missouri. These BAC clones cover a region of ~600 kbp in the Chlamydomonas reinhardtii genome.Dr. Susan K. Dutcher The BAC library prepared by Pete Lefebvre in collaboration with Genome Systems (now IncyteGenomics) is available from IncyteGenomics. Requests for these BACs should be directed to Incyte GenomicsIncyte Genomics A table viewing the BAC clones being sequenced can be viewed at URL: In accord with the Bermuda Agreement and NHGRI policy, we are depositing all our human genomic sequence data into the High-Throughput Genomic Sequences (HTGS) division of GenBank as soon as a target large insert clone sequencing project can be assembled into contigs greater than 2 kb. The following BAC clones are being sequenced. This sequence data has been deposited into GenBank, and given accession numbers:High-Throughput Genomic Sequences (HTGS) division of GenBank

Status The current draft release, version 2.0 of the Chlamydomonas reinhardtii genome, was generated using the whole genome shotgun strategy, using only data generated here at the JGI. Gene models for this release are currently undergoing manual curation by members of the Chlamydomonas community.This massive effort began during the Chlamydomonas Jamboree, held at the JGI during the week of Dec 8th, Annotations can be viewed immediately from the protein pages as they become available. To search for genes that have manual annotations, please read the directions on our advanced search page. ReleaseThis assembly was constructed with JAZZ, the JGI assembler, capitalizing on paired-end sequencing reads. After trimming for vector and quality, 1.8 Million reads assembled into 3211 scaffolds totaling 125 Mbp. Roughly half of the genome is contained in 72 scaffolds, all of at least 504 Kb in length. Gene models have been updated using the depth of EST/cDNA information publicly available for C. reinhardtii. Future plansManual review of gene models will continue through the spring, and results will be discussed at a second Jamboree due to be scheduled this coming summer. Stanford will begin finishing the genome later this year. We will update this site once finishing efforts are complete. FundingThis work was performed under the auspices of the US Department of Energy's Office of Science, Biological and Environmental Research Program and the by the University of California, Lawrence Livermore National Laboratory under Contract No. W-7405-Eng-48, Lawrence Berkeley National Laboratory under contract No. DE-AC03-76SF00098 and Los Alamos National Laboratory under contract No. W-7405-ENG-36.advanced search

Assembly C. reinhardtii release 2.0 assembled scaffolds (unmasked): chlre2.fasta.gz (29054 Kb) C. reinhardtii release 2.0 assembled scaffolds (masked): chlre2.allmasked.gz (28364 Kb) C. reinhardtii release 2.0 predicted proteins: proteins.finalModelsV2.fasta.gz (5009 Kb) chlre2.fasta.gz chlre2.allmasked.gz proteins.finalModelsV2.fasta.gz

SEARCH BY KOG ID SEARCH BY KOG KEYWORDCELLULAR PROCESSES AND SIGNALING SEARCH BY KOG IDSEARCH BY KOG KEYWORD MCell wall/membrane/envelope biogenesis 80 gene models NCell motility 6 gene models OPosttranslational modification, protein turnover, chaperones 712 gene models TSignal transduction mechanisms 666 gene models UIntracellular trafficking, secretion, and vesicular transport 308 gene models VDefense mechanisms 55 gene models WExtracellular structures 28 gene models YNuclear structure 33 gene models ZCytoskeleton 170 gene models Total Gene Count2058INFORMATION STORAGE AND PROCESSING 80 gene models 6 gene models 712 gene models 666 gene models 308 gene models 55 gene models 28 gene models 33 gene models 170 gene models ARNA processing and modification 319 gene models BChromatin structure and dynamics 239 gene models JTranslation, ribosomal structure and biogenesis 369 gene models KTranscription 304 gene models LReplication, recombination and repair 219 gene models Total Gene Count1450METABOLISM 319 gene models 239 gene models 369 gene models 304 gene models 219 gene models CEnergy production and conversion 255 gene models DCell cycle control, cell division, chromosome partitioning 199 gene models EAmino acid transport and metabolism 278 gene models FNucleotide transport and metabolism 111 gene models GCarbohydrate transport and metabolism 265 gene models HCoenzyme transport and metabolism 100 gene models ILipid transport and metabolism 232 gene models PInorganic ion transport and metabolism 203 gene models QSecondary metabolites biosynthesis, transport and catabolism 172 gene models Total Gene Count1815POORLY CHARACTERIZED 255 gene models 199 gene models 278 gene models 111 gene models 265 gene models 100 gene models 232 gene models 203 gene models 172 gene models RGeneral function prediction only 1057 gene models SFunction unknown 381 gene models Total Gene Count gene models 381 gene models

Summary

9 2D-map of Lhca proteins from C. reinhardtii