August 20, 2007 BDGP modENCODE Data Production. BDGP Data Production Project Goals 21,000 RACE experiments 6,000 cDNA’s from directed screening and full.

Slides:



Advertisements
Similar presentations
Model Organism Databases and Community Annotation
Advertisements

Recombinant DNA Technology
BiGCaT Bioinformatics Hunting strategy of the bigcat.
Transcriptional regulation and promoter analysis
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Two short pieces MicroRNA Alternative splicing.
Transcriptomics Breakout. Topics Discussed Transcriptomics Applications and Challenges For Each Systems Biology Project –Host and Pathogen Bacteria Viruses.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction Gene prediction methods Gene indices Mapping cDNA on.
The Sense of Sequense The Sense of Sequense Chris Evelo BiGCaT Bioinformatics Universiteit Maastricht.
Toxicology in the omics era. Chris Evelo BiGCaT Bioinformatics Group – BMT-TU/e & UM.
Alignment of mRNAs to genomic DNA Sequence Martin Berglund Khanh Huy Bui Md. Asaduzzaman Jean-Luc Leblond.
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
BME 130 – Genomes Lecture 7 Genome Annotation I – Gene finding & function predictions.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Arrays: Narrower terms include bead arrays, bead based arrays, bioarrays, bioelectronic arrays, cDNA arrays, cell arrays, DNA arrays, gene arrays, gene.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
MiRNA targets Using undergraduate molecular biology labs to discover targets of miRNAs in humans Adam Idica, Jordan Thompson, Irene Munk Pedersen, Pavan.
Applications of Genetics 1. Gene Therapy – Pg 248 & 267 introducing correct gene to “cure” genetic disease 2. Polymerase Chain Reaction Pg making.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Making, screening and analyzing cDNA clones Genomic DNA clones
Fine Structure and Analysis of Eukaryotic Genes
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Biotechnology SB2.f – Examine the use of DNA technology in forensics, medicine and agriculture.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Arabidopsis Genome Annotation TAIR7 Release. Arabidopsis Genome Annotation  Overview of releases  Current release (TAIR7)  Where to find TAIR7 release.
Genome Annotation BBSI July 14, 2005 Rita Shiang.
Rhesus genome annotations Rob Norgren Department of Genetics, Cell Biology and Anatomy University of Nebraska Medical Center.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Screening a Library Plate out library on nutrient agar in petri dishes. Up to 50,000 plaques or colonies per plate.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Non-coding RNA Genes the underestimated players in an old game.
Vidyadhar Karmarkar Genomics and Bioinformatics 414 Life Sciences Building, Huck Institute of Life Sciences.
ModENCODE August 20-21, 2007 Drosophila Transcriptome: Aim 2.2.
Mapping Sites of Transcription Across the Drosophila Genome Using High Resolution Tiling Microarrays LBNL, Berkeley CA August 20, 2007 A. WillinghamAffymetrix,
LECTURES 3/4. CONSTRUCTING and SCREENING cDNA LIBRARIES to ISOLATE NEW GENES ORIGINAL ARTICLES: CLONING BY COMPLEMENTATION: Lew, D, Dulic, V, and Reed.
The generalized transcription of the genome Víctor Gámez Visairas Genomics Course 2014/15.
1 Transcript modeling Brent lab. 2 Overview Of Entertainment  Gene prediction Jeltje van Baren  Improving gene prediction with tiling arrays Aaron Tenney.
The Drosophila Gene Collection Mark Stapleton Berkeley Drosophila Genome Project Lawrence Berkeley National Lab.
Chapter 7 Analyzing DNA and gene structure, variation and expression 1.Sequencing and genotyping DNA Standard/manual DNA sequencing using dideoxynucleotide.
Genomics.
The Havana-Gencode annotation GENCODE CONSORTIUM.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
A Non-EST-Based Method for Exon-Skipping Prediction Rotem Sorek, Ronen Shemesh, Yuval Cohen, Ortal Basechess, Gil Ast and Ron Shamir Genome Research August.
JIGSAW: a better way to combine predictions J.E. Allen, W.H. Majoros, M. Pertea, and S.L. Salzberg. JIGSAW, GeneZilla, and GlimmerHMM: puzzling out the.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Finding genes in the genome
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Today House Keeping –CARC –Course evaluation on-line soon, –Primer order –Final report (manuscript style) Continue RT-PCR, PCR Phylogeny, COI 15min.ppt.
Using DNA Subway in the Classroom Genome Annotation: Red Line.
318 bp insertion between Exon 2 and 3
EGASP 2005 Evaluation Protocol
The Transcriptional Landscape of the Mammalian Genome
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
EGASP 2005 Evaluation Protocol
Sequence based searches:
Experimental Verification Department of Genetic Medicine
ENCODE Pseudogenes and Transcription
ChipViewer is coded to visualize and analyze the tiling chip data.
Scientists use several techniques to manipulate DNA.
Vav‐1 gene‐targeting strategy.
Gene Annotation with DNA Subway
by Wen-feng Xu, Zhi-wei Xie, Dominic W. Chung, and Earl W. Davie
.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 3 Gene Prediction and Annotation 4 Genome Structure 5 Genome.
circRNA prediction, sequence determination, and validation.
Figure Genetic characterization of the novel GYG1 gene mutation (A) GYG1_cDNA sequence and position of primers used. Genetic characterization of the novel.
Relative abundance and expression of the 10 most abundant MAGs in the bioreactor at day 96. Relative abundance and expression of the 10 most abundant MAGs.
Presentation transcript:

August 20, 2007 BDGP modENCODE Data Production

BDGP Data Production Project Goals 21,000 RACE experiments 6,000 cDNA’s from directed screening and full insert sequencing 3,000 RT-PCR experiments and insert sequencing Data Tracking Requirements Identification of genomic regions for interrogation Tracking and associations of experiments Analysis of experimental data Submission of results to GenBank and DCC

Data Resources The identification of experiments is based on existing resources Affymetrix microarray data BDGP EST/cDNA clones

Embryonic RNA Expression on Genome Tiling Arrays Manak et al., (2006) Nat. Genet. 38(10):

BDGP EST and cDNA Projects Data Resources Project Resources 295,379 BDGP EST end sequences 109,398 Exelixis EST end sequences 15,015 BDGP clone full length sequences Production tracking and analysis in an integrated database LIMS

BDGP Production Tracking Existing production tracking through an internal web-based LIMS system

Production Data Workflow Benchwork Registration Gel Processing Clone Data Processing

BDGP Data Analysis

BDGP 5’ RACE Identification of 5’ 2,074 RACE primers from set of CG’s from Ohler et al. 96 selected for experiments

Directed cDNA Screening using iPCR The congo exon screen is a model for the 5’ RACE, directed cDNA, and RT-PCR screening congo: 41,564 protein coding exons from comparative analysis from Manolis Kellis 434 exons did not overlap Rel 4.3 annotations or existing EST/cDNA data 267 (61.5%) completed full insert sequencing

cDNA Clone Capture using iPCR Identification of ExonPrimer Design and Experiment RegistrationPCR Plate ProductionCloning, end and internal sequencingAssembly and Analysis of screen dataFull insert sequencing of positive matches

Computationally predicted conserved exons validated by cDNA screening and sequencing I. Gene modificationsII. Identification of New Genes Predictions - Kellis

BDGP Data Production The remaining work on the LIMS and data production system: Completion of migration from EST/cDNA project to new code. Identification and prioritization of experiments Integration of microarray data Specification of success Definition of data transfer to DCC

BDGP Data Production

cDNA Sequencing Corrects Gene Models