Arrowsmith extensions to bio-informatics Vetle I. Torvik.

Slides:



Advertisements
Similar presentations
After 13 years of scientist work predominatly in USA & UK the DNA sequence of the human genome was completed in 2003 Any ideas how they did it? What would.
Advertisements

CAVEAT 1 MICROARRAY EXPERIMENTS ARE EXPENSIVE AND COMPLICATED. MICROARRAY EXPERIMENTS ARE THE STARTING POINT FOR RESEARCH. MICROARRAY EXPERIMENTS CANNOT.
A Systematic approach to the Large-Scale Analysis of Genotype- Phenotype correlations Paul Fisher Dr. Robert Stevens Prof. Andrew Brass.
Collaborative Information Management: Advanced Information Processing in Bioinformatics Joost N. Kok LIACS - Leiden Institute of Advanced Computer Science.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Gene expression analysis summary Where are we now?
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Bacterial Physiology (Micr430)
An Introduction to DNA Microarrays Jack Newton University of Alberta
Literature Mining Tools for Analysis of Genomic Data Ramin Homayouni, Ph.D. Associate Professor of Biology Director of Bioinformatics UTHSC BINF April.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Human Genome Project Seminal achievement. Scientific milestone. Scientific implications. Social implications.
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
On line (DNA and amino acid) Sequence Information
Biotechnology SB2.f – Examine the use of DNA technology in forensics, medicine and agriculture.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
BioQUEST / SCALE-IT Module From Omics Data to Knowledge Case 1: Microarrays Namyong Lee Minnesota State University, Mankato Matthew Macauley Clemson University.
Bioinformatics Brad Windle Ph# Web Site:
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Copyright © 2010 Pearson Education Inc. Lecture 01 – Genetics & Genomics: An Introduction Based on Chapter 1 – Genetics: An introduction.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Mining Biological Data. Protein Enzymatic ProteinsTransport ProteinsRegulatory Proteins Storage ProteinsHormonal ProteinsReceptor Proteins.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Central dogma: the story of life RNA DNA Protein.
Gene, MicroArray and GAs Ashish Anand Kanpur Genetic Algorithms Laboratory (KanGAL) IIT Kanpur.
Bioinformatics and Computational Biology
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
Aiding Biomedical Researchers with Tools to Assist Discovery Neil R. Smalheiser May 18, 2006.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Notes: Human Genome (Right side page)
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
CAMPBELL BIOLOGY IN FOCUS © 2014 Pearson Education, Inc. Urry Cain Wasserman Minorsky Jackson Reece 18 Genomes and Their Evolution Questions prepared by.
CHAPTER 1 Genetics: An Introduction Authored by Peter J. Russell.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Higher Human Biology Sub topic 5 (a)
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
New genes can be added to an organism’s DNA.
Functional Annotation of the Horse Genome
Access to Sequence Data and Related Information
Bioinformatics and BLAST
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Functional Impact of Transposable Element using Bioinformatic Analysis
Mutations Mutations are changes in DNA.
The Future of Genetic Research
Biological Databases BI420 – Introduction to Bioinformatics
Basic Local Alignment Search Tool (BLAST)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Gene Safari (Biological Databases)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Basic Local Alignment Search Tool
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Arrowsmith extensions to bio-informatics Vetle I. Torvik

Discovering new gene sequences 4 Start with a novel DNA sequence 4 find overlapping sequences within the expressed sequence tag (EST) database  find others that overlap with that one, until one has identified an entire new full-length gene ATGATAGGAGA GGAGAGCTGAGA TGAGATGCGCTG CGCTGATACTAGA CTAGATGATAGAGATGCC ATGATAGGAGAGCTGAGATGCGCTGATACTAGATGATAGAGATGCC

The Arrowsmith approach applied to nucleotide or protein sequences 4 begin with two different sets A and C of sequences that do not overlap 4 search for sequences B in the database that overlap with one or more sequences in both A and C AB 1 B 1 BC 1 ATGCTCTCGCGCTACGACTAGCATACTG ACTGATCGCTAGCTATGA ATCGACAAGCTATGTGCAACTG CCTGATCGCTACTACTAGCTGA TCTCGCTACTAGATCACTAGCTTA CTCGATGAGCGATGATCGCTAGCTATGGG ATCTGATACTAGCTACGACTAGC GTGAGGATCGCGATGATGATG

Linking to microarray experimental data 4 A = set of microarray experiments that measured reelin 4 C = set of microarray experiments that measured tooth development 4 A and C might be in the same or different databases 4 B-terms = genes whose expression was correlated with reelin in some system, and that were expressed during tooth developing on the other 4 If reelin regulates certain genes that have roles during tooth development, one may hypothesize a role for reelin in tooth development as well, even if none of the tooth microarray studies had examined reelin explicitly

This might stimulate someone to test... 4 if reelin is expressed at specific times and places within the developing toothbud 4 if reelin actively regulates the genes on the B-list 4 if tooth development is abnormal in the reeler mouse that genetically lacks reelin

Linking PubMed to bio- informatics databases PubMed A-literature PubMed C-literature Microarray gene A Microarray gene C B-gene list

Other databases 4 Genomic 4 Quantitative trait loci (QTL) 4 Atlases 4 Images 4 ETC

Using the literature to link genes 4 If genes A strongly co-occurs with gene B in the literature due to a biologically significant relationship, and 4 gene B and C similarly co-occur, 4 Then genes A and C are likely to be biologically related as well 4 When A and C do not co-occur above the chance level, then the relation between A and C may not be previously known or documented

4 Special case of the Arrowsmith 1-node search Gene B Gene CGene A