Detecting Orthologs Using Molecular Phenotypes a case study: human and mouse Alice S Weston.

Slides:



Advertisements
Similar presentations
Periodic clusters. Non periodic clusters That was only the beginning…
Advertisements

1 Orthologs: Two genes, each from a different species, that descended from a single common ancestral gene Paralogs: Two or more genes, often thought of.
Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Phylogenetic reconstruction
Yeast - why it simply has a lot to say about human disease.
At5g A mutant phenotype? Emily Eder HC70AL - Spring 2005.
Genome evolution There are both proximate and ultimate explanations in molecular biology Mutation continually generates variation in genome content and.
Xenolog: Homologs resulting from horizontal gene transfer.
Identification of a Novel cis-Regulatory Element Involved in the Heat Shock Response in Caenorhabditis elegans Using Microarray Gene Expression and Computational.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics and Phylogenetic Analysis
Alternative splicing and evolution Daniel Jeffares.
Microarray Data Analysis Using R Studies in Tissue Databases Mark Reimers, NCI.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Protein Interactions and Disease Audry Kang 7/15/2013.
Human Molecular Genetics Section 14–3
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
BIONFORMATIC ALGORITHMS Ryan Tinsley Brandon Lile May 9th, 2014.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
The Rise of Genomics AP Biology Fall The Human Genome Project  With the invention of PCR and automated sequencing, scientists argued for the sequencing.
Construction of Substitution Matrices
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Predicting protein degradation rates Karen Page. The central dogma DNA RNA protein Transcription Translation The expression of genetic information stored.
MCB 317 Genetics and Genomics Topic 11 Genomics. Readings Genomics: Hartwell Chapter 10 of full textbook; chapter 6 of the abbreviated textbook.
P HYLO P AT : AN UPDATED VERSION OF THE PHYLOGENETIC PATTERN DATABASE CONTAINS GENE NEIGHBORHOOD Presenter: Reihaneh Rabbany Presented in Bioinformatics.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
November 18, 2000ICTCM 2000 Introductory Biological Sequence Analysis Through Spreadsheets Stephen J. Merrill Sandra E. Merrill Marquette University Milwaukee,
Using blast to study gene evolution – an example.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Cédric Notredame (08/12/2015) Molecular Evolution Cédric Notredame.
Gene expression & Clustering. Determining gene function Sequence comparison tells us if a gene is similar to another gene, e.g., in a new species –Dynamic.
Functional prediction methods. The usual troubles of the molecular and cellular biology labs What are the functions of a previously non characterized.
1 The Genome Gamble, Knowledge or Carnage? Comparative Genomics Leading the Organon Tim Hulsen, Oss, November 11, 2003.
Construction of Substitution matrices
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Motif Search and RNA Structure Prediction Lesson 9.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
1 Computational functional genomics Lital Haham Sivan Pearl.
Fanconi Anemia Anthony Winchell. What is Fanconi Anemia?
Finding Motifs Vasileios Hatzivassiloglou University of Texas at Dallas.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Prader Willi Syndrome & Necdin
Bioinformatics What is a genome? How are databases used? What is a phylogentic tree?
The Transcriptional Landscape of the Mammalian Genome
Basics of Comparative Genomics
Gene Hunting: Design and statistics
The Making of the Fittest Evidence of Evolution youtube
Peter John M.Phil, PhD Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences & Technology (NUST)
Prediction of Regulatory Elements for Non-Model Organisms Rachita Sharma, Patricia.
Genomes and Their Evolution
Bioinformatics Biological Data Computer Calculations +
Genome organization and Bioinformatics
Introduction to Bioinformatics II
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
SEG5010 Presentation Zhou Lanjun.
Basics of Comparative Genomics
Problems from last section
Relationship between Genotype and Phenotype
Volume 11, Issue 7, Pages (May 2015)
Presentation transcript:

Detecting Orthologs Using Molecular Phenotypes a case study: human and mouse Alice S Weston

What is a “molecular phenotype”? mRNA expression patterns detected using microarray techniques can reveal the co-expression of two genes in the same tissue or sample

Hypothesis It is speculated that orthologous genes between Human and Mouse will be co- expressed with a similar set of partners compared to a pair of non-orthologous genes that are similar at the sequence level.

Why do we care? categorize the biological function of mammalian core proteins learn about the similarity of genes with little sequence homology see how orthologous genes have changed since their divergence

evolutionary sequence changes can mask orthologs coding region active site = mutation BLAST hit #1 paralog BLAST hit #2 true ortholog!

Methods calculated Spearman correlations used z-Fisher transform because of missing data—consider dimension built co-expression neighborhood for each human gene (center), 100 each found related genes in neighborhoods of top two BLAST hits in mouse for the central human gene

Example 10270h m m BLAST hit #1BLAST hit #2

Methods (cont.) determined the ranks of the neighbors in relation to their central mouse gene used a sign system to tell which mouse gene was more co-expressed with the human gene asked: Are there any instances where BLAST hit #2 is the true ortholog?

Results

cases where the second best mouse gene in BLAST has the best co-expression with the human gene the best mouse gene in BLAST has the best co-expression with the human gene ~62.9% of the time results look promising if sample size is increased

problems along the way…. not enough microarray co-expression data to rank most of the neighbors some human genes have no predicted orthologs in mouse— for building mouse neighborhoods limited sample size: some human genes do not have two orthologs in mouse— to test hypothesis

….solutions do more microarray experiments, increase amount of data compare human genes to another species with more known orthologs

Acknowledgments Josh Stuart Reading: Barak A Cohen, Yitzhak Pilpel, Robi D. Mitra, and George M. Church. (2002) Discrimination between Paralogs using Microarray Analysis: Application to the Yap1p and Yap2p Transcriptional Networks. Molecular Biology of the Cell. 13, 1608 – 1614.

Happy Spring Break!!!