Christopher Roberts Supervisor: Dr. Luis Mur (former IBS) and Dr. Ian Armstead (formerly IGER) Institute of Biological, Environmental.

Slides:



Advertisements
Similar presentations
Table of Contents Human Inheritance Human Genetic Disorders
Advertisements

Slide 6.2. Slide 6.3 Dominance and recessiveness - both alleles may encode proteins; example - eye colour -the gene that encodes the protein that "does.
HCS806 “Methods in Horticulture and Crop Science” Introduction to methods in Bioinformatics for plant science. David Francis (Coordinator) Ian Holford.
Chapter 19 Evolutionary Genetics 18 and 20 April, 2004
Analysis of Physiological Characteristics and Gene Expression Associated with Different Light Conditions in Brachypodium distachyon By Jared Streich, Research.
Wheat chr 3B T. urartuAe. taushii Class I With LTR Gypsy Copia LTR unclass Without LTR LINE SINE0.01 Class.
Basics of Comparative Genomics Dr G. P. S. Raghava.
History, protohistory and prehistory of the Arabidopsis thaliana chromosome complement Henry Yves et al 2006, in press.
MGH-PGA Genomic Analysis of Stress and Inflammation: Sequence Analysis of Pseudomonas aeruginosa Strain PA14 Nicole T. Liberati, Dan G. Lee, Jacinto M.
Introduction to DNA Microarrays Todd Lowe BME 88a March 11, 2003.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Fuzzy K means.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
Supported by the NSF Plant Genome Research and REU Programs *Supported by the NSF Plant Genome Research and REU Programs A tutorial on the Cell Wall Genomics.
Plant genomes: phenotypes evolving by new rules Todd J. Vision Department of Biology University of North Carolina at Chapel Hill.
Materials and Methods Abstract Conclusions Introduction 1. Korber B, et al. Br Med Bull 2001; 58: Rambaut A, et al. Nat. Rev. Genet. 2004; 5:
Quantitative Genetics
Evolution of Populations
Chapter 15 Table of Contents Section 1 History of Evolutionary Thought
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Chapter 5 Characterizing Genetic Diversity: Quantitative Variation Quantitative (metric or polygenic) characters of Most concern to conservation biology.
Dr Andrew Harrison Departments of Mathematical Sciences and Biological Sciences University of Essex Looking for signals in tens of thousands.
Figure S1_Yao Qin et al. Figure S1 Occurrence and distribution of trihelix family in different plant species. Red branches in the cladogram indicate that.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
1 The Interrupted Gene. Ex Biochem c3-interrupted gene Introduction Figure 3.1.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
Construction of Substitution Matrices
© 2010 by The Samuel Roberts Noble Foundation, Inc. 1 The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK, 73401, USA 2 National Center.
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
The “ABC’s” of Floral Madness Architecture of a Prototypical Problem Space John Greenler and Doug Green.
Chapter 3 The Interrupted Gene.
Genomes & The Tree of Life
Construction of Substitution matrices
CuffDiff ran successfully. Output files include gene_exp.diff What are the next steps? Use Navigation bar to find files; they may be under DNA Subway if.
How many genes are there?
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used for.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
Protein Modularity and Evolution : An examination of organism complexity via protein domain structure Presented by Jennelle Heyer and Jonathan Ebbers December.
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
Supplementary Fig. 1 Supplementary Figure 1. Distributions of (A) exon and (B) intron lengths in O. sativa and A. thaliana genes. Green bars are used.
Genetics and Evolutionary Biology
Basics of Comparative Genomics
Genetic Variation Genetic Variation in Populations
Pipelines for Computational Analysis (Bioinformatics)
Prediction of Regulatory Elements for Non-Model Organisms Rachita Sharma, Patricia.
Dynamic epigenetic enhancer signatures reveal key transcription factors associated with monocytic differentiation states by Thu-Hang Pham, Christopher.
There are four levels of structure in proteins
HIS-24 regulates expression of infection-inducible genes.
Evolutionary Rewiring of Human Regulatory Networks by Waves of Genome Expansion  Davide Marnetto, Federica Mantica, Ivan Molineris, Elena Grassi, Igor.
Anastasia Baryshnikova  Cell Systems 
Mapping Global Histone Acetylation Patterns to Gene Expression
History of Biological Diversity Evolution: Darwin’s travel
Cereal Genome Evolution: Grasses, line up and form a circle
Volume 128, Issue 6, Pages (March 2007)
Luis Sanchez-Pulido, John F.X. Diffley, Chris P. Ponting 
Basics of Comparative Genomics
Volume 8, Issue 7, Pages (July 2015)
Qian Cong, Dominika Borek, Zbyszek Otwinowski, Nick V. Grishin 
Single-cell atlas of lincRNA expression during mESC to NPC differentiation. Single-cell atlas of lincRNA expression during mESC to NPC differentiation.
Volume 7, Issue 4, Pages (April 2014)
Volume 21, Issue 23, Pages (December 2011)
Volume 11, Issue 7, Pages (May 2015)
Volume 6, Issue 3, Pages (May 2013)
Presentation transcript:

Christopher Roberts Supervisor: Dr. Luis Mur (former IBS) and Dr. Ian Armstead (formerly IGER) Institute of Biological, Environmental and Rural Sciences, Aberystwyth University Introduction In spite of evolutionary divergence and the pressures of domestication, there’s been a noticeable conservation of genetic synteny between related plant species. From the angles of both evolutionary genetics and plant breeding, there’s been considerable interest in defining these interrelationships. Comparative studies will continue to be of great use in transferring information from the model species to the crop species, as physically ordered complete genome sequence are only available for rice (Oryza sativa) and Arabidopsis thaliana, both of which have relatively small genomes. Using rice as an anchor, DNA sequences from both crop and model species can be aligned with the annotated rice pseudomolecules, creating heat maps (Armstead et al 2007) ‘Hot spots’ within the genomes may be revealed where gene sequences are well or poorly conserved between species, making it possible to establish an overall picture of gene similarities, and observe where genes with differing levels of conservation are clustered. The wild grass species Brachypodium distachyon has recently gained favor as a new model system for grass crop genomics research because it possesses a suite of biological traits desired in a model system. It has a small genome (380 Mbp), low degree of repetitive DNA, availability of diploid ecotypes, self fertility promoting inbreeding, transformability, small stature, and a moderately rapid life cycle. (Garvin et al 2008) The aim of this project was to compare the genomes of Rice and Brachapodium, developing heat maps based on array intensity scores obtained from Rice Affymetrix microarrays probed with genomic DNA from Brachypodium accessions Bd21 and Bd3-1, produced as part of their on going Brachypodium project by the Mur Group collaborated with Nottingham Arabidopsis Stock Centre (NASC). Materials and Method Sequence alignments were performed using standalone MegaBLAST obtained from NCBI, BLAST FTP site (ftp://ftp.ncbi.nih.gov/blast/executables/release/) with default settings except for a window size of 16 (-W 16), maximum expectation value of 1 × (-e 1e-010) and alignment output (-D 3). Colour coded moving windows (MWs) of 100 genes along each pseudomolecule where used (in Excel) to generate the ‘heat maps’ consisting of 10 colours (blue<red) with each colour representing 10% (or as close to as the data allowed) of the MWs ranked according to array intensity scores Results and Discussion Figure 1 shows the two Brachypodium accessions are very similar. The ‘hot’ (red) and ‘cold’ (blue) spots relate to relative numbers of significant alignments. Hot spots may represent regions in the rice genome which are more conserved within Brachypodium and have more generalised functions, while cold spots may represent regions with more rice specific functions and more rapid evolution so they are less conserved between the species. Cold spots suggest a greater degree of DNA coding sequence differences and so possible divergence in structure and function of the protein products from those regions in the genome. The conserved blocks are not huge showing there are not large duplications or translocations, suggesting Brachypodium may not be as good a model/bridge species as originally proposed. Differences between the 2 Brachypodium accessions where then observed and explored in more detail looking at blocks that had particularly high (red) or low (blue) array intensities, to see if analysis of the annotations of rice genes from ‘hot spots’ and ‘cold spots’ within the heat maps indicated possible functional differences. Figure 2 and 3 show one such region This is a hot spot (high array intensity) on pseudomolecule 3. A large proportion was contributed by transposons/retrotransposons, proteins with catalytic activity, hypothetical proteins, and expressed proteins. Figure 3 shows that when this red region of BD1 was compared with the same region in the image where each individual loci is colour coded by array intensity (figure 1) it can be seen that 11 of the loci have scores assigned the red colour, with the other values ranging right through to dark blue accounting for 9 of the loci. There are differences between the images in figure 1 and 2 because with the moving windows individual high scores can have a disproportionate effect on the overall average Clearly these families of proteins are annotated on the basis of the presence of particular structural motifs in the proteins, rather than by well developed knowledge of their precise functions. There is also likely to be a degree of overlap between members of some of these families functionally. Fold difference was used to show the differences between the two Brachypodium accessions with traditional microarray colours, highlighting the differences on an individual loci basis, which can be viewed in Figure 4. Figure 5 shows a moving widows image generated for bd1 only, with the loci coding for expressed proteins, hypothetical proteins and transposons removed next to the original bd1 MW image. This shows that when the loci with these most numerous annotations are removed, regions are still conserved, most markably at the end of pseudomolecule 10, but the pseudomolecules reduce in size by greater than 50% Figure 1 Heat maps for array intensity Colour coded moving windows used to score array intensity for each of the pseudomolecules (1-12) for both accession bd1 and bd3 Figure 2 Investigation of red region on pseudomolecule 3 Table of the frequency of putative functions of a red section of bd1, together with pie chart Figure 3 Further investigation of red region on pseudomolecule 3 Figure 4 Array intensity 2 fold difference between bd1 and bd3 Red indicates loci with a fold difference below 0.5. Black indicates loci with a fold difference between 0.5 and 2.0. Green indicates loci with a fold difference greater than 2.0. Green therefore highlights loci that are found more in BD1, while red allows us to visualise loci present in BD3. Figure 5 Bd1 Heat maps for array intensity Colour coded moving windows used to score array intensity for each of the pseudomolecules (1-12) for accession bd1. For each rice pseudomolecule column 1 contains all the loci. Column 2 has had the loci coding for expressed proteins, hypothetical proteins and transposons removed References DAVID F. GARVIN, YONG-QIANG GU, ROBERT HASTEROK, SAMUEL P. HAZEN, GLYN JENKINS, TODD C. MOCKLER, LUIS A. J. MUR, AND JOHN P. VOGEL (2008) Development of Genetic and Genomic Research Resources for Brachypodium distachyon, a New Model System for Grass Crop Research. The plant Genome (A Supplement to Crop Science) IAN ARMSTEAD, LIN HUANG, JULIE KING, HELEN OUGHAM, HOWARD THOMAS AND IAN KING (2007) Rice pseudomolecule- anchored cross species DNA sequence alignments indicate regional genomic variation in expressed sequence conservation. BMC Genomics