Jo Dicks John Innes Centre Analysis of crop plant genomes

Slides:



Advertisements
Similar presentations
Evolution of genomes.
Advertisements

CITE EVIDENCE THAT ORGANISMS ARE LINKED BY LINES OF DESCENT FROM COMMON ANCESTRY LEARNING GOAL.
GENE TREES Abhita Chugh. Phylogenetic tree Evolutionary tree showing the relationship among various entities that are believed to have a common ancestor.
Ontology annotation: mapping genomic regions biological function Paul D Thomas, Huaiyu Mi and Suzanna Lewis.
Plant Molecular Systematics (Phylogenetics). Systematics classifies species based on similarity of traits and possible mechanisms of evolution, a change.
Phylogenetic reconstruction
Unit 1: DNA and the Genome Key area 8: Genomic sequencing.
History, protohistory and prehistory of the Arabidopsis thaliana chromosome complement Henry Yves et al 2006, in press.
Comparative genomics Joachim Bargsten February 2012.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Bioinformatics Chromosome rearrangements Chromosome and genome comparison versus gene comparison Permutations and breakpoint graphs Transforming Men into.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Bioinformatics and Phylogenetic Analysis
Tree Pattern Matching in Phylogenetic Trees Automatic Search for Orthologs or Paralogs in Homologous Gene Sequence Databases By: Jean-François Dufayard,
Evolutionary Algorithms Simon M. Lucas. The basic idea Initialise a random population of individuals repeat { evaluate select vary (e.g. mutate or crossover)
Genomics, Proteomics and Metabolomics. Genomics l The complete set of DNA found in each cell is known as the genome l Most crop plant genomes have billions.
Goals of the Human Genome Project determine the entire sequence of human DNA identify all the genes in human DNA store this information in databases improve.
Phylogenetic Tree Construction and Related Problems Bioinformatics.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Genes (3.1) IB Diploma Biology Essential Idea: Heritable traits are passed down to offspring through genes.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
Lecture 8: 24/5/1435 Genetic Algorithms Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Genomics Lecture 8 By Ms. Shumaila Azam. 2 Genome Evolution “Genomes are more than instruction books for building and maintaining an organism; they also.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Bioinformatic Tools for Comparative Genomics of Vectors Comparative Genomics.
Gramene: Interactions with NSF Project on Molecular and Functional Diversity in the Maize Genome Maize PIs (Doebley, Buckler, Fulton, Gaut, Goodman, Holland,
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
Announcements Urban Forestry data and photos due next week after the break. Reading. Writing assignment due Oct 18. Choose one of the characteristics out.
Bioinformatics and Computational Biology
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings PowerPoint ® Lecture Presentations for Biology Eighth Edition Neil Campbell.
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
PHYLOGENY AND THE TREE OF LIFE CH 26. I. Phylogenies show evolutionary relationships A. Binomial nomenclature: – Genus + species name Homo sapiens.
Classification. Cell Types Cells come in all types of shapes and sizes. Cell Membrane – cells are surrounded by a thin flexible layer Also known as a.
Types of mutations Mutations are changes in the genetic material
Classification.
Protein Evolution Introducing the use of Biology Workbench as a Bioinformatics Tool.
LECTURE PRESENTATIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry, Michael L. Cain, Steven A. Wasserman, Peter V. Minorsky, Robert.
Objective: I can explain how genes jumping between chromosomes can lead to evolution. Chapter 21; Sections ; Pgs Genomes: Connecting.
UK CropNet Software Development. UK CropNet Software Development Goals z Improve user access to data via user- friendly graphical displays. z Development.
Section 2: Modern Systematics
Phylogeny and the Tree of Life
(Quantitative, Evolution, & Development)
Pipelines for Computational Analysis (Bioinformatics)
Section 2: Modern Systematics
Higher Biology Genomic Sequencing Mr G R Davidson.
In-Text Art, Ch. 16, p. 316 (1).
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Genomes and their evolution
Genomes and their evolution
Agenda 10/8 Seashell Sort Phylogeny Lecture Phylogenetics Pracice
Welcome to AP Biology Saturday Study Session
Molecular Clocks Rose Hoberman.
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
DNA and the Genome Key Area 6c Chromosome Mutations.
Cereal Genome Evolution: Grasses, line up and form a circle
DNA and the Genome Key Area 6c Chromosome Mutations.
Evidence for Evolution
Unit Genomic sequencing
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
PHYLOGENETIC TREES.
Cladistics.
Essential knowledge 1.B.1:
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Jo Dicks John Innes Centre Analysis of crop plant genomes

Data  We want to compare the genomes of crop plants (e.g. wheat, rice, maize, millets, barley, pea)  At present, we mainly compare:  Whole genome sequences  Genetic markers (comparative mapping)  Transposable elements

What can we learn from the data?  Understand evolutionary processes in crop plants.  Use comparative mapping to predict gene/marker location and function across species.  Use transposable elements to maximise diversity within a subset of a germplasm collection (core collection).

Whole genome sequences  Linear streams of data, where each element is represented of one of four letters (A, C, G or T).  Streams can be long – billions of letters.  Blocks of sequence can be meaningful (e.g. they encode genes or transposable elements) or are deemed ‘junk’. Species 1: caggaaaacacacactcacatacatgaacaatatctc ||||| || ||||| |||||||| |||| || || Species 2: caggataatgcacac catacatgcacaaaat tc

Comparative mapping data 1245 Species Species 1 In most data sets, links (homologies) may be spread across chromosomes  Markers have a location and an orientation.  When markers in two species are related by descent from a common ancestor, they are called homologues.  Comparative mapping data are combinatorial.

Retrotransposons Accession 1 Accession 2  Retrotransposons are a type of transposable element.  There are various locations in a genome where they are either present or absent.  An entry in a germplasm collection (called an accession) is therefore essentially a barcode representing multiple retrotransposon locations.

Evolution  Data change in time due to errors known as mutations (there are several distinct types of mutation).  Differences between species are often quantified in terms of the number and type of such mutations.  The relationship between species is often represented as a tree of evolution (often called a phylogenetic tree).

An evolutionary tree Species 1 Species 2 Species 3 Species 4 Ancestral species Mutations occur through time, along the tree branches

Data problems  In comparative mapping studies, there may be elements between the markers that are important but of which we know nothing (i.e. missing data) and erroneous links between data items (i.e. data errors).  Missing data will be largely alleviated by whole genome sequences (when will this be though?) but there will still be errors in the data.

Projects  UK CropNet (data)  CHROMTREE (analysis)  GENE-MINE (data)  Germinate (analysis)  JIC are also involved in Arabidopsis and Brassica IGF projects

UK CropNet databases  UK CropNet curates and develops databases and data analysis tools for:  Arabidopsis thaliana (AGR)  Brassicas (BrassicaDB)  Cereals (BarleyDB, CeResDB and MilletGenes)  Forage grasses (FoggDB)  Potato (SpudBase)  as well as developing a database for:  Comparative mapping data (CropSeqDB and ComapDB)

Problems  To get hold of comparative mapping data from the crop plant community, we need to access disparate data sources of differing quality (not necessarily electronic).  We need to link the data sources to form a single, queriable entity.

BarleyDB BrassicaDB CerealsDBFoggDB MilletGenes SpudBase AGR The UK CropNet single- and related-species databases ComapDB ARCADE Will the GRID be a better solution than ARCADE?

Analysing chromosomal evolution

Chromosomes evolve over time Inversion Translocation Mutations events can be mathematically modelled and used to construct a phylogenetic tree

Problems  Unlike DNA sequences, data are combinatorial, not linear.  Algorithms are very slow (many require optimisation over a multi-dimensional space) and analysis of large data sets is not currently possible on JIC machines.  Parallelisation of algorithms may help, as it has done for DNA sequence phylogenetic analysis. However, is the only answer?  In some cases (due to mutations such as allo-polyploidy) we may wish to consider phylogenetic networks instead of trees – an even harder computational problem.

Analysing germplasm collections GENE-MINE and GERMINATE

Germplasm projects  GENE-MINE: An EU-funded project to develop a data-management and analysis computer system for plant germplasm collections  GERMINATE: A BBSRC-funded project allied to GENE-MINE and another EU project TEGERM, to develop specialist tools for analysis of the TEGERM data.  The problems seen in these projects are essentially the same as those of UK CropNet and CHROMTREE.

Retrotransposon insertion 123 Like chromosomal mutations, retrotransposon insertion can be mathematically modelled

Relationship between accessions INS Again, sometimes we may need to estimate a phylogenetic network (due to introgression between accessions)