Download presentation
Presentation is loading. Please wait.
Published byMaliyah Childers Modified over 10 years ago
1
1 L U N D U N I V E R S I T Y Comparative Genomics in Basidiomycetes - Analyzing multigene families Balaji Rajashekar Anders Tunlid Dag Ahrén Jason Stajich
2
2 L U N D U N I V E R S I T Y Basidiomycete genome data Protein coding genes Genome size (Mb) Laccaria bicolor20,61464.9 Coprinopsis cinerea13,54436.25-37.5 Phanerochaete chrysosporium10,04835.1 Cryptococcus neoformans730219.5 Ustilago maydis652219.7 58,030
3
3 L U N D U N I V E R S I T Y Sequence similarity & clustering BLASTP Gene 1 Gene 2 Gene 3 Gene 4 Gene 5 Gene 6 Gene 7 Gene 8 Gene 9 Gene 10
4
4 L U N D U N I V E R S I T Y TribeMCL (Enright et al. NAR 2002) TribeMCL animation BLASTP: All against all for the basidiomycete genomes 58,000 versus 58,000 proteins Split generated network into families Data and settings dependent
5
5 L U N D U N I V E R S I T Y Gene family distribution LaccariaCoprinopsisPhanerochaeteCryptococcusUstilago Families present59475148412630562583 Families not present14052204322642964769 Total7352
6
6 L U N D U N I V E R S I T Y Global view of proteins vs genome size
7
7 L U N D U N I V E R S I T Y Gene family size distribution
8
8 L U N D U N I V E R S I T Y Statistical analyses of gene families CAFE (Bie et al, Bioinformatics 2006) Model the evolution of gene family sizes Takes phylogeny into account Calculates birth and death of genes in all nodes Identifies families with accelerated gene gain/loss including extinction
9
9 L U N D U N I V E R S I T Y Gene family expansions/contractions Branch Divergence time (MYA) ExpansionNo changeContractionsAverage expansion 12461095248260.036 21674264873840.178 35739348551350.130 484106438444750.695 58445941118130.056 614037132911721-0.169 730830722722804-0.519 85549620433244-0.655
10
10 L U N D U N I V E R S I T Y Protein families in Laccaria 5383 Protein families analysed by CAFE 1969 Unique protein families 7352 Protein families in total
11
11 L U N D U N I V E R S I T Y Example of families >25 Laccaria proteins Protein familyLacCoprPhaeCrypUstPfam accessionPfam description Significantly Expanded 1*21697917574PF00400WD domain, G-beta repeat 2*1501131098674PF00069, PF07714Protein kinase domain, Protein tyrosine kinase 2210213210 Unique 5206000PF00931, PF05729NB-ARC domain, NACHT domain 17*128000 6456000
12
12 L U N D U N I V E R S I T Y Identification of significant families
13
13 L U N D U N I V E R S I T Y PCA of expression data Protein family 2 11 experiments Mycelia Mycorrhiza Fruiting bodies Axis 1
14
14 L U N D U N I V E R S I T Y Comparative Genomics in Basidiomycetes - Analyzing multigene families Balaji Rajashekar Anders Tunlid Dag Ahrén Jason Stajich
15
15 L U N D U N I V E R S I T Y Identification of significant families
16
16 L U N D U N I V E R S I T Y
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.