The Scarlet Runner Bean Genome: Contig 230613 By Eden Maloney.

Slides:



Advertisements
Similar presentations
44 D (3 Khipu elements) Phaseolus vulgaris B4 locus 410 Kb contig 158 kb Sub- cluster C 400 Kb 300 Kb 250 Kb 200 Kb 150 Kb 100 Kb 50 Kb
Advertisements

Class I-A Class II-A Class II-B Class II- Basal Class I-B Class I Class II 0.1 Arabidopsis thaliana PHO1;H2 Capsella rubella PHO1;H Thellungiella.
The Trihelix Transcription Factor Family Heather Hernandez.
Scarlet Runner Bean Genome Annotation: Contig
Scarlet Runner Bean Genome Sequence: Contig By Elaine Chiu.
Annotating a Scarlet Runner Bean genome fragment put together by shotgun sequencing Scarlet Runner ean Max Bachour.
Characterization of sugar-response Arabidopsis (Arabidopsis thaliana) mutants to engineer plants for higher ethanol, soydiesel and soy protein production.
Max BachourJessica Chen. Shotgun or 454 sequencing High throughput sequencing technique that can collect a large amount of data at a fast rate. Works.
Topic 7 Nucleic Acids and Proteins. DNA Structure.
Scarlet Runner Bean Genome Assembly Nancy Phang June 4, 2004.
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
An Analysis of “Gene Finding in Novel Genomes” Michael Sneddon.
Many genes have unknown function 30% have unknown function only 9% are experimentally verified The Arabidopsis Genome Initiative, Nature 2000 of the 25,498.
Goals of the Human Genome Project determine the entire sequence of human DNA identify all the genes in human DNA store this information in databases improve.
Human Genome Project Seminal achievement. Scientific milestone. Scientific implications. Social implications.
Lim et al, Supplemental Figure S Arsenic Plant height (Cm) As[μM] b/c g f e d c/d a/b a c/d a a/b Cadmium
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
Figure S1: Sequence of the mtr-miR159b backbone for amiRNA expression in pBluescriptII SK+ vector. AmiRNA constructs are generated from this template using.
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Bikash Shakya Emma Lang Jorge Diaz.  BLASTx entire sequence against 9 plant genomes. RepeatMasker  55.47% repetitive sequences  82.5% retroelements.
Arabidopsis Genome Annotation TAIR7 Release. Arabidopsis Genome Annotation  Overview of releases  Current release (TAIR7)  Where to find TAIR7 release.
Transcriptome sequencing - a case study in Piper
HC70AL Final Presentation Chris McQuilkin June 4 th, 2009.
MAIZE GENOME ANNOTATION PROJECT AGRY GROUP 2 KARTHIK PADMANABHAN SHUAI CHEN SHAYLYN WIARDA 12/06/12.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Gene Prediction and Phylogenetic Trees
Glycopeptide MS/MS Spectra Supplemental Data 2. gi| Vacuolar invertase 1 [Gossypium hirsutum] R.LFLFNNASGVNVK.A + Deamidated (NQ)
Figure S1. Effects of AVG, DIECA, DPI, NMMA, STA, and OKA on IbRPK expression in sweet potato (Ipomoea batatas cv. Tainung 57). Leaves with petiole cuts.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
The SET-Domain Containing Protein and MYB-related Families: Genes AT2G05900 & AT1G17460 Kristin Gill HC70AL Spring 2008.
Cluster I. Cluster II Cluster III (contiued) Cluster IV.
: from the White House to another celebrated breakthrough 26 Jun 2000: Craig Venter, Bill Clinton, Francis Collins.
Plant Biology Division Post-process of IMGAG M.t. 2.0 Release Affymetrix Medicago Probe set – IMGAG 2.0 / MTGI 8.0 Mapping Zhao Bioinformatics Lab.
Fig. S1. Amino acid sequence alignment of MYBS3 proteins. MYBS3 protein sequences of Arabidopsis thaliana (MYBH; NP_199550); (At3g16350; NP_188256), Glycine.
.1Sources of DNA and Sequencing Methods.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 2 Genome Assembly.
Fig. S1. Carotenoid and chlorophyll composition of Arabidopsis Z-ISO mutants compared to wild type. Seedlings were grown for 6d on half strength MS plus.
August 20, 2007 BDGP modENCODE Data Production. BDGP Data Production Project Goals 21,000 RACE experiments 6,000 cDNA’s from directed screening and full.
SRB Genome Assembly and Analysis From 454 Sequences HC70AL S Brandon Le & Min Chen.
Journal: Molecular Breeding Molecular characterization of allelic variation in spontaneous brown midrib mutants of sorghum (Sorghum bicolor L Moench) Sunita.
Supplementary Fig. 1. (A) PCR amplification of wheat TaHSP26 genomic, cDNA and ORF clones. (B) ORF and protein sequence of TaHSP26. An arrowhead indicates.
Genome Analysis Assaad text book slides only Lectures by F. Assaad can be downlaoded from muenchen.de/~farhah/index.htm.
454 Genome Sequence Assembly and Analysis HC70AL S Brandon Le & Min Chen.
WSSP Chapter 10 Literature Search Where do you learn about the function of your gene? atttaccgtg ttggattgaa attatcttgc atgagccagc tgatgagtat gatacagttt.
Myb Transcription Factors Dylan Coughtrey Laboratory Methods in Genomics Spring 2011.
H1 H2 LB LE H3H4H5H6 Bioinformatics study of Aquaporins in Dicot and Monocot plants. Neel Duti Prabh, Ravi Kumar Verma, Ramasubbu Sankararamakrishnan*
Genomic Characterisation of Nitrogen Assimilation Genes in Cassava (Manihot esculenta Crantz) T.G. Chabikwa, M.E Rauwane, and D.A Odeny ARC-Biotechnology.
Fragaria vesca Herbaceous, perennial Genotypic diversity
The Transcriptional Landscape of the Mammalian Genome
Anne Brown Josh Fitzgerald Jieqing Ping
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite.
S1 Table. The protein sequences Glycine max St8 MER3 and 18 homologous proteins used for phylogenetic analysis. S. No. Gene Name/ ID Protein type 1 Glyma.06G
Prediction of Regulatory Elements for Non-Model Organisms Rachita Sharma, Patricia.
GEP Annotation Workflow
Volume 6, Issue 3, Pages (May 2013)
HC70AL Final Presentation
Put Your Dukes Up AT5G03220! Studying Embryo Lethality of
HC70AL Oral Presentation
Strategies for annotation of a genome
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
A Metagenome Letdown By Rebekah McCurdy.
Identify D. melanogaster ortholog
1. Shen et al (MOL Biol Rep (2006) Ginko biloba) 2. Melon
Ion Channels and Synaptic Organization
Amanda Ooi, Fouad Lemtiri-Chlieh, Aloysius Wong, Christoph Gehring 
The Presence and Localization of Thioredoxins in Diatoms, Unicellular Algae of Secondary Endosymbiotic Origin  Weber Till , Gruber Ansgar , Kroth Peter.
Figure 9. Categories of pha-siRNA-yielding genes
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 3 Gene Prediction and Annotation 4 Genome Structure 5 Genome.
Human Genome Project Seminal achievement. Scientific milestone.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

The Scarlet Runner Bean Genome: Contig By Eden Maloney

CONTIG Part of Scarlet Runner Bean (Phaseolus coccineus) Genome 18, 403 base pairs Two Predicted Genes – Conserved Hypothetical Protein – Known from Soybean: Gylcine Max 2 possibly Repeats

FGENESH Length of Genes (bp) BLASTNTBLASTNBLASTPDescription Gene 13968e-736e bp 1.1 XP_ PREDICTED: microtubule associated monoxygenase, calponin and LIM domain containing 2 isoform 1 [Macaca mulatta] Gene e-53 6e bp 5e-15 EEF27784 conserved hypothetical protein [Ricinus communis] Gene e-23 5e bp 2e-09 EEF27117 conserved hypothetical protein [Ricinus communis] Gene e bp 6e-115 CAA33227 emb|CAA |emb|CAA | unnamed protein product [Glycine max] 416 6e Genescan GeneMark

Genescan GENESCAN Length of Genes (bp) BLASTNTBLASTNBLASTPDescription Gene No significant Results 9.3 EDK39880 hypothetical protein PGUG_03978 [Pichia guilliermondii ATCC 6260] Gene 29907e-855e bp 1e-45 XP_ predicted protein [Populus trichocarpa] Gene 34898e-365e bp 1e-09 EEF27117 conserved hypothetical protein [Ricinus communis] Gene 44891e-26 2e bp1e-08 XP_ predicted protein [Populus trichocarpa]

GeneMark GENEMARK Length of Proteins (aa) BLASTNTBLASTNBLASTPDescription Gene 1250NA 5e bp 0.65 AAD28219 cysteine aminopeptidase; PepC [Enterococcus faecium] Gene 294NA 2e bp ACB28472 polyprotein [Ananas comosus] Gene 385NA 2e bp No Significant Results N/A Gene 4305NA 2e bp 4.3 YP_ lysyl-tRNA synthetase [Methanococcus vannielii SB] Gene 5113NA 4e bp 5e-16 EEF27794 conserved hypothetical protein [Ricinus communis] Gene 6197NA 9e bp 2e-06 ABK28031 unknown [Arabidopsis thaliana] Gene 7160NA 7e-ll 555 bp 7e-10 XP_ multidrug/pheromone exporter, MDR family, ABC transporter family [Populus trichocarpa] Gene 8170NA 5e bp 2e-07 EEF27421 conserved hypothetical protein [Ricinus communis] Gene 9231NA 2e bp 2.4 YP_ hypothetical protein Igni_0134 [Ignicoccus hospitalis KIN4/I] Gene 10521NA9e bp 5e-148 CAA33227 unnamed protein product [Glycine max]

Diagram of Predicted Genes Fgenesh Genescan GeneMark Repeat

Overlapping Predicted Genes Fgenesh Genescan GeneMark Repeat

REPEATS? Plant Repeat Database IdDescriptionE value MRSiTETN gi| |emb|AJ |M TR Medicago truncatula CACTA type transposable element, clone 70N13, sequence GRSiTETNOOT00004 gi|170080|nt Soybean transposable element tgm1 165 MRSiTETN gi| |emb|AJ |M TR Medicago truncatula CACTA type transposable element, clone 75I04 163