Welcome to DNA Subway Classroom-friendly Bioinformatics.

Slides:



Advertisements
Similar presentations
The Maize Inflorescence Project Website Tutorial Nov 7, 2014.
Advertisements

Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
The Rice Functional Genomics Program of China cDNA microarray database (RIFGP-CDMD) consists of complete datasets, including the probe sequences, microarray.
Copyright OpenHelix. No use or reproduction without express written consent1 Organization of genomic data… Genome backbone: base position number sequence.
Genome Annotation BCB 660 October 20, From Carson Holt.
Gene Finding Genome Annotation. Gene finding is a cornerstone of genomic analysis Genome content and organization Differential expression analysis Epigenomics.
NGS Analysis Using Galaxy
Using DNA Subway in the Classroom Red Line Lesson Sketch.
Alternative Splicing. mRNA Splicing During RNA processing internal segments are removed from the transcript and the remaining segments spliced together.
Using DNA Subway in the Classroom Red Line Lesson Sketch.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Tomato genome annotation pipeline in Cyrille2
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Manifestations of a Code Genes, genomes, bioinformatics and cyberspace – and the promise they hold for biology education.
Copyright OpenHelix. No use or reproduction without express written consent1.
Manifestations of a Code Genes, genomes, bioinformatics and cyberspace – and the promise they hold for biology education.
1 The Genome Browser allows you to –Browse the Rice-Japonica, Maize and Arabidopsis genomes. –View the location of a particular feature on the rice genome.
 GEP Digital Laboratory Notebook Nick Reeves, Mt. San Jacinto Community College.
1 Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
is accessible at: The following pages are a schematic representation of how to navigate through ALE-HSA21.
Use cases for Tools at the Bovine Genome Database Apollo and Bovine QTL viewer.
GeneWise and Artemis Exercises Spliced Alignment using GeneWise Click on the GeneWise hyperlink on the course links page,
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Sequence & course material repository Annotation (sequences & evidence) Manuals (DNA, Subway, Apollo, JalView) Presentations.
I. Introduction and Red Line Education for Data-unlimited Science.
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
VISTA family of computational tools for comparative genomics How can we leverage genome sequences from many species to learn about genome function?How.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Sackler Medical School
Mark D. Adams Dept. of Genetics 9/10/04
Web Databases for Drosophila An introduction to web tools, databases and NCBI BLAST Wilson Leung08/2015.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
The UCSC Table Browser & Custom Tracks Advanced searching and discovery using the UCSC Table Browser and Custom Tracks Osvaldo Graña CNIO Bioinformatics.
IPlant Genomics in Education
Copyright OpenHelix. No use or reproduction without express written consent1.
Build an Automated Workflow Visual Workflow Creator Discovery Environment.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
How can we find genes? Search for them Look them up.
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Web Apollo Resources at the National Agricultural Library Christopher Childers NAL ARS USDA i5k.nal.usda.gov.
Copyright OpenHelix. No use or reproduction without express written consent1.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
CuffDiff ran successfully. Output files include gene_exp.diff What are the next steps? Use Navigation bar to find files; they may be under DNA Subway if.
SRB Genome Assembly and Analysis From 454 Sequences HC70AL S Brandon Le & Min Chen.
-1- Module 3: RNA-Seq Module 3 BAMView Introduction Recently, the use of new sequencing technologies (pyrosequencing, Illumina-Solexa) have produced large.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Accessing and visualizing genomics data
Copyright OpenHelix. No use or reproduction without express written consent1.
Genomes at NCBI. Database and Tool Explosion : 230 databases and tools 1996 : first annual compilation of databases and tools lists 57 databases.
Welcome to the combined BLAST and Genome Browser Tutorial.
The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.
TRACKSTER &CIRCSTER DEMO Slides: /g/funcgen/trainings/visualization/Demos/Trackster+Circster.ppt Galaxy: Galaxy Dev:
Transforming Science Through Data-driven Discovery Genomics in Education University of Delaware – February 2016 Jason Williams, Education, Outreach, Training.
Using DNA Subway in the Classroom Genome Annotation: Red Line.
Bioinformatics Computing 1 CMP 807 – Day 4 Kevin Galens.
JMC CGEMS SUMMER GENOMICS TRAINING WORKSHOPS
Genome Editing with Apollo
Gene Annotation with DNA Subway
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Basic Local Alignment Search Tool
Yating Liu July 2018 G-OnRamp workshop
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Part II SeqViewer AraCyc Help
Welcome - webinar instructions
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Welcome to DNA Subway Classroom-friendly Bioinformatics

DNA Subway Commonly used bioinformatics tools in streamlined workflows Teach important concepts in biology and bioinformatics Inquiry-based experiments for novel discovery and publication of data

DNA Subway Red Line: Genome annotation Red Line Analyze up to 150 KB of DNA sequence De novo gene prediction Construct evidence-based gene models Visualize genome sequence in browser

DNA Subway Yellow Line: Genome prospecting Yellow Line Analyze DNA or protein sequence Search plant genomes using TARGeT Explore gene duplications, transposons, and non-coding sequences not detectable in conventional BLAST searches

DNA Subway Blue Line: DNA barcoding, and phylogenetics Blue Line Analyze DNA or protein sequence Analyze DNA Barcoding sequence to identify plant, animal, and fungal species Generate phylogenetic trees and publish sequence to GenBank

DNA Subway Green Line: Transcriptome analysis Blue Line Examine RNA-Seq data for differential expression Use High-performance computing to analyze complete datasets Generate lists of genes and fold-changes; add results to Red Line projects

Annotate Genome Sequence Detect Genes and Build Gene Models DNA Subway: Red Line

DNA Subway Red Line: Genome annotation Requires JAVA 6 or above Check your web browser has java enabled

Log in to DNA Subway

Create a project Detect all the genes present Import data from BLAST results and visualize in local browser Construct a gene model Verify gene model at Phytozome Task: Analyze a ~3KB sequence from Chromosome 1 of A.thaliana DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene

Create a Red Line project DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Red Line: Genome annotation Red Line 1.Click the Red Square to begin a project 2.Choose Plant and select Dicotyledon 3.Select sample sequence Arabidopsis thaliana (mouse-ear cress) Chr1, 3.40kb 4.Name the project and click Continue

Detect genes in the project sequence DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Red Line 5.Click Sequence to view the input sequence Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Red Line 6.Click Repeat Masker 7.When View icon ( ) appears; click Repeat Masker again to examine results Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Tip: Before gene prediction, RepeatMasker attempts to identify repetitive sequences such as low-complexity, simple repeat, AT/GC-rich, or several types of transposons. Results are presented in a table. The Attributes column describes what type of repeat was detected in the ‘description=‘ field AT-rich sequence at 1667bps Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Red Line 8.Click 1 or more gene predictors (Augustus, FGenesH, SNAP, tRNA Scan) 9.When View icon appears, click the gene predictor again to examine the results Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Tip: de novo gene predictors predict genes within a given sequence. Each program is optimized differently; each program’s results vary. The Attributes column details features that make up a single predicted gene (e.g. the whole gene, mRNA, CDS, and exons). Sub-features are listed in the Type column. Red Line: Demo analysis – determine a structure for an Arabidopsis gene Augustus predicts a single gene (designated ‘g1’) with 4 exons

Import data from BLAST results and visualize in local browser DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 10.Click BLASTN to search and import similar DNA sequences 11.Click BLASTX to search and import similar sequences based on protein evidence 12.When the searches complete; click again to examine results

DNA Subway Tip: BLAST results are derived from UNIGENE or UNIPROT databases, and contain experimentally derived evidence (e.g. cDNAs) that can be used to infer a probable gene structure. The Attributes column has details on the sequence matches that were found (e.g. gene name, GenBank IDs, etc.) Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 13. Click Local Browser to visualize results

DNA Subway Tip: You can use the local browser (Gbrowse) at any time to visualize the results of any tool’s output. Red Line: Demo analysis – determine a structure for an Arabidopsis gene

Construct a gene model DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 14. Click on Apollo to start the program

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 15. Hide the reverse strand; click the View menu and select Hide Reverse Strand 16.Expand tiers; click the Tiers menu and select Expand all tiers 17.If there are too many tiers displayed, click the Tiers menu; select Show Types Panel and uncheck Show evidence you wish to hide

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 18. Double-click the Augustus model and drag into workspace 19.Double-click the new temporary model; right-click to open the Annotation info editor 20. Name the model ‘Augustus1’ in both ‘Symbol’ fields.

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 21. Double-click the BLASTN model and drag into workspace 22.Double-click the new temporary model; right-click to open the Annotation info editor 23. Name the model ‘BLASTN1’ in both ‘Symbol’ fields.

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 23. Zoom in to examine the 5’ and 3’ ends of the gene models 24.Double-click the Augustus1 model and right-click to open the Exon detail editor 25. Adjust the 5’ and 3’ of the Augustus1 model to match the evidence provided by the BLASTN1 model

DNA Subway Tip: The BLASTN evidence is most useful for determining the transcript length (e.g. the 5’ and 3’ ends). Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 26. Use any other available evidence* (e.g. BLASTN, User BLAST(N/X) ) to make alternative models if supported 27. Use the BLASTX evidence to determine start/stop codons. Drag any needed stop and start codon into your model. *If you have hidden evidence, show it again from the show types panel in the Tiers menu

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 28.Delete the BLASTN1 model and any other extraneous models 29.Save your work back to DNA Subway; click the File menu and select Upload to DNA Subway; close Apollo

Verify gene model at Phytozome DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene

DNA Subway Red Line: Demo analysis – determine a structure for an Arabidopsis gene Red Line 30.Click Phytozome Browser and compare the created model(s) to the accepted transcript(s)

DNA Subway Tip: Phytozome accepted transcripts are only available for DNA Subway sample sequences. Red Line: Demo analysis – determine a structure for an Arabidopsis gene