DNA BLAST Lab.

Slides:



Advertisements
Similar presentations
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Advertisements

Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
DNA and Proteins In this guide you will be learning about DNA and proteins Presented by Garth Jensen Emerson Middle School A project from AMGEN workshop.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Genome Evolution: Duplication (Paralogs) & Degradation (Pseudogenes)
Michael Cummings David Reisman University of South Carolina Genomes and Genomics Chapter 15.
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)
Thanks for volunteering for our study. Your chart says you have problems eating, facial weakness and overall poor muscle tone. Looks like your mother had.
Advanced Tables Lesson 9. Objectives Creating a Custom Table When a table template doesn’t suit your needs, you can create a custom table in Design view.
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Introduction to Gene Mining Part B: How similar are plant and human versions of a gene? After completing part B, you will demonstrate How to use NCBI BLASTp.
1 Welcome to the GrameneMart Tutorial A tool for batch data sequence retrieval 1.Select a Gramene dataset to search against. 2.Add filters to the dataset.
SAGExplore web server tutorial for Module II: Genome Mapping.
Lab 3 – BLAST – Directed It’s a BLAST! (too easy?)
Copyright OpenHelix. No use or reproduction without express written consent1.
Investigation #3.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Wednesday, September 11, 2013 TAKE OUT: Bioinformatics pre-lab (p. 1-2); tear off pages 3-8 from lab handout AND RECYCLE ! SAVE analysis questions on page.
Function preserves sequences Christophe Roos - MediCel ltd Similarity is a tool in understanding the information in a sequence.
ARE THESE ALL BEARS? WHICH ONES ARE MORE CLOSELY RELATED?
Copyright © by Holt, Rinehart and Winston. All rights reserved. ResourcesChapter menu To View the presentation as a slideshow with effects select “View”
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Evolution Lab.
Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST INVESTIGATION 3 BIG IDEA 1.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
By Chris Paine Genes Essential idea: Every living organism inherits a blueprint for life from its parents. Genes and.
Copyright OpenHelix. No use or reproduction without express written consent1.
SAGExplore web server tutorial. The SAGExplore server has three different modules …
Copyright OpenHelix. No use or reproduction without express written consent1.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Summer Bioinformatics Workshop 2008 BLAST Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester Center
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
L ESSON A IMS & O BJECTIVES Two part lab: First part will be completed in class today. (1) Use the online Bioinformatics tool ClustalW to analyze DNA sequences.
Protein Evolution Introducing the use of Biology Workbench as a Bioinformatics Tool.
Taxonomy & Phylogeny. B-5.6 Summarize ways that scientists use data from a variety of sources to investigate and critically analyze aspects of evolutionary.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
Section 2: Modern Systematics
Using BLAST to Identify Species from Proteins
How to Use This Presentation
INVESTIGATION 3 BIG IDEA 1
NGS Analysis Using Galaxy
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
Section 2: Modern Systematics
Tutorial for using Case It for bioinformatics analyses
Using BLAST to Identify Species from Proteins
INVESTIGATION 3 BIG IDEA 1
INVESTIGATION 3 BIG IDEA 1
Overview Bioinformatics: Analyzing biological data using statistics, math modeling, and computer science BLAST = Basic Local Alignment Search Tool Input.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Microsoft Official Academic Course, Access 2016
Annotation Presentation
Basic Local Alignment Search Tool
Conservation in Evolution
3.1 Genes Essential idea: Every living organism inherits a blueprint for life from its parents. Genes and hence genetic information is inherited from.
INVESTIGATION 3 BIG IDEA 1
Basic Local Alignment Search Tool (BLAST)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Using BLAST to Identify Species from Proteins
Lab 3 – BLAST – Directed It’s a BLAST! (too easy?)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

DNA BLAST Lab

Background Between 1990-2003, scientists working on an international research project known as the Human Genome Project were able to identify and map the 20,000-25,000 genes that define a human being. The project also mapped the genome of other species, such as the fruit fly, a mouse and E. coli The location and complete sequence of the genes in each of these species are available to access on the internet for anyone in the world

Background Why is this information important? Being able to identify the precise location and sequence of human genes will allow us to better understand genetic diseases Learning about the genes in other species helps us understand evolutionary relationships among organisms Many of our genes are identical or similar to those found in other species

Background Bioinformatics A field that combines statistics, mathematical modeling, and computer science to analyze biological data Using bioinformatics methods, entire genomes can be quickly compared in order to detect genetic similarities and differences BLAST (= Basic Local Alignment Search Tool) Bioinformatics tool that allows you to input a gene sequence of interest and search entire genomic libraries for identical or similar sequences in a matter of seconds

Lab Goals Students will use BLAST to input a gene sequence, and then check a large database to find related gene sequences. Use that information to construct a cladogram or phlyogenetic tree a visualization of the evolutionary relatedness of species)

Cladograms Review how to build a cladogram by following this link and watching the short video http://ccl.northwestern.edu/simevolution/obonu/cladograms/Open-This-File.swf Practice building cladograms by following this interactive link and build cladograms using derived anatomical characteristics and using derived molecular characteristics http://www.phschool.com/atschool/phbio/active_art/cladograms/cladograms.swf

Lab Procedure Now that you are familiar with how to build a cladogram, use the following data to construct a cladogram of some major plant groups: Organism Vascular Tissue Flowers Seeds Mosses Pine Trees 1 Flowering Plants Ferns Total 3 2

Lab Procedure The groups you just organized in a cladogram, had some differences and similarities. In a similar way, other species have differences and similarities in a cellular respiration (glycolytic) enzyme called GAPDH (glyceraldehyde 3-phosphate dehydrogenase) The following data table shows the percentage similarity of this gene and the protein it expresses in humans versus other species.

Lab Procedure First, understand your goals for using BLAST

Lab Procedure Now that you’ve made some simple comparisons, you will BLAST to do the same with more complex gene sequences. Your next step is to have you find and BLAST some gene sequences of interest to you, such as DNA polymerase or human actin, used in muscles. Before jumping into BLAST, first locate the gene of your choosing by searching the “Entrez Gene” section of the NIH website.

Lab Procedure Follow this link to start Entrez Gene: http://www.ncbi.nlm.nih.gov/gene and search for your gene of interest. The example that follows uses human actin as the gene to search for.

Lab Procedure In the Search field, type human actin and then click ‘Search.’ Click the top link that appears – GNA12.

Lab Procedure Scroll down to the ‘Reference Sequences’ section. Under the ‘mRNA and Proteins’ sections, click the first link – NM_007353.2

Lab Procedure Just below the gene title, click ‘FASTA.’ This displays the human nucleotide sequence for the actin gene.

Lab Procedure Copy the gene sequence. Go to the BLAST homepage. (type “ncbi.nlm.nih.gov/blast”) In the left column, find ‘nucleotide blast’ and click it.

Lab Procedure Paste the gene sequence into the ‘Enter…FASTA sequence’ box. Give the search a descriptive title.

Lab Procedure Choose a search set (most likely the human genome) In the ‘Optimize for’ section, choose ‘highly similar’. Click ‘BLAST.’

Lab Procedure Examine the graphic summary. Click on the question mark next to “Distribution of 17 Blast…” and read the explanation

BLAST/Cladogram procedure Scenario: A team of scientists has uncovered the fossil specimen in Figure 3 near Liaoning Province, China. Make some general observations about the morphology (physical structure) of the fossil, and then record your observations below:

Procedure Figure 3

Procedure Little is known about the fossil. It appears to be a new species. Upon careful examination of the fossil, small amounts of soft tissue have been discovered. Normally, soft tissue does not survive fossilization; however, rare situations of such preservation do occur. Scientists were able to extract DNA nucleotides from the tissue and use the information to sequence several genes. Your task is to use BLAST to analyze these genes and determine the most likely placement of the fossil species on Figure 4.

Procedure

Procedure Form an initial hypothesis as to where you believe the fossil specimen should be placed on the cladogram based on the morphological observations you made earlier. Draw your hypothesis on Figure 4.

Procedure Locate and download gene files. Download three gene files from http://blogging4biology.edublogs.org/2010/08/28/college-board-lab-files/ Upload the gene sequence into BLAST by doing the following: a. Go to the BLAST homepage: http://blast.ncbi.nlm.nih.gov/Blast.cgi b. Click on “Saved Strategies” from the menu at the top of the page.

Procedure

Procedure Under “Upload Search Strategy,” click on “Browse” and locate one of the gene files you saved onto your computer. Click “View.”

Procedure A screen will appear with the parameters for your query already configured. NOTE: Do not alter any of the parameters. Scroll down the page and click on the “BLAST” button at the bottom, as shown in Figure 7 below.

Procedure

Procedure After collecting and analyzing all of the data for that particular gene (see instructions below), repeat this procedure for the other two gene sequences. The results page has two sections. The first section – shown in Figure 8 - is a graphical display of the matching sequences.

Click here to see a results tree! Procedure Click here to see a results tree!

Procedure Scroll down to the section titled “Sequences producing significant alignments.” The species in the list that appears in Figure 9 are those with sequences identical to or most similar to the gene of interest. The most similar sequences are listed first, and as you move down the list, the sequences become less similar to your gene of interest.

Procedure

Procedure If you click on a particular species listed, you’ll get a full report that includes the classification scheme of the species, the research journal in which the gene was first reported, and the sequence of bases that appear to align with your gene of interest.

Procedure Click on a particular species listed to get a full report that includes the species’ classification scheme, the research journal in which the gene was first reported, and the sequence of bases that appear to align with your gene of interest. Click on the link titled “Distance tree of results,” to see a cladogram with the species with similar sequences to your gene of interest placed on the cladogram according to how closely their matched gene aligns with your gene of interest.

Analyzing Results Recall that species with common ancestry will share similar genes. The more similar genes two species have in common, the more recent their common ancestor and the closer the two species will be located on a cladogram.

Analyzing Results As you collect information from BLAST for each of the gene files, you should be thinking about your original hypothesis and whether the data support or cause you to reject your original placement of the fossil species on the cladogram. For each BLAST query, consider the following: The higher the score, the closer the alignment. The lower the e value, the closer the alignment. Sequences with e values less than 1e-04 (1 x 10-4) can be considered related with an error rate of less than 0.01%.

Analyzing Results What species in the BLAST result has the most similar gene sequence to the gene of interest? Where is that species located on your cladogram? How similar is that gene sequence? Based on what you have learned from the sequence analysis and what you know from the structure, decide where the new fossil species belongs on the cladogram with the other organisms. If necessary, redraw the cladogram you created before.

Evaluating Results Compare and discuss your cladogram with your classmates. Does everyone agree on the placement of the fossil specimen? If not, what is the basis of the disagreement?

Evaluating Results 2. On the main page of BLAST, click on the link “List All Genomic Databases”. How many genomes are currently available for making comparisons using BLAST? ____ How does this limitation impact the proper analysis of the gene data used in this lab? 3. What other data could be collected from the fossil specimen to help properly identify its evolutionary history?