Bioinformatics What is a genome? How are databases used? What is a phylogentic tree?

Slides:



Advertisements
Similar presentations
1 Orthologs: Two genes, each from a different species, that descended from a single common ancestral gene Paralogs: Two or more genes, often thought of.
Advertisements

Organizing Life’s Diversity
Basics of Comparative Genomics Dr G. P. S. Raghava.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 2: “Homology” Searches and Sequence Alignments.
BIOINFORMATICS Ency Lee.
Ion Channels. Cell membrane Voltage-gated Ion Channels voltage-gated because they open and close depending on the electrical potential across the membrane.
Expect value Expect value (E-value) Expected number of hits, of equivalent or better score, found by random chance in a database of the size.
Molecular Evidence Using DNA, RNA or Protein Sequences to Classify Organisms.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Mutations Section 12–4 This section describes and compares gene mutations and chromosomal mutations.
Genome Evolution: Duplication (Paralogs) & Degradation (Pseudogenes)
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
Making Sense of DNA and protein sequence analysis tools (course #2) Dave Baumler Genome Center of Wisconsin,
© Wiley Publishing All Rights Reserved. Searching Sequence Databases.
Introduction to Gene Mining Part B: How similar are plant and human versions of a gene? After completing part B, you will demonstrate How to use NCBI BLASTp.
T-COFFEE Multiple Alignments of Orthologous Sequences Horizontal Gene Transfer (Phylogenetic Trees) WebLogo.
Pathway Assignments. The assignment – Annotating Pathways KEGG Pathway Database.
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Sequence-based Similarity Module (BLAST & CDD only ) & Horizontal Gene Transfer Module (Ortholog Neighborhood & GC content only)
A gene is a particular sequence (a string) of nucleotides on a particular site of a chromosome. It is made up of combinations of A, T, C, and G. These.
Cladograms cont’d Using morphology, DNA or amino acid sequences.
ARE THESE ALL BEARS? WHICH ONES ARE MORE CLOSELY RELATED?
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
A Tutorial of Sequence Matching in Oracle Haifeng Ji* and Gang Qian** * Oklahoma City Community College ** University of Central Oklahoma.
Bioinformatic Tools for Comparative Genomics of Vectors Comparative Genomics.
Condor: BLAST Rob Quick Open Science Grid Indiana University.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Annotation of Drosophila virilis Chris Shaffer GEP workshop, 2006.
What do we already know ? The rice disease resistance gene Pi-ta Genetically mapped to chromosome 12 Rybka et al. (1997). It has also been sequenced Bryan.
David Wishart February 18th, 2004 Lecture 3 BLAST (c) 2004 CGDN.
SRB Genome Assembly and Analysis From 454 Sequences HC70AL S Brandon Le & Min Chen.
Copyright OpenHelix. No use or reproduction without express written consent1.
Medical Science I.  Community  Group of populations that live together in a defined area (Ex: businesses, people, pets, etc. in Alvin)  Population.
What is BLAST? Basic BLAST search What is BLAST?
From DNA to Proteins Section 2.3 BC Science Probe 9 Pages
Protein Evolution Introducing the use of Biology Workbench as a Bioinformatics Tool.
Biocomputational Languages December 1, 2011 Greg Antell & Khoa Nguyen.
Teacher’s Guide: Computer Lab on Bioinformatics Introduction This lab introduces you to the how and why of bioinformatics. You will learn how to use databases.
Bioinformatics Computing 1 CMP 807 – Day 4 Kevin Galens.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
What is BLAST? Basic BLAST search What is BLAST?
Phylogeny and the Tree of Life
Using BLAST to Identify Species from Proteins
Sequence similarity, BLAST alignments & multiple sequence alignments
Basics of Comparative Genomics
Comparative Genomics.
Pipelines for Computational Analysis (Bioinformatics)
In-Text Art, Ch. 16, p. 316 (1).
Saccharomyces Genome Database (SGD)
Using BLAST to Identify Species from Proteins
Genome Center of Wisconsin, UW-Madison
Predict Protein Sequence by Fuzzy-Association Rules
Bioinformatics and BLAST
Overview Bioinformatics: Analyzing biological data using statistics, math modeling, and computer science BLAST = Basic Local Alignment Search Tool Input.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Basic Local Alignment Search Tool
Basic Local Alignment Search Tool (BLAST)
BSC1010: Intro to Biology I K. Maltz Chapter 21.
-The relationship between genes and traits. -Fields of Genetics.
Classification of Organisms
Basics of Comparative Genomics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Basic Local Alignment Search Tool
Using BLAST to Identify Species from Proteins
Condor: BLAST Tuesday, Dec 7th, 10:45am
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Bioinformatics What is a genome? How are databases used? What is a phylogentic tree?

The Genome Every organism, including man, is specified by a genome. A genome is composed of DNA segments, called genes. Genes code for proteins

Bioinformatics is used for… Determining evolutionary relationships between organisms Looking at locations of particular genes Crop/Pharmaceutical Bioengineering Medical research

Bioinformatics is used for…. Making new drugs Crop bioengineering Gene sequencing Making phylogenetic trees

The Human Genome Project Began in the year 2000, published report in 2001 (Nature) Goal: to sequence all 20,000-25,000 genes on all the chromosomes in the human body New research is focusing on bioenergy— –developing plant feedstocks (fast-growing plants bred to produce electricity or liquid fuels) –using microorganisms (like bacteria) to break down cellulose in plant cell walls –converting sugars into biofuels.

Databases are… a storehouse of organized, indexed computerized data

Databases are used to … locate a gene within a sequence predict protein structure and/or function cluster protein sequences into families of related sequences view and analyze the data on millions of genomes

Step 1: Go to IMG Home Page Go to Find Genes

Step 2: Click on BLAST

Step 3: Copy and paste a nucleotide or protein sequence This is your query sequence

Step 4: BLAST sequence Take the top hit and click on it: this will give you a list of the genomes with the most similarity to your query sequence

Step 5: Click on the top hit

Step 6: Look at Gene Detail OID number Name of protein

Look at “gene neighborhood” Red shows query gene

Step 7:Click on IMG Genome BLAST

Step 8: Choose a Phylum or organism to BLAST against

Step 9: Set Maximum E-value Run BLAST The lower this number is, the more significant the alignment is between the query sequence and the other genomes you are comparing it against A low E Value shows genes with the most homology Run BLAST

Look at Genome Blast Results

Blast hits on a particular sequence

Look at: Alignment and E-values O= orthologs- genes from different species which are similar P= paralogs- genes from the same species which are similar

For a detailed view of alignment,click on Do Alignment: Letters denote amino acids; colors denote type of amino acid Dashed lines indicate deletions Look down this column- what do you notice?

Categories of amino acids

Homologs Gene sequences that are similar and show a close evolutionary relationship Two types of homologs: –Paralogs- similar gene sequences between members of the same species –Orthologs- similar gene sequences between 2 different species

10. Click on the hits with the best alignment

11. Add Selections to Gene Cart

12. Click: Show Gene Neighborhoods

13. Gene of interest is shown in red Look at colored genes next to gene of interest Gene of interest

14. To make a phylogenetic tree, go to Phylogeny.fr Copy and Paste amino acid sequence Click submit

15. Click Blast Explorer Paste sequence Click submit

A phylogenetic tree of cyanobacteria with homology for a particular gene sequence