You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. The sequence.

Slides:



Advertisements
Similar presentations
Tutorial 1 Biology background for the course. Genome sizes and number of genes OrganismGenome SizeNo. of genes E. coli4.6 Mb~4,300 genes Baker’s Yeast12.
Advertisements

Regulatory Motifs. Contents Biology of regulatory motifs Experimental discovery Computational discovery PSSM MEME.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Xenolog: Homologs resulting from horizontal gene transfer.
Sequence Similarity Searching Class 4 March 2010.
Protein RNA DNA Predicting Protein Function. Biochemical function (molecular function) What does it do? Kinase??? Ligase??? Page 245.
Expect value Expect value (E-value) Expected number of hits, of equivalent or better score, found by random chance in a database of the size.
"Nothing in biology makes sense except in the light of evolution" Theodosius Dobzhansky.
Sequence Comparison Intragenic - self to self. -find internal repeating units. Intergenic -compare two different sequences. Dotplot - visual alignment.
Protein Modules An Introduction to Bioinformatics.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 3 A Story of Transcription.
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
From Haystacks to Needles AP Biology Fall Isolating Genes  Gene library: a collection of bacteria that house different cloned DNA fragments, one.
Protein Bioinformatics Course
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
Identification of Protein Domains. Orthologs and Paralogs Describing evolutionary relationships among genes (proteins): Two major ways of creating homologous.
1 Orthology and paralogy A practical approach Searching the primaries Searching the secondaries Significance of database matches DB Web addresses Software.
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
1. Bacterial genomes - genes tightly packed, no introns... HOW TO FIND GENES WITHIN A DNA SEQUENCE? Scan for ORFs (open reading frames) - check all 6 reading.
Aim: To understand how the olfactory transduction system is organized Are there several receptor protein “species” each of which detect a class of odorant.
Remember the limitations? –You must know the sequence of the primer sites to use PCR –How do you go about sequencing regions of a genome about which you.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Functional Annotation of Proteins via the CAFA Challenge Lee Tien Duncan Renfrow-Symon Shilpa Nadimpalli Mengfei Cao COMP150PBT | Fall 2010.
Construction of Substitution Matrices
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. What do you.
Basic terms:  Similarity - measurable quantity. Similarity- applied to proteins using concept of conservative substitutions Similarity- applied to proteins.
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
Predicting protein degradation rates Karen Page. The central dogma DNA RNA protein Transcription Translation The expression of genetic information stored.
Protein and RNA Families
Introduction to NCBI & Ensembl tools including BLAST and database searching Incorporating Bioinformatics into the High School Biology Curriculum Fran Lewitter,
Genomics.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Molecular and Genomic Evolution Getting at the Gene Pool.
COT 6930 HPC and Bioinformatics Sequence Alignment Xingquan Zhu Dept. of Computer Science and Engineering.
Cédric Notredame (08/12/2015) Molecular Evolution Cédric Notredame.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Human Influence on Genes. Why Analyze DNA? Check for diseases Check for diseases Identify parents Identify parents Crime scene investigations Crime scene.
Step 3: Tools Database Searching
What is BLAST? Basic BLAST search What is BLAST?
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Summer Bioinformatics Workshop 2008 BLAST Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester Center
What is a macromolecule? There are four main types of biological molecules called macromolecules. The four types of macromolecules are carbohydrates, lipids,
CAMPBELL BIOLOGY IN FOCUS © 2014 Pearson Education, Inc. Urry Cain Wasserman Minorsky Jackson Reece 18 Genomes and Their Evolution Questions prepared by.
What is phage display? An in vitro selection technique using a peptide or protein genetically fused to the coat protein of a bacteriophage.
Bioinformatics What is a genome? How are databases used? What is a phylogentic tree?
What is BLAST? Basic BLAST search What is BLAST?
Part 3 Gene Technology & Medicine
Figure 20.0 DNA sequencers DNA Technology.
Basics of BLAST Basic BLAST Search - What is BLAST?
Basics of Comparative Genomics
Protein Sequence Alignments
Using BLAST to Identify Species from Proteins
Chapter 14 Bioinformatics—the study of a genome
Genome Projects Maps Human Genome Mapping Human Genome Sequencing
Bioinformatics and BLAST
Protein Bioinformatics Course
Introduction to Bioinformatics II
Ensembl Genome Repository.
Protein Synthesis Step 2: Translation
What do you with a whole genome sequence?
Basic Local Alignment Search Tool
Basic Local Alignment Search Tool (BLAST)
BSC1010: Intro to Biology I K. Maltz Chapter 21.
Basics of Comparative Genomics
Basic Local Alignment Search Tool
Using BLAST to Identify Species from Proteins
Presentation transcript:

You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. The sequence gets ed to your account. What do you do next?

BLAST BLAST WHAT?

Amino vs Nucleic?? -Which is more likely to be the same: a match of 10 amino acids or a match of 10 nucleotides? -4 bases vs 20 amino acids. -amino acids more have more degeneracy. If see similar amino acid, we assume that it did not occur through chance.

What steps would you take to Blast the Amino Acid sequence if you start with the nucleic acid sequence? -tblastn vs blastx -Look for ORF …how? When might need to blast nucleic acid?

-paralogs or orthologs exist where amino acid is highly similar. I.e… Mouse smad2 and frog smad2 are 98% identical. Activin Receptor has many isoforms IA, IB IIA IIB, etc. That are very similar at the protein level. -blat searches in genome for cis regulatory elements. - your impatient and the protein blast searches are slow.

You blast the protein sequence..… There is nothing like it in the database. Now what? - motifs/domains

You blast the protein. There is similarity over certain regions to several molecules containing kinase domains. What does this tell you? - it’s a kinase! - location in the cell? Furthermore, it has great similarity to the Erk family of kinases, meaning similarity outside the kinase domains. Does this help?

Why is it useful to ask if your unknown is like something else??? Clues to function!!!!! In a huge and broad context that might mean… -what it interacts with -what its biology is known for.. -pathway - is it like a molecule in a more tractable and studied system (yeast that doesn’t even have neurons!)

SIMILARITY…. What it gets you looking for new proteins with similarities to known proteins with interesting activities. Seratonin receptors, Tyrosine Kinases, Hedgehogs, TGFbs, etc, etc….. Domain similarity tells you tons… RING fingers (E3 ligases), Kinase domains, TM domains, signal sequences, HLH dna binding domains….etc…etc..

Methods to look for similarity -blast/blat -Zoo blots -Function…. Can it complement? (h ras into yeast) -Protein structure -degenerate PCR over similarity regions.

Ortholog - gene copies derived from speciation. Usually functional orthologs. Paralogs - gene copies derived from duplication. Within same species…. Ie. Members of a human gene family share sequence similarity, but may have distinct functions.

Is sequence similarity relevant??

DOMAINS Whats a domain? A contiguous segment of the primary sequence of a molecule that - in isolation- displays a significant property of the intact molecule. The most stringent use of the term requires the domain to be structurally stable and display some function. It is usually associated with a function, including providing a structural element to the protein…, but can also be a folding domain.