Perl IV Part V: Hashing Out the Genetic Code, Bioperl.

Slides:



Advertisements
Similar presentations
Proteins: Structure reflects function….. Fig. 5-UN1 Amino group Carboxyl group carbon.
Advertisements

Traducción. Molécula de aminoácido Sitio de fijación del aminoácido Adaptador (RNAt) RNAm Triplete nucleotídico que codifica un aminoácido + -O 2 C—C—NH.
©2000 Timothy G. Standish Mutations Timothy G. Standish, Ph. D.
Mutations. DNA mRNA Transcription Introduction of Molecular Biology Cell Polypeptide (protein) Translation Ribosome.
Transcription & Translation Worksheet
Decoding the Flu Norris Armstrong University of Georgia
Transcription and Translation
Richard Deem, Paradoxes Class, March 16, Nucleus A A T T G G C C A A T T A A G G T T G G C C C C A A T T T T G G C C A A T T A A T T G G C C T.
Perl Part I: A Biology Primer. Conceptual Biology H. sapiens did not create the genetic code – but they did invent the transistor Biological life is not.
FEATURES OF GENETIC CODE AND NON SENSE CODONS
Computer Programming for Biologists Class 7 Nov 27 th, 2014 Karsten Hokamp
How Proteins are Produced
Overview: The Flow of Genetic Information
Sec 5.1 / 5.2. One Gene – One Polypeptide Hypothesis early 20 th century – Archibald Garrod physician that noticed that some metabolic errors were found.
More on translation. How DNA codes proteins The primary structure of each protein (the sequence of amino acids in the polypeptide chains that make up.
PowerPoint ® Lecture Slides prepared by Janice Meeking, Mount Royal College C H A P T E R Copyright © 2010 Pearson Education, Inc. 3 Cells: The Living.
TRNA Activation (charging) by aminoacyl tRNA synthetases Aminoacyl tRNA synthetase Two important functions: 1.Implement genetic code 2.Activate amino acids.
Gene Expression: From Gene to Protein
Learning Targets “I Can...” -State how many nucleotides make up a codon. -Use a codon chart to find the corresponding amino acid.
Figure 14.1 Figure 14.1 How does a single faulty gene result in the dramatic appearance of an albino deer? 1.
7. Protein Synthesis and the Genetic Code a). Overview of translation i). Requirements for protein synthesis ii). messenger RNA iii). Ribosomes and polysomes.
A program of ITEST (Information Technology Experiences for Students and Teachers) funded by the National Science Foundation Background Session #3 DNA &
Introduction to Human Genetics
Cell Division and Gene Expression
Chapter 14 Genetic Code and Transcription. You Must Know The differences between replication (from chapter 13), transcription and translation and the.
Chapter 17 From Gene to Protein. Protein Synthesis  The information content of DNA  Is in the form of specific sequences of nucleotides along the DNA.
1 Human chromosomes: 50->250 million base pairs. Average gene: 3000 base pairs.
DNA sequence analysis IT Carlow Bioinformatics October 2006.
1 Mona Singh What is computational biology?. 2 Mona Singh Genome The entire hereditary information content of an organism.
Swetlana Nikolajewa and Thomas Wilhelm Institute of Molecular Biotechnology Jena, Germany, Jena Institute of Molecular Biotechnology.
©1998 Timothy G. Standish From DNA To RNA To Protein Timothy G. Standish, Ph. D.
Stephen Taylor i-Biology.net Photo credit: Firefly with glow, by Terry Priest on Flickr (Creative Commons)
Parts is parts…. AMINO ACID building block of proteins contain an amino or NH 2 group and a carboxyl (acid) or COOH group PEPTIDE BOND covalent bond link.
Jena Institute of Molecular Biotechnology Swetlana Nikolajewa, Andreas Beyer, Maik Friedel, Jens Hollunder, Thomas Wilhelm Institute of Molecular Biotechnology,
Dept. of Animal Breeding and Genetics Programming basics & introduction to PERL Mats Pettersson.
Today 14.2 & 14.4 Transcription and Translation /student_view0/chapter3/animation__p rotein_synthesis__quiz_3_.html.
Genetic code From Wikipedia, the free encyclopedia Edited by Jungho Kim.
Figure 17.4 DNA molecule Gene 1 Gene 2 Gene 3 DNA strand (template) TRANSCRIPTION mRNA Protein TRANSLATION Amino acid ACC AAACCGAG T UGG U UU G GC UC.
Decoding the Flu Adapted from: Norris Armstrong University of Georgia Adapted by Carole Twichell 1 Image Source:
Arginine, who are you? Why so important?. Release 2015_01 of 07-Jan-15 of UniProtKB/Swiss-Prot contains sequence entries, comprising
How Genes Work: From DNA to RNA to Protein Chapter 17.
Gene Translation:RNA -> Protein How does a particular sequence of nucleotides specify a particular sequence of amino acids?nucleotidesamino acids The answer:
Ms. Hatch, What are we doing today?
Ms. Hatch, What are we doing today?
F. PROTEIN SYNTHESIS [or translating the message]
DNA.
Translation PROTEIN SYNTHESIS.
Whole process Step by step- from chromosomes to proteins.
Please turn in your homework
The blueprint of life; from DNA to Protein
Where is Cytochrome C? What is the role? Where does it come from?
Mutations.
Warm-Up 3/12/13 After transcription, an mRNA molecule with the sequence A U A C G C A G U was created. What was the sequence of the original DNA strand?
What is Transcription and who is involved?
From Gene to Phenotype- part 2
Ch. 17 From Gene to Protein Thought Questions
Gene Expression: From Gene to Protein
Overview: The Flow of Genetic Information
Overview: The Flow of Genetic Information
Gene Expression: From Gene to Protein
NOTE SHEET 13 – Protein Synthesis
Warm Up 3 2/5 Can DNA leave the nucleus?
Protein Structure Timothy G. Standish, Ph. D..
Today’s notes from the student table Something to write with
Central Dogma and the Genetic Code
Normal DNA Strand DNA : TAC AAA GGA CGA GTA GTT TAA GCA AGA ATT
DNA, RNA, Amino Acids, Proteins, and Genes!.
DNA to proteins.
Mutations Timothy G. Standish, Ph. D..
Presentation transcript:

Perl IV Part V: Hashing Out the Genetic Code, Bioperl

Hashes There are three main data types in Perl – scalar variables, arrays, and hashes. Hashes provide VERY fast nested-array look-up Format is similar to that of array: %hash = (‘key’ => ‘value’); $value = $hash{‘key’};

Hashes %array = ( ‘key1’,‘value1’, ‘key2’,‘value2’, ‘key3’,‘value3’, ); %array = ( ‘key1’=>‘value1’, ‘key2’=>‘value2’, ‘key3’=>‘value3’, );

= keys = values %hash

The Binary Search of Arrays The ‘halving’ method is considerably faster then doing a comparison. e.g. finding a match in a set array takes 15 times through a loop max. Good method for one sort and multiple comparisons

Comparing Strings To compare 2 strings alphabetically in Perl, you use the cmp operator, which returns 0 if the two strings are the same, -1 if they are in alphabetical order, and 1 if they are in reverse order. ‘zzz’ cmp ‘zzz’ returns 0 ‘AAA’ cmp ‘ZZZ’ returns -1 ‘ZZZ’ cmp ‘AAA’ returns 1

Sorting Arrays Sorting an array of strings = if given numbers this will sort them lexically Sorting an array of numbers in ascending = sort { $a $b the values $a and $b must be used

Sorting Hashes Sorting keys and values foreach ( sort keys (%hash)) { print “$_\t”, “*” x $hash{$_},”\n”; } Sorting keys in ascending order foreach (sort {$hash{$b} $hash{$_}} keys (%hash)) { …… }

Nested Arrays $array[$i] -> [$j]; produces $array[$i][$j] Or use hashes: %hash = (duck => [‘Huey’,’Louie’,’Dewey’], horse => [‘Mr. Ed’], dog => [‘Benji’, ‘Lassie’] ); print $array will give ARRAY(0x85d3ad0) but print $array[$i] gives array of $j $value = $hash{$key}[$i]

The Genetic Code is Redundant Second Position UCAG First Posit ion U UUU Phe UCU Ser UAU Tyr UGU Cys U Thi rd Pos itio n UUCUCCUACUGCC UUA Leu UCAUAAStopUGAStopA UUGUCGUAGStopUGGTrpG C CUU Leu CCU Pro CAU His CGU Arg U CUCCCCCACCGCC CUACCACAA Gln CGAA CUGCCGCAGCGGG A AUU Ile ACU Thr AAU Asn AGU Ser U AUCACCAACAGCC AUAACAAAA Lys AGA Arg A AUGMet sACGAAGAGGG G GUU Val GCU Ala GAU Asp GGU Gly U GUCGCCGACGGCC GUAGCAGAA Glu GGAA GUGGCGGAGGGGG

Searching for codons DIFFICULT: my($codon) return s if ($codon =~ /TCA/i ); return s elseif ($codon =~ /TCC/i); return s elseif ($codon =~ /TCG/i); blah blah

Searching for codons BETTER: my($codon) return A if ($codon =~ /GC./i ); return C elseif ($codon =~ /TG[TC]/i); return D elseif ($codon =~ /GA[TC]/i); blah blah

Searching for codons BEST: my($codon) $codon uc $codon; my(%genetic_code) = ( ‘TCA’ => ‘S’, ‘TCC’ => ‘S’, ‘TCG’ => ‘S’ …. yadda yadda yadda ); return $genetic_code{$codon} if (exists $genetic_code{$codon})

Modules Perl contains the ability to deal with methods in an object-orientated manner classes are contained in packages These are often referred to as modules OO structure is: objectName ->method(arguments) Note to Self --- how many objects?

BioPerl ( The main focus of Bioperl modules is to perform sequence manipulation, provide access to various biology databases (both local and web-based), and parse the output of various programs. Its modules rely heavily on additional Perl modules available from CPAN (

How to go about comparing an unknown sequence... $in = Bio::SeqIO->new(‘file’=>$infile, ‘-format’=>’genbank’); $seqobj = = $seqobj->all_SeqFeatures(); $feat = $allfeatures[0]; $feature_start = $feat->start; $feature_strand = $feat->strand; If ($seqobj->species->{common_name} =~ {elegans}) { $seq = $seqobj->primary_seq->{seq} $id = $seqobj->id; }