Perl Part I: A Biology Primer. Conceptual Biology H. sapiens did not create the genetic code – but they did invent the transistor Biological life is not.

Slides:



Advertisements
Similar presentations
Molecular Genetics PaCES Summer Program in Environmental Science.
Advertisements

• Exam II Tuesday 5/10 – Bring a scantron with you!
5’ C 3’ OH (free) 1’ C 5’ PO4 (free) DNA is a linear polymer of nucleotide subunits joined together by phosphodiester bonds - covalent bonds between.
Mutations. DNA mRNA Transcription Introduction of Molecular Biology Cell Polypeptide (protein) Translation Ribosome.
Amino Acids and Proteins 1.What is an amino acid / protein 2.Where are they found 3.Properties of the amino acids 4.How are proteins synthesized 1.Transcription.
© 2010 Pearson Education, Inc. Lectures by Chris C. Romero, updated by Edward J. Zalisko PowerPoint ® Lectures for Campbell Essential Biology, Fourth Edition.
Transcription & Translation Biology 6(C). Learning Objectives Describe how DNA is used to make protein Explain process of transcription Explain process.
Principles of Biology By Frank H. Osborne, Ph. D. Molecular Genetics.
Transcription and Translation
Nucleic Acids DNA vs. RNA
Unit 7 RNA, Protein Synthesis & Gene Expression Chapter 10-2, 10-3
How does DNA work? What is a gene?
Nature and Action of the Gene
Chapter 12 DNA and RNA. What is DNA again? Deoxyribonucleic acid Long double-stranded molecule of nucleotides Stores genetic code that is transferred.
Protein Synthesis. DNA RNA Proteins (Transcription) (Translation) DNA (genetic information stored in genes) RNA (working copies of genes) Proteins (functional.
FEATURES OF GENETIC CODE AND NON SENSE CODONS
CHAPTER 12 PROTEIN SYNTHESIS AND MUTATIONS -RNA -PROTEIN SYNTHESIS -MUTATIONS.
How Proteins are Produced
How Proteins Are Made Mrs. Wolfe. DNA: instructions for making proteins Proteins are built by the cell according to your DNA What kinds of proteins are.
BIOCHEMISTRY REVIEW Overview of Biomolecules Chapter 4 Protein Sequence.
PowerPoint ® Lecture Slides prepared by Janice Meeking, Mount Royal College C H A P T E R Copyright © 2010 Pearson Education, Inc. 3 Cells: The Living.
Eric C. Rouchka, University of Louisville Sequence Database Searching Eric Rouchka, D.Sc. Bioinformatics Journal Club October.
TRNA Activation (charging) by aminoacyl tRNA synthetases Aminoacyl tRNA synthetase Two important functions: 1.Implement genetic code 2.Activate amino acids.
© 2010 Pearson Education, Inc. Lectures by Chris C. Romero, updated by Edward J. Zalisko PowerPoint ® Lectures for Campbell Essential Biology, Fourth Edition.
Objective: Understand the process of Translation
Learning Targets “I Can...” -State how many nucleotides make up a codon. -Use a codon chart to find the corresponding amino acid.
Chapter 8 Microbial Genetics part A. Life in term of Biology –Growth of organisms Metabolism is the sum of all chemical reactions that occur in living.
Outline 1.What is an amino acid / protein naturally occurring amino acids 3.Codon – triplet coding for an amino acid 1.How are proteins synthesized.
Bioinformatics 生物信息学理论和实践 唐继军
Maik Friedel, Swetlana Nikolajewa, Thomas Wilhelm Theoretical Systems Biology, FLI-Jena, Germany Codons and the reverse codons.
Outline What is an amino acid / protein
CELL REPRODUCTION: MITOSIS INTERPHASE: DNA replicates PROPHASE: Chromatin condenses into chromosomes, centrioles start migrating METAPHASE: chromosomes.
Chap. 1 basic concepts of Molecular Biology Introduction to Computational Molecular Biology Chapter 1.
DNA, RNA, & Protein Synthesis
End Show Slide 1 of 39 Copyright Pearson Prentice Hall 12-3 RNA and Protein Synthesis 12–3 RNA and Protein Synthesis.
RNA 2 Translation.
LO: SWBAT explain how protein shape is determined and differentiate between the different types of mutations. DN: h/0 protein synthesis HW: Read pp #
Cell Division and Gene Expression
DNA. Week 2 Review 1.Draw and label a diagram showing the cell membrane. 2.Define Osmosis 3.Define Active and Passive Transport 4.Describe the difference.
Chapter 14 Genetic Code and Transcription. You Must Know The differences between replication (from chapter 13), transcription and translation and the.
DNA mRNA Transcription Chapter 8 The Central Dogma of Molecular Biology Cell Polypeptide (protein) Translation Ribosome.
1 Human chromosomes: 50->250 million base pairs. Average gene: 3000 base pairs.
Swetlana Nikolajewa and Thomas Wilhelm Institute of Molecular Biotechnology Jena, Germany, Jena Institute of Molecular Biotechnology.
Chapter 17 How to read a table of codons. These are two forms in which you might see a table of codons.
©1998 Timothy G. Standish From DNA To RNA To Protein Timothy G. Standish, Ph. D.
Stephen Taylor i-Biology.net Photo credit: Firefly with glow, by Terry Priest on Flickr (Creative Commons)
Parts is parts…. AMINO ACID building block of proteins contain an amino or NH 2 group and a carboxyl (acid) or COOH group PEPTIDE BOND covalent bond link.
CHAPTER 10 DNA REPLICATION & PROTEIN SYNTHESIS. DNA and RNA are polymers of nucleotides – The monomer unit of DNA and RNA is the nucleotide, containing.
Date: March 18, 2016 Aim #64: How is protein shape determined? HW: 1)Protein Synthesis Review Worksheets (2) 2)Genetics Test next Tuesday (p.5) & Wednesday.
Prepared By: Syed Khaleelulla Hussaini. Outline Proteins DNA RNA Genetics and evolution The Sequence Matching Problem RNA Sequence Matching Complexity.
Genomics Lecture 3 By Ms. Shumaila Azam. Proteins Proteins: large molecules composed of one or more chains of amino acids, polypeptides. Proteins are.
NUCLEIC ACIDS AND PROTEIN SYNTHESIS. DNA complex molecule contains the complete blueprint for every cell in every living thing Amount of DNA that would.
Morse Code.. / / _..... __ _. dot short sound - dash long sound 1 st Copy code 2 nd Translate code AIM:
DNA, RNA and Protein Synthesis
Please turn in your homework
Transcription, Translation & Protein Synthesis
The blueprint of life; from DNA to Protein
Where is Cytochrome C? What is the role? Where does it come from?
Warm-Up 3/12/13 After transcription, an mRNA molecule with the sequence A U A C G C A G U was created. What was the sequence of the original DNA strand?
The DNA Connection.
RNA 3 Differences Between DNA and RNA RNA - ribonucleic acid
Outline What is an amino acid / protein
PROTEIN SYNTHESIS.
NOTE SHEET 13 – Protein Synthesis
Today’s notes from the student table Something to write with
Central Dogma and the Genetic Code
DNA, RNA, Amino Acids, Proteins, and Genes!.
RIBONUCLEIC ACID (RNA)
RNA Structure and Protein Synthesis
Unit 3: Heredity: Inheritance and Variation of Traits
Presentation transcript:

Perl Part I: A Biology Primer

Conceptual Biology H. sapiens did not create the genetic code – but they did invent the transistor Biological life is not optimized – the modern synthesis Nature vs. Nurture What are the best ways to understand the important differences the make the difference?

A Molecular Primer Hierarchy of the eukaryote Organism > System > Organ > Tissue > Cell > Organelle > Protein > RNA > DNA Put Simply: DNA → RNA → Protein

The Building Blocks DNA is composed of four building blocks Nucleic acids, nucleotides, bases Adenine, Cytosine, Guanine, Thymine RNA also has four building blocks Adenine, Cytosine, Guanine, Uracil Proteins are composed of 20 building blocks Amino acids, residues Fragments of proteins are called peptides DNA, RNA and Proteins are polymers

CodeNucleic Acid(s) w/ Sugarw/P AAdenineAdenosineAdenylic Acid CCytosineCytodineCytidylic Acid GGuanineGuanosineGuanylic Acid TThymineTymidineThymidylic Acid UUracilUridineUridylic Acid MA or C (amino)CodeNucleic Acid RA or G (purine)VA or C or G WA or T (weak)HA or C or T SC or G (strong)DA or G or T YC or T (pyrimidine) BC or G or T KG or T (keto)NA, G, C, T (any)

CodeNucleic Acid(s) w/ Sugar w/P AAdenineAdenosineAdenylic Acid CCytosineCytodineCytidylic Acid GGuanineGuanosin e Guanylic Acid TThymineTymidineThymidyli c Acid UUracilUridineUridylic Acid MA or C (amino) CodeNucleic Acid RA or G (purine) VA or C or G WA or T (weak) HA or C or T SC or G (strong) DA or G or T YC or T (pyrimidin e) BC or G or T KG or T (keto) NA, G, C, T (any) DNARNAA=T→A C=G→C G=C→G C=G→C T=A→U T=A→U M=K→M W=W→? N=N→N C=G→C C=G→C T=A→U Y=R→? B=V→? N=N→N K=M→? S=S→S T=A→U T=A→U

DNARNA A=T→A C=G→C G=C→G C=G→C T=A→U T=A→U M=K→M W=W→? N=N→N C=G→C C=G→C T=A→U Y=R→? B=V→? N=N→N K=M→? S=S→S T=A→U T=A→U One Dimensional Two Dimensional Three Dimensional

DNARNA A=T→A C=G→C G=C→G C=G→C T=A→U T=A→U M=K→M W=W→? N=N→N C=G→C C=G→C T=A→U Y=R→? B=V→? N=N→N K=M→? S=S→S T=A→U T=A→U

DNARNA A=T→A T=A→U G=C→G C=G→C T=A→U T=A→U M=K→M W=W→? N=N→N C=G→C C=G→C T=A→U Y=R→? B=V→? N=N→N K=M→? S=S→S T=A→U T=A→U

DNARNA A=T→A T=A→U G=C→G C=G→C T=A→U T=A→U M=K→M W=W→? N=N→N C=G→C C=G→C T=A→U Y=R→? B=V→? N=N→N K=M→? S=S→S T=A→U T=A→U

One-Letter Code Amino AcidThree- Letter Code One-Letter Code Amino AcidThree- Letter Code CCysteineCysDAspartic acid Asp EGlutamic Acid GluFPhenylalaninPhe GGlycineGlyHHistidineHis IIsoleucineIleKLysineLys LLeucineLeuMMethionineMet NAsparagineAsnPProlinePro QGlutamineGlnRArgineArg SSerineSerTThreonineThr VValineValWTryptophanTrp XUnknownXxxYTyrosineTyr ZGlutamic acid or GlutimineGlx

DNARNA A=T→A T=A→U G=C→G C=G→C T=A→U T=A→U M=K→M W=W→? N=N→N C=G→C C=G→C T=A→U Y=R→? B=V→? N=N→N K=M→? S=S→S T=A→U T=A→U Met (Start) Leu AA?, AU?, CA?, CU? -> Asn, Lys, Ile, Met, His, Gln, Val Pro UU?, UG?, UC?, CU?, CG?, CC? -> Phe, Leu, Cys, Stop, Trp, Ser, Leu, Arg, Pro UCU, UGU, GCU, GGU -> Ser, Cys, Ala, Gly

DNARNA A=T→A T=A→U G=C→G C=G→C T=A→U T=A→U M=K→M W=W→? N=N→N C=G→C C=G→C T=A→U Y=R→? B=V→? N=N→N K=M→? S=S→S T=A→U T=A→U Cys Phe, Leu A?C, U?C -> Ile, Thr, Asn, Ser, Phe, Ser, Tyr, Cys Leu U?U, U?G, C?U, C?G -> Phe, Ser, Tyr, Cys, Leu, Stop, Trp, Leu, Pro, His, Arg, Gln GUU, CUU -> Val, Leu

DNA RNA Protein

Lecture II Part II: One-Dimensional Strings

Hello World… A few perls of wisdom Concatenating Sequences Making a reverse complement Read sequences from data files

Every journey starts with a first 10bp #!/usr/bin/perl –w #storing DNA in a variable, and printing it out #First, storing DNA in a variable called $DNA $DNA = ‘CGGGCTATTC’; #Next, print the DNA onto the screen print $DNA; #Finally, specifically tell the program to end exit;

Every journey starts with a first 10bp #!/usr/bin/perl –w #storing DNA in a variable, and printing it out #First, storing DNA in a variable called $DNA $DNA = ‘CGGGCTATTC’; #Next, print the DNA onto the screen print $DNA; #Finally, specifically tell the program to end exit;

Every journey starts with a first 10bp #!/usr/bin/perl –w #storing DNA in a variable, and printing it out #First, storing DNA in a variable called $DNA $DNA = ‘CGGGCTATTC’; #Next, print the DNA onto the screen print $DNA; #Finally, specifically tell the program to end exit;

Every journey starts with a first 10bp #!/usr/bin/perl –w #storing DNA in a variable, and printing it out #First, storing DNA in a variable called $DNA $DNA = ‘CGGGCTATTC’; #Next, print the DNA onto the screen print $DNA; #Finally, specifically tell the program to end exit;

Concatenating DNA Fragments #!/usr/bin/perl –w #Store DNA in 2 variables $DNA1 = ‘AGTGCGTCGCTAG’; $DNA2 = ‘ACCGCATGCATTG’; #using string interpolation $DNA3 = “$DNA1$DNA2”; print “$DNA3\n\n”; #dot operator $DNA3 = $DNA1. $DNA2; print “$DNA3\n\n”; Print $DNA1,$DNA2,”\n”; exit;

Transcription: DNA to RNA #!/usr/bin/perl –w $DNA = ‘ACGACTGCACGATCGTACG’; #print the DNA onto the screen print “$DNA\n\n”; #Transcribe the DNA->RNA by substituting all T’s with U’s $RNA = $DNA; $RNA =~ s/T/U/g; #print the result to the screen print “Here is the result of DNA->RNA:\t$RNA\n\n”; exit;

$RNA =~ s/T/U/g; VariableBinding Operator Delimiters to separate the operator Substitute operator Pattern to be replaced Replacement Text of replace pattern Pattern modifier g = globally i = case insensititve m = multiline s = single line x = permit comments o = compile only once for speed e = treat replacement as Perl code

Calculating the Reverse Complement #!usr/bin/perl –w $DNA = ‘ACGTCAGTCGAGCT’; #print the starting DNA onto the screen print “Here is the starting DNA:\t$DNA\n\n”; #Calculate the reverse complement, first copying the DNA onto #a new variable called $revcom $revcom = reverse $DNA; #substitute all bases by their complement $revcom =~ s/A/T/g; $revcom =~ s/T/A/g; $revcom =~ s/C/G/g; $revcom =~ s/G/C/g; print “$revcom\n”;

Calculating the Reverse Complement #!usr/bin/perl –w $DNA = ‘ACGTCAGTCGAGCT’; #print the starting DNA onto the screen print “Here is the starting DNA:\t$DNA\n\n”; #Calculate the reverse complement, first copying the DNA onto #a new variable called $revcom $revcom = reverse $DNA; #substitute all bases by their complement $revcom =~ tr/ACGTacgt/TGCAtgca/; print “$revcom\n”;

Reading Data from Files #### Sample Data in FASTA Format #### >NM_ | Sample Data | Muppet Stuffing Protein MNIDDKLEFGDEMGOSSRTMV FGDLVRSMPHOEILAADEVLISHEE GLOYAKLEFGDEMGOGHDDEFGVY

Reading Files #!/usr/bin/perl –w #The filename of the file containing the sequence data $proteinFilename = ‘NM_ pep’; #open the file, and associate a ‘filehandle’ with it open (PROTEINFILE {IN}, $proteinFilename); #assign file with an input operator $muppetProtein = ; #print the protein file print “Here is the protein:\t$muppetProtein\n\n”; exit;

Reading Data from Files #### Sample Data in FASTA Format #### >NM_ | Sample Data | Muppet Stuffing Protein MNIDDKLEFGDEMGOSSRTMV FGDLVRSMPHOEILAADEVLISHEE GLOYAKLEFGDEMGOGHDDEFGVY

Lets try this again … #!usr/bin/perl –w $proteinFilename = ‘NM_ pep’; open(PROTEINFILE, $proteinFilename); $muppetProtein = ; print “Here is the first line:\t$muppetProtein\n\n”; $muppetProtein = ; print “Here is the second line:\t$muppetProtein\n\n”; $muppetProtein = ; print “Here is the third line:\t$muppetProtein\n\n”; close PROTEINFILE; exit;

Using Arrays to Read Files #!usr/bin/perl –w $proteinFilename = ‘NM_012345’; #open the file open(PROTEINFILE, $proteinFilename); #Read the sequence data from the file, and store it in the = ; #print the protein onto the screen close PROTEINFILE; exit;

Arrays #Here’s one way to declare an = (‘A’,’C’,’G’,’T’); #Now print each element of the array print “\nFirst element: “, $bases[0]; print “\nSecond Element: “, $bases[1]; print “\nThird Element: “, $bases[2]; print “\nFourth Element: “, $bases[3];

Arrays #Here’s one way to declare an = (‘A’,’C’,’G’,’T’); #Now print each element of the array in a row print “\nHere are all of the bases: #This prints out: ‘Here are all of the bases: ACGT’ #But, you can print them out with spaces in between print “\nHere they are with spaces”,

Arrays #Here’s one way to declare an = (‘A’,’C’,’G’,’T’); #Here’s how to take an element off of the end $base1 = print “Here’s the last element: “, $base1, “\n\n”; #The other elements still remain print “\nHere are the remaining elements: ”,

Arrays #Here’s one way to declare an = (‘A’,’C’,’G’,’T’); #Here’s how to take an element off of the front $base2 = print “Here’s the first element: “, $base2, “\n\n”; #The other elements still remain print “\nHere are the remaining elements: ”,

Arrays #Here’s one way to declare an = (‘A’,’C’,’G’,’T’); #Here’s how you put an element at the beginning of an array #Our example will put the last element at the beginning $base1 = unshift $base1); print “Here’s the last element put first: “,

Arrays #Here’s one way to declare an = (‘A’,’C’,’G’,’T’); #Here’s how you put an element at the end of an array #Our example will put the first element at the end $base1 = push $base1); print “Here’s the first element put last: “,

Arrays #Here’s one way to declare an = (‘A’,’C’,’G’,’T’); #Here’s how to reverse an = #Here’s how to get the length print “\n\n”; #Here’s how to insert an element at an arbitrary place splice 2, 0, ‘X’);

Arrays #Arrays can be evaluated as lists and = (‘A’,’C’,’G’,’T’); #Here’s how to print the array print #Here’s how to assign it to a scaler $a print $a; #Here’s how to assign an array to a list ($a) print $a;