1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.
Q: How many amino acids are there?
The twenty alpha-amino acids that are encoded by the genetic code share the generic structure…
Atom nomenclature within amino acids (as used within the PDB) CA CB C O N OG1CG2
77
Lys Arg To Do: Learn how to name the atoms of all amino acids. Hint: look at any generic PDB file to get a list of atom types. -The alpha carbon (CA) is immediately adjacent the most oxidized carbon (which is the CO 2 - in amino acids) -All the other heavy nuclei are named according to the Greek alphabet. -Put otherwise, LYS can be described by: CA, CB, CG, CD, CE, and NZ. Atom nomenclature within amino acids (as used within the PDB)
Numbers are used to discriminate between similar positions… CB CG OD1 ND2 CB CG ND1 CE1NE2 CD2 Here are some harder examples… CB CG CD2 CE2 CZ OH CD1 CE2 CB CG CD2 CD1 NE1 CE2 CH2 CE3 CZ2 CZ3 CB CD2 CD1 CG CB OG1CG2
Side-chain torsion angles -With the exception of Ala and Gly, all sidechains also have torsion angles. -To Do on your own: -Count the # of chi’s in each amino acid. -Determine why Ala doesn’t have a chi angle.
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.
Fischer projection
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.
Terminologies Hydrophobic: Amino acids are those with side chains that do not like to reside in an aqueous environment. Hence, these amino acids buried within the hydrophobic core of the protein. –Aliphatic: Hydrophobic group that contains only carbon or hydrogen atoms. –Aromatic: A side chain is considered aromatic when it contains an aromatic ring system. Polar: Polar amino acids are those with side-chains that prefer to reside in an aqueous environment and hence can be generally found exposed on the surface of a protein.
-OH -SH Twenty Amino acids Hydrophobic (non polar) Polar Polar NeutralCharged Aromatic (PHE, TRP) Aliphatic (ALA, VAL, LEU, ILE, MET, PRO) AmideAcidic Basic (ASN, GLN) (THR, SER) (CYS) (ASP, GLU) (HIS, LYS,ARG) TYR: Amphipathic GLY: Unclassifiable
It’s actually a bit more complicated…
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.
Not uncommon amino acids in biochemistry, but they are not encoded within the genetic code (meaning not incorporated into proteins)…
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.
Primary structure = the complete set of covalent bonds within a protein
Polypeptides Linear arrangement of n amino acid residues linked by peptide bonds. Polymers composed of two, three, a few, and many amino acid residues are called as dipeptides, tripeptides, oligopeptides and polypeptides. Proteins are molecules that consist of one or more polypeptide chains.
Q: why is the pentapeptide SGYAL different than LAYGS?
Amino acid to Dipeptide Amino Acid 1 Amino Acid 2 Peptide bond is the amide linkage that is formed between two amino acids, which results in (net) release of a molecule of water (H 2 O). The four atoms in the yellow box form a rigid planar unit and, as we will see next, there is no rotation around the C-N bond. Peptide bond Note: this chemistry will not work as drawn!
The peptide bond has a partial double bond character, estimated at 40% under typical conditions. It is this fact that makes the peptide bond planar and rigid.
A quick aside… A horrible leaving group A viable leaving group + +..
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid ionization 6.Formation of the peptide bond 7.Disulfide bonds 8.Comparing protein sequences to describe evolutionary processes.
Multiple sequence alignments Given the sequences: INDUSTRY INTERESTING IMPORTANT One example of a MSA is:But is it better than: IN-DUST--RYINDU--ST-RYINTERESTING IMPOR--TANTIMPOR-T-ANT
Multiple sequence alignments I-N-DU-ST-RYI--NDU-ST-RY I-NTERESTINGI--NTERESTING IMPO-R--TANTI-MPO-R--TANT IN-DUTS--RYINDU--ST-RYINTERESTING IMPOR--TANTIMPOR-T-ANT I-NDUS--T-RY-I-N-D-U-S-T-RY INT-ERES-TINGI-NTERE-S-TING IMPOR--TAN--TIMPO-RTA-NT---
Multiple sequence alignments Possible MSAEntire column can NOT have only gaps! I-N-DU-ST-RYI--NDU-ST-RY I-NTERESTINGI--NTERESTING IMPO-R--TANTI-MPO-R--TANT Can NOT move residues aroundPossible IN-DUTS--RYINDU--ST-RYINTERESTING IMPOR--TANTIMPOR-T-ANT Nothing matches!Too many opening gaps! I-NDUS--T-RY-I-N-D-U-S-T-RY INT-ERES-TINGI-NTERE-S-TING IMPOR--TAN--TIMPO-RTA-NT---
Which alignment pairs make the most sense? AVGTLE VLASID AVGTLE EKWVKV VS. A-VT-G-R-L-E AA-TA-Q-V-IE AVTG-RLE AATAQ-IE VS. AVWF----VLIM ALWFAMVFILIM ESQG----KTD DTQADGKCRTD VS. More similar amino acids Fewer gapsGap location makes more sense
A multiple sequence alignment: -CAPSRPLNENDDGR-QAFELIGTAVNM... -CVPGRGEMEHDD-RDQVLELFGTVVNL... -AVPKRAALQNDDGR-QGWELYGTVSAQ... -AVPTKMNCFNDDGR-QSVNLIGTVSGN... -ILPARTSMCNDDGR-QTIEMKGTPAGG... --APGK--NGHKLV--Q-FELKGTYSRT... AFAPRRIKMVNKLGR-QNFTLLGTFERT... AYRPDRCNTCNKLGR-QDVELMGTDART... -YRPEEWFGENKLGR-QSAELIGTDERS... --APL-ETYWPKLGR-QTGALAGTNSAV... --RPY-KAGWNKLGR-QSYELGGTNPYI PARAKNMG---R-QSYHL--TMEWQ...