A Lot More Advanced Biotechnology Tools (Part 2) Sequencing
Human Genome Project On June 26, 2001, HGP published the “working draft” of the DNA sequence of the human genome. Historic Event! – blueprint of a human – the potential to change science & medicine
Sequence of 46 Human Chromosomes 3 billion base pairs 3G of data
TACGCACATTTACGTACGCGGATGCCGCGACT ATGATCACATAGACATGCTGTCAGCTCTAGTAG ACTAGCTGACTCGACTAGCATGATCGATCAGC TACATGCTAGCACACYCGTACATCGATCCTGA CATCGACCTGCTCGTACATGCTACTAGCTACTG ACTCATGATCCAGATCACTGAAACCCTAGATC GGGTACCTATTACAGTACGATCATCCGATCAGA TCATGCTAGTACATCGATCGATACTGCTACTGA TCTAGCTCAATCAAACTCTTTTTGCATCATGAT ACTAGACTAGCTGACTGATCATGACTCTGATCC CGTAGATCGGGTACCTATTACAGTACGATCATC CGATCAGATCATGCTAGTACATCGATCGATACT GCTACTGATCTAGCTCAATCAAACTCTTTTTGC ATCATGATACTAGACTAGCTGACTGATCATGAC TCTGATCCCGTAGATCGGGTACCTATTACAGTA CGATCATCCGATCAGATCATGCTAGTACATCGA TCGATACT human genome 3.2 billion bases
Raw genome data
NCBI GenBank Database of genetic sequences gathered from research Publicly available on Web!
Organizing the data
Defining a gene… “Defining a gene is problematic because… one gene can code for several protein products, some genes code only for RNA, two genes can overlap, and there are many other complications.” – Elizabeth Pennisi, Science 2003 gene polypeptide 1 polypeptide 2 polypeptide 3 protein gene It’s hard to hunt for wabbits, if you don’t know what a wabbit looks like. RNA gene
And we didn’t stop there…
The Progress First 2 bacterial genomes complete 122+ bacterial genomes Data from NCBI and TIGR ( and ) first eukaryote complete (yeast) first metazoan complete (flatworm) 17 eukaryotic genomes complete or near completion including Homo sapiens, mouse and fruit fly Official “15 year” Human Genome Project: # of DNA base pairs (billions) in GenBank
How does the human genome stack up? Organism Genome Size (bases) Estimated Genes Human (Homo sapiens) 3 billion30,000 Laboratory mouse (M. musculus) 2.6 billion30,000 Mustard weed (A. thaliana) 100 million25,000 Roundworm (C. elegans) 97 million19,000 Fruit fly (D. melanogaster) 137 million13,000 Yeast (S. cerevisiae) 12.1 million6,000 Bacterium (E. coli) 4.6 million3,200 Human Immunodeficiency Virus (HIV) 97009