The Human Genome Project at UC Santa Cruz Phoenix Eagleshadow November 9, 2004.

Slides:



Advertisements
Similar presentations
© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
Advertisements

Human Genome Project What did they do? Why did they do it? What will it mean for humankind? Animation OverviewAnimation Overview - Click.
Chapter 15 The Human Genome Project and Genomics
Let’s investigate some of the Hot Areas of Life Sciences in more detail: Genomics –Human Genome Project –Use of Microarrays or DNA chips Bioinformatics.
Bioinformaticion Extraordinaire.  Born 1944  United States Citizen  B.A. in Mathematics, Whitman College, Walla Walla Washington  M.S. in Computer.
By: Bryan Vinzant Kimberly Black Matthew Hehemann Ravon Williams.
9 Genomics and Beyond Brief Chapter Outline
The Human Genome Project (Lecture 7)
Slide 1 of 24 Copyright Pearson Prentice Hall 14–3 Human Molecular Genetics 14-3 Human Molecular Genetics.
Data visualization in the post-genomics era Carol Morita Genentech, Inc.
The Human Genome Race. Collins vs. Venter Collins Venter.
Public-Private Cooperation for the Future of Genomics Cindy Fung & Miranda Ip Stanford-in-Washington "How Health & Science Policy Decisions Are Made”
Genome Analysis Determine locus & sequence of all the organism’s genes More than 100 genomes have been analysed including humans in the Human Genome Project.
Human Genome Project Seminal achievement. Scientific milestone. Scientific implications. Social implications.
The Human Genome Project (H.G.P.) By Ben Fuhr. What is the Human Genome Project? The Human Genome Project was a great scientific endeavor designed to.
Human Molecular Genetics Section 14–3
Bioinformatics Curriculum Guidelines: Toward a Definition of Core Competencies Lonnie Welch School of Electrical Engineering & Computer Science Biomedical.
Michael Cummings David Reisman University of South Carolina Genomes and Genomics Chapter 15.
LESSON 2 FLORAL DEVELOPMENT. Warm Up 1.When do plants normally flower? 2.What are some factors that you think plants use to decide that it is time to.
Careers and Degrees in Computing Stuart Hansen Department of Computer Science UW - Parkside.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Why It Might Change Your Life! By C. Rhein - Hazelwood Central Next Teacher’s Page Next.
Lesson 10 Bioinformatics
Copyright Pearson Prentice Hall 14–3 Human Molecular Genetics 14-3 Human Molecular Genetics.
What is the Human Genome Project? Identify all the approximately 35,000 genes in human DNA Determine the sequences of the 3,000,000,000 bases ( = 200 phone.
Lesson Overview Lesson Overview Studying the Human Genome Lesson Overview 14.3 Studying the Human Genome.
AP Biology A Lot More Advanced Biotechnology Tools Sequencing.
2015 CSU Counselor Conference. College of Science Seven Departments 23 undergrad degrees 120 full-time tenured/tenure track faculty 150 lecturers 2,100.
Lesson Overview Lesson Overview Studying the Human Genome Lesson Overview 14.3 Studying the Human Genome.
CS 790 – Bioinformatics Introduction and overview.
Section 4 Lesson 1– The Human Genome Project. Applications of DNA Technology Advances in gene manipulation have made many things possible. This section.
A Career as a Marine Biologist Nicole J. Ibrahim Mrs. Bernard Period: 6 Copyright 2007 © Nicole J. Ibrahim.
Slide 1 of 24 Copyright Pearson Prentice Hall Biology.
Lesson Overview Lesson Overview Studying the Human Genome Lesson Overview 14.3 Studying the Human Genome.
Human Genome Project Bioinformatics.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
+ => Bioinformatics: from Sequence to Knowledge Outline: Introduction to bioinformatics The TAU Bioinformatics unit Useful bioinformatics issues and databases:
How do you handle huge amounts of information? When looking in an encyclopedia you use an index When biologists search the volumes of the human genome.
COMPUTATIONAL BIOLOGIST DR. MARTIN TOMPA Place of Employment: University of Washington Type of Work: Develops computer programs and algorithms to identify.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
© 2010 Pittsburgh Supercomputing Center Pittsburgh Supercomputing Center RP Update July 1, 2010 Bob Stock Associate Director
A guided tour of Ensembl This quick tour will give you an outline view of what Ensembl is all about. You will learn: –Why we need Ensembl –What is in the.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
Wake-up 1.What process is responsible for creating the gel below? 2.Who is guilty? Explain how you know. 3.Which fragments will be at the bottom of a gel:
Chapters 13 & 14 GENETIC ENGINEERING & THE HUMAN GENOME.
STEM CELL RESEARCH. Overview In this activity, you will learn how cell specialization takes place in vertebrate embryos. –Explore a gallery of different.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Human Genome Project By: Scott Kutschke.
Genomes and Their Evolution
Genomics: Sequencing Is the Basis for Identifying and Mapping All Genes in a Genome Genomics, the study of genomes, encompasses structural genomics, functional.
14-3 Human Molecular Genetics
New genes can be added to an organism’s DNA.
Bellwork: What is the human genome project. What was its purpose
Scientists use several techniques to manipulate DNA.
14-3 Human Molecular Genetics
Genomes and Their Evolution
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
3.1 Genes Genes and hence genetic information is inherited from parents, but the combination of genes inherited from parents by each offspring will be.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
The Future of Genetic Research
Hgp april 2008.
3.1 Genes Genes and hence genetic information is inherited from parents, but the combination of genes inherited from parents by each offspring will be.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Human Genome Project Seminal achievement. Scientific milestone.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
A Lot More Advanced Biotechnology Tools
Presentation transcript:

The Human Genome Project at UC Santa Cruz Phoenix Eagleshadow November 9, 2004

The Human Genome Project Began in 1990 The Mission of the HGP: The quest to understand the human genome and the role it plays in both health and disease. “The true payoff from the HGP will be the ability to better diagnose, treat, and prevent disease.” --- Francis Collins, Director of the HGP and the National Human Genome Research Institute (NHGRI)

The genome is our Genetic Blueprint Nearly every human cell contains 23 pairs of chromosomes – and XY or XX XY = Male XX = Female Length of chr 1-22, X, Y together is ~3.2 billion bases (about 2 meters diploid)

The Genome is Who We Are on the inside! Chromosomes consist of DNA –molecular strings of A, C, G, & T –base pairs, A-T, C-G Genes –DNA sequences that encode proteins –less than 3% of human genome Information coded in DNA

CACACTTGCATGTGAGAGCTTCTAATATCTAAATTAATGTTGAATCATTATTCAGAAACAGAGAGCTAACTGTTATCCCATCCTGACTTTATTCTTTATG AGAAAAATACAGTGATTCC AAGTTACCAAGTTAGTGCTGCTTGCTTTATAAATGAAGTAATATTTTAAAAGTTGTGCATAAGTTAAAATTCAGAAATAAAACTTCATCCTAAAACTCTGTGTGTTGCTTTAAATAATC AGAGCATCTGC TACTTAATTTTTTGTGTGTGGGTGCACAATAGATGTTTAATGAGATCCTGTCATCTGTCTGCTTTTTTATTGTAAAACAGGAGGGGTTTTAATACTGGAGGAACAA CTGATGTACCTCTGAAAAGAGA AGAGATTAGTTATTAATTGAATTGAGGGTTGTCTTGTCTTAGTAGCTTTTATTCTCTAGGTACTATTTGATTATGATTGTGAAAATAGAATTTATCC CTCATTAAATGTAAAATCAACAGGAGAATAGCAAAAACTTATGAGATAGATGAACGTTGTGTGAGTGGCATGGTTTAATTTGTTTGGAAGAAGCACTTGCCCCAGAAGATACACAAT GAAATTCATGTTATTGAGTAGAGTAGTAATACAGTGTGTTCCCTTGTGAAGTTCATAACCAAGAATTTTAGTAGTGGATAGGTAGGCTGAATAACTGACTTCCTATC ATTTTCAGGTT CTGCGTTTGATTTTTTTTACATATTAATTTCTTTGATCCACATTAAGCTCAGTTATGTATTTCCATTTTATAAATGAAAAAAAATAGGCACTTGCAAATGTCAGATCACTTGCCTGTGGT CATTCGGGTAGAGATTTGTGGAGCTAAGTTGGTCTTAATCAAATGTCAAGCTTTTTTTTTTCTTATAAAATATAGGTTTTAATATGAGTTTTAAAATAAAATTAATTAGAAAAAGGCAA ATTACTCAATATATATAAGGTATTGCATTTGTAATAGGTAGGTATTTCATTTTCTAGTTATGGTGGGATATTATTCAGACTATAATTCCCAATGAAAAAACTTTAAAAAATGCTAGTGA TTGCACACTTAAAACACCTTTTAAAAAGCATTGAGAGCTTATAAAATTTTAATGAGTGATAAAACCAAATTTGAAGAGAAAAGAAGAACCCAGAGAGGTAAGGATATAACCTTACC AGTTGCAATTTGCCGATCTCTACAAATATTAATATTTATTTTGACAGTTTCAGGGTGAATGAGAAAGAAACCAAAACCCAAGACTAGCATATGTTGTCTTCTTAAGGAGCCCTCCCCT AAAAGATTGAGATGACCAAATCTTATACTCTCAGCATAAGGTGAACCAGACAGACCTAAAGCAGTGGTAGCTTGGATCCACTACTTGGGTTTGTGTGTGGCGTGACTCAGGTAATCT CAAGAATTGAACATTTTTTTAAGGTGGTCCTACTCATACACTGCCCAGGTATTAGGGAGAAGCAAATCTGAATGCTTTATAAAAATACCCTAAAGCTAAATCTTACAATATTCTCAAG AACACAGTGAA ACAAGGCAAAATAAGTTAAAATCAACAAAAACAACATGAAACATAATTAGACACACAAAGACTTCAAACATTGGAAAATACCAGAGAAAGATAATAAATAT TTTACTCTTTAAAAATTTAGTTAAAAGCTTAAACTAATTGTAGAGAAAA AACTATGTTAGTATTATATTGTAGATGAAATAAGCAAAACATTTAAAATACAAATGTGATTACTTAAAT TAAATATAATAGATAATTTACCACCAGATTAGATACCATTGAAGGAATAATTAATATACTGAAATACAGGTCAGTAGAATTTTTTTCAATTCAGCATGGAGATGTAAAAAATGAAAA TTAATGCAAAAAATAAGGGCACAAAAAGAAATGAGTAATTTTGATCAGAAATGTATTAAAATTAATAAACTGGAAATTTGACATTTAAAAAAAGCATTGTCATCCAAGTAGATGTG TCTATTAAATAGTTGTTCTCATATCCAGTAATGTAATTATTATTCCCTCTCATGCAGTTCAGATTCTGGGGTAATCTTTAGACATCAGTTTTGTCTTTTATATTATTTATTCTGTTTACTAC ATTTTATTTTGCTAATGATATTTTTAATTTCTGACATTCTGGAGTATTGCTTGTAAAAGGTATTTTTAAAAATACTTTATGGTTATTTTTGTGATTCCTATTCCTCTATGGACACCAAGGCT ATTGACATTTTCTTTGGTTTCTTCTGTTACTTCTATTTTCTTAGTGTTTATATCATTTCATAGATAGGATATTCTTTATTTTTTATTTTTATTTAAATATTTGGTGATTCTTGGTTTTCTCAGCC ATCTATTGTCAAGTGTTCTTATTAAGCATTATTATTAAATAAAGATTATTTCCTCTAATCACATGAGAATCTTTATTTCCCCCAAGTAATTGAAAATTGCAATGCCATGCTGCCATGTGG TACAGCATGGGTTTGGGCTTGCTTTCTTCTTTTTTTTTTAACTTTTATTTTAGGTTTGGGAGTACCTGTGAAAGTTTGTTATATAGGTAAACTCGTGTCACCAGGGTTTGTTGTACAGATCA TTTTGTCACCTAGGTACCAAGTACTCAACAATTATTTTTCCTGCTCCTCTGTCTCCTGTCACCCTCCACTCTCAAGTAGACTCCGGTGTCTGCTGTTCCATTCTTTGTGTCCATGTGTTCTC ATAATTTAGTTCCCCACTTGTAAGTGAGAACATGCAGTATTTTCTAGTATTTGGTTTTTTGTTCCTGTGTTAATTTGCCCAGTATAATAGCCTCCAGCTCCATCCATGTTACTGCAAAGAA CATGATCTCATTCTTTTTTATAGCTCCATGGTGTCTATATACCACATTTTCTTTATCTAAACTCTTATTGATGAGCATTGAGGTGGATTCTATGTCTTTGCTATTGTGCATATTGCTGCAAG AACATTTGTGTGCATGTGTCTTTATGGTAGAATGATATATTTTCTTCTGGGTATATATGCAGTAATGCGATTGCTGGTTGGAATGGTAGTTCTGCTTTTATCTCTTTGAGGAATTGCCATG CTGCTTTCCACAATAGTTGAACTAACTTACACTCCCACTAACAGTGTGTAAGTGTTTCCTTTTCTCCACAACCTGCCAGCATCTGTTATTTTTTGACATTTTAATAGTAGCCATTTTAACT GGTATGAAATTATATTTCATTGTGGTTTTAATTTGCATTTCTCTAATGATCAGTGATATTGAGTTTGTTTTTTTTCACATGCTTGTTGGCTGCATGTATGTCTTCTTTTAAAAAGTGTCTGTT CATGTACTTTGCCCACATTTTAATGGGGTTGTTTTTCTCTTGTAAATTTGTTTAAATTCCTTATAGGTGCTGGATTTTAGACATTTGTCAGACGCATAGTTTGCAAATAGTTTCTCCCATTC TGTAGGTTGTCTGTTTATTTTGTTAATAGTTTCTTTTGCTATGCAGAAGCTCTTAATAAGTTTAATGAGATCCTGATATGTTAGGCTTTGTGTCCCCACCCAAATCTCATCTTGAATTATA TCTCCATAATCACCACATGGAGAGACCAGGTGGAGGTAATTGAATCTGGGGGTGGTTTCACCCATGCTGTTCTTGTGATAGTGAATGAGTTCTCACGAGATCTAATGGTTTTATGAGG GGCTCTTCCCAGCTTTGCCTGGTACTTCTCCTTCCTGCCGCTTTGTGAAAAAGGTGCATTGCGTCCCTTTCACCTTCTTCTATAATTGTAAGTTTCCTGAGGCCTTCCCAGCCATGCTGAA CTTCAAGTCAATTAAACCTTTTTCTTTATAAATTACTCAGTCTCTGGTGGTTCTTTATAGCAGTGTGAAAATGGACTAATGAAGTTCCCATTTATGAATTTTTGCTTTTGTTGCAATTGCTT TTGACATCTTAGTCATGAAATCCTTGCCTGTTCTAAGTACAGGACGGTATTGCCTAGGTTGTCTTCCAGGGTTTTTCTAATTTTGTGTTTTGCATTTAAGTGTTTAATCCATCTTGAGTTGA TTTTTGTATATTGTGTAAGGAAGGGGTCCAGTTTCAATCTTTTGCATATGGCTAGTTAGTTATCCCAGTACCATTTATTGAAAAGACAGTCTTTTCCCCATCGCTCGTTTTTGTCAGTTTT ATTGATGATCAGATAATCATAGCTGTGTGGCTTTATTTCTGGGTTCTTTATTCTGTTCTATTGGTTTATGTCCCTGTTTTTGTGCCAGTACCATGCTGTTTTGGTTAACATAGCCCTGTAGT ATAGTTTGAGGTCAGATAGCCTGATGCTTCCAGCTTTGTTCTTTTTCTTAAGATTGCCTTGGCTATTTGGCCTCTTTTTTGGTTCCACATGAATTTTAAAACAGTTGTTTCTAGTTTTTGAA GAATGTCATTGGTAGTTTGATAGAAATAGCATTTAATCTGTAAATTGATTTGTGCAGTATGGCCTTTTAATGATATTGATTCTTCCTATCCATGAGCATGATATGTTTTCCATTTTGTTTG TATCCTCTCTGATTTCTTTGTGCAGTGTTTTGTAATTCTCAT TGTAGAGATTTTTCACCTCCCTGGTTAGTTGTATTTTACCCTAGATATTT TATTCTTTTTGTGAAAATTGTGAATGGGAT TGCCTTCCTGATTTGACTGC CAGCTTGGTTACTGTTGGTTTATAGAAATGCTAGTGATTTTTGTACATTG ATTTTCTTTCTAAAACTTTGCTGAAGTTTTTTTTATTAGCAGAAGGAGCT TTGGGGCTGAGACTATGGGGTTTTCTAGATATAGAATCATGTCAGCTTCAAATAGGGATAATTTTACTTCCTCTCTTCCTATTTGGATGCCCTTTATTTCTTTCTCTTGCCTGATTACTCTG GCTGGGATTTCCTATGTTGAATAGGAGT CATGAGAGAGGGCATCAAATCTACACATATCAAATACTAACCTTGAATGTCTAGATATTT TATTCTTTTTGTGAAAATTGTGAATGGGAT 5000 bases per page

How much data make up the human genome? 3 pallets with 40 boxes per pallet x 5000 pages per box x 5000 bases per page = 3,000,000,000 bases! To get accurate sequence requires 6-fold coverage. Now: Shred 18 pallets and reassemble.

The Beginning of the Project Most the first 10 years of the project were spent improving the technology to sequence and analyze DNA. Scientists all around the world worked to make detailed maps of our chromosomes and sequence model organisms, like worm, fruit fly, and mouse.

UC Santa Cruz gets Involved Computational biology (or Bioinformatics) is a research field that uses computers to help solve biological problems Because of the work Professor David Haussler was doing in the field of computational biology, UC Santa Cruz was invited to participate in the HGP in late of 1999.

The Tech Awards honors the UCSC Genome Bioinformatics Group in 2003!

The Challenges were Overwhelming First there was the Assembly The DNA sequence is so long that no technology can read it all at once, so it was broken into pieces. There were millions of clones (small sequence fragments). The assembly process included finding where the pieces overlapped in order to put the draft together. 3,200,000 piece puzzle anyone?

Assembly generated by UCSC Freeze of sequence data generated by NCBI Clone layouts generated By Washington University ACCTTGG CCTGAAT CTAGGCT TTGCATC CCTAGTC CTGATCG sequenceClone maps Working draft assembly The “Working Draft” of the human genome

UCSC put the human genome sequence on the web July 7, 2000 UCSC put the human genome sequence on CD in October 2000, with varying results Cyber geeks Searched for hidden Messages, and “GATTACA”

The Completion of the Human Genome Sequence June 2000 White House announcement that the majority of the human genome (80%) had been sequenced (working draft). Working draft made available on the web July 2000 at genome.ucsc.edu. Publication of 90 percent of the sequence in the February 2001 issue of the journal Nature. Completion of 99.99% of the genome as finished sequence on July 2003.

The Project is not Done… Next there is the Annotation: The sequence is like a topographical map, the annotation would include cities, towns, schools, libraries and coffee shops! So, where are the genes? How do genes work? And, how do scientists use this information for scientific understanding and to benefit us?

What do genes do anyway? We only have ~27,000 genes, so that means that each gene has to do a lot. Genes make proteins that make up nearly all we are (muscles, hair, eyes). Almost everything that happens in our bodies happens because of proteins (walking, digestion, fighting disease). Eye Color and Hair Color are determined by genes OR

Of Mice and Men: It’s all in the genes Humans and Mice have about the same number of genes. But we are so different from each other, how is this possible? One human gene can make many different proteins while a mouse gene can only make a few! Did you say cheese? Mmm, Cheese!

Genes are important By selecting different pieces of a gene, your body can make many kinds of proteins. (This process is called alternative splicing.) If a gene is “expressed” that means it is turned on and it will make proteins.

What we’ve learned from our genome so far… There are a relatively small number of human genes, less than 30,000, but they have a complex architecture that we are only beginning to understand and appreciate. -We know where 85% of genes are in the sequence. -We don’t know where the other 15% are because we haven’t seen them “on” (they may only be expressed during fetal development). -We only know what about 20% of our genes do so far. So it is relatively easy to locate genes in the genome, but it is hard to figure out what they do.

How do scientists find genes? The genome is so large that useful information is hard to find. Researchers at UCSC decided to make a computational microscope to help scientists search the genome. Just as you would use “google” to find something on the internet, researchers can use the “UCSC Genome Browser” to find information in the human genome. Explore it at

The UCSC Genome Browser

The browser takes you from early maps of the genome...

... to a multi-resolution view...

... at the gene cluster level...

... the single gene level...

... the single exon level...

... and at the single base level caggcggactcagtggatctggccagctgtgacttgacaag caggcggactcagtggatctagccagctgtgacttgacaag

The Continuing Project Finding the complete set of genes and annotating the entire sequence. Annotation is like detailing; scientists annotate sequence by listing what has been learn experimentally and computationally about its function. Proteomics is studying the structure and function of groups of proteins. Proteins are really important, but we don’t really understand how they work. Comparative Genomics is the process of comparing different genomes in order to better understand what they do and how they work. Like comparing humans, chimpanzees, and mice that are all mammals but all very different.

Who works on this stuff anyway? Biologists and Chemists understand the physical sciences-they take biology and chemistry classes. Computer Scientists program the computers (the same people who make video games!)-they take math and computer classes. Computer Engineers try to build better, faster, smarter computers-they take math, physics and computer classes. Social Scientists try to understand how this new information and technology will impact our lives- they take sociology and philosophy classes.

UCSC Summer Workshop on Human Genome Research Held annually in July It’s a free event for students and teachers Workshops by faculty and researchers on a wide array of topics Tours of our laboratories and kilocluster Free breakfast and lunch Travel funds are available RSVP: or

How can I work on this project, or something like it? Read about it, online at or in Nature, Science, or other scientific magazines. Take classes in biology, chemistry, math, physics and English classes at high school. OR take classes at your local community college or University-Extension in biology, bioinformatics, or genetics. Go to college and get a degree in science, engineering, math, or social sciences.

Bioinformatics Opportunities Entry-Level - Company National Laboratory Teaching – Private Schools BS (BA) MS (MA) Research Staff - Company/University National Laboratory Research Foundation Teaching - Community College Public Schools PhD Director/Professor - University Company National Laboratory Research Foundation Bioinformatics Biochemistry Biology Computer Science Computer Engineering Mathematics Ocean Sciences Physics (Education, Sociology, Philosophy, Psychology, Community Studies) A research degree in any of these majors will take you far!

Thank you for letting us come talk to you today and share what we do! Bye! Come to UCSC, Slugs are cool!