The Human Genome Project

Slides:



Advertisements
Similar presentations
The Human Genome Project
Advertisements

Beyond the Human Genome Project New Discovery Paths and Diverse Applications.
The Human Genome Project And It’s Impact on the 21 st century Made by RA-A Made by RA-A.
Study of Gene Expression: Statistics, Biology, and Microarrays Ker-Chau Li Statistics Department UCLA
Workshop: computational gene prediction in DNA sequences (intro)
Genomics 101: An Introduction HAS 4320 Genomics 101: An Introduction What factors influence our behavior?
Genomics MUPGRET Weekend Workshop Timeline Answers ne_2.html ne_2.html.
Bioinformatics Kick Ass Cool Stuff. Bioinformatics Def.: where the rubber meets the road (i.e., where computer science meets biology) "The mathematical,
HAS 4320 Pharmaceutical Industry Dr Burton. “…prescription drugs represent approximately 10 percent of the total national personal health care spending,
Human Genetics Overview.
Genomics MUPGRET Weekend Workshop Timeline Answers ne_2.html ne_2.html.
Summer Bioinformatics Workshop 2008 Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State University – Rochester
Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center Introduction to Bioinformatics.
Goals of the Human Genome Project determine the entire sequence of human DNA identify all the genes in human DNA store this information in databases improve.
Genomics and Its Impact on Medicine and Society: A 2001 Primer Human Genome Program, U.S. Department of Energy.
Chapter 9: Biotechnology
The Human Genome Project (H.G.P.) By Ben Fuhr. What is the Human Genome Project? The Human Genome Project was a great scientific endeavor designed to.
DNA sequencing: Importance The DNA sequences making up any organism comprise the basic blueprint for that organism.
What is genomics? Study of genomes. What is the genome? Entire genetic compliment of an organism.
Elements of Molecular Biology All living things are made of cells All living things are made of cells Prokaryote, Eukaryote Prokaryote, Eukaryote.
Georgia Wiesner, MD CREC June 20, GATACAATGCATCATATG TATCAGATGCAATATATC ATTGTATCATGTATCATG TATCATGTATCATGTATC ATGTATCATGTCTCCAGA TGCTATGGATCTTATGTA.
What is the Human Genome Project? Identify all the approximately 35,000 genes in human DNA Determine the sequences of the 3,000,000,000 bases ( = 200 phone.
Human Genome Project by: Amanda Mosello. What is the Human Genome Project? created in 1990, by the National Institutes of Health and the US Department.
The Human Genome Project Jacob D. Schroeder Department of Chemistry and Chemical Engineering South Dakota School of Mines and Technology Rapid City, SD.
Human Genome Project. In 2003 scientists in the Human Genome Project obtained the DNA sequence of the 3 billion base pairs making up the human genome.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
HUMAN GENOMICS Human Genome Program, U.S. Department of Energy, Genomics and Its Impact on Medicine and Society: A 2001 Primer, 2001.
U.S. Department of Energy Genome Programs
Lesson Overview Lesson Overview Studying the Human Genome Lesson Overview 14.3 Studying the Human Genome.
Write down what you know about the human genome project.
Genomics and Its Impact on Science and Society: The Human Genome Project and Beyond U.S. Department of Energy Genome Programs
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Aim: What have we learned from the Human Genome Project ? Human Genome Project Progress Project goals were togoals 1.identify all the approximately 20,000-25,000.
Bioinformatics field of science in which biology, computer science, and information technology merge to form a single discipline.
Chapter 21 Eukaryotic Genome Sequences
Human Genome Project Daniel Ospina Joaquín Llano.
HUMAN GENOME PROJECT International effort of 13 years (1990 – 2003) Identified all the approximate 20,000 – 25,000 genes in human DNA Determined the sequences.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
The Human Genome Race. Collins vs. Venter Collins Venter.
Benefits of Human Genome Project Research
The Human Genome Project
The Secret of Life! DNA. 2/4/20162 SOMETHING HAPPENS GENE PROTEIN.
The Human Genome Project
Chapter 2 Genome Organization and Evolution Dr Achraf El Allali.
Notes: Human Genome (Right side page)
Chapter 13 Section 13.3 The Human Genome. Genomes contain all the information needed for an organism to grow and survive The Human Genome Project (HGP)
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
What does the draft human genome sequence tell us?
Biotechnology.
Chapter 9: Biotechnology
What does the draft human genome sequence tell us?
U.S. Department of Energy Genome Programs
Biotechnology.
Human Cells Human genomics
Introduction to Genetics and Biotechnology
U.S. Department of Energy Genome Programs
U.S. Department of Energy Genome Programs
Scientists use several techniques to manipulate DNA.
U.S. Department of Energy Genome Programs
What is Technology?.
Genomes and Their Evolution
Aka DNA finger printing
The Human Genome Project
U.S. Department of Energy Genome Programs
The Study of Biological Information
U.S. Department of Energy Genome Programs
U.S. Department of Energy Genome Programs
In 2003 scientists in the Human Genome Project achieved a long-sought goal by obtaining the DNA sequence of the 3.2 billion base pairs (the order of As,
A Lot More Advanced Biotechnology Tools
Presentation transcript:

The Human Genome Project Prepared By Ihsan Ullah Roll No 117106 BS Bioinformatics

Introduction Until the early 1970’s, DNA was the most difficult cellular molecule for biochemists to analyze. DNA is now the easiest molecule to analyze – we can now isolate a specific region of the genome, produce a virtually unlimited number of copies of it, and determine its nucleotide sequence overnight.

Introduction At the height of the Human Genome Project, sequencing factories were generating DNA sequences at a rate of 1000 nucleotides per second 24/7. Technical breakthroughs that allowed the Human Genome Project to be completed have had an enormous impact on all of biology…..

Human Genome Project Goals: Milestones: ■ identify all the approximate 30,000 genes in human DNA, ■ determine the sequences of the 3 billion chemical base pairs that make up human DNA, ■ store this information in databases, ■ improve tools for data analysis, ■ transfer related technologies to the private sector, and ■ address the ethical, legal, and social issues (ELSI) that may arise from the project.   Milestones: ■ 1990: Project initiated as joint effort of U.S. Department of Energy and the National Institutes of Health ■ June 2000: Completion of a working draft of the entire human genome (covers >90% of the genome to a depth of 3-4x redundant sequence) ■ February 2001: Analyses of the working draft are published ■ April 2003: HGP sequencing is completed and Project is declared finished two years ahead of schedule

What does the draft human genome sequence tell us? By the Numbers • The human genome contains 3 billion chemical nucleotide bases (A, C, T, and G).  • The average gene consists of 3000 bases, but sizes vary greatly, with the largest known human gene being dystrophin at 2.4 million bases.  • The total number of genes is estimated at around 30,000--much lower than previous estimates of 80,000 to 140,000.  • Almost all (99.9%) nucleotide bases are exactly the same in all people.  • The functions are unknown for over 50% of discovered genes.

What does the draft human genome sequence tell us? How It's Arranged • The human genome's gene-dense "urban centers" are predominantly composed of the DNA building blocks G and C.   • In contrast, the gene-poor "deserts" are rich in the DNA building blocks A and T. GC- and AT-rich regions usually can be seen through a microscope as light and dark bands on chromosomes. • Genes appear to be concentrated in random areas along the genome, with vast expanses of noncoding DNA between. • Stretches of up to 30,000 C and G bases repeating over and over often occur adjacent to gene-rich areas, forming a barrier between the genes and the "junk DNA." These CpG islands are believed to help regulate gene activity. • Chromosome 1 has the most genes (2968), and the Y chromosome has the fewest (231).

What does the draft human genome sequence tell us? The Wheat from the Chaff • Less than 2% of the genome codes for proteins.   • Repeated sequences that do not code for proteins ("junk DNA") make up at least 50% of the human genome. • Repetitive sequences are thought to have no direct functions, but they shed light on chromosome structure and dynamics. Over time, these repeats reshape the genome by rearranging it, creating entirely new genes, and modifying and reshuffling existing genes. • The human genome has a much greater portion (50%) of repeat sequences than the mustard weed (11%), the worm (7%), and the fly (3%).

What does the draft human genome sequence tell us? How the Human Compares with Other Organisms • Unlike the human's seemingly random distribution of gene-rich areas, many other organisms' genomes are more uniform, with genes evenly spaced throughout.  • Humans have on average three times as many kinds of proteins as the fly or worm because of mRNA transcript "alternative splicing" and chemical modifications to the proteins. This process can yield different protein products from the same gene.  • Humans share most of the same protein families with worms, flies, and plants; but the number of gene family members has expanded in humans, especially in proteins involved in development and immunity.  • Although humans appear to have stopped accumulating repeated DNA over 50 million years ago, there seems to be no such decline in rodents. This may account for some of the fundamental differences between hominids and rodents, although gene estimates are similar in these species. Scientists have proposed many theories to explain evolutionary contrasts between humans and other organisms, including those of life span, litter sizes, inbreeding, and genetic drift.

What does the draft human genome sequence tell us? Variations and Mutations • Scientists have identified about 3 million locations where single-base DNA differences (SNPs) occur in humans. This information promises to revolutionize the processes of finding chromosomal locations for disease-associated sequences and tracing human history.   • The ratio of germline (sperm or egg cell) mutations is 2:1 in males vs females. Researchers point to several reasons for the higher mutation rate in the male germline, including the greater number of cell divisions required for sperm formation than for eggs.

What does the draft human genome sequence tell us? Led to the discovery of whole new classes of proteins and genes, while revealing that many proteins have been much more highly conserved in evolution than had been suspected. Provided new tools for determining the functions of proteins and of individual domains within proteins, revealing a host of unexpected relationships between them.

What does the draft human genome sequence tell us? By making large amounts of protein available, it has yielded an efficient way to mass produce protein hormones and vaccines Dissection of regulatory genes has provided an important tool for unraveling the complex regulatory networks by which eukaryotic gene expression is controlled.

How does the human genome stack up? Organism Genome Size (Bases) Estimated Genes Human (Homo sapiens) 3 billion 30,000 Laboratory mouse (M. musculus) 2.6 billion Mustard weed (A. thaliana) 100 million 25,000 Roundworm (C. elegans) 97 million 19,000 Fruit fly (D. melanogaster) 137 million 13,000 Yeast (S. cerevisiae) 12.1 million 6,000 Bacterium (E. coli) 4.6 million 3,200 Human immunodeficiency virus (HIV) 9700 9

What We Still Don’t Know Future Challenges: What We Still Don’t Know • Gene number, exact locations, and functions • Gene regulation • DNA sequence organization • Chromosomal structure and organization • Noncoding DNA types, amount, distribution, information content, and functions • Coordination of gene expression, protein synthesis, and post-translational events • Interaction of proteins in complex molecular machines • Predicted vs experimentally determined gene function • Evolutionary conservation among organisms • Protein conservation (structure and function) • Proteomes (total protein content and function) in organisms • Correlation of SNPs (single-base DNA variations among individuals) with health and disease • Disease-susceptibility prediction based on gene sequence variation • Genes involved in complex traits and multigene diseases • Complex systems biology including microbial consortia useful for environmental restoration • Developmental genetics, genomics

Anticipated Benefits of Genome Research Molecular Medicine • improve diagnosis of disease • detect genetic predispositions to disease • create drugs based on molecular information • use gene therapy and control systems as drugs • design “custom drugs” (pharmacogenomics) based on individual genetic profiles Microbial Genomics • rapidly detect and treat pathogens (disease-causing microbes) in clinical practice • develop new energy sources (biofuels) • monitor environments to detect pollutants • protect citizenry from biological and chemical warfare • clean up toxic waste safely and efficiently

Anticipated Benefits of Genome Research-cont. Risk Assessment • evaluate the health risks faced by individuals who may be exposed to radiation (including low levels in industrial areas) and to cancer-causing chemicals and toxins Bioarchaeology, Anthropology, Evolution, and Human Migration • study evolution through germline mutations in lineages • study migration of different population groups based on maternal inheritance • study mutations on the Y chromosome to trace lineage and migration of males • compare breakpoints in the evolution of mutations with ages of populations and historical events

Anticipated Benefits of Genome Research-cont. DNA Identification (Forensics) • identify potential suspects whose DNA may match evidence left at crime scenes • exonerate persons wrongly accused of crimes • identify crime and catastrophe victims • establish paternity and other family relationships • identify endangered and protected species as an aid to wildlife officials (could be used for prosecuting poachers) • detect bacteria and other organisms that may pollute air, water, soil, and food • match organ donors with recipients in transplant programs • determine pedigree for seed or livestock breeds • authenticate consumables such as caviar and wine  

Anticipated Benefits of Genome Research-cont. Agriculture, Livestock Breeding, and Bioprocessing • grow disease-, insect-, and drought-resistant crops • breed healthier, more productive, disease-resistant farm animals • grow more nutritious produce • develop biopesticides • incorporate edible vaccines incorporated into food products • develop new environmental cleanup uses for plants like tobacco

Medicine and the New Genetics Gene Testing  Pharmacogenomics  Gene Therapy Anticipated Benefits: • improved diagnosis of disease earlier detection of genetic predispositions to disease rational drug design gene therapy and control systems for drugs personalized, custom drugs

ELSI: Ethical, Legal, and Social Issues • Privacy and confidentiality of genetic information. • Fairness in the use of genetic information by insurers, employers, courts, schools, adoption agencies, and the military, among others. • Psychological impact, stigmatization, and discrimination due to an individual’s genetic differences. • Reproductive issues including adequate and informed consent and use of genetic information in reproductive decision making. • Clinical issues including the education of doctors and other health-service providers, people identified with genetic conditions, and the general public about capabilities, limitations, and social risks; and implementation of standards and quality‑control measures.

ELSI Issues (cont.) • Uncertainties associated with gene tests for susceptibilities and complex conditions (e.g., heart disease, diabetes, and Alzheimer’s disease). • Fairness in access to advanced genomic technologies. • Conceptual and philosophical implications regarding human responsibility, free will vs genetic determinism, and concepts of health and disease. • Health and environmental issues concerning genetically modified (GM) foods and microbes. • Commercialization of products including property rights (patents, copyrights, and trade secrets) and accessibility of data and materials.

Advances The following advances in robotics and automation reduced the labor by 80% while combining the microbiological advances: Development of Perkin-Elmer (ABI PRISM 3700) gene sequence. 1000 sample per day 15 minutes instead of 8 hours for first automated sequencers A parallel system of 300 sequencers ($300,000 each) Use of supercomputers to assemble fragments Development of process support instrumentation to process 100 K template preps and 200 K sequence reactions per day. 24 hour per day unattended operation of sequencers

Advances In addition to the above advances the field of computational biology (bioinformatics) became increasingly important as the software and processors required to assemble a puzzle of this size still needed to be developed. The solution came in advances in processor speeds that have doubled every 18 months and the development of better overlap detection algorithms. It is expected that using 100 networked workstations the entire genome could be assembled in 30-60 computational days.

Costs of Human Genomic Sequencing Clone by clone $0.30 per finished base $130 million per year for 7 years Total $900 million spent by end of 2003 Shotgun $0.01 per raw base $130 million for 3 years would provide 10× coverage/redundancy plus an additional $90 million for informatics

Human Genome Projects Progress

Algorithms Human Genome Project: GigAssembler Celera: Celera Assembler Euler Algorithm

References http://www.pnas.org/cgi/content/full/98/17/9748 http://portal.acm.org/citation.cfm?id=369190&jmp=citings&dl=GUIDE&dl=ACM http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=11504945 http://www.genome.org/cgi/content/full/11/9/1461 http://www.genome.org/cgi/content/full/7/5/401 http://www.pnas.org/cgi/content/full/98/17/9748 http://www.cse.ucsc.edu/~learithe/browser/goldenPath/algo.html http://www.eecs.berkeley.edu/~gene/Papers/greedy.path.merging.pdf --> ***detailed algorithm description***

“MAN IS A NUISANCE MAN IS A CRACKPOT BUT ONLY MAN CAN HIT THE JACKPOT” THANK YOU