Introduction to Genetic Analysis

Introduction to Genetic Analysis
Griffiths • Wessler • Carroll • Doebley Introduction to Genetic Analysis ELEVENTH EDITION CHAPTER 14 Genomes and Genomics © 2015 W. H. Freeman and Company

CHAPTER OUTLINE 14.1 The genomics revolution
14.2 Obtaining the sequence of a genome 14.3 Bioinformatics: meaning from genomic sequence 14.4 The structure of the human genome 14.5 The comparative genomics of humans with other species 14.6 Comparative genomics and human medicine 14.7 Functional genomics and reverse genetics

Genomes and Genomics

Dideoxy (Sanger) sequencing and now several
DNA Sequencing Dideoxy (Sanger) sequencing and now several next generation sequencing (NGS) methods are used for whole-genome sequencing (WGS). Thus, Traditional WGS and Next Gen WGS. See video:

Genomics: Completed genomes as 2018
 Currently the genomes of over 5,000 eukaryotes and 134,000 prokaryotes are sequenced. This generates large amounts of information to be handled by individual computers.

Cloning/libraries used in traditional WGS BAC, YAC and ESTs
BAC = bacterial artificial chromosome 150 kb, replicate in E.coli YAC = yeast artificial chromosome 150 kb -1.5 Mb, replicate in yeast EST = expressed sequence tags essentially cDNA sequences; reveals exons

The logic of obtaining a genome sequence

End reads from multiple inserts may be overlapped to produce a contig

Pyrosequencing reactions take place on beads in tiny wells

Pyrosequencing is based on detecting synthesis reactions

Paired-end reads may be used to join two sequence contigs

Strategy for whole-genome shotgun sequencing assembly

How to handle the large amount of information?
Drew Sheneman, New Jersey--The Newark Star Ledger Answer: bioinformatics and Internet

Annotation-the identification of all the genes and functional elements of a genome
ORFs – open reading frames Various binding sites for proteins and RNAs ESTs- expressed sequence tags Based on mRNAs Comparative genomics

Bioinformatics history
In1960s: the birth of bioinformatics IBM 7090 computer The 1960s marked the beginning of bioinformatics. Prior to the advent of high-level computer languages in 1957, programmers needed a detailed knowledge of a computer’s design and were forced to use languages that were unintuitive to humans. High-level computer languages allowed computer scientists to spend more time designing complex algorithms and less time worrying about the technical details of the particular computer model they were using. By the 1960s, mainframe computers like the one pictured in the slide were becoming common at universities and research institutions, giving academics unprecedented access to computers. (As useful as these computers were, they filled entire rooms and had processing power far below that of consumer-grade personal computers today!) Margaret Oakley Dayhoff and colleagues took advantage of these developments and the accumulation of protein sequence data to create some of the first bioinformatics applications. For example, Dayhoff wrote the first computer program to automate sequence assembly, enabling a task that previously took human workers months to be accomplished in minutes. She and her colleagues also published (in paper form) the first protein sequence database and performed many groundbreaking studies regarding phylogeny and scoring sequence comparisons. For these reasons, she is considered one of the great pioneers of computational biology and bioinformatics. Margaret Oakley Dayhoff created: The first protein database The first program for sequence assembly There is a need for computers and algorithms that allow: Access, processing, storing, sharing, retrieving, visualizing, annotating…

The information content of the genome includes binding sites
The information content of the genome includes binding sites. A gene within DNA may be viewed as a series of binding sits for proteins and RNAs.

cDNAs and ESTs reveal exons or gene ends in genome searches

Genome searches hunt for various binding sites

Many forms of evidence are integrated to make gene predictions

DNA (nucleotide sequences) databases
They are big databases and searching either one should produce similar results because they exchange information routinely. -GenBank (NCBI): -Arabidopsis: (TAIR) Specialized databases:Tissues, species… -ESTs (Expressed Sequence Tags) ~at NCBI ~at TIGR

Comparative genomics Comparative genomics considers genomes of closely and more distantly related species for evolutionary insight. BLAST – basic local alignment and search tool; identifies closely related DNA/gene sequences (see ) Homologs-closely related genes Orthologs-closely related genes in two or more genomes inherited from a common ancestor Paralogs-closely related genes within a genome as a result of gene duplication

Question cDNA sequences Protein sequences Genomic DNA sequences
You are a researcher who has tentatively identified a human homolog of a yeast gene. You determine the DNA sequence of cDNAs of both your yeast gene and the human gene and decide to compare the gene sequences, as well as the predicted protein sequence of each, using alignment software. You would expect the greatest sequence identity from comparisons of the: cDNA sequences Protein sequences Genomic DNA sequences Both (a) and (b) will give you equivalent sequence similarity All will give equivalent sequence similarity

Phylogeny (evolutionary history) of living mammals and other amniotes

The human genome carries relics of our egg-laying ancestors
The human genome carries relics of our egg-laying ancestors. Note, pseudogenes are mutationally inactivated (non-functional) genes.

The mouse and human genome have large syntenic blocks of genes in common

Exome sequencing and personalized genomics/medicine

Functional Genomics Functional Genomics: the use of an expanding variety of methods to understand gene and protein function in biological processes. Genomics: New sequence information is being produced at increasing rates. (The contents of GenBank double every year) Transcriptomics: Includes RNA seq and Microarray analysis; Global expression analysis: RNA levels of every gene in the genome analyzed in parallel. Proteomics: Global protein analysis generates large mass spectra libraries. Metabolomics:Global metabolite analysis: 25,000 secondary metabolites characterized Glycomics:Global sugar metabolism analysis

Microarrays can detect differences in gene expression
See

Microarray technology directly involves:
Question Microarray technology directly involves: PCR DNA sequencing Hybridization RFLP detection None of the above

Protein – protein interactions
Yeast two hybrid ChIP (chomatin immunoprecipitation) and ChIP-chip Bi Molecular Fluorescence Complementation (BMFC)

Studying protein interactions with the use of the yeast two-hybrid system
See

Steps in a chromatin immunoprecipitation assay (ChIP)
ChIP-Seq vs ChIP-chip; see

Bi Molecular Fluorescence Complementation (BMFC)
Citovsky et al., 2006

Reverse genetics-used to study gene function (functional genomics)
Gene knockouts RNAi Overexpression Altered expression

Disrupting gene function with the use of targeted mutagenesis
CRISPR-Cas9 is a new and exciting way to make gene knockouts. See

Disrupting gene function with the use of RNA interference

Inserting transgenes into a nonmodel organism

Examples of nonmodel insects expressing a transgene

Summary DNA sequencing and the rise of genomics
Annotation of genome sequences Comparative genomics Functional genomics Protein-protein interactions ESTs Reverse genetics

Introduction to Genetic Analysis

Similar presentations

Presentation on theme: "Introduction to Genetic Analysis"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Introduction to Genetic Analysis

Similar presentations

Presentation on theme: "Introduction to Genetic Analysis"— Presentation transcript:

Similar presentations

About project

Feedback