Introduction to Bioinformatics

Slides:



Advertisements
Similar presentations
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Advertisements

© 2006 W.W. Norton & Company, Inc. DISCOVER BIOLOGY 3/e
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Create a tally sheet for scoring and give each player a bell or buzzer. To begin a game, go to the next slide and select “Slide Show” from the “View”
Changes in DNA can produce variation
On line (DNA and amino acid) Sequence Information
Genes (3.1) IB Diploma Biology Essential Idea: Heritable traits are passed down to offspring through genes.
Transcription and Translation
Bioinformatics.
Mutations Mutation- a change in the DNA nucleotide sequence
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
DNA Structure & Function. Perspective They knew where genes were (Morgan) They knew what chromosomes were made of Proteins & nucleic acids They didn’t.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
5.2.  Differences or variation in DNA are what makes us different from each other!  Each human cell contains 46 chromosomes  Each cell has 3 billion.
Genes and Genomic Datasets. DNA compositional biases Base composition of genomes: E. coli: 25% A, 25% C, 25% G, 25% T P. falciparum (Malaria parasite):
EB3233 Bioinformatics Introduction to Bioinformatics.
Biotechnology and Genomics Chapter 16. Biotechnology and Genomics 2Outline DNA Cloning  Recombinant DNA Technology ­Restriction Enzyme ­DNA Ligase 
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Gene Expression: from DNA to protein to phenotype How is DNA transcribed to RNA? How is mRNA translated to protein? How do alterations in DNA lead to alterations.
Human Evolution. Where did hominids evolve? Where do we live now?
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
The Genetic Code. The DNA that makes up the human genome can be subdivided into information bytes called genes. Each gene encodes a unique protein that.
CHANGES IN DNA CAN PRODUCE VARIATIONS
Mutations Chapter Types of Mutations The sequence of bases in DNA are like the letters of a coded message or even the letters of a simple alphabet.
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
Selective Breeding and Natural Selection. DNA Technology.
Genetics 3.1 Genes. Essential Idea: Every living organism inherits a blueprint for life from its parents.
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
Explain how scientists manipulate DNA. Describe the importance of recombinant DNA. Define transgenic and describe the usefulness of some transgenic organisms.
Ch.20 Biotechnology. Overview: The DNA Toolbox History  1970’s Recombinant DNA  2001 Human Genome Project  10 years, $3 billion  Genomes.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Disorder, Cause, and Result
Higher Human Biology Sub topic 5 (a)
Gil McVean Department of Statistics
6 C H A P T E R HUMAN VARIATION AND ADAPTATION 6-2.
Genes (3.1) Essential Idea: Heritable traits are passed down to offspring through genes.
Section 3: Gene Technologies in Detail
The ABC’s of DNA Barry Bowman.
DNA Marker Lecture 10 BY Ms. Shumaila Azam
Mutations and Genetic Engineering
Changes in DNA can produce Variation
Genes 3.1.
Genomes and Their Evolution
Relationship between Genotype and Phenotype
ABO Blood Type: An Example of Genetic Variation
Biology, 9th ed,Sylvia Mader
Introduction to Genetics
How do individuals and groups with different genes arise?
Genomes and Their Evolution
Warm Up – Visual Analysis
There are four levels of structure in proteins
DNA and Modern Genetics
Genes 3.1.
Genome organization and Bioinformatics
Lesson Overview 13.3 Mutations.
Where do Blue Eyes Come From?
Every living organism inherits a blueprint for life from its parents.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Overview of Genetics.
Gene and Chromosomal Mutations
Biology, 9th ed,Sylvia Mader
From Mendel to Genomics
What has happened? Substitution mutation
Unit Genomic sequencing
Introduction to Bioinformatics
Biology 331 Genetics Introduction.
Relationship between Genotype and Phenotype
Presentation transcript:

Introduction to Bioinformatics Topic 1 Introduction to Bioinformatics and Sequence Analysis

Session 1 Learning Outcomes: The scope of bioinformatics The origins & growth of DNA databases Evidence of evolution from bioinformatics Example sequence analysis and displays using human Factor IX

Bioinformatics: Concerns the generation, visualization, analysis, storage, and retrieval of large quantities of biological information.

GenBank growth: How much data we are talking about? The amount of DNA sequence data in public databases NCBI: US national centre for Biotechnology Information DDJB: DNA Data Bank Japan EBI: European Bioinformatics Institute The contest of these data base are synchronized.

What DATA??? Human Genome Project Projecting now come from scientists in numerous field of biology, medicine, agriculture, ecology, history, energy, and forensic. Lets give some examples which you can explore in your own interest:

http://www.1000genomes.org The genomes of 1000 people to identify genetic variants that affect 1% of the human population

www.1001genomes.org The genomes of 1001 strains that differ in phenotype including adaptation to growth in a wide variety conditions.

https://genome10k.soe.ucsc.edu/ An effort to sequence the genomes of 10,000 species, one from each genus.

http://www.arthropodgenomes.org/wiki/i5K

http://www.ncbi.nlm.nih.gov/genome/browse/ Metagenomics database

Cancer genome atlas

ANNOTATION: The information describing genetic and protein sequences structures, similarities, functions, and prediction associated with these sequences.

Advantageous Deleterious Neutral WITNESSING EVOLUTION THROUGH BIOINFORMATICS Random mutation in sequences is a common phenomenon. Advantageous Organism kept it for future population Deleterious Quickly eliminated from the population Neutral May or may not be retained

Recent evolutionary changes to plants & animals 10,000 years ago hunter-gather life-style to practicing agriculture. Domestication of animals. Cows milk production Horses  speed or strength Sheep  wool quantity and quantity Poultry  more breast meat Fish  speed of maturation

LARGE SOURCES OF HUMAN SEQUENCE VARIATION First time sequencing of human genome both cost and time was high. Resequencing cost decline sharply as using the first sequence as template. Resequencing show considerable differnces seen between individual people.

Single Nucleotide polymorphisms (SNPs): Human genome 3.2 billions bp Approximately 3 million nucleotides differ between two individual genomes The common differences are found in about 1% of the population.

Copy Number Variations (CNVs): Comparing your DNA sequence to that of the human “standard genome”, there are thousands of DNA segments which range from 1000 to several million nucleotides in length and they are either present, present in multiple copies or absent from your genome.

Africa (50,000 years ago)  Middle East  Europe  Neanderthals RECENT EVOLUTIONARY CHANGES TO HUMAN POPULATIONS Africa (50,000 years ago)  Middle East  Europe  Neanderthals Eastern Europe  Lithuania

block damaging of uv light Examples of genetic changes associated with adaptation (diet and lifestyle): Skin Color: African Indian Southern European Northern European Near pole Paler skin color make vitamin D Near equator Darker skin color block damaging of uv light Sequence variation in number of genes, one of it is SLC24A5

Other examples: (self study) Lactose intolerance Digestion of starch Malaria resistance and sickle cell anemia Life at high altitude

DNA SEQUENCE IN DATABASES

Two types of DNA sequences are available in databases: Genomic DNA cDNA

Genomic DNA assembly

cDNA:

SEQUENCE ANALYSIS AND DATABASE DISPLAY The sequence of the mRNA for human Factor IX Accession number: NM_000133

Applying two rules for describing the human Factor IX mRNA sequence: Coding regions begins with ATG Coding regions end with one of three terminator sequences: TAA TGA TAG

Coding regions are read at triplets. Others are 5’ and 3’ UTR

Coding region triplets are translated into amino acids.

The protein sequence of human factor IX (461 amino acids)

Pairwise alignment: Factor IX gene which is over 38000 nt. A single mutation, changing a G to T at coordinate 25531, results in hemophilia B, a severe bleeding disorder.

Alignment of human (Query) and chimpanzee (Subject/Subjct) Factor IX proteins

Factor IX has five major domains Cleaved by signal peptidase, 12 Gla residues in the second domain. Activated by cleaving the protein into 2 peptide Cleave X protein, clotting cascade pathway To direct the protein to the ER of liver cells, from where it secreted into the blood. Epidermal growth factor- like domain bind Ca++

The entire 38000 nt gene is shown as the black arrow F9.

Location of Factor IX gene in chr X.

THE END