Introduction to BioInformatics GCB/CIS535

Slides:



Advertisements
Similar presentations
Bioinformatics Motif Detection Revised 27/10/06. Overview Introduction Multiple Alignments Multiple alignment based on HMM Motif Finding –Motif representation.
Advertisements

Ab initio gene prediction Genome 559, Winter 2011.
Regulatory Motifs. Contents Biology of regulatory motifs Experimental discovery Computational discovery PSSM MEME.
Gene Prediction Methods G P S Raghava. Prokaryotic gene structure ORF (open reading frame) Start codon Stop codon TATA box ATGACAGATTACAGATTACAGATTACAGGATAG.
CISC667, F05, Lec18, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Gene Prediction and Regulation.
Basic Biology for CS262 OMKAR DESHPANDE (TA) Overview Structures of biomolecules How does DNA function? What is a gene? How are genes regulated?
Introduction to Computational Biology Topics. Molecular Data Definition of data  DNA/RNA  Protein  Expression Basics of programming in Matlab  Vectors.
Gene Finding (DNA signals) Genome Sequencing and assembly
Dimension reduction : PCA and Clustering by Agnieszka S. Juncker
Multidimensional Analysis If you are comparing more than two conditions (for example 10 types of cancer) or if you are looking at a time series (cell cycle.
CSE182-L12 Gene Finding.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Review of important points from the NCBI lectures. –Example slides Review the two types of microarray platforms. –Spotted arrays –Affymetrix Specific examples.
Bioinformatics page 12, part of ch. 21 Cell and Mol Biol Lab.
CSE182-L10 MS Spec Applications + Gene Finding + Projects.
Gene Finding Genome Annotation. Gene finding is a cornerstone of genomic analysis Genome content and organization Differential expression analysis Epigenomics.
Biological Motivation Gene Finding in Eukaryotic Genomes
Presented by Liu Qi An introduction to Bioinformatics Algorithms Qi Liu
Genome Analysis & Gene Prediction. Overview about Genes Gene : whole nucleic acid sequence necessary for the synthesis of a functional protein (or functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
International Livestock Research Institute, Nairobi, Kenya. Introduction to Bioinformatics: NOV David Lynn (M.Sc., Ph.D.) Trinity College Dublin.
In-Text Art, Ch. 9, p In-Text Art, Ch. 3, p. 37.
Applications of HMMs Yves Moreau Overview Profile HMMs Estimation Database search Alignment Gene finding Elements of gene prediction Prokaryotes.
Genome Annotation BBSI July 14, 2005 Rita Shiang.
Grupo 5. 5’site 3’site branchpoint site exon 1 intron 1 exon 2 intron 2 AG/GT CAG/NT.
Genome Organization & Evolution. Chromosomes Genes are always in genomic structures (chromosomes) – never ‘free floating’ Bacterial genomes are circular.
Genome Analysis & Gene Prediction. Overview about Genes Gene : whole nucleic acid sequence necessary for the synthesis of a functional protein (or functional.
Gene finding and gene structure prediction M. Fatih BÜYÜKAKÇALI Computational Bioinformatics 2012.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
Computational Genomics and Proteomics Lecture 8 Motif Discovery C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E.
Pattern Matching Rhys Price Jones Anne R. Haake. What is pattern matching? Pattern matching is the procedure of scanning a nucleic acid or protein sequence.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
Gene Prediction: Similarity-Based Methods (Lecture for CS498-CXZ Algorithms in Bioinformatics) Sept. 15, 2005 ChengXiang Zhai Department of Computer Science.
Mark D. Adams Dept. of Genetics 9/10/04
From Genomes to Genes Rui Alves.
Gene, Proteins, and Genetic Code. Protein Synthesis in a Cell.
Central dogma: the story of life RNA DNA Protein.
Prokaryotic cells turn genes on and off by controlling transcription.
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
Basic Overview of Bioinformatics Tools and Biocomputing Applications II Dr Tan Tin Wee Director Bioinformatics Centre.
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
Genes and Genomes. Genome On Line Database (GOLD) 243 Published complete genomes 536 Prokaryotic ongoing genomes 434 Eukaryotic ongoing genomes December.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Finding genes in the genome
CFE Higher Biology DNA and the Genome Transcription.
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
HOW DO CELLS KNOW WHEN TO EXPRESS A GENE? DO NOW:.
1 Gene Finding. 2 “The Central Dogma” TranscriptionTranslation RNA Protein.
bacteria and eukaryotes
Bioinformatics Overview
The Transcriptional Landscape of the Mammalian Genome
High-throughput Biological Data The data deluge
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics II
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Proteomics Informatics David Fenyő
EXTENDING GENE ANNOTATION WITH GENE EXPRESSION
From Mendel to Genomics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Basic Local Alignment Search Tool
Proteomics Informatics David Fenyő
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Introduction to BioInformatics GCB/CIS535

Course Overview Sequence alignment Gene prediction Gene expression Dynamic programming Blast and its variants statistical significance Motif and promoter prediction Gene prediction Homology and HMMs Gene expression Experiment design Interpretation: clustering Proteomics Use of mass spectrometry

Sequence Alignment Choices Motif finding nucleotide vs. amino acid global vs. local repeat masking Motif finding Position weight matrices PAM, BLOSUM CONSENSUS EM and Gibbs sampling methods

Promoter Finding CpG islands Transcription Factor Binding sites TATA, GC, and CAAT boxes Transfac and Jasper libraries FirstExon

Gene Finding Homology Hidden Markov Models (HMMs) Regression Future Conservation between species Hidden Markov Models (HMMs) Acceptors & donors Coding & non-coding Frame shifts Regression Linear regression Artificial neural networks Future Conditional Random Fields (CRFs)

Gene Expression

Gene Expression Uses Technology Experimental design Analysis Finding Differentially Expressed Genes Gene List Annotation Technology Spotted array (two color) and Affimetrics (one color) Experiment Execution (Process Control) Experimental design Replicates Matched experiments Controls / reference samples Analysis Probes to Genes Normalization Sample Quality Control Statistical Significance of Over Representation

Clustering Clustering methods Key decisions Dimension reduction Hierarchical K-means Key decisions Standardize data? How many clusters? Dimension reduction PCA - Principal Components Analysis SOM - Self Organizing Maps Assessment Cluster purity

Methods for protein identification Proteomics Methods for protein identification

Proteomics Uses Mass spectrometry Toxicology Compare diseased vs. normal cells Alternative splicing Post-translational modifications Together with genomics Mass spectrometry Mass fingerprinting Sequence tags Cross correlation with simulated mass spectra E.g. Sequest and mascot Problem with introns Y-ions and b-ions Tandem mass spec

Future Directions Regulatory mechanisms Binding between Transcription (“gene expression”) Translation (“protein production”) Acetylation (of lycine) Phosphorylation, Other protein, RNA and DNA modification Binding between DNA, RNA, Protein Comparison across species Systems biology Metabolic modeling Combining data

Gene Regulatory Network Sea urchin development

Metabolic Networks