Introduction to Bioinformatics February 13, 2017

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

Machine learning methods for the analysis of heterogeneous, multi- source data Ilkka Huopaniemi Statistical machine learning and.
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
©2001 Timothy G. Standish Psalm 102:25 25In the beginning you laid the foundations of the earth, and the heavens are the work of your hands.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Integrative Bioinformatics Institute VU (IBIVU) Tel ,
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Assembly of RNA polymerase II preinitiation complex Heather Drilling.
University of Louisville The Department of Bioinformatics and Biostatistics.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Introduction to BioInformatics GCB/CIS535
BI420 – Course information Web site: Instructor: Gabor Marth Teaching.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Bioinformatics.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Master’s Degrees in Bioinformatics in Switzerland: Past, present and near future Patricia M. Palagi Swiss Institute of Bioinformatics.
©2001 Timothy G. Standish Psalm 102:24 25In the beginning you laid the foundations of the earth, and the heavens are the work of your hands.
© 2004 by Genomatix Software GmbH Genomatix Dr. Martin Seifert Genomatix Software GmbH Landsberger Strasse 6, D München
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Centre for Integrative Bioinformatics VU (IBIVU) Tel ,
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
Gene Expression Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
High-throughput data used in bioinformatics
Introduction to Bioinformatics
Cancer Genomics and Class Discovery
Gil McVean Department of Statistics
Monica Britton, Ph.D. Sr. Bioinformatics Analyst June 2016 Workshop
Gene Expression Analysis
Statistical Applications in Biology and Genetics
Lecture 2.1.
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
Lecture 5. Transcription: DNA→RNA
Gene Hunting: Design and statistics
Day 2: Session 8: Questions and follow-up…. James C. Fleet, PhD
Post-GWAS and Mechanistic Analyses
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
Genomes and Their Evolution
Concept 18.2: Eukaryotic gene expression can be regulated at any stage
Gene Expression Analysis and Proteins
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
9 Future Challenges for Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Agenda 3/16 Eukaryotic Control Introduction and Reading
Data Mining Functionalities (2)
Bioinformatics For MNW 2nd Year
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
Hebrews 1 1 God, who at sundry times and in divers manners spake in time past unto the fathers by the prophets, 2 Hath in these last days spoken unto us.
Introduction to Bioinformatic
Comparison of Nuclear, Eukaryotic RNA Polymerases
A systems view of genetics in chronic kidney disease
Gene Expression Analysis
The gene: structure, function and location
Summarized by Sun Kim SNU Biointelligence Lab.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Session 1: WELCOME AND INTRODUCTIONS
Introduction to Bioinformatics
Vermont Genetics Network Outreach Proteomics Module
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
GN434 - Genes and Development
Vermont Genetics Network Outreach Proteomics Module
Biotechnology & Bioinformatics
Presentation transcript:

Introduction to Bioinformatics February 13, 2017 Dr. ir. Perry Moerland Bioinformatics Laboratory Academic Medical Center p.d.moerland@amc.uva.nl Graduate School ‘Bioinformatics’

Aim of course Get acquainted with the basic principles and algorithms of commonly used bioinformatics tools Gain sufficient theoretical knowledge and practical skills to be able to apply bioinformatics adequately in your own work

Topics (I) Possibilities and limitations of public biological DBs Statistical concepts for ‘omics data analysis DNA microarray analysis Proteomics data analysis Metabolomics data analysis Pathways and networks Genetical genomics Capita selecta: DNA methylation array analysis, sample misannotation, …

Topics (I) Possibilities and limitations of public biological DBs Statistical concepts for ‘omics data analysis DNA microarray analysis Proteomics data analysis Metabolomics data analysis Pathways and networks Genetical genomics Capita selecta

Topics (II) Methods for the analysis of data generated with high-throughput technologies Microarrays Mass spectrometry Next generation sequencing Course: Bioinformatics Sequence Analysis

What has been left out Almost anything sequence-based Phylogenetics Construction of evolutionary trees Image from Florian Markowetz’s blog: https://scientificbsides.wordpress.com/2014/12/08/the-biggest-problem-in-cancer-evolution-that-mostly-people-like-me-are-doing-it/

What has been left out Almost anything sequence-based Phylogenetics Construction of evolutionary trees Modeling of intra-tumour heterogeneity Source: Florian Markowetz’s blog: https://scientificbsides.wordpress.com/2014/12/08/

What has been left out Almost anything sequence-based Phylogenetics Construction of evolutionary trees Modeling of intra-tumour heterogeneity Comparative genomics Protein modeling, protein docking Systems biology Information management Programming e-Science Multi-omic approaches, exception: eQTL

Related AMC Graduate School courses Computing in R Unix e-Science (Big Data) Bioinformatics Sequence Analysis Systems Medicine Practical Biostatistics Advanced Biostatistics Genetic Epidemiology BioSB Research School: http://biosb.nl/education/course-portfolio-2/ Pattern Recognition (Machine Learning) DNA Technology Mass Spectrometry, Proteomics and Protein Research

Possibilities and limitations of public biological databases Most high-throughput data is publicly available Often enforced by journals Possibilities Limitations Errors in databases GPL11012 (Gene Expression Omnibus)

Possibilities and limitations of public biological databases Most high-throughput data is publicly available Often enforced by journals Possibilities Limitations Errors in databases GPL11012 (Gene Expression Omnibus) Zeeberg et al., BMC Bioinformatics. 5:80 2004

Possibilities and limitations of public biological databases Most high-throughput data is publicly available Often enforced by journals Possibilities Limitations Errors in databases GPL11012 (Gene Expression Omnibus)

Statistical concepts for ‘omics data analysis High-dimensional data 10,000s of genes, transcript variants, proteins, metabolites 100,000s of single nucleotide polymorphisms, epigenetic markers In general, much less samples: ~100s Experimental design Quality control Pre-processing: normalization Differential expression: statistical tests, multiple testing Unsupervised: clustering Supervised: classification, prediction Widely applicable: next-generation sequence analysis, for example

‘Omics technologies Microarrays mRNA Single nucleotide polymorphisms Methylation Transcription factor binding Chromosal aberrations – aCGH (comparative genomic hybridization) Mass spectrometry Proteins: identification Metabolites: pre-processing

Pathways and networks activated pathways Interorgan coordination of the murine adaptive response to fasting : 5 tissues 5 timepoints 5 mice per timepoint Hakvoort et al., J Biol Chem, 286(18):16332-43, 2011

lipid steroid carbohydrates metabolism amino acid FoxOs cell turnover transcriptional network FASTING CHALLENGE ‘to serve and protect’ metabolic regulators central controller lipid steroid carbohydrates amino acid metabolism cell turnover immune response ox. stress defense cMyc Sp1 p53 EGF AP-1 HNF4α FoxOs NRs

Genetical genomics Locus SNP X modulates expression of gene Y = Expression quantitative trait locus (eQTL) SNP X TFIIB TFIIE TFIIH IN R TFIID TFIIF RNA polymerase II TFIIA TBP proximal promoter core distal promoter/ enhancer TF binding sites „DNA-looping“ TATA TF binding sites Expression gene Y Gene Y Genotype SNP X Source: Michiel Adriaens

Bioinformatics Laboratory Department of Clinical Epidemiology, Biostatistics and Bioinformatics You are welcome if you need bioinformatics expertise The earlier, the better! wiki.bioinformaticslaboratory.nl

Practical things Certificate Other things Attend all sessions (half a day can be skipped, ask for possibility for self-study) Active participation Other things Lunch is not included Coffee, tea, … is available at the machines (with your AMC badge) Slides and exercises will be made available on http://wiki.bioinformaticslaboratory.nl/ under ‘Education’

Schedule