BF528 - Applications in Translational Bioinformatics

Slides:



Advertisements
Similar presentations
HCS806 “Methods in Horticulture and Crop Science” Introduction to methods in Bioinformatics for plant science. David Francis (Coordinator) Ian Holford.
Advertisements

Dale Beach, Longwood University Lisa Scheifele, Loyola University Maryland.
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Welcome to Chem 434 Bioinformatics March 25, 2008 Review of course prerequisites Review of syllabus Review of Bioinformatics Course website Course objectives.
Workshop in Bioinformatics 2010 What is it ? The goals of the class… How we do it… What’s in the class Why should I take the class..
Workshop in Bioinformatics 2010 Class # Class 8 March 2010.
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2015 Xiaole Shirley Liu Please Fill Out Student Sign In.
Workshop in Bioinformatics 2009 Class 2 Michal Linial Room A-515
Ayesha Masrur Khan Spring Course Outline Introduction to Bioinformatics Definition of Bioinformatics and Related Fields Earliest Bioinformatics.
EECS 395/495 Algorithmic Techniques for Bioinformatics General Introduction 9/27/2012 Ming-Yang Kao 19/27/2012.
BIO337 Systems Biology/Bioinformatics (course # 50524) Spring 2014 Tues/Thurs 11 – 12:30 PM BUR 212 Edward Marcotte/Univ. of Texas/BIO337/Spring 2014.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Bioinformatics.
Bioinformatics Sean Langford, Larry Hale. What is it?  Bioinformatics is a scientific field involving many disciplines that focuses on the development.
Manifestations of a Code Genes, genomes, bioinformatics and cyberspace – and the promise they hold for biology education.
Bioinformatics and medicine: Are we meeting the challenge?
Introduction to Bioinformatics Prologue. Bioinformatics Living things have the ability to store, utilize, and pass on information Bioinformatics strives.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
Intelligent systems in bioinformatics Introduction to the course.
Introduction to Science Informatics Lecture 1. What Is Science? a dependence on external verification; an expectation of reproducible results; a focus.
Bioinformatics Core Facility Guglielmo Roma January 2011.
Overview of Bioinformatics 1 Module Denis Manley..
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
WMU CS 6260 Parallel Computations II Spring 2013 Presentation #1 about Semester Project Feb/18/2013 Professor: Dr. de Doncker Name: Sandino Vargas Xuanyu.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
Bioinformatics Curriculum Issues, goals, curriculum.
Algorithms for Biological Sequence Analysis Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University,
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
A Librarian’s Guide to NCBI. The Course provided… B asic knowledge and skills for librarians interested in helping patrons use online molecular databases.
High throughput biology data management and data intensive computing drivers George Michaels.
BCH339N Systems Biology/Bioinformatics (course # 54040) Spring 2016 Tues/Thurs 11 – 12:30 PM BUR 212.
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2016 Xiaole Shirley Liu.
1 Delivering Bioinformatics Training: Bridging the Gaps Between Computer Science and Biomedicine Christopher Dubay Ph.D. James M. Brundege Ph.D. William.
Bioinformatics Educated by Zhenglin Zhu School of Life Sciences, Chongqing U.
1 Using DLESE: Finding Resources to Enhance Teaching Shelley Olds Holly Devaul 11 July 2004.
Computer Applications and Bioinformatics
BME435 BIOINFORMATICS.
Bioinformatics Overview
Pathway Informatics 16th August, 2017
Introduction to Bioinformatics
Introduction to Bioinformatics and Functional Genomics
Biological Databases By: Komal Arora.
Theory and Practice of Web Technology
Making “Open Data” Work: Challenges for Data Integration in Genomics Research
Lecture 2.1.
생물정보학 Bioinformatics.
LEARN WHY COMPUTERS ARE REVOLUTIONIZING BIOLOGY!
Cpt S 471/571: Computational Genomics
Algorithms for Biological Sequence Analysis
Mangaldai College, Mangaldai
PABIO 590B Advanced Topics in Bioinformatics
Genomes and Their Evolution
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Cpt S 471/571: Computational Genomics
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
The Study of Biological Information
CSE 5290: Algorithms for Bioinformatics Fall 2009
Gene Expression Analysis
MCBIOS 2016 – University of Memphis, TN
Applying principles of computer science in a biological context
BCH339N Systems Biology/Bioinformatics (course # 54510)
Session 1: WELCOME AND INTRODUCTIONS
BF528 - Applications in Translational Bioinformatics
Introduction to Bioinformatics
Presentation transcript:

BF528 - Applications in Translational Bioinformatics 1/19/2018

Instructor Introductions Adam Labadorf David Bray Rachael Ivison Kritika Karri Marzie Rasekh Meghan Thommes

Course Overview Survey course in bioinformatics Focus on high-throughput sequencing data, tools, and techniques Focus on practical skills Group work simulates real-world collaborative environment

Course Goals Survey current bioinformatics techniques in translational studies Give you hands-on experience working with high-throughput biological data and tools Read and understand papers that use bioinformatics in translational studies Develop shared vocabulary between biology and computation

Prerequisites Molecular and cell biology BF527, BE505/605 or equivalent Good-to-haves: Basic statistics knowledge Programming/linux cluster experience But don’t panic...

Course Organization http://bf528.readthedocs.io Wed/Fri 2:30-4:15 SED 208 Semi-flipped paradigm Online content limited to ~1 hr/class Class period split into two segments: Lecture or discussion of online material Project group meeting and discussion

Course Organization cont’d Students assigned into groups of 4 4 projects over the course of the semester No homeworks No exams

Schedule of Topics Class Day Date Topic Project Out/Due Lecturer 1 Fri Jan 19 Introduction Adam 2 Wed Jan 24 Computational Skills Primer Kritika 3 Jan 26 Genomics, Genes, and Genomes 4 Jan 31 Array Technologies 5 Feb 2 2nd Gen Sequencing Marzie 6 Feb 7 Sequence Analysis 1 - WGS/WES 7 Feb 9 Genomic Variation and SNP Analysis 8 Feb 14 Biological Data Formats 9 Feb 16 Databases 10 Feb 21 Sequence Analysis 2 - RNA-Seq 1/2 11 Feb 23 Biomarkers Marc 12 Feb 28 Sequence Analysis 3 - ChIP-Seq David 13 Mar 2 Phylogenetics Spring Break

Schedule of Topics cont’d 14 Wed Mar 14 Gene sets and enrichment 2/3 Kritika 15 Fri Mar 16 Replicability vs Reproducibility Strategies Adam 16 Mar 21 Computational Pipeline Strategies 17 Mar 23 Computational Environment Management 18 Mar 28 Sequence Visualization David 19 Mar 30 Microbiome: 16S Meghan 20 Apr 4 Microbiome: Metagenomics 3/4 21 Apr 6 Metabolomics 22 Apr 11 Proteomics/Mass Spectrometry Andrew 23 Apr 13 Single Cell Techniques 24 Apr 18 Integrative Genomics 25 Apr 20 Network Biology 26 Apr 25 Systems Biology 4 Trevor 27 Apr 27 The Future + Retrospective Ensemble

Projects Assigned into groups based on experience Groups are for the entire semester You will reproduce published findings from published manuscripts Each project has a full writeup

Project Groups Group members will play one of four roles: Data Curator - find, download, and organize data Programmer - process data into analyzable form Analyst - transform processed data into interpretable form Biologist - understand paper and biological context, help interpret results Roles rotate for each project Structured class time to help facilitate group work and help each other!

Project Group Meeting : Wednesdays Time allotted for groups to meet and discuss progress “Stand-up” meeting structure: “What did I work on since our last meeting?” “What challenges did I encounter?” “Are there any obstacles to completing my work?” “What will I be working on for next meeting?” Each group will make a brief status report at the end of class

Project Group Meeting : Fridays Time allotted for roles to meet and discuss progress Similar structure to Wednesdays Share challenges and solutions among roles Each role group will make a brief status report at the end of class

Project Report Organized like a published study Sections (primary role): Intro - background and motivation (Biologist) Data - data description (Data Curator) Methods - processing and tools (Programmer) Results - findings (Analyst) Discussion - interpret findings (Biologist) Conclusion (all)

Assessment Each project is 25% of your total grade Broken down: Intro, Conclusion - 2.5% Data, Methods, Results, Discussion, 20% Stand-up participation: 15%

Translational Bioinformatics

Biology as Data Science 1953 - DNA structure published in Nature 1972 - first genetic sequence determined, protein DataBank 1977 - Sanger sequencing, first genome sequenced 1983 - PCR technique invented 1990 - Human Genome Project begun 1995 - first bacterial genome sequenced, microarray technology first described 1997 - yeast genome on a microarray, sequencing by synthesis concept established 1998 - first multicellular eukaryote sequenced 2001 - first draft of human genome 2006 - Solexa Genome Analyzer released

“Big” Data Single Microarray dataset: ~500Mb Single short read dataset: ~2Gb-300Gb Human genome reference sequence: ~2Gb One run of Illumina instruments: HiSeq 2500: ~1Tb NovaSeq 6000: ~6Tb Gene Expression Omnibus (GEO): 2014: 1,237,138 samples, ~28 Tb 2018: 2,335,694 samples, ?? Tb

What is Bioinformatics? “Bioinformatics is an interdisciplinary field that develops methods and software tools for understanding biological data. As an interdisciplinary field of science, bioinformatics combines computer science, statistics, mathematics, and engineering to study and process biological data.” Wikipedia

Conceptual History of Bioinformatics Biological sequences digitized Biological databases needed to store sequences Search tools needed for databases Tools for analyzing data from searches Computational tools required to analyze human genome Sophisticated sequence analysis tools enable analysis of large amounts of sequencing data Sequencing data volume explodes, requiring new tools And here we are

The Biologist’s Tools Wet lab biologists: Bioinformaticians:

Sequence: The Fundamental Datatype Computer Science genome assembly, homology, phylogeny Physics DNA/RNA/protein structure, drug prediction Statistics gene expression, population genetics, biomarkers Mathematics metabolic modeling, synthetic biology, systems biology

Genbank Sequences

Translational Bioinformatics “Translational Bioinformatics is an emerging field in the study of health informatics, focused on the convergence of molecular bioinformatics, biostatistics, statistical genetics, and clinical informatics.” Wikipedia

Workshop 0. Basic Linux and Command Line Usage For Next Time Assignment: familiarize yourself with the material on basic command line usage found here: Workshop 0. Basic Linux and Command Line Usage

SSH and SCC SCC - Shared Compute Cluster You all have accounts on SCC You will need an ssh client program to connect: Mac, Linux: Terminal (included) Windows: MobaXTerm Connect to: scc1.bu.edu with your BU username/password Demonstration

Survey Results

How comfortable are you with the following programming languages/concepts?

How comfortable are you with the following statistics concepts?

How comfortable are you with the following biology concepts?

How comfortable are you with the following bioinformatics concepts?

Rank the following roles that you might play in a project in order of preference

What do you hope to learn?

How do you plan to use what you learn?