Lecture 1.01 Developing the Tools Montreal 2004 Course Introduction John J. Salama
Lecture 1.02 Today’s Talk Course Introduction Bioinformatics Introduction Data Introduction Applications Introduction
Lecture 1.03 COURSE INTRODUCTION
Lecture 1.04 Introductions What is your work experience? How will bioinformatics help you in your work? What do you hope to learn from this course?
Lecture 1.05 Canadian Bioinformatics Workshops Bioinformatics Genomics Proteomics Developing the Tools
Lecture 1.06 Course Objectives Data modeling Using existing open source bioinformatics packages Using and creating databases Algorithms and data mining Web application design Foundations of software architecture
Lecture 1.07 Detailed Schedule Day 1 (May 3 rd ) Day 2 (May 4 th ) Day 3 (May 5 th ) Day 4 (May 6 th ) Day 5 (May 7 th ) Day 6 (May 8 th ) 08:30Registration & Breakfast Breakfast 09:001.0 Welcome and Course Introduction John Salama 2.0 Data Modeling I XML & XSLT Marc Dumontier 3.0 Bio Packages I Doron Betel 4.0 Data Mining Sebastien Lemieux 5.0 Java Servlets & Java Server Pages Francois Major 6.0 Existing Tools in Bioinformatics Howard Feldman 10.30Break 11:001.1 Introduction to J2EE Development (MVC) John Salama 2.1 Data Modeling II XML Schema & JAXB Marc Dumontier 3.1 Bio Packages II Sohrab Shah 4.1 Data Mining Lab Sohrab Shah 5.1 Development Servlets and JSPs Lab John Salama Open Lab 12:30Lunch on your own 13:451.2 Preparing your Development Environment Lab Dominik Gehl 2.2 Microarray Data Modeling Lab Sebastien Lemieux 3.2 MySQL/JDBC Sohrab Shah 4.2 Algorithms Sebastien Lemieux/ Francois Major 5.2 Transforming Data: Using Apache Xalan to apply an XSLT in Java Marc Dumontier Open Lab 15:15Break Evaluation survey and roundtable discussion 15:451.3 Eclipse, Tomcat, and ANT (CVS) Steve Montgomery 2.3 Auto-generating Java classes using JAXB Marc Dumontier 3.3 Creating a mySQL Database Lab Dominik Gehl 4.3 Algorithms Lab Sebastien Lemieux/ Francois Major Open Lab 17:15Dinner on your own No-host social 19:00 – 21: Microarray Guest Lecture Rob Kearney (Mc Gill Univeristy) Open Lab
Lecture 1.08 Assignments & Marking Scheme DayBrief DescriptionNumber % of Final 2 Modelling microarray data and auto-generating Java 1a25 4Data Mining225 4Algorithms325 5 Selecting from a database and displaying XML in a browser 1b25
Lecture 1.09 Administration Login / Password –Will be given to you in the first lab After First Login – $ cbw.setup To start services (apache, tomcat, mysql) – $ cbw.start mySQL –Database location /u/$USER/mysql/data –phpMyAdmin -
Lecture Administration CVS –$ cvs -d login –$ cvs -d checkout Services –apache - (/u/$USER/apache/htdocs) –tomcat - (/u/$USER/jakarta-tomcat/webapps) –axis -
Lecture Sponsors UOttawa
Lecture BIOINFORMATICS INTRODUCTION
Lecture What is Bioinformatics The application of computer technology to the management of biological information Software applications used to gather, store, analyze and integrate biological information Databases and algorithms designed for the purpose of enhancing the process of biological research
Lecture What is Bioinformatics NCBI: “Research, development, or application of computational tools and approaches for expanding the use of biological, medical, behavioral or health data including those to acquire, store, archive, analyze, or visualize such data.” Lincoln Stein: “Biologists using computers, or the other way around.”
Lecture “Hot” Bioinformatics Topics Gene Expression / Regulation Protein / RNA Structure Ontologies Genome Sequencing / Annotation Molecular Interactions
Lecture Where is Bioinformatics Used Pharmaceuticals Universities Biotech Companies Public Good / Health Research Institutes Hardware Manufacturers Government Agencies THESE ARE OUR CLIENTS
Lecture Why Do We Need Bioinformatics Accessibility of biological data Data integration… at least within an organization Processing of data (data mining) Prediction and analysis Storage of mass amounts of data (high-throughput experiments)
Lecture DATA INTRODUCTION
Lecture How Much Data - GenBank Source: NCBI
Lecture How Much Data - PDB Source: RSCB
Lecture How Much Data - BIND Source: Blueprint North America
Lecture How Much Data - PubMed Source: Israel Institute of Technology
Lecture What Do We Do With All This Data? Design data structures to represent this information unambiguously Develop databases to house the data Develop accessible software to submit new data Develop fast applications to query the data Develop fast applications to analyze the data (data mining)
Lecture APPLICATIONS INTRODUCTION
Lecture Bioinformatics Application Trends Web based GUI tool accessibility Data marts Web services Integration Services Pre-analyzed Data Services
Lecture Languages of Bioinformatics Perl Python Java C++ C And More…
Lecture Today’s World of Bioinformatics Note: This is not intended to be an extensive list of bioinformatics institutions
Lecture All Sorts of Bioinformatics Tools Note: This is not intended to be an extensive list of bioinformatics tools