1 CISC 841 Bioinformatics (Fall 2008) Review Session.

Slides:



Advertisements
Similar presentations
Support Vector Machine
Advertisements

Lecture 9 Support Vector Machines
ECG Signal processing (2)
SVM - Support Vector Machines A new classification method for both linear and nonlinear data It uses a nonlinear mapping to transform the original training.
Support Vector Machines
Computer vision: models, learning and inference Chapter 8 Regression.
Separating Hyperplanes
Consistent probabilistic outputs for protein function prediction William Stafford Noble Department of Genome Sciences Department of Computer Science and.
Support Vector Machines
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
Hidden Markov Models I Biology 162 Computational Genetics Todd Vision 14 Sep 2004.
Lecture 5: Learning models using EM
Support Vector Classification (Linearly Separable Case, Primal) The hyperplanethat solves the minimization problem: realizes the maximal margin hyperplane.
L15:Microarray analysis (Classification). The Biological Problem Two conditions that need to be differentiated, (Have different treatments). EX: ALL (Acute.
Classification Problem 2-Category Linearly Separable Case A- A+ Malignant Benign.
Support Vector Machine (SVM) Classification
Support Vector Machines
6. Gene Regulatory Networks
Lecture 10: Support Vector Machines
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Mehdi Ghayoumi MSB rm 132 Ofc hr: Thur, a Machine Learning.
Greg GrudicIntro AI1 Support Vector Machine (SVM) Classification Greg Grudic.
Optimization Theory Primal Optimization Problem subject to: Primal Optimal Value:
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
CISC667, F05, Lec27, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Review Session.
Gaussian Processes for Transcription Factor Protein Inference Neil D. Lawrence, Guido Sanguinetti and Magnus Rattray.
Whole Genome Expression Analysis
CSE182 L14 Mass Spec Quantitation MS applications Microarray analysis.
机器学习 陈昱 北京大学计算机科学技术研究所 信息安全工程研究中心. Concept Learning Reference : Ch2 in Mitchell’s book 1. Concepts: Inductive learning hypothesis General-to-specific.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Machine Learning Lecture 11 Summary G53MLE | Machine Learning | Dr Guoping Qiu1.
计算机学院 计算感知 Support Vector Machines. 2 University of Texas at Austin Machine Learning Group 计算感知 计算机学院 Perceptron Revisited: Linear Separators Binary classification.
CISC841, F08, Lec2, Liao CISC 841 Bioinformatics (Fall 2008) A Primer on Molecular Biology & Bioinformatics challenges.
1/15 Strengthening I-ReGEC classifier G. Attratto, D. Feminiano, and M.R. Guarracino High Performance Computing and Networking Institute Italian National.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
+ Get Rich and Cure Cancer with Support Vector Machines (Your Summer Projects)
Blind Information Processing: Microarray Data Hyejin Kim, Dukhee KimSeungjin Choi Department of Computer Science and Engineering, Department of Chemical.
CISC667, F05, Lec22, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Support Vector Machines I.
CS 478 – Tools for Machine Learning and Data Mining SVM.
Sparse Kernel Methods 1 Sparse Kernel Methods for Classification and Regression October 17, 2007 Kyungchul Park SKKU.
1 CISC 841 Bioinformatics (Fall 2007) Kernel engineering and applications of SVMs.
Class 23, 2001 CBCl/AI MIT Bioinformatics Applications and Feature Selection for SVMs S. Mukherjee.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Lecture 2: Statistical learning primer for biologists
Christopher M. Bishop Object Recognition: A Statistical Learning Perspective Microsoft Research, Cambridge Sicily, 2003.
CZ5225: Modeling and Simulation in Biology Lecture 7, Microarray Class Classification by Machine learning Methods Prof. Chen Yu Zong Tel:
Final Exam Review CS479/679 Pattern Recognition Dr. George Bebis 1.
Maximum Entropy Discrimination Tommi Jaakkola Marina Meila Tony Jebara MIT CMU MIT.
ECE 471/571 - Lecture 19 Review 11/12/15. A Roadmap 2 Pattern Classification Statistical ApproachNon-Statistical Approach SupervisedUnsupervised Basic.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
CSE182 L14 Mass Spec Quantitation MS applications Microarray analysis.
Greg GrudicIntro AI1 Support Vector Machine (SVM) Classification Greg Grudic.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Classification COMP Seminar BCB 713 Module Spring 2011.
Learning by Loss Minimization. Machine learning: Learn a Function from Examples Function: Examples: – Supervised: – Unsupervised: – Semisuprvised:
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
CISC667, S07, Lec25, Liao1 CISC 467/667 Intro to Bioinformatics (Spring 2007) Review Session.
CMPS 142/242 Review Section Fall 2011 Adapted from Lecture Slides.
Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)
Pattern Recognition CS479/679 Pattern Recognition Dr. George Bebis
Combining HMMs with SVMs
CISC 841 Bioinformatics (Fall 2007) Hybrid models
The following slides are taken from:
CISC 841 Bioinformatics (Fall 2007) A Primer on Molecular Biology & Bioinformatics challenges CISC841, F07, Lec2, Liao.
CISC 667 Intro to Bioinformatics (Spring 2007) Review session for Mid-Term CISC667, S07, Lec14, Liao.
CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (IV)
Multivariate Methods Berlin Chen
CISC 667 Intro to Bioinformatics (Spring 2007) Genetic networks and gene expression data CISC667, S07, Lec24, Liao.
Linear Discrimination
Deep Learning in Bioinformatics
Chapter 3: Maximum-Likelihood and Bayesian Parameter Estimation (part 2)
Presentation transcript:

1 CISC 841 Bioinformatics (Fall 2008) Review Session

2 Basics of Molecular biology Central dogma –Transcription –Translation Genetic code DNA –Double helix –Watson-Crick binding RNA –Secondary structures Genes –Compositional Structure –Reading frames Proteins –Secondary structure: alpha helices, beta sheets, and coils –3-d structure Gene Regulations (Operons) DNA Microarray Cloning PCR Gel electrophoresis 2D gel + MS Yeast 2 hybrid system

3 Computational methods Kernel based methods -Linear SVM -Rosenblatt algorithm (Primal and dual forms) -Novikoff theorem (You do not need to memorize the proof) -Maximum margin (primal and dual form) -Lagrangian multiplier, KKT condition -Gradient descent algorithm for the dual form -Support vectors -Nonlinear SVM -Mapping to feature space (high dimension) -Kernel functions: generic kernels -Mercer’s theorem -Soft margin (slack variables) -Principal component analysis (PCA) -Dimension reduction (projection onto a few most differentiating directions) -Kernel based PCA (capable of nonlinear projection) -Binary versus multiclass classification -Applications: classifying genes based on expression profiles

4 Computational methods (cont’d) Bayesian networks -Joint probability, factorization based on chain rules -Bayes’ rule -Bayesian networks -Conditionally independence, D-separation, Markov condition -Model construction (scored-based, maximum posterior probability) -Parameter estimation (Maximum likelihood) -Model averaging -Bootstrap -Applications: inferring regulatory networks from gene expression data

5 Computational methods (cont’d) Hidden Markov models -Three major problems -Decoding -Likelihood -Training : Parameter estimation -Model structure -Incorporating domain knowledge -Genetic algorithm -Model equivalence -Mutual entropy -NP-Hard -Heuristics: quasi-consensus based -Applications: predicting transmembrane topology and classifying protein families Gradient descent algorithm Genetic algorithm Evaluation metrics -Sensitivity -Specificity -ROC

6 About the Exam –Time and Place: 3:30PM-4:45PM,Thursday, November A Smith Hall –closed-book –Four parts Basics of Molecular Biology[10 points] Kernel Based Methods [40 points] Bayesian Networks [35 points] Hidden Markov models [15 points]