Protein Encoding Optimization Student: Logan Everett Mentor: Endre Boros Funded by DIMACS REU 2004.

Slides:



Advertisements
Similar presentations
Ordinary Least-Squares
Advertisements

Standard Minimization Problems with the Dual
Sections 4.1 and 4.2 The Simplex Method: Solving Maximization and Minimization Problems.
Section 4.6 (Rank).
Ch 2. 6 – Solving Systems of Linear Inequalities & Ch 2
Lecture 8 – Nonlinear Programming Models Topics General formulations Local vs. global solutions Solution characteristics Convexity and convex programming.
10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.
The Simplex Method: Standard Maximization Problems
Linear Systems of Equations Ax = b Marco Lattuada Swiss Federal Institute of Technology - ETH Institut für Chemie und Bioingenieurwissenschaften ETH Hönggerberg/
Lecture 6 Intersection of Hyperplanes and Matrix Inverse Shang-Hua Teng.
Protein String Encoding Student: Logan Everett Mentor: Endre Boros Funded by DIMACS REU 2004.
Procrustes analysis Purpose of procrustes analysis Algorithm Various modifications.
Solving the Protein Threading Problem in Parallel Nocola Yanev, Rumen Andonov Indrajit Bhattacharya CMSC 838T Presentation.
1 A Novel Binary Particle Swarm Optimization. 2 Binary PSO- One version In this version of PSO, each solution in the population is a binary string. –Each.
1 Systems of Linear Equations Error Analysis and System Condition.
Lecture 7: Matrix-Vector Product; Matrix of a Linear Transformation; Matrix-Matrix Product Sections 2.1, 2.2.1,
Unit 1 Equations, Inequalities, and Functions. Unit 1: Equations, Inequalities, and Functions Overview: In this unit you will model real-world solutions.
Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall.
Duality Theory 對偶理論.
Section 3.6 – Solving Systems Using Matrices
1 / 20 Arkadij Zakrevskij United Institute of Informatics Problems of NAS of Belarus A NEW ALGORITHM TO SOLVE OVERDEFINED SYSTEMS OF LINEAR LOGICAL EQUATIONS.
Section 4-1: Introduction to Linear Systems. To understand and solve linear systems.
Chapter 9 Matrices and Determinants Copyright © 2014, 2010, 2007 Pearson Education, Inc Multiplicative Inverses of Matrices and Matrix Equations.
1 Ch. 4 Linear Models & Matrix Algebra Matrix algebra can be used: a. To express the system of equations in a compact manner. b. To find out whether solution.
Algebra 3: Section 5.5 Objectives of this Section Find the Sum and Difference of Two Matrices Find Scalar Multiples of a Matrix Find the Product of Two.
Systems of Equations and Inequalities Systems of Linear Equations: Substitution and Elimination Matrices Determinants Systems of Non-linear Equations Systems.
Elementary Linear Algebra Anton & Rorres, 9th Edition
Section 2.7 Solving Inequalities. Objectives Determine whether a number is a solution of an inequality Graph solution sets and use interval notation Solve.
DIGITAL COMMUNICATIONS Linear Block Codes
Slide 14.1 Nonmetric Scaling MathematicalMarketing Chapter 14 Nonmetric Scaling Measurement, perception and preference are the main themes of this section.
Linear Programming Advanced Math Topics Mrs. Mongold.
© 2010 Pearson Prentice Hall. All rights reserved. CHAPTER 7 Algebra: Graphs, Functions, and Linear Systems.
Learning Spectral Clustering, With Application to Speech Separation F. R. Bach and M. I. Jordan, JMLR 2006.
Information Theory Linear Block Codes Jalal Al Roumy.
Pre-Calculus Section 1.7 Inequalities Objectives: To solve linear inequalities. To solve absolute value inequalities.
Solving Linear Inequalities Lesson 5.5 linear inequality: _________________________________ ________________________________________________ solution of.
Solving Systems of Equations by Elimination (Addition) Section 3.2, Part II.
Implicit Hitting Set Problems Richard M. Karp Erick Moreno Centeno DIMACS 20 th Anniversary.
Review of Matrix Operations Vector: a sequence of elements (the order is important) e.g., x = (2, 1) denotes a vector length = sqrt(2*2+1*1) orientation.
GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function Sara Mostafavi, Debajyoti Ray, David Warde-Farley,
2.5 – Determinants and Multiplicative Inverses of Matrices.
Copyright © 2011 Pearson Education, Inc. Slide
1.1 The row picture of a linear system with 3 variables.
Sullivan Algebra and Trigonometry: Section 12.9 Objectives of this Section Set Up a Linear Programming Problem Solve a Linear Programming Problem.
Regularized Least-Squares and Convex Optimization.
Algebra 1 Section 7.6 Solve systems of linear inequalities The solution to a system of linear inequalities in two variable is a set of ordered pairs making.
Use Inverse Matrices to Solve Linear Systems
PreCalculus Section 14.3 Solve linear equations using matrices
Linear Inequalities in One Variable
Part 2 Linear block codes
Algebra 1 Section 6.5 Graph linear inequalities in two variables.
Review of Matrix Operations
Quadratic Inequalities
The Inverse of a Square Matrix
Systems of Equations and Inequalities
Chapter 2 Equations and Inequalities in One Variable
Linear Inequalities and Absolute Value Inequalities
3-3 Optimization with Linear Programming
Algebra: Graphs, Functions, and Linear Systems
Systems of Linear Equations in Two Variables
Graphing Linear Equations
Linear Inequalities in Two Variables
6 minutes Warm-Up Find each product..
9.3 Linear programming and 2 x 2 games : A geometric approach
Lecture 19 Linear Program
Sec 3.5 Inverses of Matrices
4 minutes Warm-Up Solve and graph. 1) 2).
Graph Linear Inequalities in Two Variables
Chapter 7: Systems of Equations and Inequalities; Matrices
Solving Linear Systems of Equations - Inverse Matrix
Presentation transcript:

Protein Encoding Optimization Student: Logan Everett Mentor: Endre Boros Funded by DIMACS REU 2004

Project Overview Model Biological Scoring Matrices Weighted Binary Hamming Space Optimize Using Linear Programming Accurate Random Generation

Scoring Matrices A Q M K R H… A R M I F L… –3 –3 -3…

Encode To Binary Strings Hamming Distances Easy to Approximate on Binary Strings Statistically Proven Methods More Efficient How Do Similarity and Distance Relate? Inverse Relationship First Create “Real” Distance Vector: D

Precise Problem: Distortion D ij (1–  )  h[  i,  j ]  D ij (1+  )  unique pairs i,j ( n C 2 ) s.t. 0    1 and 0 

Encoding Scheme as Vector C = S = T = P = A = G = y2y1y2y1

Modified Inequality D(1–  )  Ax  D(1+  ) s.t. 0    1 and 0  Let x = y

Linear Programming Problem Need All Linear Expressions D(1 –  )  Ax and Ax  D(1 +  ) -Ax – D   -D and Ax – D   D All x i,   0 Goal: Minimize  Solve with CPLEX

Problem Size Number of Constraints (Rows) 2( n C 2 ) = 380 Number of Variables (Columns) 2 n-1 = 524,288 Total Size – App. 2x10 8 CPLEX – App. 1 Minute

Linear Programming Solution Solution Contains: Min Value of  Scaled Weight Vector x Non-Integral Values in x Convert to p Vector X =  x i p i = x i / X

Random Encodings Randomly Select Cross Sections Based on Percent Weights Can Scale For Any N-Length Encoding Longer Encodings Should Approach Minimum Distortion

Results

Courtesy of DIMACS Mentor: Endre Boros – RUTCOR Logan Everett – DIMACS REU 2004