Presentation is loading. Please wait.

Presentation is loading. Please wait.

Stabbing balls and simplifying proteins Ovidiu Daescu and Jun Luo Department of Computer Science University of Texas at Dallas Richardson, TX 75080.

Similar presentations


Presentation on theme: "Stabbing balls and simplifying proteins Ovidiu Daescu and Jun Luo Department of Computer Science University of Texas at Dallas Richardson, TX 75080."— Presentation transcript:

1 Stabbing balls and simplifying proteins Ovidiu Daescu and Jun Luo Department of Computer Science University of Texas at Dallas Richardson, TX 75080

2 Problem definition Input: indexed sequence of balls B= {B 1, B 2, …, B n } in R 3, with each B i specified by a center and radius pair (p i, ε i ).  Let C= {p 1, p 2, …, p n }, be the set of center points. Find set of stablers defined by a subset P= {p i 1, p i 2, …, p i m } of C such that: 1.i 1 =1, i m =n, and i j є {1,2,…,n}, for j= 1,2,…,m, 2.i j < i j+1, for j=1,2,…,m-1, 3.The line segment p i j p i j+1 (or the line p i j p i j+1 ) stabs each of the balls {B i j, B i j +1, …, B i j+1 }, 4.There is no other subset P’ of C satisfying the first three conditions and of smaller size than P, i.e., m is minimized. Stabler Ball B i pipi εiεi

3 Applications Simplification of proteins for visualization, manipulation, (approximate) matching and searching in protein database, and neural map representation. The problem is a generalization of the polygonal chain simplification problem. Approximating segment pipi ε

4 Key difference from chain simplification: Chain simplification ε Our simplification εiεi

5 Related Works Sergey Bereg, Cylindrical Hierarchy for Deforming Necklaces, International Journal of Computational Geometry & Applications, 14(1-2): 3-18, 2004 –Compute optimal cylindrical cover of a necklace with n beads (balls) in R 3 in polynomial time. –The n balls are ordered in sequence; if not, the problem is NP-hard.

6 Related Works Binhai Zhu, Approximating 3D points with Cylindrical Segments, International Journal of Computational Geometry & Applications, 14(3),189-201,2004. –Given a set S of n points in R 3, compute k cylindrical segments enclosing S such that the sum of their radii is minimized. –For unordered points: NP hard. –Polynomial time approximation scheme (PTAS) for any fixed k>1 is possible. –Used for constructing neural maps and some other computational biology applications.

7 Related Works Frederic Vivien, Nicolas Wicker, Minimal Enclosing Parallelepiped in R 3, CG:T&A, 29(2004), 177-190. –Find min. volume parallelepiped enclosing a set of n points. –O(n 6 ) time.

8 Our results Quadratic or near quadratic time solutions for line segment stablers. Subcubic, O(n 2.4 log O(1) n) time for line stablers Experimental results: –for proteins with thousands of atoms, our solutions have much better performance than previous solutions; –actual running time is much smaller than the worst case time shows.

9 Line segment based simplification L p metric: distance between two points X and Y = For example, X=(x 1,y 1 ), Y = (x 2,y 2 ) –L 1 = |x 1 -x 2 | + |y 1 -y 2 | –L 2 = –L ∞ =|x 1 -x 2 | if |x 1 -x 2 | ≥ |y 1 -y 2 | or |y 1 -y 2 | if |x 1 -x 2 | < |y 1 -y 2 |, used for protein simplification X Y 3 4 5 L 1 =7 L 2 =5 L ∞ =4

10 Line segment based simplification L 2 metric: O(n 2 logn) time, O(n 2 ) space algorithm. –similar to the polygonal chain simplification algorithm of Daescu et.al. –replace the line segment p i p j by two rays; for each ray, intersect it and the projections from p i (or p j ) of the balls {B i+1,B i+2,…,B j } with a plane. –reduces to deciding whether the projection of p i along the ray is within the common intersection of some disks (the projected balls).

11 Line segment based simplification L 1 or L ∞ metric: O(n 2 ) time, O(n) space. L 1 (L ∞ ) “balls” are cubes (crosspolytopes). Main idea: the common intersection of the projections of n L 1 or L ∞ “balls”, from any view point onto any plane, if not empty, is a convex region bound by O(1) edges.

12

13 Line segment based simplification O(n 2 ) time and space if each B i is a convex polytope and the complexities of the projections Proj(B i, W, p) of the B i ’s from any point p onto any plane W satisfy the condition: –The intersection of the projection Proj(B i, W, p) of the B i ’s from any point p onto any plane W is a convex polygon of size O(n). –The algorithm is similar to the one for the L 2 metric.

14 Line based simplification For n indexed points P = {p 1, p 2, …, p n } in R d, d≥ 3, with O(n 3-3/( └ f(d)/2 ┘ +1) * log O(1) n) time and space one can report for each line p i p j, 1≤i<j≤n, the farthest point p k with i<k<j  f(d) = O(d 2 ). For protein chains, the radius of each ball B i takes value from a small set, and f(d) = 3 2 = 9  the minimum size set P of stablers can be found with O(n 2.4 log O(1) n) time and space.

15 Main idea: –Use a (constant number of) balanced binary tree structure. –At each node, construct a farthest-point-from-line data structure, balanced with respect to the number of queries. –O(n 2 ) queries overall.

16 Experimental Results Use RMSD to measure the similarity between the original and the simplified chains. Different number of atoms in the original and the simplified chains.

17 Experimental Results 1CA2: 256 alpha carbons Simplified 1CA2: 168 alpha carbons RMSD= 0.62 Å

18 Experimental Results 1DDZ_A: 481 alpha carbons Simplified 1DDZ_A : 340 alpha carbons RMSD= 0.44 Å

19 Experimental Results 1DDZ_B: 481 alpha carbons Simplified 1DDZ_B : 351 alpha carbons RMSD= 0.43 Å

20 Conclusions The RMSDs are very small, the simplified backbones are similar to the originals, while having significantly simpler representations (e.g., about 33% reduction in size for 1CA2). The simplified chains can be used in place of the original ones in visualization, alignment, classification of protein structures, etc.


Download ppt "Stabbing balls and simplifying proteins Ovidiu Daescu and Jun Luo Department of Computer Science University of Texas at Dallas Richardson, TX 75080."

Similar presentations


Ads by Google