Download presentation
Presentation is loading. Please wait.
1
Physical Mapping Problem D7526010@csie.ntu.edu.tw
2
Problem Definition Physical mapping 的定義
3
A B C D E F G H J K L M N O P Q H J DNA Fragment of DNA
4
Why We Need Physical Mapping 可以利用這個地圖將 DNA 做完全排序 可以知道基因到底如何對人類產生作用 利用人造蛋白質... 等等來改進遺傳體質
5
AGACTAGTCGTAACGATCGCTAATTTAAGGCTACT..... 人類染色體 ( 約 bp) Physical map ( 約 bp) DNA Sequencing ( 約 bp)
6
Why We Need Physical Mapping 可以利用這個地圖將 DNA 做完全排序 可以知道基因到底如何對人類產生作用 利用人造蛋白質... 等等來改進遺傳體質 可以得知基因 ( 或標記 ) 的大約位置 對於一些遺傳疾病可以得到較多的資訊 可以幫助偵測是否具有遺傳疾病
7
A B C D E F G H J K L M N O P Q H J DNA Fragment of DNA α
9
target DNA 加入酵素
10
Partial Digest Problem by single enzyme A restriction sites: a1< a2< a3<.....< ap multiset of fragment lengths {aj- ai,i<j}
11
target DNA
12
Double Digest Problem (DDP) Clones first completely digested by enzyme A,then by B, finally A and B together restriction sites: by A: a 1 < a 2 < a 3 <.....< a p by B: b 1 < b 2 < b 3 <.....< b q by A+B : c 1 < c 2 < c 3 <.....< c p+q Reconstruct the restriction sites from these multisets
13
Example : DDP Enzyme A 3 6 8 10 Enzyme B 4 5 7 11 Enzyme A+B 1 2 3 5 6 7
14
Solution
15
Double Digest Problem (DDP)
16
target DNA........
17
By Probe Approach
18
ATGCGCTAACTGGACTTCAAGCCTAAACTGCATCAGACTT................. TACGCGATTGACCTGAAGT........ Complementary probe target DNA The Spirit of Hybridization
19
target DNA A B C D E F GHGH I J 1234512345
20
12345 A B C D E F G H I J
21
12345 A111 B1 C111 D11 E11 F11 G11 H111 I111 J1
22
JDFIEGACHBJDFIEGACHB 1 2 3 4 5
23
12345 B1 H11 C111 A111 G11 E11 I111 F111 D11 J1
24
1 2 3 4 5 6 A、CA、C C、D、EC、D、E E、FE、F A、F、GA、F、G G、H、IG、H、I E、F、I、J、KE、F、I、J、K 1 2 3 4 5 6 A、B、CA、B、C C、D、EC、D、E E、FE、F F、GF、G G、H、IG、H、I I、J、KI、J、K False Negative
25
1 2 3 4 5 6 A、CA、C C、D、EC、D、E E、FE、F A、F、GA、F、G G、H、IG、H、I E、F、I、J、KE、F、I、J、K 1 2 3 4 5 6 A、B、CA、B、C C、D、EC、D、E E、FE、F F、GF、G G、H、IG、H、I I、J、KI、J、K False Positive
26
1 2 3 4 5 6 A、CA、C C、D、EC、D、E E、FE、F A、F、GA、F、G G、H、IG、H、I E、F、I、J、KE、F、I、J、K 1 2 3 4 5 6 A、B、CA、B、C C、D、EC、D、E E、FE、F F、GF、G G、H、IG、H、I I、J、KI、J、K Chimeric Clones
27
5 6 3 4 1 2 A、B、CA、B、C C、D、EC、D、E E、FE、F F、GF、G I、J、KI、J、K G、H、IG、H、I Clones Probes 1 2 3 4 5 6 A B C D E F G H I J K
28
5 6 3 4 1 2 A、B、CA、B、C C、D、EC、D、E E、F、KE、F、K I、J、K、F、GI、J、K、F、G I、J、KI、J、K G、H、IG、H、I Clones Probes 1 2 3 4 5 6 A B C D E F G H I J K
29
How To Use Traveling Salesman Problem to Solve Physical Mapping Problem
30
How to Convert to TSP? Hamming distance
31
ABCDEFGHIJ A0 B20 C020 D3330 E23320 F233200 G2312220 H12235530 I244111340 J4231333220 12345 A111 B1 C111 D11 E11 F11 G11 H111 I111 J1
32
How to Convert to TSP? Hamming distance Cycle weight = number of gaps transitions +2n
33
ABCDEFGHIJ A0 B20 C020 D3330 E23320 F233200 G2312220 H12235530 I244111340 J4231333220 12345 A111 B1 C111 D11 E11 F11 G11 H111 I111 J1
34
How to Convert to TSP? Hamming distance Cycle weight = number of gaps transitions +2n So, minimize the cycle weight is to the gap number
37
Our approach We also convert it to optimization problem Using more complicated model Using Genetic Algorithm to solve it. F(A) = X*C(A)+Y*P(A)+Z*N(A)+T*M(A)+ P*L(A).
38
(a) (b) The results of our approach tested on simulated data. The false negative rate is set as 0.1. The false positive rate is 0.05. The false negative rate is set as 0.1. The false positive rate is 0.01.
39
Experimental Results of our GA tested on Real data from chromosome 1 (a) It shows the results of our GA run with the data which is a contig with about 95 clones and about 120 probes (b) It shows the results of our GA run with the data which is a contig with about 172 clones and about 136 probes
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.