Download presentation
Presentation is loading. Please wait.
Published byViolet Walker Modified over 9 years ago
1
Aliya Sadeque BIOC 599 Supervisory Committee Meeting Wednesday December 19, 2007.
2
Outline About me Thesis project blueprint Course selection
3
Curriculum Vitae Queen’s University. Bachelor of Science (Honours) in Biochemistry. Minor in Computing. Graduated May, 2007
4
Previous Coursework Undergraduate Level Biochemistry: Proteins and Enzymes Physical Biochemistry Metabolism Molecular Biology Introductory Biochemistry Laboratory Protein Structure and Function Current Topics in Biochemistry Biochemistry of the Cell Advanced Molecular Biology
5
Previous Coursework Undergraduate Level Computing: Database Management Systems Neural and Genetic Computing Introduction to Data Mining System Level Programming Operating Systems Undergraduate Level Mathematics: Introduction to Statistics Discrete Math for Computer Scientists Modeling Techniques in Biology
6
Thesis Project Blueprint Context Why is this work necessary What kind of tools have been used to address it Longest Common Subsequence Part I: Explore LCSs in poxvirus Visualization Threshold frequency equation Part II: Develop an interface for use by biologists
7
Background “Promoter sequences might be identified as conserved islands in a divergent sea” Observed: 42-bp sequence showing “unusually high degree of sequence conservation” (Brunetti et al.) Are these claims reasonable? How can they be tested?
8
Tools Alignment 0 mismatch suffix tree Longest Common Subsequence Algorithm
9
Visualization
10
Threshold Frequency Figure 1. Table showing number of hits resulting from LCS trials with varying values of n and k, or subsequence length and error number, respectively. k = 1k=2k=3 length# solutionslength# solutionslength# solutions 1011864315584925127 12638451765545224 13237231821055320 1459661910045416 151350206675512 17344252165610 2019130114577 251014046597 30484524605 35285014612 3625536620 4013545630 456554640 501572650
11
User Interface Design with usability in mind Selection of inputs – What kind of genomes can/will this tool be used for? Format of results – How should these be presented in order to allow interpretation? Visualization Further processing of output
12
Timeline Part I: Poxvirus LCS data collection and analysis 2 months Part II: Interface 4-6 months
13
Course Selection BIOC 570 - completed MICR 502 - Virology Courses to sit in for: Biochemistry courses? Computing courses? Data mining Bioinformatics Statistics
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.