Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara.

Similar presentations


Presentation on theme: "The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara."— Presentation transcript:

1 The Wold Lab BioHub Cory Tobin

2 Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara

3 Goal Standardize the relationship between biological data Integrate all of the data seamlessly Provide novel methods to search for and analyze data

4 Adapted from http://woldlab.caltech.edu/biohub/

5 My Contribution Implement a database for homology data

6 Background Species A Species B Paralogs Orthologs The more general term is “homology” Gene

7 Requirements Be more accurate and flexible than HomoloGene Work in real time Make sense of HomoloGene’s misleading data

8 Rationale Gene They are similar Gene HomoloGene BioHub They are related like this

9 Rationale Continued Human Genome Mouse Genome Seq ASeq B HomoloGene would BLAST seq A against mouse and determine that seq C is an ortholog of seq A. Seq C HomoloGene would also BLAST seq B against mouse and detrmine that seq C is an ortholog of seq B. BioHub will BLAST seq A against mouse, find seq C, then BLAST C back against human to see if there are any better matches. It will find seq B to be better.

10 Methods Design data relationships that make sense biologically Generate the low-level database interaction code Parse and load HomoloGene’s data into our database Write biologically useful functions Create a web-based interface for easy use

11 Materials ArgoUML – Design Aid Pymerase – Design Implementation PostgreSQL – Database HomoloGene – Data Source Python – Programming Language

12 Current State Design data relationships that make sense biologically Generate the low-level database interaction code Parse and load HomoloGene’s data into our database Write biologically useful functions Create a web-based interface for easy use

13 Example Usage Sequence of Interest …GGATACAAAATTCCTC… Are there any known genes in this sequence? acetyl - coenzyme A dehydrogenase ( Human ) (cont.)

14 acetyl - coenzyme A dehydrogenase ( Human ) Are there any homologs? Mouse Rat Mosquito Fruit fly Nematode (cont.)

15 How are those genes related?

16 Where do you want to go?

17 More Info BioHubwoldlab.caltech.edu / biohub HomoloGenewww.ncbi.nlm.nih.gov Pythonpython.org Pymerasepymerase.sf.net PostgreSQLpostgresql.org


Download ppt "The Wold Lab BioHub Cory Tobin. Collaborators Brandon King Joe Roden Diane Trout Dr. Barbara."

Similar presentations


Ads by Google