US Sequencing Project Funded by NSF Two-year project Start date: Sept 1, 2004 Follow-up project for full sequencing of chromosomes 1, 10 and 11
Aims 400,000 BAC end sequences 2400 reads from sheared library 20 full BAC sequences FISH analysis for selected BACs Bioinformatics hub for project –Analysis pipelines –Data Archiving for entire project –Summary data for entire project –Links to other groups
Sequencing Outsourced sequencing BAC ends and sheared library: –SeqWright Inc, Houston, Texas –ABI Prism 3730xl sequencers 20 full BACs –Phred/Phrap assembly Bioinformatics Clustering of BAC Ends, sheared sequences using SGN pipeline Gene coding potential analysis, repeat identification BAC annotation using SGN pipeline based on GeneSeqer, Apollo, gene finding programs
Sequence pipelines Clustering of BAC Ends, sheared sequences using SGN pipeline Gene coding potential analysis, repeat identification BAC annotation using SGN pipeline based on GeneSeqer, Apollo, gene finding programs