For the UCSC Genome Informatics Group aka Browser Staff.

Slides:



Advertisements
Similar presentations
Library Resources in the Networked Environment or, Its all about service(s) (and data…) Kevin Kidd Library Applications & Systems Manager Boston College.
Advertisements

Design Validation CSCI 5801: Software Engineering.
The Diagnostic Laboratory ……the ideal system……. Molecular Genetics Diagnostic Laboratory Exciting area of medical pathology Need to continually up-date.
Enyang Guo Millersville University September 19, 2014 Simulations and Integrated Learning in Investment Education EFA 2014.
Why Blogging is Essential. Agenda Why? What are the characteristics of the most successful blogs? What distinguishes the successful from unsuccessful.
A Lite Introduction to (Bioinformatics and) Comparative Genomics Chris Mueller August 10, 2004.
Linking Actions for Unmet Needs in Children’s Health
I started school with the intention of becoming a web developer and I have been here a year so far working for a degree in Web Technologies. My grades.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
ENCODE Data Coordination at UCSC Kate Rosenbloom ENCODE DCC Technical Project Manager UCSC Genome Bioinformatics Group September 2010 Genome Browser SAB.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Informatics Support for Vaccine Projects Using and extending the UCSC bioinformatics infrastructure.
Core Application Software Activities Ian Fisk US-CMS Physics Meeting April 20, 2001.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Discussion examples Andrea Zhok.
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Samuvel Johnson nd MCA B. Contents  Introduction to Real-time systems  Two main types of system  Testing real-time software  Difficulties.
The UCSC Team A professional informatics team of 30 FTEs with expertise in –genomics –work with domain experts worldwide –complex data and applications.
Center for Biomolecular Science and Engineering University of California, Santa Cruz Robert Kuhn, PhD Center for Biomolecular Science and Engineering University.
Open sharing and maintenance of scientific code Jordan S Read; Luke A Winslow
Resource Curation and Automated Resource Discovery.
Andy Conley 3/26/ James Kent. Know that name. He is one of greatest, perhaps the greatest, bioinformatics programmers ever. He was deeply involved.
A Lite Introduction to (Bioinformatics and) Comparative Genomics Chris Mueller November 18, 2004 Based on the Genomics in Biomedical Research course at.
Chapter Quality Network Asthma Pilot Project LS4 Reflections from Maine in the Summer Amy Belisle, MD August 2010.
Training Role Module 8 – User Admin Ver. 10 Oct 2009.
Jessica Dantzer Mooney Lab Center for Computational Biology and Bioinformatics Indiana University School of Medicine
Adam Siepel Assistant Professor Biological Statistics & Computational Biology Cornell University.
Chapter 13 Table of Contents Section 1 DNA Technology
An Orientation: General Psychology Online. The Course Menu Shown on the far left is the menu used to navigate our Psychology course.
Government IT Professionals Online Survey Results FINAL REPORT September 2010.
Course Introduction CSE250. Course Overview This course will be difficult Work hard and start early You are adults and I will treat you as such – I won’t.
Pathway Interaction Database (PID) Market Research BioPortals Tiger Team Meeting Mervi Heiskanen January 31, 2013.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Improvement Forum    A webinar series for QI Managers, Nurse Leaders and others supporting healthcare improvement in Wisconsin’s hospitals    June.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Disciplined Software Engineering Lecture #15 Software Engineering Institute Carnegie Mellon University Pittsburgh, PA Sponsored by the U.S. Department.
Comprehensive Marketing Plan Presentation February 22, 2012 Indiana University Kelley MBA Students Lauren Feldman Medha Hazari Chris Mahon Jen Michuda.
A L I M E N T A T I O N A G R I C U L T U R E E N V I R O N N E M E N T 1 G20 – 12th May 2011 An International Research Initiative for Wheat Improvement.
Thoughts on ENCODE Annotations Mark Gerstein. Simplified Comprehensive (published annotation, mostly in '12 & '14 rollouts)
D4FF55A0-6B6F BF422A9BA9 Present by: Xiao Chen On December 7, 2015.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Human Physiology. Warm Up ( ) Given the list of the following sensors and their functions, write down a few ideas of experiments that you might.
Accessing and visualizing genomics data
Grundtvig Multilateral project CAMP2.0: Challenging Attractiveness of lifelong learning: web 2.0 tools for strengthening Marketing and Public relations.
The Structure of a Paragraph. Paragraphs A paragraph is a collection of related sentences dealing with one topic. Most paragraphs contain between five.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
100,000 Genomes Project North East and North Cumbria GMC An Introduction Mike Pratt (Genomic Education Development Officer) Susan Goldstein (100,000 Genomes.
School of Teaching & Learning Development Advanced Lecturer in Education: Julie-Ann Stobo BA (Hons)
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Genomics and the Growing World Steve Rounsley Dow Agrosciences.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Thoughts on How to Initiate An Academic Career - Research
Helping you succeed in promoting your club
Lesson: Sequence processing
CyVerse Discovery Environment
Making “Open Data” Work: Challenges for Data Integration in Genomics Research
Collaboration and Communication Tools in Teaching and Business – Who’s doing what? How can busy professionals work more effectively with the help of Web.
Department of Genetics • Stanford University School of Medicine
Marine Managed Area Mapping
Rick McGee, PhD and Bill Lowe, MD Faculty Affairs and NUCATS
Shared Genomics Sharing paths of exploration to support collaborative reasoning in genomic data analysis David Hoyle, Mark.
FaceBase Consortium: NIDCR Update 2018
Approach Section: The “Meat” of the Proposal
TAMU Bovine QTL db and viewer
Chapter 12 Power Analysis.
Following Patterns of Inheritance in Humans
Presentation transcript:

For the UCSC Genome Informatics Group aka Browser Staff

Our Goals Encourage understanding of basic biology Help develop of new medical therapies Promote cooperation and collaboration in research Enjoy our work

Projects in the works Basic Bio Genome Browser and associated web works + database Data Coordinating Center for scientific consortia Gene sets Comparative genomics Regulatory elements 1000 Genome Project (Normal human variation) Vertebrate Genomes Medical Cancer browsers Copy number variation in pediatric patients FACE-base Linking variations in genotype to variations in phenotype Games to promote physical activity

Basic bio

Genome Browser etc. Will be subject to it’s very own meeting in a few weeks. Likely will need 40%-60% of staff time for the next few years anyway. Changes to further improve handling of distributed data Continued integration of high quality data sets Web services development to make it easier for third party tools to integrate our data Work to make it easier to set up “modified mirrors” and to make it possible for UCSC to easily import data from them.

Data Coordinating Center Work Almost every new large scale research effort now includes a data coordinating center (DCC). Our grant to be the DCC for ENCODE runs 2 more years Do we want to do more work like this? The grants are relatively easy to get for us Gets interesting data into our web works Puts us in a bureacratic/enforcement role Maybe wait and see how the new wranglers enjoy it Funding – currently ~6 FTE ENCODE?

Gene Sets At ENSEMBL the gene build is considered ~50% of their work, here it’s Hiram and me maybe 20% time. Worth investing more? High value, relatively easy to make our own RNA gene set. High value, difficult to expand set to new organisms. (Would need new techniques.) High value, difficult to integrate RNA-seq data into gene set. Moderate value, moderate difficulty to update UCSC genes more often. We have automated about as much as it can go, would need to train and devote an engineer or two half time to it. Funding? 1 FTE browser + 1 FTE various?

Comparative Genomics Current lastz/multiz approach invaluable to research but will only scale to 50 or so. David H. has group working on newer approaches with the hopes that they will scale and also yield more info. Should browser staff get more involved? Will Brian want to leap back in after ENCODE? Do we want to build and publically release browsers on reconstructed ancestors? Have we already reached point of diminishing returns on vertebrates? Really do need 100’s if not 1000’s to do good protein level conservation plots on just vertebrates. Funding? 1 FTE to maintain 50 genome lastz/multiz on browser.

Regulatory Elements Predicting regulatory elements (REs) today much like predicting exon/introns was 10 year ago: Enough experimental data available (from ENCODE, elsewhere) to paint >50% of elements Data is scattered all over the place, needs synthesis Reasonable to think that could develop methods that would predict 80-90% of true REs with 80-90% specificity. ENSEMBL has done this, but it is pretty weak. Strong personal research interest for Jim Funding – ENCODE + Browser. ~1 FTE

Cataloging human variation We’re just playing a peripheral role in the 1000 genomes project. In general though having a catalog of normal human variation is critical to most medical genomics projects. What mutations are tolerated in one copy of a gene What mutations are tolerated in both copies What genes can function with just a single working copy What genes can function with extra copies Likely our role will be mostly integrating in work from outside, but some new browser displays and some diploid displays may be helpful. Funding? Browser grant mostly. ~1 FTE

10000 Vertebrate Genome Project Another of David H’s grand ideas Lots of enthusiasm for it in the sequencing community, the wildlife conservation groups, and the organismal biologists Mark D has done some preliminary work, useful in keeping track of samples Would need to design alignment, gene prediction, visualization systems from scratch to handle the scale. Funding? HHMI 0.5 FTE?

Medically Oriented Projects

Cancer Genome Browser Prototypes are very well received Very productive group of grad students and post docs working on it. Would be good to start having more regular engineering staff and quality assurance involvement Have funding now to turn into products. 5 FTE on top of existing team??

Face Base Image database for purposes of pediatric genetic diagnosis. Images need to be kept very confidential – parents do *not* want to see their kid’s face on the internet. Might be good for Galt in that involves security and likely to share code with VisiGene. Good to keep Chris in cancer group in the loop as he has also been making an image browser that leverages the VisiGene source. Funding - Bob K is PI, ~1 FTE UCSC

Copy # Variation in Pediatrics Collaborative grant with David Ledbetter from Emory and various clinical genetic testing facilities. Organize and display copy number variation genotype data from pediatric patients with developmental problems. Ideal outcome would include a haplosufficiency map of human genome. Confidentiality of data map is built on must be maintained. Thorgier Thorgierson has scientific interest in this. Funding: Bob K is PI, ~1 FTE UCSC

Genotype/Phenotype Associations Another place we hope we can work with Thorgeir. We can no longer put up most medical genome association studies in detail. With so many studies these days though, a summary track of just the biggest significant peaks of peer reviewed studies might be of more interest to more people. Non-disease genotype/phenotype associations are interesting too, and are produced by many research groups including thouse at 23andme. Funding? Probably could justify at least 2 fte on browser grant.

Games to promote physical activity Perhaps a potent way to battle childhood obesity May be helpful to many adults as well – cardiovascular problems bigger killer than cancer! “Pocket” games put an acceleration sensor in the pocket. Game-play requires ongoing motion of the thighs (and hopefully other major muscles). I-phone pocket apps that link music to motion Wii pocket apps put main controller in pocket, navigate with nunchuk Hula hips propel boats Side to side sway propels fish Jump to jump…. Walk in place to climb Work with video game development class here at UCSC? Funding – Jim 1 month concept development at this stage. Apply for NIH grants once have storyboards?

Or give it all up for a nice job at the boardwalk?

Concluding thoughts Have successfully diversified funding Work will become likely more closely linked to medicine over time. More data will be private May increasingly have semi-specialized sub-teams I’ll put up a doodle poll shortly to get an idea what projects people are most interested in working on.