Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.

Slides:



Advertisements
Similar presentations
Useful tricks in studying reading papers doing research writing papers publishing papers English e-manuscripts.
Advertisements

Pathways analysis Iowa State Workshop 11 June 2009.
A New Biclustering Algorithm for Analyzing Biological Data Prashant Paymal Advisor: Dr. Hesham Ali.
Example Presentation Something to Show. Basic Title & Content Point to be made Number One Point to be made Number Two Point to be made Number Three.
Process- Related Requirement Management By Viswaketan Reddy.
C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E Lecture 9 Clustering Algorithms Bioinformatics Data Analysis and Tools.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
‘Gene Shaving’ as a method for identifying distinct sets of genes with similar expression patterns Tim Randolph & Garth Tan Presentation for Stat 593E.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Biological Interpretation of Microarray Data Helen Lockstone DTC Bioinformatics Course 9 th February 2010.
An Overview of Relevance Feedback, by Priyesh Sudra 1 An Overview of Relevance Feedback PRIYESH SUDRA.
Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Immune Cell Ontology for Networks (ICON) Immunology Ontologies and Their Applications in Processing Clinical Data June 11-13, Buffalo, NY.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Malignant Melanoma and CDKN2A
Expression profiling of peripheral blood cells for early detection of breast cancer Introduction Early detection of breast cancer is a key to successful.
A Multivariate Biomarker for Parkinson’s Disease M. Coakley, G. Crocetti, P. Dressner, W. Kellum, T. Lamin The Michael L. Gargano 12 th Annual Research.
The Art of Scientific Writing. Goals of Scientific Writing  Making a clear presentation of a complex scientific problem/accomplishment  Addressing a.
Chapter 7 Essential Concepts in Molecular Pathology Companion site for Molecular Pathology Author: William B. Coleman and Gregory J. Tsongalis.
Multiple Examples of tumor tissue (public data from Whitehead/MIT) SVM Classification of Multiple Tumor Types DNA Microarray Data Oracle Data Mining 78.25%
Enabling biomarker validation in breast cancer molecular subtypes: sensitivity and specificity of array-based subtype classification in 983 patients Balázs.
Chris Luszczek Biol2050 week 3 Lecture September 23, 2013.
The Art of Arguing… How to use Language and Logic to write in a purposely persuasive manner.
Sample classification using Microarray Data. AB We have two sample entities malignant vs. benign tumor patient responding to drug vs. patient resistant.
ANSWERING APUSH ESSAY QUESTIONS (FRQ). Essay Prompt All college-level essay test answers are written in response to an essay “prompt.” All college-level.
A /01 Evaluation of English and Spanish Health Information on the Internet Gretchen Berland, M.D. The RAND Corporation.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
How to write a professional paper. 1. Developing a concept of the paper 2. Preparing an outline 3. Writing the first draft 4. Topping and tailing 5. Publishing.
BIOS6660 shRNAseq Gene Set Enrichment Analysis Tzu L Phang PhD Robert Stearman PhD April 16, 2014.
Ranjit Ganta, Raj Acharya, Shruthi Prabhakara Department of Computer Science and Engineering, Penn State University DATA WAREHOUSE FOR BIO-GEO HEALTH CARE.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
CSE 8331 Spring CSE 8331 Spring 2010 Image Mining Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University.
Preparation for Final Exam
Copyright OpenHelix. No use or reproduction without express written consent1.
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
Comp. Genomics Recitation 10 4/7/09 Differential expression detection.
1 Semantic Relations for Interpreting DNA Microarray Data and for Novel Hypotheses Generation Dimitar Hristovski, 1 PhD, Andrej Kastrin, 2 Borut Peterlin,
Principals of Research Writing. What is Research Writing? Process of communicating your research  Before the fact  Research proposal  After the fact.
CSC, Dec.15-16,2005. Cytoscape Team Trey Ideker Mark Anderson Nerius Landys Ryan Kelley Chris Workman Past contributors: Nada Amin Owen Ozier Jonathan.
 Writing 5 English Language Program. In creating a thesis statement for your paper, you must consider these things. Does your thesis…  Give a topic.
Standard Form Bingo. Use any 9 of these numbers
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
MA 6.3 A food web identifies producers, consumers, and decomposers, and explains the transfer of energy through trophic levels. Relationships among organisms.
PINALOG Protein Interaction Network Alignment and its implication in function prediction and complex detection Hang Phan Prof. Michael J.E. Sternberg.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Health Services Administration
Genomics research paper presentation
Putting it all together: Writing research proposals and reports
Presenting an opinion Facts Ideas Linking words, ideas, arguments
Pathway Visualization
Title of My Design and Prototype Project
Type Topic in here! Created by Educational Technology Network
Materials & Methods Introduction Abstract Results Conclusion
Overview Expression data basics Introduction Biological network data
Overview Gene Ontology Introduction Biological network data
Standard Form Bingo.
Reading Visuals.
Materials & Methods Introduction Abstract Results Conclusion
Overview Domains and conclusion Introduction Biological network data
Materials & Methods Introduction Abstract Results Conclusion
Pathway Visualization
Add Main Topic Here Created by Educational Technology Network
<Add authors and affiliation>
Materials & Methods Introduction Abstract Results Conclusion
Materials & Methods Introduction Abstract Results Conclusion
Presentation transcript:

Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and complexes  Domains and conclusion

Scenario  Ran a set of expression experiments to study a given disease state.  Need to put the results into a functional context.

Atherosclerosis  Most common fatal disease in the U.S., and not well-understood.

Microarray analysis  Analyzed 51 artery segments from the hearts from 22 heart transplant patients.  Classified segments by their disease pathology.  Will assess the differences between Type I (moderate) and Type V (severe) atherosclerosis.  Performed microarray analysis of each segment.  Agilent expression array with probesets for 13,000 human genes.

SAM microarray statistic  For each gene i, contrasted expression in Type I and Type V lesions with SAM ( Proc Natl Acad Sci USA 98: , 2001).  High positive SAM score: gene expressed more highly in Type V lesions.  Large negative SAM score: gene expressed more highly in Type I lesions.

Analysis pipeline 1. Biomarker identification  For formal studies, use machine learning methods  For exploratory work, select several genes with extreme SAM scores.

Analysis pipeline, continued 2. Biomarker association  Basic question: for this context, what is common among the biomarker genes?  Approaches  Exhaustive reading  GO analysis  Literature searching

pros and cons of this approach  Pro: associations are specific to this disease context  Pro: identifies relevant literature  Con: might not find associations on all of your biomarkers  Con: might find associations on other genes

Iterative literature searching  Perform an initial search  Color the network by SAM d-score  Identify any new “responsive” genes  Add to biomarker list  Repeat

Discussion topic Why not use all genes with extreme SAM scores as biomarkers? Why iterate?

Once you have a good network: 1. Use BiNGO to identify the enriched GO terms 2. Look at the genes corresponding to selected enriched terms 3. Check the literature search sentences for those genes 4. Choose one or two sentences, look at the abstracts. 5. Iterate if desired (or go to lunch)

Final points  No right or wrong answers, only plausible or novel hypotheses.  You can take any approach you wish.  “If it was easy, everyone would be doing it”.